Friday, May 22, 2026

AI Alert: Aligning AI agents with developmentally appropriate CHC (select) cognitive domains—a thought provoking AI approach for school-age children and youth


 The source article is an arXiv open-access article available here.  I have no idea if it has undergone peer-review and is a pre-print or if it is working draft paper.  

Click on inserted images to enlarge for easier viewing.


Dr. Kevin McGrew with major assist from Google NotebookLM.  (Click here for brief explanation of how IQs Corner creates AI Alerts from article PDFs)


ChildAgentEval, a psychometrically grounded benchmark designed to evaluate how well artificial intelligence aligns with the cognitive development of children and youth. Inspired by the Wechsler Intelligence Scale for Children-IV four-factor CHC model (the study does not use actual WISC-IV items as per test security standards), this framework assesses multimodal AI agents across ten interactive subtests covering the CHC domains like working memory (Gwm), verbal abstraction and vocabulary (Gc), fluid-visual spatial reasoning (Gf/Gv), and processing speed (Gs).

The study reveals that standard prompting—simply asking an AI to "act like a child"—does not authentically replicate pediatric developmental cognition, as models often maintain adult-level reasoning. The authors introduced a skill-guided distillation strategy that applies data-driven filters to simulate age-appropriate cognitive constraints and limitations. While AI agents can successfully adapt their vocabulary, the research highlights significant challenges in mimicking human-like bottlenecks in perceptual processing and memory retention

ChildAgentEval is the first psychometrically grounded interactive benchmark designed to measure cognitive age alignment in multimodal large language model (MLLM) agents. The study addresses a critical gap in current AI development: while state-of-the-art agents excel at complex reasoning, they often do not scaffold learning within a child's Zone of Proximal Development, often using adult-level abstractions that exceed a young user's cognitive grasp. The authors advocate for a shift in AI development from maximizing raw capability to ensuring developmental appropriateness for young users (e.g., school-age children and youth)—a feature that is much needed if AI is to be used in with school-age children and youth. 

The ChildAgentEval Framework

Inspired by the Wechsler Intelligence Scale for Children (WISC-IV), the benchmark evaluates agents across ten interactive subtests mapped to four primary CHC cognitive factors:

  • Crystallized Intelligence (Gc): Verbal abstraction and vocabulary.
  • Fluid and Visual-Spatial Reasoning (Gf/Gv): Rule induction and spatial problem-solving.
  • Working Memory (Gwm): Information retention and manipulation.
  • Processing Speed (Gs): Quickness of visual scanning and timed execution.

Unlike static evaluations, ChildAgentEval utilizes a Playwright-driven browser environment where agents must perform physical actions like clicking and typing to solve tasks. The framework also implements clinical protocols such as reversal and discontinuation rules to ensure developmental validity.

Skill-Guided Distillation

A core contribution of the research is a data-driven skill distillation strategy that moves beyond simple "act like a child" prompts. By analyzing a multi-source corpus of real child and adolescent interactions (ages 6–17), the researchers developed cognitive profile vectors. These vectors are converted into executable constraints via five cognitive filter modules injected into the agent's prompt, memory, and reasoning layers:

  1. Vocabulary Abstraction Filter: Limits academic concepts and controls syntactic complexity.
  2. Working Memory Mask: Physically simulates shorter memory spans by injecting noise or restricting cross-page information.
  3. Reasoning Budget Controller: Restricts the depth of multi-step logic.
  4. Visual Reliance Module: Reproduces cognitive biases, such as being misled by physical arrangement illusions.
  5. Social Perspective Filter: Restricts reasoning to age-appropriate viewpoints, such as first-person vs. institutional perspectives.

Key Findings and Experimental Results

The study evaluated several proprietary models (e.g., GPT-5.4, Gemini-3.1-Pro) and open-weight models (e.g., Qwen3.5-27B). The experiments revealed three major insights:

  • Standard Prompting Fails: Merely asking an agent to "act younger" does not reliably change its underlying cognitive behavior; most models continue to maximize correctness regardless of the requested age.
  • Skill Guidance Enables Alignment: In high-performing proprietary models, the distillation method successfully induced monotonic score trajectories, where performance expanded naturally as the target age increased—the results produced performances associated with developmental growth curves of cognitive abilities.
  • Uneven Domain Alignment: While agents easily adapted their linguistic style (Gc), they struggled to authentically simulate human-like limits in working memory and perceptual reasoning. This "domain dissociation" suggests that MLLM architectures currently lack the structural developmental bottlenecks found in biological cognition.

Conclusion

The authors conclude that for sensitive applications like educational tutoring, technical correctness must be secondary to developmental appropriateness. ChildAgentEval establishes a new paradigm for AI safety and alignment, shifting the focus from maximizing raw capability to authentic cognitive simulation.





Thursday, May 21, 2026

Breaking legal and research news: SCOTUS (US Supreme Court) decision re multiple IQ scores in intellectual disability (ID) death penalty case (Hamm v Smith) - A DIG decision upholding the lower court decision

Today SCOTUS ruled on the Hamm v Smith multiple IQ score Atkins intellectual disability (ID) case that was argued before SCOTUS last December. This important decision, with all concurring and dissenting opinions, is available here.  Interested parties should read all the opinions. 

All documents and the history of the case can be found in a prior IQs Corner post.  As noted in the prior lengthy post, APA submitted an Amicus brief that, based on the reading of some of the justices opinions, suggests it (together with a AAIDD Amicus Brief) played an important role in the decision.




Briefly, when SCOTUS dismisses the writ of certiorari as improvidently granted it means the justices recognized that they made a mistake in agreeing to hear the case and decided to dismiss it without issuing a ruling on the merits. In legal shorthand, this is often referred to as a "DIG" (Dismissed as Improvidently Granted). This typically happens after the Court has accepted the case, reviewed the briefs, and sometimes even heard oral arguments. 

In practice, this means that the lower court decision—which is consistent with APA’s Amicus brief—that argued that multiple IQ scores should be viewed holistically  augmented by clinical judgement—stands. This means Mr. Smith will not be executed and, more importantly, the ruling does not upset the status of prior Atkins case law.  Given the recent direction and flavor of SCOTUS, there was a fear that SCOTUS might render a formal decision that would set back prior SCOTUS Atkins ID decisions.  So, this is a win for maintaining the principles of clinical judgement, the opinions and standards established by the established medical communities (both APA’s, AAIDD), and the need for a holistic approach to the interpretation of multiple IQ scores together with adaptive behavior.

A reading of the concurring opinions indicates that APA’s brief, as well as several key APA and AAIDD ID-related publications referenced in their decisions, were influential in the decision.  Several of the dissenting opinions reveal some troubling thinking by several justices. 
As a potential conflict of interest (COI) notice, I (Dr. Kevin McGrew), together with Dr. Joel Schneider and Dr. Cecil Reynolds (as noted on page three the APA amicus brief), were consultants to APA in the drafting of that brief.
It should be noted that as a result of the work of APA Amicus brief working committee, a peer-reviewed paper outlining a psychometrically sound approach (developed by Dr. Joel Schneider) for integrating multiple IQ scores was started during the deliberations and was recently published (Schneider, Reynolds, McGrew & Salekin, 2026) after the oral argument's.  This issue will likely to be revisited in future state cases, with Schneider et al. (2026) now elevated to a scientifically sound multiple IQ composite score method.

Tuesday, May 19, 2026

Research Alert: Sex Differences in Cognitive Processing Speed (Gs) Across the Lifespan: A Meta-Analysis - ProQuest

This is another quick email-based post.  
 
This is for a doctoral thesis that is not open access, although you can read some of the first pages and the abstract at the provided link.  If interested you can order and pay for a copy or, if at a university, you can request via ProQuest.  Given that Dr. Mathew Reynolds is the primary advisor, a scholar whom I hold in high esteem for his research skills, I am looking forward to seeing this in the future, most likely as a journal pub.
 
File under Gs as per the CHC taxonomy
 
Sex Differences in Cognitive Processing Speed Across the Lifespan: A Meta-Analysis - ProQuest 
https://www.proquest.com/openview/6be586738a6bfd359b6c8be220de7071/1?pq-origsite=gscholar&cbl=18750&diss=y

Research Alert: Cognitive Networks for Knowledge Modeling: A Gentle Introduction for Data‐ and Cognitive Scientists

Quick FYI email-based Research Alert.  Is open access👍

 

 

Cognitive Networks for Knowledge Modeling: A Gentle Introduction for Data- and Cognitive Scientists - Haim - 2026 - WIREs Cognitive Science - Wiley Online Library 

https://wires.onlinelibrary.wiley.com/doi/10.1002/wcs.70026

 

ABSTRACT

In this paper, we introduce the reader to the field of cognitive network science, that is, the application of network science methods to study human cognition and knowledge structures. Cognitive networks are representations of associative knowledge between concepts in a cognitive system apt at acquiring, storing, processing and producing language, that is, the mental lexicon. In a cognitive network, nodes represent concepts with links expressing relations, such as semantic, syntactic, phonological and visual connections, for example, “canine” and “dog” (nodes) linked by “being synonyms” (link). Hence, cognitive networks represent associative knowledge in mathematical, measurable and quantifiable ways. Can such structure be used to gain insights over cognitive phenomena? We explore this research question by reviewing recent, pioneering key applications and limitations of cognitive networks across visual, auditory, and semantic language processing tasks, either in healthy or clinical populations. We also review applications of cognitive networks modeling language acquisition, reconstructing text content and assessing creativity or personality traits in individuals. Our paper also gently introduces the reader to mathematical notations, definitions and measures about single-layer and multiplex networks as well as hypergraphs. Last but not least, across phonological, semantic and syntactic networks, we guide the reader through relevant psychological frameworks, datasets and software packages that might all aid current and future cognitive network scientists.

This article is categorized under:

  • Psychology > Memory
  • Psychology > Theory and Methods
  • Linguistics > Cognitive

Graphical Abstract

Cognitive network science helps organize associative knowledge—that is, the connections between concepts. These connections play a key role in cognitive processes such as language understanding and context interpretation, even though they are not obvious in language use. For example, we do not see syntactic links, as depicted in the figure, between words in a written or spoken text. Further information can be highlighted visually in cognitive representations, such as the emotional valence of words (here indicated with the colors blue for positive, red for negative, gray for neutral and purple for a connection between positive and negative concepts). Giving structure to knowledge via cognitive networks represents a new frontier. This gentle primer offers a clear overview and introduces tools for cognitive scientists and psychologists interested in exploring cognitive representations.


Click on image to enlarge for easy viewing



 


Thursday, May 14, 2026

Research Alert: Technological Devices for Developing Working Memory in Children with ADHD: A Systematic Review

Email phone-based quick FYI post.  Open access article👍
 
Technological Devices for Developing Working Memory in Children with ADHD: A Systematic Review 
https://www.mdpi.com/2673-5318/7/3/104
 

Abstract

Attention-Deficit/Hyperactivity Disorder (ADHD) is frequently associated with working-memory (WM) weaknesses that affect learning and everyday functioning. This systematic review examined the extent to which technology-delivered interventions improve WM in children and adolescents with ADHD, with primary emphasis on standardized objective WM outcomes and secondary consideration of rating-based or in-app measures. Following PRISMA 2020 guidelines, we searched PubMed, PsycINFO, Web of Science, Scopus, ERIC, and IEEE Xplore and identified 22 eligible studies spanning PC-based training, mobile interventions, AI-adaptive programs, wearables, and neurofeedback. Across modalities, the clearest near-transfer signal emerged from process-specific PC-based interventions and some AI-adaptive programs evaluated with standardized objective WM tests. Mobile and neurofeedback approaches appeared promising in some studies, but the evidence was more heterogeneous and was more often supported by ratings, in-app composites, or less rigorous designs. Overall, current evidence suggests that technology-assisted WM interventions are most promising when they are process-specific, adaptive, and delivered at a sufficient dose, although conclusions remain constrained by heterogeneity in study design, outcome type, and methodological rigor.
 

Pardon typos and spelling errors-Message may be sent from iPhone and I've always had spelling problems :)

*****************************************
Kevin S. McGrew, PhD
Educational & School Psychologist
Director
Institute for Applied Psychometrics (IAP)
https://www.themindhub.com
******************************************

Wednesday, May 13, 2026

AI Brief: What does Grok say about Richard Snow’s aptitude trait complexes for learning?

 The following is a Grok AI generated summary when prompted (by Dr. Kevin McGrew) to explain Richard Snow’s aptitude complexes on March 7, 2026).

(Grok refers to a generative artificial intelligence developed by xAI, an Elon Musk company, as well as a foundational term in science fiction for deep understanding. Several editorial changes were made to insure accuracy, and select links added, by McGrew—to the Grok summary).

 


Richard E. Snow’s aptitude complexes (introduced in the 1980s–1990s, with key publications in 1987, 1992, and especially the 1996 chapter with Lyn Corno and Douglas Jackson) represent a major shift in how educational psychologists think about aptitudemoving far beyond the traditional narrow view of aptitude as “just cognitive ability” (e.g., IQ or a single test score).

 

Core Definition

Snow redefined aptitude as:

the degree of readiness to learn and to perform well in a particular situation or in a fixed domain.”

It is a predisposition or propensity—a latent quality that enables more advanced performance under specific conditions. Aptitudes are situational and domain-specific: what makes someone ready to succeed in math class may be different from what works in a history seminar or a hands-on lab.  


Aptitude Complexes (the key innovation)

Snow argued that single constructs (like “fluid reasoning” or “achievement motivation”) are insufficient. Instead, success in learning comes from aptitude complexes—dynamic constellations or critical combinations of variables that work together as a coordinated system.

These complexes draw from the classic “trilogy of the mind”:

•  Cognition — abilities and processes for analyzing, interpreting, and solving (e.g., reasoning, knowledge, strategies, cognitive style, CHC abilities).

•  Affect — emotions, anxiety, self-concept, emotion regulation, personality traits.

•  Conation — motivation, volition, goal-setting, effort, persistence, will (the “want to” and “stick with it” aspects).

An aptitude complex is not just a list of traits—it is how these elements assemble and coordinate in real time within a specific task and context. They are amalgams of cognitive, conative, and affective characteristics.

 

The Two Pathways That Build Aptitude Complexes

Snow (and later Corno et al., 2002) described aptitudes developing through two parallel, interacting pathways (sometimes called the commitment pathway and the performance/action pathway):

1.  Commitment Pathway (motivational/affective/volitional)

     •  Assembles motivational resources that energize effort.

     •  Affective and volitional processes modulate how the work proceeds (e.g., regulating anxiety, sustaining intention).

     •  Outcome: Propensity (how likely the person is to engage and persist).

2.  Performance (Action) Pathway (cognitive)

     •  Assembles and deploys cognitive resources (abilities, strategies, knowledge) to do the task.

     •  Outcome: Ability/accuracy in execution.

When the two pathways coordinate effectively in a given “problem space” (the task + the broader situation/context), successful performance occurs. Over repeated experiences, these coordinated processes coalesce into stable aptitude complexes. The learner’s perception of the situation’s constraints and affordances is crucial—aptitudes are dynamic and transactional (person ↔ environment). 

Aptitudes are both input (what the student brings to learning) and product (they develop and change through education).


Snow Academic Aptitude Model (SAAM)

Snow synthesized decades of research into a provisional taxonomy (called the Snow Academic Aptitude Model or SAAM in an unpublished working white paper [6-15-04; Increasing the Chance of No Child Being Left Behind: Beyond Cognitive and Achievement Abilities, by Kevin McGrew, David Johnson, Anna Casio, Jeffrey Evans]) which is now available for download here. It treats academic aptitudes as a “multivariate repertoire” of readiness for general and domain-specific learning. It explicitly includes affect (personality and social-emotional skills) and conative related skills and traits (e.g., achievement motivation, goal orientation, freedom from debilitating anxiety, positive self-concept, self-efficacy, interests) alongside cognitive abilities. The model emphasizes that “there is more to knowing about an individual’s propensity for academic learning than simply knowing their IQ scores!” 

 

Why This Mattered (and Still Matters)

Snow’s work grew out of his famous Aptitude-Treatment Interaction (ATI) research with Lee Cronbach. He showed that simple aptitude scores rarely predict learning outcomes well because real-world performance depends on these richer, situation-specific complexes.

This framework directly inspired later theories (including Kevin McGrew’s 2022 CAMML model, which updates Snow’s complexes with contemporary CHC cognitive theory, Big Five personality, and self-regulated learning mechanisms while retaining the commitment/action pathway structure).

In short, Snow’s aptitude complexes gave us a whole-person, dynamic, domain-specific way to understand why two students with the same IQ can have dramatically different learning outcomes: their aptitude complexes (the coordinated cognitive + affective + conative “package” or “amalgam”) fit the demands of the task and context differently.

Snow’s legacy is summarized in the 2002 book Remaking the Concept of Aptitude (Corno et al.), which remains the definitive extension of his ideas.

 


Research Alert: (Lets hear it for conative abilities!!!!) Non-cognitive skills mediate education-related polygenic score associations with academic achievement across development

Important “in press” (and downloadable copy) article available here.  This is a quick email-generated FYI post.
 
I know at times my posting about non-cognitive conative abilities may seem repetitive.  It is—these skills (motivation, perseverance, mindset, learning strategies, self-regulatory strategies) are important and need to have more attention in school-based assessments for struggling learners!!!  See recent May 6 “AI Brief” about the need to get the triology-of-the-mind band back together post for more info.

Abstract 

The role of environmental, developmental, and psychological processes in translating genetic dispositions into observed academic achievement remains under-investigated. Here, we examine whether non-cognitive skills—including motivation, attitudes, and emotional and behavioural functioning—mediate the genetic prediction of academic achievement across development. We analyse data from 5,016 children enrolled in the Twins Early Development Study at ages 7, 9, 12, and 16, as well as their parents and  teachers. We find that non-cognitive skills mediated between less than 5 and up to 64% of the genetic prediction of academic achievement. Mediation effects are larger and more robust for motivation and attitudes (β ≈ 0.13) than for emotional and behavioural functioning (β ≈ 0.01–0.03). This pattern holds longitudinally and is replicated in within-family analyses, where non-cognitive skills accounted for up to 83% of the total mediation effects. These findings highlight the contribution of non-cognitive skills beyond shared familial factors, likely reflecting how children evoke and select experiences that align with their genetic propensity and lead to differences in academic development.
 
Select quotes from article
 
The term ‘non-cognitive skills' describes attitudes and characteristics  that impact life outcomes beyond what cognitive tests can measure and predict. These skills encompass motivation, perseverance, mindset, learning strategies, social skills, and self-regulatory strategies.  Non-cognitive skills are associated with educational outcomes beyond cognitive ability. Studies have found that self-efficacy and personality predict academic achievement beyond cognitive ability across compulsory education. Other studies have linked personality, self-regulation, and motivation to academic performance. More recently, our research highlighted that the association between non-cognitive skills and academic achievement increases substantially across compulsory education, from age 7 to 16. Previous research also showed that greater self-control and, to a lesser extent, interpersonal skills, partly mediated the genetic prediction of adult educational attainment.

…the mediating role of education-specific non-cognitive skills increased developmentally, pointing to the growing importance of students' perceived non-cognitive profiles and experiences in their academic journeys. This developmental increase is consistent with the possibility that, as they grow up, children become more aware of their aptitudes and appetites towards learning. As they gain greater self-awareness and autonomy, students might become increasingly more able to shape their environmental contexts in ways that allow them to cultivate these non-cognitive skills and, in turn, foster their academic performance

Lets hear it for the CAMML framework.

Tuesday, May 12, 2026

Research Alert: The development of a universal screening measure for young children: Assessing social and emotional competencies in early childhood.

A quick FYI email post.  Article is NOT open access.😕
 
The development of a universal screening measure for young children: Assessing social and emotional competencies in early childhood. 
https://psycnet.apa.org/record/2027-67366-001
 
Wadington, M., Eklund, K., Kilgus, S. P., & von der Embse, N. P. (2026). The development of a universal screening measure for young children: Assessing social and emotional competencies in early childhood. School Psychology. Advance online publication. https://doi.org/10.1037/spq0000747

Abstract

Research highlights the importance of the early identification of social, emotional, and behavioral concerns in young children; however, there are limitations regarding the usability and technical adequacy of available measures. The purpose of the present study was the initial development and validation of the Social, Academic, and Emotional Behavior Risk Screener–Early Childhood measure, a novel tool designed to assess social, emotional, and behavioral functioning for preschool-aged children. Current analyses examined internal structure, reliability, concurrent validity, and diagnostic accuracy. Data were collected from 299 children, ages 2–6, and 42 educators from six early childhood centers in the Midwest and Southeastern regions of the United States. Results of a series of factor analyses provided support for a four-factor model and yielded adequate estimates of the internal consistency reliability of each factor. Correlational and receiver operating characteristic curve findings yielded strong support for the concurrent validity and diagnostic accuracy of the Social, Academic, and Emotional Behavior Risk Screener–Early Childhood Total Behavior, Social Behavior, Early Learning Behavior, and Challenging Behavior scales relative to Devereux Early Childhood Assessment for Preschoolers–Second Edition scales. Less support was found for the Anxious Behavior scale. Limitations, implications for practice, and future directions are also discussed.