Showing posts with label mental retardation. Show all posts
Showing posts with label mental retardation. Show all posts

Thursday, March 01, 2012

IAP101 Brief #12: Use of IQ component part scores as indicators of general intelligence in SLD and MR/ID diagnosis

   
            Historically the concept of general intelligence (g), as operationalized by intelligence test battery global full scale IQ scores, has been central to the definition and classification of individuals with a specific learning disability (SLD) as well as individuals with an intellectual disability (ID).  More recently, contemporary definitions and operational criteria have elevated intelligence test battery composite or part scores to a more prominent role in diagnosis and classification of SLD and more recently in ID.
            In the case of SLD, third-method consistency definitions prominently feature component or part scores in (a) the identification of consistency between low achievement and relevant cognitive abilities or processing disorders and (b) the requirement that an individual demonstrate relative cognitive and achievement strengths (see Flanagan, Fiorello & Ortiz, 2010).  The global IQ score is de-emphasized in the third-method SLD methods.
            In contrast, the 11th edition of the AAIDD Intellectual Disability: Definition, Classification, and Systems of Supports manual (AAIDD, 2010) placed general intelligence, and thus global composite IQ scores, as central to the definition of intellectual functioning.  This has not been without challenge.  For example, the AAIDD ID definition has been criticized for an over-reliance on the construct of general intelligence and for ignoring contemporary psychometric theoretical and empirical research that has converged on a multidimensional hierarchical model of intelligence (viz., Cattell-Horn-Carroll or CHC theory).
The potential constraints of the “ID-as-a-general-intelligence-disability” definition was anticipated by the Committee on Disability Determination for Mental Retardation, in its National Research Council report “Mental Retardation:  Determining Eligibility for Social Security Benefits” (Reschly, Meyers & Hartel, 2001).  This national committee of experts concluded that “during the next decade, even greater alignment of intelligence tests and the IQ scores derived from them and the Horn-Cattell and Carroll models is likely.  As a result, the future will almost certainly see greater reliance on part scores, such as IQ scores for Gc and Gf, in addition to the traditional composite IQ.  That is, the traditional composite IQ may not be dropped, but greater emphasis will be placed on part scores than has been the case in the past” (Reschly et al., 2002, p. 94).  The committee stated that “whenever the validity of one or more part scores (subtests, scales) is questioned, examiners must also question whether the test’s total score is appropriate for guiding diagnostic decision making.  The total test score is usually considered the best estimate of a client’s overall intellectual functioning.  However, there are instances in which, and individuals for whom, the total test score may not be the best representation of overall cognitive functioning.” (p. 106-107).
            The increased emphasis on intelligence test battery composite part scores in SLD and ID diagnosis and classification raises a number of measurement and conceptual issues (Reschly et al., 2002).  For example, what are statistically significant differences?  What is a meaningful difference?  What appropriate cognitive abilities should serve as proxies of general intelligence when the global IQ is questioned?  What should be the magnitude of the total test score? 
Appropriate cognitive abilities will only be the only issue discussed here.  This issue addresses  which component or part scores are more correlated with general intelligence (g)—that is, what component part scores are high g-loaders?  The traditional consensus has been that measures of Gc (crystallized intelligence; comprehension-knowledge) and Gf (fluid intelligence or reasoning) are the highest g-loading measures and constructs and are the most likely candidates for elevated status when diagnosing ID (Reschly et al., 2002).  Although not always stated explicitly, the third method consistency SLD definitions specify that an individual must demonstrate “at least an average level of general cognitive ability or intelligence” (Flanagan et al., 2010, p.745), a statement that implicitly suggests cognitive abilities and component scores with high g-ness.
Table 1 is intended to provide guidance when using component part scores in the diagnosis and classification of SLD and ID (click on images to enlarge and use the browser zoom feature  to view; it is recommended you click here to access a PDF copy of the table..and also zoom in on it).  Table 1 presents a summary of the comprehensive, nationally normed, individually administered intelligence batteries that possess satisfactory psychometric characteristics (i.e., national norm samples, adequate reliability and validity for the composite g-score) for use in the diagnosis of ID and SLD.



The Composite g-score column lists the global general intelligence score provided by each intelligence battery.  This score is the best estimate of a persons general intellectual ability, which currently is most relevant to the diagnosis of ID as per AAIDD.  All composite g-scores listed in Table 1 meet Jensens (1998) psychometric sampling error criteria as valid estimates of general intelligence.  As per Jensens number of tests criterion, all intelligence batteries g-composites are based on a minimum of nine tests that sample at least three primary cognitive ability domains.  As per Jensens variety of tests criterion (i.e., information content, skills and demands for a variety of mental operations), the batteries, when viewed from the perspective of CHC theory, vary in ability domain coveragefour (CAS, SB5), five (KABC-II, WISC-IV, WAIS-IV), six (DAS-II) and seven (WJ III) (Flanagan, Ortiz & Alfonso, 2007; Keith & Reynolds, 2010).   As recommended by Jensen (1998), the particular collection of tests used to estimate g should come as close as possible, with some limited number of tests, to being a representative sample of all types of mental tests, and the various kinds of test should be represented as equally as possible (p. 85).  Users should consult sources such as Flanagan et al. (2007) and Keith and Reynolds, 2010) to determine how each intelligence battery approximates Jensens optimal design criterion, the specific CHC domains measured, and the proportional representation of the CHC domains in each batteries composite g-score.
Also included in Table 1 are the component part scales provided by each battery (e.g., WAIS-IV Verbal Comprehension Index, Perceptual Reasoning Index, Working Memory Index, and Processing Speed Index), followed by their respective within-battery g-loadings.[1]  Examination of the g-ness of composite scores from existing batteries (see last three columns in Table 1) suggests the traditional assumption that measures of Gf and Gc are the best proxies of general intelligence may not hold across all intelligence batteries.[2] 
In the case of the SB5, all five composite part scores are very similar in g-loadings (h2 = .72 to .79).  No single SB5 composite part score appears better than the other SB5 scores for suggesting average general intelligence (when the global IQ score is not used for this purpose).  At the other extreme is the WJ III where the Fluid Reasoning, Comprehension-Knowledge, Long-term Storage and Retrieval cluster scores are the best g-proxies for part-score based interpretation within the WJ III.  The WJ III Visual Processing and Processing Speed clusters are not composite part scores that should be emphasized as indicators of general intelligence.  Across all batteries that include a processing speed component part score (DAS-II, WAIS-IV, WISC-IV, WJ III) the respective processing speed scale is always the weakest proxy for general intelligence and thus, would not be viewed as a good estimate of general intelligence. 
            It is also clear that one cannot assume that composites with similar sounding names of measured abilities should have similar relative g-ness status within different batteries.  For example, the Gv (visual-spatial or visual processing) clusters in the DAS-II (Spatial Ability), SB5 (Visual-Spatial Processing) are relatively strong g-measures within their respective battery, but the same cannot be said for the WJ III Visual Processing cluster.  Even more interesting are the differences in the WAIS-IV and WISC-IV relative g-loadings for similarly sounding index scores. 
For example, the Working Memory Index is the highest g-loading component part score (tied with Perceptual Reasoning Index) in the WAIS-IV but is only third (out of four) in the WISC-IV.   The Working Memory Index is comprised of the Digit Span and Arithmetic subtests in the WAIS-IV and the Digit Span and the Letter-Number Sequencing subtests in the WISC-IV.  The Arithmetic subtest has been reported to be a factorially complex test which may tap fluid intelligence (Gf-RQ—quantitative reasoning), quantitative knowledge (Gq), working memory (Gsm), and possible processing speed (Gs; Keith & Reynolds, 2010; Phelps, McGrew, Knopik & Ford, 2005).   The factorially complex characteristics of the Arithmetic subtest (which, in essence, makes it function like a mini-g proxy) would explain why the WAIS-IV Working Memory Index is a good proxy for g in the WAIS-IV but not in the WISC-IV. The WAIS-IV and WISC-IV Working Memory Index scales, although named the same, are not measuring identical constructs.

A critical caveat is that the g-loadings cannot be compared across different batteries.  g-loadings may change when the mixture of measures included in the analyses change.  Different "flavors" of g can result (Carroll, 1993; Jensen, 1998). The only way to compare the g-ness across batteries is with appropriately designed cross- or joint-battery analysis (e.g., WAIS-IV, SB5 and WJ III analyzed in a common sample).
The above within and across intelligence battery examples illustrates that those who use component part scores as an estimate of a person’s general intelligence must be aware of the composition and psychometric g-ness of the component scores within each intelligence battery.  Not all component part scores in different intelligence batteries are created equal (with regard to g-ness).  Also, not all similarly named factor-based composite scores may measure the same identical construct and may vary in degree of within battery g-ness.  This is not a new problem in the context of naming factors in factor analysis, and by extension, factor-based intelligence test composite scores, Cliff (1983) described this nominalistic fallacy in simple language—“if we name something, this does not mean we understand it” (p. 120). 




[1] As noted in the footnotes in Table 1, all composite score g-loadings were computed by Kevin McGrew by entering the smallest number (and largest age ranges covered) of the published correlation matrices within each intelligence batteries technical manual (note the exception for the WJ III) in order to obtain an average g-loading estimate.  It would have been possible to calculate and report these values for each age-differentiated correlation matrix for each intelligence battery.  However, the purpose of this table is to provide the best possible average value across the entire age-range of each intelligence battery.  Floyd and colleagues have published age-differentiated g-loadings for the DAS-II and WJ III.  Those values were not used as they are based on the use of the principal common factor analysis method, a method that  analyzes the reliable shared variance among tests.  Although principal factor and principal component loadings typically will order measures in the same relative position, the principal factor loadings typically will be lower.  Given that the imperfect manifest composite scale scores are those that are utilized in practice, and to also allow uniformity in the calculation of the g-loadings reported in Table 1, principal component analysis was used in this work. The same rationale was used for not using the latent factor loadings on a higher-order g-factor in SEM/CFA analysis of each test battery.  Loadings from CFA analyses represent the relations between the underlying theoretical ability constructs and g purged of measurement error.  Also, frequently the final CFA solutions reported in a batteries technical manual (or independent journal articles) allow tests to be factorially complex (load on more than one latent factor), a measurement model that does not resemble the real world reality of the manifest/observed composite scores used in practice.  Latent factor loadings on a higher-order g-factor will often differ significantly from principal component loadings based on the manifest measures, both in absolute magnitude and relative size (e.g., see high Ga loading on g in WJ III technical manual which is at variance with the manifest variable based Ga loading reported in Table 1) 
[2] The h2 values are the values that should be used to compare the relative amount of g-variance present in the component part scores within each intelligence battery.

Wednesday, November 30, 2011

Is it possible to fine-tune the human brain clock?



I just made a guest blog post at the IM-Home blog with the above title. It is the fourth blog in my introductory series that explains why I am so interested in mental timing and brain-clock based neurotechnologies. You can find the new post here.

Generated by: Tag Generator

Tuesday, February 01, 2011

Special journal issue on Forensic Research in Offenders with Intellectual & Developmental Disabilities

Thanks to Karen Salekin for alerting me to a two-part special issue of Psychology, Crime and Law: Forensic Research in Offenders with Intellectual & Developmental Disabilities. I've provided picture images of the table of contents below. The actual TOC and articles for the two issues can be accessed here and here.

Double click on images to enlarge







- iPost using BlogPress from my Kevin McGrew's iPad

Generated by: Tag Generator


Thursday, November 11, 2010

Research brief: More support for Flynn effect --study with MR/ID adults







Nijman, E. E., Scheirs, J. G. M., Prinsen, M. J. H., Abbink, C. D., & Blok, J. B. (2010). Exploring the Flynn effect in mentally retarded adults by using a nonverbal intelligence test for children. Research in Developmental Disabilities, 31(6), 1404-1411.

Abstract

Increases in the scores on IQ tests across generations have been called the Flynn effect (FE). One of the unresolved questions is whether the FE affects all subsamples of the intellectual ability distribution equally. The present study was aimed at determining the size of the FE in moderately mentally retarded individuals. A nonverbal intelligence test developed for children, the Snijders-Oomen Nonverbal Intelligence Test (SON), was administered to 32 retarded adults with a mental age of 3–6 years. Sixty-nine children with a biological age in the same range and with normal intelligence served as a comparison group. Both an older and a more recent version of the SON were presented to all participants in a counterbalanced order. The proportion of items answered correctly was taken as a measure of the dependent variable. It was found that a FE existed in both the group of children and in the group of retarded adults, but that the FE was largest in the latter group. The importance of not using obsolete test norms when diagnosing mental retardation was stressed, and possible causes of the Flynn effect were discussed.





- iPost using BlogPress from my Kevin McGrew's iPad

Saturday, April 10, 2010

Research Briefs 4-10-10: Working memory, executive functioning and ID/MR

The cognitive neuroscience constructs of working memory and executive function (EF) are hot topics in all areas of cognition.  Although working memory measures are now showing up on most contemporary IQ tests, the measurement of executive functioning (EF) is not...and is typically measured by special purpose tests, tests that all suffer from a solid empirical foundation of research that clearly specifies the elements of, relations between, operational definitions of, etc. the components of EF.

I was thus excited to see the table of contents for the recent issue of the Journal of Intellectual Disability Research (see below), as it focuses primarily on working memory and EF research for people with ID/MR.  My excitement was short-lived as I found that, at least at my Universities library, access to articles in this journal are not allowed until one year after publication.  Darn.Re

Ball, S. L., Holland, A. J., Watson, P. C., & Huppert, F. A. (2010). Theoretical exploration of the neural bases of behavioural disinhibition, apathy and executive dysfunction in preclinical Alzheimer's disease in people with Down's syndrome: potential involvement of multiple frontal-subcortical neuronal circuits. Journal of Intellectual Disability Research, 54, 320-336.

Carretti, B., Belacchi, C., & Cornoldi, C. (2010). Difficulties in working memory updating in individuals with intellectual disability. Journal of Intellectual Disability Research, 54, 337-345.

Henry, L., Cornoldi, C., & Mahler, C. (2010). Special issues on 'working memory and executive functioning in individuals with intellectual disabilities'. Journal of Intellectual Disability Research, 54, 293-294.

Henry, L., & Winfield, J. (2010). Working memory and educational achievement in children with intellectual disabilities. Journal of Intellectual Disability Research, 54, 354-365.

Lanfranchi, S., Jerman, O., DalPont, E., Alberti, A., & Vianello, R. (2010). Executive function in adolescents with Down Syndrome. Journal of Intellectual Disability Research, 54, 308-319.

Mosse, E. K., & Jarrold, C. (2010). Searching for the Hebb effect in Down syndrome: evidence for a dissociation between verbal short-term memory and domain-general learning of serial order. Journal of Intellectual Disability Research, 54, 295-307.

Schuchardt, K., Gebhardt, M., & Maehler, C. (2010). Working memory functions in children with different degrees of intellectual disability. Journal of Intellectual Disability Research, 54, 346-353.

Willner, P., Bailey, R., Parry, R., & Dymond, S. (2010). Evaluation of executive functioning in people with intellectual disabilities. Journal of Intellectual Disability Research, 54, 366-379.

Willner, P., Bailey, R., Parry, R., & Dymond, S. (2010). Evaluation of the ability of people with intellectual disabilities to 'weigh up' information in two tests of financial reasoning. Journal of Intellectual Disability Research, 54, 380-391.

Technorati Tags: , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,

Wednesday, April 07, 2010

Psychometric PS to Johnston v Florida (2010) denied appeal re: new WAIS-IV scores

This is a follow-up to my brief comments yesterday regarding the Johstone v Fl (2010) denied MR/ID appeal of two days ago.

As mentioned in the decision and my blog comment, the WAIS-III/WAIS-IV tests correlated .94 in a study reported in the WAIS-IV technical manual.  This is a very high correlation...but does NOT mean that the two tests should be expected to provide identical IQ scores.  I discuss these issues in a prior IAP AP101 report.

The tests have different norm dates and thus, the later version (WAIS-IV) would be expected to provide a lower score based on the Flynn effect.  More importantly, as reported in the IAP AP101 report, when one calculates the standard deviation of the difference score (see page 6 of that report) for a correlation of .94, the resulting value is 5.2 (round to 5 for ease of discussion).  This means that, on average, the WAIS-III/WAIS-IV (even if highly correlated at the .94 level) would in the general population be expected to display a range of difference scores from -5 to +5...or a range of 10 IQ points......in 68% of the population.  Please review that prior report for further explanation and discussion.

Technorati Tags: , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,

Friday, February 12, 2010

Critque of proposed DSM5 intellectual disability criteria: Guest post by Dr. Dale Watson


Without question, the DSM-5 Proposed Draft Revision document has been generating considerable chatter among psychologists.  With regard to Atkins cases, the proposed definition of intellectual disability (ID) is no exception....emails and listservs have been busy debating and critiquing the ID proposed criteria. 

Dr. Dale Watson has set out a well-written set of concerns and issues in the guest blog post below---which is reproduced "as is" from Dr. Watson.  Kudos to Dale for providing ICDP with his perspective.

Dr. Dale Watson's critique of the proposed DSM-V ID criteria follows:

The DSM-5 Proposed Draft Revisions to the Criterion sets for Mental Disorders have recently become available.  The proposed criteria for the diagnosis of Intellectual Disability retain the three-pronged model of diagnosis used by both the DSM-IV and the AAIDD.  However, the revised language, though more precise in some ways, is also potentially problematic for a number of reasons and requires further clarification.  The following critique outlines concerns regarding the revised language and is a request for further clarification and/or specificity in the diagnostic language.

The first prong of the revision appears initially to improve the specificity of the IQ requirements stating, “Current intellectual deficits of two or more standard deviations below the population mean, which generally translates into performance in the lowest 3% of a person’s age and cultural group, or an IQ of 70 or below.”  Certainly using “standard deviations below the population mean” adds a degree of precision and perhaps allows for consideration of “Flynn Effect” changes in the population mean.  However, in an Atkins context, this language also appears to foreclose sole reliance on historical test scores in establishing the diagnosis in that it requires “current intellectual deficits.”  In addition, by eliminating the DSM-IV “IQ of approximately 70 or below” there must be some concern that this proposal establishes a “bright-line” cutoff of “70 or below” for the diagnosis of an Intellectual Disability.  Despite the rationale provided by the Work Group that the proposed criteria add “rigor to wording regarding psychometrics [with] (no change in cut –off)” one must be concerned that this is, in effect, a change in the cutoff.  In justifying the changes in the coding procedure, the rationale indicates, “Inaccuracy of testing no longer a factor.”  Does this mean that eliminating the word “approximately,” as used in DSM-IV and by the AAIDD, also eliminates consideration of the Standard Error of Measurement?  Does this mean, as is the practice in a number of death-penalty states, that it is no longer “possible to diagnose Mental Retardation with IQ scores between 71 and 75…”  (DSM-IV, p. 48).  If that were the case, the proposed criteria would not be consistent with clinical practice nor would it, by ignoring the standard error or measurement, be “adding [psychometric] rigor.”

The revision language also acknowledges the importance of cultural sensitivity, which, on the face of it, should not be objectionable.  However, by stating that the IQ requirement “translates into performance in the lowest 3% of a person’s age and cultural group…” there is a risk that this will be used as a rationale to establish subgroup norms rather than relying, as indicated, on the “population mean.”  Certainly we have seen attempts by some psychologists to use membership in a presumed cultural group to inflate IQ scores thus making legitimately intellectually disabled individuals eligible for the death penalty.  These attempts have included the use of Heaton’s WAIS-III demographically adjusted norms and what have been termed “IQ-Quality” scores, both of which inflate the obtained IQ scores based upon subgroup membership to establish that individuals do not have an Intellectual Disability.  IQs, by definition, must reference population rather than subgroup standards.

There is also either ambiguity or a substantially increased demand for deficits in adaptive function, in the proposed language for the second prong of the diagnosis.  The proposed criteria require:

 [C]oncurrent deficits in at least two domains of adaptive functioning of at least two or more standard deviations, which generally translates into performance in the lowest 3 % of a person’s age and cultural group, or standard scores of 70 or below.  This should be measured with individualized, standardized, culturally appropriate, psychometrically sound measures.  Adaptive behavior domains typically include:
  • Conceptual skills (communication, language, time, money, academic)
  • Social skills (interpersonal skills, social responsibility, recreation, friendships)
  • Practical skills (daily living skills, work, travel).
The ambiguity arises when one considers what is meant by a “domain.”  DSM-IV required deficits in at least two “areas” (similar to what appear to be sub-domains above).  In contrast, AAIDD has rightfully relied upon, based on factor analytic studies and the work of Stephen Greenspan, deficits in the domains of Conceptual, Social and Practical skills.  However, the AAIDD requires adaptive function deficits in only one as opposed to two domains.  The current AAIDD manual operationally defines significant limitations in adaptive behavior as “performance that is approximately two standard deviations below the mean of either (a) one of the following three types of adaptive behavior: conceptual, social, or practical or (b) an overall score on a standardized measure of conceptual, social, or practical skills.  The assessment instrument’s standard error of measurement must be considered when interpreting the individual’s obtained scores” p. 43.  The impact of requiring deficits in two of the three domains, as apparently required by the proposed criteria, has long been recognized.  For example, The 2002 AAMR Mental Retardation: Definition, Classification, and Systems of Supports manual described the impact that requiring deficits in two or more of the three domains would have on the prevalence of intellectual disabilities:

…the probability of a person having significant deficits (2 SDs below the mean) in two or in all three domains of adaptive behavior is extremely low compared to the probability of scoring two standard deviations or below on only one domain.  In fact, simulation studies have demonstrated that the probability of a person scoring two standard deviations below the mean on more than one domain would be so low that almost no one with an IQ in the upper mental retardation range would be identified as having mental retardation (K.F. Widaman, personal communication, November 9, 2001) (p. 78).

The proposed DSM-5 language, rather than having “Consistency with AAIDD practices,” as stated in the rationale, appears to fly in the face of those standards and would insure that “almost no one with an IQ in the upper mental retardation range would be identified as having” an intellectual disability.  In addition, by failing to note the importance of considering the “instrument’s standard error of measurement” the proposed criteria once again suggests a “bright-line” cut-off for both the intellectual and adaptive functioning requirements.  Effectively, and contrary to the stated rationale, these proposals represent changes in the cutoff scores.  In an attempt to establish unambiguous criteria for the intellectual and adaptive behavior diagnostic prongs the proposed language ignores a fundamental understanding of the nature of test scores, i.e., that some degree of imprecision is inherent.

The practical impact of these proposed criteria within a clinical context would be to reduce the number of individuals diagnosed with an Intellectual Disability.  Within an Atkins context, these changes would make more individuals eligible for the death penalty.

The Work Groups for DSM-5 are soliciting comments upon the proposed diagnostic criteria until April 20, 1010 at www.dsm5.org.  I would urge anyone with concerns regarding the criteria to submit their comments.

Dale G. Watson, Ph.D.
Clinical and Forensic Neuropsychologist
watson.dale@comcast.net

Technorati Tags: , , , , , , , , , , , , , , , , , , , , , , , ,

Wednesday, February 10, 2010

DSM-5: Proposed draft revisions: Intellectual Disability

The Proposed Draft Revisions to DSM-5 are now available on line.  MR/ID is described as the following:

Intellectual Disability

A. Current intellectual deficits of two or more standard deviations below the population mean, which generally translates into performance in the lowest 3% of a person's age and cultural group, or an IQ of 70 or below. This should be measured with an individualized, standardized, culturally appropriate, psychometrically sound measure.

B. And concurrent deficits in at least two domains of adaptive functioning of at least two or more standard deviations, which generally translates into performance in the lowest 3 % of a person's age and cultural group, or standard scores of 70 or below. This should be measured with individualized, standardized, culturally appropriate, psychometrically sound measures. Adaptive behavior domains typically include:
  • Conceptual skills (communication, language, time, money, academic)
  • Social skills (interpersonal skills, social responsibility, recreation, friendships)
  • Practical skills (daily living skills, work, travel)
C. With onset during the developmental period.

Code no longer based on IQ level.

Technorati Tags: , , , , , , , , , , , , , , , , , , , , ,

Tuesday, February 09, 2010

Stephen Greenspan on "60 Minutes" to discuss gullibility

Stephen Greenspan (ANNALS OF GULLIBILITY, Praeger, 2009) is scheduled to appear on a  60 Minute segment scheduled to air on Sun, 2/14 @ 7 pm ET/PT; 6PM Central/ Mtn.  Greenspan has written extensively re: the critical importance of the concept of gullibility as it relates to the definition and identification of individuals with intellectual disabilities/mental retardation (ID/MR).

According to Greenspan, the segment will focus mainly on Ponzi scheme implications, and will not get much or all into other implications, such as criminal justice, cognitive disabilities, Atkins MR death penalty cases, etc.

Technorati Tags: , , , , , , , , , ,

Sunday, January 31, 2010

Intellectual heterogeneity of MR/ID as evidence against AAIDD "stuck on g" green manual: Even in cleary genetic-based syndromes (Williams Syndrome)

In the last in my series of posts re: concerns I have with the AAIDD 11th Edition ID definition and classification manual, one  point I raised (re: my concern for the AAIDD "stuck on g" position) was the fact that individuals with ID/MR should not be stereotyped as having a single type of cognitive disability (simply poor g---which also implies, for many, a "flat profile" of cognitive abilities). 

Although not so stated in the AAIDD manual, the elevation of general intelligence to such high status, combined with statements that current intelligence measures are not available to reliably and validly assess multiple cognitive abilities (a statement that is simply wrong--see PPT show link in last post in series), implicitly has the potential to convey this stereotype.  I argued that individuals with ID/MR show just as much heterogeneity in profiles of cognitive abilities as individuals without ID/MR.

This past week a colleague reminded me of one article that makes my point clear.  Within the field of ID/MR, there are a number of rare genetic-based disorders.  Such genetically-based disorders typically result in a greater degree of similarity (homogeneity) among individuals with the condition.  Williams syndrome (WS) is one such ID/MR disorder.  Of course, individuals with WS are not those being evaluated in typical Atkins death penalty cases, but the common assumption and lore is that WS individuals show a "syndrome-specific pattern of cognitive strengths/weaknesses"----high verbal abilities and much lower visual-spatial abilities.

I would argue, as have others, that this WS syndrome-specific cognitive stereotype is largely due to the fact that historically MR/ID researchers only had the V/P organized Wechsler batteries as their primary IQ battery...and that the "profile" may be due to this research being constrained by batteries that did not validly measure a greater breadth of cognitive functioning.  This is not a criticism of the past research, as researchers had limited theories of intelligence and measures of constructs from which to work.  However, now that CHC theory has emerged as the consensus psychometric model of cognitive abilities and, more importantly, there are a significant number of well-standardized and psychometrically sound IQ batteries of multiple cognitive abilities, I'm not surprised that a syndrome with a strong genetic core, which typically results in more within-group similarity, when measured by more contemporary CHC-based IQ batteries display considerable variability/heterogeneity in patterns of cognitive abilities. 

Below is the abstract for  2005 study that reported that WS individuals do NOT display the classic and historical syndrome-specific pattern of cognitive strengths and weaknesses when measured with a more contemporary CHC-based cognitive battery (WJ-R:  conflict of interest note--I am a coauthor of the next edition..the WJ III).

This study clearly suggests that even a population of individuals with a shared genetic causal mechanism display significant individual differences in patterns of cognitive abilities.  If this is found in ID/MR populations with a strong shared genetic causal mechanism, one would be hard-pressed to argue that such variability does not exist for more milder forms of ID/MR and the general population.

My point (again)---I'm very concerned that the AAIDD 11th Edition ID manual's "stuck on g" position is out of synch with contemporary intelligence theory and measurement and has the potential to cause serious harm when potentially life-altering decisions are made on the basis of a single g-based composite IQ scores that ignores the heterogeneity of human cognitive abilities across the ability spectrum and different disorders.

Porter, M. A. & Coltheart, M.  Cognitive Heterogeneity in Williams Syndrome.  Developmental Neuropsychology, 27 (2), 275-306. (click here to view article)


Abstract
This study used the Woodcock-Johnson Tests of Cognitive Ability-Revised to investigate a wide range of cognitive abilities in people with Williams syndrome (WS). It involved a comparatively large sample of 31 people with WS, but took a case-series approach. The study addressed the widespread claims of a characteristic "WS cognitive profile" by looking for heterogeneity rather than homogeneity. People with WS showed a variety of preserved (significantly above mental age [MA]), expected (at MA), and significantly impaired (significantly below MA) levels of functioning. Such results provide clear evidence for heterogeneity in cognitive functions within WS. We found the most homogeneity on a test of phonological processing and a test of phonological short-term memory, with half of the WS sample performing at MA levels on these tests. Interestingly, no WS individual showed a weakness on a test of nonverbal reasoning, and only one WS individual showed a weakness on a test of verbal comprehension. In addition, we found that strengths on analysis-synthesis and verbal analogies occurred only for WS individuals with an MA less than 5.5 years (our sample median MA); people with an MA greater than 5.5 years performed at MA level on these 2 tests. Results also provided preliminary evidence for distinct subgroups of WS people based on their cognitive strengths and weaknesses on a broad range of cognitive functions. On the basis of the findings, caution should be made in declaring a single cognitive profile that is characteristic of all individuals with WS. Just as there is heterogeneity in genetic and physical anomalies within WS, not all WS individuals share the same cognitive strengths and weaknesses. Also, not all WS individuals show the profile of a strength in verbal abilities and a weakness in spatial functions.

Technorati Tags: , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,

Tuesday, January 26, 2010

AAIDD intellectual disability manual (11th edition): Intelligence component -1 standard deviation below average: Final in 3-part series


[Note...this is cross-blog post.  It also has been posted to IQs Corner sister blog--ICDP]

This is my third (and final) comment in my series of comments re: the intellectual component of the new AAIDD ID/MR definition and classification manual.  I urge readers, if they have not done so, to read my original post.  In the first post I outlined the reason for the series.  I also highlighted positive features of the AAIDD component (chapter 4) of the manual and acknowledged that no manual will be perfect.  In the second post, I presented a comparative analysis of the literature cited in the 2002 and 2010 manuals regarding the nature and definition of intelligence. It was my conclusion that the 2010 manual (11th edition-the green book) failed to incorporate significant consensus-based advances regarding the nature of psychometric theories of intelligence and contemporary intelligence tests based upon these theories.

This final post is intended to provide the foundations for the conclusions in my second critical analysis post. To be honest, I've struggled with how to articulate these concerns in a brief format. This has been the major reason for the delay in this final post. I've struggled with not wanting to be a simple critic who does not offer substantive evidence or guidance. More importantly, I did not want to be a critic who did not try to help rectify the issues identified. Thus, I decided to take a more ambitious educational approach to my concerns regarding the AAIDD ID 2010 manual. Thus, my final post in the form of a lengthy PowerPoint presentation that is intended to educate and provide background information regarding my criticisms.

Below is a description of the PowerPoint presentation which is available via my SlideShare space.  This is an online  presentation that can also be downloaded to your respective hard drive for off-line viewing and use. In addition, I have made available a PDF copy of the slides presentation can be accessed by clicking here. [Warning....the PDF version is very large...30+MB....and should only be downloaded when you have a high speed connection]

Description of presentation:  This presentation traces the evolution of psychometric theories of intelligence from Spearman's g to contemporary CHC. In addition, it simultaneously tracks the evolution of psychometric tests of intelligence as they relate to psychometric theories. Finally, there is a special emphasis on tracking changes in the AAMR/AAIDD intellectual disability (mental retardation) classification manuals over the same period. It is concluded that despite significant advances in psychometric theories of intelligence and contemporary psychometric intelligence tests, the official 2010 AAIDD manual is significantly behind these developments. The 2010 AAIDD manual is "stuck on g" and has failed to incorporate advances in both psychometric theories and tests of intelligence.  A significant intelligence theory--AAIDD ID/MR definition gap exists tat has potential serious consequences for individuals with ID/MR.

Below is my final set of critical summary comments (2nd slide from the end) presented at the end of the presentation.
Despite the widespread acceptance and recognition of the contemporary CHC (aka Extended Gf-Gc) theory of intelligence by intelligence scholars, a 2002 national panel of MR/ID experts, and the clear movement in applied IQ test development to test batteries grounded in the CHC framework, AAIDD continues to be “stuck on g”

The AAIDD definition of intelligence is out-of-date.  A major intelligence theory—AAIDD ID definition gap exists

Contemporary intelligence scholars, experts, and test developers recognize that although g (general intelligence)  may exist at the apex of the CHC taxonomy of human cognitive abilities, there are broad (stratum II) abilities that are important (i.e., have differential validities) that can be assessed and, when interpreted appropriately, can provide a more valid and multidimensional picture of an individuals intellectual functioning.

AAIDD’s continued use of the statement (with regard to measurement of multiple cognitive abilities) that “until such measures of multiple intelligences can be assessed reliably and validly, it is the position of AAIDD that intellectual functioning…is best conceptualized and captured by a general factor of intelligence” is simply wrong!  Reliable and valid measures of the broad CHC ability domains exist and have been published  in most intelligence batteries published from 1989 to 2008. 

The AAIDD g-position is at odds with the known heterogeneity of abilities within the ID (and general) population and fails to recognize that although a g-based total composite score may often represent the best single index of a person’s intellectual functioning, often the g-based composite score may lead to inaccurate conclusions regarding a person’s intellectual functioning and in these cases more attention should be focused on the component part scores.  The stuck on g position has the potential to result in serious consequences for individuals, such as denial of special education services; denial of SS benefits, and unjust execution as in “Atkins MR/ID death penalty cases”.

As I stated in my original post, "ideally I hope that my forthcoming critical comments, combined with a spirited back-and-forth dialogue, will produce productive scholarly discourse, discourse that may result in AAIDD upgrading/revising their current written statement regarding the first prong of an ID diagnosis—intellectual functioning (Chapter 4) via new position papers or journal articles, web-based clarifications, and/or the publication of more specific professional guidelines."  Finally, I extend an invitation to members of the committee (that drafted the 2010 manual) to forward  me any professional responses to my series,  which I will post as "guest post responses" at the ICDP blog.

Technorati Tags: , , , , , , , , , , , , , , , , , , , , , , , , , , ,

Tuesday, January 05, 2010

The Wechsler-like IQ subtest scaled score metric: The potential for misuse, misinterpretation and impact on critical life decisions---draft report in search of feedback




The following are the first three paragraphs (and a critical figure) of a draft of an IAP Applied Psychometrics 101 Brief Report (#5).  The complete report can be download in PDF format by clicking here.  A web-page version of the complete report can be found by clicking here (note - the web page verision may NOT display two embedded figures....viewing the PDF copy may be necessary)

I'm providing this initial draft report with the expressed intent of soliciting feedback and comments regarding the accuracy and soundness of my analyses and logic.  I'm looking for critical feedback to improve the report.  This is a draft report that will be revised if comments suggest important changes.  Please read it in the spirit of "tossing out some critical ideas" for reflective analysis and feedback.  Feedback can be sent directly to me (iap@earthlink.net) or could be provided in the form of listserv thread discussions at the NASP and/or CHC listservs.


I've recently been skimming James Flynn's new book (What is Intelligence:  Beyond the Flynn Effect) to better understand the methodology and interpretation of the Flynn effect. Of particular interest to me (as an applied measurement person) is his analysis of the individual subtest scores from the various Wechsler scales across time. As most psychologists know, Wechsler subtest scaled scores (ss) are on a scale with a mean (M) = 10 and a standard deviation (SD) = 3. The subtest ss range from 1 to 19.  In Appendix 1 of his book, Flynn states "it is customary to score subtests on a scale in which the SD is 3, as opposed to IQ scores which are scaled with SD set at 15. To convert to IQ, just multiply subtest gains by five, as was done to get the IQ gains in the last column."  At first glance, this statement makes it sound as if the transformation of subtest ss to IQ SS is an easy (“just multiply….”; emphasis added by me) and mathematically acceptable procedure without problems. However, on close inspection this transformation has the potential to introduce unknown sources of error into the precision of the transformed SS scores.  It is the goal of this brief technical post to explain the issues involved when making this ss-to- IQ SS conversion.

The ss 1-19 scale has a long history in the Wechsler batteries. For sample, in Appendix 1 of Measurement of Adult Intelligence (Wechsler, 1944), Wechsler described the steps used to translate subtest raw scores to the new ss metric. The Wechsler batteries have continued this tradition in each new revision, although the methodology and procedures to calculate the ss 1-19 values have become more sophisticated over time.   Although the methods used to develop the Wechsler ss 1-19 scale may have become more sophisticated, the resultant underlying scale for each subtest has not…scores still range from 1-19 (M=10; SD=3).  Also, the most recent Stanford-Binet—5th Edition (SB5; Roid, 2003) and Kaufman Assessment Battery for Children-2nd Edition (KABC-II) have both adopted the same ss 1-19 scale for their respective individual subtests.

Why is this relatively crude (to be defined below) scale metric still used in some intelligence batteries when other contemporary intelligence batteries provide subtest scale metrics with finer measurement resolution?  For example, the DAS-II (Elliott, 2007) places individual test scores on the T-scale (M=50; SD=10), with scores that range from 10-90.  The WJ III (McGrew & Woodcock, 2001) places all test and composite scores on the standard score (SS) metric associated with full scale and composite scores (M=100; SD=15).  The critical question to be asked is “are there advantages or disadvantages to retaining the historical ss 1-19 scale or, are their real advantages to having individual test scales with finer measurement resolution (DAS-II; WJ III)?”

......continued............
(complete report available at links in first paragraph of this post)

[Double click on image to enlarge]





Technorati Tags: , ,, , , , , , , , , , , , , , , , , , , , , , , , , ,


Tuesday, December 22, 2009

State special education definitions of MR/ID

Thanks to Randy Floyd for sending this article to me.  The article summarizes state eligibility guidelines for MR/ID as promulgated by state education agencies--thus covering the school-age population of students with potential MR/ID and special education services.  State law governing the definition and criteria for Atkin's decisions do not directly correspond with state special education rules, laws, and regulations.

Bergeron, R., Floyd, R., & Shands, E.  (2008).  States’ Eligibility Guidelines for Mental Retardation: An Update and Consideration of Part Scores and Unreliability of IQs.  Education and Training in Developmental Disabilities, 43(1), 123–131. (click here to view)

Abstract
Mental retardation (MR) has traditionally been defined as a disorder in intellectual and adaptive functioning beginning in the developmental period. Guided by a federal definition of MR described in the Individuals with Disabilities Education Act, it is the responsibility of each of the United States to describe eligibility guidelines for special education services. The purpose of this study was to examine eligibility guidelines for MR for the 50 states and the District of Columbia. This study examined the terms used to describe MR, the use of classification levels, the cutoff scores, and the adaptive behavior considerations for each state. In addition, this study examined guidelines for consideration of intelligence test part scores and consideration of the unreliability of IQs through consideration of the standard error of measurement (SEM) or an IQ range. As found in previous studies, results revealed great variation in the specific eligibility guidelines for MR from state to state. The greatest variation appeared to be across the adaptive behavior considerations. Approximately 20% of states (10) recommend consideration of intelligence test part scores, and approximately 39% of states (20) recommend attention to unreliability of IQs through consideration of the SEM or an IQ range.


Technorati Tags: , , , , , , , , , , , , , , , ,