Tuesday, 11 June 2013

Vocabulary: humanity’s greatest achievement?

Words are our greatest tools. They allow us to guide, warn, praise and admonish each other and to cooperate in complex tasks. Even restricting ourselves to oral traditions, we can pass down stories which preserve some of the wisdom of our ancestors. Once we master the art of reading and writing we can supersede the restrictions of memory and draw on the wisdom of successive generations. Knowledge is power, and the accumulated knowledge of many centuries is the most powerful of all.

Vocabulary acquisition is an enthralling process to study, as children learn the rules of grammar, and sometimes apply them with greater consistency than the idiosyncrasies of actual spoken adult language would deem correct, if you have gotted the point. Children may refer to little rodents as “mouses” because children have noticed a language rule, and applied it correctly. The English language is not always correct, for idiosyncratic and historical reasons. The English who goed West on the Mayflower were full of reformist zeal, and managed to improve the unwritten constitution by writing it down, and improved writing by removing some of the colourful miss-spellings of the English language and by making spelling more logical and less colorful, but they did not get round to a full purging of languistic errors and exceptions, or at the very least they have not gotten round to it yet (this is a recondite joke, because all English speakers at that time said “gotten” and then with time the English moved to “got” for the past tense, no doubt just to aggravate the colonials).

Some people have the simplistic notion that vocabulary must be determined by mere exposure to spoken language. That is necessary, but far from sufficient, as even children work out. They notice patterns, informal rules, and the contexts in which communication takes place.  “The acquisition of meaning is based on the eduction of meaning from the contexts in which the words are encountered”. (So, even if the word “eduction” in the quotation from page 146 of Jensen’s “Bias in mental Testing” is unfamiliar, you will not be surprised to deduce that it means “To assume or work out from given facts; deduce”). The meaning of a word is acquired in some contexts which permits at least some partial inference as to its meaning. By hearing or reading the word in different contexts, through a process of generalization, discrimination and eduction one can guess at the essence of the meaning of the word, so as to use it (experimentally) oneself the next time a similar context presents itself. Words move from being unfamiliar to familiar, from familiar but not really understood to being familiar and partly understood (at which stage the explanations given about the meaning of the word are threadbare and inaccurate), and from there to being explained by use of synonyms (though those can range from partial to full understanding as shown by power of the explanations and definitions).

Testing vocabulary precisely is quite complicated, because you have to test how well subjects really understand the words in question. It is a bit like trying to find out whether people can really handle heavy machinery, as opposed to boasting about it. Typically vocabulary tests work up from common to rare words, and specify what sorts of definitions and explanations will get full points. On multiple choice questions, the use of distractor items often reveals that many people have misunderstood the meanings of words that are new to them. For example, some people who think they know how to define FATUOUS are distracted by the option large. An argument may be witless, silly and pointless, but not obese.  

A very short vocabulary test, which correlates 0.71 with IQ, is the ten word test in the General Social Survey (US). Can something so crude yield interesting results? Yes. Razib Khan has a very informative post on this. In my view, no survey should be conducted without including a test like this, which provides a very good estimate of intelligence.


A stab at using this test to calculate the intellectual demands of particular jobs is provided by The Audacious Epigone. Note the low score for academics.


Nonetheless, long vocabulary tests, or dynamic computer—administered tests which adapt quickly to your difficulty level provide reasonable estimates of your total word store. In that sense, intelligence ranges from 0 to 45,000 words (the real upper limit if one avoids technical jargon), and one can put a single number on it, on a proper ratio scale. Rating people by the size of their word stores makes sense.  Although 3000 words will provide a great deal of cost-effective and very useful communication, additional words bring conceptual benefits. Knowledge of the 3000 most frequent words in the English language will probably result in your understanding 95% of what is said to you, and knowledge of 5000 “word families” (the main word and its variants, like quick: quickly, quicker, quickest) should mean that you would be able to understand 99.9%. Why have more? The answer is that much of good thinking depends on a powerful vocabulary. Carpentry can be done with a standard tool set, cabinet making requires finer, more specialised tools.

As a rough guide, teenagers have about 12,000 words and college students 17,000. Older adults have 17,000 to 21,000 words, and a minority have many more.  Some conceited person referred to 20,000 words as being “the incoherence boundary”. I eschew such contemptuous judgements.

Bright children acquire vocabulary faster than duller children, and thus brighter adults have larger vocabularies, because they require fewer contexts to work out word meanings, and make more subtle discriminations in meanings between similar words. In 1962 Alice Heim, to whom a statue should be raised somewhere, designed a new test of verbal reasoning called “The word in context”.  Charming and intriguing, it featured an unfamiliar foreign word in a descriptive paragraph. Subsequent paragraphs gave more descriptive context, until the elusive meaning was potentially resolved. A brilliant idea, but the test took too long for it to be used in psychometry.

Most crucially, bright people often observe things and have thoughts about things before they learn the words they need to express them. They have the need before the word, so that when the word comes into view they seize on it with pleasure, and relief. Such words are treasured, and stick in the mind because of their elegant utility. Words for which we have no need are shapeless, and never stick in the mind.

So, a word is not just something we have heard, like a bird-song. A word is a cog in a meaning machine. To learn a word is to mine the essence of a context, to condense a cloud of implications into the condensate of definition.

You may wish to discuss this post with someone you love. Be warned: even in autochthonous pairings, discrepancies in terpsichorean accomplishments can lead to uxoricide.


Do not be captious after reading this post.

Saturday, 8 June 2013

By the age of three, a clear gap in ability

 
Jason  Malloy has been working on an interesting question, which is to determine at whether there is a clear gap in ability between black and white three year old children in America.

http://humanvarieties.org/2013/05/26/the-onset-and-development-of-b-w-ability-differences-early-infancy-to-age-3-part-1/comment-page-1/#comment-741

Some years ago there was much talk in Britain about tests which found that all racial groups began schooling at age 5 with identical “school readiness” results, but then progressively diverged as the years went by, strongly suggesting that schools were failing to teach British black children properly. I was never able to get to the bottom of what these tests were, and what range of skills they covered, and whether there were ceiling effects on the tests, but it was clear that pre-school intelligence testing results would cast light on whether children of different genetic backgrounds really started school with the same level of ability.

Malloy found 29 usable studies containing 35 different samples of children born between 1936 and 2000. There is IQ data for 2569 black children and 2762 white children, age 3

“A majority of these samples contain control groups of whites who were tested at similar times, and under similar conditions. When we compare the 20 samples with both blacks and whites, we get a difference of almost exactly 1 standard deviation: the black IQ is 85.4 and the white IQ is 100.8 (15.4 points/1.03σ).”
 
The figure shows the studies plotted out by year.
 

There is no particular trend (one early outlier study was excluded) and the pre-school intelligence gap looks pretty solid. The next post will discuss measurement issues. The predictive power of intelligence tests increase with subject age. Conventionally 7 years gives the first reliable indication of adult intelligence. 11 years is far better, but there is still more maturation to take place.

One more post to follow on this topic.  










Friday, 7 June 2013

Patronage and the arts of courtly flattery

IMG_3804

I have been reading my favourite blogs for years, without paying a penny for the privilege. The material was there, free, so I just read it. When some of my much-visited bloggers admitted they were facing destitution, I sent them some dollars every now and then. Life being what it is, I only did that when they reminded me that they could not continue writing without some income to support them. It was hit and miss. One day, reading Steve Sailer’s blog, I noticed a little ”Flattr” button at the end of each post.

flattr.com is a way of making social micro-donations. You set aside a fixed sum which will be distributed each month according to which blogs you think deserve support. The more you flatter different writers by clicking on their “flattr” buttons the more finely the donations are distributed, but they never exceed your maximum monthly sum. Think of it as a way of paying to read a newspaper, but ensuring that your payment goes to those columnists whose writings you most value, and to no-one else.

And now, back to the program.

ORIGINAL PAPER: How clever were the Victorians? A comment on Woodley et al. (2013) by Elijah L. Armstrong


Abstract: Woodley et al. (2013) cite declines in simple reaction time as evidence of dysgenesis. In this paper it is conceded that these declines are strong evidence for a dysgenic trend. However, declines in g cannot be inferred from reaction time declines alone.

Woodley et al. (2013) are quite correct that the existence of a secular decline in reaction time suggests dysgenesis. However, the secular decline in reaction time is probably a poor quantification, per se, of the exact dysgenesis rate. Woodley (2012) argues that the Flynn effect may be caused by increasing specialization in cognitive abilities. If specialization has truly increased, one would expect to see a secular decline in certain such abilities. These abilities are probably, for the most part, not measured on typical g-loaded tests (Lynn, 1998 gives the example of farming ability), but it is nevertheless to be expected that some g-loaded tests will show a secular decline. It may be responded that reaction time’s shared variance with g is wholly genetic (Woodley et al. cite Rijsdijk et al., 1998 on this matter) and therefore changes in specialization will have a minor impact. However, even if the environmental influences on reaction time are different from those on IQ, there may still be considerable environmental influences. Moreover, reaction time influences mortality rates (Deary & Der, 2005). Declining reaction time independent of g fits into Woodley’s (2012) life-history model because there would be less pressure to develop a mortality-mediating ability in a less environmentally harsh environment. It should be noted as well that even though simple reaction time shows little or no training effect (Kida et al., 2005), there may be other processes that decrease simple reaction time, such as imprinting (Armstrong & Woodley, under review).

While Woodley et al. extract declines in g from the declines in reaction time (given a .54 correlation), simply multiplying the decline in reaction time by the g-loading is not sufficient to establish a decline in g (cf. Dickens & Flynn, 2001 for discussion of a similar issue). Using a similar procedure on IQ tests for the Flynn effect would imply high g gains (say, if performance on a test with a g-loading of 0.8 has increased by a d of 1, this procedure would imply that g has increased by a d of 0.8). However, the Flynn effect is not on g (Woodley, 2011, 2012a, 2012b; te Nijenhuis & van der Flier, 2013).

A number of similar declines (approximately 1 SD since the Victorian era) on other highly g-loaded tests or abilities would corroborate Woodley et al.’s dysgenesis estimate. To the best of my knowledge, though, there are few tests that have shown a secular decline; the SATs have, but the population has grown increasingly representative (e.g., Williams and Ceci, 1997; Sailer, 2011a, 2012). Piagetian tasks show a decline (Shayer et al., 2007), and if the decline in g estimated from secular trends in Piagetian tasks is comparable to the decline in g estimated from secular trends, this would corroborate a 1 SD dysgenesis estimate. Likewise, if the decline in IQ among wealthy countries that are no longer experiencing the Flynn effect (e.g., Sundet et al., 2004) was similar to the decline measured using reaction time, Woodley et al.’s estimate would be validated.

Finally, it should be noted that a g decline of 1 SD is difficult to believe (cf. Charlton, 2013; Flynn, 1987; Guha, 2001 for discussion of a similar issue).[1] A community with average levels of g 1 SD higher than modern populations would be supermen. Ashkenazi Jews, who are a tremendously successful ethnic group, appear to have IQs around 110 (e.g., Cochran et al., 2005; Lynn, 2011; Sailer, 2011b). Hence even the most intellectually successful ethnic group would have IQs five points lower than the Victorians, if Woodley et al. are correct. This process of devolution is made quite incredible by the fact that it is hypothesized to have occurred in only 130 years (Cochran, 2012).

References

Armstrong, E., and Woodley, M. A. The rule-dependence model explains the commonalities between the Flynn effect and IQ gains via retesting. Under review.

Charlton, B. (2013). "Extraordinary claims require extraordinary evidence" - with respect to the claim of intelligence decline since Victorian times. Retrieved from http://charltonteaching.blogspot.com/2013/05/extraordinary-claims-require.html.

Cochran, G., et al. (2006). Natural history of Ashkenazi intelligence. Journal of Biosocial Science, 38, 659-693.

Cochran, G. (2012). The long and short of it. Retrieved from http://westhunt.wordpress.com/2012/11/04/the-long-and-short-of-it/.

Deary, I., and Der. G. (2005). Reaction time explains IQ’s association with death. Psychological Science, 16, 64-69.

Dickens, W., and Flynn, J. R. (2001). Heritability estimates versus large environmental effects: The IQ paradox resolved. Psychological Review, 108, 346-369.

Flynn, J. R. (1987). Massive IQ gains in 14 nations: What IQ tests really measure. Psychological Review, 101, 179-191.

Guha, S. (2001). A philosopher’s paradise––in inspired lunacy. Retrieved from http://www.amazon.com/review/R2ER6NZI1H2WJ/ref=cm_aya_cmt?ie=UTF8&ASIN=0631224262#wasThisHelpful.

Rijsdijk, F. V., et al. (1998). The genetic basis of the relation between speed-of-information-processing and IQ. Behavioural Brain Research, 95, 77-84.

Lynn, R. (1998). In support of the nutrition theory. In U. Neisser (Ed.), The rising curve: Long-term gains in IQ and related measures (pp. 207-215). Washington, D. C.: American Psychological Association.

Lynn, R. (2011). The Chosen People. Augusta, GA: Washington Summit Publishers.

Sailer, S. (2011a). SAT score changes by race since 1996. Retrieved from http://isteve.blogspot.com/2011/09/sat-score-changes-by-race-since-1996.html

Sailer, S. (2011b). Lynn on the Jews: Yes, it’s intelligence –– but there’s something else too. Retrieved from http://www.vdare.com/articles/lynn-on-the-jews-yes-it-s-intelligence-but-there-s-something-else-too.

Sailer, S. (2012). SAT and ACT: How hard are they scraping the bottom of the barrel and are they finding any diamonds in the rough? Retrieved from http://isteve.blogspot.com/2012/10/sat-and-act-how-hard-are-they-scraping.html

Shayer, M., et al. (2007). Thirty years on – a large anti-Flynn effect? The Piagetian test Volume & Heaviness norms 1975–2003. British Journal of Educational Psychology, 77, 25-41.

Sundet, J.M., et al. (2004). The end of the Flynn effect?
A study of secular trends in mean intelligence test scores of Norwegian conscripts during half a century. Intelligence, 32, 349-362.

te Nijenhuis, J., & van der Flier, H. (in press). Is the Flynn effect on g?: A meta-analysis. Intelligence.

Williams, W. M., and Ceci, S. J. (1997). Are Americans becoming more or less alike? Trends in race, class, and ability differences in intelligence. American Psychologist, 52, 1226-1235.

Woodley, M. A. (2011a). Heterosis doesn’t cause the Flynn effect: A critical examination of Mingroni (2007). Psychological Review, 118, 689-693.

Woodley, M. A. (2012a). The social and scientific temporal correlates of genotypic intelligence and the Flynn effect. Intelligence, 40, 189–204.

Woodley, M. A. (2012b). A life history model of the Lynn-Flynn effect. Personality
and Individual Differences, 53, 152–156.

Woodley, M.A., et al. (in press) Were the Victorians cleverer than us? The decline in general intelligence estimated from a meta-analysis of the slowing of simple reaction time. Intelligence.



[1] Charlton’s discussion of this line of reasoning is critical.

Thursday, 6 June 2013

Motion Quotient, and why you need a friend

Duje Tadin, who did the motion quotient work with one of his research team, explains that for proper results you need to use controlled conditions, because even the level of illumination in the room will affect the findings

"Unfortunately, there is no good way to compare the results across computers. For example, "suppression" is weaker at low contrast, so somebody will have a weaker suppression looking at the video in a bright room than in a dark environment. Also the actual monitor frame rate will make a big difference. What can be compared is if two people look at the video under same conditions.
This provides the same info:
http://www.telegraph.co.uk/science/science-video/10076355/US-scientists-unveil-new-IQ-measure-take-the-test.html

So, this is why you will need a friend. However, the results could potentially damage the friendship. I leave that dilemma to you.

Wednesday, 5 June 2013

Shibboleth: Test your vocabulary (and your honesty)

Shibboleth is simply a word that you will mispronounce unless you know how a particular tribe, cohort or gang pronounces it. It serves as a password to an exclusive community. Naturally, like all human culture, it has a dark side. Pronounce the word wrong while trying to worm your way into such a secluded community, and you may be banished, or attacked for your impudence.

Here at Psychological Comments we try to be more cognitively demanding. Your pronunciation of words is of marginal interest. Your reaction times, on the other hand, have some predictive value, so we tend to pore over those to test the quality of our readers. HBDchick is in pole position on this measure, and you are encouraged to test your own.

However, why not test yourself on something very closely related to intelligence, and something profoundly human: your vocabulary? Sure, you will be doing an intelligence test, but why not? The link is below

http://testyourvocab.com/

A bold reader, Elijah Armstrong is in pole position on this one with 32,800 words.

Best of all, this test requires the testee to be honest. You can make the number up by pretending to know the  words offered up to you, but in that case you would be missing the point, and failing to understand yourself.

No such problem would ever afflict readers of this blog.

A very good morning to you all.

Tuesday, 4 June 2013

The Motion Quotient and other distractions


Galton believed that cleverness would be associated with the capacity to make fine sensory discriminations. As usual, he was way ahead of his time, but he did not have the statistics available to analyse his results with sufficient power. Once Binet had taken the broad-brush educational approach by using a non-theoretical selection of mental tasks to develop intelligence quotients (skills expressed by comparison with peers of the same age) research on sensory intelligence lapsed. Only when under pressure to explain the underlying basis of intelligence in the 1960’s did researchers return with renewed interest to sensory measures like inspection time, choice reaction time, brain waves, and sensory nerve transmission times. They put together a reasonable case that whatever intelligence is, it can be measured weakly by physiological surrogates with little intellectual content, and minimal cultural influences.

Now a new test of sensation has entered the lists. Melnick et al. A Strong Interactive Link between Sensory Discriminations and Intelligence
Current Biology 23, 1013–1017, June 3, 2013 ª2013 Elsevier Ltd All rights reserved http://dx.doi.org/10.1016/j.cub.2013.04.053

Already dubbed the “Motion Quotient”, this procedure looks at the sensation of movement caused by displays of lines in either a small or large visual field. This is better seen in action than described in words, although the sequence of test examples below is not itself very well explained. The test items come too quickly for you to be able to record your answers, but they ask two questions: do the lines appear to move to the right or the left, and is it easier to make that judgement with a small focussed display or a larger screen version?


My somewhat confused answer was that I had little idea what was going on, but of course it was much easier to see the lines moving on a big display than a small central display. It was “bloody obvious”, to use a British colloquialism.

However, my impressions turn out to be of little consequence. As you would expect from any inspection task, the experimenters adaptively adjusted the stimulus duration to estimate the shortest exposure durations sufficient for threshold level performance. In other words, the experiment has to be calibrated to each individual person, and you cannot really work anything out about yourself by looking at the above demonstration.

There is a lot to like in this study. They drew their volunteers from the general population, and tested them face to face on the Wechsler individual intelligence test, which is the best validated. That must have been time consuming. They know exactly how their result fits in with inspection time research, namely that they are getting a higher correlation than is usual in this often disappointing line of research.  They have convincing explanation as to why their somewhat more complex “grating stimulus” is a better, and more ecologically valid test of sensory discrimination, namely that it tests the subject’s ability to detect the important signal from the unimportant but distracting background noise. They say: “Rapid processing is of limited utility unless it is restricted to the most relevant information” Finally, they can show a correlation between their test and intelligence which is probably around 0.7 which is as good as the correlation between a subtest and the overall IQ figure.

So, why am I not celebrating their impressive result? First, we have been here before. The early results on inspection time looked almost as good. Second, one should be slightly on guard when the result is in line with what one wants. Third, one should be particularly on guard when the explanation for the results are given by clever researchers who have thought through their experiment carefully.

Nope, I have only one real gripe.

N=67