Wordsmith.org: the magic of words

Wordsmith Talk

About Us | What's New | Search | Site Map | Contact Us  

Previous Thread
Next Thread
Print Thread
#105336 06/11/2003 1:28 AM
Joined: Jan 2003
Posts: 171
member
member
Offline
Joined: Jan 2003
Posts: 171
This may have been discussed in some form or other before I found my way to this forum, but I came across this item in today's local paper and thought it might be of interest:
"You've got about 600,000 English words to work with. But only 43 of them make up half of everything you say. Only nine go into a quarter of what you say. These nine are:
and, be, have, it, of, the, to, will, you. So contends that language expert Robert Chapman."
I have no information regarding how he arrived at these figures, but it seems to me that a statistical study of "what people say" would be a very difficult undertaking and would require a number of clearly stated parameters.


Joined: Apr 2002
Posts: 148
member
member
Offline
Joined: Apr 2002
Posts: 148
And is it purely 'saying' - verbal - or also writing? I couldn't believe that if it related also to writing, since it is usually significantly more lexically dense (ratio of content words to structural words). This, of course, is one of the distinguishing differences between the two (speaking and writing), and one big thing that children need to learn when figuring out how to 'write' - don't write as you speak!

Hmm, sorry, did I go off topic??


Joined: Jan 2001
Posts: 13,858
wwh Offline
Carpal Tunnel
Carpal Tunnel
Offline
Joined: Jan 2001
Posts: 13,858
One use I can see for such studies would be to help persons studying foreign languages. I used to be able to read German and French, but didn't have the words needed for even simple conversation. I suppose the same thing would be true of foreigners learning English.


Joined: Mar 2000
Posts: 1,027
old hand
old hand
Joined: Mar 2000
Posts: 1,027
help persons studying foreign languages -
On closer inspection, I have some doubts about the usefulness of learning words according to their statistical frequency. The most frequent words quoted above are merely the "glue" that holds together what you really want to say. Urgent communication - disregarding grammar - can often do without them.



Joined: Sep 2001
Posts: 6,296
Carpal Tunnel
Carpal Tunnel
Offline
Joined: Sep 2001
Posts: 6,296
The 600,000 figure is way too low. The latest impossibly, impossible count I read about had the figure 200,000 times higher--and that article was from several years ago.

We have discussed here the impossibility of determining an accurate count of English vocabulary, especially so many very fine words never find their way into a dictionary, wonderful words such as my little pet 'google.'

But, for those who attempt to count professionally however they go about it, the count was over 800,000 several years back.

I, for one, think it would be fascinating to have a way of determining a person's personal vocabulary--including proper nouns. Seems one should get credit for all the persons, places, book titles, etc., one is familiar with.


Joined: Jun 2002
Posts: 7,210
Carpal Tunnel
Carpal Tunnel
Joined: Jun 2002
Posts: 7,210
200,000 times (emphasis added)

now you're getting into googol range!



formerly known as etaoin...
Joined: Jul 2002
Posts: 742
sjm Offline
old hand
old hand
Offline
Joined: Jul 2002
Posts: 742
>The latest impossibly, impossible count I read about had the figure 200,000 times higher--and that article was from several years ago.

Say What?!120,000,000,000 words?


Joined: Dec 2000
Posts: 13,803
Carpal Tunnel
Carpal Tunnel
Joined: Dec 2000
Posts: 13,803
googol range

Lessee, 10,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,
000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000/120,
000,000,000=83,333,333,333,333,333,333,333,333,333,333,333,333,333,333,
333,333,333,333,333,333,333,333,333,333,333,333,333,333,333,333.

That's a couple orders of magnitude of difference. Not exactly googol range.


Joined: Apr 2002
Posts: 148
member
member
Offline
Joined: Apr 2002
Posts: 148
I actually think that learning the 'glue' (structural language) is a very important part of language to learn, and know, especially in a foreign language... hard to make sense if you can't explain who is doing what to whom in your sentence, and I think it would enable you to ask questions to get the rest of the needed, content, language


Joined: Jun 2002
Posts: 7,210
Carpal Tunnel
Carpal Tunnel
Joined: Jun 2002
Posts: 7,210
googol range

Lessee, 10,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,
000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000/120,
000,000,000=83,333,333,333,333,333,333,333,333,333,333,333,333,333,333,
333,333,333,333,333,333,333,333,333,333,333,333,333,333,333,333.

That's a couple orders of magnitude of difference. Not exactly googol range.


augh! my nit's been picked! I claim hyperbolic extension.
(besides, who could pass up the google/googol bit? not I, said the nit, licentiously.)



formerly known as etaoin...
Joined: Apr 2000
Posts: 10,542
Carpal Tunnel
Carpal Tunnel
Joined: Apr 2000
Posts: 10,542
>=83,333,333,333,333,333,333,333,333,333,333,333,333,333,333, 333,333,333,333,333,333,333,333,333,333,333,333,333,333,333,333.
That's a couple orders of magnitude of difference. Not exactly googol range.

not exactly a couple, either! <g>


Joined: Jun 2002
Posts: 7,210
Carpal Tunnel
Carpal Tunnel
Joined: Jun 2002
Posts: 7,210
thppt!



formerly known as etaoin...
Joined: Mar 2000
Posts: 11,613
Carpal Tunnel
Carpal Tunnel
Joined: Mar 2000
Posts: 11,613
Interesting post, JH. Who is Robert Chapman, anyway? I have my doubts about that "But only 43 of them make up half of everything you say", too. Let me try and think, here--and of course this will have to discount proper nouns, as they would change from person to person. Here we go--as many as I can think of, of the words I use most commonly:
a, an, and, the, I, you, he, she, it, his, hers, yours, theirs, it's (apostrophe deliberate--I don't use "its" all that often), we, they, your, yours, be, am, is, are, were, was, go, going, to, for, some, too, went, gone, out, in, of, about, just, only, few, quite, think, thinking, thought, ask, asked, asking, read, reading, talk, talked, talking...
That's 51, and I haven't even gotten to the nouns yet. Did Mr. Chapman give a list of these 43?


Joined: Jan 2003
Posts: 171
member
member
Offline
Joined: Jan 2003
Posts: 171
Chapman was the author of "A Dictionary of American Slang," published in 1960. I wasn't familiar with the man or his work until I came across a short item in the newspaper that made the statement I posted. I'm not prepared to defend it, having no information on his methodology. Your list of most-used words casts reasonable doubt on his conclusions. Unfortunately, the 43 words were not included in the article.


Joined: Mar 2000
Posts: 1,027
old hand
old hand
Joined: Mar 2000
Posts: 1,027
Im a little surprised you didn't include arrrrgh..


Joined: Mar 2000
Posts: 11,613
Carpal Tunnel
Carpal Tunnel
Joined: Mar 2000
Posts: 11,613
Augh!



Moderated by  Jackie 

Link Copied to Clipboard
Disclaimer: Wordsmith.org is not responsible for views expressed on this site. Use of this forum is at your own risk and liability - you agree to hold Wordsmith.org and its associates harmless as a condition of using it.

Home | Today's Word | Yesterday's Word | Subscribe | FAQ | Archives | Search | Feedback
Wordsmith Talk | Wordsmith Chat

© 1994-2025 Wordsmith

Powered by UBB.threads™ PHP Forum Software 8.0.0