Wordsmith.org
Posted By: JohnHawaii Interesting item on English - 06/11/03 01:28 AM
This may have been discussed in some form or other before I found my way to this forum, but I came across this item in today's local paper and thought it might be of interest:
"You've got about 600,000 English words to work with. But only 43 of them make up half of everything you say. Only nine go into a quarter of what you say. These nine are:
and, be, have, it, of, the, to, will, you. So contends that language expert Robert Chapman."
I have no information regarding how he arrived at these figures, but it seems to me that a statistical study of "what people say" would be a very difficult undertaking and would require a number of clearly stated parameters.

Posted By: alexis Re: Interesting item on English - 06/11/03 01:38 AM
And is it purely 'saying' - verbal - or also writing? I couldn't believe that if it related also to writing, since it is usually significantly more lexically dense (ratio of content words to structural words). This, of course, is one of the distinguishing differences between the two (speaking and writing), and one big thing that children need to learn when figuring out how to 'write' - don't write as you speak!

Hmm, sorry, did I go off topic??

Posted By: wwh Re: Interesting item on English - 06/11/03 02:22 AM
One use I can see for such studies would be to help persons studying foreign languages. I used to be able to read German and French, but didn't have the words needed for even simple conversation. I suppose the same thing would be true of foreigners learning English.

Posted By: wsieber Re: Interesting item on English - 06/11/03 07:24 AM
help persons studying foreign languages -
On closer inspection, I have some doubts about the usefulness of learning words according to their statistical frequency. The most frequent words quoted above are merely the "glue" that holds together what you really want to say. Urgent communication - disregarding grammar - can often do without them.


Posted By: Wordwind Re: Interesting item on English - 06/11/03 09:03 AM
The 600,000 figure is way too low. The latest impossibly, impossible count I read about had the figure 200,000 times higher--and that article was from several years ago.

We have discussed here the impossibility of determining an accurate count of English vocabulary, especially so many very fine words never find their way into a dictionary, wonderful words such as my little pet 'google.'

But, for those who attempt to count professionally however they go about it, the count was over 800,000 several years back.

I, for one, think it would be fascinating to have a way of determining a person's personal vocabulary--including proper nouns. Seems one should get credit for all the persons, places, book titles, etc., one is familiar with.

Posted By: Buffalo Shrdlu Re: Interesting item on English - 06/11/03 09:35 AM
200,000 times (emphasis added)

now you're getting into googol range!

Posted By: sjm Re: Interesting item on English - 06/11/03 10:15 AM
>The latest impossibly, impossible count I read about had the figure 200,000 times higher--and that article was from several years ago.

Say What?!120,000,000,000 words?

Posted By: Faldage Re: Interesting item on English - 06/11/03 10:38 AM
googol range

Lessee, 10,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,
000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000/120,
000,000,000=83,333,333,333,333,333,333,333,333,333,333,333,333,333,333,
333,333,333,333,333,333,333,333,333,333,333,333,333,333,333,333.

That's a couple orders of magnitude of difference. Not exactly googol range.

Posted By: alexis Re: Interesting item on English - 06/11/03 10:51 AM
I actually think that learning the 'glue' (structural language) is a very important part of language to learn, and know, especially in a foreign language... hard to make sense if you can't explain who is doing what to whom in your sentence, and I think it would enable you to ask questions to get the rest of the needed, content, language

Posted By: Buffalo Shrdlu Re: Interesting item on English - 06/11/03 12:31 PM
googol range

Lessee, 10,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,
000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000/120,
000,000,000=83,333,333,333,333,333,333,333,333,333,333,333,333,333,333,
333,333,333,333,333,333,333,333,333,333,333,333,333,333,333,333.

That's a couple orders of magnitude of difference. Not exactly googol range.


augh! my nit's been picked! I claim hyperbolic extension.
(besides, who could pass up the google/googol bit? not I, said the nit, licentiously.)

Posted By: tsuwm Re: Interesting item on English - 06/11/03 12:36 PM
>=83,333,333,333,333,333,333,333,333,333,333,333,333,333,333, 333,333,333,333,333,333,333,333,333,333,333,333,333,333,333,333.
That's a couple orders of magnitude of difference. Not exactly googol range.

not exactly a couple, either! <g>

Posted By: Buffalo Shrdlu Re: Interesting item on English - 06/11/03 12:38 PM
thppt!

Posted By: Jackie Re: Interesting item on English - 06/30/03 07:03 PM
Interesting post, JH. Who is Robert Chapman, anyway? I have my doubts about that "But only 43 of them make up half of everything you say", too. Let me try and think, here--and of course this will have to discount proper nouns, as they would change from person to person. Here we go--as many as I can think of, of the words I use most commonly:
a, an, and, the, I, you, he, she, it, his, hers, yours, theirs, it's (apostrophe deliberate--I don't use "its" all that often), we, they, your, yours, be, am, is, are, were, was, go, going, to, for, some, too, went, gone, out, in, of, about, just, only, few, quite, think, thinking, thought, ask, asked, asking, read, reading, talk, talked, talking...
That's 51, and I haven't even gotten to the nouns yet. Did Mr. Chapman give a list of these 43?

Posted By: JohnHawaii Re: Interesting item on English - 07/01/03 01:40 AM
Chapman was the author of "A Dictionary of American Slang," published in 1960. I wasn't familiar with the man or his work until I came across a short item in the newspaper that made the statement I posted. I'm not prepared to defend it, having no information on his methodology. Your list of most-used words casts reasonable doubt on his conclusions. Unfortunately, the 43 words were not included in the article.

Posted By: wsieber Re: Interesting item on English - 07/01/03 01:16 PM
Im a little surprised you didn't include arrrrgh..

Posted By: Jackie Re: Interesting item on English - 07/01/03 06:03 PM
Augh!

© Wordsmith.org