English Grammar: a resource Book for Students

bet	273/314
Sana	05.01.2022
Hajmi	1.74 Mb.
	#224903

1 ... 269 270 271 272 273 274 275 276 ... 314

Bog'liq
English Grammar- A Resource Book for Students

Some numerical facts
In the corpus of approximately 7.3 million words, used in Chapters 3 and 4, there are
2,320 instances of the different forms of the lemma set. We associate together the forms
set, sets, and setting as instances of the word set, and the frequency of each is:

228
E X T E N S I O N
set
1885
(80%)
sets
219
(9%)
setting
246
(11%)
Other possible associates such as setter and settee are ignored.
Set is thus one of the commonest words in the language – the uninflected form
is ranked number 272. However, if we compare the relative frequency of the inflected
forms  sets  and  setting,  we  see  that  they  are  not  nearly  as  common  as  set,  being
approximately 9 per cent and 11 per cent of the lemma.
This  is  a  commonly  observed  pattern,  where  one  of  the  forms  is  much  more
common than any other. Similar, if less dramatic, tendencies are shown for decline
and yield in Chapters 3 and 4 respectively. This means that if sets or setting has a use
which is not shared by set, we have much less evidence to go on. Whatever criteria
we use, there is nearly ten times as much evidence available for set.
It could be argued that, in one respect at least, the inflection of set is untypical,
and that the frequency of forms of set will reflect the oddity. Set is one of a handful
of verbs in English which do not have a separate past tense form. So whatever fre-
quency is assigned to walk and walked, say and said, etc. is not differentiated in set.
To complicate the picture further, all three forms of the lemma set are also readily
available as nouns, and the picture is not at all straightforward.
However, compared to the vast majority of words, even the least common form
sets  is  generously  represented.  But  when  we  look  for  combinations  of  even  these
frequent words, the expectations are not promising.
If a corpus is held to be representative of the language as a whole, the prob-
ability  of  occurrence  of  a  word-form  can  be  expressed  in  general  as  a  relation
between  the  frequency  of  the  word-form  in  the  corpus  and  the  total  number  of
word-forms in the corpus.
In the case of set this is:
1855
7,300,000
or 0.0025
This  means  that  the  chance  of  set  being  the  next  word  in  the  text  is  about
250 per million, or one occurrence in every 3,935 words.

Download 1.74 Mb.

Do'stlaringiz bilan baham:

1 ... 269 270 271 272 273 274 275 276 ... 314