English Grammar: a resource Book for Students
Download 1.74 Mb. Pdf ko'rish
|
English Grammar- A Resource Book for Students
Some numerical facts
In the corpus of approximately 7.3 million words, used in Chapters 3 and 4, there are 2,320 instances of the different forms of the lemma set. We associate together the forms set, sets, and setting as instances of the word set, and the frequency of each is: 228 E X T E N S I O N set 1885 (80%) sets 219 (9%) setting 246 (11%) Other possible associates such as setter and settee are ignored. Set is thus one of the commonest words in the language – the uninflected form is ranked number 272. However, if we compare the relative frequency of the inflected forms sets and setting, we see that they are not nearly as common as set, being approximately 9 per cent and 11 per cent of the lemma. This is a commonly observed pattern, where one of the forms is much more common than any other. Similar, if less dramatic, tendencies are shown for decline and yield in Chapters 3 and 4 respectively. This means that if sets or setting has a use which is not shared by set, we have much less evidence to go on. Whatever criteria we use, there is nearly ten times as much evidence available for set. It could be argued that, in one respect at least, the inflection of set is untypical, and that the frequency of forms of set will reflect the oddity. Set is one of a handful of verbs in English which do not have a separate past tense form. So whatever fre- quency is assigned to walk and walked, say and said, etc. is not differentiated in set. To complicate the picture further, all three forms of the lemma set are also readily available as nouns, and the picture is not at all straightforward. However, compared to the vast majority of words, even the least common form sets is generously represented. But when we look for combinations of even these frequent words, the expectations are not promising. If a corpus is held to be representative of the language as a whole, the prob- ability of occurrence of a word-form can be expressed in general as a relation between the frequency of the word-form in the corpus and the total number of word-forms in the corpus. In the case of set this is: 1855 7,300,000 or 0.0025 This means that the chance of set being the next word in the text is about 250 per million, or one occurrence in every 3,935 words. Download 1.74 Mb. Do'stlaringiz bilan baham: |
Ma'lumotlar bazasi mualliflik huquqi bilan himoyalangan ©fayllar.org 2024
ma'muriyatiga murojaat qiling
ma'muriyatiga murojaat qiling