Some notes on the thematic meta-metric of the national corpus of Uzbek dialects


Download 0.69 Mb.
Pdf ko'rish
bet1/5
Sana25.10.2023
Hajmi0.69 Mb.
#1721406
  1   2   3   4   5
Bog'liq
Some notes on the thematic meta-metric of the national corpus of Uzbek dialects



Philosophical Readings XIII.4 (2021), pp. 3267-3275. 3267 
Info@philosophicalreadings.org
10.5281/zenodo.5815371 
Some notes on the thematic meta-metric of the national corpus of Uzbek dialects 
(On the example of dialects using the sound "j" in Boysun district) 
Kholova Muyassar Abdulhakimovna
Basic doctoral student of the Department of Uzbek Linguistics, Termez State University 
+99897 530 50 21. xolovam@tersu.uz 
Bakhtiyor Rajabovich Mengliyev 
Doctor of Philology, Professor, Tashkent State University of the Uzbek Language and Literature 
Abstract: The article deals with the role of dialectal texts in the world linguistic corpus, the Uzbek 
national corpus, the creation of a system for accurate, easy and convenient search for necessary 
information, meta-measurement of small parts of the dialect and large texts: social, economic and 
political disunity and the degree to which it is convenient to draw conclusions when studying 
thematic tags of texts in Uzbek dialects. 
Keywords: Uzbek National Corpus, dialectal text, thematic meta-metric, passport meta-metric, 
thematic: social, economic and political divisibility. 
 
It is known that the first linguistic corpuses in science began to appear in the 60s of the last 
century. In 1963, at Brown University in the United States, the first large Brown Corpus was 
created, followed by other languages and the American version of 500 prose texts in 50 genres and 
2,000 words in English was developed by the co-authors U. Francis and H. Kucher. 
It is known that Austria, China (Mandarin dialects corpus [21]), Germany [22], Portugal [23], 
Czech Republic [24], Scandinavia [25], Poland, Lithuania, Georgia ( for example: Cor-Dial-Sin; 
Helsinki corpus of English dialects [26]; Norwegian dialect corpus [27]; Archive fur gesprochenes 
Deutsch, Die bayerische Dialektadebank; LEXDIALGRAM, etc.), Moscow, Slavic-Kuban
Vologda, Saratov and Kazan Such dialectal corpus as in the central cities of Russia (Dialectic 
podkorpus v sostave Natsionalnogo korpusa russkogo yazika, Saratovskiy dialektologicheskiy 
korpus, e-library of Russian dialects [28], electronic corpus of Kuban dialect culture [8]). In the 
creation of the national corpus in Russian linguistics: the structure of the Russian national corpus 
("Natsionalniy korpus russkogo yazika"), the work done in St. Petersburg, Saratov institutes and 
Tomsk [29], the corpus and bases such as the Boysun dialect base [30] evidence. 
The structure of dialectal corpus can be divided into 2: 1. Dialectal corpus based on the formal 
aspect of other language corpus, taking into account the features of their dialects: Portuguese corpus 
- The syntax-oriented Corpus of Portuguese dialects, Polish corpus - Dialesty I gwary Polish 
Compendium internetowe, Spanish corpus - Corpus Oral y Sonoro del Español, Bulgarian corpus - 
Bulgarian dialestology as Living, Georgian corpus (2 054 705 words, 3400 texts), Mandarin corpus 
in Chinese (this dialect is in Chinese) manifests itself in diversity and variability), and so on. 2. 
Dialectal corpus based on the dialect of a certain region: Kuban dialect corpus, Saratov dialect 
corpus, dialectical corpus of Ustya district of Arkhangelsk region. 
As well as, the correct identification of the direction of Uzbek dialect corpus is one of the 
basic parameters of dialectological research. The authorial corpus being created to date [4, 34] and 
the synonym database [1,15] can be considered as actions in the process of corporatizing a language 
that is in the process of development in linguistics. In the dialectological corpus, the anthropocentric 
survey is used to gradually expand the database so that full texts can be used. Examples of this are 
the texts in the electronic library of the Bulgarian, Scandinavian, Estonian, Portuguese, Scottish and 
Tomsk dialectological corpus [10, 15]. In the process of observing the Russian National Corpus, the 
dialectological corpus under the corpus: words, phrases, sentences, text sequences can be used in 
the search for dialectological texts [9, 34]. A. Sirazitdinov mentions about the corpus of Bashkir 
dialects, emphasizes the need to work on dialect dictionaries to replenish the database. [11]. In 
Uzbek grammar, prof. N. Rajabov says that professional dialects are mainly derived from the 


Philosophical Readings XIII.4 (2021), pp. 3267-3275. 3268 
Info@philosophicalreadings.org
10.5281/zenodo.5815371 
Qarluq group, which is an urban dialect, while dialects for livestock, grain, cotton, and agriculture 
are derived from the Kipchak group. [6]. 
It should be noted here that, if we look at linguistics in a broader context, we can observe a 
homogeneity in the thematic division of dialectal texts (taking into account the process of gradual 
growth in harmony with the place of people in society). However, on a global scale, concepts such 
as housing, household, politics, social life, and religion, which are inextricably linked to the role of 
the ethnic population in social life, do not occupy the same place in the worldview of Dialect 
representatives. [As an example, we can take the lifestyle of the population living only in the Tomsk 
region and in the territory of Uzbekistan]. The high level of this or that concept depends on the 
lifestyle and occupation of the local people [15]. The dialectal units of all dialects can be listed as 
common themes: 1) personal life (concepts related to man); 2) home; 3) pets; 4) ceremonies; 5) 
belief; 6) policy; 7) production (manual labor); 8) such as social life. From the macro themes 
mentioned above, “human privacy”: parents, children, relatives, marriage and family life, villagers; 
"household": garden, construction, tools; "faith": one can observe a state of division into micro-
themes, such as religious ceremonies, celebrations, and rituals [10]. Prof. S. Rakhimov notes that 
the lexical units reflecting the way of life of the population living in Surkhandarya district are based 
on the scheme of A. Shermatov and A. Ishayev: a) relatives; b) terms denoting a human organ; d) 
household items; e) food terms; f) construction terms; d) clothing terms; h) terms related to 
agriculture; terms related to animal husbandry; i) ethnographisms, etc. In the creation of the Dialect 
corpus, the description of events should also take into account the time interval that indicates the 
possession of a specific thematic characteristic (for example, the 30s, 50s, 60s, etc.). Research 
analysis shows that dialectal units written in the language of the older generation reflect the 
traditional culture of the farmers. The harmony of the daily life of ordinary peasants with their life 
experiences is the basis for the nominative nature of the concepts. 
The Tomsk dialectical corpus consists of 16 macro-themes: "Work", "Life", "Nature", 
"Kindness", "Event", etc., as well as 64 micro-themes. For example, on the topics "Work", "Land 
cultivation", "Harvesting of fruits", "Growing", "Logging", "Hunting", "Pine cone (harvesting of 
spruce nuts)", "Collecting medicinal. herbs ". There are micro-themes such as "Housework for 
women", "Housework for men" and "Fishing", and in the same dialectal corpus the topics move 
from one to another. Family, villagers, love ties, strangers (Buryats, Tatars, Khakass, townspeople, 
etc.); tools of labor, work, occupation, craft, garden; rural history, housing, collective farm, food 
preparation, flax, potatoes, livestock, beekeeping, cedar production, hunting, fishing, (timber 
production) industry, construction, strawberries, mushrooms, wire and flowers, food, disease and 
healing, clothing, utensils, intoxication, magic and faith, temple, prophecies, fortune-telling, rituals, 
wedding, mourning, baptism, holidays, weather, entertainment, politics, nature, events, product of 
modernity, human dignity, etc. [2] 
Based on the thematic division of the above language corpus (TDC), given that not all themes 
reflected in them may reflect the lifestyle of the ethnic people (cedar production, fishing, 
strawberries, mushrooms, drunkenness, temple, baptism, the paint of the present), interface 
reflecting the national mentality in the corpus of Uzbek national dialects. 


Philosophical Readings XIII.4 (2021), pp. 3267-3275. 3269 
Info@philosophicalreadings.org
10.5281/zenodo.5815371 
It should be noted that the work on this building is the result of the work not one person, but 
several groups, and the project is underway. Below we present the author's "Online Database 
Interface", which acts as a base for the dialect corpus. 
Boysun dialect 

Download 0.69 Mb.

Do'stlaringiz bilan baham:
  1   2   3   4   5




Ma'lumotlar bazasi mualliflik huquqi bilan himoyalangan ©fayllar.org 2024
ma'muriyatiga murojaat qiling