Some notes on the thematic meta-metric of the national corpus of Uzbek dialects

bet	1/5
Sana	25.10.2023
Hajmi	0.69 Mb.
	#1721406

1 2 3 4 5

Bog'liq
Some notes on the thematic meta-metric of the national corpus of Uzbek dialects

Kholova Muyassar Abdulhakimovna

Philosophical Readings XIII.4 (2021), pp. 3267-3275. 3267
Info@philosophicalreadings.org
10.5281/zenodo.5815371
Some notes on the thematic meta-metric of the national corpus of Uzbek dialects
(On the example of dialects using the sound "j" in Boysun district)
Kholova Muyassar Abdulhakimovna,
Basic doctoral student of the Department of Uzbek Linguistics, Termez State University
+99897 530 50 21. xolovam@tersu.uz
Bakhtiyor Rajabovich Mengliyev
Doctor of Philology, Professor, Tashkent State University of the Uzbek Language and Literature
Abstract: The article deals with the role of dialectal texts in the world linguistic corpus, the Uzbek
national corpus, the creation of a system for accurate, easy and convenient search for necessary
information, meta-measurement of small parts of the dialect and large texts: social, economic and
political disunity and the degree to which it is convenient to draw conclusions when studying
thematic tags of texts in Uzbek dialects.
Keywords: Uzbek National Corpus, dialectal text, thematic meta-metric, passport meta-metric,
thematic: social, economic and political divisibility.

It is known that the first linguistic corpuses in science began to appear in the 60s of the last
century. In 1963, at Brown University in the United States, the first large Brown Corpus was
created, followed by other languages and the American version of 500 prose texts in 50 genres and
2,000 words in English was developed by the co-authors U. Francis and H. Kucher.
It is known that Austria, China (Mandarin dialects corpus [21]), Germany [22], Portugal [23],
Czech Republic [24], Scandinavia [25], Poland, Lithuania, Georgia ( for example: Cor-Dial-Sin;
Helsinki corpus of English dialects [26]; Norwegian dialect corpus [27]; Archive fur gesprochenes
Deutsch, Die bayerische Dialektadebank; LEXDIALGRAM, etc.), Moscow, Slavic-Kuban,
Vologda, Saratov and Kazan Such dialectal corpus as in the central cities of Russia (Dialectic
podkorpus v sostave Natsionalnogo korpusa russkogo yazika, Saratovskiy dialektologicheskiy
korpus, e-library of Russian dialects [28], electronic corpus of Kuban dialect culture [8]). In the
creation of the national corpus in Russian linguistics: the structure of the Russian national corpus
("Natsionalniy korpus russkogo yazika"), the work done in St. Petersburg, Saratov institutes and
Tomsk [29], the corpus and bases such as the Boysun dialect base [30] evidence.
The structure of dialectal corpus can be divided into 2: 1. Dialectal corpus based on the formal
aspect of other language corpus, taking into account the features of their dialects: Portuguese corpus
- The syntax-oriented Corpus of Portuguese dialects, Polish corpus - Dialesty I gwary Polish
Compendium internetowe, Spanish corpus - Corpus Oral y Sonoro del Español, Bulgarian corpus -
Bulgarian dialestology as Living, Georgian corpus (2 054 705 words, 3400 texts), Mandarin corpus
in Chinese (this dialect is in Chinese) manifests itself in diversity and variability), and so on. 2.
Dialectal corpus based on the dialect of a certain region: Kuban dialect corpus, Saratov dialect
corpus, dialectical corpus of Ustya district of Arkhangelsk region.
As well as, the correct identification of the direction of Uzbek dialect corpus is one of the
basic parameters of dialectological research. The authorial corpus being created to date [4, 34] and
the synonym database [1,15] can be considered as actions in the process of corporatizing a language
that is in the process of development in linguistics. In the dialectological corpus, the anthropocentric
survey is used to gradually expand the database so that full texts can be used. Examples of this are
the texts in the electronic library of the Bulgarian, Scandinavian, Estonian, Portuguese, Scottish and
Tomsk dialectological corpus [10, 15]. In the process of observing the Russian National Corpus, the
dialectological corpus under the corpus: words, phrases, sentences, text sequences can be used in
the search for dialectological texts [9, 34]. A. Sirazitdinov mentions about the corpus of Bashkir
dialects, emphasizes the need to work on dialect dictionaries to replenish the database. [11]. In
Uzbek grammar, prof. N. Rajabov says that professional dialects are mainly derived from the

Philosophical Readings XIII.4 (2021), pp. 3267-3275. 3268
Info@philosophicalreadings.org
10.5281/zenodo.5815371
Qarluq group, which is an urban dialect, while dialects for livestock, grain, cotton, and agriculture
are derived from the Kipchak group. [6].
It should be noted here that, if we look at linguistics in a broader context, we can observe a
homogeneity in the thematic division of dialectal texts (taking into account the process of gradual
growth in harmony with the place of people in society). However, on a global scale, concepts such
as housing, household, politics, social life, and religion, which are inextricably linked to the role of
the ethnic population in social life, do not occupy the same place in the worldview of Dialect
representatives. [As an example, we can take the lifestyle of the population living only in the Tomsk
region and in the territory of Uzbekistan]. The high level of this or that concept depends on the
lifestyle and occupation of the local people [15]. The dialectal units of all dialects can be listed as
common themes: 1) personal life (concepts related to man); 2) home; 3) pets; 4) ceremonies; 5)
belief; 6) policy; 7) production (manual labor); 8) such as social life. From the macro themes
mentioned above, “human privacy”: parents, children, relatives, marriage and family life, villagers;
"household": garden, construction, tools; "faith": one can observe a state of division into micro-
themes, such as religious ceremonies, celebrations, and rituals [10]. Prof. S. Rakhimov notes that
the lexical units reflecting the way of life of the population living in Surkhandarya district are based
on the scheme of A. Shermatov and A. Ishayev: a) relatives; b) terms denoting a human organ; d)
household items; e) food terms; f) construction terms; d) clothing terms; h) terms related to
agriculture; terms related to animal husbandry; i) ethnographisms, etc. In the creation of the Dialect
corpus, the description of events should also take into account the time interval that indicates the
possession of a specific thematic characteristic (for example, the 30s, 50s, 60s, etc.). Research
analysis shows that dialectal units written in the language of the older generation reflect the
traditional culture of the farmers. The harmony of the daily life of ordinary peasants with their life
experiences is the basis for the nominative nature of the concepts.
The Tomsk dialectical corpus consists of 16 macro-themes: "Work", "Life", "Nature",
"Kindness", "Event", etc., as well as 64 micro-themes. For example, on the topics "Work", "Land
cultivation", "Harvesting of fruits", "Growing", "Logging", "Hunting", "Pine cone (harvesting of
spruce nuts)", "Collecting medicinal. herbs ". There are micro-themes such as "Housework for
women", "Housework for men" and "Fishing", and in the same dialectal corpus the topics move
from one to another. Family, villagers, love ties, strangers (Buryats, Tatars, Khakass, townspeople,
etc.); tools of labor, work, occupation, craft, garden; rural history, housing, collective farm, food
preparation, flax, potatoes, livestock, beekeeping, cedar production, hunting, fishing, (timber
production) industry, construction, strawberries, mushrooms, wire and flowers, food, disease and
healing, clothing, utensils, intoxication, magic and faith, temple, prophecies, fortune-telling, rituals,
wedding, mourning, baptism, holidays, weather, entertainment, politics, nature, events, product of
modernity, human dignity, etc. [2]
Based on the thematic division of the above language corpus (TDC), given that not all themes
reflected in them may reflect the lifestyle of the ethnic people (cedar production, fishing,
strawberries, mushrooms, drunkenness, temple, baptism, the paint of the present), interface
reflecting the national mentality in the corpus of Uzbek national dialects.

Philosophical Readings XIII.4 (2021), pp. 3267-3275. 3269
Info@philosophicalreadings.org
10.5281/zenodo.5815371
It should be noted that the work on this building is the result of the work not one person, but
several groups, and the project is underway. Below we present the author's "Online Database
Interface", which acts as a base for the dialect corpus.
Boysun dialect

Download 0.69 Mb.

Do'stlaringiz bilan baham:

1 2 3 4 5