Statistical Analysis of Indivisible Word-Combinations: on Material of Ukrainian National Linguistic Corpus.
DOI:
https://doi.org/10.31558/1815-3070.2018.36.24Keywords:
association measure, phraseological units, mutual information, indivisible word-combination, statistics, the Ukrainian languageAbstract
Тhe frequency data for word-combination from the Ukrainian National Linguistic Corpus is received, the MI for multicomponent units is computed, the obtained results are analyzed, the correlation between the MI value and the type of indivisible word-combinations is revealed.
For the analyzed indivisible word-combinations, the MI association measure is in the range of 8.64 (Ukr. мав працювати) to 44.63 (Ukr. майстер спорту міжнародного класу), that means that the components combination in all these units is non-randomness. Within a single text corpus, the MI value depends on such factors as the absolute construction frequency, the absolute frequency of its components, the number of components and the type of indivisible word-combinations.
References
Balko, Maryna. “Semantyko-syntaksychni i strukturni aspekty tsilisnykh slovospoluchen' suchasnoyi ukrayins'koyi movy (Semantic-syntactical and Structural Aspects of Indivisible Word-combinations of Modern Ukrainian Language)ˮ: Diss. Zaporizhzhya National U, 2004. Abstract. Print.
Balko, Maryna. Aktual'ni problemy teoriyi slovospoluchennya suchasnoyi ukrayins'koyi movy (Actual Problems of the Word-combination Theory of Modern Ukrainian Language): [monohrafiya]. Dnipropetrovs'k: Svidler, 2014. Print.
Church, Kenneth Ward, and Hanks, Patrick. “Word Association Norms, Mutual Information, and Lexicography.” Computational Linguistics 16(1) (1990): 22–29. Print.
Everitt, B. S. The Cambridge Dictionary of Statistics. 2nd edition. Cambridge: Cambridge University Press, 2002. Print.
Fano, Robert M. Transmission of Information: A Statistical Theory of Communications. The Technology Press, M.I.T., and John Wiley & Sons, Inc., New York, 1961. Print.
Lychuk, Mariya. “Syntaksychno nechlenovani slovospoluchennya: ustalenist' termina, istoriya doslidzhennya (Syntactically Nondivided Word-combinations: Term Sustainability, History of Research)”. Linguistic Bulletin, 21 (2016): 142-148. Print.
Maksymiuk, Oksana. “Koreferentnist' nerozkladnykh komponentiv u strukturi rechennya (Co-reference of Stable Components in the Structure of the Sentence).” Diss. Chernivtsi National U, 2005. Abstract. Print.
Petrovic, S., Snajder, J., Basic, B.D., Kolar, M. “Comparison of collocation extraction for document indexing.” Journal of Computing and information technology, 14 (4) (2006): 321-327. Print.
Sytar, Hanna. “Statystychni Kryteriyi Analizu Syntaksychnykh Frazeolohizmiv (Statistical Criteria of Analysis of Syntactic Idioms).” Visnyk Donets'koho Natsional'noho Universytetu. Seriya B. Humanitarni Nauky (The Bulletin of Donetsk National University. Series B. Humanities) 1-2 (2015): 245–256. Print.
Sytar, Hanna. “Statystychnyі analiz prysliv"yiv i prykazok: pokaznyk asotsiatsiyi mutual information (na materiali Ukrayins'koho natsional'noho linhvistychnoho korpusu) (Statistical Analysis of Proverbs and Sayings: Association Measure of Mutual Information (on material of Ukrainian National Linguistic Corpus).” Лiнгвiстичнi студiї / Linguistic Studies 35 (2018): 170-177. Print.
Sytar, Hanna. “Statystychnyi analiz frazeolohizovanykh rechen: pokaznyk asotsiatsii mutual information (Statistical Analysis of Sentences with Phraseological Structures: Association Measure of Mutual Information).” Ukrainske movoznavstvo (Ukrainian Linguistics). 1(46) (2016): 103-125. Print.
Sytar, Hanna. Syntaksychni frazeolohizmy v rozrizi konstruktsiinoi hramatyky (Syntactic Idioms in the Context of Construction Grammar). Vinnytsya: ТОV «Nilan-LTD», 2017. Print.
Ukrainska mova: Entsyklopediia (Ukrainian language: Encyclopedia). Redkol.: Rusanivskyi V. M. (spivholova), Taranenko O. O. (spivholova), Ziabliuk M. P. ta in. 2-he vyd., vypr. i dop. Kyiv: Vyd-vo “Ukrainska entsyklopediia” im. M. P. Bazhana, 2004. Print.
Yagunova, Ye.V., Pivovarova, L.M. “Ot kollokatsiy k konstruktsiyam (From Collocations to Constructions)”. ACTA LINGUISTICA PETROPOLITANA. Works of the Institute of Linguistic Researches of RAS, Russkiy yazyk: grammatika konstruktsiy i leksiko-semanticheskie podkhody (The Russian Language: Construction Grammar and Lexical and Semantic Approaches): X, part 2. (2014) 568-617. Print.
Zahnitko, Anatoliy. Slovnyk suchasnoyi linhvistyky: ponyattya i terminy (Dictionary of Modern Linguistics: Concepts and Terms). Donets'k: DonNU, 2013. Print.
Zahnitko, Anatoliy. Teoretychna hramatyka ukrayins'koyi movy: Syntaksys (Theoretical Grammar of the Ukrainian Language: Syntax). Donets'k: DonNU, 2001. Print.