FORMAL-FUNCTIONAL MODELS OF THE UZBEK ELECTRON CORPUS

Nilufar Abdurakhmonova

Abstract


The paper is devoted to the structure and its linguistic annotation for building Uzbek Corpus. Linguistic annotation, metadata and corpus manager as formal-functional model of the corpus are important for usage for many purposes. The fact that the platform allows users to address language and literature issues, use it online. The Uzbek corpus based on structural and sub corpus models, which partially represented in this paper, is going on process to develop Uzbek language technology.

Keywords: Uzbek corpus, morphoanalyzer, metadata, parallel corpora, text analysis, corpus manager.


Full Text:

PDF

References


Sulevmanov, D., Gatiatullin, A., Prokopyev, N., Abdurakhmonova, N. (2020) Turkic morpheme web portal as a platform for turkology research International Conference on Information Science and Communications Technologies, ICISCT 2020, 2020, 9351500.

Khusainov, A., Suleymanov, D., Gilmullin, R., Minsafina, A., Kubedinova, L., Abdurakhmonova. N. (2020) First Results of the “TurkLang-7” Project: Creating Russian-Turkic Parallel Corpora and MT Systems CMLS 2020 CEUR Workshop Proceedings, 2020, pp. 90-101.

Khusainov, A., Suleymanov, D., Gilmullin, R., Gatiatullin, A. (2018) Building the Tatar-Russian NMT system based on re-translation of multilingual data Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 11107 LNAI, pp. 163–170.

Абдураҳмонова Н. (2020) Замонавий корпусларнинг компьютер моделлари // Ўзбекистонда хорижий тиллар. -2020. - № 1(30). - Б. 50-58. https://doi.org/ 10.36078/

Мухамедшин, Д.Р., Сулейманов Д.Ш. (2018) Система корпус-менеджер: архитектура и модели корпусных данных Программные продукты и системы / Software & Systems 4 (31) – C. 6.

В. П. Захаров, И. В. Азарова, О. А. Митрофанова, А. М. Попов, М. В. Хохлова (2019) Моделирование в корпусной лингвистике Специализированные корпусы русского языка, Санкт-Петербургский государственный университет. -C. 19.

Erhard Hinrichs, Marie Hinrichs, Thomas Zastrow, Gerhard Heyer, Volker Boehlke, Uwe Quasthoff, Helmut Schmid, Ulrich Heid, Fabienne Fritzinger, Alexander Siebert, and Jorg Didakowski. (2009) Weblicht: Web-based LRT services for German. In Workshop on linguistic processing pipelines, GSCL Jahrestagung, Potsdam.

Аброскин А. А. Поиск по корпусу: проблемы и методы их решения // Национальный корпус русского языка: 2006-2008. Новые результаты и перспективы. СПб.: Нестор-История, 2009, 277-282.

https://uz.wikipedia.org/wiki/O%CA%BBzbek_tili

Jinyi Zhang, Tadahiro Matsumoto (2019) Corpus Augmentation for Neural Machine Translation with ChineseJapanese Parallel Corpora / Applied sciences (9), 2036.




Copyright (c) 2021 Author(s)

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.

© 2012-2022 ANGLISTICUM. Journal of the Association-Institute for English Language and American Studies,Tetovo, North Macedonia.

ISSN (print): 1857-8179. ISSN (online): 1857-8187.

Disclaimer: Articles on Anglisticum have been reviewed and authenticated by the Authors before sending for the publication.

The Journal, Editors and the editorial board are not entitled or liable to either justify or responsible for inaccurate and misleading data if any. It is the sole responsibility of the Author concerned.