Preview

NSU Vestnik. Series: Linguistics and Intercultural Communication

Advanced search

USE OF UNIVERSAL DEPENDENCIES FOR GRAMMATICAL ANALYSIS OF THE MULTILINGUAL TEXT (ON THE EXAMPLE OF PREDICATIVE)

https://doi.org/10.25205/1818-7935-2018-16-2-19-33

Abstract

The paper combines the initiative of universal dependencies (UD), with the aim to promote developing cross-linguistically consistent annotation scheme of grammatical analysis. The purpose of this initiative is to simplify cross-language research methods, develop a unified interlanguage linguistic typology, and thus lay the foundations for the use of automated multilingual systems and a universal cross-language text parser. The paper points out the main problems of grammatical analysis pertinent to a multilingual text, advantages of using common language features, and purposes of the universal dependencies project. The major principles of universal dependencies are discussed based on the example of morphology and syntax features of Russian, English and German. In the second part we illustrate how to conduct a corpus research on predicatives using UD. The article proposes a technique of automatic identification of predicatives, examines their frequency distribution in the Russian UD corpus, and the semantic categorization of the most often used predicatives.

About the Author

E. V. Lyukina
National Research University Higher School of Economics
Russian Federation


References

1. Дяченко П. В., Иомдин Л. Л., Лазурский А. В., Митюшин Л. Г., Подлесская О. Ю., Сизов В. Г., Фролова Т. И., Цинман Л. Л. Современное состояние глубоко аннотированного корпуса текстов русского языка (СинТагРус) // Национальный корпус русского языка: 10 лет проекту. Труды Института русского языка им. В. В. Виноградова. М., 2015. Вып. 6. С. 272-299.

2. Летучий А. Б. Предикатив. Материалы для проекта корпусного описания русской грамматики (http://rusgram.ru). На правах рукописи. М., 2017.

3. Русская грамматика / Под ред. Н. Ю. Шведовой. М.: Наука, 1980. Т. 1: Фонетика. Фонология. Ударение. Интонация. Словообразование. Морфология; Т. 2: Синтаксис.

4. Шелякин М. А. Функциональная грамматика русского языка. М.: Русский язык, 2001.

5. Щерба Л. В. Языковая система и речевая деятельность. М.: Едиториал УРСС, 2004.

6. Duden. Mannheim; Leipzig; Wien; Zurich, 1995. Bd. 4: Grammatik der deutschen Gegenwartssprache.

7. Marneffe M. et al. Universal Stanford Dependencies: A cross-linguistic typology, 2006.

8. McDonald R., Nivre J., Quirmbach-Brundage Y., Goldberg Y., Das D., Ganchev K., Hall K., Petrov S., Zhang H., Täckström O., Bedini C., Bertomeu Castelló N., Lee J. Universal dependency annotation for multilingual parsing // ACL, 2013.

9. Nivre J., Marneffe M., Ginter F., Goldberg Y., Hajic J., Manning C., McDonald R., Petrov S., Pyysalo S., Silveira N., Tsarfaty R., Zeman D. Universal Dependencies v1: A Multilingual Treebank Collection // Proceedings of the Tenth International Conference on Language Resources and Evaluation. European Language Resources Association (ELRA). Paris, France, 2016.

10. Nivre J. Towards a Universal Grammar for Natural Language Processing // CICLing 2015: Proceedings of Computational Linguistics and Intelligent Text Processing. Cairo, Egypt, 2015. Vol. 9041. Р. 3-16.

11. Petrov S., Das D., McDonald R. A universal part-of-speech tagset // LREC, 2012.

12. Straka M., Hajič J., Straková J. UDPipe: Trainable Pipeline for Processing CoNLL-U Files Performing Tokenization, Morphological Analysis, POS Tagging and Parsing // Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016). Portorož, Slovenia, 2016.

13. Zeman D., Mareček D., Popel M., Ramasamy L., Štěpánek J., Žabokrtský Z., Hajič J. HamleDT: To parse or not to parse? // LREC, 2012. Р. 2735-2741.

14. Zeman D. Reusable tagset conversion using tagset drivers // LREC, 2008. Р. 213-218.


Review

For citations:


Lyukina E.V. USE OF UNIVERSAL DEPENDENCIES FOR GRAMMATICAL ANALYSIS OF THE MULTILINGUAL TEXT (ON THE EXAMPLE OF PREDICATIVE). NSU Vestnik. Series: Linguistics and Intercultural Communication. 2018;16(2):19-33. (In Russ.) https://doi.org/10.25205/1818-7935-2018-16-2-19-33

Views: 175


Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.


ISSN 1818-7935 (Print)