Preview

NSU Vestnik. Series: Linguistics and Intercultural Communication

Advanced search

Structural Models of Terminological Word Combinations for Marking up a Corpus of Scientific and Technical Texts

https://doi.org/10.25205/1818-7935-2021-19-3-45-56

Abstract

The article presents structural models of terminological phrases from the subject area “Welding” as the basis for creating automated tools to mark up the corpus of scientific and technical texts. The place of scientific and technical corpora in corpus linguistics and the prospects for their further research are outlined. The relevance of the research stems from the need to create corpora of scientific and technical texts in general and to provide tools for automatic detection of terms in particular. It is substantiated that the main problem in designing such corpora is the automatic markup of terminological phrases. The analysis of the current state of the term system of the subject area “Welding” has been carried out. The results of the analysis of two-, three-, four- and five-component terminological phrases of “Welding” and their structural models are presented and illustrated by examples. The necessity of listing all possible structural models of terminological combinations has been substantiated too. It has been established that the addition of a new component to the basic terminological combination most often occurs with introduction of one more postpositional at-tribute whose function is to add some specific feature to the basic meaning. The novelty of the study is seen in providing a theoretical approach for the formation of a database of structural models of terminological phrases which may be used as a core of a supersource database on the structure of the multicomponent scientific and technical terms. An approach to automatic markup of multicomponent terms is proposed too. It will be also helpful in future corpus research for identification of candidate word combinations as scientific and technical terms.

About the Authors

I. I. Butenko
Peoples Friendship University of Russia; Bauman Moscow State Technical University
Russian Federation


N. S. Nikolaeva
Bauman Moscow State Technical University
Russian Federation


T. D. Margaryan
Bauman Moscow State Technical University
Russian Federation


References

1. Butenko Iu. I., Garazha V. V. BMSTU Corpus of Scientific and Technical Texts: Conceptual Framework. Applied Linguistics Research Journal, 2021, vol. 5 (3), p. 76–81. DOI 10.14744/alrj.2021.15579.

2. Citkina F. A. Terminologija i perevod [Terminology and translation]. Lvov, Vysshaja shkola, 1988, 157 p.

3. Grinev-Grinevich S. V., Sorokina Je. A. Describing the formal structure of a term (based on the english terminology of lexicology). Vestnik Moskovskogo gosudarstvennogo oblastnogo universiteta. Serija: Lingvistika, 2020, no. 5, p. 74–85. (in Russ).

4. Kruzhkov M. G. Information resources for contrastive studies: electronic text corpora. Systems and means of informatics, 2015, no. 25 (2), p. 140–159. (in Russ.)

5. Lejchik V. M. Iskhodnye ponyatiya, osnovnye polozheniya, opredeleniya sovremennogo termino- vedeniya i terminografii [Basic concepts, basic provisions, definitions of modern terminology and terminography]. Vestnik Har'kovskogo politekhnicheskogo universiteta, 1994, vol. 1, p. 147–180. (in Russ.)

6. Lesnikov S. V. The types of marking of text corpora of the Russian language. Nauchno-tekhnicheskaya informaciya. Seriya 2. Informacionnye processy i sistemy, 2019, vol. 9, p. 27–30. (in Russ.) DOI 10.36535/0548-0027-2019-09-4

7. Lotte D. S. Osnovy postroeniya nauchno-tekhnicheskoj terminologii [Fundamentals of building sci-entific and technical terminology]. Moscow Izdatel’stvo AN SSSR, 1961, 158 p. (in Russ.)

8. Loukachevitch N., Dobrov B. Ontological Resources for Representing Security Domain in Infor-mation-Analytical System. Open Semantic Technologies for Intelligent Systems Design, 2018, vol. 8, p. 185–191.

9. Morev N. A. K probleme lingvisticheskogo analiza terminologii v oblasti nanotehnologij (o neobhodimosti razrabotki issledovatel'skogo korpusa terminologicheskih edinic) [On the Problem of Linguistic Analysis of Nanotechnology Terminology (On the Need to Develop a Research Corpus of Terminological Units)]. Vestnik MGLU, 2012, no. 13 (646), p. 115–124.

10. Nagel O. V. Corpus linguistics and its use in computer-based language teaching. Language and cul-ture, 2008, no. 4, p. 53–59 (in Russ.)

11. Nikolaeva N. S. Osobennosti proiskhozhdeniya osnovnyh terminov terminosistemy “Svarka” (na materiale anglijskoj i russkoj terminologij) [Peculiarities of origin of the basic terms of the “Welding” system (on the material of English and Russian terminology)]. Vestnik Moskovskogo gosudarstvennogo oblastnogo universiteta. Seriya: Lingvistika, 2011, no. 1, p. 132–138. (in Russ.)

12. Terminologija svarki metallov [Terminology of metal welding]. Eds. S. A. Chaplygina, D. S. Lotte. Moscow, AS USSR, 1937, 31 p. (in Russ.)

13. Zaharov V. P. Russian corpora. Trudy instituta russkogo jazyka imeni V. V. Vinogradova, 2015, vol. 6, p. 20–65. (in Russ.)

14. Zakharov, V. P., Khokhlova, M. V. Automatic extracting of terminological phrases. Strukturnaya i prikladnaya lingvistika, 2014, vol. 10, p. 182–200. (in Russ.)

15. Zakharov, V. P., Bogdanova, S. Yu. Methods in Corpus Linguistics: Course book. 3rd revised ed. St. Petersburg, SPb University Press, 2020, 234 p. (in Russ.)

16. Zakharov, V. P., Khokhlova, M. V. Automatic term extraction and statistical analysis in a special text corpus as a tool for thesaurus construction. Strukturnaya i prikladnaya lingvistika, 2019, no. 9, p. 222–233. (in Russ.)


Review

For citations:


Butenko I.I., Nikolaeva N.S., Margaryan T.D. Structural Models of Terminological Word Combinations for Marking up a Corpus of Scientific and Technical Texts. NSU Vestnik. Series: Linguistics and Intercultural Communication. 2021;19(3):45-56. (In Russ.) https://doi.org/10.25205/1818-7935-2021-19-3-45-56

Views: 332


Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.


ISSN 1818-7935 (Print)