“Diacronia” bibliometric database (BDD)

ROMTEXT – a Fundamental Instrument for the New Edition of the Dictionary of the Romanian Language

Publication: Revue roumaine de linguistique, LXIII (4), p. 409-414
Publisher:Editura Academiei
Abstract:The article is a short presentation of the ROMTEXT project, a dated and annotated corpus of selected texts from the bibliography of the Dictionary of the Romanian Language, from the 16th − 21st centuries. The project aims at supporting the new digital edition of the thesaurus-dictionary, developed by the Lexicology and Lexicography Department of the “Iorgu Iordan − Al. Rosetti” Institute of Linguistics, Romanian Academy (2017–2019). ROMTEXT shall include over 500 literary and non- literary texts, obtained by optical recognition of the best editions, with assisted corrections. Subsequently, these texts shall be annotated from a morphological, syntactic and semantic point of view, by a team of lexicographers, with computer assistance. ROMTEXT shall have two concordance searching interfaces: one for lexicographers and one for the public. Results limitation and selection methods are also provided based on the text metadata. Due to its design and results, ROMTEXT shall be one of the most modern and versatile corpus linguistics available in Romanian.
Key words:corpus linguistics, Dictionary of the Romanian Language; lemmatization; annotated corpus; reference corpus, Romanian language
Language: English

Citations to this publication: 1

References in this publication: 2

The citations/references list is based on indexed publications only, and may therefore be incomplete.
For any and all inquiries related to the database, please contact us at [Please enable javascript to view.].