A fairly considerable lack of Polish-Ukrainian electronic language resources, especially large dictionaries and parallel corpora, is presently felt among the professional translators and those who learn either language. A morpho-syntactically annotated aligned corpus with a parallel concordancer can be used in many cases as a traditional dictionary. One of its possible advantages is being an up-to-date database of numerous examples of language use, which could be of use while dealing with neologisms and new lexical meanings. The authors aim is to create such a corpus and make it publicly available through the Internet. At the moment one can advise the prototype of the parallel Polish-Ukrainian corpus on-line from the website http://corpus.domeczek.pl. It contains ca. 50 parallel bitexts compliant with the XCES standard. Not all sentence segments have been tagged yet, the work on morphological annotation is on-going as well. The corpus is aligned presently only at the paragraph level. The authors are convinced that such a parallel corpus with an efficient search engine, a flexible query language and a friendly user interface will be one of the most powerful tools for all Polish and Ukrainian researchers, translators and students.
Financed by the National Centre for Research and Development under grant No. SP/I/1/77065/10 by the strategic scientific research and experimental development program:
SYNAT - “Interdisciplinary System for Interactive Scientific and Scientific-Technical Information”.