Search results for: Jaroslava Hlaváčová

Items from 1 to 7 out of 7 results

article

Distribuce předpon v českém sylabotónickém trocheji

Petr Plecháč, Jaroslava Hlaváčová, Kristýna Merthová, Robert Kolár

Slovo a slovesnost: časopis pro otázky teorie a kultury jazyka (Slovo... > 2017 > 78 > 4 > 322-332

The article deals with the use of prefixes in the Czech accentual syllabic trochee. We test a hypothesis raised by Miroslav Červenka, Květa Sgallová, and Petr Kaiser which states that some authors in the 19th century used prefixes to moderate rhythmical irregularities. In our analysis – based on automatic prefix recognition in a large body of poetic texts from the Corpus of Czech Verse – we observe...

chapter

Dispersion of Words in a Language Corpus

Jaroslava Hlaváčová, Pavel Rychlý

Lecture Notes in Computer Science > Text, Speech and Dialogue > Posters > 321-324

This paper proposes new measures for dealing with word dispersion in a language corpus - reduced frequency and rarity. Their calculation is described and some results from the Czech National Corpus (CNC) presented. Some previous approaches are briefly mentioned.

chapter

Morphological Guesser of Czech Words

Jaroslava Hlaváčová

Lecture Notes in Computer Science > Text, Speech and Dialogue > Text > 70-75

If a corpus is submitted to a morphological analysis, there always remain some words that the analyser could not recognize (foreign names, misspellings,...). However, if a human reads the texts, he usually understands them, even if he does not knowas manywords as there are in the lexicon used by the morphological analyser. The language itself helps him to recognize unknown words. It is not only semantics...

chapter

Affisix: Tool for Prefix Recognition

Jaroslava Hlaváčová, Michal Hrušecký

Lecture Notes in Computer Science > Text, Speech and Dialogue > Text > 85-92

In the paper, we present a software tool Affisix for automatic recognition of prefixes. On the basis of an extensive list of words in a language, it determines the segments – candidates for prefixes. There are two methods implemented for the recognition – the entropy method and the squares method. We briefly describe the methods, propose their improvements and present the results of experiments with...

chapter

Prefix Recognition Experiments

Jaroslava Hlaváčová, Michal Hrušecký

Lecture Notes in Computer Science > Text, Speech and Dialogue > Conference Papers > 235-242

The paper deals with automatic methods for prefix extraction and their comparison. We present experiments with Czech and English and compare the results with regard to the size and type (wordforms vs. lemmas) of input data.

chapter

Variants and Homographs

Jaroslava Hlaváčová, Markéta Lopatková

Lecture Notes in Computer Science > Text, Speech and Dialogue > Text > 93-100

We discuss two types of asymmetry between wordforms and their(morphological) characteristics, namely (morphological) variants and homographs. We introduce a concept of multiple lemma that allows for unique identification of wordform variants as well as ‘morphologically-based’ identification of homographic lexemes. The deeper insight into these concepts allows further refining of morphological dictionaries...

article

Adaptation of machine translation for multilingual information retrieval in the medical domain

Pavel Pecina, Ondřej Dušek, Lorraine Goeuriot, Jan Hajič, more

Artificial Intelligence In Medicine > 2014 > 61 > 3 > 165-185

We investigate machine translation (MT) of user search queries in the context of cross-lingual information retrieval (IR) in the medical domain. The main focus is on techniques to adapt MT to increase translation quality; however, we also explore MT adaptation to improve effectiveness of cross-lingual IR.Our MT system is Moses, a state-of-the-art phrase-based statistical machine translation system...

Filter options

Publication date

Set your own date range

INFONA - science communication portal

Search results for: Jaroslava Hlaváčová