Using the tools of corpus linguistics, this study identifies the basic means of knowledge construction in research articles in economics. The results suggest that discourse-signals realize conditional prediction and empirical hypothesis within the macro-speech acts of hypothesis, analysis/interpretation
military slang; (B) a high proportion of abbreviations; (C) frequent linguistic devices expressing mutuality and collectiveness of the soldiers’ enterprise. The texts were subjected to keyword and collocation analyses; these determined several stylistic features of theirs (such as use of English-based expressions, protocol
This article is devoted to the problem of the criteria on which multi-word terms should be selected and verified for the bibliographic database of Slavic linguistics publications iSybislaw. The text is based on Russian and Polish linguistic material. Such criteria as actuality, frequency, transparency of form are
The paper presents the results of contrastive studies of the Polish and German discourse on the subject of the Centre Against Expulsions and Erika Steinbach at the level of lexis. The quantitative and qualitative analysis of keywords with the application of the DIMEAN method with the addition of the analysis of
. The basic idea is to search for anchor words such as abstract or keywords followed by their equivalents in another language. Text fragments that follow anchor words are likely to supply new entries for bilingual lexica.
W artykule przedstawiono jeden z problemów związanych z budowaniem systemu informacyjno-wyszukiwawczego iSybislaw. System ten prezentuje bibliograficzną bazę danych światowego językoznawstwa slawistycznego, dostępną on-line pod adresem www.isybislaw.ispan.waw.pl. Ważnym elementem tego systemu są słowa kluczowe, a jednym z istotnych celów międzynarodowego zespołu slawistów współpracujących z Centrum...
This article discusses automatic extraction of relevant words from sets of texts. The author briefly presents three methods aimed to extract the words from the corpus of words with regard to their frequency, or words whose occurrence next to each other is not random. First, he focuses on the keyword analysis method
This paper proposes a strategy of the summary sentence selection for query-focused multi-document summarization through extracting keywords from relevant document set. It calculates the query related feature and the topic related feature for every word in relevant document set, then obtains the importance of the word
disambiguate the term letter from an internal, bottom-up perspective. Such a perspective may offer "a key for disclosing historical forms of communication" (Hübler and Busse 2012: 1) as first-order phenomena. This is achieved in an analysis of categorial labels (keywords) and metacommunicative clues found in the internal
Keyword extraction has been a very traditional topic in Natural Language Processing. However, most methods have been too complicated and slow to be applied in real applications, for example in web-based system. This paper proposes an approach which will complete some preparing works focusing on exploring the
Little attention has been paid so far to keywords and lexical bundles used in the English language typical of the pharmaceutical field. Conducted from a register-perspective (Biber & Conrad, 2009), this exploratory and descriptive research is intended to fill in the gap in corpus linguistics studies on phraseology and
in keyword assignment precision from 18 to 29 percent and in F-measure from 17.2 to 27.6 for 5 keywords assigned to a document. The further filtering out of the top 10 frequent items improves precision by 4 percent and collocation segmentation improves precision by 9 percent on the average, over 21 languages tested.
solutions for Indic scripts and languages such as Sanskrit has hampered information extraction from a large body of documents of cultural and historical importance. This chapter presents two relevant topics in this area. First, we describe the use of a script specific Keyword Spotting for Sanskrit documents that makes use of
This paper presents automatic pronunciation transliteration method with acoustic and contextual analysis for Chinese-English mixed language keyword spotting (KWS) system. More often, we need to develop robust Chinese-English mixed language spoken language technology without Chinese accented English acoustic data. In
Streszczenie/abstrakt i słowa kluczowe podane przez autora w oryginalnym artykule są podstawą opracowania bibliograficznego, nie stanowią jednak wystarczającego źródła pracy bibliografa. Jego zadaniem jest przede wszystkim sprawdzenie, czy streszczenie/abstrakt i słowa kluczowe odpowiadają treści samego artykułu. Kolejne zadanie polega na odnalezieniu w języku bibliografii (w naszym przypadku jest...
The term keyword, created in 1954 by Pierre Guiraud, is also used in con¬temporary linguistics to describe words that, because of their frequency of occurrence, are characteristic of a particular text, the genre of a text or the writer‘s style. The indication of keywords defined in this way in Pro¬testant funeral
We present a comparison of four unsupervised algorithms to automatically acquire the set of keywords that best characterise a particular multimedia archive: the Belga News Archive. Such keywords provide the basis of a controlled vocabulary for indexing the pictures in this archive. Our comparison shows that the
In the article the features of keyword synonymy in Bibliographic database of world Slavic linguistics publications iSybislaw are considered. The issue of keywords in information retrieval system is examined in connection with synonymy in linguistic terminology. There are established relations between general
Financed by the National Centre for Research and Development under grant No. SP/I/1/77065/10 by the strategic scientific research and experimental development program:
SYNAT - “Interdisciplinary System for Interactive Scientific and Scientific-Technical Information”.