Serwis Infona wykorzystuje pliki cookies (ciasteczka). Są to wartości tekstowe, zapamiętywane przez przeglądarkę na urządzeniu użytkownika. Nasz serwis ma dostęp do tych wartości oraz wykorzystuje je do zapamiętania danych dotyczących użytkownika, takich jak np. ustawienia (typu widok ekranu, wybór języka interfejsu), zapamiętanie zalogowania. Korzystanie z serwisu Infona oznacza zgodę na zapis informacji i ich wykorzystanie dla celów korzytania z serwisu. Więcej informacji można znaleźć w Polityce prywatności oraz Regulaminie serwisu. Zamknięcie tego okienka potwierdza zapoznanie się z informacją o plikach cookies, akceptację polityki prywatności i regulaminu oraz sposobu wykorzystywania plików cookies w serwisie. Możesz zmienić ustawienia obsługi cookies w swojej przeglądarce.
In this study, shallow parsing is applied on Turkish sentences. These sentences are used to train and test the per-formances of various learning algorithms with various features specified for shallow parsing in Turkish.
Machine Translation (MT) is very useful in supporting multicultural communication. Existing Statistical Machine Translation (SMT) which requires high quality and quantity of corpora and Rule-Based Machine Translation (RBMT) which requires bilingual dictionaries, morphological, syntax, and semantic analyzer are scarce for low-resource languages. Due to the lack of language resources, it is difficult...
The ultimate aim of this research is to develop a Rule Based Machine Translation System (RBMT) using sentence simplification. The sentence pattern for English is SVO and Tamil is SOV. Complex and larger sentence are not easy to parse and translate. So, the sentence simplifier is also accommodated in the rule based system to split a large sentence into simple multiple sentences. Machine translation...
Machine translation (MT) between natural languages is an infamously difficult problem in Natural Language Processing that is still very much being researched. This research study explores the efficacy of developing an adaptive translator using Lexical Functional Grammars. The main research objective is building a machine translator generator for multilingual communication, i.e. developing a system...
Speech discourse comprehension is crucial for developing intelligent speech processing technologies. The present research aims to establish a multi-layered annotation scheme for Chinese discourse that contains inter-related information of phonetics, phonology, syntax, semantics and pragmatics. This research provides a theoretical foundation and analytical support for discourse comprehension by examining...
This paper presents methods for automatic generation of phonetic databases (The Morphological and Phonetic Dictionary, The Phonetic Dictionary of Syllables, The Rhyming Dictionary) for a natural language, starting from a set of linguistic knowledge bases. The knowledge bases are developed by means of the GRAALAN (Grammar Abstract Language) system. The exemplification of this process will be described...
The task of tagging and allotting the correct Part of Speech (POS) to text given its context is not obvious and requires expertise and use of considerable resources. Automating such task and building tools that can carry such job is crucial and imperative to advance in major areas of natural language processing. A limited numbers of Part of Speech Taggers exist currently for Arabic and their availability...
In statistical machine translation, many translation errors may easily occur especially when the word orders are very different between source language and target language, especially with asymmetric morphological structures. The paper investigates combining a rule-based reordering model with conventional dependency parsing at the source side, which can alleviate both the asymmetry of morphological...
This paper focuses on integrating linguistically motivated and statistically derived information into language modeling. We use discriminative language models (DLMs) as a complementary approach to the conventional -gram language models to benefit from discriminatively trained parameter estimates for overlapping features. In our DLM approach, relevant information is encoded as features. Feature weights...
Transfer Grammar is an integral component of a Rule based Machine Translation system. In this paper, we describe a subset of the transfer grammar developed for Tamil to Hindi Machine Translation system, i.e., the transfer of nominal constructions from Tamil to Hindi. Nominal constructions in Tamil, which is an agglutinative language, take multiple suffixes which may be case markers or other suffixes...
This paper presents an empirical study of grammatical distribution of Chinese negation markers based on available corpus data. Most previous studies adopted theoretical approaches and focused on the roles of two dominant negators, bu and mei(you) in Chinese negation, paying very little attention to other complementary negators such as bie and fei . To achieve a comprehensive view on Chinese negation,...
In this paper, we discuss an HPSG grammar of Persian (PerGram) that is implemented in the TRALE system. We describe some of the phenomena which are currently covered. While working on the grammar, we developed a test suite with positive and negative examples from the linguistic literature. To be able to test the coverage of the grammar with respect to naturally occurring sentences, we use a subcorpus...
The Quran is a significant religious text, followed by the 1.5 billion believers of the Islamic faith worldwide. The text dates to 610-632 CE and is written in Quranic Arabic, the direct ancestor language of modern standard Arabic in use today. This paper presents the Quranic Arabic Dependency Treebank (QADT) and reports on the approaches and solutions used to apply Natural Language Processing to...
Collocation is such a language phenomenon that a sequence of words or terms which co-occur more often than would be expected by chance. It is different from frozen idioms or free word combinations in a continuum ranging from field of morphology and syntax. Collocation has been studied thoroughly in corpus and computational linguistics. A mastery of good collocation is vital for second language or...
Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.