Wyniki wyszukiwania

rozdział

Shallow parsing in Turkish

Ozan Topsakal, Onur Acikgoz, Ali Tunca Gurkan, Ali Bugra Kanburoglu, więcej

2017 International Conference on Computer Science and Engineering (UBMK) > 480 - 485

2017 International Conference on Computer Science and Engineering (UBMK)

In this study, shallow parsing is applied on Turkish sentences. These sentences are used to train and test the per-formances of various learning algorithms with various features specified for shallow parsing in Turkish.

rozdział

Pivot-Based Hybrid Machine Translation to Support Multilingual Communication

Arbi Haza Nasution, Nesi Syafitri, Panji Rachmat Setiawan, Des Suryani

2017 International Conference on Culture and Computing (Culture and Computing) > 147 - 148

2017 International Conference on Culture and Computing (Culture and Computing)

Machine Translation (MT) is very useful in supporting multicultural communication. Existing Statistical Machine Translation (SMT) which requires high quality and quantity of corpora and Rule-Based Machine Translation (RBMT) which requires bilingual dictionaries, morphological, syntax, and semantic analyzer are scarce for low-resource languages. Due to the lack of language resources, it is difficult...

rozdział

Improving the rule based machine translation system using sentence simplification (english to tamil)

B. Kavirajan, M. Anand Kumar, K.P. Soman, S. Rajendran, więcej

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 957 - 963

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

The ultimate aim of this research is to develop a Rule Based Machine Translation System (RBMT) using sentence simplification. The sentence pattern for English is SVO and Tamil is SOV. Complex and larger sentence are not easy to parse and translate. So, the sentence simplifier is also accommodated in the rule based system to split a large sentence into simple multiple sentences. Machine translation...

rozdział

An Adaptive Machine Translator for Multilingual Communication

Ryan Lane, Ajay Bansal

2017 IEEE 26th International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (WETICE) > 21 - 23

2017 IEEE 26th International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (WETICE)

Machine translation (MT) between natural languages is an infamously difficult problem in Natural Language Processing that is still very much being researched. This research study explores the efficacy of developing an adaptive translator using Lexical Functional Grammars. The main research objective is building a machine translator generator for multilingual communication, i.e. developing a system...

rozdział

A linguistic annotation scheme of Chinese discourse structures and study of prosodic interactions

Yuan Jia, Aijun Li

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP) > 1 - 5

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP)

Speech discourse comprehension is crucial for developing intelligent speech processing technologies. The present research aims to establish a multi-layered annotation scheme for Chinese discourse that contains inter-related information of phonetics, phonology, syntax, semantics and pragmatics. This research provides a theoretical foundation and analytical support for discourse comprehension by examining...

rozdział

Methods for automatic generation of GRAALAN-based phonetic databases

Stefan - Stelian Diaconescu, Monica - Mihaela Rizea, Felicia - Carmen Codirlasu, Mihaela Ionescu, więcej

2015 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 8

2015 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

This paper presents methods for automatic generation of phonetic databases (The Morphological and Phonetic Dictionary, The Phonetic Dictionary of Syllables, The Rhyming Dictionary) for a natural language, starting from a set of linguistic knowledge bases. The knowledge bases are developed by means of the GRAALAN (Grammar Abstract Language) system. The exemplification of this process will be described...

rozdział

Toward an efficient Arabic Part of Speech Tagger

Ahmed Abdelali, Yahya O. Mohamed Elhadj, Rachid Bouziane

2013 ACS International Conference on Computer Systems and Applications (AICCSA) > 1

2013 ACS International Conference on Computer Systems and Applications (AICCSA)

The task of tagging and allotting the correct Part of Speech (POS) to text given its context is not obvious and requires expertise and use of considerable resources. Automating such task and building tools that can carry such job is crucial and imperative to advance in major areas of natural language processing. A limited numbers of Part of Speech Taggers exist currently for Arabic and their availability...

rozdział

Dependency Parsing on Source Language with Reordering Information in SMT

Lei Chen, Miao Li, Miantao He, Hui Liu

2012 International Conference on Asian Language Processing > 133 - 136

2012 International Conference on Asian Language Processing (IALP)

In statistical machine translation, many translation errors may easily occur especially when the word orders are very different between source language and target language, especially with asymmetric morphological structures. The paper investigates combining a rule-based reordering model with conventional dependency parsing at the source side, which can alleviate both the asymmetry of morphological...

artykuł

Discriminative Language Modeling With Linguistic and Statistically Derived Features

Ebru Arisoy, Murat Saraclar, Brian Roark, Izhak Shafran

IEEE Transactions on Audio, Speech, and Language Processing > 2012 > 20 > 2 > 540 - 550

This paper focuses on integrating linguistically motivated and statistically derived information into language modeling. We use discriminative language models (DLMs) as a complementary approach to the conventional $n$ -gram language models to benefit from discriminatively trained parameter estimates for overlapping features. In our DLM approach, relevant information is encoded as features. Feature weights...

rozdział

Nominal Transfer from Tamil to Hindi

Sobha Lalitha Devi, V Kavitha, Pravin Pralayankar, S Menaka, więcej

2010 International Conference on Asian Language Processing > 270 - 273

2010 International Conference on Asian Language Processing (IALP 2010)

Transfer Grammar is an integral component of a Rule based Machine Translation system. In this paper, we describe a subset of the transfer grammar developed for Tamil to Hindi Machine Translation system, i.e., the transfer of nominal constructions from Tamil to Hindi. Nominal constructions in Tamil, which is an agglutinative language, take multiple suffixes which may be case markers or other suffixes...

rozdział

Two Cores in Chinese Negation System: A Corpus-Based View

Hio Tong Chan, Chunyu Kit

2010 International Conference on Asian Language Processing > 87 - 90

2010 International Conference on Asian Language Processing (IALP 2010)

This paper presents an empirical study of grammatical distribution of Chinese negation markers based on available corpus data. Most previous studies adopted theoretical approaches and focused on the roles of two dominant negators, bu and mei(you) in Chinese negation, paying very little attention to other complementary negators such as bie and fei . To achieve a comprehensive view on Chinese negation,...

rozdział

PerGram: A TRALE implementation of an HPSG fragment of Persian

Stefan Muller, Masood Ghayoomi

Proceedings of the International Multiconference on Computer Science and Information Technology > 461 - 467

2010 International Multiconference on Computer Science and Information Technology (IMCSIT 2010)

In this paper, we discuss an HPSG grammar of Persian (PerGram) that is implemented in the TRALE system. We describe some of the phenomena which are currently covered. While working on the grammar, we developed a test suite with positive and negative examples from the linguistic literature. To be able to test the coverage of the grammar with respect to naturally occurring sentences, we use a subcorpus...

rozdział

A Dependency Treebank of the Quran using traditional Arabic grammar

Kais Dukes, Tim Buckwalter

2010 The 7th International Conference on Informatics and Systems (INFOS) > 1 - 7

2010 7th International Conference on Informatics and Systems (INFOS 2010)

The Quran is a significant religious text, followed by the 1.5 billion believers of the Islamic faith worldwide. The text dates to 610-632 CE and is written in Quranic Arabic, the direct ancestor language of modern standard Arabic in use today. This paper presents the Quranic Arabic Dependency Treebank (QADT) and reports on the approaches and solutions used to apply Natural Language Processing to...

rozdział

Collocation Studies from Chinese English Learner's Perspective

Wenxin Xiong

2010 International Conference on Intelligent Computing and Cognitive Informatics > 175 - 178

2010 International Conference on Intelligent Computing and Cognitive Informatics (ICICCI 2010)

Collocation is such a language phenomenon that a sequence of words or terms which co-occur more often than would be expected by chance. It is different from frozen idioms or free word combinations in a continuum ranging from field of morphology and syntax. Collocation has been studied thoroughly in corpus and computational linguistics. A mastery of good collocation is vital for second language or...

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania

Shallow parsing in Turkish

Pivot-Based Hybrid Machine Translation to Support Multilingual Communication

Improving the rule based machine translation system using sentence simplification (english to tamil)

An Adaptive Machine Translator for Multilingual Communication

A linguistic annotation scheme of Chinese discourse structures and study of prosodic interactions

Methods for automatic generation of GRAALAN-based phonetic databases

Toward an efficient Arabic Part of Speech Tagger

Dependency Parsing on Source Language with Reordering Information in SMT

Discriminative Language Modeling With Linguistic and Statistically Derived Features

Nominal Transfer from Tamil to Hindi

Two Cores in Chinese Negation System: A Corpus-Based View

PerGram: A TRALE implementation of an HPSG fragment of Persian

A Dependency Treebank of the Quran using traditional Arabic grammar

Collocation Studies from Chinese English Learner's Perspective

Opcje filtrowania

Data publikacji

Dostępność treści

Typ publikacji

Słowa kluczowe

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Dostępność treści

Typ publikacji

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu