Search results

chapter

A grapheme-level approach for constructing a Korean morphological analyzer without linguistic knowledge

Jihun Choi, Jonghem Youn, Sang-goo Lee

2016 IEEE International Conference on Big Data (Big Data) > 3872 - 3879

2016 IEEE International Conference on Big Data (Big Data)

Morphological analysis is an essential step for processing the Korean language, due to highly agglutinative properties of the language. In this paper, we propose a novel approach for constructing a Korean morphological analyzer that can capture linguistic properties using graphemes as basic processing units. Since our model does not utilize prior linguistic knowledge, the model can be applied to other...

chapter

ExATO - High Quality Term Extraction for Portuguese and English

Lucelene Lopes, Paulo Fernandes, Renata Vieira

2016 IEEE/WIC/ACM International Conference on Web Intelligence (WI) > 540 - 545

2016 IEEE/WIC/ACM International Conference on Web Intelligence (WI)

This paper presents a novel version of ExATO, a term extractor originally designed to extract relevant terms from corpora in Portuguese. In this new version not only corpora in Portuguese can be handled, but also texts in English are accepted. This extension is likely to offer the same quality pattern already achieved for Portuguese. In this paper, we draw the analysis of results in parallel corpora...

chapter

Debate on political reforms in Twitter: A hashtag-driven analysis of political polarization

Mirko Lai, Cristina Bosco, Viviana Patti, Daniela Virone

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 9

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

Political debates about a reform may sparkle national controversies, by leading members of the community to polarize their opinions and sentiment about the topic addressed. With the rise of social media like Twitter users are encouraged to voice and share their strong and polarized views and in general people are exposed to broader viewpoints than they were before. The large amount of user-generated...

chapter

Combined Classification for Extracting Named Entities from Arabic Texts

Feriel Ben Fraj Trabelsi, Chiraz Ben Othmane Zribi, Wiem Kouki

2015 First International Conference on Arabic Computational Linguistics (ACLing) > 55 - 60

2015 First International Conference on Arabic Computational Linguistics (ACLing)

In this paper, we describe an approach for extracting named entities from Arabic texts. Arabic language is hard to process since its characteristics that influence, even, the NE extraction. For our case, we consider that the named entities extraction can be assimilated to a typical classification problem. Indeed, this extraction consists of searching for text portions that can be classified in a NE...

chapter

Revisiting Arabic Part of Speech Tagsets

Yahya O.M. Elhadj, Ahmed Abdelali, Rachid Bouziane, Adel H. Ammar

2014 IEEE/ACS 11th International Conference on Computer Systems and Applications (AICCSA) > 793 - 802

2014 IEEE/ACS 11th International Conference on Computer Systems and Applications (AICCSA)

Assigning the appropriate grammatical category to a word given a context is very important step in major areas of natural language processing. A limited numbers of Part of Speech Taggers currently exist for Arabic. These taggers mainly adopt tagsets that were developed for languages such as English. In this paper we present an effort of proposing a revised categories for Arabic POS tags that would...

chapter

A semisupervised associative classification method for POS tagging

Pratibha Rani, Vikram Pudi, Dipti Misra Sharma

2014 International Conference on Data Science and Advanced Analytics (DSAA) > 156 - 162

2014 International Conference on Data Science and Advanced Analytics (DSAA)

We present here a data mining approach for part-of-speech (POS) tagging, an important Natural language processing (NLP) classification task. We propose a semi-supervised associative classification method for POS tagging. Existing methods for building POS taggers require extensive domain and linguistic knowledge and resources. Our method uses a combination of a small POS tagged corpus and untagged...

chapter

Resolving issues in parsing technique in machine translation from hindi language to english language

Shachi Mall, Umesh Chandra Jaiswal

2014 International Conference on Computer and Communication Technology (ICCCT) > 55 - 58

2014 International Conference on Computer and Communication Technology (ICCCT)

This paper describes the development of parser algorithm which is used for Hindi-English machine translation (MT). Machine translation requires analysis, transfer and generation steps to produce target language output from a source language input. Structural representation of Hindi sentences codes the information of Hindi sentences and a transfer module can be designed to generate English sentences...

chapter

Resolution to Chinese combinational ambiguity combined corpus-based method with linguistics knowledge

JiangYang Liu, Ying Liu

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery > 3 > 1469 - 1473

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)

Combinational ambiguity is a challenging issue in Chinese word segmentation in that its disambiguation depends on the contextual information. This paper collects contextual information of 28 typical combinational ambiguity strings, and makes use of lexical, syntactic and semantic knowledge and large scale corpus to summarize the rules of these combinational ambiguity strings. Using these rules to...

INFONA - science communication portal

Search results

A grapheme-level approach for constructing a Korean morphological analyzer without linguistic knowledge

ExATO - High Quality Term Extraction for Portuguese and English

Debate on political reforms in Twitter: A hashtag-driven analysis of political polarization

Combined Classification for Extracting Named Entities from Arabic Texts

Revisiting Arabic Part of Speech Tagsets

A semisupervised associative classification method for POS tagging

Resolving issues in parsing technique in machine translation from hindi language to english language

Resolution to Chinese combinational ambiguity combined corpus-based method with linguistics knowledge

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options