Search results

chapter

Morpheme-based product features categorization in Chinese reviews mining

Shu Zhang, Wenjie Jia, Yingju Xia, Yao Meng, more

2010 6th International Conference on Advanced Information Management and Service (IMS) > 324 - 329

2010 6th International Conference on Advanced Information Management and Service (IMS 2010)

Pursuing on the analysis of product reviews, an unsupervised product features categorization method is proposed. Morphemes as smallest linguistic meaningful unit are induced in measuring the intra relationship among product features instead of words. Opinion words around product features are chosen to represent the inter relationship among product features instead of full context information. The...

chapter

Web-Based Variant of the Lesk Approach to Word Sense Disambiguation

M.A.R. Gaona, A. Gelbukh, S. Bandyopadhyay

2009 Eighth Mexican International Conference on Artificial Intelligence > 103 - 107

2009 Eighth Mexican International Conference on Artificial Intelligence (MICAI 2009)

Word Sense Disambiguation (WSD) is the task of selecting the meaning of a word based on the context in which the word occurs. The principal statistical WSD approaches are supervised and unsupervised learning. The Lesk method is an example of unsupervised disambiguation. We present a measure for sense assignment useful for the simple Lesk algorithm. We use word co-occurrences of the gloss and the context,...

chapter

Semi-supervised word sense disambiguation based on weakly controlled sense induction

B. Broda, M. Piasecki

2009 International Multiconference on Computer Science and Information Technology > 17 - 24

2009 International Multiconference on Computer Science and Information Technology (IMCSIT)

Word Sense Disambiguation in text is still a difficult problem as the best supervised methods require laborious and costly manual preparation of training data. On the other hand, the unsupervised methods express significantly lower accuracy and produce results that are not satisfying for many application. The goal of this work is to develop a model of Word Sense Disambiguation which minimises the...

chapter

Structure learning for natural language processing

Yizhao Ni, C.J. Saunders, S. Szedmak, M. Niranjan

2009 IEEE International Workshop on Machine Learning for Signal Processing > 1 - 6

2009 IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2009)

We applied a structure learning model, Max-Margin Structure (MMS), to natural language processing (NLP) tasks, where the aim is to capture the latent relationships within the output language domain. We formulate this model as an extension of multi-class Support Vector Machine (SVM) and present a perceptron-based learning approach to solve the problem. Experiments are carried out on two related NLP...

chapter

A segmentation method for crossing ambiguity string based on mutual information and t-test difference

Zhiying Lu, Qianqian Zhao, Le Yang

2009 IEEE Youth Conference on Information, Computing and Telecommunication > 371 - 374

2009 IEEE Youth Conference on Information, Computing and Telecommunication (YC-ICT 2009)

One nodus existing in Chinese word segmentation is the ambiguity problem of which more than 85% are crossing ambiguity, therefore it is significant to decrease the error in dealing with the crossing ambiguity. Taking the advantage of the characteristics of the crossing ambiguity string, a novel method based on the mutual information and t-test difference is proposed to deal with the ambiguities in...

chapter

A method of Chinese named entity recognition based on maximum entropy model

Ning Hui, Yang Hua, Tan Ya-zhou, Wu Hao

2009 International Conference on Mechatronics and Automation > 2472 - 2477

2009 IEEE International Conference on Mechatronics and Automation

There are many connotative semantic features in Chinese which can help Chinese named entity recognition. Moreover, one of the important strongpoint of maximum entropy model is that it can syncretize features in different granularity and level. With that in mind, many Chinese named entity semantic knowledge bases were established by extracting information from corpus in this paper. However, because...

chapter

A New Approach To Accent Restoration Of Vietnamese Texts Using Dynamic Programming Combined With Co-Occurrence Graph

Hoang Trong Nghia, Do Phuc

2009 IEEE-RIVF International Conference on Computing and Communication Technologies > 1 - 4

2009 IEEE-RIVF International Conference on Computing and Communication Technologies (RIVF). Research, Innovation and Vision for the Future

In this paper, we would like to introduce a new approach to recover Vietnamese text's accents. Given a Vietnamese text in which accents are lost, our goal is to seek for a recovered text that yields a best lexical probability. Using a dynamic programming approach, we first build a model of language for Vietnamese as a lexical database which gives lexical probabilities to Vietnamese sentences. Second,...

chapter

A Study on Machine Translation of Register-Specific Terms in Tea Classics

Jiang Xin, Zhao Lixin, Wu Di

2009 WASE International Conference on Information Engineering > 1 > 57 - 60

2009 WASE International Conference on Information Engineering (ICIE)

The rich heritage of Chinese tea culture has attracted an increasing number of people in the world, but the translating of such classical and specialized literature proves to be extremely arduous. Machine translation (MT) is introduced to facilitate the decrypting process. However, when the popular online translation Systran is tried bi-directionally on a high-frequency wordlist from 24 ancient tea...

chapter

Temporal Relations with Signals: The Case of Italian Temporal Prepositions

T. Caselli, F. Dell'Orletta, I. Prodanof

2009 16th International Symposium on Temporal Representation and Reasoning > 125 - 132

2009 16th International Symposium on Temporal Representation and Reasoning (TIME 2009)

This paper presents a maximum entropy tagger for the identification of intra-sentential temporal relations between temporal expressions and eventualities mediated by temporal signals in constructions of the kind "eventuality + signal + temporal relation". The tagger reports an accuracy rate of 90.8%, outperforming the baseline (81.8%). One of the main results of this work is represented...

chapter

Context-Based Approach for Covering Ambiguity Resolution in Chinese Word Segmentation

Su-qin Feng, Su-qin Hou

2009 Second International Conference on Information and Computing Science > 2 > 43 - 46

Second International Conference on Information and Computing Science, ICIC 2009

Covering ambiguity is a vital issue in Chinese word segmentation. Challenges are that disambiguation is depending on the contextual information. This paper collected contextual information statistics of covering ambiguity words and found a context calculation mode by using log likelihood ratio. A weighing calculation formula is designed for considering contextual informationpsilas window size and...

chapter

Ensemble Similarity Measures for Clustering Terms

A. Ittoo, L. Maruster

2009 WRI World Congress on Computer Science and Information Engineering > 4 > 315 - 319

2009 WRI World Congress on Computer Science and Information Engineering, CSIE

Clustering semantically related terms is crucial for many applications such as document categorization, and word sense disambiguation. However, automatically identifying semantically similar terms is challenging. We present a novel approach for automatically determining the degree of relatedness between terms to facilitate their subsequent clustering. Using the analogy of ensemble classifiers in machine...

chapter

Word-Sense Disambiguation using maximum entropy model

N. Chatterjee, R. Misra

2009 Proceeding of International Conference on Methods and Models in Computer Science (ICM2CS) > 1 - 4

2009 International Conference on Methods and Models in Computer Science (ICM2CS)

Natural languages are typically replete with homographs, words which have more than one meaning. Consequently, machine understanding of natural language sentences sometimes suffers from certain ambiguities in getting the correct sense of a word in a given sentence. In this work we present a trainable model for word sense disambiguation (WSD) for resolving this ambiguity. The proposed model applies...

chapter

Resources for Nepali Word Sense Disambiguation

N. Shrestha, P.A.V. Hall, S.K. Bista

2008 International Conference on Natural Language Processing and Knowledge Engineering > 1 - 5

2008 International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE)

Word sense disambiguation (WSD) is a process of identifying proper meaning of words that may have multiple meanings. It is regarded as one of the most challenging problems in the field of natural language processing (NLP). Nepali Language also has words that have multiple meanings, thus giving rise to the problem of WSD in it. In this paper, we investigate the impact of NLP resources like morphology...

chapter

Grammar-Based Automatic Extraction of Definitions

A. Iftene, I. Pistol, D. Trandabat

2008 10th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing > 110 - 115

2008 10th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing (SYNASC 2008)

The paper describes the development and usage of a grammar developed to extract definitions from documents. One of the most important practical usages of the developed grammar is the automatic extraction of definitions from web documents. Three evaluation scenarios were run, the results of these experiments being the main focus of the paper. One scenario uses an e-learning context and previously annotated...

chapter

Automatic Judgment of the Subjectivity and Objectivity of the Chinese Words

Zhang Jing, Jin Hao

2010 International Conference on Intelligent Computing and Cognitive Informatics > 160 - 163

2010 International Conference on Intelligent Computing and Cognitive Informatics (ICICCI 2010)

The effective automatic judgment of the Chinese words sentiment polarity, the most important part of the Chinese sentiment analysis, can improve the building of the subjectivity lexicon and the efficiency of the sentiment analysis. The technology of the Chinese word subjectivity and objectivity judgment is discussed and analyzed, the subjectivity dictionary is defined and the subjective feature model...

INFONA - science communication portal

Search results

Morpheme-based product features categorization in Chinese reviews mining

Web-Based Variant of the Lesk Approach to Word Sense Disambiguation

Semi-supervised word sense disambiguation based on weakly controlled sense induction

Structure learning for natural language processing

A segmentation method for crossing ambiguity string based on mutual information and t-test difference

A method of Chinese named entity recognition based on maximum entropy model

A New Approach To Accent Restoration Of Vietnamese Texts Using Dynamic Programming Combined With Co-Occurrence Graph

A Study on Machine Translation of Register-Specific Terms in Tea Classics

Temporal Relations with Signals: The Case of Italian Temporal Prepositions

Context-Based Approach for Covering Ambiguity Resolution in Chinese Word Segmentation

Ensemble Similarity Measures for Clustering Terms

Word-Sense Disambiguation using maximum entropy model

Resources for Nepali Word Sense Disambiguation

Grammar-Based Automatic Extraction of Definitions

Automatic Judgment of the Subjectivity and Objectivity of the Chinese Words

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options