Search results

Items from 1 to 20 out of 21 results

article

Summarizing Online Reviews Using Aspect Rating Distributions and Language Modeling

Giuseppe Di Fabbrizio, Ahmet Aker, Robert Gaizauskas

IEEE Intelligent Systems > 2013 > 28 > 3 > 28 - 37

Product and service reviews are abundantly available online, but selecting relevant information from them involves a significant amount of time. The authors address this problem with Starlet, a novel approach for extracting multidocument summarizations that considers aspect rating distributions and language modeling. These features encourage the inclusion of sentences in the summary that preserve...

chapter

Designing effective web mining-based techniques for OOV translation

Haitao Yu, Fuji Ren, Degen Huang, Lishuang Li

Proceedings of the 6th International Conference on Natural Language Processing and Knowledge Engineering(NLPKE-2010) > 1 - 8

2010 International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE 2010)

Due to a limited coverage of the existing bilingual dictionary, it is often difficult to translate the Out-Of-Vocabulary terms (OOV) in many natural language processing tasks. In this paper, we propose a general cascade mining technique of three steps, it leverages OOV category to optimize the effectiveness of each step. OOV category based expansion policy is suggested to get more relevant mixed-language...

chapter

A Methodology for Integrating Network Theory and Topic Modeling and its Application to Innovation Diffusion

Jana Diesner, Kathleen M Carley

2010 IEEE Second International Conference on Social Computing > 687 - 692

2010 IEEE Second International Conference on Social Computing (SocialCom 2010). the Second IEEE International Conference on Privacy, Security, Risk and Trust (PASSAT 2010)

Text data pertaining to socio-technical networks often are analyzed separately from relational data, or are reduced to the fact and strength of the flow of information between nodes. Disregarding the content of text data for network analysis can limit our understanding of the effects of language use in networks. We present a computational and interdisciplinary methodology that addresses this limitation...

chapter

Blog Hotness Evaluation Model Based on Text Opinion Analysis

Jianjiang Li, Xuechun Zhang, Yu Weng, Changjun Hu

2009 Eighth IEEE International Conference on Dependable, Autonomic and Secure Computing > 235 - 240

2009 International Conference on Dependable, Autonomic and Secure Computing (DASC 2009)

Aiming at the deficiencies of traditional blog hotness evaluation methods, the paper presents a blog hotness evaluation model based on text opinion analysis (named BHEM-TOA). The model not only considers the number of reviews, comments and publication time of the blog topic, but also focuses on the comment opinion. BHEM-TOA emphasizes subjective opinions of reviewers about the blog topic. It utilizes...

chapter

Using Semantic Web for Information Retrieval Based on Clonal Selection Strategy

Jianming Zhang, Xinliang Tan, Xuehua Huang, Yan Wang

2009 Second International Symposium on Computational Intelligence and Design > 1 > 513 - 516

2009 Second International Symposium on Computational Intelligence and Design (ISCID 2009)

It is well known that information retrieval systems based entirely on syntactic contents have serious limitations. In order to achieve high precision and recall on IR systems, the incorporation of natural language processing techniques that provide semantic information is needed. For this reason, by determining the semantic for the constituents of documents, a clustering method is presented in this...

chapter

Chinese Text Clustering Method Based on Semantics and Special Domain

Dong Jianquan, Zhang Jinchao

2009 International Conference on Web Information Systems and Mining > 195 - 199

2009 International Conference on Web Information Systems and Mining (WISM 2009)

In view of ignoring semantic relationship between words, high dimensionality of data and computational complexity when current text clustering algorithms deal with Chinese texts. This paper presents a new method to cluster Chinese texts based on semantics in a specific field-TCBS (Text Clustering Based on Semantics) algorithm. The algorithm is based on the agglomerative hierarchical clustering algorithm,...

chapter

Analysis on degree words for Chinese emotion expressions based on syntactic parse and rules

Yan Sun, Changqin Quan, F. Ren

2009 International Conference on Natural Language Processing and Knowledge Engineering > 1 - 6

2009 International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE)

Analysis of emotions in texts has wide-ranging applications. In the analysis of emotional expressions, degree words are important for expressing emotion intensity of emotions. With the support of a large Chinese emotion corpus (Ren-CECps), in this paper, we present analysis on degree words for Chinese emotion expressions based on syntactic parse and rules. At first, Ren-CECps is used to extract the...

chapter

Research on Katakana phrase translation based on bi-directional integration

Guiping Zhang, Yonglei Gao, Duo Ji, Xiaona Ren

2009 International Conference on Natural Language Processing and Knowledge Engineering > 1 - 6

2009 International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE)

In order to solve the problem of Katakana reduced to English in Japanese-English translation, we employ the phrase-based statistical machine translation model to perform Katakana phrase (or word) translation from Japanese to English. The Katakana phrase is segmented into words by CRF, and then Japanese-English and English-Japanese bi-directional integration translation is carried out on those segmented...

chapter

Demo of Antelogue: Pronoun Resolution for Dialogues

E. Miltsakaki

2009 IEEE International Conference on Semantic Computing > 567 - 568

2009 IEEE International Conference on Semantic Computing (ICSC)

We present Antelogue, a novel pronoun resolution architecture for dialogues based on efficient filtering of potential antecedents through a simple look-up of information using existing resources (gender, number, NER, etc). Our system does not require large labelled datasets for training or complex handcrafted rules. We will demo the system's real time performance on dialogues extracted from the screenplays...

chapter

A new evaluating method for Chinese text summarization not requiring

Chan Wang, Lei Li, Yixin Zhong

2009 International Conference on Natural Language Processing and Knowledge Engineering > 1 - 7

2009 International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE)

With the rapid development of text summarization, evaluation methods for automatic Chinese text summarization system are becoming more and more important in natural language processing, which can promote development of text summarization greatly. This paper analyzes the existed methods for automatic summarization evaluation, and introduces a new evaluation method based on cluster. The main idea of...

chapter

A Model for Chinese Sentence Ordering Based on Markov Model

Yanxiang He, Gongfu Peng, Weidong Wen

2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery > 7 > 457 - 461

2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2009)

In this paper, we discuss a method to improve the sentence ordering task in Chinese. The way we approach is based on the analysis of Markov model, which can train transition probability in raw corpus. We iteratively calculate the largest transition probability path in Markov model to confirm the correct order. The method avoids judging the first sentence, which could lead to an instable result in...

chapter

A Multi-agent Model for English Text Chunking

Ying-Hong Liang, Jin-xiang Li, Jun Cao, De-pang Wang

2009 International Conference on Information Technology and Computer Science > 1 > 88 - 92

2009 International Conference on Information Technology and Computer Science (ITCS 2009)

Traditional English text chunking approach is to identify phrases using only one model and same features. It is shown that one model could not consider each phrasepsilas characteristics, and same features are not suitable to all phrases. In this paper, a multi-agent text chunking model is proposed. This model uses individual sensitive features of each phrase to identify different phrases. Through...

chapter

Linguistic Steganography Detection Algorithm Using Statistical Language Model

Peng Meng, Liusheng Hang, Wei Yang, Zhili Chen, more

2009 International Conference on Information Technology and Computer Science > 2 > 540 - 543

2009 International Conference on Information Technology and Computer Science (ITCS 2009)

Steganography is a technique for embedding secret messages into carriers. Linguistic steganography is a branch of text steganography. Research on attacking methods against linguistic steganography plays an important role in information security (IS) area. In this paper, a linguistic steganography detecting algorithm using statistical language model (SLM) is presented. An experiment to detect text...

chapter

An Anaphora Based Information Retrieval Model Extension

F. Santiago do Carmo Pereira, H. Seibel Junior, S.A.A. de Freitas

2009 WRI World Congress on Computer Science and Information Engineering > 4 > 330 - 334

2009 WRI World Congress on Computer Science and Information Engineering, CSIE

Classical information retrieval models are based on representation of document terms without considering linguistic elements. This article presents a model based on the Discourse Nominal Structure; which lets us take linguistic characteristics of text into account. The model presented is evaluated in comparison with the vector space model. Based on observations during the experimentation we propose...

chapter

Combining Lexical Resources with Fuzzy Set Theory for Recognizing Textual Entailment

Jin Feng, Yiming Zhou, T. Martin

2008 International Seminar on Business and Information Management > 2 > 54 - 57

2008 International Seminar on Business and Information Management (ISBIM 2008)

Textual Entailment (TE) recognition is a task which consists in recognizing if a textual expression, the text T, entails another expression, the hypothesis H. Recently it is treated as a common solution for modeling language variability. Textual entailment captures a broad range of semantic oriented inferences needed for many Natural Language Processing (NLP) applications, like Information Retrieval...

chapter

Automatic Segmentation of Hierarchy Feature without Lexicon for Chinese Text Based on Iterative Learning

Shaohua Jiang, Yanzhong Dang

2008 International Conference on Computer Science and Software Engineering > 1 > 657 - 661

2008 International Conference on Computer Science and Software Engineering (CSSE 2008)

Chinese features extraction is indispensable in a processing of Chinese natural language because it is beneficial to Chinese text knowledge discovery and information retrieval. Chinese Segmentation is the precondition of features extraction. To conquer the disadvantage of current Chinese segmentation methods, such as lexicon-based scheme, syntax and rules-based scheme, statistics-based scheme and...

chapter

Chinese Term Recognition and Extraction Based on Hidden Markov Model

Yonghua Cen, Zhe Han, PeiPei Ji

2008 IEEE Pacific-Asia Workshop on Computational Intelligence and Industrial Application > 2 > 219 - 224

2008 Pacific-Asia Workshop on Computational Intelligence and Industrial Application. PACIIA 2008

Motivated by the probabilistic characteristics of syntax compositions especially POS (part of speech) matching of Chinese textual information and the inner structures of most unlexicalized Chinese domain terms, a system framework to recognize and extract domain-specific Chinese terms based on hidden Markov model (HMM) was proposed and implemented. The system learns the HMM parameters by the input...

chapter

Compute the Term Contributed Frequency

Cheng-Lung Sung, Hsu-Chun Yen, Wen-Lian Hsu

2008 Eighth International Conference on Intelligent Systems Design and Applications > 2 > 325 - 328

2008 Eighth International Conference on Intelligent Systems Design and Applications

In this paper, we propose an algorithm and data structure for computing the term contributed frequency (tcf) for all N-grams in a text corpus. Although term frequency is one of the standard notions of frequency in Corpus-Based Natural Language Processing (NLP), there are some problems regarding the use of the concept to N-grams approaches such as the distortion of phrase frequencies. We attempt to...

chapter

An Approach of Chunk Parsing and Entity Relation Extracting to Chinese Based on Conditional Random Fields Model

Jun-hua Wu, Jing Zhou

2008 Eighth International Conference on Intelligent Systems Design and Applications > 1 > 489 - 494

2008 Eighth International Conference on Intelligent Systems Design and Applications

Conditional random fields (CRFs) model is the valid probabilistic model to segment and label sequence data. Comparing with other statistical models, such as HMM, MEHMM, CRFs process the data sequence in terms of the context of data. Chunk analysis is a shallow parsing method to simplify natural language processing. And entity relation extraction is used in establishing relationship between entities...

chapter

HowNet based evaluation for Chinese text summarization

Chan Wang, Lixia Long, Lei Li

2008 International Conference on Natural Language Processing and Knowledge Engineering > 1 - 6

2008 International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE)

With the rapid development of text summarization, evaluation methods for automatic summarization system is becoming more and more important in natural language processing, which can promote development of text summarization greatly. This paper analyzes the existed methods for automatic summarization evaluation, and introduces a new evaluation method based on HowNet. The original tests have shown that...

Data set:
ieee
Keywords:
DATA MINING
NATURAL LANGUAGE PROCESSING
TEXT ANALYSIS
COMPUTATIONAL MODELING

Publication date

Set your own date range

Publication type

book (20)
article (1)

Keywords

FEATURE EXTRACTION (7)
COMPUTATIONAL LINGUISTICS (5)
TRAINING (5)
ACCURACY (4)
HUMANS (4)
INFORMATION RETRIEVAL (4)
MACHINE LEARNING (4)
ALGORITHM DESIGN AND ANALYSIS (3)
CLUSTERING ALGORITHMS (3)
DATA MODELS (3)
INTERNET (3)
LEARNING (ARTIFICIAL INTELLIGENCE) (3)
ANALYTICAL MODELS (2)
ANAPHORA RESOLUTION (2)
CHARACTER RECOGNITION (2)
CLASSIFICATION ALGORITHMS (2)
CORRELATION (2)
GRAMMARS (2)
HIDDEN MARKOV MODELS (2)
INFORMATION PROCESSING (2)
INFORMATION SERVICES (2)
LABELING (2)
MARKOV PROCESSES (2)
MATHEMATICAL MODEL (2)
PATTERN CLUSTERING (2)
PROBABILISTIC LOGIC (2)
PROBABILITY (2)
PROBABILITY DENSITY FUNCTION (2)
SIMILARITY (2)
SPEECH (2)
STATISTICAL ANALYSIS (2)
TEXT MINING (2)
WEB SITES (2)
AFFECTIVE COMPUTING (1)
AGGLOMERATIVE HIERARCHICAL CLUSTERING ALGORITHM (1)
ANTECEDENT FILTERING (1)
ANTELOGUE (1)
ARRAYS (1)
AUTOMATIC SEGMENTATION (1)
AUTOMATIC SUMMARIZATION EVALUATION (1)
AUTOMATIC SUMMARIZATION SYSTEM (1)
AUTOMATIC SUMMARY (1)
BHEM-TOA (1)
BI-DIRECTIONAL INTEGRATION (1)
BIDIRECTIONAL CONTROL (1)
BIDIRECTIONAL INTEGRATION (1)
BILINGUAL DICTIONARY (1)
BIOLOGICAL SYSTEM MODELING (1)
BLOG HOTNESS (1)
BLOG HOTNESS EVALUATION MODEL (1)
BOOK REVIEWS (1)
CHANGE AGENTS (1)
CHARACTERISTIC WORDS (1)
CHINESE ACADEMY OF SCIENCES (1)
CHINESE CHARACTER INDEX (1)
CHINESE CHARACTERS (1)
CHINESE EMOTION CORPUS (1)
CHINESE EMOTION EXPRESSIONS (1)
CHINESE HIERARCHY FEATURE EXTRACTION (1)
CHINESE NATURAL LANGUAGE (1)
CHINESE SEGMENTATION METHOD (1)
CHINESE SENTENCE ORDERING (1)
CHINESE TERM RECOGNITION (1)
CHINESE TEXT (1)
CHINESE TEXT CLUSTERING (1)
CHINESE TEXT KNOWLEDGE DISCOVERY (1)
CHINESE TEXT PROCESSING (1)
CHINESE TEXT SUMMARIZATION (1)
CHINESE TEXT SUMMARIZATION EVALUATING METHOD (1)
CHINESE TEXT UNDERSTANDING CHUNK ANALYSIS (1)
CHINESE TEXTUAL INFORMATION (1)
CHUNK PARSING (1)
CLIR (1)
CLOANL SELECTON ALGORITHM (1)
CLONAL SELECTION STRATEGY (1)
CLUSTER (1)
CLUSTERING METHOD (1)
CLUSTERING METHODS (1)
CO-OCCURRENCE NETWORK (1)
COMPLEXITY THEORY (1)
COMPUTATIONAL COMPLEXITY (1)
COMPUTER ARCHITECTURE (1)
COMPUTERS (1)
CONTEXT (1)
COREFERENCE (1)
CORPUS-BASED NATURAL LANGUAGE PROCESSING (1)
DATA ANALYSIS (1)
DATA CORPUSES (1)
DATA DIMENSIONALITY (1)
DATA STRUCTURE (1)
DATA STRUCTURES (1)
DATABASES (1)
DEGREE WORDS (1)
DIALOGUE (1)
DIRECTED ACYCLIC GRAPH (1)
DIRECTED GRAPHS (1)
more

INFONA - science communication portal

Search results

Summarizing Online Reviews Using Aspect Rating Distributions and Language Modeling

Designing effective web mining-based techniques for OOV translation

A Methodology for Integrating Network Theory and Topic Modeling and its Application to Innovation Diffusion

Blog Hotness Evaluation Model Based on Text Opinion Analysis

Using Semantic Web for Information Retrieval Based on Clonal Selection Strategy

Chinese Text Clustering Method Based on Semantics and Special Domain

Analysis on degree words for Chinese emotion expressions based on syntactic parse and rules

Research on Katakana phrase translation based on bi-directional integration

Demo of Antelogue: Pronoun Resolution for Dialogues

A new evaluating method for Chinese text summarization not requiring

A Model for Chinese Sentence Ordering Based on Markov Model

A Multi-agent Model for English Text Chunking

Linguistic Steganography Detection Algorithm Using Statistical Language Model

An Anaphora Based Information Retrieval Model Extension

Combining Lexical Resources with Fuzzy Set Theory for Recognizing Textual Entailment

Automatic Segmentation of Hierarchy Feature without Lexicon for Chinese Text Based on Iterative Learning

Chinese Term Recognition and Extraction Based on Hidden Markov Model

Compute the Term Contributed Frequency

An Approach of Chunk Parsing and Entity Relation Extracting to Chinese Based on Conditional Random Fields Model

HowNet based evaluation for Chinese text summarization

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options