Search results for: Xiaojie Wang

Items from 1 to 16 out of 16 results

chapter

Chinese subjectivity analysis using bilingual knowledge and adaptation technology

Rongjun Li, Yuan Kuang, Xiaojie Wang

2010 IEEE 2nd Symposium on Web Society > 331 - 335

2010 IEEE 2nd Symposium on Web Society (SWS 2010)

Research in opinion analysis have drawn a great attention these days. Many of the effective opinion analysis system are based on supervised learning technology. However there lack of annotation sentiment corpora for Chinese opinion analysis. The purpose of our work is try to make use of annotation English corpora, where are rich and reliable to improve opinion analysis in Chinese. We propose a approach...

chapter

Feature distributions in exponential language models

Huixing Jiang, Xiaojie Wang

2009 IEEE International Conference on Network Infrastructure and Digital Content > 252 - 256

2009 IEEE International Conference on Network Infrastructure and Digital Content (IC-NIDC 2009)

Considering of the features' distribution but not just the counts of features' appearances in sequence makes exponential language models more powerful to capture the global language phenomena. This paper constructs an exponential language model with binary variables' distributions of features, and uses minimum sample risk training method to train model by utilizing more features and adjusting their...

chapter

Automatic Extract Product-Entity from Untagged Review

Rongjun Li, Xiaojie Wang, Songxiang Cen, Yu Mao

2009 Second International Symposium on Knowledge Acquisition and Modeling > 3 > 281 - 284

2009 Second International Symposium on Knowledge Acquisition and Modeling (KAM 2009)

In this paper, we present a new method to extract product entity from Chinese customer reviews. The approach requires no segmentation, no domain dictionary and little prior domain knowledge, which is more suitable for domain with resource-limited. Quite different from the previous work, the proposed method first get the entity candidates use a general version bootstrapping algorithm and then distribute...

chapter

Incorporating multi-task learning in conditional random fields for chunking in semantic role labeling

Saike He, Taozheng Zhang, Xue Bai, Xiaojie Wang, more

2009 International Conference on Natural Language Processing and Knowledge Engineering > 1 - 5

2009 International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE)

This paper presents a novel application of incorporating Alternating Structure Optimization (ASO) to conduct the task of text chunking of Semantic Role Labeling (SRL) in Chinese texts. ASO is a competent linear algorithm based on the theory of multi-task learning. In this paper, by constructing several SRL tasks to constitute a multi-task, we are able to encode the inference obtained by ASO algorithm...

chapter

Research on knowledge elements in exponential language model

Huixing Jiang, Xiaojie Wang

2009 International Conference on Natural Language Processing and Knowledge Engineering > 1 - 5

2009 International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE)

This paper presents an exponential language model (ELM) for modeling and managing knowledge elements. The model has been developed based on minimum sample risk (MSR) algorithm, which is a discriminative training method. ELM uses features to capture global, domain, or sentential language phenomena that is composed of name entities, part of speech strings, personal usage words, positions of words, sentence...

chapter

Exploiting lexical information for function tag labeling

Caixia Yuan, Xiaojie Wang, Fuji Ren

2008 International Conference on Natural Language Processing and Knowledge Engineering > 1 - 8

2008 International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE)

This paper proposes an novel approach to annotate function tags for unparsed text. What distinguishes our work from other attempts in such task is that we assign function tags directly basing on lexical information other than on parsed trees. In order to demonstrate the effectiveness and versatility of our method, we investigate two statistical models for automatic annotation, one is log-linear maximum...

chapter

Ambiguity solution of pinyin segmentation in continuous Pinyin-to-Character conversion

Juan Wen, Xiaojie Wang, Wenzhi Xu, Huixing Jiang

2008 International Conference on Natural Language Processing and Knowledge Engineering > 1 - 7

2008 International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE)

Chinese Pinyin-to-character conversion is a key technology in Chinese Pinyin input system. In sentence based Pinyin-to-character conversion, segmentation of Pinyin string has important influence on performance of Pinyin-to-character conversion. There are lots of ambiguities in segmentation of Pinyin string. This paper classifies them into overlap and combinational ambiguities, and proposes disambiguation...

chapter

Exploiting syntactic and semantic information in coarse chinese question classification

Xin Kang, Xiaojie Wang, Fuji Ren

2008 International Conference on Natural Language Processing and Knowledge Engineering > 1 - 7

2008 International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE)

Recent years have seen great process in studying English question classification. In our research, we learn Chinese question classification by exploiting the result of lexical, syntactic and semantic parsing on question sentences. Support vector machines are adopted to train a classifier on 6 coarse categories using single and combination of different parsing results as features. We find that even...

chapter

Chinese Named Entity Recognition with new contextual features

Ying Qin, Taozheng Zhang, Xiaojie Wang

2008 International Conference on Natural Language Processing and Knowledge Engineering > 1 - 6

2008 International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE)

Chinese named entity recognition (NER) is studied in two directions: inner structure and outer surroundings. Inner structural analyses induce constitutions of person, location and organization name from the point of linguistics. However inner structural rules for named entities only provide necessary conditions for a sequence of Chinese characters being an entity name but not sufficient. Whether a...

chapter

Identification of Noun Phrase with Various Granularities

Ying Qin, Xiaojie Wang, Yixin Zhong

2007 International Conference on Natural Language Processing and Knowledge Engineering > 197 - 202

2007 IEEE International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE '07)

Since noun phrases are the most popular phrases in texts, noun phrase identification is one of vital subtasks of natural language processing. Generally Chinese noun phrases have hierarchical inner structures. This paper proposes an approach of defining various levels of granularity for noun phrases, catering for different application demands. Three levels of granularity noun phrases are proposed,...

chapter

Chinese Verb Sense Disambiguation Using AdaBoosting

Juan Wen, Ying Qin, Xiaojie Wang

2007 International Conference on Natural Language Processing and Knowledge Engineering > 316 - 321

2007 IEEE International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE '07)

This paper uses the adaptive boosting (AdaBoosting) algorithm to the task of word sense disambiguation (WSD) for Chinese verbs. The AdaBoosting algorithm is a kind of ensemble learning method used for classification. We have implemented the classifier using a feature set combining collocation features, syntactic features and semantic features. We test the model on eight polysemous verbs in Chinese...

chapter

Semantic Role Labeling for multi-VP clauses in Chinese

Jie Cai, Caixia Yuan, Xiaojie Wang, Yixin Zhong

2007 International Conference on Natural Language Processing and Knowledge Engineering > 367 - 372

2007 IEEE International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE '07)

We have built a semantic role labeling (SRL) system for Chinese clauses, with some desired information of the main-predicate in each clause and the relevant functional slots. The ability of our SRL system dealing with simple clauses is considered as the basic performance of the system. When processing more complex clauses (namely clauses with more than one verb phrase in our definition of complex...

chapter

Automatic Entity Relation Extraction Based on Maximum Entropy

Suxiang Zhang, Juan Wen, Xiaojie Wang, Lei Li

Sixth International Conference on Intelligent Systems Design and Applications > 1 > 540 - 544

2006 6th International Conference on Intelligent Systems Design and Applications

Entity relation extraction (RE) is an very important research domain in information extraction, we can regard RE as a classification problem in this paper, RE is still original study field in Chinese language now, maximum entropy (ME)-based machine learning is the first time to be used to extract entity relations between named entities from Chinese texts, Thirteen features have been designed for entity...

chapter

Combining Multi-knowledge for Chinese Word Segmentation Disambiguation

Ying Qin, Suxiang Zhang, Xiaojie Wang

Sixth International Conference on Intelligent Systems Design and Applications > 1 > 551 - 556

2006 6th International Conference on Intelligent Systems Design and Applications

In the task of Chinese word segmentation, there are two main segmentation ambiguities, overlapping ambiguity and combination ambiguity. The paper analyzes properties of ambiguities and supposes multi-knowledge approach to disambiguate. Multi-knowledge refers to the knowledge from statistic of large corpus and syntactic, semantic or discourse information about ambiguous words. Class based N-gram and...

chapter

The Research and Application about the Information Extraction in Chinese Domain

Suxiang Zhang, Juan Wen, Ying Qin, Xiaojie Wang, more

2006 8th international Conference on Signal Processing > 3

2006 8th International Conference on Signal Processing

A specific prototype information service system was proposed by this paper, which can send interesting information to user with database search way from unstructured text. In order to achieve this goal, two fundamental issues were studied by using maximum entropy (ME) algorithm, which is named entity recognition and relation extraction. Our named entity recognition approach is distinguished from most...

chapter

A Practical Approach to Resolving Combination Ambiguity in Chinese Word Segmentation

Ying Qin, Suxiang Zhang, Xiaojie Wang

2006 8th international Conference on Signal Processing > 3

2006 8th International Conference on Signal Processing

In Chinese word segmentation task, combination ambiguity is one of challenges not being well settled. The main obstacle exists in the detection of ambiguous words in given texts and their proper segmentations. This paper puts forward a practical approach to automatically collecting ambiguous words and disambiguating based on maximum entropy principle. The experimental result reveals the approach of...

Filter options

Keywords:
NATURAL LANGUAGE PROCESSING

Publication date

Set your own date range

Content availability

Available (13)
None (3)

Keywords

DATA MINING (8)
TEXT ANALYSIS (7)
TRAINING (6)
MAXIMUM ENTROPY METHODS (5)
FEATURE EXTRACTION (4)
GRAMMARS (4)
LEARNING (ARTIFICIAL INTELLIGENCE) (4)
SEMANTIC FEATURE (3)
BOOK REVIEWS (2)
CHINESE LANGUAGE (2)
CHINESE WORD SEGMENTATION (2)
CLASSIFICATION PROBLEM (2)
COMBINATION AMBIGUITY (2)
COMPUTATIONAL LINGUISTICS (2)
CONTEXT (2)
EQUATIONS (2)
EXPONENTIAL LANGUAGE MODELS (2)
GRAMMAR (2)
HIDDEN MARKOV MODELS (2)
INFORMATION EXTRACTION (2)
LABELING (2)
MACHINE LEARNING (2)
MATHEMATICAL MODEL (2)
MINIMUM SAMPLE RISK (2)
MORPHOLOGY (2)
PROBABILITY (2)
PROBABILITY DENSITY FUNCTION (2)
SEMANTIC INFORMATION (2)
STATISTICAL ANALYSIS (2)
STATISTICAL MODEL (2)
STRONTIUM (2)
SUPPORT VECTOR MACHINES (2)
SYNTACTIC PARSING (2)
ADABOOSTING ALGORITHM (1)
ADAPTATION TECHNOLOGY (1)
ALTERNATING STRUCTURE OPTIMIZATION (1)
AMBIGUITY RESOLUTION (1)
AMBIGUOUS WORDS (1)
ANNOTATION ENGLISH CORPORA (1)
ANNOTATION SENTIMENT CORPORA (1)
ASO (1)
ASO ALGORITHM (1)
AUTOMATIC ANNOTATION (1)
AUTOMATIC ENTITY RELATION EXTRACTION (1)
AUTOMATIC EXTRACT PRODUCT-ENTITY (1)
BASELINE N-GRAM MODELS (1)
BILINGUAL KNOWLEDGE (1)
BINARY VARIABLE'S DISTRIBUTION (1)
BINARY VARIABLES (1)
BOOTSTRAPPING ALGORITHM (1)
CHARACTER RECOGNITION (1)
CHINESE CHARACTER (1)
CHINESE CHARACTER CONVERSION (1)
CHINESE CUSTOMER REVIEWS (1)
CHINESE DOMAIN (1)
CHINESE INTERNET CHAT CORPUS (1)
CHINESE MOBILE SHORT MESSAGES (1)
CHINESE NAMED ENTITY RECOGNITION (1)
CHINESE NER SYSTEM (1)
CHINESE NOUN PHRASES (1)
CHINESE PERSON NAMES (1)
CHINESE PINYIN (1)
CHINESE QUESTION CLASSIFICATION (1)
CHINESE SUBJECTIVITY ANALYSIS (1)
CHINESE TEXT CHUNKING TASK (1)
CHINESE TEXTS (1)
CHINESE VERB SENSE DISAMBIGUATION (1)
CLASSIFICATION (1)
CLASSIFICATION ALGORITHMS (1)
CLASSIFICATION TREE ANALYSIS (1)
COAE2008 CORPUS (1)
COLLOCATION FEATURE (1)
COMBINATIONAL AMBIGUITIES (1)
COMPETENT LINEAR ALGORITHM (1)
COMPUTATIONAL MODELING (1)
CONDITIONAL RANDOM FIELD (1)
CONDITIONAL RANDOM FIELDS (1)
CONTEXT MODELING (1)
CONTEXTUAL FEATURES (1)
CONTEXTUAL KNOWLEDGE (1)
CONTINUOUS PINYIN-TO-CHARACTER CONVERSION (1)
CRF (1)
CRFS (1)
DICTIONARIES (1)
DISAMBIGUATION (1)
DISAMBIGUATION ALGORITHMS (1)
DISCOURSE INFORMATION (1)
DISCRIMINATIVE TRAINING METHOD (1)
DOMAIN ADAPTATION (1)
EDUCATIONAL INSTITUTIONS (1)
ENCODING (1)
ENCODING SCHEMES (1)
ENGINES (1)
ENSEMBLE LEARNING METHOD (1)
ENTITY EXTRACTION (1)
ENTITY RECOGNITION (1)
ENTITY RELATION EXTRACTION AND EVALUATION (1)
ENTROPY (1)
EXPONENTIAL LANGUAGE MODEL (1)
more

INFONA - science communication portal

Search results for: Xiaojie Wang

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options