Search results for: Li Huang

Items from 1 to 5 out of 5 results

chapter

Automatic text categorization based on Jensen-Shannon Divergence

Xiangdong Li, Denghui Sun, Li Huang Wuhan

2011 International Conference on E-Business and E-Government (ICEE) > 1 - 4

2011 International Conference on E-Business and E-Government (ICEE)

This paper studies the principle of text categorization in which Jensen-Shannon Divergence is used to calculate text similarity, comparing its accuracy of classification and time taking to the traditional Cosine Similarity algorithm. Experimental research shows that Jensen-Shannon Divergence algorithm will reach better results when test materials remain unchanged.

chapter

Name Disambiguation Using Semantic Association Clustering

Hai Jin, Li Huang, Pingpeng Yuan

2009 IEEE International Conference on e-Business Engineering > 42 - 48

2009 IEEE International Conference on e-Business Engineering. ICEBE 2009

Due to homonyms, abbreviations, etc., name ambiguity is widely available in Web and e-document. For example, when integrating heterogeneous literature databases, because there are different name specifications, different authors may be thought of as the same author, and vice versa. Therefore, name ambiguity makes data robust even dirty and lowers the precision of information retrieval. In this paper,...

chapter

A new marketing effectiveness metric based on web data mining

Li Huang, Xiangmin Zhang

2009 1st IEEE Symposium on Web Society > 5 - 9

2009 1st IEEE Symposium on Web Society (SWS)

The dominance of the Internet in our lives sees permanent changes of how marketers conduct their marketing and measure their marketing performance. Traditional measurement methods fall short for not being timely and effective. In this study, we propose the use of Web data, in a quantitative metric, to assess market impact of brands. The metric consists of three independent dimensions, covering measures...

chapter

Duplicate Records Cleansing with Length Filtering and Dynamic Weighting

Li Huang, Hai Jin, Pingpeng Yuan, Fan Chu

2008 Fourth International Conference on Semantics, Knowledge and Grid > 95 - 102

2008 Fourth International Conference on Semantics, Knowledge and Grid (SKG)

Due to diversity of data formats, missing of certain properties, imprecise records in heterogeneous literature databases, there exist duplicate records when integrating heterogeneous databases. Duplicate records lower the efficiency of information retrieval. In this paper, we propose an approach, named length filtering and dynamic weighting (LFDW) for duplicate records cleansing. There are three steps...

chapter

Similarity Computation of Chinese Question Based on Chunk

Zheng-Tao Yu, Lei Hu, Li Huang, Jing-Hui Deng, more

2006 International Conference on Machine Learning and Cybernetics > 17 - 22

Proceedings of 2006 International Conference on Machine Learning and Cybernetics

The currently similarity computation methods of Chinese sentence and their shortcomings are analyzed at first. According to the characteristic of the Chinese question sentence, Chinese question general chunk and special chunk are defined, and then a similarity computation method of Chinese question based on chunk is proposed. In this method, the semantic similarity of words is computed on the basis...

Filter options

Keywords:
INFORMATION RETRIEVAL

Publication date

Set your own date range

Keywords

CLUSTERING ALGORITHMS (2)
DATA MINING (2)
EQUATIONS (2)
MATHEMATICAL MODEL (2)
ACCURACY (1)
CHINESE QUESTION SENTENCE (1)
CHUNK PARSING THEORY (1)
CHUNK SIMILARITY (1)
CITESSEER (1)
CLASSIFICATION ALGORITHMS (1)
CLUSTING (1)
COMPANIES (1)
COMPUTATIONAL LINGUISTICS (1)
CONSUMER BEHAVIOUR (1)
COSINE SIMILARITY (1)
CURRENT MARKET POSITION (1)
CUSTOMER AWARENESS (1)
CUSTOMER BEHAVIOUR (1)
CUSTOMER SATISFACTION (1)
DATA FORMAT DIVERSITY (1)
DATA HANDLING (1)
DATABASES (1)
DBLP (1)
DISTANCE MEASUREMENT (1)
DISTRIBUTED DATABASES (1)
DOCUMENT HANDLING (1)
DUPLICATE RECORDS CLEANSING (1)
DYNAMIC SLIDING-WINDOW ALGORITHM (1)
DYNAMIC WEIGHTING (1)
E-DOCUMENT (1)
FILTERING (1)
FUZZY NAME MATCHING METHOD (1)
FUZZY SET THEORY (1)
GENERAL CHUNK (1)
GOVERNMENT (1)
GOVERNMENT RELATION MEASURE (1)
GRAMMARS (1)
HETEROGENEOUS DATABASES (1)
HETEROGENEOUS LITERATURE DATABASE (1)
HEURISTIC ALGORITHMS (1)
HIDDEN MARKOV MODELS (1)
HMM LEARNING METHOD (1)
HOWNET (1)
INDEXES (1)
INFORMATION SERVICES (1)
INSTANT CUSTOMER FEEDBACK (1)
INTERNET (1)
JENSEN-SHANNON DIVERGENCE (1)
KNN ALGORITHM (1)
LENGTH FILTERING (1)
LIBRA (1)
LIBRARIES (1)
MARKETING DATA PROCESSING (1)
MARKETING EFFECTIVENESS METRIC (1)
MARKETING PERFORMANCE MEASUREMENT (1)
MEDIA (1)
MERGING (1)
NAME DISAMBIGUATION (1)
NAME SPECIFICATION (1)
NATURAL LANGUAGES (1)
PATTERN CLUSTERING (1)
PATTERN MATCHING (1)
PROBABILITY DENSITY FUNCTION (1)
PUBLIC RELATION MEASURE (1)
PUBLIC RELATIONS (1)
PUBLISHING (1)
RESEARCH AND DEVELOPMENT (1)
SAND (1)
SEARCH ENGINE (1)
SEARCH ENGINES (1)
SEMANTIC ASSOCIATION (1)
SEMANTIC ASSOCIATION CLUSTERING (1)
SEMANTIC ASSOCIATION-BASED NAME DISAMBIGUATION METHOD (1)
SENTENCE SIMILARITY (1)
SIMILARITY COMPUTATION METHODS (1)
SPECIAL CHUNK (1)
SUN (1)
SUPPORT VECTOR MACHINES (1)
SVM LEARNING METHODS (1)
TEXT ANALYSIS (1)
TEXT CATEGORIZATION (1)
WEB DATA MINING (1)
WEB SITES (1)
WEBSITE (1)
WORD SEMANTIC SIMILARITY (1)
WORD SIMILARITY (1)
more

INFONA - science communication portal

Search results for: Li Huang

Automatic text categorization based on Jensen-Shannon Divergence

Name Disambiguation Using Semantic Association Clustering

A new marketing effectiveness metric based on web data mining

Duplicate Records Cleansing with Length Filtering and Dynamic Weighting

Similarity Computation of Chinese Question Based on Chunk

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options