Search results for: Wen-Lian Hsu

Items from 1 to 4 out of 4 results

chapter

Compute the Term Contributed Frequency

Cheng-Lung Sung, Hsu-Chun Yen, Wen-Lian Hsu

2008 Eighth International Conference on Intelligent Systems Design and Applications > 2 > 325 - 328

2008 Eighth International Conference on Intelligent Systems Design and Applications

In this paper, we propose an algorithm and data structure for computing the term contributed frequency (tcf) for all N-grams in a text corpus. Although term frequency is one of the standard notions of frequency in Corpus-Based Natural Language Processing (NLP), there are some problems regarding the use of the concept to N-grams approaches such as the distortion of phrase frequencies. We attempt to...

chapter

A Survey of State of the Art Biomedical Text Mining Techniques for Semantic Analysis

Hong-Jie Dai, J.Y.-W. Lin, Chi-Hsin Huang, Pei-Hsuan Chou, more

2008 IEEE International Conference on Sensor Networks, Ubiquitous, and Trustworthy Computing (sutc 2008) > 410 - 417

2008 IEEE International Conference on Sensor Networks, Ubiquitous, and Trustworthy Computing (SUTC '08)

The abstract is to be in fully-justified italicized text, at the top of the left-hand column as it is here, below the author information. Use the word "Abstract" as the title, in 12-point Times, boldface type, centered relative to the column, initially capitalized. The abstract is to be in 10-point, single-spaced type, and up to 150 words in length. Leave two blank lines after the abstract,...

chapter

Web Directory Integration Using Conditional Random Fields

Terry Wu, Wen-lian Hsu

2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006 Main Conference Proceedings)(WI'6) > 540 - 543

2006 IEEE/WIC/ACM International Conference on Web Intelligence

The purpose of integrating web directories is to transfer instances from a source to a target directory. Unlike conventional text categorization, in directory integration, there is extra information about the source directory that can be used to improve the classification accuracy. Many approaches exploit the measured similarity between two corresponding classes to enhance traditional text classifiers...

chapter

Chinese Word Segmentation with Minimal Linguistic Knowledge: An Improved Conditional Random Fields Coupled with Character Clustering and Automatically Discovered Template Matching

Richard Tsai, Hong-jie Dai, Hsieh-chuan Hung, Cheng-lung Sung, more

2006 IEEE International Conference on Information Reuse&Integration > 274 - 279

Proceedings of the 2006 IEEE International Conference on Information Reuse and Integration

This paper addresses three major problems of closed task Chinese word segmentation (CWS): word overlap, tagging sentences interspersed with non-Chinese words, and long named entity (NE) identification. For the first, we use additional bigram features to approximate trigram and tetragram features. For the second, we first apply K-means clustering to identify non-Chinese characters. Then, we employ...

Filter options

Keywords:
TEXT ANALYSIS

Publication date

Set your own date range

Keywords

NATURAL LANGUAGE PROCESSING (3)
CONDITIONAL RANDOM FIELDS (2)
ABSTRACT (1)
ALGORITHM DESIGN AND ANALYSIS (1)
ARRAYS (1)
ART BIOMEDICAL TEXT MINING TECHNIQUES (1)
BIGRAM FEATURE (1)
BIOLOGY COMPUTING (1)
CHARACTER CLUSTERING (1)
CHINESE TEXT (1)
CHINESE WORD SEGMENTATION (1)
CLASSIFICATION (1)
COMPUTATIONAL LINGUISTICS (1)
COMPUTATIONAL MODELING (1)
CORPUS-BASED NATURAL LANGUAGE PROCESSING (1)
DATA MINING (1)
DATA STRUCTURE (1)
DATA STRUCTURES (1)
DIRECTED ACYCLIC GRAPH (1)
DIRECTED GRAPHS (1)
FINITE-STATE MODEL (1)
INFORMATION RETRIEVAL (1)
INTERNET (1)
K-MEANS CLUSTERING (1)
LINGUISTIC KNOWLEDGE (1)
MARKOV PROCESS (1)
MARKOV PROCESSES (1)
NAMED ENTITY IDENTIFICATION (1)
NAMED ENTITY RECOGNITION (1)
PATTERN CLUSTERING (1)
PROBABILITY (1)
RAILS (1)
RANDOM PROCESSES (1)
RELATION EXTRACTION (1)
SEMANTIC ANALYSIS (1)
SEMANTIC ROLE LABELLING (1)
SUFFIX ARRAY (1)
TAGGING SENTENCES (1)
TEMPLATE MATCHING (1)
TERM CONTRIBUTED FREQUENCY (1)
TERM FREQUENCY (1)
TETRAGRAM FEATURE (1)
TEXT MINING (1)
TIME FREQUENCY ANALYSIS (1)
TRIGRAM FEATURE (1)
TWO-TAGGER METHOD (1)
WEB DIRECTORY INTEGRATION (1)
WORD OVERLAP (1)
more

INFONA - science communication portal

Search results for: Wen-Lian Hsu

Compute the Term Contributed Frequency

A Survey of State of the Art Biomedical Text Mining Techniques for Semantic Analysis

Web Directory Integration Using Conditional Random Fields

Chinese Word Segmentation with Minimal Linguistic Knowledge: An Improved Conditional Random Fields Coupled with Character Clustering and Automatically Discovered Template Matching

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options