Information Retrieval Technology
4th Asia Infomation Retrieval Symposium, AIRS 2008, Harbin, China, January 15-18, 2008 Revised Selected Papers

Hang Li, Ting Liu, Wei-Ying Ma, Tetsuya Sakai, Kam-Fai Wong, Guodong Zhou

Items from 1 to 20 out of 97 results

chapter

Research on Asynchronous Communication-Oriented Page Searching

Yulian Fei, Min Wang, Wenjuan Chen

Lecture Notes in Computer Science > Information Retrieval Technology > Poster Session > 412-417

Researches on asynchronous communication-oriented page searching aim at solving the new problems for search engine brought about by the adoption of asynchronous communication technology. At present, a full text search engine crawler mostly adopts the algorithm based on a hyperlink analysis. The crawler searches only the contents of the HTML page and ignores the codes in the script region. But it is...

chapter

A Novel Fuzzy Kernel C-Means Algorithm for Document Clustering

Yingshun Yin, Xiaobin Zhang, Baojun Miao, Lili Gao

Lecture Notes in Computer Science > Information Retrieval Technology > Poster Session > 418-423

Fuzzy Kernel C-Means (FKCM) algorithm can improve accuracy significantly compared with classical Fuzzy C-Means algorithms for nonlinear separability, high dimension and clusters with overlaps in input space. Despite of these advantages, several features are subjected to the applications in real world such as local optimal, outliers, the c parameter must be assigned in advance and slow convergence...

chapter

Cov-HGMEM: An Improved Hierarchical Clustering Algorithm

Sanming Song, Qunsheng Yang, Yinwei Zhan

Lecture Notes in Computer Science > Information Retrieval Technology > Poster Session > 424-429

In this paper we present an improved method for hierarchical clustering of Gaussian mixture components derived from Hierarchical Gaussian Mixture Expectation Maximization (HGMEM) algorithm. As HGMEM performs, it is efficient in reducing a large mixture of Gaussians into a smaller mixture while still preserving the component structure of the original mode. Compared with HGMEM algorithm, it takes covariance...

chapter

Improve Web Image Retrieval by Refining Image Annotations

Peng Huang, Jiajun Bu, Chun Chen, Kangmiao Liu, more

Lecture Notes in Computer Science > Information Retrieval Technology > Poster Session > 430-435

Automatic image annotation techniques are proposed for overcoming the so-called semantic-gap between image low-level feature and high-level concept in content-based image retrieval systems. Due to the limitations of techniques, current state-of-the-art automatic image annotation models still produce some irrelevant concepts to image semantics, which are an obstacle to getting high-quality image retrieval...

chapter

Story Link Detection Based on Event Model with Uneven SVM

Xiaoyan Zhang, Ting Wang, Huowang Chen

Lecture Notes in Computer Science > Information Retrieval Technology > Poster Session > 436-441

Topic Detection and Tracking refers to automatic techniques for locating topically related materials in streams of data. As a core of it, story link detection is to determine whether two stories are about the same topic. Up to now, many representation models have been used in story link detection. But few of them are specific to stories. This paper proposes an event model based on the characters of...

chapter

Video Temporal Segmentation Using Support Vector Machine

Shaohua Teng, Wenwei Tan

Lecture Notes in Computer Science > Information Retrieval Technology > Poster Session > 442-447

A first step required to allow video indexing and retrieval of visual data is to perform a temporal segmentation, that is, to find the location of camera-shot transitions, which can be either abrupt or gradual. We adopt SVM technique to decide whether a shot transition exists or not within a given video sequence. Active learning strategy is used to accelerate training of SVM-classifiers. We also introduce...

chapter

Using Multiple Combined Ranker for Answering Definitional Questions

Junkuo Cao, Lide Wu, Xuanjing Huang, Yaqian Zhou, more

Lecture Notes in Computer Science > Information Retrieval Technology > Poster Session > 448-453

This paper presents a Multiple Combined Ranker (MCR) approach for answering definitional questions. Generally, our MCR approach first extracts question target-related knowledge as much as possible, then using this knowledge to pick up appropriate question answers. The knowledge includes both online definitions and related terms (RT). In our system, extraction of related terms is different from traditional...

chapter

Route Description Using Natural Language Generation Technology

XueYing Zhang

Lecture Notes in Computer Science > Information Retrieval Technology > Poster Session > 454-459

This paper aims to solve the problems of generating natural language route description in Chinese way-finding systems, on the basis of datasets of geographical information systems and natural language generation technology. The techniques of deriving important information e.g. paths, roads, directions and landmarks from geographical information systems are discussed in detail. Through examples we...

chapter

Some Question to Monte-Carlo Simulation in AIB Algorithm

Sanming Song, Qunsheng Yang, Yinwei Zhan

Lecture Notes in Computer Science > Information Retrieval Technology > Poster Session > 460-465

Hierarchical clustering algorithm is efficient in reducing the bytes needed to describe the original information while preserving the original information structure. Information Bottleneck (IB) theory is a hierarchical clustering framework derivative from the information theory. Agglomerative Information Bottleneck (AIB) algorithm is a suboptimal agglomerative clustering procedure designed for optimizing...

chapter

An Opinion Analysis System Using Domain-Specific Lexical Knowledge

Youngho Kim, Yuchul Jung, Sung-Hyon Myaeng

Lecture Notes in Computer Science > Information Retrieval Technology > Poster Session > 466-471

In this paper, we describe an opinion analysis system using domain-specific lexical knowledge in Korean economic news. We tested our hypothesis that such domain-specific knowledge helps enhancing the performance of statistically based approaches and obtained a promising result.

chapter

A New Algorithm for Reconstruction of Phylogenetic Tree

ZhiHua Du, Zhen Ji

Lecture Notes in Computer Science > Information Retrieval Technology > Poster Session > 472-477

The abstract should summarize the contents of the paper and should contain at least 70 and at most 150 words. It should be set in 9-point font size and should be inset 1.0 cm from the right and left margins. There should be two blank (10-point) lines before and after the abstract. This document is in the required format. In this paper, we present a new algorithm for reconstructing large phylogenetic...

chapter

A Full Distributed Web Crawler Based on Structured Network

Kunpeng Zhu, Zhiming Xu, Xiaolong Wang, Yuming Zhao

Lecture Notes in Computer Science > Information Retrieval Technology > Poster Session > 478-483

Distributed Web crawlers have recently received more and more attention from researchers. Full decentralized crawler without a centralized managing server seems to be an interesting architectural paradigm for realizing large scale information collecting systems for its scalability, failure resilience and increased autonomy of nodes. This paper provides a novel full distributed Web crawler system which...

chapter

A Simulated Shallow Dependency Parser Based on Weighted Hierarchical Structure Learning

Zhiming Kang, Chun Chen, Jiajun Bu, Peng Huang, more

Lecture Notes in Computer Science > Information Retrieval Technology > Poster Session > 484-489

In the past years much research has been done on data-driven dependency parsing and performance has increased steadily. Dependency grammar has an important inherent characteristic, that is, the nodes closer to root usually make more contribution to audiences than the others. However, that is ignored in previous research in which every node in a dependency structure is considered to play the same role...

chapter

One Optimized Choosing Method of K-Means Document Clustering Center

Hongguang Suo, Kunming Nie, Xin Sun, Yuwei Wang

Lecture Notes in Computer Science > Information Retrieval Technology > Poster Session > 490-495

A center choice method based on sub-graph division is presented. After constructing the similarity matrix, the disconnected graphs can be established taking the text node as the vertex of the graph and then it will be analyzed. The number of the clustering center and the clustering center can be confirmed automatically on the error allowable range by this method. The noise data can be eliminated effectively...

chapter

A Model for Evaluating the Quality of User-Created Documents

Linh Hoang, Jung-Tae Lee, Young-In Song, Hae-Chang Rim

Lecture Notes in Computer Science > Information Retrieval Technology > Poster Session > 496-501

In this paper, we propose a model for evaluating the quality of general user-created documents. The model is based on supervised classification approach, in which output scores are considered as quality of given document. In order to utilize both textual and non-textual attributes of documents, we incorporated a number of objectively measurable, real-valued features selected upon predefined criteria...

chapter

Filter Technology of Commerce-Oriented Network Information

Min Wang, Yulian Fei

Lecture Notes in Computer Science > Information Retrieval Technology > Poster Session > 502-507

With the network information growing day by day, people engaging in commercial affairs are crying for a commerce-oriented search engine. The primary step of building up the search engine is to get commercial information efficiently from Internet. This paper introduces a method used to filter commerce-oriented information from Internet. By this method, Spider decides the passing orientation by judging...

chapter

IR Interface for Contrasting Multiple News Sites

Masaharu Yoshioka

Lecture Notes in Computer Science > Information Retrieval Technology > Poster Session > 508-513

In order to utilize news articles from multiple news sites, it is better to understand the characteristics of each news site. In this paper, a concept of contrast set mining is applied for analyzing the characteristic difference between each news site and all others. The News Site Contrast (NSContrast) system is also proposed based on this mining technique. This system is applied to a news article...

chapter

Real-World Mood-Based Music Recommendation

Magnus Mortensen, Cathal Gurrin, Dag Johansen

Lecture Notes in Computer Science > Information Retrieval Technology > Poster Session > 514-519

We present a music recommendation system that incorporates both collaborative filtering and mood-based recommendations. The benefits of incorporating mood-based recommendations over both content/genre-based and collaborative filtering-based recommendation are illustrated by means of a real-world user evaluation in which 54 users took part in a one month long evaluation.

chapter

News Page Discovery Policy for Instant Crawlers

Yong Wang, Yiqun Liu, Min Zhang, Shaoping Ma

Lecture Notes in Computer Science > Information Retrieval Technology > Poster Session > 520-525

Many news pages which are of high freshness requirements are published on the internet every day. They should be downloaded immediately by instant crawlers. Otherwise, they will become outdated soon. In the past, instant crawlers only downloaded pages from a manually generated news website list. Bandwidth is wasted in downloading non-news pages because news websites do not publish news pages exclusively...

chapter

An Alignment-Based Approach to Semi-supervised Relation Extraction Including Multiple Arguments

Seokhwan Kim, Minwoo Jeong, Gary Geunbae Lee, Kwangil Ko, more

Lecture Notes in Computer Science > Information Retrieval Technology > Poster Session > 526-536

We present an alignment-based approach to semi-supervised relation extraction task including more than two arguments. We concentrate on improving not only the precision of the extracted result, but also on the coverage of the method. Our relation extraction method is based on an alignment-based pattern matching approach which provides more flexibility of the method. In addition, we extract all relationships...

Publication date

Set your own date range

Content availability

Available (82)
None (15)

Keywords

CLUSTERING (3)
FEATURE SELECTION (3)
SEMI-SUPERVISED LEARNING (3)
TEXT CATEGORIZATION (3)
CONTENT-BASED IMAGE RETRIEVAL (2)
FOCUSED CRAWLING (2)
INFORMATION EXTRACTION (2)
K-MEANS (2)
LANGUAGE MODELING APPROACH (2)
MACHINE LEARNING (2)
MULTI-DOCUMENT SUMMARIZATION (2)
NAMED ENTITY RECOGNITION (2)
ONTOLOGY (2)
SEARCH ENGINE (2)
SVM (2)
TEXT CLASSIFICATION (2)
3D MODEL RETRIEVAL (1)
ACTIVE LEARNING (1)
APRIORI ALGORITHM (1)
ASSOCIATED WORD (1)
ASSOCIATION RULES (1)
ASYNCHRONOUS COMMUNICATION (1)
AUGMENTED INFORMATION (1)
AXIOMATIC APPROACH (1)
BETWEEN-SEGMENT SIMILARITY (1)
BIOMEDICAL NAMED ENTITY (1)
BISECTING K-MEANS CLUSTERING (1)
CATEGORY-SPECIFIC TERMS (1)
CHINESE INFORMATION PROCESSING (1)
CHINESE TEMPORAL EXPRESSIONS (1)
CHINESE TEXT SIMILARITY (1)
CHINESE WEB SEARCH RESULTS (1)
CHINESE WORD SEGMENTATION (1)
CLASSIFICATION (1)
CO-OCCURRENCE MEASURES (1)
COLLABORATIVE FILTERING (1)
COMMERCE-ORIENTED SPIDER (1)
COMPLETE-ARBITRARY PASSAGE (1)
COMPLETELY-ARBITRARY PASSAGE (1)
CONCEPTNET (1)
CONDITIONAL RANDOM FIELD (1)
CONTENT EXTRACTION (1)
CONTENT FILTERING (1)
CORPUS (1)
CRAWLER (1)
CROSS-LANGUAGE INFORMATION RETRIEVAL (1)
DCM (1)
DEFINITIONAL QUESTION ANSWERING (1)
DERIVED QUERY (1)
DISCRETIZATION (1)
DISSIMILARITY MEASURE (1)
DISTANCE MEASURE (1)
DIVIDE-AND-CONQUER (1)
DOCUMENT CLUSTERING (1)
DOCUMENT FREQUENCY (1)
DOCUMENTS (1)
DOMAIN VERB (1)
DYNAMIC PROGRAMMING (1)
EDIT DISTANCE (1)
ELECTRONIC MAP (1)
ENSEMBLE LEARNING (1)
ENTROPY (1)
EVENT MODEL (1)
FEATURE ORIENTED SAMPLES (1)
FEATURE SPACE (1)
FILTERING TRANSFORM (1)
FULL DISTRIBUTED (1)
FUZZY ENTROPY (1)
FUZZY KERNEL C-MEANS (1)
FUZZY NEURAL NETWORKS (1)
FUZZY SET (1)
GRAPH RANKING (1)
GRAPH REPRESENTATION (1)
GRAPHICAL MODEL (1)
HIERARCHICAL CLUSTERING (1)
HIERARCHICAL MODELING (1)
HIERARCHICAL TAXONOMY INTEGRATION (1)
HIERARCHICAL THESAURI INFORMATION (1)
HIERARCHICAL TOPIC TAXONOMY (1)
HOWNET (1)
HYPONYMY (1)
IMAGE DATABASES (1)
INFORMATION RETRIEVAL (1)
INITIAL CENTER (1)
INVERSE DOCUMENT FREQUENCY (1)
KERNEL VALIDITY INDEX (1)
KEYWORDS DEPENDENCY PROFILE (1)
LANGUAGE MODEL (1)
LATENT DIRICHLET ALLOCATION (1)
LEARNING FROM POSITIVE AND UNLABELED EXAMPLES (LPU) (1)
LEXICAL CHAIN (1)
LIFELOGGING (1)
LSA THEORY (1)
MAX-SUM MODEL (1)
MAXIMUM ENTROPY MODELING (1)
MEDICAL IMAGING (1)
MINING CHAT CONVERSATIONS (1)
MOOD (1)
MULTI-SCALE FUSION (1)
MULTIMEDIA RETRIEVAL (1)
more

INFONA - science communication portal

Information Retrieval Technology
4th Asia Infomation Retrieval Symposium, AIRS 2008, Harbin, China, January 15-18, 2008 Revised Selected Papers

Research on Asynchronous Communication-Oriented Page Searching

A Novel Fuzzy Kernel C-Means Algorithm for Document Clustering

Cov-HGMEM: An Improved Hierarchical Clustering Algorithm

Improve Web Image Retrieval by Refining Image Annotations

Story Link Detection Based on Event Model with Uneven SVM

Video Temporal Segmentation Using Support Vector Machine

Using Multiple Combined Ranker for Answering Definitional Questions

Route Description Using Natural Language Generation Technology

Some Question to Monte-Carlo Simulation in AIB Algorithm

An Opinion Analysis System Using Domain-Specific Lexical Knowledge

A New Algorithm for Reconstruction of Phylogenetic Tree

A Full Distributed Web Crawler Based on Structured Network

A Simulated Shallow Dependency Parser Based on Weighted Hierarchical Structure Learning

One Optimized Choosing Method of K-Means Document Clustering Center

A Model for Evaluating the Quality of User-Created Documents

Filter Technology of Commerce-Oriented Network Information

IR Interface for Contrasting Multiple News Sites

Real-World Mood-Based Music Recommendation

News Page Discovery Policy for Instant Crawlers

An Alignment-Based Approach to Semi-supervised Relation Extraction Including Multiple Arguments

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Information Retrieval Technology 4th Asia Infomation Retrieval Symposium, AIRS 2008, Harbin, China, January 15-18, 2008 Revised Selected Papers $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

Information Retrieval Technology
4th Asia Infomation Retrieval Symposium, AIRS 2008, Harbin, China, January 15-18, 2008 Revised Selected Papers