Advanced search

Advanced search in people

From:

To:

Items from 21 to 40 out of 56 results

chapter

Research on Applicability of Sentence Similarity Algorithms in Text Copy Detection

Kong Sheng, Wang Yu

2010 International Conference on E-Product E-Service and E-Entertainment > 1 - 5

2010 International Conference on E-Product E-Service and E-Entertainment (ICEEE 2010)

Sentence similarity computation is the research topic of domain of natural language processing, and plays an important role in the example-based machine translation, information retrieval, text mining and other fields. Those sentence similarity algorithms have different applicability in different environments. This paper reviewed and analyzed five kinds of sentence similarity algorithm, and tested...

chapter

Dominating ranking algorithm for information retrieval

Huilin Liu, Zhiqing Li, Junchang Xin, Chen Chen

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery > 5 > 2456 - 2460

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)

There are lots of ranking algorithms used in Web information retrieval. However, current algorithms have some problems: these algorithms are based on different calculation formulas to calculate the documents and query similarity or train a lot of training data to get corresponding calculation formula which calculate documents and query similarity. We know that this process is a very complex, and sometimes...

chapter

Chinese text categorization study based on CBM learning

Yan Zhan, Hao Chen

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery > 4 > 1511 - 1514

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)

Text Categorization (TC) is an important component in many information organization and information management tasks. In many TC applications, the case-base grows at a fast rate and this causes inefficiency in the case retrieval process. Using Case-Base Maintenance learning via the GC (Generalization Capability) algorithm, which can reduce the case number into KNN algorithm, can improve efficiency...

chapter

Pattern Matching with Flexible Wildcards and Recurring Characters

Haiping Wang, Fei Xie, Xuegang Hu, Peipei Li, more

2010 IEEE International Conference on Granular Computing > 782 - 786

2010 IEEE International Conference on Granular Computing (GrC-2010)

Pattern matching is an important task, which is widely used in many fields, such as information retrieval and bioinformatics. Recently, a much more flexible pattern matching problem with wildcards has been proposed. Chen et al. introduced local constraints, global constraints and the one-off condition into the task of pattern matching, and the most representative algorithm SAIL was designed. However,...

chapter

Web document clustering based on a new niching Memetic Algorithm, Term-Document Matrix and Bayesian Information Criterion

C Cobos, C Montealegre, M Mejia, M Mendoza, more

IEEE Congress on Evolutionary Computation > 1 - 8

2010 IEEE Congress on Evolutionary Computation

This paper introduces a new description-centric algorithm for web document clustering based on Memetic Algorithms with Niching Methods, Term-Document Matrix and Bayesian Information Criterion. The algorithm defines the number of clusters automatically. The Memetic Algorithm provides a combined global and local strategy for a search in the solution space and the Niching methods to promote diversity...

chapter

Framework for analysis and improvement of data-fusion algorithms

Mohammad Othman Nassar, Ghassan Kanaan, Hussain A H Awad

2010 2nd IEEE International Conference on Information Management and Engineering > 379 - 382

2010 2nd IEEE International Conference on Information Management and Engineering (ICIME 2010)

The data-fusion techniques have been investigated by many researchers and have been used in implementing several information retrieval systems. Introducing a new or improved data-fusion algorithm is an active research area for the researchers' community. We propose a framework for analyses and improvement of Data-fusion algorithms; this framework is going to be: First; a supportive tool for researchers...

chapter

Research and Implement on Genetic Algorithm and Ant Colony Algorithm in Chinese Question Answering System

Shuling Di, Pilian He, Huan Li

2010 Second International Conference on Computer Engineering and Applications > 1 > 166 - 169

2010 Second International Conference on Computer Engineering and Applications (ICCEA 2010)

This paper transformed the process of Chinese question answering into agent coalition formation first, and then got the solution by using of combination of genetic algorithm and ant colony algorithm. The idea and routine of the algorithm were given. Coding scheme, selecting scheme, crossover operator, mutation operator and so on of genetic algorithm which suitable for Chinese question answering agent...

chapter

A Study on Tactics for Corporate Website Development Aiming at Search Engine Optimization

Mo Yunfeng

2010 Second International Workshop on Education Technology and Computer Science > 3 > 673 - 675

2010 2nd International Workshop on Education Technology and Computer Science (ETCS)

Along with the rapid growth of network information, using search engines to search information has become an integral part of one's life everyday. In recent years, there is a research focus on the search engine optimization technologies used to quickly publish business information onto the search engines by which higher rankings can be kept. The present paper analyzes the impact of receiving and recording...

chapter

Information retrieval of video images library based on multi-modal

Xiaoping Li, Libai Ha, Jinghui Chen, Xiaoxing Lv, more

4th International Conference on New Trends in Information Science and Service Science > 386 - 389

2010 4th International Conference on New Trends in Information Science and Service Science (NISS 2010)

Distance Education retrieval is the most striking image retrieval and video retrieval, this paper presents a distance-oriented multimedia information retrieval system of multi-modal, and in the application of SVM support vector machine relevance feedback algorithm image classification conducted a preliminary attempt.

chapter

Obtaining term similarities on concept extraction study

K Balkan, H Takci

National Conference on Electrical, Electronics and Computer Engineering > 578 - 582

2010 National Conference on Electrical, Electronics and Computer Engineering (ELECO 2010)

Concept extraction work, promises to improve the performance of the term-based text mining which has high complexity. The first phase of the concept extraction is to detect the terms have notable frequency to represent the documents. With grouping these terms an important function will be implemented on the way conception. Transition from terms to concepts; by clustering the terms according to similarities...

chapter

An Improved PageRank Algorithm Based on Latent Semantic Model

Xiaoyun Chen, Baojun Gao, Ping Wen

2009 International Conference on Information Engineering and Computer Science > 1 - 4

2009 International Conference on Information Engineering and Computer Science. ICIECS 2009

The traditional PageRank (PR) just takes into account the Web link structure, when distributing rank scores it treats all links equally, which results in topic drift. In this paper, latent semantic model (LSM) is used to calculate the similarity between Web pages, and the LSMPageRank (LPR) algorithm is introduced. In this algorithm, the value of parent page is distributed to the child on the basis...

chapter

Integration of mutual information and text mining methods for extracting gene-gene interactions from gene expression data

D.H. Millis, J.L. Solka, L.K. Matukumalli

2009 IEEE International Conference on Bioinformatics and Biomedicine Workshop > 357

2009 IEEE International Conference on Bioinformatics and Biomedicine Workshop, BIBMW

Mutual information algorithms have been used for the identification of gene-gene interactions in gene expression data. These methods have been hindered by a high false-positive rate. We explored the use of free-text abstracts as an additional source of information for assessing the biological relevance of predicted gene interactions. Our results suggest that the performance of a mutual information...

chapter

Document Classification Algorithm Based on IB and LS-SVM

Ziqiang Wang, Xia Sun

2009 Third International Symposium on Intelligent Information Technology Application > 1 > 279 - 282

2009 Third International Symposium on Intelligent Information Technology Application

Document classification has received extensive attention in the past few decades due to its wide applications in many fields. To efficiently deal with this problem, a novel document classification algorithm based on information bottleneck (IB) and least square version of SVM (LS-SVM) is proposed in this paper. Extensive experimental results on the real-word document corpus show that the proposed algorithm...

chapter

Meaningful Inner Link Objects for Automatic Text Categorization

Jau-Ji Shen, Jia-Chiuan Wu

2009 Fifth International Conference on Intelligent Information Hiding and Multimedia Signal Processing > 266 - 269

2009 Fifth International Conference on Intelligent Information Hiding and Multimedia Signal Processing. IIH-MSP 2009

This paper presents a novel approach for automatic text categorization. The mainstream of the research on rule-based classifier regards document as a container of term, and generates rules by using the term distribution in documents. General speaking, there must be existed some kind of semantic relevance between term and paragraph in a document. We call it Meaningful Inner Link Objects-MILO which...

chapter

CTSC: Core-Tag Oriented Spectral Clustering Algorithm on Web2.0 Tags

Yexi Jiang, Changjie Tang, Kaikuo Xu, Yu Chen, more

2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery > 1 > 460 - 464

2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2009)

With the rapid development of the Web2.0 communities, many researchers have been attracted by the concept of folksonomy from the field of data mining and information retrieval. Finding out semantic correlation of tags is avid requirement for Web2.0 application. However, no proper algorithm can tackle this task very well. This paper proposes a core-tag oriented clustering method to handle the task...

chapter

Pagerank algorithm improvement by page relevance measurement

Chia-Chen Yen, Jih-Shih Hsu

2009 IEEE International Conference on Fuzzy Systems > 502 - 506

2009 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE)

Pagerank algorithm evaluates the importance of web pages by the link analysis, and there are many techniques to improve the traditional pagerank algorithm to prevent from the biases of link spamming in recent years. The modified algorithms should concern not only the correctness, but also the efficiency should be considered. This paper proposes an associated pagerank algorithm for search engines to...

chapter

Research and Design of Internet Public Opinion Analysis System

Quanlong Guan, Saizhi Ye, Guoxiang Yao, Huanming Zhang, more

2009 IITA International Conference on Services Science, Management and Engineering > 173 - 177

2009 IITA International Conference on Services Science, Management and Engineering (SSME)

Internet is becoming a spreading platform for the public opinion. It is important to grasp the Internet public opinion in time and understand the trends of their opinion correctly. Text classification plays a fundamental role in a number of information management and retrieval tasks. But Web-page classification is much more difficult than pure-text classification due to a large variety of noisy information...

chapter

A heuristic web information retrieval method

Yi Yang, Yan-ni Peng, Can Tang

2009 4th International Conference on Computer Science&Education > 1292 - 1296

2009 4th International Conference on Computer Science & Education (ICCSE 2009)

In view of the poor information retrieval performance because of numerous disordered and semi-structured web information, a heuristic web information retrieval method which combines query intention classification research and topic-specific retrieval is proposed. In this method, a web retrieval model based on a scheme of one pretreatment and two retrievals is presented, and it discusses its designing...

chapter

Effective Keyword Search for Precise Return Information over XML Documents

Ying Lou, Zhanhuai Li, Meng Han, Juan Xu

2009 WRI Global Congress on Intelligent Systems > 4 > 567 - 571

2009 WRI Global Congress on Intelligent Systems (GCIS)

Meaningful and useful return information is extraordinary important for information retrieval and XML keyword search. In this work, based on analysis the structure of XML document, we propose an algorithm to classify return matched nodes, we present formal analysis on LCA (lowest common ancestor) nodes ranking and LCA sub tree refining to obtain precise return information. Experimental studies show...

chapter

Comparison Probabilistic Latent Semantic Indexing Model In Chinese Information Retrieval

Xie Fang, Liu Xiaoguang, Hu Quan

2009 International Forum on Information Technology and Applications > 3 > 559 - 562

2009 International Forum on Information Technology and Applications (IFITA)

With the increasing of information on Internet, Web mining has been the focus of information retrieval. By a certain metric of similarity, Web clustering groups the similar Web documents. But the classical algorithms of clustering are aimless in searching the solution space and absent of semantic characters. In this paper, the probabilistic latent semantic indexing (PLSI) models which using word segmentation,...

Keywords:
ALGORITHM DESIGN AND ANALYSIS
CLASSIFICATION ALGORITHMS
INFORMATION RETRIEVAL

Publication date

Set your own date range

INFONA - science communication portal

Advanced search

Advanced search in people

Research on Applicability of Sentence Similarity Algorithms in Text Copy Detection

Dominating ranking algorithm for information retrieval

Chinese text categorization study based on CBM learning

Pattern Matching with Flexible Wildcards and Recurring Characters

Web document clustering based on a new niching Memetic Algorithm, Term-Document Matrix and Bayesian Information Criterion

Framework for analysis and improvement of data-fusion algorithms

Research and Implement on Genetic Algorithm and Ant Colony Algorithm in Chinese Question Answering System

A Study on Tactics for Corporate Website Development Aiming at Search Engine Optimization

Information retrieval of video images library based on multi-modal

Obtaining term similarities on concept extraction study

An Improved PageRank Algorithm Based on Latent Semantic Model

Integration of mutual information and text mining methods for extracting gene-gene interactions from gene expression data

Document Classification Algorithm Based on IB and LS-SVM

Meaningful Inner Link Objects for Automatic Text Categorization

CTSC: Core-Tag Oriented Spectral Clustering Algorithm on Web2.0 Tags

Pagerank algorithm improvement by page relevance measurement

Research and Design of Internet Public Opinion Analysis System

A heuristic web information retrieval method

Effective Keyword Search for Precise Return Information over XML Documents

Comparison Probabilistic Latent Semantic Indexing Model In Chinese Information Retrieval

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Advanced search

Advanced search in people

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options