Advanced search

Advanced search in people

From:

To:

Items from 1 to 20 out of 148 results

article

Protein interaction network constructing based on text mining and reinforcement learning with application to prostate cancer

Fei Zhu, Quan Liu, Xiaofang Zhang, Bairong Shen

IET Systems Biology > 2015 > 9 > 4 > 106 - 112

Constructing interaction network from biomedical texts is a very important and interesting work. The authors take advantage of text mining and reinforcement learning approaches to establish protein interaction network. Considering the high computational efficiency of co-occurrence-based interaction extraction approaches and high precision of linguistic patterns approaches, the authors propose an interaction...

chapter

Machine learning based biomedical named entity recognition

N. Kanya, T. Ravi

IET Chennai Fourth International Conference on Sustainable Energy and Intelligent Systems (SEISCON 2013) > 380 - 384

IET Chennai Fourth International Conference on Sustainable Energy and Intelligent Systems (SEISCON 2013)

The biomedical society makes wide use of text mining technology. Named Entity (NE) extraction is one of the most primary and significant tasks in biomedical information extraction of text mining technology. Named Entity Recognition (NER) involves processing structured and unstructured documents to recognize the definite kinds of entities and categorization of them into some predefined classes. Several...

chapter

Sentiment classification in online reviews using FRN algorithm

I. Hemalatha, G. P. Saradhi Varma, A. Govardhan

IET Chennai Fourth International Conference on Sustainable Energy and Intelligent Systems (SEISCON 2013) > 357 - 362

IET Chennai Fourth International Conference on Sustainable Energy and Intelligent Systems (SEISCON 2013)

The internet is rich in directional text (i.e., text containing opinions and emotions). World Wide Web provides volumes of text-based data about consumer preferences, stored in online review websites, web forums, blogs, etc. Sentiment analysis is a technique to classify people's opinions in product reviews, blogs or social networks has emerged as a method for mining opinions from such text archives...

chapter

Detection of Verbatim or Partial Duplication from Multiple Source Documents Using Data Mining Techniques and Case-Based Reasoning Methodologies

C Chaudhuri, A Chaudhuri

2011 Second International Conference on Emerging Applications of Information Technology > 129 - 132

Second International Conference on Emerging Applications of Information Technology (EAIT 2011)

This paper aims to specify a Case-Based Reasoning strategy for correctly classifying, storing and preventing duplication efforts of electronic text material. Preservation of complete source documents for checking similarity between them pose a daunting amount of spatial and computational complexity to researchers in this area. The problem is partially solved by applying certain preprocessing steps...

chapter

Classification of brand names based on n-grams

P Warintarawej, A Laurent, P Pompidor, B Laurent

2010 International Conference of Soft Computing and Pattern Recognition > 12 - 17

2010 International Conference of Soft Computing and Pattern Recognition (SoCPaR 2010)

Supervised classification has been extensively addressed in the literature as it has many applications, especially for text categorization or web content mining where data are organized through a hierarchy. On the other hand, the automatic analysis of brand names can be viewed as a special case of text management, although such names are very different from classical data. They are indeed often neologisms,...

chapter

Query Expansion for UMLS Metathesaurus Disambiguation Based on Automatic Corpus Extraction

Antonio Jimeno-Yepes, A R Aronson

2010 Ninth International Conference on Machine Learning and Applications > 965 - 968

2010 Ninth International Conference on Machine Learning and Applications (ICMLA 2010)

Word sense disambiguation (WSD) is an intermediate task within information retrieval and information extraction, which attempts selecting the proper sense of ambiguous terms. In the biomedical domain, general WSD has not received much attention compared to the disambiguation of specific categories of entities like proteins and genes or diseases. Statistical learning approaches have achieved better...

chapter

Hot keyword identification for extracting web public opinion

Zhiqi Fang, Yue Ning, Tingshao Zhu

5th International Conference on Pervasive Computing and Applications > 116 - 121

2010 5th International Conference on Pervasive Computing and Applications (ICPCA 2010)

Internet is becoming an increasingly important platform for ordinary life and work. It is expected that keyword extraction can help people quickly find hot spots on the web, since keywords in a document provide important information about the content of the document. In this paper, we propose to use text clustering method based on semi-supervised learning to get focuses of social topics in a large...

chapter

Title Page i

2010 IEEE International Conference on Data Mining > i

2010 10th IEEE International Conference on Data Mining (ICDM 2010)

The following topics are dealt with: data mining; local clustering; spatiotemporal event detection; time series; Markov models; email classification; data stream; parallel mining; Bayesian network; unsupervised learning; missing values prediction; anomaly detection; decision tree; binary classifier; data similarity matrix; data mapping; support vector machine; Mapreduce; document similarity; social...

chapter

Learning Preferences with Millions of Parameters by Enforcing Sparsity

Xi Chen, Bing Bai, Yanjun Qi, Qihang Lin, more

2010 IEEE International Conference on Data Mining > 779 - 784

2010 10th IEEE International Conference on Data Mining (ICDM 2010)

We study the retrieval task that ranks a set of objects for a given query in the pair wise preference learning framework. Recently researchers found out that raw features (e.g. words for text retrieval) and their pair wise features which describe relationships between two raw features (e.g. word synonymy or polysemy) could greatly improve the retrieval precision. However, most existing methods can...

chapter

Sentence-Level and Document-Level Sentiment Mining for Arabic Texts

N Farra, E Challita, R A Assi, H Hajj

2010 IEEE International Conference on Data Mining Workshops > 1114 - 1119

2010 10th IEEE International Conference on Data Mining Workshops (ICDMW 2010)

In this work, we investigate sentiment mining of Arabic text at both the sentence level and the document level. Existing research in Arabic sentiment mining remains very limited. For sentence-level classification, we investigate two approaches. The first is a novel grammatical approach that employs the use of a general structure for the Arabic sentence. The second approach is based on the semantic...

chapter

A refined weighted K-Nearest Neighbors algorithm for text categorization

Fang Lu, Qingyuan Bai

2010 IEEE International Conference on Intelligent Systems and Knowledge Engineering > 326 - 330

2010 IEEE International Conference on Intelligent Systems and Knowledge Engineering (ISKE 2010)

Text categorization is one important task of text mining, for automated classification of large numbers of documents. Many useful supervised learning methods have been introduced to the field of text classification. Among these useful methods, K-Nearest Neighbor (KNN) algorithm is a widely used method and one of the best text classifiers for its simplicity and efficiency. For text categorization,...

chapter

Extracting Parallel Texts from the Web

Le Quang Hung, Le Anh Cuong

2010 Second International Conference on Knowledge and Systems Engineering > 147 - 151

2010 Second International Conference on Knowledge and Systems Engineering (KSE)

Parallel corpus is the valuable resource for some important applications of natural language processing such as statistical machine translation, dictionary construction, cross-language information retrieval. The Web is a huge resource of knowledge, which partly contains bilingual information in various kinds of web pages. It currently attracts many studies on building parallel corpora based on the...

chapter

A mutual information and information entropy pair based feature selection method in text classification

Zhili Pei, Yuxin Zhou, Lisha Liu, Lihua Wang, more

2010 International Conference on Computer Application and System Modeling (ICCASM 2010) > 6 > V6-258 - V6-261

2010 International Conference on Computer Application and System Modeling (ICCASM 2010)

Text classification is an important research field of data mining topics. This article brings a mutual information and information entropy pair based feature selection method (MIIEP_FS) based on the theory of information entropy and information entropy pair concept. This method measure the classification effect using feature by mutual information method and show the difference extent between the features...

chapter

Web Text Categorization for Large-scale Corpus

Zhijuan Jia, Jianbo Mu

2010 International Conference on Computer Application and System Modeling (ICCASM 2010) > 8 > V8-188 - V8-191

2010 International Conference on Computer Application and System Modeling (ICCASM 2010)

Corpus is the set of language materials which are stored in computers and can use computers to search, query and analyze for enterprise decision-makers. Automated text categorization has been extensively studied and various techniques for document categorization. But based on the current scarcity of Chinese corpus, especially in the field of text categorization, the Chinese categorization corpus is...

chapter

Automatic Mining of Human Activity Attributes from Weblogs

Nguyen Minh The, Takahiro Kawamura, Hiroyuki Nakagawa, Yasuyuki Tahara, more

2010 IEEE/ACIS 9th International Conference on Computer and Information Science > 633 - 638

2010 IEEE/ACIS 9th International Conference on Computer and Information Science (ICIS 2010)

In this paper, we define an activity by five basic attributes: actor, action, object, time and location. The goal of this paper is to describe a method to automatically extract all attributes in each sentence retrieved from Japanese weblogs. Previous work had some limitations, such as high setup cost, inability of extracting all attributes, limitation on the types of sentences that can be handled,...

chapter

Using Text Analysis to Understand the Structure and Dynamics of the World Wide Web as a Multi-Relational Graph

Harish Sethu, Alexander Yates

2010 IEEE Second International Conference on Social Computing > 683 - 686

2010 IEEE Second International Conference on Social Computing (SocialCom 2010). the Second IEEE International Conference on Privacy, Security, Risk and Trust (PASSAT 2010)

A representation of the World Wide Web as a directed graph, with vertices representing web pages and edges representing hypertext links, underpins the algorithms used by web search engines today. However, this representation involves a key oversimplification of the true complexity of the Web: an edge in the traditional Web graph represents only the existence of a hyperlink; information on the context...

chapter

A new polarity clustering algorithm based on semantic criterion function for text of the Chinese commentary

Bin Xu, Yufeng Zhang

2010 3rd International Conference on Advanced Computer Theory and Engineering(ICACTE) > 4 > V4-116 - V4-119

2010 3rd International Conference on Advanced Computer Theory and Engineering (ICACTE 2010)

The mining methods for comment text polarity are usually used to adopted supervised learning algorithms, but supervised learning algorithms require significant manual labor marked the training set, and its text set in dealing with will be also faced with dimension disaster, sparse vector, high spatial and temporal complexity, low recall and precision rates that cannot be used for a flood of text polarity...

chapter

Extraction of purpose data using surface text patterns

P Kiran Mayee, Rajeev Sangal, Soma Paul

Proceedings of the 6th International Conference on Natural Language Processing and Knowledge Engineering(NLPKE-2010) > 1 - 7

2010 International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE 2010)

This paper presents the concept of surface text patterns for extracting purpose data from the web. In order to obtain an optimal set of patterns, we have developed a method for learning purpose patterns automatically. A corpus was downloaded from the Internet using bootstrapping by providing a few hand-crafted examples of each purpose pattern to a generic search engine. This corpus was then tagged...

chapter

A supervised machine learning approach of extracting coexpression relationship among genes from literature

Richa Tiwari, Chengcui Zhang, Thamar Solorio

2010 IEEE International Conference on Information Reuse&Integration > 98 - 103

2010 IEEE International Conference on Information Reuse & Integration (IRI 2010)

It is vital to develop automatic information extraction systems to help researchers cope up with the vast amount of data available on the Internet. In this paper, we describe a framework to extract precise information about coexpression relationship among genes, from published literature using a supervised machine learning approach. We use a graphical model, Dynamic Conditional Random Fields (DCRFs),...

chapter

Document Relevance Identifying and its Effect in Query-Focused Text Summarization

Tingting He, Fang Li, Liang Ma

2010 IEEE International Conference on Granular Computing > 206 - 211

2010 IEEE International Conference on Granular Computing (GrC-2010)

There is an important issue that text summarization has to embody personal information need and provide indicative message to user. In this paper, a method of acquiring relevant documents based on user-feedback information and transductive inference SVM machine learning is presented. This method can well avoid the subjectivity of deciding relevant documents empirically. Furthermore, a sentence selection...

Keywords:
DATA MINING
LEARNING (ARTIFICIAL INTELLIGENCE)
TEXT ANALYSIS

Publication date

Set your own date range

Content availability

Available (144)
None (4)

Publication type

book (134)
article (14)

Keywords

MACHINE LEARNING (68)
FEATURE EXTRACTION (52)
TRAINING (44)
TEXT MINING (41)
PATTERN CLASSIFICATION (40)
NATURAL LANGUAGE PROCESSING (39)
SUPPORT VECTOR MACHINES (38)
CLASSIFICATION ALGORITHMS (35)
INFORMATION RETRIEVAL (32)
ACCURACY (30)
TEXT CATEGORIZATION (28)
CLASSIFICATION (24)
INTERNET (22)
TEXT CLASSIFICATION (16)
SUPPORT VECTOR MACHINE (15)
DICTIONARIES (13)
ONTOLOGIES (ARTIFICIAL INTELLIGENCE) (13)
ONTOLOGIES (12)
TRAINING DATA (12)
INFORMATION EXTRACTION (11)
PATTERN CLUSTERING (11)
KERNEL (10)
LEARNING SYSTEMS (10)
WORLD WIDE WEB (10)
ALGORITHM DESIGN AND ANALYSIS (9)
MACHINE LEARNING ALGORITHMS (9)
SVM (9)
CONTEXT (8)
DATABASES (8)
HIDDEN MARKOV MODELS (8)
PROBABILITY DENSITY FUNCTION (8)
PROTEINS (8)
SUPPORT VECTOR MACHINE CLASSIFICATION (8)
WEB PAGES (8)
ARTIFICIAL NEURAL NETWORKS (7)
BAYES METHODS (7)
CLUSTERING ALGORITHMS (7)
COMPUTATIONAL MODELING (7)
OPINION MINING (7)
SEARCH ENGINES (7)
SEMANTICS (7)
SUPERVISED LEARNING (7)
ARTIFICIAL INTELLIGENCE (6)
BIOINFORMATICS (6)
BIOLOGY COMPUTING (6)
CONDITIONAL RANDOM FIELDS (6)
EQUATIONS (6)
FEATURE SELECTION (6)
LABELING (6)
NATURAL LANGUAGES (6)
ONTOLOGY LEARNING (6)
OPTIMIZATION (6)
QUERY PROCESSING (6)
SEMI-SUPERVISED LEARNING (6)
TAGGING (6)
TESTING (6)
WEB SITES (6)
BAYESIAN METHODS (5)
COMPLEXITY THEORY (5)
COMPUTATIONAL LINGUISTICS (5)
COMPUTERS (5)
CORRELATION (5)
DATA MODELS (5)
DOCUMENT CLASSIFICATION (5)
ENTROPY (5)
GRAMMARS (5)
HUMANS (5)
INDEXING (5)
KNOWLEDGE DISCOVERY (5)
MATHEMATICAL MODEL (5)
MEDICAL COMPUTING (5)
NEURAL NETS (5)
PATTERN RECOGNITION (5)
RANDOM PROCESSES (5)
SELF-ORGANISING FEATURE MAPS (5)
SENTIMENT ANALYSIS (5)
STATISTICAL ANALYSIS (5)
TERMINOLOGY (5)
TEXT SUMMARIZATION (5)
WEB MINING (5)
XML (5)
BOOK REVIEWS (4)
BOOTSTRAPPING (4)
CLUSTERING (4)
FILTERING (4)
FUZZY SET THEORY (4)
GENETIC ALGORITHMS (4)
IMAGE CLASSIFICATION (4)
IMAGE RETRIEVAL (4)
INFERENCE MECHANISMS (4)
MACHINE LEARNING TECHNIQUE (4)
MACHINE LEARNING TECHNIQUES (4)
ONTOLOGY (4)
PATTERN MATCHING (4)
SAMPLING METHODS (4)
SEMANTIC WEB (4)
SPEECH (4)
more

INFONA - science communication portal

Advanced search

Advanced search in people

Protein interaction network constructing based on text mining and reinforcement learning with application to prostate cancer

Machine learning based biomedical named entity recognition

Sentiment classification in online reviews using FRN algorithm

Detection of Verbatim or Partial Duplication from Multiple Source Documents Using Data Mining Techniques and Case-Based Reasoning Methodologies

Classification of brand names based on n-grams

Query Expansion for UMLS Metathesaurus Disambiguation Based on Automatic Corpus Extraction

Hot keyword identification for extracting web public opinion

Title Page i

Learning Preferences with Millions of Parameters by Enforcing Sparsity

Sentence-Level and Document-Level Sentiment Mining for Arabic Texts

A refined weighted K-Nearest Neighbors algorithm for text categorization

Extracting Parallel Texts from the Web

A mutual information and information entropy pair based feature selection method in text classification

Web Text Categorization for Large-scale Corpus

Automatic Mining of Human Activity Attributes from Weblogs

Using Text Analysis to Understand the Structure and Dynamics of the World Wide Web as a Multi-Relational Graph

A new polarity clustering algorithm based on semantic criterion function for text of the Chinese commentary

Extraction of purpose data using surface text patterns

A supervised machine learning approach of extracting coexpression relationship among genes from literature

Document Relevance Identifying and its Effect in Query-Focused Text Summarization

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Advanced search

Advanced search in people

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options