Search results

Items from 1 to 11 out of 11 results

chapter

Document classification efficiency of phrase-based techniques

N. Kapalavayi, S.N.J. Murthy, Gongzhu Hu

2009 IEEE/ACS International Conference on Computer Systems and Applications > 174 - 178

2009 7th IEEE/ACS International Conference on Computer Systems and Applications (AICCSA-2009)

Due to the exponential growth of available text documents in digital form, it is of great importance to develop techniques for automatic document classification based on the textual contents. Earlier document classification techniques have used keyword-based features and related statistics to achieve good results when

chapter

An Examination of the Effectiveness of Social Tagging for Resource Discovery

D.H.-L. Goh, Chei Sian Lee, A.Y.K. Chua, K. Razikin

2008 International Workshop on Information-Explosion and Next Generation Search > 23 - 30

2008 International Workshop on Information-Explosion and Next Generation Search (INGS)

Social tagging allows users to assign keywords (tags) to resources facilitating their future access by the tag creator, and possibly by other users. In terms of its support for resource discovery, social tagging has both proponents and critics. The goal of this paper investigates if tags are an effective means for

chapter

Classification and clustering for neuroinformatics: Assessing the efficacy on reverse-mapped NeuroNLP data using standard ML techniques

Nidheesh Melethadathil, Priya Chellaiah, Bipin Nair, Shyam Diwakar

2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 1065 - 1070

2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

quality of text-mined data while efficacy relied on the context of the choice of techniques. Although developments of automated keyword extraction methods have made differences in the quality of data selection, the efficacy of the Natural Language Processing (NLP) methods using verified keywords remain a challenge. In this

chapter

A framework for measuring similarity between terms in Short Text Categorization

Nandini V., Janani Chitra R., P. Uma Maheswari

2016 Online International Conference on Green Engineering and Technologies (IC-GET) > 1 - 7

2016 Online International Conference on Green Engineering and Technologies (IC-GET)

(MWE) and they do not scale very well. This paper proposes a clustering and classification algorithm for semantic similarity using sample web pages. Further improvement is to analyze the short text for classification and labeling the short text according to the keyword and producing the result for the end user. This type

chapter

A framework for measuring similarity between Terms in Short Text Categorization

Nandini V., Janani Chitra. R, P. Uma Maheswari

2016 Online International Conference on Green Engineering and Technologies (IC-GET) > 1 - 7

2016 Online International Conference on Green Engineering and Technologies (IC-GET)

chapter

Feature Reduction for Text Categorization Using Cluster-Based Discriminant Coefficient

Li-Ju Gao, Been-Chian Chien

2012 Conference on Technologies and Applications of Artificial Intelligence > 137 - 142

2012 Conference on Technologies and Applications of Artificial Intelligence (TAAI)

Text classification is an important research topic for managing numerous electronic documents. Feature reduction is the key issue for text classification with high dimensional keywords. A document analysis method called discriminant coefficient was proposed to reduce features and achieve high precisiontext

chapter

Classifying Web Pages Using Information Extraction Patterns Preliminary Results and Findings

Lay-Ki Soon, Sang Ho Lee

2010 Sixth International Conference on Signal-Image Technology and Internet Based Systems > 195 - 202

Sixth International Conference on Signal-Image Technology & Internet-Based Systems (SITIS 2010)

Web page classification plays an essential role in facilitating more efficient information retrieval and information processing. Conventionally, web text documents are represented by term frequency matrix for classification purpose. However, considering the limitations of representing documents using terms or keywords

chapter

A novel term weighting scheme with distributional coefficient for text categorization with support vector machine

Yuan Ping, Ya-jian Zhou, Yi-xian Yang, Wei-ping Peng

2010 IEEE Youth Conference on Information, Computing and Telecommunications > 182 - 185

2010 IEEE Youth Conference on Information, Computing and Telecommunications (YC-ICT 2010)

In text categorization, vectorizing a document by probability distribution is an effective dimension reduction way to save training time. However, the data sets that share many common keywords between categories affect the classification performance seriously. To address that problem, firstly, we conduct an effective

chapter

Text categorization of Enron email corpus based on information bottleneck and maximal entropy

Man Wang, Yifan He, Minghu Jiang

IEEE 10th INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS > 2472 - 2475

2010 10th International Conference on Signal Processing (ICSP 2010)

of the classifier. Our experimental results shows that these measures can improve the classifier's performances, for keywords change too rapidly in emails while address groups are much steadier.

chapter

Hybrid text mining model for document classification

K A Vidhya, G Aghila

2010 The 2nd International Conference on Computer and Automation Engineering (ICCAE) > 1 > 210 - 214

2nd International Conference on Computer and Automation Engineering (ICCAE 2010)

likelihood in the entire training documents where the training and test data are split randomly into k-subsets like 2/3 for training and 1/3 for test data. In addition, it also utilizes two level hierarchy structures for training documents like features from title, keywords and content with the predefined knowledge available

chapter

Automatic Categorization of Image Databases Using Web Folksonomies

P. Capasso, A. Chianese, V. Moscato, A. Penta, more

2008 Tenth IEEE International Symposium on Multimedia > 685 - 690

2008 Tenth IEEE International Symposium on Multimedia

Traditional image classification techniques are based on the analysis of low-level visual features or on textual information. In this paper, we describe a novel solution which tries to improve image analysis and processing algorithms by incorporating keywords and textual annotation produced by humans in a folksonomy

Filter options

Keywords:
CLASSIFICATION
TEXT CATEGORIZATION

Publication date

Set your own date range

Keywords

TEXT ANALYSIS (6)
CLASSIFICATION ALGORITHMS (4)
DATA MINING (4)
TRAINING (4)
WEB PAGES (4)
ACCURACY (3)
ALGORITHM DESIGN AND ANALYSIS (3)
CLUSTERING (3)
FEATURE EXTRACTION (3)
FEATURE SELECTION (3)
INFORMATION RETRIEVAL (3)
INTERNET (3)
CLUSTERING ALGORITHMS (2)
CONTEXT (2)
FEATURE REDUCTION (2)
INDEXING (2)
PERSONALIZED WEB SEARCH (2)
SEARCH ENGINES (2)
SEMANTICS (2)
SHORT TEXT CATEGORIZATION (2)
SINGULAR VALUE DECOMPOSITION (2)
SINGULAR VALUE DECOMPOSITION (SVD) (2)
SUPPORT VECTOR MACHINES (2)
TERM SIMILARITY (2)
TEXT MINING (2)
ANIMALS (1)
APPROXIMATION METHODS (1)
ARTIFICIAL NEURAL NETWORKS (1)
AUTOMATIC CATEGORIZATION (1)
BAYES METHODS (1)
BAYESIAN METHODS (1)
BIOLOGY (1)
BLOGS (1)
BUILDINGS (1)
CLASSIFICATION PERFORMANCE (1)
CLASSIFIER PERFORMANCE (1)
CLUSTERING METHODS (1)
COMPUTER SCIENCE (1)
DATA SETS (1)
DATABASES (1)
DECISION TREE (1)
DECISION TREES (1)
DIMENSION REDUCTION (1)
DISCRIMINANT COEFFICIENT (1)
DISTRIBUTIONAL COEFFICIENT (1)
DOCUMENT CLASSICATION (1)
DOCUMENT CLASSIFICATION (1)
ELECTRONIC MAIL (1)
EMAIL CORPUS (1)
EMAIL TEXT (1)
ENRON EMAIL CORPUS (1)
ENTROPY (1)
ERROR ANALYSIS (1)
FEATURE CLUSTERING (1)
FILTERING ALGORITHMS (1)
FLICKR (1)
HORSES (1)
HYBRID POWER SYSTEMS (1)
HYBRID TEXT MINING MODEL (1)
IMAGE ANALYSIS (1)
IMAGE CATEGORIZATION (1)
IMAGE CLASSIFICATION (1)
IMAGE DATABASES (1)
IMAGE PROCESSING (1)
IMAGE RETRIEVAL (1)
INDEXES (1)
INFORMATION BOTTLENECK (1)
INFORMATION EXTRACTION (1)
INFORMATION EXTRACTION PATTERNS (1)
INFORMATION GAIN (1)
INFORMATION PROCESSING (1)
INFORMATION REPRESENTATION (1)
KERNEL (1)
KEY WORD CLUSTERING (1)
KEYWORD ASSIGNMENT (1)
KEYWORD BASED FEATURE (1)
KEYWORD-BASED AND PHRASE-BASED FEATURES (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
MACHINE LEARNING (1)
MACHINE LEARNING ALGORITHMS (1)
MATRIX ALGEBRA (1)
MATRIX CONVERTERS (1)
MAXIMAL ENTROPY (1)
MULTIMEDIA INFORMATION RETRIEVAL (1)
MUSIC (1)
NAïVE BAYES (1)
NAIVE BAYES (1)
NAVIGATION (1)
NEUROINFORMATICS (1)
NEURONLP (1)
ORGANIZING (1)
PATTERN CLUSTERING (1)
PHRASE BASED TECHNIQUE (1)
PROBABILISTIC VALUE (1)
PROBABILITY (1)
PROBABILITY DISTRIBUTION (1)
RELEVANCE FREQUENCY (1)
RESOURCE DISCOVERY (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options