Search results

Items from 1 to 20 out of 198 results

chapter

Enhanced intelligent text categorization using concise keyword analysis

Amir Mohammad Shahi, Biju Issac, Jashua Rajesh Modapothala

2012 International Conference on Innovation Management and Technology Research > 574 - 579

2012 International Conference on Innovation Management and Technology Research (ICIMTR)

Supervised learning is a popular approach to text classification among the research community as well as within software development industry. It enables intelligent systems to solve various text analysis problems such as document organization, spam detection and report scoring. However, the extremely difficult and time intensive process of creating a training corpus makes it inapplicable to many...

chapter

Classification of email using BeaKS: Behavior and keyword stemming

Veena H Bhat, Vandana R Malkani, P Deepa Shenoy, K R Venugopal, more

TENCON 2011 - 2011 IEEE Region 10 Conference > 1139 - 1143

TENCON 2011 - 2011 IEEE Region 10 Conference

Spam mails are one of the greatest challenges faced by internet service providers, organizations and internet users in unison. Spam mails may be targeted, with a malicious intent or just as a commercial marketing activity - on the whole unwanted by everyone except the dispatcher. Spam filters continuously evolve as spammers go techno-savvy and creative. Machine learning algorithms have been popularly...

chapter

A small vocabulary automatic filipino speech profanity suppression system using hybrid Hidden Markov Model/Artificial Neural Network (HMM/ANN) keyword spotting framework

Fernando I. Ablaza, Timothy Oliver D. Danganan, Bryan Paul L. Javier, Kevin S. Manalang, more

2014 International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment and Management (HNICEM) > 1 - 5

2014 International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment and Management (HNICEM)

Markov Model/ Artificial Neural Network (HMM/ANN) keyword spotting framework. The feature extraction method used was Mel-Frequency Cepstral Coefficients (MFCC). The ANN is a 3-layer feedforward neural network using Multi-Layer Perceptron (MLP). In recognizing the words, an HMM decoder was used which implemented the Viterbi

chapter

Keyword Based Semantic Search for Mobile Data

Jihoon Ko, Sangjin Shin, Sungkwang Eom, Minjae Song, more

2014 IEEE 15th International Conference on Mobile Data Management > 1 > 245 - 248

2014 15th IEEE International Conference on Mobile Data Management (MDM)

Most of the mobile platforms provide a keyword based full text search (FTS) for users to find what they want. However, FTS has difficulties in dealing with the cases where a user cannot remember the exact keywords about target data or the number of search results is too many. To overcome these limitations of FTS, we

chapter

A Lattice-Based Method for Keyword Spotting in Online Chinese Handwriting

Heng Zhang, Cheng-Lin Liu

2011 International Conference on Document Analysis and Recognition > 1064 - 1068

2011 International Conference on Document Analysis and Recognition (ICDAR)

This paper proposes a lattice-based method for keyword spotting in online Chinese handwriting to improve the trade-off between accuracy and speed, and to overcome the out-of-vocabulary (OOV) problem of lexicon-driven approach. Using a character string recognition algorithm, the lattice-based method generates a

chapter

Keyword Spotting from Online Chinese Handwritten Documents Using One-vs-All Trained Character Classifier

Heng Zhang, Da-Han Wang, Cheng-Lin Liu

2010 12th International Conference on Frontiers in Handwriting Recognition > 271 - 276

2010 12th International Conference on Frontiers in Handwriting Recognition (ICFHR 2010)

This paper presents a text query-based method for keyword spotting from online Chinese handwritten documents. The similarity between a text word and handwriting is obtained by combining the character similiarity scores given by a character classifier. To overcome the ambiguity of character segmentation, multiple

chapter

A corpus-based approach for keyword identification using supervised learning techniques

J. TeCho, C. Nattee, T. Theeramunkong

2008 5th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology > 1 > 33 - 36

2008 5th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON)

This paper presents a corpus-based approach for extracting keywords from a text written in a language that has no word boundary. Based on the concept of Thai character cluster, a Thai running text is preliminarily segmented into a sequence of inseparable units, called TCCs. To enable the handling of a large-scaled

chapter

Keyword Search over Dynamic Categorized Information

M. Bhide, V.T. Chakaravarthy, K. Ramamritham, P. Roy

2009 IEEE 25th International Conference on Data Engineering > 258 - 269

2009 IEEE 25th International Conference on Data Engineering. ICDE 2009

Consider an information repository whose content is categorized. A data item (in the repository) can belong to multiple categories and new data is continuously added to the system. In this paper, we describe a system, CS*, which takes a keyword query and returns the relevant top-K categories. In contrast, traditional

chapter

Open-vocabulary keyword detection from super-large scale speech database

N. Kanda, H. Sagawa, T. Sumiyoshi, Y. Obuchi

2008 IEEE 10th Workshop on Multimedia Signal Processing > 939 - 944

2008 IEEE 10th Workshop on Multimedia Signal Processing (MMSP)

This paper presents our recent attempt to make a super-large scale spoken-term detection system, which can detect any keyword uttered in a 2,000-hour speech database within a few seconds. There are three problems to achieve such a system. The system must be able to detect out-of-vocabulary (OOV) terms (OOV problem

chapter

Automated classification of radiology reports for acute lung injury: Comparison of keyword and machine learning based natural language processing approaches

I. Solti, C.R. Cooke, Fei Xia, M.M. Wurfel

2009 IEEE International Conference on Bioinformatics and Biomedicine Workshop > 314 - 319

2009 IEEE International Conference on Bioinformatics and Biomedicine Workshop, BIBMW

This paper compares the performance of keyword and machine learning-based chest x-ray report classification for Acute Lung Injury (ALI). ALI mortality is approximately 30 percent. High mortality is, in part, a consequence of delayed manual chest x-ray classification. An automated system could reduce the time to

chapter

Keyword Extraction Algorithm Based on

She-Xue Bing, Fu-Lei

2013 International Conference on Computational and Information Sciences > 664 - 665

2013 Fifth International Conference on Computational and Information Sciences (ICCIS)

The search engine, keyword extraction is an important technique. In this paper, aiming at the defects of the traditional keyword extraction algorithm, we proposed an improved weight computation strategy. The experimental results show that, the improved method's results are significantly better results than the

chapter

Reducing redundancy in XML Keyword Search by indirect-SLCA

Gao Dandan, Wang Xinjun, Zhang Lihua

2008 IEEE International Symposium on IT in Medicine and Education > 174 - 177

2008 IEEE International Symposium on IT in Medicine and Education (ITME)

In this paper, we study the problem of the data redundancy in XML Keyword Search by SLCA and propose a new mode to resolve it. We begin by introducing the notion of SLCA and analyzing its faults. Then we propose the concept of Indirect-SLCA (ISLCA) to reduce the redundancy basing on the notion of Heterogeneous node

chapter

Implementing spam detection using Bayesian and Porter Stemmer keyword stripping approaches

B. Issac, W.J. Jap

TENCON 2009 - 2009 IEEE Region 10 Conference > 1 - 5

TENCON 2009. 2009 IEEE Region 10 Conference

developed by implementing the keyword stripping using the Porter Stemmer algorithm. This could make the keyword search more efficient, as the root or stem word is only considered. Experimental results on two public spam corpuses are also discussed at the end.

chapter

An unsupervised language model adaptation based on keyword clustering and query availability estimation

A. Ito, Y. Kajiura, S. Makino, M. Suzuki

2008 International Conference on Audio, Language and Image Processing > 1412 - 1418

2008 International Conference on Audio, Language and Image Processing

Language model adaptation using text data downloaded from the WWW is an efficient way to train a topic-specific LM. We are developing an unsupervised LM adaptation method using data in the Web. The one key point of unsupervised Web-based LM adaptation is how to select keywords to compose the search query. In this

chapter

Chinese Keyword Extraction Using Semantically Weighted Network

Qian Chen, Zengru Jiang, Jinqiang Bian

2014 Sixth International Conference on Intelligent Human-Machine Systems and Cybernetics > 2 > 83 - 86

2014 6th International Conference on Intelligent Human-Machine Systems and Cybernetics (IHMSC)

The complex network theory is widely used in the field of keyword extraction. Through analyzing the insufficient of keyword extraction algorithms using traditional complex network, this paper proposes a new method to extract Chinese keyword based on semantically weighted network. On the basis of K-nearest neighbor

chapter

Hot keyword identification for extracting web public opinion

Zhiqi Fang, Yue Ning, Tingshao Zhu

5th International Conference on Pervasive Computing and Applications > 116 - 121

2010 5th International Conference on Pervasive Computing and Applications (ICPCA 2010)

Internet is becoming an increasingly important platform for ordinary life and work. It is expected that keyword extraction can help people quickly find hot spots on the web, since keywords in a document provide important information about the content of the document. In this paper, we propose to use text clustering

chapter

Keyword extraction of web pages based on domain thesaurus

Guowan He, Jie Wang, Yafeng Zhang, Yan Peng

2014 IEEE 3rd International Conference on Cloud Computing and Intelligence Systems > 310 - 314

2014 IEEE 3rd International Conference on Cloud Computing and Intelligence Systems (CCIS)

This paper presents a keyword extraction method of web pages based on domain thesaurus. The method extracts keywords from web pages based on traditional statistic features, such as frequency and location, and it also evaluates the weight of candidate keywords combining with their relation of domain thesaurus. This

chapter

Keyword spotting system for Tamil isolated words using Multidimensional MFCC and DTW algorithm

Senthildevi K. A, Chandra E

2015 International Conference on Communications and Signal Processing (ICCSP) > 550 - 554

2015 International Conference on Communications and Signal Processing (ICCSP)

Audio mining is a speaker independent speech processing technique and is related to data mining. Keyword spotting plays an important role in audio mining. Keyword spotting is retrieval of all instances of a given keyword in spoken utterances. It is well suited to data mining tasks that process large amount of speech

chapter

Research of Duplicate Record Cleaning Technology Based on a Reformative Keywords Matching Algorithm

Yan Hu, Wei Li, Ying Qiu, Wei Wu

2009 International Conference on E-Business and Information System Security > 1 - 5

2009 International Conference on E-Business and Information System Security (EBISS)

Based on the analysis of the insufficiencies of the present Chinese matching algorithms, by examining the characteristics of approximately duplicate records, this paper proposes a method of duplicate record cleaning based on a reformative keywords matching algorithm. Experiments show that this method improves Recall

chapter

Region-Based Semi-automatic Annotation Using the Bag of Words Representation of the Keywords

N. Van Nguyen, A. Boucher, J.-M. Ogier, S. Tabbone

2009 Fifth International Conference on Image and Graphics > 422 - 427

Fifth International Conference on Image and Graphics (ICIG 2009)

approach has a limit as only the annotations of found images during the interaction are updated. In this paper we introduce a novel method of semi-automatic annotation. The method is using visual feature representations of keywords which are improved during the region-based relevance feedback. The experiments show that this

Keywords:
ACCURACY

Publication date

Set your own date range

Content availability

Available (195)
None (3)

Publication type

book (181)
article (17)

Publication language

English (197)
German (1)

Keywords

DATA MINING (56)
FEATURE EXTRACTION (46)
TRAINING (38)
CLASSIFICATION ALGORITHMS (33)
INFORMATION RETRIEVAL (27)
INTERNET (26)
SEMANTICS (25)
SUPPORT VECTOR MACHINES (25)
TEXT ANALYSIS (25)
WEB PAGES (21)
SEARCH ENGINES (20)
MACHINE LEARNING (18)
ALGORITHM DESIGN AND ANALYSIS (17)
DATABASES (17)
VISUALIZATION (16)
ONTOLOGIES (13)
PATTERN CLASSIFICATION (13)
SPEECH (13)
CLASSIFICATION (12)
CLUSTERING ALGORITHMS (12)
CONTEXT (12)
NATURAL LANGUAGE PROCESSING (12)
HIDDEN MARKOV MODELS (11)
IMAGE RETRIEVAL (11)
KEYWORD EXTRACTION (11)
LEARNING (ARTIFICIAL INTELLIGENCE) (11)
IMAGE COLOR ANALYSIS (10)
PROBABILITY DENSITY FUNCTION (10)
VECTORS (10)
ARTIFICIAL NEURAL NETWORKS (9)
COMPUTATIONAL MODELING (9)
DICTIONARIES (9)
INDEXING (9)
QUERY PROCESSING (9)
SPEECH RECOGNITION (9)
TAGGING (9)
TEXT CATEGORIZATION (9)
WEB SITES (9)
DOCUMENT HANDLING (8)
MATHEMATICAL MODEL (8)
SENTIMENT ANALYSIS (8)
VOCABULARY (8)
CORRELATION (7)
EQUATIONS (7)
INDEXES (7)
KEYWORD SEARCH (7)
SUPPORT VECTOR MACHINE (7)
TWITTER (7)
CONTENT-BASED RETRIEVAL (6)
FILTERING (6)
ONTOLOGIES (ARTIFICIAL INTELLIGENCE) (6)
PATTERN CLUSTERING (6)
TEXT MINING (6)
BAYESIAN METHODS (5)
BLOGS (5)
CHARACTER RECOGNITION (5)
COMPUTERS (5)
DECISION TREES (5)
EIGENVALUES AND EIGENFUNCTIONS (5)
ELECTRONIC MAIL (5)
GOOGLE (5)
HISTORY (5)
HTML (5)
LATTICES (5)
MACHINE LEARNING ALGORITHMS (5)
OPTIMIZATION (5)
PATTERN MATCHING (5)
QUERY FORMULATION (5)
SEARCH PROBLEMS (5)
SERVERS (5)
SOFTWARE (5)
SPEECH PROCESSING (5)
STATISTICAL ANALYSIS (5)
SVM (5)
UNSOLICITED ELECTRONIC MAIL (5)
XML (5)
ADAPTATION MODEL (4)
BAYES METHODS (4)
CLUSTERING (4)
COMPUTER ARCHITECTURE (4)
EDUCATION (4)
EMOTION RECOGNITION (4)
FEATURE SELECTION (4)
GLOBAL POSITIONING SYSTEM (4)
HUMANS (4)
IMAGE CLASSIFICATION (4)
IMAGE SEGMENTATION (4)
INFORMATION FILTERING (4)
KERNEL (4)
KEYWORDS (4)
LIBRARIES (4)
MEDIA (4)
MULTIMEDIA COMMUNICATION (4)
NAIVE BAYES (4)
PREDICTIVE MODELS (4)
RECOMMENDER SYSTEMS (4)
SOCIAL NETWORK SERVICES (4)
WEB SEARCH (4)
WORD PROCESSING (4)
more

Data set

ieee (187)
Springer (4)
Elsevier (3)
Wiley (3)
PSJD (1)

INFONA - science communication portal

Search results

Enhanced intelligent text categorization using concise keyword analysis

Classification of email using BeaKS: Behavior and keyword stemming

A small vocabulary automatic filipino speech profanity suppression system using hybrid Hidden Markov Model/Artificial Neural Network (HMM/ANN) keyword spotting framework

Keyword Based Semantic Search for Mobile Data

A Lattice-Based Method for Keyword Spotting in Online Chinese Handwriting

Keyword Spotting from Online Chinese Handwritten Documents Using One-vs-All Trained Character Classifier

A corpus-based approach for keyword identification using supervised learning techniques

Keyword Search over Dynamic Categorized Information

Open-vocabulary keyword detection from super-large scale speech database

Automated classification of radiology reports for acute lung injury: Comparison of keyword and machine learning based natural language processing approaches

Keyword Extraction Algorithm Based on

Reducing redundancy in XML Keyword Search by indirect-SLCA

Implementing spam detection using Bayesian and Porter Stemmer keyword stripping approaches

An unsupervised language model adaptation based on keyword clustering and query availability estimation

Chinese Keyword Extraction Using Semantically Weighted Network

Hot keyword identification for extracting web public opinion

Keyword extraction of web pages based on domain thesaurus

Keyword spotting system for Tamil isolated words using Multidimensional MFCC and DTW algorithm

Research of Duplicate Record Cleaning Technology Based on a Reformative Keywords Matching Algorithm

Region-Based Semi-automatic Annotation Using the Bag of Words Representation of the Keywords

Filter options

Publication date

Content availability

Publication type

Publication language

Keywords

Data set

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Publication language

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options