Search results

Items from 1 to 15 out of 15 results

chapter

Alternatives for Page Skew Compensation in Writer Identification

Jin Chen, Daniel Lopresti

2013 12th International Conference on Document Analysis and Recognition > 927 - 931

2013 12th International Conference on Document Analysis and Recognition (ICDAR)

Traditionally, page images undergo pre-processing before the later stages of document analysis are applied. One common pre-processing step is to calculate and correct for the presence of simple page skew through a compensating rotation. Such operations modify the original input image, however, and in doing so may discard or obscure useful information. In this paper, we examine the impact of page deskewing...

chapter

Chinese Chunk Recognition Using HMSVM Method

Wang Zhong-Hua, Qi Hui

2010 International Conference on Artificial Intelligence and Computational Intelligence > 1 > 3 - 7

2010 International Conference on Artificial Intelligence and Computational Intelligence (AICI 2010)

Hidden Markov Support Vector Machines is a novel structural SVMs model. Its efficiency has been proved in label sequence learning task such as English text chunking. In this paper, we treat Chinese chunk recognition as a label sequence learning problem. After giving the definition of Chinese chunk, we apply HMSVM to solve Chinese chunk problem. The results of experiment show that it achieves a better...

chapter

Holistic Urdu Handwritten Word Recognition Using Support Vector Machine

Malik Waqas Sagheer, Chun Lei He, Nicola Nobile, Ching Y Suen

2010 20th International Conference on Pattern Recognition > 1900 - 1903

2010 20th International Conference on Pattern Recognition (ICPR 2010)

Since the Urdu language has more isolated letters than Arabic and Farsi, a research on Urdu handwritten word is desired. This is a novel approach to use the compound features and a Support Vector Machine (SVM) in offline Urdu word recognition. Due to the cursive style in Urdu, a classification using a holistic approach is adapted efficiently. Compound feature sets, which involves in structural and...

chapter

A new text categorization method based on HMM and SVM

Chen Donghui, Liu zhijing

2010 2nd International Conference on Computer Engineering and Technology > 7 > V7-383 - V7-386

2010 2nd International Conference on Computer Engineering and Technology (ICCET)

This paper has put forward a new method to improve the performance of text categorization. The new method combines HMM (Hidden Markov Model) and SVM (Support Vector Machines). HMMs are used to as a feature extractor and then a new feature vector is normalized as the input of SVMs, so the trained SVMs can classify unknown texts successfully. The experimental results prove that the method is more effective...

chapter

The sensitive feature selection for both English and Chinese text chunking

Liang Ying-Hong, Li Jin-xiang, Zhou De-fu, Wang De-peng

2010 The 2nd International Conference on Computer and Automation Engineering (ICCAE) > 4 > 305 - 309

2nd International Conference on Computer and Automation Engineering (ICCAE 2010)

Traditional text chunking approach is to identify many phrases using only one model, and the same features are used to identify these phrases too. So the helpful features of each phrase are ignored. In fact, different phrases have different helpful features. In this paper, the concept of ??sensitive feature?? is proposed, and the sensitive features of eleven English types and seven Chinese types of...

chapter

Tagger voting improves morphosyntactic tagging accuracy on Croatian texts

Z Agić, M Tadić, Zdravko Dovedan

Proceedings of the ITI 2010, 32nd International Conference on Information Technology Interfaces > 61 - 66

2010 32nd International Conference on Information Technology Interfaces (ITI 2010)

We present results of an experiment dealing with combining outputs of five part-of-speech taggers via tagger voting in order to improve the overall accuracy of morphosyntactic tagging of Croatian texts using a subset of the Multext-East v3 tagset. The increase in accuracy over the best-performing single tagger is shown to exist, but not to be statistically significant. We discuss the performance of...

chapter

Extraction of coexpression relationship among genes from biomedical text using dynamic conditional random fields

R. Tiwari, Chengcui Zhang, Wei-Bang Chen

2009 22nd IEEE International Symposium on Computer-Based Medical Systems > 1 - 4

2009 22nd IEEE International Symposium on Computer-Based Medical Systems (CBMS)

Text mining tools and algorithms are being successfully used for information extraction especially on large corpus like biomedical publications. These tools not only aid in information extraction but also in forming new theories and relationships between various fields of biomedical research. Extraction of gene-gene or gene-disease relationship is one such application. In this paper, we introduce...

chapter

Stochastic Segment Modeling for Offline Handwriting Recognition

P. Natarajan, K. Subramanian, A. Bhardwaj, R. Prasad

2009 10th International Conference on Document Analysis and Recognition > 971 - 975

2009 10th International Conference on Document Analysis and Recognition (ICDAR)

In this paper, we present a novel approach for incorporating structural information into the hidden Markov modeling (HMM) framework for offline handwriting recognition. Traditionally, structural features have been used in recognition approaches that rely on accurate segmentation of words into smaller units (sub-words or characters). However, such segmentation based approaches do not perform well on...

chapter

Self-Teaching Semantic Annotation Method for Knowledge Discovery from Text

Kaiquan Xu, S.S. Liao, R.Y.K. Lau, Lejian Liao, more

2009 42nd Hawaii International Conference on System Sciences > 1 - 7

2009 42nd Hawaii International Conference on System Sciences. HICSS-42

As much valuable domain knowledge is hidden in enterprises' text repositories (e.g., email archives, digital libraries, etc.), it is desirable to develop effective knowledge management tools to process this unstructured data so as to extract domain knowledge for business decision making. Ontology-based semantic annotation of documents is one of the promising ways for knowledge discovery from text...

chapter

Online Writer-Independent Character Recognition Using a Novel Relational Context Representation

S. Izadi, C.Y. Suen

2008 Seventh International Conference on Machine Learning and Applications > 867 - 870

2008 Seventh International Conference on Machine Learning and Applications

Transforming handwriting into digital text and recognition of handwritten patterns opens a vast scope of application opportunities from searching for handwritten notes and document management to causing actions by writing symbols. Despite receiving a great attention, a massive number of applications, and a huge research effort, recognition of handwritten text has not still reached a desired efficiency...

chapter

Estimating the readability of handwritten text - a Support Vector Regression based approach

A. Schlapbach, F. Wettstein, H. Bunke

2008 19th International Conference on Pattern Recognition > 1 - 4

ICPR 2008 19th International Conference on Pattern Recognition

This paper presents a new approach to estimating the readability of handwritten text. The estimation task is posed as a regression problem. A novel support vector regression (SVR) system is used to estimate the recognition rate of a text recognizer on a given text. The estimated recognition rates are used to classify text as either readable or unreadable. Unreadable text can then be filtered out prior...

chapter

Hidden Markov Models and Text Classifiers for Information Extraction on Semi-Structured Texts

F.A. Barros, E.F.A. Silva, R.B.C. Prudencio, V.M. Filho, more

2008 Eighth International Conference on Hybrid Intelligent Systems > 417 - 422

2008 8th International Conference on Hybrid Intelligent Systems (HIS)

Information extraction (IE) aims to extract from textual documents only the fragments which correspond to datafields required by the user. In this paper, we present new experiments evaluating a hybrid machine learning approach for IE that combines text classifiers and hidden Markov models (HMM). In this approach, a text classifier technique generates an initial output, which is refined by an HMM,...

chapter

A New Fuzzy Support Vector Machine Method for Named Entity Recognition

A. Mansouri, L.S. Affendy, A. Mamat

2008 International Conference on Computer Science and Information Technology > 24 - 28

2008 International Conference on Computer Science and Information Technology

Recognizing and extracting exact name entities, like Persons, Locations, Organizations, Dates and Times are very useful to mining information from electronics resources and text. Learning to extract these types of data is called Named Entity Recognition (NER) task. Proper named entity recognition and extraction is important to solve most problems in hot research area such as Question Answering and...

chapter

A new text classification method based on HMM-SVM

Jing Wang, Yong Yao, Zhi Jing Liu

2007 International Symposium on Communications and Information Technologies > 1516 - 1519

2007 International Symposium on Communications and Information Technologies

Text classification has been considered as a hot research area in data mining. This paper presents a new approach combining hidden Markov model (HMM) with support vector machine (SVM) for text classification. HMMs are used to as a feature extractor and then a new feature vector is normalized as the input of SVMs, so the trained SVMs can classify unknown texts successfully. The experimental results...

chapter

Similarity Computation of Chinese Question Based on Chunk

Zheng-Tao Yu, Lei Hu, Li Huang, Jing-Hui Deng, more

2006 International Conference on Machine Learning and Cybernetics > 17 - 22

Proceedings of 2006 International Conference on Machine Learning and Cybernetics

The currently similarity computation methods of Chinese sentence and their shortcomings are analyzed at first. According to the characteristic of the Chinese question sentence, Chinese question general chunk and special chunk are defined, and then a similarity computation method of Chinese question based on chunk is proposed. In this method, the semantic similarity of words is computed on the basis...

Filter options

Keywords:
SUPPORT VECTOR MACHINES
TEXT ANALYSIS
HIDDEN MARKOV MODELS

Publication date

Set your own date range

Keywords

FEATURE EXTRACTION (10)
TRAINING (9)
DATA MINING (5)
INFORMATION RETRIEVAL (5)
CLASSIFICATION ALGORITHMS (4)
HANDWRITING RECOGNITION (4)
HANDWRITTEN CHARACTER RECOGNITION (4)
NATURAL LANGUAGE PROCESSING (4)
PATTERN CLASSIFICATION (4)
LEARNING (ARTIFICIAL INTELLIGENCE) (3)
SUPPORT VECTOR MACHINE (3)
COMPUTATIONAL LINGUISTICS (2)
FEATURE SELECTION (2)
GRAMMARS (2)
HIDDEN MARKOV MODEL (2)
INFORMATION EXTRACTION (2)
LABELING (2)
MACHINE LEARNING (2)
SPEECH (2)
STOCHASTIC PROCESSES (2)
TEXT CATEGORIZATION (2)
TEXT CLASSIFICATION (2)
TEXT RECOGNITION (2)
ACCURACY (1)
ARABIC CHARACTERS (1)
ARTIFICIAL NEURAL NETWORKS (1)
ASSEMBLY (1)
BIBLIOGRAPHIC REFERENCES (1)
BIBLIOGRAPHIC SYSTEMS (1)
BIOMEDICAL TEXT MINING (1)
BOUNDARY RECOGNITION (1)
BUSINESS DATA PROCESSING (1)
BUSINESS DECISION MAKING (1)
CENPARMI URDU WORDS DATABASE (1)
CHARACTER RECOGNITION (1)
CHINESE CHUNK (1)
CHINESE CHUNK RECOGNITION (1)
CHINESE QUESTION SENTENCE (1)
CHINESE TEXT (1)
CHUNK PARSING THEORY (1)
CHUNK SIMILARITY (1)
CITIES AND TOWNS (1)
CLASSIFICATION (1)
COEXPRESSION RELATIONSHIP INFORMATION EXTRACTION (1)
COMPOUND FEATURE SETS (1)
CONNECTED COMPONENT (1)
CROATIAN LANGUAGE (1)
CROATIAN TEXT (1)
CURRENT FEATURE DESCRIPTORS (1)
CYBERNETICS (1)
DATA MODELS (1)
DECISION MAKING (1)
DIGITAL TEXT (1)
DOCUMENT MANAGEMENT (1)
DYNAMIC CONDITIONAL RANDOM FIELD (1)
ENGLISH TEXT (1)
ENGLISH TEXT CHUNKING (1)
ESTIMATION (1)
FEATURE EXTRACTOR (1)
FEATURE VECTOR (1)
FINANCE (1)
FUZZY MEMBERSHIP FUNCTION (1)
FUZZY SET THEORY (1)
FUZZY SUPPORT VECTOR MACHINE (1)
GENE-DISEASE RELATIONSHIP (1)
GENERAL CHUNK (1)
GENETICS (1)
GRAMMATICAL DEPENDENCY (1)
GRAPHICAL MODELS (1)
HANDWRITTEN ARABIC DOCUMENT (1)
HANDWRITTEN CHARACTER IMAGE (1)
HANDWRITTEN CHARACTER REPRESENTATION (1)
HANDWRITTEN NOTES (1)
HANDWRITTEN PATTERN RECOGNITION (1)
HANDWRITTEN TEXT READABILITY (1)
HANDWRITTEN TEXT RECOGNITION (1)
HIDDEN MARKOV MODELING (1)
HIDDEN MARKOV SUPPORT VECTOR MACHINES (1)
HMM (1)
HMM LEARNING METHOD (1)
HMSVM (1)
HOLISTIC URDU HANDWRITTEN WORD RECOGNITION (1)
HOWNET (1)
HYBRID MACHINE LEARNING APPROACH (1)
IMAGE CLASSIFICATION (1)
IMAGE MATCHING (1)
IMAGE SEGMENTATION (1)
INFORMATION MINING (1)
INFORMATION PROCESSING (1)
ISOLATED LETTERS (1)
KNOWLEDGE DISCOVERY (1)
KNOWLEDGE MANAGEMENT TOOL (1)
LABEL SEQUENCE LEARNING (1)
LARGE CORPUS (1)
LOGIC GATES (1)
MACHINE LEARNING ALGORITHMS (1)
MATHEMATICAL MODEL (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options