Search results

Items from 1 to 6 out of 6 results

chapter

Converting printed Sinhala documents to formatted editable text

S Ajward, N Jayasundara, S Madushika, R Ragel

2010 Fifth International Conference on Information and Automation for Sustainability > 138 - 143

2010 5th International Conference on Information and Automation for Sustainability (ICIAfS)

Digitizing printed document is always a challenge faced by the computing society. Digitization of text not only allows users to easily modify and reprint printed documents, but also is a need of the day due to the use of word-search capability available at disposal in this era. Converting a printed document into a stream of characters using OCR (optical character recognition) techniques is a widely...

chapter

Determination of Bloom's cognitive level of question items using artificial neural network

Norazah Yusof, Chai Jing Hui

2010 10th International Conference on Intelligent Systems Design and Applications > 866 - 870

10th International Conference on Intelligent Systems Design and Applications (ISDA 2010)

We propose a classification model for the cognitive level of question items in examinations based on Bloom's taxonomy. The model implements the artificial neural network approach, which is trained using the scaled conjugate gradient learning algorithm. Several data preprocessing techniques such as word extraction, stop word removal, stemming, and vector representation are applied to a feature set...

chapter

Optimal Hash List for Word Frequency Analysis

Sheng-Lan Peng

2010 International Conference on Web Information Systems and Mining > 1 > 242 - 245

2010 International Conference on Web Information Systems and Mining (WISM 2010)

Word frequency analysis plays an essential role in many data mining tasks of large-scale data set based on text corpus, and hash list is a very simple but efficient structure for frequent pattern discovering. In this paper, a Poisson approximation approach is exploited to analyze the space efficiency of hash list under different parameters on probability. Based on our theoretical model, an optimal...

chapter

A Novel Approach to Improve the Accuracy of Web Retrieval

Vitaly Klyuev, Vladimir Oleshchuk

2010 5th International Conference on Future Information Technology > 1 - 5

2010 5th International Conference on Future Information Technology (FutureTech)

General purpose search engines utilize a very simple view on text documents: They consider them as bags of words. It results that after indexing, the semantics of documents is lost. In this paper, we introduce a novel approach to improve the accuracy of Web retrieval. We utilize the WordNet and WordNet SenseRelate All Words Software as main tools to preserve the semantics of the sentences of documents...

chapter

Categorization of news articles using neural text categorizer

Taeho Jo

2009 IEEE International Conference on Fuzzy Systems > 19 - 22

2009 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE)

This research proposes the application of NTC (neural text categorizer) for categorizing news articles. Even if the research on text categorization has been progressed very much, documents should be still encoded into numerical vectors. Encoding so causes the two main problems: huge dimensionality and sparse distribution. The idea of this research as the solution to the problems is to encode documents...

chapter

Clustering news groups using inverted index based NTSO

Taeho Jo

2009 First International Conference on Networked Digital Technologies > 1 - 7

2009 First International Conference on Networked Digital Technologies (NDT 2009)

This research proposes NTSO (neural text self organizer) as the approach to text clustering and sets inverted index as the basis for execution of the NTSO. For using one of traditional approaches, documents should be encoded into numerical vectors and encoding so causes the two main problems: the huge dimensionality and the sparse distribution. This research proposes that documents should be encoded...

Filter options

Keywords:
WORD PROCESSING
ARTIFICIAL NEURAL NETWORKS

Publication date

Set your own date range

Keywords

DATA MINING (3)
NEURAL NETS (3)
ENCODING (2)
FEATURE EXTRACTION (2)
TEXT CATEGORIZATION (2)
TRAINING (2)
ACCURACY (1)
ALGORITHM DESIGN AND ANALYSIS (1)
AMBIGUITY PROBLEM (1)
APPROXIMATION METHODS (1)
APPROXIMATION THEORY (1)
ARTIFICIAL NEURAL NETWORK (1)
BIOINFORMATICS (1)
BLOOM'S COGNITIVE LEVEL (1)
BLOOMS TAXONOMY (1)
CHARACTER RECOGNITION (1)
CLASSIFICATION MODEL (1)
CLUSTERING ALGORITHMS (1)
COMPUTER ARCHITECTURE (1)
COMPUTING SOCIETY (1)
CONJUGATE GRADIENT LEARNING ALGORITHM (1)
CONJUGATE GRADIENT METHODS (1)
CONVERGENCE (1)
DATA MINING TASKS (1)
DATA PREPROCESSING TECHNIQUES (1)
DIGITAL PRINTING (1)
DIGITIZING PRINTED DOCUMENT (1)
DOCUMENT FREQUENCY (1)
DOCUMENT IMAGE PROCESSING (1)
EDITABLE SCANNED DOCUMENTS (1)
FEATURE REDUCTION METHODS (1)
FEATURE VECTOR (1)
FINITE ELEMENT METHODS (1)
FORMATTED EDITABLE TEXT (1)
FORMATTING FEATURE (1)
FREQUENT PATTERN DISCOVERY (1)
GENOMICS (1)
HASH LIST (1)
HORIZONTAL PROFILING (1)
INDEXES (1)
INVERTED INDEX BASED NTSO (1)
KERNEL (1)
LAYOUT (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
MACHINE LEARNING (1)
NATURAL LANGUAGE PROCESSING (1)
NEURAL TEXT CATEGORIZER (1)
NEURAL TEXT SELF ORGANIZER (1)
NEWS ARTICLES CATEGORIZATION (1)
NEWS GROUPS CLUSTERING (1)
OCR TECHNIQUE (1)
OPEN-SOURCE WORD PROCESSING TOOL (1)
OPTICAL CHARACTER RECOGNITION (1)
OPTICAL CHARACTER RECOGNITION SOFTWARE (1)
OPTICAL IMAGING (1)
PATTERN CLASSIFICATION (1)
PEDIATRICS (1)
POISSON APPROXIMATION (1)
POISSON APPROXIMATION APPROACH (1)
PRINTED SINHALA DOCUMENT (1)
QUERY KEYWORDS (1)
QUERY PROCESSING (1)
QUESTION ITEMS COGNITIVE LEVEL (1)
RANDOM VARIABLES (1)
SCALED CONJUGATE GRADIENT LEARNING ALGORITHM (1)
SEARCH ENGINES (1)
SEMANTIC TREE (1)
SEMANTIC WEB (1)
SEMANTICS (1)
SINHALA DOCUMENT FORMATTING (1)
SINHALA LANGUAGE (1)
SOFTWARE (1)
SPACE EFFICIENCY (1)
SRI LANKA NATIVE LANGUAGE (1)
STEMMING (1)
STOCHASTIC PROCESSES (1)
STOP WORD REMOVAL (1)
STRING VECTOR ENCODING (1)
SUPPORT VECTOR MACHINE CLASSIFICATION (1)
SUPPORT VECTOR MACHINES (1)
TAXONOMY (1)
TEXT CLUSTERING (1)
TEXT CORPUS (1)
TEXT DIGITIZATION (1)
TEXT EDITING (1)
TEXT RECOGNITION (1)
TIME FREQUENCY ANALYSIS (1)
TREE DATA STRUCTURES (1)
VECTOR REPRESENTATION (1)
VERTICAL PROFILING (1)
WEB RETRIEVAL ACCURACY (1)
WORD EXTRACTION (1)
WORD FREQUENCY (1)
WORD FREQUENCY ANALYSIS (1)
WORD-SEARCH CAPABILITY (1)
WORDNET (1)
WORDNET SENSERELATE (1)
more

INFONA - science communication portal

Search results

Converting printed Sinhala documents to formatted editable text

Determination of Bloom's cognitive level of question items using artificial neural network

Optimal Hash List for Word Frequency Analysis

A Novel Approach to Improve the Accuracy of Web Retrieval

Categorization of news articles using neural text categorizer

Clustering news groups using inverted index based NTSO

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options