Search results

Items from 1 to 20 out of 133 results

chapter

Machine Identification of High Impact Research through Text and Image Analysis

Marko Stamenovic, Sam Schick, Jiebo Luo

2017 IEEE Third International Conference on Multimedia Big Data (BigMM) > 98 - 104

2017 IEEE Third International Conference on Multimedia Big Data (BigMM)

The volume of academic paper submissions and publications is growing at an ever increasing rate. While this flood of research promises progress in various fields, the sheer volume of output inherently increases the amount of noise. We present a system to automatically separate papers with a high from those with a low likelihood of gaining citations as a means to quickly find high impact, high quality...

chapter

Opinion mining and analysis: A literature review

Vandana Singh, Sanjay Kumar Dubey

2014 5th International Conference - Confluence The Next Generation Information Technology Summit (Confluence) > 232 - 239

2014 5th International Conference- Confluence The Next Generation Information Technology Summit

Sentiment analysis or opinion mining consist of many different fields like natural language processing, text mining, decision making and linguistics. It is a type of text analysis that classifies the text and makes decision by extracting and analyzing the text. Opinions can be categorized as positive and negative and measures the degree of positive or negative associated with that event (people, organization,...

chapter

Comparing Mining Algorithms for Predicting the Severity of a Reported Bug

A Lamkanfi, S Demeyer, Q D Soetens, T Verdonck

2011 15th European Conference on Software Maintenance and Reengineering > 249 - 258

2011 15th European Conference on Software Maintenance and Reengineering (CSMR 2011)

A critical item of a bug report is the so-called "severity", i.e. the impact the bug has on the successful execution of the software system. Consequently, tool support for the person reporting the bug in the form of a recommender or verification system is desirable. In previous work we made a first step towards such a tool: we demonstrated that text mining can predict the severity of a given...

chapter

Study on question classification approach mixing multiple semantic characteristics together

LiGuo Duan, YanQin Niu, JunJie Chen

2011 3rd International Conference on Computer Research and Development > 1 > 354 - 357

2011 3rd International Conference on Computer Research and Development (ICCRD 2011)

This article proposes such a question classification approach that integrates multiple semantic features. It is aimed at these two questions in Chinese question classification models: inaccurate semantic information extraction and too slow processing speed caused by too high Eigenvector dimension. With the help of HowNet and the support vector machine and syntactic and semantic information of question...

chapter

SVM-based knowledge topic identification toward the autonomous knowledge acquisition

Keedong Yoo

2011 IEEE 9th International Symposium on Applied Machine Intelligence and Informatics (SAMI) > 149 - 154

2011 IEEE 9th International Symposium on Applied Machine Intelligence and Informatics (SAMI)

One of the most serious problems that conventional knowledge management (KM) encompasses has been pointed out tardy and ineffective acquisition of knowledge. To resolve this problem, knowledge must be autonomously acquired according to its context of use by applying the technique of keyword extraction in machine learning algorithm-based text mining. Once the topic of the given knowledge can be identified...

chapter

Investigating analysis of speech content through text classification

S Ezzat, N E Gayar, M M Ghanem

2010 International Conference of Soft Computing and Pattern Recognition > 105 - 110

2010 International Conference of Soft Computing and Pattern Recognition (SoCPaR 2010)

The field of Text Mining has evolved over the past years to analyze textual resources. However, it can be used in several other applications. In this research, we are particularly interested in performing text mining techniques on audio materials after translating them into texts in order to detect the speakers' emotions. We describe our overall methodology and present our experimental results. In...

chapter

Title Page i

2010 IEEE International Conference on Data Mining > i

2010 10th IEEE International Conference on Data Mining (ICDM 2010)

The following topics are dealt with: data mining; local clustering; spatiotemporal event detection; time series; Markov models; email classification; data stream; parallel mining; Bayesian network; unsupervised learning; missing values prediction; anomaly detection; decision tree; binary classifier; data similarity matrix; data mapping; support vector machine; Mapreduce; document similarity; social...

chapter

Vote-Based LELC for Positive and Unlabeled Textual Data Streams

Bo Liu, Yanshan Xiao, Longbing Cao, P S Yu

2010 IEEE International Conference on Data Mining Workshops > 951 - 958

2010 10th IEEE International Conference on Data Mining Workshops (ICDMW 2010)

In this paper, we extend LELC (PU Learning by Extracting Likely Positive and Negative Micro-Clusters) method to cope with positive and unlabeled data streams. Our developed approach, which is called vote-based LELC, works in three steps. In the first step, we extract representative documents from unlabeled data and assign a vote score to each document. The assigned vote score reflects the degree of...

chapter

Improving Arabic document categorization: Introducing local stem

Eiman Tamah Al-Shammari

2010 10th International Conference on Intelligent Systems Design and Applications > 385 - 390

10th International Conference on Intelligent Systems Design and Applications (ISDA 2010)

Stemming is a fundamental step in processing textual data preceding the tasks of text mining, Information Retrieval (IR), and natural language processing (NLP). The common goal of stemming is to standardize words by reducing a word to its base (root or stem), thus can be also considered a feature reduction technique. This paper aims at presenting a new dictionary free, content-based Arabic stemmer...

chapter

Extracting Topics Information from Conference Web Pages Using Page Segmentation and SVM

Yaw-Huei Chen, Sin-Sian Li, Yu-Ta Chen

2010 International Conference on Technologies and Applications of Artificial Intelligence > 270 - 277

2010 International Conference on Technologies and Applications of Artificial Intelligence (TAAI 2010)

Conference web pages display their topics information in different ways, and conferences in different domains accept papers on different topics. Automatic extraction of topics information from conference web pages is thus a difficult task and has not received much attention from the research community. In this paper, we propose a method for extracting topics information that uses a web page segmentation...

chapter

Extracting Parallel Texts from the Web

Le Quang Hung, Le Anh Cuong

2010 Second International Conference on Knowledge and Systems Engineering > 147 - 151

2010 Second International Conference on Knowledge and Systems Engineering (KSE)

Parallel corpus is the valuable resource for some important applications of natural language processing such as statistical machine translation, dictionary construction, cross-language information retrieval. The Web is a huge resource of knowledge, which partly contains bilingual information in various kinds of web pages. It currently attracts many studies on building parallel corpora based on the...

chapter

A mutual information and information entropy pair based feature selection method in text classification

Zhili Pei, Yuxin Zhou, Lisha Liu, Lihua Wang, more

2010 International Conference on Computer Application and System Modeling (ICCASM 2010) > 6 > V6-258 - V6-261

2010 International Conference on Computer Application and System Modeling (ICCASM 2010)

Text classification is an important research field of data mining topics. This article brings a mutual information and information entropy pair based feature selection method (MIIEP_FS) based on the theory of information entropy and information entropy pair concept. This method measure the classification effect using feature by mutual information method and show the difference extent between the features...

chapter

Web Text Categorization for Large-scale Corpus

Zhijuan Jia, Jianbo Mu

2010 International Conference on Computer Application and System Modeling (ICCASM 2010) > 8 > V8-188 - V8-191

2010 International Conference on Computer Application and System Modeling (ICCASM 2010)

Corpus is the set of language materials which are stored in computers and can use computers to search, query and analyze for enterprise decision-makers. Automated text categorization has been extensively studied and various techniques for document categorization. But based on the current scarcity of Chinese corpus, especially in the field of text categorization, the Chinese categorization corpus is...

chapter

Stock price prediction using financial news articles

M I Y Kaya, M E Karsligil

2010 2nd IEEE International Conference on Information and Financial Engineering > 478 - 482

2010 2nd IEEE International Conference on Information and Financial Engineering (ICIFE 2010)

Stock price prediction is one of the most important issues to be investigated in academic and financial researches. Data mining techniques are frequently involved in the studies aimed to achieve this problem. In this paper we investigate predicting stock prices using financial news articles. A prediction model, finding and analyzing correlation between contents of news articles and stock prices and...

chapter

A Comparison of Stylometric and Lexical Features for Web Genre Classification and Emotion Classification in Blogs

Elisabeth Lex, Andreas Juffinger, Michael Granitzer

2010 Workshops on Database and Expert Systems Applications > 10 - 14

2010 21st International Conference on Database and Expert Systems Applications

In the blogosphere, the amount of digital content is expanding and for search engines, new challenges have been imposed. Due to the changing information need, automatic methods are needed to support blog search users to filter information by different facets. In our work, we aim to support blog search with genre and facet information. Since we focus on the news genre, our approach is to classify blogs...

chapter

Sentiment text classification of customers reviews on the Web based on SVM

Huosong Xia, Min Tao, Yi Wang

2010 Sixth International Conference on Natural Computation > 7 > 3633 - 3637

2010 Sixth International Conference on Natural Computation (ICNC)

As a developing endeavor of data mining on semi-structured information, sentiment analysis to the comments on the Internet has aroused people's great interest recently. This paper analysis the influence of different stop word removal methods on the result of text classification and represent the more effective stop word removal list. The experiment bases on the sentiment comments which have been grasped...

chapter

Internet Public Opinion Hotspot Detection and Analysis Based on Kmeans and SVM Algorithm

Hong Liu

2010 International Conference of Information Science and Management Engineering > 1 > 257 - 261

2010 International Conference of Information Science and Management Engineering. ISME 2010

Rapid progress of network arouses much attention on Internet public opinion, it is important to grasp the internet public opinion in time and understand the trends of their opinion correctly. Text mining plays a fundamental role in categorization and monitoring of internet public opinion, but internet public opinion is much more difficult than pure-text process because of their semi-structured characteristic...

chapter

An event ontology construction approach to web crime mining

Li Cunhua, Hu Yun, Zhong Zhaoman

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery > 5 > 2441 - 2445

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)

Along with the rapid popularity of the Internet, crime information on the web is becoming increasingly rampant, and the majority of them are in the form of text. Because a lot of crime information in documents is described through events, event-based semantic technology can be used to study the patterns and trends of web-oriented crimes. In our research project on cyber crime mining, we construct...

chapter

Document Relevance Identifying and its Effect in Query-Focused Text Summarization

Tingting He, Fang Li, Liang Ma

2010 IEEE International Conference on Granular Computing > 206 - 211

2010 IEEE International Conference on Granular Computing (GrC-2010)

There is an important issue that text summarization has to embody personal information need and provide indicative message to user. In this paper, a method of acquiring relevant documents based on user-feedback information and transductive inference SVM machine learning is presented. This method can well avoid the subjectivity of deciding relevant documents empirically. Furthermore, a sentence selection...

chapter

Intelligent Information Technology Based Drug Knowledge Platform

Cunhua Li, Zhaoman Zhong, Hongwei Dai

2010 International Conference on Internet Technology and Applications > 1 - 4

2010 International Conference on Internet Technology and Applications (iTAP 2010)

In this paper, a drug knowledge platform based on intelligent information technology is constructed. Unlike the traditional virtual drug compound library which focus on the patents of drug only, the aim of the new platform is to design a system including information retrieval, information extraction, construction of drug compound and drug ontology, structure based virtual screening, and text classification...

Data set:
ieee
Keywords:
DATA MINING
TEXT ANALYSIS
SUPPORT VECTOR MACHINES

Publication date

Set your own date range

Content availability

Available (130)
None (3)

Publication type

book (125)
article (8)

Keywords

FEATURE EXTRACTION (61)
CLASSIFICATION ALGORITHMS (49)
PATTERN CLASSIFICATION (49)
TRAINING (48)
SUPPORT VECTOR MACHINE (44)
ACCURACY (40)
LEARNING (ARTIFICIAL INTELLIGENCE) (38)
MACHINE LEARNING (38)
TEXT CATEGORIZATION (35)
TEXT MINING (33)
SVM (31)
NATURAL LANGUAGE PROCESSING (27)
TEXT CLASSIFICATION (22)
CLASSIFICATION (21)
INFORMATION RETRIEVAL (21)
INTERNET (18)
KERNEL (17)
SUPPORT VECTOR MACHINE CLASSIFICATION (16)
FEATURE SELECTION (15)
INFORMATION EXTRACTION (11)
TESTING (11)
PROBABILITY DENSITY FUNCTION (10)
MACHINE LEARNING ALGORITHMS (9)
ONTOLOGIES (9)
ONTOLOGIES (ARTIFICIAL INTELLIGENCE) (8)
SVM CLASSIFIER (8)
WEB SITES (8)
DATABASES (7)
ARTIFICIAL NEURAL NETWORKS (6)
CONFERENCES (6)
CONTEXT (6)
DOCUMENT CLASSIFICATION (6)
GRAMMARS (6)
NATURAL LANGUAGES (6)
PROTEINS (6)
SENTIMENT CLASSIFICATION (6)
STATISTICAL ANALYSIS (6)
TRAINING DATA (6)
WEB PAGES (6)
BAYES METHODS (5)
CHINESE TEXT (5)
DOCUMENT IMAGE PROCESSING (5)
EQUATIONS (5)
HIDDEN MARKOV MODELS (5)
INDEXING (5)
INFORMATION FILTERING (5)
LEARNING SYSTEMS (5)
SENTIMENT ANALYSIS (5)
TAGGING (5)
VISUALIZATION (5)
ARTIFICIAL INTELLIGENCE (4)
BIOINFORMATICS (4)
BLOGS (4)
COMPUTERS (4)
DICTIONARIES (4)
ENTROPY (4)
FEATURE SELECTION METHOD (4)
GENETIC ALGORITHMS (4)
GENETICS (4)
KNOWLEDGE BASED SYSTEMS (4)
MATHEMATICAL MODEL (4)
NEURAL NETS (4)
PATTERN CLUSTERING (4)
PIXEL (4)
PREDICTION ALGORITHMS (4)
SAMPLING METHODS (4)
SEARCH ENGINES (4)
SHAPE (4)
SPEECH (4)
TEXT PROCESSING (4)
VECTOR SPACE MODEL (4)
VIDEO SIGNAL PROCESSING (4)
WORD PROCESSING (4)
ACRONYM EXTRACTION (3)
ALGORITHM DESIGN AND ANALYSIS (3)
ANALYTICAL MODELS (3)
ARRAYS (3)
BIOLOGY COMPUTING (3)
CHINESE TEXT CATEGORIZATION (3)
CHINESE TEXT CLASSIFICATION (3)
CITIES AND TOWNS (3)
CLUSTERING (3)
CLUSTERING ALGORITHMS (3)
COMPLEXITY THEORY (3)
COMPUTATIONAL LINGUISTICS (3)
COMPUTATIONAL MODELING (3)
COMPUTER SCIENCE (3)
DECISION TREES (3)
DIGITAL LIBRARIES (3)
DIGITAL LIBRARY (3)
DISTANCE MEASUREMENT (3)
EDUCATIONAL INSTITUTIONS (3)
ELECTRONIC MAIL (3)
FILTERING (3)
HANDWRITING RECOGNITION (3)
HOWNET (3)
IMAGE CLASSIFICATION (3)
more

INFONA - science communication portal

Search results

Machine Identification of High Impact Research through Text and Image Analysis

Opinion mining and analysis: A literature review

Comparing Mining Algorithms for Predicting the Severity of a Reported Bug

Study on question classification approach mixing multiple semantic characteristics together

SVM-based knowledge topic identification toward the autonomous knowledge acquisition

Investigating analysis of speech content through text classification

Title Page i

Vote-Based LELC for Positive and Unlabeled Textual Data Streams

Improving Arabic document categorization: Introducing local stem

Extracting Topics Information from Conference Web Pages Using Page Segmentation and SVM

Extracting Parallel Texts from the Web

A mutual information and information entropy pair based feature selection method in text classification

Web Text Categorization for Large-scale Corpus

Stock price prediction using financial news articles

A Comparison of Stylometric and Lexical Features for Web Genre Classification and Emotion Classification in Blogs

Sentiment text classification of customers reviews on the Web based on SVM

Internet Public Opinion Hotspot Detection and Analysis Based on Kmeans and SVM Algorithm

An event ontology construction approach to web crime mining

Document Relevance Identifying and its Effect in Query-Focused Text Summarization

Intelligent Information Technology Based Drug Knowledge Platform

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options