Search results

Items from 141 to 160 out of 3,961 results

1 ...
5
6
7
8
9
10
11

chapter

Isolating critical data points from boundary region with feature selection

A. Anitha, E. Kannan

2014 IEEE International Conference on Computational Intelligence and Computing Research > 1 - 4

2014 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC)

Immense databases may contain critical instances or chunks-a small heap of records or instances which has domain specific information. These chunks of information are useful in future decision making for improving classification accuracy for labeling of critical, unlabeled instances by reducing false positives and false negatives. Classification process may be assessed based on efficiency and effectiveness...

chapter

An enhanced feature selection method comprising rough set and clustering techniques

A. Murugan, T. Sridevi

2014 IEEE International Conference on Computational Intelligence and Computing Research > 1 - 4

2014 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC)

Feature selection or variable reduction is a fundamental problem in data mining, refers to the process of identifying the few most important features for application of a learning algorithm. The best subset contains the minimum number of dimensions retaining a suitably high accuracy on classifier in representing the original features. The objective of the proposed approach is to reduce the number...

chapter

Scalable parallel clustering approach for large data using genetic possibilistic fuzzy c-means algorithm

Juby Mathew, R. Vijayakumar

2014 IEEE International Conference on Computational Intelligence and Computing Research > 1 - 7

2014 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC)

In various domains, big data play crucial and related processes because of the latest developments in the digital planet. Such irrepressible data growth has led to bring clustering algorithms to segment the data into small sets to perform associated processes with them. However, the challenge continues in dealing with large data, because most of the algorithms are compatible only with small data....

chapter

Incremental Ensemble Classifier Addressing Non-stationary Fast Data Streams

Brandon S. Parker, Latifur Khan, Albert Bifet

2014 IEEE International Conference on Data Mining Workshop > 716 - 723

2014 IEEE International Conference on Data Mining Workshop (ICDMW)

Classification of data points in a data stream is a fundamentally different set of challenges than data mining on static data. While streaming data is often placed into the context of "Big Data" (or more specifically "Fast Data") wherein one-pass algorithms are used, true data streams offer additional hurdles due to their dynamic, evolving, and non-stationary nature. During the...

chapter

Emotion Recognition from Text Based on Automatically Generated Rules

Shadi Shaheen, Wassim El-Hajj, Hazem Hajj, Shady Elbassuoni

2014 IEEE International Conference on Data Mining Workshop > 383 - 392

2014 IEEE International Conference on Data Mining Workshop (ICDMW)

With the growth of the Internet community, textual data has proven to be the main tool of communication in human-machine and human-human interaction. This communication is constantly evolving towards the goal of making it as human and real as possible. One way of humanizing such interaction is to provide a framework that can recognize the emotions present in the communication or the emotions of the...

chapter

Examination of Reliability of Missing Value Recovery in Data Mining

Shigang Liu, Honghua Dai

2014 IEEE International Conference on Data Mining Workshop > 306 - 313

2014 IEEE International Conference on Data Mining Workshop (ICDMW)

Missing data imputation is an important task in cases where it is crucial to use all available data and no discard records with missing values. However, most of the existing algorithms are focused on missing at random (MAR) or missing completely at random (MCAR). In this paper, an information decomposition imputation (IDIM) algorithm using fuzzy membership function is proposed for addressing the missing...

chapter

An Improved Semi-supervised K-Means Algorithm Based on Information Gain

Liu Zhenpeng, Guo Ding, Zhang Xizhong, Wang Xu, more

2014 IEEE 17th International Conference on Computational Science and Engineering > 1960 - 1963

2014 IEEE 17th International Conference on Computational Science and Engineering (CSE)

The traditional K-means algorithm is sensitive to the initial center, and equates the importance of dimension data for multidimensional data. So it is unable to block the effects of dimensional data dimension, nor can it well reflect the influence of each dimension of clustering. The semi-supervised clustering introduces a small amount of sample points, so that it can significantly reduce the number...

chapter

Check-in Location Prediction Using Wavelets and Conditional Random Fields

Roland Assam, Thomas Seidl

2014 IEEE International Conference on Data Mining > 713 - 718

2014 IEEE International Conference on Data Mining (ICDM)

The widespread adoption of ubiquitous devices does not only facilitate the connection of billions of people, but has also fuelled a culture of sharing rich, high resolution locations through check-ins. Despite the profusion of GPS and WiFi driven location prediction techniques, the sparse and random nature of check-in data generation have ushered diverse problems, which have prompted the prediction...

chapter

Sequence Classification Based on Delta-Free Sequential Patterns

Pierre Holat, Marc Plantevit, Chedy Raissi, Nadi Tomeh, more

2014 IEEE International Conference on Data Mining > 170 - 179

2014 IEEE International Conference on Data Mining (ICDM)

Sequential pattern mining is one of the most studied and challenging tasks in data mining. However, the extension of well-known methods from many other classical patterns to sequences is not a trivial task. In this paper we study the notion of &#x3B4;-freeness for sequences. While this notion has extensively been discussed for item sets, this work is the first to extend it to sequences. We...

chapter

Popular Items or Niche Items: Flexible Recommendation Using Cosine Patterns

Yaqiong Wang, Junjie Wu, Zhiang Wu, Hua Yuan, more

2014 IEEE International Conference on Data Mining Workshop > 205 - 212

2014 IEEE International Conference on Data Mining Workshop (ICDMW)

Recent years have witnessed the explosive growth of recommender systems in various exciting application domains such as electronic commerce, social networking, and location-based services. A great many algorithms have been proposed to improve the accuracy of recommendation, but until recently the long tail problem rising from inadequate recommendation of niche items is recognized as a real challenge...

chapter

Mp-Dissimilarity: A Data Dependent Dissimilarity Measure

Sunil Aryal, Kai Ming Ting, Gholamreza Haffari, Takashi Washio

2014 IEEE International Conference on Data Mining > 707 - 712

2014 IEEE International Conference on Data Mining (ICDM)

Nearest neighbour search is a core process in many data mining algorithms. Finding reliable closest matches of a query in a high dimensional space is still a challenging task. This is because the effectiveness of many dissimilarity measures, that are based on a geometric model, such as lp-norm, decreases as the number of dimensions increases. In this paper, we examine how the data distribution can...

chapter

A Joint Model for Topic-Sentiment Evolution over Time

Mohamed Dermouche, Julien Velcin, Leila Khouas, Sabine Loudcher

2014 IEEE International Conference on Data Mining > 773 - 778

2014 IEEE International Conference on Data Mining (ICDM)

Most existing topic models focus either on extracting static topic-sentiment conjunctions or topic-wise evolution over time leaving out topic-sentiment dynamics and missing the opportunity to provide a more in-depth analysis of textual data. In this paper, we propose an LDA-based topic model for analyzing topic-sentiment evolution over time by modeling time jointly with topics and sentiments. We derive...

chapter

SNOC: Streaming Network Node Classification

Ting Guo, Xingquan Zhu, Jian Pei, Chengqi Zhang

2014 IEEE International Conference on Data Mining > 150 - 159

2014 IEEE International Conference on Data Mining (ICDM)

Many real-world networks are featured with dynamic changes, such as new nodes and edges, and modification of the node content. Because changes are continuously introduced to the network in a streaming fashion, we refer to such dynamic networks as streaming networks. In this paper, we propose a new classification method for streaming networks, namely streaming network node classification (SNOC). For...

chapter

Finding Valuable Yelp Comments by Personality, Content, Geo, and Anomaly Analysis

Jay Koven, Hossein Siadati, Ching-Yung Lin

2014 IEEE International Conference on Data Mining Workshop > 1215 - 1218

2014 IEEE International Conference on Data Mining Workshop (ICDMW)

User reported experiences and opinions are used by peers to make decisions about where to go and what to buy. Unfortunately, not all users or opinions are honest. Many opinions are fabricated and may be submitted by automated systems or by people who are recruited by businesses and search engine optimizers to write good reviews. Such reviews and ratings are called spam reviews. These are misleading...

chapter

Modeling risk prediction of diabetes — A preventive measure

Bakshi Rohit Prasad, Sonali Agarwal

2014 9th International Conference on Industrial and Information Systems (ICIIS) > 1 - 6

2014 9th International Conference on Industrial and Information Systems (ICIIS)

Databases in clinical scenario have tremendous amount of data regarding patients and clinical history associated. Here, data mining plays vital role in searching for patterns within huge clinical data that could provide useful basis of knowledge for efficient and effective decision-making. Classification mechanism is widely used tool of data mining employed in healthcare applications to facilitate...

chapter

Optimization of feature selection method for high dimensional data using fisher score and minimum spanning tree

Bharat Singh, Jitendra Singh Sankhwar, Om Prakash Vyas

2014 Annual IEEE India Conference (INDICON) > 1 - 6

2014 Annual IEEE India Conference (INDICON)

For classification of High Dimensional data, feature selection is the most important step for obtaining optimal result with respect to processing power required and time taken. Feature selection is a method by which the most relevant feature is selected from a set of features containing redundant and irrelevant features thereby reducing the load on the classification algorithm. This paper proposes...

chapter

Stream Mining Using Statistical Relational Learning

Swarup Chandra, Justin Sahs, Latifur Khan, Bhavani Thuraisingham, more

2014 IEEE International Conference on Data Mining > 743 - 748

2014 IEEE International Conference on Data Mining (ICDM)

Stream mining has gained popularity in recent years due to the availability of numerous data streams from sources such as social media and sensor networks. Data mining on such continuous streams possess a variety of challenges including concept drift and unbounded stream length. Traditional data mining approaches to these problems have difficulty incorporating relational domain knowledge and feature...

chapter

Genetic algorithm based wrapper feature selection on hybrid prediction model for analysis of high dimensional data

R C Anirudha, Remya Kannan, Nagamma Patil

2014 9th International Conference on Industrial and Information Systems (ICIIS) > 1 - 6

2014 9th International Conference on Industrial and Information Systems (ICIIS)

Data mining concepts have been extensively used for disease prediction in the medical field. Many Hybrid Prediction Models (HPM) have been proposed and implemented in this area, however, there is always a need for increasing accuracy and efficiency. The existing methods take into account all the features to build the classifier model thus reducing the accuracy and increasing the overall processing...

chapter

A Paralleled Big Data Algorithm with MapReduce Framework for Mining Twitter Data

LI Bing, Keith C.C. Chan

2014 IEEE Fourth International Conference on Big Data and Cloud Computing > 121 - 128

2014 IEEE International Conference on Big Data and Cloud Computing (BdCloud)

Some recent studies have suggested that public opinions expressed in social media may be correlated with various social issues. To find out what actually can be discovered in social media data, we need data mining. Data mining approaches that can handle massive amount of data have recently been referred to as big data algorithms. In this paper, we propose a big data algorithm to handling Twitter data...

chapter

Feature Extraction Model to Identify At -- Risk Level of Students in Academia

Mamta Singh, Jyoti Singh, Arpana Rawal

2014 International Conference on Information Technology > 221 - 227

2014 International Conference on Information Technology (ICIT)

Since four decades, a sincere concern has aroused among managerial, professional, towards the satisfaction of teaching-learning objective in Academia. Huge span of time has already been spent revealing student's profile patterns using predictive modeling methods, however, very little effort is put up in identifying the causative features responsible for varied students' performances followed by decisive...

1 ...
5
6
7
8
9
10
11

Keywords:
ACCURACY
DATA MINING

Publication date

Set your own date range

Content availability

Available (3,890)
None (71)

Keywords

FEATURE EXTRACTION (828)
TRAINING (796)
CLASSIFICATION ALGORITHMS (784)
ALGORITHM DESIGN AND ANALYSIS (485)
PATTERN CLASSIFICATION (467)
SUPPORT VECTOR MACHINES (454)
LEARNING (ARTIFICIAL INTELLIGENCE) (398)
PROBABILITY DENSITY FUNCTION (345)
CLUSTERING ALGORITHMS (317)
DATA MODELS (313)
ARTIFICIAL NEURAL NETWORKS (309)
MACHINE LEARNING (290)
DATABASES (285)
CLASSIFICATION (278)
DECISION TREES (254)
INTERNET (242)
PATTERN CLUSTERING (221)
COMPUTATIONAL MODELING (199)
MATHEMATICAL MODEL (196)
TESTING (195)
TEXT ANALYSIS (184)
PREDICTIVE MODELS (183)
ESTIMATION (182)
REMOTE SENSING (181)
NOISE (179)
IMAGE CLASSIFICATION (176)
FEATURE SELECTION (173)
EQUATIONS (168)
PREDICTION ALGORITHMS (167)
PIXEL (161)
IMAGE SEGMENTATION (160)
CORRELATION (158)
EDUCATIONAL INSTITUTIONS (152)
SUPPORT VECTOR MACHINE (151)
NATURAL LANGUAGE PROCESSING (148)
COMPUTERS (145)
TRAINING DATA (141)
INDEXES (138)
GENETIC ALGORITHMS (136)
INFORMATION RETRIEVAL (134)
OPTIMIZATION (129)
SOFTWARE (129)
FUZZY SET THEORY (127)
STATISTICAL ANALYSIS (126)
KERNEL (124)
MONITORING (124)
COMPLEXITY THEORY (122)
SHAPE (120)
APPROXIMATION METHODS (118)
DISTANCE MEASUREMENT (118)
SIGNAL PROCESSING (116)
PATTERN RECOGNITION (105)
PRINCIPAL COMPONENT ANALYSIS (105)
ROBUSTNESS (105)
ENTROPY (101)
ASSOCIATION RULES (100)
DECISION TREE (98)
ROUGH SET THEORY (98)
CONFERENCES (97)
NEURAL NETS (97)
IMAGE COLOR ANALYSIS (96)
SUPPORT VECTOR MACHINE CLASSIFICATION (95)
IMAGE PROCESSING (92)
PROBABILITY (92)
TRANSFORMS (91)
WEB PAGES (91)
OPTIMISATION (90)
DISEASES (88)
BAYES METHODS (86)
HUMANS (86)
SPEECH (85)
CONTEXT (84)
WIRELESS SENSOR NETWORKS (84)
HEURISTIC ALGORITHMS (82)
ANALYTICAL MODELS (80)
DECISION MAKING (80)
IMAGE RESOLUTION (80)
CLUSTERING (78)
DATA ANALYSIS (78)
HIDDEN MARKOV MODELS (76)
IMAGE EDGE DETECTION (76)
SVM (76)
GENETICS (75)
SIGNAL PROCESSING ALGORITHMS (74)
BAYESIAN METHODS (73)
BUILDINGS (73)
CAMERAS (73)
VECTORS (71)
REGRESSION ANALYSIS (70)
VISUALIZATION (70)
CLASSIFICATION TREE ANALYSIS (69)
SECURITY OF DATA (69)
BIOLOGICAL SYSTEM MODELING (68)
FILTERING (68)
WEB SITES (68)
ARTIFICIAL INTELLIGENCE (67)
COMPUTER SCIENCE (67)
COMPUTER VISION (66)
more

Data set

ieee (3,960)
Springer (1)

INFONA - science communication portal

Search results

Isolating critical data points from boundary region with feature selection

An enhanced feature selection method comprising rough set and clustering techniques

Scalable parallel clustering approach for large data using genetic possibilistic fuzzy c-means algorithm

Incremental Ensemble Classifier Addressing Non-stationary Fast Data Streams

Emotion Recognition from Text Based on Automatically Generated Rules

Examination of Reliability of Missing Value Recovery in Data Mining

An Improved Semi-supervised K-Means Algorithm Based on Information Gain

Check-in Location Prediction Using Wavelets and Conditional Random Fields

Sequence Classification Based on Delta-Free Sequential Patterns

Popular Items or Niche Items: Flexible Recommendation Using Cosine Patterns

Mp-Dissimilarity: A Data Dependent Dissimilarity Measure

A Joint Model for Topic-Sentiment Evolution over Time

SNOC: Streaming Network Node Classification

Finding Valuable Yelp Comments by Personality, Content, Geo, and Anomaly Analysis

Modeling risk prediction of diabetes — A preventive measure

Optimization of feature selection method for high dimensional data using fisher score and minimum spanning tree

Stream Mining Using Statistical Relational Learning

Genetic algorithm based wrapper feature selection on hybrid prediction model for analysis of high dimensional data

A Paralleled Big Data Algorithm with MapReduce Framework for Mining Twitter Data

Feature Extraction Model to Identify At -- Risk Level of Students in Academia

Filter options

Publication date

Content availability

Keywords

Data set

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options