Search results

Items from 1 to 20 out of 3,530 results

chapter

A Feasible Direction Method for Optimization Problem with Orthogonal Constraint in Feature Selection

Jianyu Miao, Yong Shi, Lingfeng Niu

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 824 - 829

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

Feature selection, as a fundamental component of building robust models, plays an important role in many machine learning and data mining tasks. Since acquiring labeled data is particularly expensive in both time and effort, unsupervised feature selection on unlabeled data has recently gained considerable attention. Without label information, unsupervised feature selection needs alternative criteria...

chapter

Ship route extraction and clustering analysis based on automatic identification system data

Sainan Wang, Suixiang Gao, Wenguo Yang

2017 Eighth International Conference on Intelligent Control and Information Processing (ICICIP) > 33 - 38

2017 Eighth International Conference on Intelligent Control and Information Processing (ICICIP)

This paper considers ship route extraction and clustering problem based on Automatic Identification System (AIS) data. For the ships with known Maritime Mobile Service Identify (MMSI), we propose a ship route extraction method by using AIS data. For ship route clustering, hierarchical clustering method is selected. We firstly define a distance between ship routes to measure the dissimilarity of them...

chapter

High-Dimensional Density Estimation for Data Mining Tasks

Alexander Kuleshov, Alexander Bernstein, Yury Yanovich

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 523 - 530

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

Consider a problem of estimating an unknown high dimensional density whose support lies on unknown low-dimensional data manifold. This problem arises in many data mining tasks, and the paper proposes a new geometrically motivated solution for the problem in manifold learning framework, including an estimation of an unknown support of the density. Firstly, tangent bundle manifold learning problem is...

chapter

Taming Wild High Dimensional Text Data with a Fuzzy Lash

Amir Karami

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 518 - 522

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

The bag of words (BOW) represents a corpus in a matrix whose elements are the frequency of words. However, each row in the matrix is a very high-dimensional sparse vector. Dimension reduction (DR) is a popular method to address sparsity and high-dimensionality issues. Among different strategies to develop DR method, Unsupervised Feature Transformation (UFT) is a popular strategy to map all words on...

chapter

Discovery of Informal Topics from Post Traumatic Stress Disorder Forums

Reilly Grant, David Kucher, Ana M. Leon, Jonathan Gemmell, more

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 452 - 461

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

Post Traumatic Stress Disorder (PTSD) is a public health problem afflicting millions of people each year. It is especially prominent among military veterans. Understanding the language, attitudes, and topics associated with PTSD presents an important and challenging problem. Based on their expertise, mental health professionals have constructed a formal definition of PTSD. However, even the most assiduous...

chapter

Efficient Computation of Multiple Density-Based Clustering Hierarchies

Antonio Cavalcante Araujo Neto, Joerg Sander, Ricardo J. G. B. Campello, Mario A. Nascimento

2017 IEEE International Conference on Data Mining (ICDM) > 991 - 996

2017 IEEE International Conference on Data Mining (ICDM)

HDBSCAN*, a state-of-the-art density-based hierarchical clustering method, produces a hierarchical organization of clusters in a dataset w.r.t. a parameter mpts. While the performance of HDBSCAN* is robust w.r.t. mpts, choosing a "good" value for it can be challenging: depending on the data distribution, a high or low value for mpts may be more appropriate, and certain data clusters may...

chapter

Distributed Representations of Subgraphs

Bijaya Adhikari, Yao Zhang, Naren Ramakrishnan, B. Aditya Prakash

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 111 - 117

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

There has been a surge in research interest in learning feature representation of networks in recent times. Researchers, motivated by the recent successes of embeddings in natural language processing and advances in deep learning, have explored various means for network embedding. Network embedding is useful as it can exploit off-the-shelf machine learning algorithms for network mining tasks like...

chapter

Distance and Density Clustering for Time Series Data

Ruizhe Ma, Rafal Angryk

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 25 - 32

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

Clustering is an important branch in the field of data mining as well as statistical analysis and is widely used in exploratory analysis. Many algorithms exist for clustering in the Euclidean space. However, time series clustering introduces new problems, such as inadequate distance measure, inaccurate cluster center description, lack of efficient and accurate clustering techniques. When dealing with...

chapter

AnySCAN: An Efficient Anytime Framework with Active Learning for Large-Scale Network Clustering

Weizhong Zhao, Gang Chen, Xiaowei Xu

2017 IEEE International Conference on Data Mining (ICDM) > 665 - 674

2017 IEEE International Conference on Data Mining (ICDM)

Network clustering is an essential approach to finding latent clusters in real-world networks. As the scale of real-world networks becomes increasingly larger, the existing network clustering algorithms fail to discover meaningful clusters efficiently. In this paper, we propose a framework called AnySCAN, which applies anytime theory to the structural clustering algorithm for networks (SCAN). Moreover,...

chapter

Matrix Profile VI: Meaningful Multidimensional Motif Discovery

Chin-Chia Michael Yeh, Nickolas Kavantzas, Eamonn Keogh

2017 IEEE International Conference on Data Mining (ICDM) > 565 - 574

2017 IEEE International Conference on Data Mining (ICDM)

Time series motifs are approximately repeating patterns in real-valued time series data. They are useful for exploratory data mining and are often used as inputs for various time series clustering, classification, segmentation, rule discovery, and visualization algorithms. Since the introduction of the first motif discovery algorithm for univariate time series in 2002, multiple efforts have been made...

chapter

A cognitive data stream mining technique for context-aware IoT systems

Dinithi Nallaperuma, Daswin De Silva, Damminda Alahakoon, Xinghuo Yu

IECON 2017 - 43rd Annual Conference of the IEEE Industrial Electronics Society > 4777 - 4782

IECON 2017 - 43rd Annual Conference of the IEEE Industrial Electronics Society

IoT systems deployed in industrial and smart factory settings generate large volumes of data at high velocity. Context awareness is mandatory for knowledge discovery and actionable insights from such high-velocity, high-volume IoT data streams. Changes to the context of a data stream are represented in the underlying data distribution. Research in concept drift aims to detect and adapt to such changes...

chapter

An effective method determining the initial cluster centers for K-means for clustering gene expression data

Deniz Tanir, Fidan Nuriyeva

2017 International Conference on Computer Science and Engineering (UBMK) > 751 - 754

2017 International Conference on Computer Science and Engineering (UBMK)

Clustering is an important tool for analyzing gene expression data. Many clustering algorithms have been proposed for the analysis of gene expression data. In this article we have clustered real life gene expression data via K-Means which is one of clustering algorithms. Also, we have proposed a new method determining the initial cluster centers for K-means. We have compared results of our method...

chapter

Exploring risk factors and predicting UPDRS score based on Parkinson's speech signals

Jianxin Zhang, Weifeng Xu, Qiang Zhang, Bo Jin, more

2017 IEEE 19th International Conference on e-Health Networking, Applications and Services (Healthcom) > 1 - 6

2017 IEEE 19th International Conference on e-Health Networking, Applications and Services (Healthcom)

The unified Parkinson's disease rating scale (UPDRS) is the most widely employed scale for tracking Parkinson's disease (PD) symptom progression. However, conventional way to achieve UPDRS, mainly based on the physical examinations of clinic patients performed by the trained medical staffs, involves the disadvantages of inconvenience and high medical expense. Hence, in this study, we try to explore...

chapter

A non-parametric density kernel in density peak based clustering

Jian Hou, Aihua Zhang

2017 Chinese Automation Congress (CAC) > 4362 - 4367

2017 Chinese Automation Congress (CAC)

Density peak (DP) based clustering algorithm is a recently proposed clustering approach and has been shown to be with great potential. This algorithm is based on the simple assumption that cluster centers have high local density and they are relatively far from each other. This observation is used to isolate cluster centers from other data. By making use of the density relationship among neighboring...

chapter

HDFS framework for efficient frequent itemset mining using MapReduce

Prajakta G. Kulkarni, Shraddha R. Khonde

2017 1st International Conference on Intelligent Systems and Information Management (ICISIM) > 171 - 178

2017 1st International Conference on Intelligent Systems and Information Management (ICISIM)

Association rule mining is a very essential data mining technique in different fields. The enormous development of the information needs increased computational power. To address this issue, it is important to study executions of mining algorithms. To find out the frequent itemsets is an essential and vital issue in numerous information mining applications. There are many algorithms present to extract...

chapter

A hybrid caching scheme based on manifold learning in CCN

Weiyuan Li, Yang Li, Wei Wang, Yonghui Xin, more

MILCOM 2017 - 2017 IEEE Military Communications Conference (MILCOM) > 424 - 429

MILCOM 2017 - 2017 IEEE Military Communications Conference (MILCOM)

Content-Centric Networking (CCN) proposals rethink the communication model around named data. In-network caching is a fundamental feature to distinguish the CCN from the current host-centric IP network. In this paper, we have proposed a hybrid caching scheme which combines the on-path one and the off-path one. We leverage the ISOMAP manifold learning algorithm to distinguish the importance of nodes...

chapter

Shadowed C-means clustering based on approximated feature space

Lingning Kong, Long Chen

2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 1758 - 1763

2017 IEEE International Conference on Systems, Man and Cybernetics (SMC)

The random Fourier Features method has been found very effective in approximating the kernel functions. Our former studies show that through a mixing mechanism of the feature space formed by random Fourier features and certain linear algorithms, the fuzzy clustering results in the approximated feature space are comparable to or even exceed the classical kernel-based algorithms. To increase the robustness...

chapter

A novel clustering algorithm based on searched experiences

Chun-Wei Tsai, Yong-Chun Ding, Ming-Chao Chiang, Chu-Sing Yang

2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 804 - 808

2017 IEEE International Conference on Systems, Man and Cybernetics (SMC)

How to reduce the computation time and how to improve the quality of the clustering result are the two major research issues. Although several efficient and effective clustering algorithms have been presented, none of which is perfect. As such, an effective clustering algorithm, which is based on the prediction of searching information to determine the search directions at later iterations and employs...

chapter

Data mining for social networks open data analysis

Roman E. Spiridonov, Vladislav D. Cvetkov, Oleg M. Yurchik

2017 IEEE II International Conference on Control in Technical Systems (CTS) > 395 - 396

2017 IEEE II International Conference on Control in Technical Systems (CTS)

Social networks are no longer a place where you can spend leisure time and chat with friends. It is also a business instrument in work with their audiences to increase brand recognition, total result from marketing and move sales up. For this purposes it's needed to make thorough analysis of the target audience, scan dozens of user profiles, reveal their interests, positions and estimate users LTV...

chapter

Spectral clustering based on JS-divergence for uncertain data

Yingxu Wang, Jiwen Dong, Jin Zhou, Lin Wang, more

2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 1972 - 1975

2017 IEEE International Conference on Systems, Man and Cybernetics (SMC)

Spectral clustering is one of the most effective methods of data mining, in which the adjacency matrix is constructed by using the similarity matrix. In this paper, to extend spectral clustering method for uncertain data clustering, we propose a new spectral clustering method based on JS-divergence. In the proposed method, the JS-divergence is used to construct the adjacency matrix in the spectral...

Keywords:
CLUSTERING ALGORITHMS
DATA MINING
Publication language:
English
Publication type:
book

Publication date

Set your own date range

Content availability

Available (3,456)
None (74)

Keywords

PATTERN CLUSTERING (1,376)
ALGORITHM DESIGN AND ANALYSIS (1,358)
CLASSIFICATION ALGORITHMS (626)
CLUSTERING (598)
PARTITIONING ALGORITHMS (533)
FEATURE EXTRACTION (377)
ACCURACY (317)
DATABASES (312)
DATA MODELS (257)
PROBABILITY DENSITY FUNCTION (257)
FUZZY SET THEORY (253)
INTERNET (235)
DISTANCE MEASUREMENT (233)
INDEXES (222)
CLUSTERING METHODS (214)
TRAINING (200)
HEURISTIC ALGORITHMS (182)
K-MEANS (173)
COMPLEXITY THEORY (166)
NOISE (165)
OPTIMIZATION (161)
CLUSTERING ALGORITHM (157)
LEARNING (ARTIFICIAL INTELLIGENCE) (153)
PATTERN CLASSIFICATION (153)
COMPUTATIONAL MODELING (146)
SHAPE (146)
MACHINE LEARNING (136)
INFORMATION RETRIEVAL (134)
DATA ANALYSIS (131)
COMPUTERS (128)
TEXT ANALYSIS (127)
IMAGE SEGMENTATION (124)
EDUCATIONAL INSTITUTIONS (122)
ARTIFICIAL NEURAL NETWORKS (121)
ENTROPY (120)
CORRELATION (118)
SPATIAL DATABASES (118)
PREDICTION ALGORITHMS (117)
MATHEMATICAL MODEL (113)
EQUATIONS (111)
GRAPH THEORY (108)
OPTIMISATION (108)
UNSUPERVISED LEARNING (108)
ASSOCIATION RULES (104)
K-MEANS ALGORITHM (104)
FUZZY CLUSTERING (103)
STATISTICAL ANALYSIS (100)
GENETIC ALGORITHMS (98)
VECTORS (96)
APPROXIMATION ALGORITHMS (95)
ITEMSETS (95)
MERGING (95)
CONFERENCES (92)
CLASSIFICATION (90)
DATA CLUSTERING (89)
PIXEL (88)
PROTOTYPES (87)
MACHINE LEARNING ALGORITHMS (86)
CLUSTER ANALYSIS (85)
KERNEL (85)
DISTRIBUTED DATABASES (84)
INTRUSION DETECTION (83)
K-MEANS CLUSTERING (82)
WIRELESS SENSOR NETWORKS (82)
IRIS (80)
COMPUTATIONAL COMPLEXITY (79)
SEARCH ENGINES (79)
VISUALIZATION (79)
WEB PAGES (79)
SEMANTICS (77)
COMMUNITIES (76)
IMAGE COLOR ANALYSIS (76)
TIME SERIES ANALYSIS (76)
PARTICLE SWARM OPTIMIZATION (75)
DATA VISUALIZATION (74)
ELECTRONIC MAIL (74)
SOCIAL NETWORK SERVICES (74)
GENETICS (73)
DOCUMENT HANDLING (72)
MONITORING (72)
PROBABILITY (72)
SIGNAL PROCESSING ALGORITHMS (72)
WEB SITES (72)
PATTERN RECOGNITION (71)
PRINCIPAL COMPONENT ANALYSIS (71)
SECURITY OF DATA (71)
SUPPORT VECTOR MACHINES (70)
BIOINFORMATICS (69)
SOFTWARE (67)
APPROXIMATION METHODS (66)
BIG DATA (66)
CLUSTERING ANALYSIS (66)
CONTEXT (66)
DECISION TREES (66)
MATRIX ALGEBRA (64)
CONVERGENCE (62)
ROBUSTNESS (62)
SEARCH PROBLEMS (62)
more

Data set

ieee (3,528)
Springer (1)
Wiley (1)

INFONA - science communication portal

Search results

A Feasible Direction Method for Optimization Problem with Orthogonal Constraint in Feature Selection

Ship route extraction and clustering analysis based on automatic identification system data

High-Dimensional Density Estimation for Data Mining Tasks

Taming Wild High Dimensional Text Data with a Fuzzy Lash

Discovery of Informal Topics from Post Traumatic Stress Disorder Forums

Efficient Computation of Multiple Density-Based Clustering Hierarchies

Distributed Representations of Subgraphs

Distance and Density Clustering for Time Series Data

AnySCAN: An Efficient Anytime Framework with Active Learning for Large-Scale Network Clustering

Matrix Profile VI: Meaningful Multidimensional Motif Discovery

A cognitive data stream mining technique for context-aware IoT systems

An effective method determining the initial cluster centers for K-means for clustering gene expression data

Exploring risk factors and predicting UPDRS score based on Parkinson's speech signals

A non-parametric density kernel in density peak based clustering

HDFS framework for efficient frequent itemset mining using MapReduce

A hybrid caching scheme based on manifold learning in CCN

Shadowed C-means clustering based on approximated feature space

A novel clustering algorithm based on searched experiences

Data mining for social networks open data analysis

Spectral clustering based on JS-divergence for uncertain data

Filter options

Publication date

Content availability

Keywords

Data set

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options