Search results

chapter

Accelerating Exact Similarity Search on CPU-GPU Systems

Takazumi Matsumoto, Man Lung Yiu

2015 IEEE International Conference on Data Mining > 320 - 329

2015 IEEE International Conference on Data Mining (ICDM)

In recent years, the use of Graphics Processing Units (GPUs) for data mining tasks has become popular. With modern processors integrating both CPUs and GPUs, it is also important to consider what tasks benefit from GPU processing and which do not, and apply a heterogeneous processing approach to improve the efficiency where applicable. Similarity search, also known as k-nearest neighbor search, is...

chapter

A Unified Gradient Regularization Family for Adversarial Examples

Chunchuan Lyu, Kaizhu Huang, Hai-Ning Liang

2015 IEEE International Conference on Data Mining > 301 - 309

2015 IEEE International Conference on Data Mining (ICDM)

Adversarial examples are augmented data points generated by imperceptible perturbation of input samples. They have recently drawn much attention with the machine learning and data mining community. Being difficult to distinguish from real examples, such adversarial examples could change the prediction of many of the best learning models including the state-of-the-art deep learning models. Recent attempts...

chapter

Online pattern mining for high-dimensional data streams

Yoshitaka Yamamoto, Koji Iwanuma

2015 IEEE International Conference on Big Data (Big Data) > 2880 - 2882

2015 IEEE International Conference on Big Data (Big Data)

This paper studies one-scan approximation algorithms for streaming data mining (SDM). Despite of the importance of pattern discovery in streaming data, this issue has not sufficiently addressed yet in the big data community. In this context, we briefly review the previously proposed SDM methods. There is a recent work to improve their limitation using the tecnique of online compression. It is based...

chapter

Selecting representative instances from datasets

Seyed Hamid Mirisaee, Ahlame Douzal, Alexandre Termier

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 10

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

We propose in this paper a new, alternative approach for the problem of finding a set of representative objects in large datasets. To do so, we first formulate the general Instance Selection Problem (ISP) and then study three variants of that in order to select instances from different regions of the data. These variants aim at finding the objects located in three very different locations of the data:...

chapter

Features of information flows in the backbone Internet-channel: the analysis of the statistical characteristics of the relationship between the number of packets and the time

S.V. Porshnev, A.S. Koposov, D.A. Bozhalkin

2015 9th International Conference on Application of Information and Communication Technologies (AICT) > 437 - 440

2015 9th International Conference on Application of Information and Communication Technologies (AICT)

The flows of traffic dumps of high-speed Internet backbone channel were analyzed. Streams were classified into three groups according to the amount of transmitted information. Density function was calculated for the number of packets transmitted by different classes of flows (time series) according to the method of image sources and the Rosenblatt-Parzen approximation. The obtained results show non-stationarity...

chapter

Mining incomplete data with many attribute-concept values and "do not care" conditions

Patrick G. Clark, Jerzy W. Grzymala-Busse

2015 IEEE International Conference on Big Data (Big Data) > 1597 - 1602

2015 IEEE International Conference on Big Data (Big Data)

In this paper we present novel experimental results comparing two interpretations of missing attribute values: attribute-concept values and "do not care" conditions. Experiments were conducted on 12 data sets with many missing attribute values using the MLEM2 rule induction system. In the experiments, three kinds of probabilistic approximations were used: singleton, subset and concept; with...

chapter

Improved algorithms for exact and approximate boolean matrix decomposition

Yuan Sun, Shiwei Ye, Yi Sun, Tsunehiko Kameda

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 10

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

An arbitrary m×n Boolean matrix M can be decomposed exactly as M = U○V, where U (resp. V) is an m×k (resp. k ×n) Boolean matrix and ○ denotes the Boolean matrix multiplication operator. We first prove an exact formula for the Boolean matrix J such that M = M○JT holds, where J is maximal in the sense that if any 0 element in J is changed to a 1 then this equality no longer holds. Since minimizing k...

chapter

REODM: Identify Local Outliers in Big Data

Yongchang Gao, Haowen Guan, Bin Gong

2015 IEEE International Conference on Computer and Information Technology; Ubiquitous Computing and Communications; Dependable, Autonomic and Secure Computing; Pervasive Intelligence and Computing > 825 - 830

2015 IEEE International Conference on Computer and Information Technology; Ubiquitous Computing and Communications; Dependable, Autonomic and Secure Computing; Pervasive Intelligence and Computing (CIT/IUCC/DASC/PICOM)

Outlier detection is now widely used in various fields. It attracts more and more interests in research. The density based outlier detection methods and the distance based outlier detection methods are the most frequently used outlier detection methods. In big data, the size and dimensions of data is very large. Those features make the conventional methods not suitable for big data. According to the...

chapter

Finding community structure via rough K-means in social network

Yunlei Zhang, Bin Wu

2015 IEEE International Conference on Big Data (Big Data) > 2356 - 2361

2015 IEEE International Conference on Big Data (Big Data)

Much of the data of scientific interest, particularly when independence of data is not assumed, can be represented in the form of networks where data nodes are joined together to form edges corresponding to some kind of associations or relationships. Such information networks abound, like protein interactions in biology, web page hyperlink connections in information retrieval on the Web, cellphone...

chapter

A New Approach to Attribute Reduction of Covering Information System

Fachao Li, Jinning Yang

2015 International Conference on Computer Science and Mechanical Automation (CSMA) > 180 - 184

2015 International Conference on Computer Science and Mechanical Automation (CSMA)

This paper focuses on eliminating data redundancy of covering information system. Firstly, we enumerate several usual covering reduction methods and analysis their contact among them. Secondly, we take the attributes value of explicit and implicit value into consider, and we obtain a network topology (shorthand as NT) of covering information system. Through NT, we turn the covering information system...

chapter

Efficient Distributed Data Clustering on Spark

Jia Li, Dongsheng Li, Yiming Zhang

2015 IEEE International Conference on Cluster Computing > 504 - 505

2015 IEEE International Conference on Cluster Computing (CLUSTER)

Data clustering is usually time-consuming since it by default needs to iteratively aggregate and process large volume of data. Approximate aggregation based on sample provides fast and quality ensured results. In this paper, we propose to leverage approximation techniques to data clustering to obtain the trade-off between clustering efficiency and result quality, along with online accuracy estimation...

chapter

Hybrid generalized additive neuro-fuzzy system and its adaptive learning algorithms

Yevgeniy Bodyanskiy, Galina Setlak, Dmytro Peleshko, Olena Vynokurova

2015 IEEE 8th International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications (IDAACS) > 1 > 328 - 333

2015 IEEE 8th International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications (IDAACS)

In this paper we propose architecture of hybrid generalized additive neuro-fuzzy system. Such system is hybrid of the neuro-fuzzy system of Wang-Mendel and the generalized additive models of Hastie-Tibshirani. Proposed hybrid generalized additive neuro-fuzzy system can be used for solving different tasks of computational intelligence and data stream mining. The results of experimental modelling confirm...

chapter

Learning opposites with evolving rules

H.R. Tizhoosh, S. Rahnamayan

2015 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE) > 1 - 8

2015 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE)

The idea of opposition-based learning was introduced 10 years ago. Since then a noteworthy group of researchers has used some notions of oppositeness to improve existing optimization and learning algorithms. Among others, evolutionary algorithms, reinforcement agents, and neural networks have been reportedly extended into their “opposition-based” version to become faster and/or more accurate. However,...

chapter

Pruned search: A machine learning based meta-heuristic approach for constrained continuous optimization

Ruoqian Liu, Ankit Agrawal, Wei-keng Liao, Alok Choudhary, more

2015 Eighth International Conference on Contemporary Computing (IC3) > 13 - 18

2015 Eighth International Conference on Contemporary Computing (IC3)

Searching for solutions that optimize a continuous function can be difficult due to the infinite search space, and can be further complicated by the high dimensionality in the number of variables and complexity in the structure of constraints. Both deterministic and stochastic methods have been presented in the literature with a purpose of exploiting the search space and avoiding local optima as much...

chapter

A randomized proper orthogonal decomposition technique

Dan Yu, Suman Chakravorty

2015 American Control Conference (ACC) > 1137 - 1142

2015 American Control Conference (ACC)

In this paper, we consider the problem of model reduction of large scale systems, such as those obtained through the discretization of PDEs. We propose a randomized proper orthogonal decomposition (RPOD) technique to obtain the reduced order models by randomly choosing a subset of the inputs/outputs of the system to construct a suitable small sized Hankel matrix from the full Hankel matrix. It is...

chapter

Trend feature-based clustering for research funding time series data

Ma Yixuan, Gao Xuedong, Pan Baoxiang

2015 International Conference on Logistics, Informatics and Service Sciences (LISS) > 1 - 5

2015 International Conference on Logistics, Informatics and Service Sciences (LISS)

This paper presents an efficient computational method for time series clustering and application concerning research funding of universities directly under Minster of Education of People Republic of China. Presented approach was based on extraction of trend features with Haar wavelet decomposition from time series data and their use in feature-based agglomerative hierarchical clustering of monthly...

chapter

Approximating flow-sensitive pointer analysis using frequent itemset mining

Vaivaswatha Nagaraj, R. Govindarajan

2015 IEEE/ACM International Symposium on Code Generation and Optimization (CGO) > 225 - 234

2015 IEEE/ACM International Symposium on Code Generation and Optimization (CGO)

Pointer alias analysis is a well researched problem in the area of compilers and program verification. Many recent works in this area have focused on flow-sensitivity due to the additional precision it offers. However, a flow-sensitive analysis is computationally expensive, thus, preventing its use in larger programs. In this work, we observe that a number of object sets, consisting of tens to hundreds...

chapter

A generic predictive knowledge management model for fisheries with special emphasis to the catch of oil-sardine along the south-west coast of India

Bruce Mathew, P. Sumathi

2015 International Conference on Advanced Computing and Communication Systems > 1 - 6

2015 International Conference on Advanced Computing and Communication Systems (ICACCS)

Knowledge Management is a very hot domain these days due to the increasing asset value for knowledge available within the organization. These days, knowledge is treated as a vital asset that can increase organization's competitive advantage. The potential that knowledge management has for improving fisheries management is increasingly being recognized. There are relatively few studies that have specifically...

chapter

Rough — Granular neural network model for making treatment decisions of Hepatitis C

Mohammed M. Eissa, Mohammed Elmogy, Mohammed Hashem

2014 9th International Conference on Informatics and Systems > DEKM-19 - DEKM-26

2014 9th International Conference on Informatics and Systems (INFOS)

Hepatitis C virus is a massive health issue affecting significant portions of the world's population. Applying data preprocessing, feature reduction techniques, and generating rules based on the selected features for classification tasks are considered as important steps in the knowledge discovery in databases. This paper highlights a Rough-Granular Neural Networks model that incorporates Rough Sets...

chapter

Mp-Dissimilarity: A Data Dependent Dissimilarity Measure

Sunil Aryal, Kai Ming Ting, Gholamreza Haffari, Takashi Washio

2014 IEEE International Conference on Data Mining > 707 - 712

2014 IEEE International Conference on Data Mining (ICDM)

Nearest neighbour search is a core process in many data mining algorithms. Finding reliable closest matches of a query in a high dimensional space is still a challenging task. This is because the effectiveness of many dissimilarity measures, that are based on a geometric model, such as lp-norm, decreases as the number of dimensions increases. In this paper, we examine how the data distribution can...

INFONA - science communication portal

Search results

Accelerating Exact Similarity Search on CPU-GPU Systems

A Unified Gradient Regularization Family for Adversarial Examples

Online pattern mining for high-dimensional data streams

Selecting representative instances from datasets

Features of information flows in the backbone Internet-channel: the analysis of the statistical characteristics of the relationship between the number of packets and the time

Mining incomplete data with many attribute-concept values and "do not care" conditions

Improved algorithms for exact and approximate boolean matrix decomposition

REODM: Identify Local Outliers in Big Data

Finding community structure via rough K-means in social network

A New Approach to Attribute Reduction of Covering Information System

Efficient Distributed Data Clustering on Spark

Hybrid generalized additive neuro-fuzzy system and its adaptive learning algorithms

Learning opposites with evolving rules

Pruned search: A machine learning based meta-heuristic approach for constrained continuous optimization

A randomized proper orthogonal decomposition technique

Trend feature-based clustering for research funding time series data

Approximating flow-sensitive pointer analysis using frequent itemset mining

A generic predictive knowledge management model for fisheries with special emphasis to the catch of oil-sardine along the south-west coast of India

Rough — Granular neural network model for making treatment decisions of Hepatitis C

Mp-Dissimilarity: A Data Dependent Dissimilarity Measure

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options