Search results

chapter

Improved fuzzy space-intervals based sequential pattern mining: Technical solution

Harsha Nair, E A Neeba

2015 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC) > 1 - 4

2015 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC)

One of the sub areas of the data mining includes sequential pattern mining. This mining algorithm is to find the repeating patterns after mining the sequence databases. These are used to find the relation between the various items in the data for different purposes. As these data keep changing according to the change in time, mining should be done on incremented or updated database to obtain the frequent...

chapter

Margin-based active subspace clustering

John Lipor, Laura Balzano

2015 IEEE 6th International Workshop on Computational Advances in Multi-Sensor Adaptive Processing (CAMSAP) > 377 - 380

2015 IEEE 6th International Workshop on Computational Advances in Multi-Sensor Adaptive Processing (CAMSAP)

Subspace clustering has typically been approached as an unsupervised machine learning problem. However in several applications where the union of subspaces model is useful, it is also reasonable to assume you have access to a small number of labels. In this paper we investigate the benefit labeled data brings to the subspace clustering problem. We focus on incorporating labels into the k-subspaces...

chapter

A Datalog Engine for Iterative Graph Algorithms on Large Clusters

Jacek Sroka, Marek Rogala, Michal Adamczyk, Jan Hidders

2015 IEEE International Conference on Data Science and Data Intensive Systems > 113 - 114

2015 IEEE International Conference on Data Science and Data Intensive Systems (DSDIS)

Distributed computations on graphs gained importance with the emergence of large graphs, e.g., in the web or social networks. Frameworks like Hadoop, Giraph and Spark are used for their processing. Yet, they require advanced programming techniques to minimize skew and data shuffling. Declarative, query-like, but at the same time efficient solutions like Pig for general purpose analytics are lacking...

chapter

A novel approach for imputation of missing values for mining medical datasets

Yelipe UshaRani, P. Sammulal

2015 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC) > 1 - 8

2015 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC)

Imputation of missing attribute values in medical datasets for extracting hidden knowledge from medical datasets is an interesting research topic of interest which is very challenging. One cannot eliminate missing values in medical records. The reason may be because some tests may not been conducted as they are cost effective, values missed when conducting clinical trials, values may not have been...

chapter

Distributed data warehouse — Experimentation with TPC-DS

Sagar Yeruva, P. V. Kumar, P. Padmanabham

2015 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC) > 1 - 5

2015 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC)

Distributed computing, data availability and data analytics supporting for strategic decision making are the essential key requirements for success of any organization business. These features are the leading frontiers in the current business and in research which opens lot of expectations by the end users. This paper attempts a design methodology for distributing the current data warehouse features...

chapter

A novel approach to solve K-center problems with geographical placement

Peter Hillmann, Tobias Uhlig, Gabi Dreo Rodosek, Oliver Rose

2015 IEEE International Conference on Service Operations And Logistics, And Informatics (SOLI) > 31 - 36

2015 IEEE International Conference on Service Operations And Logistics, And Informatics (SOLI)

The facility location problem is a well-known challenge in logistics that is proven to be NP-hard. In this paper we specifically simulate the geographical placement of facilities to provide adequate service to customers. Determining reasonable center locations is an important challenge for a management since it directly effects future service costs. Generally, the objective is to place the central...

chapter

Clustering Evolving Batch System Jobs for Online Anomaly Detection

Eileen Kuehn

2015 IEEE International Conference on Data Mining Workshop (ICDMW) > 1534 - 1535

2015 IEEE International Conference on Data Mining Workshop (ICDMW)

In batch systems monitoring information at the level of individual jobs is crucial to optimize resource utilization and prevent misusage. However, especially the usage of network resources is difficult to track. In order to understand usage patterns in modern computing clusters, a more detailed monitoring than existent solutions is required. A monitoring on job level leads to dynamic graphs of processes...

chapter

Fast Community Discovery and Its Evolution Tracking in Time-Evolving Social Networks

Yao Liu, Hong Gao, Xiaohui Kang, Qiao Liu, more

2015 IEEE International Conference on Data Mining Workshop (ICDMW) > 13 - 20

2015 IEEE International Conference on Data Mining Workshop (ICDMW)

In real world, social networks are large scale, noisy and evolutionary. Communities are inherent characteristics of human interaction in social networks. Tracking evolutionary communities in dynamic social networks has become an increasingly important research topic. Several classic incremental clustering and evolutionary clustering algorithms have been proposed. But they all face a problem of controlling...

chapter

Finding Subspace Clusters Using Ranked Neighborhoods

Emin Aksehirli, Siegfried Nijssen, Matthijs van Leeuwen, Bart Goethals

2015 IEEE International Conference on Data Mining Workshop (ICDMW) > 831 - 838

2015 IEEE International Conference on Data Mining Workshop (ICDMW)

Clustering high dimensional datasets is challenging due to the curse of dimensionality. One approach to address this challenge is to search for subspace clusters, i.e., clusters present in subsets of attributes. Recently the cartification algorithm was proposed to find such subspace clusters. The distinguishing feature of this algorithm is that it operates on a neighborhood database, in which for...

chapter

Unsupervised Learning Techniques for Detection of Regions of Interest in Solar Images

Juan M. Banda, Rafal A. Angryk

2015 IEEE International Conference on Data Mining Workshop (ICDMW) > 582 - 588

2015 IEEE International Conference on Data Mining Workshop (ICDMW)

Identifying regions of interest (ROIs) in images is a very active research problem as it highly depends on the types and characteristics of images. In this paper we present a comparative evaluation of unsupervised learning methods, in particular clustering, to identify ROIs in solar images from the Solar Dynamics Observatory (SDO) mission. With the purpose of finding regions within the solar images...

chapter

Email Engagement Segmentation Using Bipartite Graph Co-clustering

Ketong Wang, Aaron Beach

2015 IEEE International Conference on Data Mining Workshop (ICDMW) > 540 - 546

2015 IEEE International Conference on Data Mining Workshop (ICDMW)

In the industry of email marketing, it is important to send content relevant to the recipient. If the recipients are uninterested they may ignore the email or worse report it as spam. Such actions compromise the ability of the senders to deliver emails to the inboxes of other recipients and permanently harm their relationship with the uninterested recipients. Targeting highly engaged recipients with...

chapter

Paradigmatic Clustering for NLP

Julio Santisteban, Javier Tejada-Carcamo

2015 IEEE International Conference on Data Mining Workshop (ICDMW) > 814 - 820

2015 IEEE International Conference on Data Mining Workshop (ICDMW)

How can we retrieve meaningful information from a large and sparse graph?. Traditional approaches focus on generic clustering techniques and discovering dense cumulus in a network graph, however, they tend to omit interesting patterns such as the paradigmatic relations. In this paper, we propose a novel graph clustering technique modelling the relations of a node using the paradigmatic analysis. We...

chapter

Modeling the learning behaviors of massive open online courses

Zhenhui Liu, Jingjing He, Yufei Xue, Zhenzhong Huang, more

2015 IEEE International Conference on Big Data (Big Data) > 2883 - 2885

2015 IEEE International Conference on Big Data (Big Data)

With the help of Internet, Massive Open Online Courses (MOOC) are recognized as a new path to learn courses via the web instead of in the traditional classrooms. MOOC can break many limits such as distance, time, participants, on the traditional courses. At the same time, it brings some new issues, such as high drop out ratio. Nowadays increasing MOOC courses are available and even more common people...

chapter

Big data entity resolution: From highly to somehow similar entity descriptions in the Web

Vasilis Efthymiou, Kostas Stefanidis, Vassilis Christophides

2015 IEEE International Conference on Big Data (Big Data) > 401 - 410

2015 IEEE International Conference on Big Data (Big Data)

In the Web of data, entities are described by interlinked data rather than documents on the Web. In this work, we focus on entity resolution in the Web of data, i.e., identifying descriptions that refer to the same real-world entity. To reduce the required number of pairwise comparisons, methods for entity resolution perform blocking as a pre-processing step. A blocking technique places similar entity...

chapter

League Championship Algorithm for clustering

Sangeeta Yadav, Satyasai Jagannath Nanda

2015 IEEE Power, Communication and Information Technology Conference (PCITC) > 321 - 326

2015 IEEE Power, Communication and Information Technology Conference (PCITC)

In the last decade the nature inspired algorithms have gained a lot of popularity in solving complex optimization problems. Partitional clustering deals with the optimization of data points from the cluster centroids to classify a dataset into several groups (clusters). In this paper we introduce clustering as an optimization problem and solve it with a recently developed natural meta-heuristic League...

chapter

Double segmentation method for brain region using FCM and graph cut for CT scan images

Chuen Rue Ng, Joel C.M. Than, Norliza Mohd Noor, Omar Mohd Rijal

2015 IEEE International Conference on Signal and Image Processing Applications (ICSIPA) > 443 - 446

2015 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)

In the field of neuropsychiatrie disorders, it is known that brain segmentation is important for both detection and diagnosis. The segmentation of the brain, which leads to the computation of brain volume proved to be vital in the detection of many brain pathology having Computed Tomography (CT) scan as the primary modality. Due to the fact that Fuzzy c-Means (FCM) proven to be robust, it is often...

chapter

Infrastructure-less collaborative indoor positioning for time critical operations

Ankita Tondwalkar

2015 IEEE Power, Communication and Information Technology Conference (PCITC) > 834 - 838

2015 IEEE Power, Communication and Information Technology Conference (PCITC)

Localisation defines the process of determining the topographical location of sensor nodes in wireless sensor network. Current localisation method is space oriented which adopts GPS (Global Positioning System) signal to yield the location information of the nodes. The GPS signals are suited for outdoor environments, however they fail to work indoors, which makes the need to opt for terrestrial localisation...

chapter

An efficient brain mass detection with adaptive clustered based fuzzy C-mean and thresholding

Said E. El-Khamy, Rowayda A. Sadek, Mohamed A. El-Khoreby

2015 IEEE International Conference on Signal and Image Processing Applications (ICSIPA) > 429 - 433

2015 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)

Image segmentation plays an important role in analyzing medical images. Brain tumor detection is one of the applications that require brain image segmentation. Due to the complex nature of brain magnetic resonance images (MRI), the accurate computer aided detection (CAD) system for brain tumor segmentation has a lot of advantages over manual segmentation as it requires a lot of time and its results...

chapter

Adaptive regularized diffusion adaptation over multitask networks

Sadaf Monajemi, Saeid Sanei, Sim-Heng Ong, Ali H. Sayed

2015 IEEE 25th International Workshop on Machine Learning for Signal Processing (MLSP) > 1 - 5

2015 IEEE 25th International Workshop on Machine Learning for Signal Processing (MLSP)

The focus of this paper is on multitask learning over adaptive networks where different clusters of nodes have different objectives. We propose an adaptive regularized diffusion strategy using Gaussian kernel regularization to enable the agents to learn about the objectives of their neighbors and to ignore misleading information. In this way, the nodes will be able to meet their objectives more accurately...

chapter

Leveraging client-side DNS failure patterns to identify malicious behaviors

Pengkui Luo, Ruben Torres, Zhi-Li Zhang, Sabyasachi Saha, more

2015 IEEE Conference on Communications and Network Security (CNS) > 406 - 414

2015 IEEE Conference on Communications and Network Security (CNS)

DNS has been increasingly abused by adversaries for cyber-attacks. Recent research has leveraged DNS failures (i.e. DNS queries that result in a Non-Existent-Domain response from the server) to identify malware activities, especially domain-flux botnets that generate many random domains as a rendezvous technique for command-&-control. Using ISP network traces, we conduct a systematic analysis...

INFONA - science communication portal

Search results

Improved fuzzy space-intervals based sequential pattern mining: Technical solution

Margin-based active subspace clustering

A Datalog Engine for Iterative Graph Algorithms on Large Clusters

A novel approach for imputation of missing values for mining medical datasets

Distributed data warehouse — Experimentation with TPC-DS

A novel approach to solve K-center problems with geographical placement

Clustering Evolving Batch System Jobs for Online Anomaly Detection

Fast Community Discovery and Its Evolution Tracking in Time-Evolving Social Networks

Finding Subspace Clusters Using Ranked Neighborhoods

Unsupervised Learning Techniques for Detection of Regions of Interest in Solar Images

Email Engagement Segmentation Using Bipartite Graph Co-clustering

Paradigmatic Clustering for NLP

Modeling the learning behaviors of massive open online courses

Big data entity resolution: From highly to somehow similar entity descriptions in the Web

League Championship Algorithm for clustering

Double segmentation method for brain region using FCM and graph cut for CT scan images

Infrastructure-less collaborative indoor positioning for time critical operations

An efficient brain mass detection with adaptive clustered based fuzzy C-mean and thresholding

Adaptive regularized diffusion adaptation over multitask networks

Leveraging client-side DNS failure patterns to identify malicious behaviors

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options