2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

chapter

Multi-objective clustering ensemble for high-dimensional data based on Strength Pareto Evolutionary Algorithm (SPEA-II)

Abdul Wahid, Xiaoying Gao, Peter Andreae

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 9

Clustering is one of the fundamental data analysis techniques, which aims to find distinct groups of similar objects and discovers hidden structures in data. A recent clustering approach, clustering ensembles tries to derive an improved clustering solution based on previously generated different candidate clustering solutions. Clustering ensembles have two steps: generating multiple candidate clustering...

chapter

Label noise correction methods

Bryce Nicholson, Jing Zhang, Victor S. Sheng, Zhiheng Wang

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 9

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

The important task of correcting label noise is addressed infrequently in literature. The difficulty of developing a robust label correction algorithm leads to this silence concerning label correction. To break the silence, we propose two algorithms to correct label noise. One utilizes self-training to re-label noise, called Self-Training Correction (STC). Another is a clustering-based method, which...

chapter

Learning urban users' choices to improve trip recommendations

Boris Chidlovskii

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 9

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

We analyze the work of urban trip planners and the relevance of trips they recommend upon user queries. We propose to improve the planner recommendations by learning from choices made by travelers who use the transportation network on the daily basis. We analyze a large collection of individual travelers' trips collected from the automated fare collection systems; we convert the trips into pair-wise...

chapter

Hermessem: A semantic-aware framework for the management and analysis of our LifeSteps

Nikos Pelekis, Stylianos Sideridis, Yannis Theodoridis

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 10

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

The explosion of available positioning information associated with the inferred or user-declared semantics of the respective locations, already contributes in what is called the big data era, posing new challenges to the mobility data management and mining research community. In this paper, motivated by a series of challenges set in [11], we present a unified framework for the management and the analysis...

chapter

TESS: Temporal event sequence summarization

Dominique Gay, Romain Guigoures, Marc Boulle, Fabrice Clerot

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 10

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

We suggest a novel method of clustering and exploratory analysis of temporal event sequences data (also known as categorical time series) based on three-dimensional data grid models. A data set of temporal event sequences can be represented as a data set of three-dimensional points, each point is defined by three variables: a sequence identifier, a time value and an event value. Instantiating data...

chapter

IOHMM for location prediction with missing data

Jiawei Hu, Yanfeng Wang, Ya Zhang

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 10

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

In recent years, the widespread adoption of GPS enabled vehicles brings the Location Based Services new opportunities. It benefits many related fields such as urban planning, city traffic modeling, personalized recommendations and driving suggestions. The service providers can understand their users better by modeling the mobility pattern and provide more personalized services by predicting the destination...

chapter

Time series contextual anomaly detection for detecting market manipulation in stock market

Koosha Golmohammadi, Osmar R. Zaiane

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 10

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

Anomaly detection in time series is one of the fundamental issues in data mining that addresses various problems in different domains such as intrusion detection in computer networks, irregularity detection in healthcare sensory data and fraud detection in insurance or securities. Although, there has been extensive work on anomaly detection, majority of the techniques look for individual objects that...

chapter

Sentiment and stock market volatility predictive modelling — A hybrid approach

Rapheal Olaniyan, Daniel Stamate, Lahcen Ouarbya, Doina Logofatu

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 10

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

The frequent ups and downs are characteristic to the stock market. The conventional standard models that assume that investors act rationally have not been able to capture the irregularities in the stock market patterns for years. As a result, behavioural finance is embraced to attempt to correct these model shortcomings by adding some factors to capture sentimental contagion which may be at play...

chapter

Mining high-utility itemsets with various discount strategies

Jerry Chun-Wei Lin, Wensheng Gan, Philippe Fournier-Viger, Tzung-Pei Hong, more

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 10

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

In recent years, mining high-utility itemsets (HUIs) has become as a key topic in data mining. However, most of the developed algorithms assume the unrealistic situations that unit profits of items remain unchanged over time. But in real-life situations, the profit of an item or itemset varies as a function of cost prices, sales prices and sales strategies. In this paper, a novel framework for mining...

chapter

An approach to cover more advertisers in Adwords

Amar Budhiraja, P. Krishna Reddy

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 10

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

Advertising through web search engines is one of the modes of online advertising and is described as Adwords problem. In Adwords, advertisers bid on keywords to display advertisements along with corresponding search results. During keyword auction, there is very high competition for the frequent keywords while little to no competition for the less frequent ones. In this paper, we have proposed an...

chapter

Interactive exploration over RDF data using formal concept analysis

Mehwish Alam, Amedeo Napoli

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 10

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

With an increased interest in machine processable data, many datasets are now published in RDF (Resource Description Framework) format in Linked Data Cloud. These data are distributed over independent resources which need to be centralized and explored for domain specific applications. This paper proposes a new approach based on interactive data exploration paradigm using Pattern Structures, an extension...

chapter

Nonparametric discovery of online mental health-related communities

Bo Dao, Thin Nguyen, Svetha Venkatesh, Dinh Phung

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 10

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

People are increasingly using social media, especially online communities, to discuss mental health issues and seek supports. Understanding topics, interaction, sentiment and clustering structures of these communities informs important aspects of mental health. It can potentially add knowledge to the underlying cognitive dynamics, mood swings patterns, shared interests, and interaction. There has...

chapter

FactorBase : Multi-relational model learning with SQL all the way

Zhensong Qian, Oliver Schulte

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 10

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

We describe FactorBase, a new SQL-based framework that leverages a relational database management system to support multi-relational model discovery. A multi-relational statistical model provides an integrated analysis of the heterogeneous and interdependent data resources in the database. We adopt the BayesStore design philosophy: statistical models are stored and managed as first-class citizens...

chapter

Duration models for activity recognition and prediction in buildings using Hidden Markov Models

Antonio Ridi, Nikos Zarkadis, Christophe Gisler, Jean Hennebert

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 10

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

Activity recognition and prediction in buildings can have multiple positive effects in buildings: improve elderly monitoring, detect intrusions, maximize energy savings and optimize occupant comfort. In this paper we apply human activity recognition by using data coming from a network of motion and door sensors distributed in a Smart Home environment. We use Hidden Markov Models (HMM) as the basis...

chapter

Multi-class learning using data driven ECOC with deep search and re-balancing

Nathalie Japkowicz, Vincent Barnabe-Lortie, Shawn Horvatic, Jie Zhou

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 10

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

Multi-class learning is an important task in Data Science. One of the ways to achieve good performance on this task is to use Error Correcting Output Codes (ECOC), which is a powerful ensemble learning method that transforms a multi-class problem into a series of binary classifiers which it uses indirectly to learn the original multi-class problem. A crucial component of ECOC is the design of the...

chapter

Improved risk predictions via sparse imputation of patient conditions in electronic medical records

Budhaditya Saha, Sunil Gupta, Svetha Venkatesh

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 10

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

Electronic Medical Records (EMR) are increasingly used for risk prediction. EMR analysis is complicated by missing entries. There are two reasons — the “primary reason for admission” is included in EMR, but the co-morbidities (other chronic diseases) are left uncoded, and, many zero values in the data are accurate, reflecting that a patient has not accessed medical facilities. A key challenge is to...

chapter

Exploiting big data in time series forecasting: A cross-sectional approach

Claudio Hartmann, Martin Hahmann, Wolfgang Lehner, Frank Rosenthal

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 10

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

Forecasting time series data is an integral component for management, planning and decision making. Following the Big Data trend, large amounts of time series data are available from many heterogeneous data sources in more and more applications domains. The highly dynamic and often fluctuating character of these domains in combination with the logistic problems of collecting such data from a variety...

chapter

Compression rate distance measure for time series

Vo Thanh Vinh, Duong Tuan Anh

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 10

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

In this work, we propose a Compression Rate Distance, a new distance measure for time series data. The main idea behind this distance is based on the Minimum Description Length (MDL) principle. The higher compression rate between two time series is, the closer they should be. Besides, we also propose a relaxed version of the new distance, called the Extended Compression Rate Distance. The Extended...

chapter

Scalable image annotation using a product compressive sampling approach

Anastasios Maronidis, Elisavet Chatzilari, Spiros Nikolopoulos, Ioannis Kompatsiaris

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 10

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

The rise of big data, which need computationally demanding manipulation has posed unprecedented challenges in the machine learning community. In this context, a variety of dimensionality reduction methods has been introduced in order to deal with the large-scale aspect of the data. However, their employment in very large scales often becomes impractical due to memory and computation limitations. In...

chapter

Constrained independence for detecting interesting patterns

Thomas Delacroix, Ahcene Boubekki, Philippe Lenca, Stephane Lallich

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 10

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

Among other criteria, a pattern may be interesting if it is not redundant with other discovered patterns. A general approach to determining redundancy is to consider a probabilistic model for frequencies of patterns, based on those of patterns already mined, and compare observed frequencies to the model. Such probabilistic models include the independence model, partition models or more complex models...

INFONA - science communication portal

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

Multi-objective clustering ensemble for high-dimensional data based on Strength Pareto Evolutionary Algorithm (SPEA-II)

Label noise correction methods

Learning urban users' choices to improve trip recommendations

Hermessem: A semantic-aware framework for the management and analysis of our LifeSteps

TESS: Temporal event sequence summarization

IOHMM for location prediction with missing data

Time series contextual anomaly detection for detecting market manipulation in stock market

Sentiment and stock market volatility predictive modelling — A hybrid approach

Mining high-utility itemsets with various discount strategies

An approach to cover more advertisers in Adwords

Interactive exploration over RDF data using formal concept analysis

Nonparametric discovery of online mental health-related communities

FactorBase : Multi-relational model learning with SQL all the way

Duration models for activity recognition and prediction in buildings using Hidden Markov Models

Multi-class learning using data driven ECOC with deep search and re-balancing

Improved risk predictions via sparse imputation of patient conditions in electronic medical records

Exploiting big data in time series forecasting: A cross-sectional approach

Compression rate distance measure for time series

Scalable image annotation using a product compressive sampling approach

Constrained independence for detecting interesting patterns

Filter options

Publication date

Keywords

INFONA - science communication portal

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)