2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

rozdział

An unsupervised attribute clustering algorithm for unsupervised feature selection

Pei-Yuan Zhou, Keith C. C. Chan

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 7

The curse of dimensionality refers to the problem that one faces when analyzing datasets with thousands or hundreds of thousands of attributes. This problem is usually tackled by different feature selection methods which have been shown to effectively reduce computation time, improve prediction performance, and facilitate better understanding of datasets in various application areas. These methods...

rozdział

Anomaly detection in ECG time signals via deep long short-term memory networks

Sucheta Chauhan, Lovekesh Vig

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 7

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

Electrocardiography (ECG) signals are widely used to gauge the health of the human heart, and the resulting time series signal is often analyzed manually by a medical professional to detect any arrhythmia that the patient may have suffered. Much work has been done to automate the process of analyzing ECG signals, but most of the research involves extensive preprocessing of the ECG data to derive vectorized...

rozdział

Improved approach for protein function prediction by exploiting prominent proteins

D. Satheesh Kumar, P. Krishna Reddy

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 7

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

Protein-protein interaction (PPI) networks are valuable biological data source which contain rich information useful for protein function prediction. The PPI network data set obtained from high-throughput experiments is known to be noisy and incomplete. By modeling PPI data as a graph, research efforts are being made in the literature to improve the performance of protein function prediction by extending...

rozdział

MapReduce-based k-prototypes clustering method for big data

Mohamed Aymen Ben Haj Kacem, Chiheb-Eddine Ben N'cir, Nadia Essoussi

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 7

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

Big data clustering is one of the recently challenging tasks that is used in many application domains. Traditional clustering methods are not able to deal with large-scale of data. Furthermore, Big data are often characterized by the mixed type of data, including numerical and categorical attributes. Thus, we propose in this paper the parallelization of k-prototypes clustering method (MR-KP) using...

rozdział

Cluster-based data oriented hashing

Sanaa Chafik, Imane Daoudi, Mounim A. El Yacoubi, Hamid El Ouardi

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 7

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

Many multidimensional hashing schemes have been actively studied in recent years, providing efficient nearest neighbor search. Generally, we can distinguish several hashing families, such as learning based hashing, which provides better hash function selectivity by learning the dataset distribution. The spacial hashing family proposes a suitable partition of the multidimensional space, more adapted...

rozdział

Ensemble of deep long short term memory networks for labelling origin of replication sequences

Urminder Singh, Sucheta Chauhan, A. Krishnamachari, Lovekesh Vig

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 7

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

Advancement in sequence data generation technologies are churning out voluminous omics data and posing a massive challenge to annotate the biological functional features. Sequence data from the well studied model organism Saccharomyces cerevisiae has been commonly used to test and validate in silico prediction methods. DNA replication is a critical step in the cellular process and the sequence location...

rozdział

MIAT: A novel attribute selection approach to better predict upper gastrointestinal cancer

Avi Rosenfeld, David G. Graham, Rifat Hamoudi, Rommel Butawan, więcej

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 7

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

The use of data mining has led to many significant medical discoveries. However, many challenges still exist in using these methods for knowledge discovery within this field given that the large amounts of data medical practitioners collect often creates a curse of dimensionality. To address this challenge, attribute selection approaches have been developed. However, current approaches typically put...

rozdział

Detecting irony and sarcasm in microblogs: The role of expressive signals and ensemble classifiers

Elisabetta Fersini, Federico Alberto Pozzi, Enza Messina

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 8

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

The automatic detection of sarcasm and irony in user generated contents is one of the most challenging task of Natural Language Processing. In this paper we address this problem by introducing Bayesian Model Averaging (BMA), an ensemble approach to take into account several classifiers according to their reliabilities and their marginal probability predictions. The impact of the most used expressive...

rozdział

An accurate rating aggregation method for generating item reputation

Ahmad Abdel-Hafez, Yue Xu, Audun Josang

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 8

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

Many websites presently provide the facility for users to rate items quality based on user opinion. These ratings are used later to produce item reputation scores. The majority of websites apply the mean method to aggregate user ratings. This method is very simple and is not considered as an accurate aggregator. Many methods have been proposed to make aggregators produce more accurate reputation scores...

rozdział

LDA based semi-supervised learning from streaming short text

Ji-De Chen, Hung-Yu Kao

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 8

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

With the rapidly growing of real-time social media, like Twitter, many users share and discuss their interest topics through such platforms. Hashtag is a type of metadata tag which allows users to annotate their topics of tweets. For research usage, for example, hashtags can help the performance of event detection by observing the trend of hashtags. Although Twitter grows rapidly, hashtag growth is...

rozdział

A text block context informations based multiple Web contents extraction

Wonmoon Song, Myungwon Kim

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 8

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

In Web environment, in order to provide appropriate Web services to users' needs it becomes important to quickly and accurately extract from Web documents contents such as main-content, menu-list, article-list, comments and so on. In this paper, we propose an efficient method that extracts various contents from Web documents. In the method, text blocks are separated from the document and context information...

rozdział

Integrating spatial information into probabilistic relational models

Rajani Chulyadyo, Philippe Leray

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 8

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

Growing trend of using spatial information in various domains has increased the need for spatial data analysis. As spatial data analysis involves the study of interaction between spatial objects, Probabilistic Relational Models (PRMs) can be a good choice for modeling probabilistic dependencies between such objects. However, standard PRMs do not support spatial objects. Here, we present a general...

rozdział

Scalable extraction of timeline information from road traffic data using MapReduce

Ardi Imawan, Fadhilah Kurnia Putri, Seonga An, Han-You Jeong, więcej

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 8

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

Due to the increasing number of vehicles in recent years, traffic congestion problem is a common issue for residents of metropolises. For a better understanding of traffic congestion, the analyzed data from big data technology can be provided as timeline information. However, a scalability problem would occur when we convert raw traffic data into the timeline information due to the volume and complexity...

rozdział

Dynamics of multi-campaign propagation in online social networks

M Thejaswi, Sriniketh Vijayaraghavan, Avinash Das, P. Santhi Thilagam

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 8

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

Ever since the advent of online social networking, people have been voluntarily posting and consuming information on the web. This new method to communicate digitally provides the means to spread information considerably far in a very short span of time with minimal resources. Social networks are increasingly being used to spread misinformation online due to low-costs in organizing grassroots of these...

rozdział

Evaluating and predicting energy consumption of data mining algorithms on mobile devices

Carmela Comito, Domenico Talia

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 8

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

The pervasive availability of increasingly powerful mobile computing devices like PDAs, smartphones and wearable sensors, is widening their use in complex applications such as collaborative analysis, information sharing, and data mining in a mobile context. Energy characterization plays a critical role in determining the requirements of data-intensive applications that can be efficiently executed...

rozdział

Error detection of oceanic observation data using sequential labeling

Satoshi Ono, Haruki Matsuyama, Ken-ichi Fukui, Shigeki Hosoda

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 8

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

Globally-covered ocean monitoring system Argo with more than 3,600 small and light-weight drifting buoys is always working for oceanic temperature and salinity measurement. The accumulated big ocean observation data helps many studies such as investigation into climate change mechanism. Although human experts visually confirm and revise quality control (QC) labels, it is difficult to regularize the...

rozdział

Multi-task learning with selective cross-task transfer for predicting bleeding and other important patient outcomes

Che Ngufor, Sudhindra Upadhyaya, Dennis Murphree, Daryl Kor, więcej

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 8

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

In blood transfusion studies, its is often desirable before a surgical procedure to estimate the likelihood of a patient bleeding, need for blood products, re-operation due to bleeding and other important patient outcomes. Such prediction rules are crucial in allowing for optimal planning, more efficient use of blood bank resources, and identification of high-risk patient cohort for specific perioperative...

rozdział

A combination of CUSUM-EWMA for Anomaly Detection in time series data

Vyron Christodoulou, Yaxin Bi

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 8

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

In this work we investigate the use of parametric statistical methods for Anomaly Detection in time series data. The approach involves the use of simple and computationally efficient algorithms, the Cumulative Sum (CUSUM) and Exponentially Weighted Moving Average (EWMA), that have demonstrated an acceptable performance in detecting different shifts from the process mean. However, while the performance...

rozdział

Cascading adverse drug event detection in electronic health records

Jing Zhao, Aron Henriksson, Henrik Bostrom

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 8

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

The ability to detect adverse drug events (ADEs) in electronic health records (EHRs) is useful in many medical applications, such as alerting systems that indicate when an ADE-specific diagnosis code should be assigned. Automating the detection of ADEs can be attempted by applying machine learning to existing, labeled EHR data. How to do this in an effective manner is, however, an open question. The...

rozdział

Modeling heterogeneous clinical sequence data in semantic space for adverse drug event detection

Aron Henriksson, Jing Zhao, Henrik Bostrom, Hercules Dalianis

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 1 - 8

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

The enormous amounts of data that are continuously recorded in electronic health record systems offer ample opportunities for data science applications to improve healthcare. There are, however, challenges involved in using such data for machine learning, such as high dimensionality and sparsity, as well as an inherent heterogeneity that does not allow the distinct types of clinical data to be treated...

INFONA - portal komunikacji naukowej

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

An unsupervised attribute clustering algorithm for unsupervised feature selection

Anomaly detection in ECG time signals via deep long short-term memory networks

Improved approach for protein function prediction by exploiting prominent proteins

MapReduce-based k-prototypes clustering method for big data

Cluster-based data oriented hashing

Ensemble of deep long short term memory networks for labelling origin of replication sequences

MIAT: A novel attribute selection approach to better predict upper gastrointestinal cancer

Detecting irony and sarcasm in microblogs: The role of expressive signals and ensemble classifiers

An accurate rating aggregation method for generating item reputation

LDA based semi-supervised learning from streaming short text

A text block context informations based multiple Web contents extraction

Integrating spatial information into probabilistic relational models

Scalable extraction of timeline information from road traffic data using MapReduce

Dynamics of multi-campaign propagation in online social networks

Evaluating and predicting energy consumption of data mining algorithms on mobile devices

Error detection of oceanic observation data using sequential labeling

Multi-task learning with selective cross-task transfer for predicting bleeding and other important patient outcomes

A combination of CUSUM-EWMA for Anomaly Detection in time series data

Cascading adverse drug event detection in electronic health records

Modeling heterogeneous clinical sequence data in semantic space for adverse drug event detection

Opcje filtrowania

Data publikacji

Słowa kluczowe

INFONA - portal komunikacji naukowej

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) $("#expandableTitles").expandable();

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu

2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)