2017 IEEE International Conference on Data Mining Workshops (ICDMW)

chapter

Sentiment Extraction from Consumer-Generated Noisy Short Texts

Hardik Meisheri, Kunal Ranjan, Lipika Dey

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 399 - 406

Sentiment analysis or recognizing emotions from short and noisy text from social networks such as twitter has been a challenging task. Most of the existing models use word level embeddings for the final classification of the sentiments. This paper proposes a novel representation of short text derived from a combination of word embeddings and character embeddings using Bidirectional LSTM (BiLSTM)....

chapter

Phonetic-Based Microtext Normalization for Twitter Sentiment Analysis

Ranjan Satapathy, Claudia Guerreiro, Iti Chaturvedi, Erik Cambria

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 407 - 413

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

The proliferation of Web 2.0 technologies and the increasing use of computer-mediated communication resulted in a new form of written text, termed microtext. This poses new challenges to natural language processing tools which are usually designed for well-written text. This paper proposes a phonetic-based framework for normalizing microtext to plain English and, hence, improve the classification...

chapter

A Bootstrap Method for Automatic Rule Acquisition on Emotion Cause Extraction

Shuntaro Yada, Kazushi Ikeda, Keiichiro Hoashi, Kyo Kageura

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 414 - 421

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

Emotion cause extraction is one of the promising research topics in sentiment analysis, but has not been well-investigated so far. This task enables us to obtain useful information for sentiment classification and possibly to gain further insights about human emotion as well. This paper proposes a bootstrapping technique to automatically acquire conjunctive phrases as textual cue patterns for emotion...

chapter

Extracting User-Reported Mobile Application Defects from Online Reviews

Yue Wang, Hongning Wang, Hui Fang

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 422 - 429

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

User-generated mobile application reviews have become a gold mine for timely identifying functional defects in this type of software artifacts. In this work, we develop a hidden structural SVM model for extracting detailed defect descriptions from user reviews at the sentence level. Structured features and constraints are introduced to reduce the demand of exhaustive manual annotation at the sentence...

chapter

An CNN-LSTM Attention Approach to Understanding User Query Intent from Online Health Communities

Ruichu Cai, Binjun Zhu, Lei Ji, Tianyong Hao, more

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 430 - 437

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

Understanding user query intent is a crucial task to Question-Answering area. With the development of online health services, online health communities generate huge amount of valuable medical Question-Answering data, where user intention can be mined. However, the queries posted by common users have many domain concepts and colloquial expressions, which make the understanding of user intents very...

chapter

Process-Oriented Iterative Multiple Alignment for Medical Process Mining

Shuhong Chen, Sen Yang, Moliang Zhou, Randall Burd, more

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 438 - 445

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

Adapted from biological sequence alignment, trace alignment is a process mining technique used to visualize and analyze workflow data. Any analysis done with this method, however, is affected by the alignment quality. The best existing trace alignment techniques use progressive guide-trees to heuristically approximate the optimal alignment in O(N2L2) time. These algorithms are heavily dependent on...

chapter

Exploiting PubMed for Protein Molecular Function Prediction via NMF Based Multi-label Classification

Samah Fodeh, Aditya Tiwari, Hong Yu

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 446 - 451

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

Gene ontology (GO) defines terms and classes used to describe gene functions and relationships between them. GO has been the standard to describing the functions of specific genes in different model organisms. GO annotation which tags genes with GO terms has mostly been a manual and timeconsuming curation process. In this paper we describe the development and evaluation of an innovative predictive...

chapter

Discovery of Informal Topics from Post Traumatic Stress Disorder Forums

Reilly Grant, David Kucher, Ana M. Leon, Jonathan Gemmell, more

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 452 - 461

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

Post Traumatic Stress Disorder (PTSD) is a public health problem afflicting millions of people each year. It is especially prominent among military veterans. Understanding the language, attitudes, and topics associated with PTSD presents an important and challenging problem. Based on their expertise, mental health professionals have constructed a formal definition of PTSD. However, even the most assiduous...

chapter

RESTRAC: REference Sequence Based Space TRAnsformation for Clustering

AKM Tauhidul Islam, Sakti Pramanik, Vahid Mirjalili, Shamik Sural

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 462 - 469

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

Effective mining of large amount of DNA and RNA fragments obtained from next generation sequencing technologies, depends on the availability of efficient analytical tools to process them. One of the important aspects of this analysis, dealing with huge number of fragments, is partitioning them based on their level of similarities. In this paper we propose a space transformation based clustering approach...

chapter

Probable Biomarker Identification Using Recursive Feature Extraction and Network Analysis

Arpit Mishra, Abhishek Gupta, Umesh Maheswari, Laeeq Siddique

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 470 - 477

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

Biomarkers have tremendous potential in different phases of treatment such as risk assessment, screening/detection, diagnosis and patient's response prediction. In this paper, we present an approach for development of a generic tool for an end to end analysis of expression data to identify the probable biomarkers. We follow machine learning as well as network analysis approaches in parallel. We use...

chapter

Semi-Supervised Prediction of Comorbid Rare Conditions Using Medical Claims Data

Chirag Nagpal, Kyle Miller, Tiffany Pellathy, Marilyn Hravnak, more

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 478 - 485

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

Medical insurance claims data offer a coarse view of a patient's medical profile, including information about previous diagnoses and procedures performed. These data have been exploited in the past to predict presence of unmanifested conditions. Rarer conditions however, provide an extremely limited amount of ground truth to train supervised models, but predicting relevant co-morbidities can help...

chapter

Deep Physiological Arousal Detection in a Driving Simulator Using Wearable Sensors

Aaqib Saeed, Stojan Trajanovski, Maurice van Keulen, Jan van Erp

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 486 - 493

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

Driving is an activity that requires considerable alertness. Insufficient attention, imperfect perception, inadequate information processing, and sub-optimal arousal are possible causes of poor human performance. Understanding of these causes and the implementation of effective remedies is of key importance to increase traffic safety and improve driver's well-being. For this purpose, we used deep...

chapter

GB-R: A Fast and Effective Gray-Box Reconstruction of Cascade Time-Series

Hyun Ah Song, Fan Yang, Zongge Liu, Wilbert van Panhuis, more

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 494 - 501

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

Given some (but not all) monthly totals of people with measles (or counts of product-units sold, or counts of retweets), how can we recover the weekly counts? Requiring smoothness between successive weeks is reasonable - but can we do better, if we have some domain knowledge? For example, we know that measles (flu, count-of-retweets, etc) follow a specific cascade model, like the so-called 'SIS'....

chapter

Detecting Opioid Users from Twitter and Understanding Their Perceptions Toward MAT

Yiming Zhang, Yujie Fan, Yanfang Ye, Xin Li, more

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 502 - 509

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

Opioid (e.g., heroin and morphine) addiction has become one of the largest and deadliest epidemics in the United States. To combat such deadly epidemic, there is an urgent need for novel tools and methodologies to gain new insights into the behavioral processes of opioid addiction and treatment. In this paper, we design and develop an intelligent system named iOPU to automate the detection of opioid...

chapter

Robust Projective Dictionary Learning by Joint Label Embedding and Classification

Weiming Jiang, Zhao Zhang, Jie Qin, Mingbo Zhao, more

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 510 - 517

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

In this paper, we propose a new discriminative dictionary learning framework, called robust Label Embedding Projective Dictionary Learning (LE-PDL), for data classification. LE-PDL can learn a discriminative dictionary and the blockdiagonal representations without using the l0-norm or l1-norm sparsity regularization, since the l0 or l1-norm constraint on the coding coefficients used in the existing...

chapter

Taming Wild High Dimensional Text Data with a Fuzzy Lash

Amir Karami

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 518 - 522

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

The bag of words (BOW) represents a corpus in a matrix whose elements are the frequency of words. However, each row in the matrix is a very high-dimensional sparse vector. Dimension reduction (DR) is a popular method to address sparsity and high-dimensionality issues. Among different strategies to develop DR method, Unsupervised Feature Transformation (UFT) is a popular strategy to map all words on...

chapter

High-Dimensional Density Estimation for Data Mining Tasks

Alexander Kuleshov, Alexander Bernstein, Yury Yanovich

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 523 - 530

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

Consider a problem of estimating an unknown high dimensional density whose support lies on unknown low-dimensional data manifold. This problem arises in many data mining tasks, and the paper proposes a new geometrically motivated solution for the problem in manifold learning framework, including an estimation of an unknown support of the density. Firstly, tangent bundle manifold learning problem is...

chapter

Multiple Queries of Information Retrieval Using Krylov Subspace Method

Youzuo Lin

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 531 - 538

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

The Krylov subspace based information retrieval (IR) approach has been shown to provide comparable accuracy to latent semantic indexing (LSI), while providing some computational advantages. Recently, in the area of numerical linear algebra, attention has been drawn to the block Krylov subspace methods, which are shown to be more efficient than the classic Krylov subspace methods in solving linear...

chapter

Differential Geometric Retrieval of Deep Features

Y. Qian, E. Vazquez, B. Sengupta

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 539 - 544

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

Comparing images to recommend items from an image-inventory is a subject of continued interest. Added with the scalability of deep-learning architectures the once 'manual' job of hand-crafting features have been largely alleviated, and images can be compared according to features generated from a deep convolutional neural network. In this paper, we compare distance metrics (and divergences) to rank...

chapter

Evaluation of Non-linearity in MIR Spectroscopic Data for Compressed Learning

Dixon Vimalajeewa, Donagh Berry, Eric Robson, Chamil Kulatunga

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 545 - 552

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

Mid-Infrared (MIR) spectroscopy has emerged as the most economically viable technology to determine milk values as well as to identify a set of animal phenotypes related to health, feeding, well-being and environment. However, Fourier transform-MIR spectra incurs a significant amount of redundant data. This creates critical issues such as increased learning complexity while performing Fog and Cloud...

INFONA - science communication portal

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

Sentiment Extraction from Consumer-Generated Noisy Short Texts

Phonetic-Based Microtext Normalization for Twitter Sentiment Analysis

A Bootstrap Method for Automatic Rule Acquisition on Emotion Cause Extraction

Extracting User-Reported Mobile Application Defects from Online Reviews

An CNN-LSTM Attention Approach to Understanding User Query Intent from Online Health Communities

Process-Oriented Iterative Multiple Alignment for Medical Process Mining

Exploiting PubMed for Protein Molecular Function Prediction via NMF Based Multi-label Classification

Discovery of Informal Topics from Post Traumatic Stress Disorder Forums

RESTRAC: REference Sequence Based Space TRAnsformation for Clustering

Probable Biomarker Identification Using Recursive Feature Extraction and Network Analysis

Semi-Supervised Prediction of Comorbid Rare Conditions Using Medical Claims Data

Deep Physiological Arousal Detection in a Driving Simulator Using Wearable Sensors

GB-R: A Fast and Effective Gray-Box Reconstruction of Cascade Time-Series

Detecting Opioid Users from Twitter and Understanding Their Perceptions Toward MAT

Robust Projective Dictionary Learning by Joint Label Embedding and Classification

Taming Wild High Dimensional Text Data with a Fuzzy Lash

High-Dimensional Density Estimation for Data Mining Tasks

Multiple Queries of Information Retrieval Using Krylov Subspace Method

Differential Geometric Retrieval of Deep Features

Evaluation of Non-linearity in MIR Spectroscopic Data for Compressed Learning

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE International Conference on Data Mining Workshops (ICDMW) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE International Conference on Data Mining Workshops (ICDMW)