2017 IEEE International Conference on Data Mining Workshops (ICDMW)

Items from 1 to 7 out of 7 results

chapter

Dealing with Class Imbalance the Scalable Way: Evaluation of Various Techniques Based on Classification Grade and Computational Complexity

Bernhard Schlegel, Bernhard Sick

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 69 - 78

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

Highly imbalanced datasets continue to be a challenge in many data mining applications. It is surprising that state-of-the-art techniques countering class imbalances are usually very computationally expensive and therefore unscalable. Most research effort has been directed into enhancing those techniques, e.g., by focusing on borderline examples or combining multiple techniques. This is usually accompanied...

chapter

Long Tail Query Enrichment for Semantic Job Search

Layla Pournajaf, Khalifeh Aljadda, Mohammed Korayem

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 215 - 220

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

Online job boards are used by millions of job seekers, who browse through the postings for jobs that match their interest. Queries are crafted using terminology generated by the users, which may not match the language used in the job postings. Semantic enrichment methods attempt to fill such a lexical gap by re-writing the queries based on richer terms, which are mined using behavioral logs. However,...

chapter

Dataset Construction via Attention for Aspect Term Extraction with Distant Supervision

Athanasios Giannakopoulos, Diego Antognini, Claudiu Musat, Andreea Hossmann, more

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 373 - 380

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

Aspect Term Extraction (ATE) detects opinionated aspect terms in sentences or text spans, with the end goal of performing aspect-based sentiment analysis. The small amount of available datasets for supervised ATE and the fact that they cover only a few domains raise the need for exploiting other data sources in new and creative ways. Publicly available review corpora contain a plethora of opinionated...

chapter

Sentiment Extraction from Consumer-Generated Noisy Short Texts

Hardik Meisheri, Kunal Ranjan, Lipika Dey

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 399 - 406

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

Sentiment analysis or recognizing emotions from short and noisy text from social networks such as twitter has been a challenging task. Most of the existing models use word level embeddings for the final classification of the sentiments. This paper proposes a novel representation of short text derived from a combination of word embeddings and character embeddings using Bidirectional LSTM (BiLSTM)....

chapter

Near-Optimal Noisy Low-Tubal-Rank Tensor Completion via Singular Tube Thresholding

Andong Wang, Zhong Jin

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 553 - 560

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

The problem of completing low-tubal-rank tensors from incomplete noisy observations is studied. To recover the underlying tensor, an iterative singular tube thresholding (ISTT) algorithm is proposed. To explore the statistical performance of the proposed algorithm, the estimation error in terms of the Frobenius norm is upper bounded non-asymptotically. The minimax optimal lower bound of the estimation...

chapter

Robust Self-Tuning Sparse Subspace Clustering

Guangtao Wang, Jiayu Zhou, Jingjie Ni, Tingjin Luo, more

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 858 - 865

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

Sparse subspace clustering (SSC) is an effective approach to cluster high-dimensional data. However, how to adaptively select the number of clusters/eigenvectors for different data sets, especially when the data are corrupted by noise, is a big challenge in SSC and also an open problem in field of data mining. In this paper, considering the fact that the eigenvectors are robust to noise, we develop...

chapter

Feature Selection in Learning Using Privileged Information

Rauf Izmailov, Blerta Lindqvist, Peter Lin

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 957 - 963

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

The paper considers the problem of feature selection in learning using privileged information (LUPI), where some of the features (referred to as privileged ones) are only available for training, while being absent for test data. In the latest implementation of LUPI, these privileged features are approximated using regressions constructed on standard data features, but this approach could lead to polluting...

INFONA - science communication portal

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

Dealing with Class Imbalance the Scalable Way: Evaluation of Various Techniques Based on Classification Grade and Computational Complexity

Long Tail Query Enrichment for Semantic Job Search

Dataset Construction via Attention for Aspect Term Extraction with Distant Supervision

Sentiment Extraction from Consumer-Generated Noisy Short Texts

Near-Optimal Noisy Low-Tubal-Rank Tensor Completion via Singular Tube Thresholding

Robust Self-Tuning Sparse Subspace Clustering

Feature Selection in Learning Using Privileged Information

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE International Conference on Data Mining Workshops (ICDMW) $("#expandableTitles").expandable();

Dealing with Class Imbalance the Scalable Way: Evaluation of Various Techniques Based on Classification Grade and Computational Complexity

Long Tail Query Enrichment for Semantic Job Search

Dataset Construction via Attention for Aspect Term Extraction with Distant Supervision

Sentiment Extraction from Consumer-Generated Noisy Short Texts

Near-Optimal Noisy Low-Tubal-Rank Tensor Completion via Singular Tube Thresholding

Robust Self-Tuning Sparse Subspace Clustering

Feature Selection in Learning Using Privileged Information

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE International Conference on Data Mining Workshops (ICDMW)