Search results

chapter

Unsupervised feature extraction for hyperspectral images using combined low rank representation and locally linear embedding

Mengdi Wang, Jing Yu, Lijuan Niu, Weidong Sun

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1428 - 1431

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Hyperspectral images(HSIs) provide hundreds of narrow spectral bands for the land-covers, thus can provide more powerful discriminative information for the land-cover classification. However, HSIs suffer from the curse of high dimensionality, therefore dimension reduction and feature extraction are essential for the application of HSIs. In this paper, we propose an unsupervised feature extraction...

chapter

Post-ICA phase de-noising for resting-state complex-valued FMRI data

Li-Dan Kuang, Qiu-Hua Lin, Xiao-Feng Gong, Fengyu Cong, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 856 - 860

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Magnitude-only resting-state fMRI data have been largely investigated via independent component analysis (ICA) for exacting spatial maps (SMs) and time courses. However, the native complex-valued fMRI data have rarely been studied. Motivated by the significant improvements achieved by ICA of complex-valued task fMRI data than magnitude-only task fMRI data, we present an efficient method for de-noising...

chapter

Detecting Anomalies in the Data Residing over the Cloud

Deepali Arora, Kin Fun Li

2017 31st International Conference on Advanced Information Networking and Applications Workshops (WAINA) > 541 - 546

2017 31st International Conference on Advanced Information Networking and Applications Workshops (WAINA)

With more companies turning towards cloud computing for storage and processing of their data, the security of the cloud becomes essential. However, cloud computing is vulnerable to many security threats, including data leakages, compromised credentials, presence of unauthorized users or entities, execution of insecure applications or programming interfaces and APIs, shared technology vulnerabilities,...

chapter

An efficient approach for opinion mining from skewed twitter corpus using under sampling approach

Salina Adinarayana, E. Ilavarasan

2017 International Conference on Algorithms, Methodology, Models and Applications in Emerging Technologies (ICAMMAET) > 1 - 4

2017 International Conference on Algorithms, Methodology, Models and Applications in Emerging Technologies (ICAMMAET)

Data Mining is an efficient technique for knowledge discovery from existing databases. The existing algorithms performance degrades when applied to the imbalance dataset. The imbalance nature of twitter data set also hinders the process of efficient knowledge discovery. In this paper, we proposed an efficient approach for knowledge discovery from imbalance datasets specifically designed for opinion...

chapter

StiCProb: A novel feature mining approach using conditional probability

Yutian Tang, Hareton Leung

2017 IEEE 24th International Conference on Software Analysis, Evolution and Reengineering (SANER) > 45 - 55

2017 IEEE 24th International Conference on Software Analysis, Evolution and Reengineering (SANER)

Software Product Line Engineering is a key approach to construct applications with systematical reuse of architecture, documents and other relevant components. To migrate legacy software into a product line system, it is essential to identify the code segments that should be constructed as features from the source base. However, this could be an error-prone and complicated task, as it involves exploring...

chapter

Does Your Configuration Code Smell?

Tushar Sharma, Marios Fragkoulis, Diomidis Spinellis

2016 IEEE/ACM 13th Working Conference on Mining Software Repositories (MSR) > 189 - 200

2016 IEEE/ACM 13th Conference on Mining Software Repositories (MSR)

Infrastructure as Code (IaC) is the practice of specifying computing system configurations through code, and managing them through traditional software engineering methods. The wide adoption of configuration management and increasing size and complexity of the associated code, prompt for assessing, maintaining, and improving the configuration code's quality. In this context, traditional software engineering...

chapter

Analyzing Developer Sentiment in Commit Logs

Vinayak Sinha, Alina Lazar, Bonita Sharif

2016 IEEE/ACM 13th Working Conference on Mining Software Repositories (MSR) > 520 - 523

2016 IEEE/ACM 13th Conference on Mining Software Repositories (MSR)

The paper presents an analysis of developer commit logs for GitHub projects. In particular, developer sentiment in commits is analyzed across 28,466 projects within a seven year time frame. We use the Boa infrastructure’s online query system to generate commit logs as well as files that were changed during the commit. We analyze the commits in three categories: large, medium, and small based on the...

chapter

Prediction of heart and kidney risks in diabetic prone population using fuzzy classification

S. Ananthi, V. Bhuvaneswari

2017 International Conference on Computer Communication and Informatics (ICCCI) > 1 - 6

2017 International Conference on Computer Communication and Informatics (ICCCI)

Diabetes mellitus is a group of metabolic diseases characterized by hyperglycemia resulting from defects in insulin secretion, insulin action, or both. In current scenario diabetes mellitus has become the major health problem among the people of all ages globally. Early diagnosing of diabetic causing heart, kidney and eye complications is difficult and challenging. Data mining techniques are applied...

chapter

Latent Semantic Analysis (LSA) for syslog correlation

Gabriel Slomovitz

2017 International Conference on Electronics, Communications and Computers (CONIELECOMP) > 1 - 4

2017 International Conference on Electronics, Communications and Computers (CONIELECOMP)

Latent Semantic Analysis is a novel method to extract the principal components of a text corpus which has been initially used for categorization and information search. However, due to the significant results obtained, similar to human processing, LSA has become much more than a simple method to analyze text. In this work, we propose to use LSA in order to infer similarity degree of syslog messages...

chapter

Study on Network Information Security Based on Big Data

Wang Jia

2017 9th International Conference on Measuring Technology and Mechatronics Automation (ICMTMA) > 408 - 409

2017 9th International Conference on Measuring Technology and Mechatronics Automation (ICMTMA)

Nowadays, APT attacks bring extreme threat and challenge to the network information security. Based on analysis of big data technique, the paper presents an APT security protective framework, which integrates deep and three-dimensional defense strategies, besides, the big data are used to explore and analyze possible APT attacks as well as threat positioning and tracks.

chapter

Use of Data Mining Technique for Prediction of Tea Yield in the Face of Climate Change of Assam, India

Rupanjali D. Baruah, Sudipta Roy, R. M. Bhagat, L. N. Sethi

2016 International Conference on Information Technology (ICIT) > 265 - 269

2016 International Conference on Information Technology (ICIT)

Data mining is an emerging field of research in Information Technology as well as in agriculture. The present study focus on the applications of data mining techniques in tea plantations in the face of climatic change to help the farmer in taking decision for farming and achieving the expected economic return. This paper presents an analysis using data mining techniques for estimating the future yield...

chapter

Dynamic Community Mining and Tracking Based on Temporal Social Network Analysis

Xiaokang Zhou, Wei Liang, Bo Wu, Zixian Lu, more

2016 IEEE International Conference on Computer and Information Technology (CIT) > 177 - 182

2016 IEEE International Conference on Computer and Information Technology (CIT)

Nowadays, the analysis of social networks, as well as the community evolution has become a hotly discussed topic in social computing field. In this paper, we focus on mining and tracking the dynamic communities based on social networking analysis. Based on a generic framework for the dynamic community discovery, a computational approach is developed to extract users' static and dynamic features for...

chapter

Prioritizing Software Maintenance Plan by Analyzing User Feedback

Kittiya Srewuttanapitikul, Pornsiri Muengchaisri

2016 International Conference on Information Science and Security (ICISS) > 1 - 5

2016 International Conference on Information Science and Security (ICISS)

Normally when developers obtain defects list from users, the development team will decide which defects should be fixed first. The software maintenance plan, which consists of list of defects to be fixed sequentially, is mostly generated using developer experience to prioritize the defects. With the current strategy, the software maintenance plan may not serve well to customer needs. This research...

chapter

MH-ARM: A Multi-Mode and High-Value Association Rule Mining Technique for Healthcare Data Analysis

Libao Yang, Zhe Li, Guan Luo

2016 International Conference on Computational Science and Computational Intelligence (CSCI) > 122 - 127

2016 International Conference on Computational Science and Computational Intelligence (CSCI)

The association rules mining process enables the end users to analyze, understand, and use the extracted knowledge in an intelligent system or to support the decision-making processes. To find valuable association rules from a large number of redundant rules, this paper proposes a deeper mining process, multi-mode and high value association rules mining (MH-ARM). This method takes into account the...

chapter

Cross Platform Bug Correlation Using Stack Traces

Maryam Abdul Ghafoor, Junaid Haroon Siddiqui

2016 International Conference on Frontiers of Information Technology (FIT) > 199 - 204

2016 International Conference on Frontiers of Information Technology (FIT)

Crashing of program is an annoying experience for users. Whenever a program crashes, an event log is generated. Sometimes built in crash reporting programs send crash reports automatically to developing site whereas sometimes, user is presented with an option to report the crash himself. This reporting is often useful for the development team to diagnose and fix the problem. It happens quite often...

chapter

Predicting Traffic Congestions with Global Signatures Discovered by Frequent Pattern Mining

Jun Gao, Yi Sun, Weihua Liu, Su Yang

2016 IEEE International Conference on Internet of Things (iThings) and IEEE Green Computing and Communications (GreenCom) and IEEE Cyber, Physical and Social Computing (CPSCom) and IEEE Smart Data (SmartData) > 554 - 560

2016 IEEE International Conference on Internet of Things (iThings) and IEEE Green Computing and Communications (GreenCom) and IEEE Cyber, Physical and Social Computing (CPSCom) and IEEE Smart Data (SmartData)

We propose a traffic jam prediction method based on mining frequent patterns correlated to traffic jams. For traffic jam prediction at a given sensor, first, we apply a one-dimensional clustering scheme to identify automatically which sensors are and in what degree correlated to the given sensor in terms that certain volume values with a compact distribution co-occur frequently with the traffic jams...

chapter

Research on the big data system of massive open online course

Zhenwei Du, Haopeng Chen, Jianwei Jiang

2016 IEEE International Conference on Big Data (Big Data) > 1931 - 1936

2016 IEEE International Conference on Big Data (Big Data)

With no limit on time and location [1], the number of users attracted by massive open online course (MOOC) has increased rapidly, and many platforms have been built to provide a variety of courses. All of these trigger an explosive growth in data volume. As we known, people have met big data in many areas and proposed many techniques and methods to deal with them. However, people still have no sense...

chapter

SCEM: Smart & effective crowd management with a novel scheme of big data analytics

Shakti Awaghad

2016 IEEE International Conference on Big Data (Big Data) > 2000 - 2003

2016 IEEE International Conference on Big Data (Big Data)

The proposed paper presents a novel scheme that can perform a precise extraction of knowledge from the complex and massive streaming of live data of the scene from the crowded place. The prime contribution of the proposed system is to perform enough processing over the raw and unstructured distributed data from multiple locations so that processing over distributed storage and mining can be done with...

chapter

Effective and Unsupervised Fractal-Based Feature Selection for Very Large Datasets: Removing Linear and Non-linear Attribute Correlations

Antonio C. Fraideinberze, Jose F. Rodrigues, Robson L. F. Cordeiro

2016 IEEE 16th International Conference on Data Mining Workshops (ICDMW) > 615 - 622

2016 IEEE 16th International Conference on Data Mining Workshops (ICDMW)

Given a very large dataset of moderate-to-high dimensionality, how to mine useful patterns from it? In such cases, dimensionality reduction is essential to overcome the "curse of dimensionality". Although there exist algorithms to reduce the dimensionality of Big Data, unfortunately, they all fail to identify/eliminate non-linear correlations between attributes. This paper tackles the problem...

chapter

Improved Time Series Classification with Representation Diversity and SVM

Rafael Giusti, Diego F. Silva, Gustavo E. A. P. A. Batista

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA) > 1 - 6

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)

Time series classification is an important task in data mining that has been traditionally addressed with the use of similarity-based classifiers. The 1-NN DTW is typically considered the most accurate model for temporal data. Nevertheless, some authors have recently proposed ingenious alternatives to the 1-NN DTW by using diversity of time series representation or by using DTW for feature extraction...

INFONA - science communication portal

Search results

Unsupervised feature extraction for hyperspectral images using combined low rank representation and locally linear embedding

Post-ICA phase de-noising for resting-state complex-valued FMRI data

Detecting Anomalies in the Data Residing over the Cloud

An efficient approach for opinion mining from skewed twitter corpus using under sampling approach

StiCProb: A novel feature mining approach using conditional probability

Does Your Configuration Code Smell?

Analyzing Developer Sentiment in Commit Logs

Prediction of heart and kidney risks in diabetic prone population using fuzzy classification

Latent Semantic Analysis (LSA) for syslog correlation

Study on Network Information Security Based on Big Data

Use of Data Mining Technique for Prediction of Tea Yield in the Face of Climate Change of Assam, India

Dynamic Community Mining and Tracking Based on Temporal Social Network Analysis

Prioritizing Software Maintenance Plan by Analyzing User Feedback

MH-ARM: A Multi-Mode and High-Value Association Rule Mining Technique for Healthcare Data Analysis

Cross Platform Bug Correlation Using Stack Traces

Predicting Traffic Congestions with Global Signatures Discovered by Frequent Pattern Mining

Research on the big data system of massive open online course

SCEM: Smart & effective crowd management with a novel scheme of big data analytics

Effective and Unsupervised Fractal-Based Feature Selection for Very Large Datasets: Removing Linear and Non-linear Attribute Correlations

Improved Time Series Classification with Representation Diversity and SVM

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options