Search results

chapter

Demo Abstract: Visual Analytics of Higher-order Dependencies in Sensor Data

Jian Xu, Jun Tao, Nitesh Chawla, Chaoli Wang

2017 IEEE/ACM Second International Conference on Internet-of-Things Design and Implementation (IoTDI) > 297 - 298

2017 IEEE/ACM Second International Conference on Internet-of-Things Design and Implementation (IoTDI)

Existing approach to model sensor movement data as pairwise connections in networks implicitly assumes the Markov property and loses higher-order movement patterns. While the higher-order network (HON) captures higher-order movement patterns, there has not yet been a visualization tool tailored for HON. Based on our prior work, in this demo we present HoNVis, a comprehensive visualization and interactive...

chapter

Ordered multidimensional model construction of relational source for integral OLAP-modeling

Anna Korobko, Ludmila Nozhenkova

2016 IEEE 10th International Conference on Application of Information and Communication Technologies (AICT) > 1 - 5

2016 IEEE 10th International Conference on Application of Information and Communication Technologies (AICT)

Internet is a rich source of information, but it consists of miscellaneous data fragments. Analytical “surfing” on World Wide Web opens extraordinary prospects for attain and maintain competitive advantage of enterprises and for scientific researches. Exploratory OLAP is one of the main agenda of analytical processing of heterogeneous data sources. The author proposes an original approach to exploratory...

chapter

Weakly hierarchical lasso based learning to rank in best answer prediction

Qiongjie Tian, Baoxin Li

2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM) > 307 - 314

2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM)

In community question and answering sites, pairs of questions and their high-quality answers (like best answers selected by askers) can be valuable knowledge available to others. However lots of questions receive multiple answers but askers do not label either one as the accepted or best one even when some replies answer their questions. To solve this problem, high-quality answer prediction or best...

chapter

SVM kernel based predictive analytics on faculty performance evaluation

E. Deepak, G. Sai Pooja, R. N S Jyothi, S V Phani Kumar, more

2016 International Conference on Inventive Computation Technologies (ICICT) > 3 > 1 - 4

2016 International Conference on Inventive Computation Technologies (ICICT)

In recent years, higher education has been gaining importance in graduate students to make successful careers. So, academic organizations are given utmost importance for quality in academics to build the careers of the students. Faculty performance plays a vital role in academic institutions. In this paper, the performance of faculty members is evaluated on the basis of different parameters are taken...

chapter

A mapreduce fuzzy techniques of big data classification

Osman Hegazy, Soha Safwat, Malak El Bakry

2016 SAI Computing Conference (SAI) > 118 - 128

2016 SAI Computing Conference (SAI)

Due to the huge increase in the size of the data it becomes troublesome to perform efficient analysis using the current traditional techniques. Big data put forward a lot of challenges due to its several characteristics like volume, velocity, variety, variability, value and complexity. Today there is not only a necessity for efficient data mining techniques to process large volume of data but in addition...

chapter

An Evaluation Study on Log Parsing and Its Use in Log Mining

Pinjia He, Jieming Zhu, Shilin He, Jian Li, more

2016 46th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN) > 654 - 661

2016 46th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN)

Logs, which record runtime information of modern systems, are widely utilized by developers (and operators) in system development and maintenance. Due to the ever-increasing size of logs, data mining models are often adopted to help developers extract system behavior information. However, before feeding logs into data mining models, logs need to be parsed by a log parser because of their unstructured...

article

Online Ensemble Learning of Data Streams with Gradually Evolved Classes

Yu Sun, Ke Tang, Leandro L. Minku, Shuo Wang, more

IEEE Transactions on Knowledge and Data Engineering > 2016 > 28 > 6 > 1532 - 1545

Class evolution, the phenomenon of class emergence and disappearance, is an important research topic for data stream mining. All previous studies implicitly regard class evolution as a transient change, which is not true for many real-world problems. This paper concerns the scenario where classes emerge or disappear gradually. A class-based ensemble approach, namely Class-Based ensemble for Class...

chapter

Benchmarking concept drift adoption strategies for high speed data stream mining

Mohammed Ahmed Ali Abdualrhman, M.C Padma

2015 International Conference on Emerging Research in Electronics, Computer Science and Technology (ICERECT) > 364 - 369

2015 International Conference on Emerging Research in Electronics, Computer Science and Technology (ICERECT)

Data streams are significantly influenced by the notion change that is termed as concept drift. The act of knowledge discovery from the data streams under notion adaption is a significant act to achieve the conventional learning of the streaming data. The concept drift for conventional learning of streaming data can be done under set of notions that can be either static or dynamic. Due to the large...

chapter

A study of rainfall over India using data mining

Chowdari K.K, Girisha R, K C Gouda

2015 International Conference on Emerging Research in Electronics, Computer Science and Technology (ICERECT) > 44 - 47

2015 International Conference on Emerging Research in Electronics, Computer Science and Technology (ICERECT)

The data mining techniques are employed for efficient and real time analysis of Weather and Climate data. The main goal of studies on Climate is that users e.g. farmers, Scientist, decision & policy maker etc., from different industries e.g. Agriculture, Scientific, Aerospace etc., is required to understand the importance of various changes in weather and climate parameters like rainfall, humidity,...

chapter

Comparisons of keyphrase extraction methods in source retrieval of plagiarism detection

Hui Ning, Leilei Kong, Mingxing Wang, Cuixia Du, more

2015 4th International Conference on Computer Science and Network Technology (ICCSNT) > 1 > 661 - 664

2015 4th International Conference on Computer Science and Network Technology (ICCSNT)

In the processing of source retrieval in plagiarism detection, rationale for keywords extraction is to select only those phrases or words which maximize the chance of retrieving source documents matching the suspicious document. TF-IDF (term frequency-inverse document frequency), weighted TF-IDF (the weighted term frequency-inverse document frequency, namely, the TF-IDF of a term with a different...

chapter

A hybrid decision support system for the identification of asthmatic subjects in a cross-sectional study

Pooja M R, Pushpalatha M P

2015 International Conference on Emerging Research in Electronics, Computer Science and Technology (ICERECT) > 288 - 293

2015 International Conference on Emerging Research in Electronics, Computer Science and Technology (ICERECT)

This paper discusses the implementation of a decision support system for the prediction of asthma in a group of children with related medical factors. The system makes use of the survey data that is gathered as part of ISAAC Phase One Study, obtained through questionnaires completed by adolescents at school and at home by the parents of the children. The model is tested on cross-sectional study data...

chapter

Comparison of K-Means clustering and statistical outliers in reducing medical datasets

T. Santhanam, M. S Padmavathi

2014 International Conference on Science Engineering and Management Research (ICSEMR) > 1 - 6

2014 International Conference on Science Engineering and Management Research (ICSEMR)

Data reduction is a process of reducing the datasets in volume, almost used in all real time applications. Although there are several techniques available, many researchers have used K-Means clustering in reducing the datasets. In this paper, three different methods were used to replace missing values with mean, median and a predicted score; the cleaned datasets were reduced using K-Means clustering...

chapter

Fuzzy logic-based outlier detection for bio-medical data

Yong Ki Kim, Sang Yeun Lee, Keon Myung Lee

2014 International Conference on Fuzzy Theory and Its Applications (iFUZZY2014) > 117 - 121

2014 International Conference on Fuzzy Theory and Its Applications (iFuzzy)

Many bio-medical databases such cohort study data suffer from potential errors involved with human factors like mistyping, overlooking some fields. It is crucial to detect such errors at the data entry stage using some techniques like outlier detection. Because such data lie in high-dimensional space and contain many null values, i.e., missing values, most conventional outlier detections are not a...

chapter

The integrating between web usage mining and data mining techniques

Omer Adel Nassar, Nedhal A. Al Saiyd

2013 5th International Conference on Computer Science and Information Technology > 243 - 247

2013 5th International Conference on Computer Science and Information Technology (CSIT)

Clickstream data is one of the most important sources of information in websites usage and customers' behavior in Banks e-services. A number of web usage mining scenarios are possible depending on the available information. While simple traffic analysis based on click stream data may easily be performed to improve the e-banks services. The banks need data mining techniques to substantially improve...

chapter

Discussion on experimental teaching of data warehouse & data mining course for undergraduate education

Fangjun Wu

2012 7th International Conference on Computer Science & Education (ICCSE) > 1425 - 1429

2012 7th International Conference on Computer Science & Education (ICCSE 2012)

Nowadays data mining techniques have been widely applied to telecommunications, finance, Internet, industry, agriculture, education, software engineering, etc, thus started a continuously offering data mining courses for undergraduate and graduate levels to provide a strong academic support all over the world. Hereon, this paper described the data warehouse & data mining course for computer science...

chapter

A author topic model based unsupervised algorithm for learning topics from large text collections

S. Mercy Shalinie, K. Sundarakantham, S. Pushparathi

2011 International Conference on Recent Trends in Information Technology (ICRTIT) > 360 - 363

2011 International Conference on Recent Trends in Information Technology (ICRTIT)

With the advent of the Web and various specialized digital libraries, the automatic extraction of useful information from text has become an increasingly important research in Data mining. In this paper we present a new MH based algorithm that extracts both the topics expressed in large text document collections and also models how the authors of documents use those topics. The methodology is illustrated...

chapter

BibMiner: A service-oriented framework for bibliographic analysis service

Bin Wu, Hongqiao Tian

2010 2nd IEEE InternationalConference on Network Infrastructure and Digital Content > 206 - 209

2010 2nd IEEE International Conference on Network Infrastructure and Digital Content (IC-NIDC 2010)

The continued exponential growth in volume of literature data is giving birth to a new challenge to the bibliographic analysis service and the traditional features such as keyword search, author search and statistics services could not satisfy researchers for in-depth analysis. The emerging of community analysis in social networks is becoming a hot topic in many domains and disciplines such as sociology,...

chapter

A Novel Approach for High Dimensional Data Clustering

A. Alijamaat, M. Khalilian, N. Mustapha

2010 Third International Conference on Knowledge Discovery and Data Mining > 264 - 267

2010 3rd International Conference on Knowledge Discovery and Data Mining (WKDD 2010)

Clustering is considered as the most important unsupervised learning problem. It aims to find some structure in a collection of unlabeled data. Dealing with a large quantity of data items can be problematic because of time complexity. On the other hand high dimensional data is a challenge arena in data clustering e.g. time series data. Novel algorithms are needed to be robust, scalable, efficient...

chapter

Themes Updating Model Based on TIBG

Guangli Zhu, Shunxiang Zhang, Xiao Wei

2009 International Conference on Computational Intelligence and Software Engineering > 1 - 4

2009 International Conference on Computational Intelligence and Software Engineering

Only considering freshness or total click number of a certain theme may lead to unreasonable themes updating. To improve the reasonableness of updating themes on homepage, this paper proposes a novel model based on theme interestingness of browser group (TIBG). TIBG can be used to calculate the theme's real-time popularity which tracks information related to theme browser's actual interest. Firstly,...

chapter

Predicting NDUM Student's Academic Performance Using Data Mining Techniques

M. Wook, Y.H. Yahaya, N. Wahab, M.R.M. Isa, more

2009 Second International Conference on Computer and Electrical Engineering > 2 > 357 - 361

2009 Second International Conference on Computer and Electrical Engineering (ICCEE 2009)

The ability to predict the students' academic performance is very important in institution educational system. Recently some researchers have been proposed data mining techniques for higher education. In this paper, we compare two data mining techniques which are: Artificial neural network (ANN) and the combination of clustering and decision tree classification techniques for predicting and classifying...

INFONA - science communication portal

Search results

Demo Abstract: Visual Analytics of Higher-order Dependencies in Sensor Data

Ordered multidimensional model construction of relational source for integral OLAP-modeling

Weakly hierarchical lasso based learning to rank in best answer prediction

SVM kernel based predictive analytics on faculty performance evaluation

A mapreduce fuzzy techniques of big data classification

An Evaluation Study on Log Parsing and Its Use in Log Mining

Online Ensemble Learning of Data Streams with Gradually Evolved Classes

Benchmarking concept drift adoption strategies for high speed data stream mining

A study of rainfall over India using data mining

Comparisons of keyphrase extraction methods in source retrieval of plagiarism detection

A hybrid decision support system for the identification of asthmatic subjects in a cross-sectional study

Comparison of K-Means clustering and statistical outliers in reducing medical datasets

Fuzzy logic-based outlier detection for bio-medical data

The integrating between web usage mining and data mining techniques

Discussion on experimental teaching of data warehouse & data mining course for undergraduate education

A author topic model based unsupervised algorithm for learning topics from large text collections

BibMiner: A service-oriented framework for bibliographic analysis service

A Novel Approach for High Dimensional Data Clustering

Themes Updating Model Based on TIBG

Predicting NDUM Student's Academic Performance Using Data Mining Techniques

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options