Advanced search

chapter

Mining Culture-Specific Music Listening Behavior from Social Media Data

Martin Pichl, Eva Zangerle, Gunther Specht, Markus Schedl

2017 IEEE International Symposium on Multimedia (ISM) > 208 - 215

2017 IEEE International Symposium on Multimedia (ISM)

Incorporating user characteristics and contextual information has shown to be essential when it comes to personalized music retrieval and recommendation. To this end, the current location of a user is often exploited. However, relying solely on GPS coordinates neglects the cultural background of users, which does not necessarily coincide with political borders. In this paper, we analyze culture-specific...

chapter

Efficient Computation of Multiple Density-Based Clustering Hierarchies

Antonio Cavalcante Araujo Neto, Joerg Sander, Ricardo J. G. B. Campello, Mario A. Nascimento

2017 IEEE International Conference on Data Mining (ICDM) > 991 - 996

2017 IEEE International Conference on Data Mining (ICDM)

HDBSCAN*, a state-of-the-art density-based hierarchical clustering method, produces a hierarchical organization of clusters in a dataset w.r.t. a parameter mpts. While the performance of HDBSCAN* is robust w.r.t. mpts, choosing a "good" value for it can be challenging: depending on the data distribution, a high or low value for mpts may be more appropriate, and certain data clusters may...

chapter

Clusterization of objects with fuzzy parameter's values

Aleksandr O. Nazarov, Igor V. Anikin

2017 Dynamics of Systems, Mechanisms and Machines (Dynamics) > 1 - 5

2017 Dynamics of Systems, Mechanisms and Machines (Dynamics)

We suggested a method of clustering, which allows to build a model of conceptual clustering for objects of fuzzy nature, and also to increase the accuracy of clustering for such objects. We used Cobweb clustering method as a base. We modified the formula of assessing the utility of conceptual clustering for objects with fuzzy parameter values. Then we suggested a modified Cobweb version for working...

chapter

A survey of data mining technology on electronic medical records

Wencheng Sun, Zhiping Cai, Fang Liu, Shengqun Fang, more

2017 IEEE 19th International Conference on e-Health Networking, Applications and Services (Healthcom) > 1 - 6

2017 IEEE 19th International Conference on e-Health Networking, Applications and Services (Healthcom)

Medical institutes use Electronic Medical Record (EMR) to record a series of medical events, including diagnostic information (diagnosis codes), procedures performed (procedure codes) and admission details. Plenty of data mining technologies are applied in the EMR data set for knowledge discovery, which is precious to medical practice. The knowledge found is conducive to develop treatment plans, improve...

chapter

An effective method determining the initial cluster centers for K-means for clustering gene expression data

Deniz Tanir, Fidan Nuriyeva

2017 International Conference on Computer Science and Engineering (UBMK) > 751 - 754

2017 International Conference on Computer Science and Engineering (UBMK)

Clustering is an important tool for analyzing gene expression data. Many clustering algorithms have been proposed for the analysis of gene expression data. In this article we have clustered real life gene expression data via K-Means which is one of clustering algorithms. Also, we have proposed a new method determining the initial cluster centers for K-means. We have compared results of our method...

chapter

Copying case detection with data mining

Halil Hakan Tarhan, Nizamettin Aydin

2017 International Conference on Computer Science and Engineering (UBMK) > 430 - 434

2017 International Conference on Computer Science and Engineering (UBMK)

Central examinations are one of the measurement and evaluation tools used throughout the world to select from among the participants, to rank, to reduce the number of candidates before the interview or determine whether the level of education varies between regional and demographic criteria. A more objective measurement and evaluation can be made through the questioning of multiple choice questions...

chapter

Weight based movie recommendation system using K-means algorithm

Md. Tayeb Himel, Mohammed Nazim Uddin, Mohammad Arif Hossain, Yeong Min Jang

2017 International Conference on Information and Communication Technology Convergence (ICTC) > 1302 - 1306

2017 International Conference on Information and Communication Technology Convergence (ICTC)

There are a diverse set of products for a particular type on the internet. When any user tries to find out best product among a certain type it is very much difficult to do it manually go through every one of them. That's why manually searching is not very efficient. In that scenario, recommendation system plays a great important role to recommend the best products. In this study, we develop a recommendation...

chapter

A missing data imputation approach using clustering and maximum likelihood estimation

Muammer Albayrak, Kemal Turhan, Burcin Kurt

2017 Medical Technologies National Congress (TIPTEKNO) > 1 - 4

2017 Medical Technologies National Congress (TIPTEKNO)

Missing data is a data mining problem that adversely affects data analysis and decision making processes that are frequently encountered in healthcare data for a variety of reasons. Missing data is still an important research topic because the success of the method is influenced by many factors such as the characteristics of the data and the type of the missing data. In this study, a clustering and...

chapter

A novel clustering algorithm based on searched experiences

Chun-Wei Tsai, Yong-Chun Ding, Ming-Chao Chiang, Chu-Sing Yang

2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 804 - 808

2017 IEEE International Conference on Systems, Man and Cybernetics (SMC)

How to reduce the computation time and how to improve the quality of the clustering result are the two major research issues. Although several efficient and effective clustering algorithms have been presented, none of which is perfect. As such, an effective clustering algorithm, which is based on the prediction of searching information to determine the search directions at later iterations and employs...

chapter

An approach of algorithmic clustering based on string compression to identify bird songs species in xeno-canto database

Guillermo Sarasa, Ana Granados, Francisco B. Rodriguez

2017 3rd International Conference on Frontiers of Signal Processing (ICFSP) > 101 - 104

2017 3rd International Conference on Frontiers of Signal Processing (ICFSP)

In this work, we analyze the usefulness of the normalized compression distance (NCD) as a similarity measure to bird species identification through audio samples. As a first approach we review the effect of different compression methods from 7z and CompLearn Toolkit, over subsets of bird audio samples obtained from the xeno-canto database. The performance of each compression method was measured applying...

chapter

A novel data mining method for high accuracy solar radiation forecasting

M. Ghofrani, N. Niromand, R. Azimi, M. Ghayekhloo

2017 North American Power Symposium (NAPS) > 1 - 6

2017 North American Power Symposium (NAPS)

Accurate forecasting of solar time series is challenging due to irregularities and uncertainties of such datasets. This paper develops an advanced hybrid forecasting method for solar radiation. The proposed framework combines a novel data mining technique for clustering the time-series data with an innovative cluster selection method and a multilayer recurrent neural network (RNN) to enhance the forecast...

chapter

Acquisition and clustering for affective semantic lexicon from web

Fang Tian, Xiao Sun, Benwang Sun

2017 2nd IEEE International Conference on Computational Intelligence and Applications (ICCIA) > 255 - 259

2017 2nd IEEE International Conference on Computational Intelligence and Applications (ICCIA)

A simple semantic lexicon extraction method is proposed based on one hypothesis and three filtering rules from Baidu Chinese Network Encyclopedia. The acquired affective lexicon includes emotional words and their lexical semantic relations including synonyms and antonyms. The acquiring method is recursive algorithm using the seed words. The extracted affective lexicon is labeled with affective tendency...

chapter

Moving object grouping rule mining based on accumulated spatio-temporal data

Guodong Yang, Xiang Wang, Zhitao Huang

2017 2nd IEEE International Conference on Computational Intelligence and Applications (ICCIA) > 57 - 62

2017 2nd IEEE International Conference on Computational Intelligence and Applications (ICCIA)

With the advance of mobile electronic devices and the development of positioning technology, a large volume of spatio-temopral data are collected in the form of desultorily data streams, which contain a lot of potential information. In this study, we focus on discovering the composition relationships between observation moving objects in a long period. Such research can be widely used in military...

chapter

Automatic summarization and visualisation of healthcare tweets

P. G. Lavanya, Suresha Mallappa

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 1557 - 1563

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

With social media encompassing people in all aspects of life, the relevance of information shared over these media is becoming highly relevant. The marketing and retail industry have been using social media like Twitter and Facebook extensively to collect information and promote their products. Now, it is the healthcare industry's turn to find hidden insights from the vast data available on the web...

chapter

Characteristics and causes of malnutrition across Indian states: A cluster analysis based on Indian demographic and health survey data

Nair Akash Anilkumar, Deepa Gupta, Sangita Khare, Deepika Manippady Gopalkrishna, more

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 2115 - 2123

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

Good nutrition is an essential component of life. Undernutrition is the root cause of death of over 3.5 million children under the age of five in India. To address this issue of malnutrition, though overarching national policy is desirable, it may not be effective if the root cause of malnutrition varies across regions of the country. In this context, the attempt made in this paper is two-fold. First,...

chapter

A review on clustering of residential electricity customers and its applications

Amin Rajabi, Li Li, Jiangfeng Zhang, Jianguo Zhu, more

2017 20th International Conference on Electrical Machines and Systems (ICEMS) > 1 - 6

2017 20th International Conference on Electrical Machines and Systems (ICEMS)

Clustering is a well-recognized data mining technique which enables the determination of underlying patterns in datasets. In electric power systems, it has been traditionally utilized for different purposes like defining customer load profiles, tariff designs and improving load forecasting. Some surveys summarized different clustering techniques which were traditionally used for customer segmentation...

chapter

A Data Science and Engineering Solution for Fast K-Means Clustering of Big Data

Karl E. Dierckens, Adrian B. Harrison, Carson K. Leung, Adrienne V. Pind

2017 IEEE Trustcom/BigDataSE/ICESS > 925 - 932

2017 IEEE Trustcom/BigDataSE/ICESS

With advances in technology, high volumes of a wide variety of valuable data of different veracity can be easily collected or generated at a high velocity in the current era of big data. Embedded in these big data are implicit, previously unknown and potentially useful information. Hence, fast and scalable big data science and engineering solutions that mine and discover knowledge from these big data...

chapter

Clustering and profiling of customers using RFM for customer relationship management recommendations

Ina Maryani, Dwiza Riana

2017 5th International Conference on Cyber and IT Service Management (CITSM) > 1 - 6

2017 5th International Conference on Cyber and IT Service Management (CITSM)

The problem faced by the company is how to determine potential customers and apply CRM (Customer Relationship Management) in order to perform the right marketing strategy, so it can bring benefits to the company. This research aims to perform clustering and profiling customer by using the model of Recency Frequency and Monetary (RFM) to provide customer relationship management (CRM) recommendation...

chapter

Sentence structure-based summarization for Indonesian news articles

Raihannur Reztaputra, Masayu Leylia Khodra

2017 International Conference on Advanced Informatics, Concepts, Theory, and Applications (ICAICTA) > 1 - 6

2017 International Conference on Advanced Informatics, Concepts, Theory, and Applications (ICAICTA)

Automatic multi-document summarization may help news readers retrieve information from digital news media efficiently. The summarizer create a concise summary containing important information from a collection of articles, enabling readers to read only one text to gain information from multiple text sources. Reflecting on previous researches, we propose an automatic summarization system using sentence...

chapter

The Use of Clustering Algorithms Ensemble with Variable Distance Metrics in Solving Problems of Web Mining

Pyotr V. Bochkaryov, Anna I. Guseva

2017 5th International Conference on Future Internet of Things and Cloud Workshops (FiCloudW) > 41 - 46

2017 5th International Conference on Future Internet of Things and Cloud Workshops (FiCloudW)

The article focuses on the results of the research into scientific publications of the All-Russian Institute for Scientific and Technical Information of the Russian Academy of Sciences database (VINITI Database RAS) in different fields. The purpose of operation was to increase partition accuracy on the directions of large volumes of scientific data. This analysis was carried out on summaries of scientific...

INFONA - science communication portal

Advanced search

Advanced search in people

Mining Culture-Specific Music Listening Behavior from Social Media Data

Efficient Computation of Multiple Density-Based Clustering Hierarchies

Clusterization of objects with fuzzy parameter's values

A survey of data mining technology on electronic medical records

An effective method determining the initial cluster centers for K-means for clustering gene expression data

Copying case detection with data mining

Weight based movie recommendation system using K-means algorithm

A missing data imputation approach using clustering and maximum likelihood estimation

A novel clustering algorithm based on searched experiences

An approach of algorithmic clustering based on string compression to identify bird songs species in xeno-canto database

A novel data mining method for high accuracy solar radiation forecasting

Acquisition and clustering for affective semantic lexicon from web

Moving object grouping rule mining based on accumulated spatio-temporal data

Automatic summarization and visualisation of healthcare tweets

Characteristics and causes of malnutrition across Indian states: A cluster analysis based on Indian demographic and health survey data

A review on clustering of residential electricity customers and its applications

A Data Science and Engineering Solution for Fast K-Means Clustering of Big Data

Clustering and profiling of customers using RFM for customer relationship management recommendations

Sentence structure-based summarization for Indonesian news articles

The Use of Clustering Algorithms Ensemble with Variable Distance Metrics in Solving Problems of Web Mining

Filter options

Publication date

Content availability

Keywords

Data set

INFONA - science communication portal

Advanced search

Advanced search in people

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options