Wyniki wyszukiwania

rozdział

Semi-supervised Kernel-Based Temporal Clustering

Rodrigo Araujo, Mohamed S. Kamel

2014 13th International Conference on Machine Learning and Applications > 123 - 128

2014 13th International Conference on Machine Learning and Applications (ICMLA)

In this paper, we adapt two existing methods to perform semi-supervised temporal clustering: Aligned Cluster Analysis (ACA), a temporal clustering algorithm, and Constrained Spectral Clustering, a semi-supervised clustering algorithm. In the first method, we add side information in the form of pair wise constraints to its objective function, and in the second, we add a temporal search to its framework...

rozdział

NSLPA: A Node Similarity Based Label Propagation Algorithm for Real-Time Community Detection

Qi Song, Bo Li, Weiren Yu, Jianxin Li, więcej

2014 IEEE/ACM 7th International Conference on Utility and Cloud Computing > 896 - 901

2014 IEEE/ACM 7th International Conference on Utility and Cloud Computing (UCC)

With the development of Internet, online social networks and websites generate a large amount of data. At the same time, several distributed systems, represented by Hadoop, has been proposed to handle mass data. These systems provide both efficient and convenient way to construct different kinds of algorithms. Community detection, a traditional research area, is now facing the challenge of Big Data...

rozdział

Dependency network methods for Hierarchical Multi-label Classification of gene functions

Fabio Fabris, Alex A. Freitas

2014 IEEE Symposium on Computational Intelligence and Data Mining (CIDM) > 241 - 248

2014 IEEE Symposium on Computational Intelligence and Data Mining (CIDM)

Hierarchical Multi-label Classification (HMC) is a challenging real-world problem that naturally emerges in several areas. This work proposes two new algorithms using a Probabilistic Graphical Model based on Dependency Networks (DN) to solve the HMC problem of classifying gene functions into pre-established class hierarchies. DNs are especially attractive for their capability of using traditional,...

rozdział

An Efficient Hierarchical Clustering Algorithm via Root Searching

Wenbo Xie, Zhen Liu

2014 IEEE 17th International Conference on Computational Science and Engineering > 279 - 284

2014 IEEE 17th International Conference on Computational Science and Engineering (CSE)

As an important branch of machine learning, clustering is wildly used for data analysis in various domains. Hierarchical clustering algorithm, one of the traditional clustering algorithms, has excellent stability yet relatively poor time complexity. In this paper, we proposed an efficient hierarchical clustering algorithm by searching given nodes' nearest neighbors iteratively, which depends on an...

rozdział

Drift Detection for Multi-label Data Streams Based on Label Grouping and Entropy

Zhongwei Shi, Yimin Wen, Chao Feng, Hai Zhao

2014 IEEE International Conference on Data Mining Workshop > 724 - 731

2014 IEEE International Conference on Data Mining Workshop (ICDMW)

Many real-world applications involve multi-label data streams, so effective concept drift detection methods should be able to consider the unique properties of multi-label stream data, such as label dependence. To deal with these challenges, we proposed an efficient and effective method to detect concept drift based on label grouping and entropy for multi-label data. Two methods are proposed to group...

rozdział

Online failure prediction for HPC resources using decentralized clustering

Alejandro Pelaez, Andres Quiroz, James C. Browne, Edward Chuah, więcej

2014 21st International Conference on High Performance Computing (HiPC) > 1 - 9

2014 21st International Conference on High Performance Computing (HiPC)

Ensuring high reliability of large-scale clusters is becoming more critical as the size of these machines continues to grow, since this increases the complexity and amount of interactions between different nodes and thus results in a high failure frequency. For this reason, predicting node failures in order to prevent errors from happening in the first place has become extremely valuable. A common...

rozdział

Incomplete Big Data Clustering Algorithm Using Feature Selection and Partial Distance

Fanyu Bu, Zhikui Chen, Qingchen Zhang, Xin Wang

2014 5th International Conference on Digital Home > 263 - 266

2014 5th International Conference on Digital Home (ICDH)

Incomplete data clustering plays an important role in the big data analysis and processing. Existing algorithms for clustering incomplete high-dimensional big data have low performances in both efficiency and effectiveness. The paper proposes an incomplete high-dimensional big data clustering algorithm based on feature selection and partial distance strategy. First, a hierarchical clustering-based...

rozdział

Model-free expectation maximization for divisive hierarchical clustering of multicolor flow cytometry data

Basak Esin Kokturk, Bilge Karacali

2014 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 267 - 272

2014 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

This paper proposes a new method for automated clustering of high dimensional datasets. The method is based on a recursive binary division strategy that successively divides an original dataset into distinct clusters. Each binary division is carried out using a model-free expectation maximization scheme that exploits the posterior probability computation capability of the quasi-supervised learning...

rozdział

Collaborative Filtering Recommendation Model Based on User's Credibility Clustering

Zhao Xu, Qiao Fuqiang

2014 13th International Symposium on Distributed Computing and Applications to Business, Engineering and Science > 234 - 238

2014 13th International Symposium on Distributed Computing and Applications to Business, Engineering and Science (DCABES)

Aiming at the long response time, inaccurate recommendation and cold-start problems that faced by present recommendation algorithm, this paper, taking movie recommendation system as an example, proposes a collaborative filtering recommendation model based on user's credibility clustering. This model divides recommendation process into offline and online phases. Offline, it uses the result of user's...

rozdział

Unsupervised Image Classification by Probabilistic Latent Semantic Analysis for the Annotation of Images

Abass A. Olaode, Golshah Naghdy, Catherine A. Todd

2014 International Conference on Digital Image Computing: Techniques and Applications (DICTA) > 1 - 8

2014 International Conference on Digital Image Computing: Techniques and Applications (DICTA)

Abstract-Image annotation has been identified to be a suitable means by which the semantic gap which has made the accuracy of Content-based image retrieval unsatisfactory be eliminated. However existing methods of automatic annotation of images depends on supervised learning, which can be difficult to implement due to the need for manually annotated training samples which are not always readily available...

rozdział

An efficient motif finding algorithm for large DNA data sets

Qiang Yu, Hongwei Huo, Xiaoyang Chen, Haitao Guo, więcej

2014 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 397 - 402

2014 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

The planted (l, d) motif discovery has been successfully used to locate transcription factor binding sites in dozens of promoter sequences over the past decade. However, there has not been enough work done in identifying (l, d) motifs in the next-generation sequencing (ChIP-seq) data sets, which contain thousands of input sequences and thereby bring new challenge to make a good identification in reasonable...

rozdział

Local binary pattern texture feature for satellite imagery classification

T. Vigneshl, K. K. Thyagharajan

2014 International Conference on Science Engineering and Management Research (ICSEMR) > 1 - 6

2014 International Conference on Science Engineering and Management Research (ICSEMR)

The Texture Feature Extraction (TFE) plays an important role in satellite image processing application. This paper proposes a novel method for Satellite Imagery Classification. Our proposed method is a combination of Local Binary Pattern (LBP) and Fuzzy c-means classification algorithm. Local Binary Pattern is calculated by thresholding a 3 × 3 neighborhood of each pixel by the center pixel value...

rozdział

Automatic segmentation of brain MR images for patients with different kinds of epilepsy

Jie Wang, Rui Wang, Su Zhang, Jing Ding, więcej

2014 International Conference on Smart Computing > 216 - 220

2014 International Conference on Smart Computing (SMARTCOMP)

Idiopathic generalized epilepsy (IGE) and symptomatic generalized epilepsy (SGE) are two kinds of generalized epilepsy. In this study, we discussed the methods of automatically segmentation of MR images for patients with these two kinds of epilepsy. K-Means clustering, expectation-maximization, and fuzzy c-means algorithms were employed to perform segmentation on brain images for patients with IGE...

rozdział

An optimized approach for unbalanced big data categorizing using fuzzy clustering

Saman Fallah Mehneh, JalilGazalan Toosi, Mehrdadjalali

2014 International Congress on Technology, Communication and Knowledge (ICTCK) > 1 - 4

2014 International Congress on Technology, Communication and Knowledge (ICTCK)

Big data is a set of very large and complex data that is hard to load on computers. The main challenge in big data world is related to their search, categorize and analyze specially, when they are unbalanced. Despite, there are a lot of works in the field of big data but analyzing unbalanced big data is still a fundamental challenge in this area. In this paper we try to solve the problem of RSIO-LFCM...

rozdział

Application of neural networks in early detection and diagnosis of Parkinson's disease

Rashidah. Funke Olanrewaju, Nur Syarafina Sahari, Aibinu A. Musa, Nashrul Hakiem

2014 International Conference on Cyber and IT Service Management (CITSM) > 78 - 82

2014 International Conference on Cyber and IT Service Management (CITSM)

Parkinson's disease (PD) is a chronic neurological progressive disorder caused by lack of the chemical dopamine in the brain. Up to today, there is still no cure or prevention for PD, and usually the disease worsens gradually over time. However, this disease can be controlled with some treatment, especially in the early stage. Hence, this study proposes a method in early detection and diagnosis of...

rozdział

An Ameliorated Partitioning Clustering Algorithm

Raghavi Chouhan, Abhishek Chauhan

2014 International Conference on Computational Intelligence and Communication Networks > 520 - 524

2014 International Conference on Computational Intelligence and Communication Networks (CICN)

Original K-medoid algorithm use to take initial medoids arbitrarily that bears on the resulting clusters and it leads to unstable and empty clusters which are no meaningful and also amount of iterations can be rather high so K-Medoid is not a substitute for big databases because of its computational complexity. Also the original k-means algorithm is computationally. Though existing algorithms usually...

rozdział

Improved Intrusion Detection in DDoS Applying Feature Selection Using Rank & Score of Attributes in KDD-99 Data Set

Aditya Harbola, Jyoti Harbola, Kunwar Singh Vaisla

2014 International Conference on Computational Intelligence and Communication Networks > 840 - 845

2014 International Conference on Computational Intelligence and Communication Networks (CICN)

In today's networked environment, massive volume of data being generated, gathered and stored in databases across the world. This trend is growing very fast, year after year. Today it is normal to find databases with terabytes of data, in which vital information and knowledge is hidden. The unseen information in such databases is not feasible to mine without efficient mining techniques for extracting...

rozdział

The SOM Based Improved K-Means Clustering Collaborative Filtering Algorithm in TV Recommendation System

Zhaocai Ma, Yi Yang, Fei Wang, Caihong Li, więcej

2014 Second International Conference on Advanced Cloud and Big Data > 288 - 295

2014 Second International Conference on Advanced Cloud and Big Data (CBD)

This paper aims on collaborative filtering (CF) in TV recommendation system which combines content-based and collaborative filtering recommendation mechanism, we propose an algorithm that using the self-organizing mapping (SOM) to optimize the improved k-means (IK) clustering in collaborative filtering. The whole clustering algorithm is divided into two phases: at the first stage, the quantity of...

rozdział

A Robust Density-Based Hierarchical Clustering Algorithm

Mohammad Mohammadi, Hamid Parvin, Naser Nematbakhsh, Ali Heidarzadegan

2014 13th Mexican International Conference on Artificial Intelligence > 89 - 92

2014 13th Mexican International Conference on Artificial Intelligence (MICAI)

Clustering the genes based on their expression patterns is one of the important subjects in analyzing microarray data. Discovering the genes co-expressed in particular conditions has been done by different clustering algorithms. In these methods, the similar genes are located in the same cluster. Thus, the closer the similar genes, the further the dissimilar ones will be. Each of the applied methods...

rozdział

Sample Selection Based Active Learning for Imbalanced Data

Ikram Chairi, Souad Alaoui, Abdelouahid Lyhyaoui

2014 Tenth International Conference on Signal-Image Technology and Internet-Based Systems > 645 - 651

2014 Tenth International Conference on Signal-Image Technology & Internet-Based Systems (SITIS)

The majority of learning systems don't take in consideration real world data problem and consider that the training sets are perfect. However, in real world data, this hypothesis is not always true. In fact, real world data is characterized by many different problems like redundancy, incoherence or the big size of data. In this paper we focus on the problem of imbalance between class. Many solutions...

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania

Semi-supervised Kernel-Based Temporal Clustering

NSLPA: A Node Similarity Based Label Propagation Algorithm for Real-Time Community Detection

Dependency network methods for Hierarchical Multi-label Classification of gene functions

An Efficient Hierarchical Clustering Algorithm via Root Searching

Drift Detection for Multi-label Data Streams Based on Label Grouping and Entropy

Online failure prediction for HPC resources using decentralized clustering

Incomplete Big Data Clustering Algorithm Using Feature Selection and Partial Distance

Model-free expectation maximization for divisive hierarchical clustering of multicolor flow cytometry data

Collaborative Filtering Recommendation Model Based on User's Credibility Clustering

Unsupervised Image Classification by Probabilistic Latent Semantic Analysis for the Annotation of Images

An efficient motif finding algorithm for large DNA data sets

Local binary pattern texture feature for satellite imagery classification

Automatic segmentation of brain MR images for patients with different kinds of epilepsy

An optimized approach for unbalanced big data categorizing using fuzzy clustering

Application of neural networks in early detection and diagnosis of Parkinson's disease

An Ameliorated Partitioning Clustering Algorithm

Improved Intrusion Detection in DDoS Applying Feature Selection Using Rank & Score of Attributes in KDD-99 Data Set

The SOM Based Improved K-Means Clustering Collaborative Filtering Algorithm in TV Recommendation System

A Robust Density-Based Hierarchical Clustering Algorithm

Sample Selection Based Active Learning for Imbalanced Data

Opcje filtrowania

Data publikacji

Dostępność treści

Typ publikacji

Słowa kluczowe

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Dostępność treści

Typ publikacji

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu