Search results

Items from 1 to 20 out of 20 results

chapter

MR-SNN: Design of parallel Shared Nearest Neighbor clustering algorithm using MapReduce

Sujing Wang, Christoph F. Eick

2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA)( > 312 - 315

2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA)

Shared Nearest Neighbor (SNN) Clustering is a well-established density based clustering algorithm, which can find clusters of different sizes, shapes, and densities. SNN has been widely adopted in numerous applications. As the size of dataset becomes extremely large nowadays, it is inefficient or even impossible for large-scale data to be stored and processed on a single machine. Therefore, the scalability...

chapter

A Recommendation System Algorithm Based on Large Scale Internet Environment

Xifeng Liu, Zhijian Wang, Feng Ye

2016 13th Web Information Systems and Applications Conference (WISA) > 108 - 112

2016 13th Web Information Systems and Applications Conference (WISA)

With the growing scale of the Internet, the amount of data is increasing rapidly as well. In order to improve the user experience, the recommendation system came into being. It recommends products to the user by analyzing the user's behavior. In the recommendation system, collaborative filtering algorithm is one of the most widely used algorithms. While the traditional collaborative filtering is no...

chapter

Wrapper scan chains balance algorithm base on twice assigned by difference and mean value

Deng Libao, Ning Fu, Qiao Liyan

2015 IEEE AUTOTESTCON > 52 - 57

2015 IEEE AUTOTESTCON

The core test application time is based on the maximum scan-in/scan-out chains. To design a well balance wrapper scan chains is an important approach to reduce the test application time and test cost. We propose a wrapper scan chains balance algorithm base on twice-assigned algorithm by the chains difference and mean value. By selecting a standard chain with its length L, calculating the mean value...

chapter

An efficient hybrid approach based on K-means and generalized fashion algorithms for cluster analysis

Akram Aghamohseni, Rasool Ramezanian

2015 AI & Robotics (IRANOPEN) > 1 - 7

2015 AI & Robotics (IRANOPEN)

Clustering is the process of grouping data objects into set of disjoint classes called clusters so that objects within a class are highly similar with one another and dissimilar with the objects in other classes. The k-means algorithm is a simple and efficient algorithm that is widely used for data clustering. However, its performance depends on the initial state of centroids and may trap in local...

chapter

An efficient projected partition algorithm based on the order among genes

Zesheng Sun, Yuhai Zhao, Dengke Meng, He Pan

2013 10th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD) > 700 - 707

2013 10th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)

Most existing methods perform the projected partition over gene expression data based on the untrue assumption of independence among genes. To address the problem, we propose two novel projected partition algorithms, PPA and PPA+. The basic idea of PPA is to take the order among genes as the criterion of phenotype structure discovery. Specially, in PPA, no any specific data distribution assumption...

chapter

An improved PTAS approximation algorithm for k-means clustering problem

Wang Shouqiang

2012 2nd International Conference on Uncertainty Reasoning and Knowledge Engineering > 90 - 94

2012 2nd International Conference on Uncertainty Reasoning and Knowledge Engineering (URKE)

This paper presented an improved (1+ε)-randomized approximation algorithm proposed by Ostrovsky. The running time of the improved algorithm is equation, where d,n denote the dimension and the number of the input points respectively, and α(<1) represents the separated coefficient. The successful probability is equation. Compared to the original algorithm, the improved algorithm runs more efficiency.

chapter

Network analyses of Beijing subways

Sixue Liu, Wenhao He

IEEE 10th International Conference on Industrial Informatics > 1022 - 1024

2012 10th IEEE International Conference on Industrial Informatics (INDIN)

By analyzing the topological structure of the subway network in Beijing and the people flowing in it, we build the ‘graph model based on the Sub-network Partition Algorithm’ and define the ‘weighted average path length’ to measure the time of the trips in each sub-network. The actual data from the survey proves the correctness of the model.

chapter

Designing 3D test wrappers for pre-bond and post-bond test of 3D embedded cores

Dean L. Lewis, Shreepad Panth, Xin Zhao, Sung Kyu Lim, more

2011 IEEE 29th International Conference on Computer Design (ICCD) > 90 - 95

2011 IEEE 29th International Conference on Computer Design (ICCD 2011)

3D integration is a promising new technology for tightly integrating multiple active silicon layers into a single chip stack. Both the integration of heterogeneous tiers and the partitioning of functional units across tiers leads to significant improvements in functionality, area, performance, and power consumption. Managing the complexity of 3D design is a significant challenge that will require...

chapter

Similar regular plans for mobile clients

John Tsiligaridis

Proceedings of the International Conference on > 1 - 6

2011 International Conference on Data Communication Networking (DCNET)

The broadcast problem including the plan design is considered. The data are inserted and numbered into customized size relations at a predefined order. The server ability to create a full, regular Broadcast Plan (RBP) with single and multiple channels, after some data transformations, is examined. The Basic Regular Algorithm (BRA) prepares an RBP and enables users to catch their items avoiding wasting...

chapter

Possibilistic C-Spherical Shell clustering algorithm based on conformai geometric algebra

Li Maokuan, Guan Jian

IEEE 10th INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS > 1347 - 1350

2010 10th International Conference on Signal Processing (ICSP 2010)

In this paper, a new Possibilistic C-Spherical Shell clustering (PCSS) algorithm based on conformal geometric algebra is proposed. The probability and simplicity of using the conformal geometric algebra to analyse spherical shell clustering algorithm is discussed firstly. By the conformal geometric algebra theory, patterns and prototypes in spherical shell clustering can be represented as vectors,...

chapter

CML model based max-subset shedding for sensor streams multi-joins under limited resources

Wanchang Jiang, Cong Huo

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery > 5 > 2250 - 2254

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)

Join queries over wireless sensor data streams need to be processed immediately to keep up with the input streams. Many existing algorithms do not solve the problem in context of both limited CPU and memory resources. In this paper, we propose two CML statistic model based approximate sliding window multi-joins algorithms for the system that both CPU and memory is limited, and a maximum subset of...

chapter

A Coarse Grain Reconfigurable Architecture for sequence alignment problems in bio-informatics

Pei Liu, Ahmed Hemani

2010 IEEE 8th Symposium on Application Specific Processors (SASP) > 50 - 57

2010 IEEE 8th Symposium on Application Specific Processors (SASP 2010)

A Coarse Grain Reconfigurable Architecture (CGRA) tailored for accelerating bio-informatics algorithms is proposed. The key innovation is a light weight bio-informatics processor that can be reconfigured to perform different Add Compare and Select operations of the popular sequencing algorithms. A programmable and scalable architectural platform instantiates an array of such processing elements and...

chapter

A new algorithm for small-large table outer joins in parallel DBMS

Yu Xu, Pekka Kostamaa

2010 IEEE 26th International Conference on Data Engineering (ICDE 2010) > 1018 - 1024

2010 IEEE 26th International Conference on Data Engineering (ICDE 2010)

Large enterprises have been relying on parallel database management systems (PDBMS) to process their ever-increasing data volume and complex queries. Business intelligence tools used by enterprises frequently generate a large number of outer joins and require high performance from the underlying database systems. A common type of outer joins in business applications is the small-large table outer...

chapter

An Enhancement of K-means Clustering Algorithm

Jirong Gu, Jieming Zhou, Xianwei Chen

2009 International Conference on Business Intelligence and Financial Engineering > 237 - 240

2009 International Conference on Business Intelligence and Financial Engineering (BIFE)

K-means clustering algorithm and one of its enhancements are studied in this paper. Clustering is the classification of objects into different groups, or more precisely, the partitioning of a data set into subsets (clusters), so that the data in each subset (ideally) share some common trait - often proximity according to some defined distance measure. A popular technique for clustering is based on...

chapter

Extracting activated regions of fMRI data using unsupervised learning

H. Davoudi, A. Taalimi, E. Fatemizadeh

2009 International Joint Conference on Neural Networks > 641 - 645

2009 International Joint Conference on Neural Networks (IJCNN 2009 - Atlanta)

Clustering approaches are going to efficiently define the activated regions of the brain in fMRI studies. However, choosing appropriate clustering algorithms and defining optimal number of clusters are still key problems of these methods. In this paper, we apply an improved version of Growing Neural Gas algorithm, which automatically operates on the optimal number of clusters. The decision criterion...

chapter

Text Clustering via Particle Swarm Optimization

Yanping Lu, Shengrui Wang, Shaozi Li, Changle Zhou

2009 IEEE Swarm Intelligence Symposium > 45 - 51

2009 IEEE Swarm Intelligence Symposium

This paper presents an approach which extends a particle swarm optimizer for variable weighting (PSOVW) to handle the problem of text clustering, called text clustering via particle swarm optimization (TCPSO). PSOVW has been exploited for evolving optimal feature weights for clusters and has demonstrated to improve the clustering quality of high-dimensional data. However, when applying it for text...

chapter

Clustering Based on Data Attribute Partition and Its Visualization

Y. Ren, A.L. Culen

2009 Second International Conferences on Advances in Computer-Human Interactions > 13 - 18

Second International Conferences on Advances in Computer-Human Interactions. ACHI 2009

Clustering algorithms are the core technique of data mining, machine learning, pattern matching, bioinformatics and a number of other fields. This paper proposes a new clustering method based on attribute partitioning and a novel data visualization method. In a nutshell, the idea for our method is based on two steps: 1) cluster data set using primary and secondary attributes of data; 2) map color...

chapter

Digital Watermarking Algorithm for Print-and-Scan Process Used for Printed Matter Anti-counterfeit

Dongcheng Shi, Qi Wang, Chao Liang

2008 Congress on Image and Signal Processing > 5 > 697 - 701

International Congress on Image and Signal Processing (CISP 2008)

In order to solve the pixel divergence problem brought by print and scan process, propose the algorithm that applying stretching contrast grade to adjust image pixel divergence. The parameter setting of this adjusting method uses peak signal noise ratio as standard of measurement, even though using different printer and scanner, we can also get good adjusting image. Simulation results show that the...

chapter

Hierarchical memory with block transfer

Alok Aggarwal, Ashok K. Chandra, Marc Snir

28th Annual Symposium on Foundations of Computer Science (sfcs 1987) > 204 - 216

28th Annual Symposium on Foundations of Computer Science

In this paper we introduce a model of Hierarchical Memory with Block Transfer (BT for short). It is like a random access machine, except that access to location x takes time f(x), and a block of consecutive locations can be copied from memory to memory, taking one unit of time per element after the initial access time. We first study the model with f(x) = xα for 0 ≪ α ≪ 1. A tight bound of θ(n log...