Search results

Items from 1 to 12 out of 12 results

chapter

Efficient spark-based framework for big geospatial data query processing and analysis

Isam Mashhour Aljawarneh, Paolo Bellavista, Antonio Corradi, Rebecca Montanari, more

2017 IEEE Symposium on Computers and Communications (ISCC) > 851 - 856

2017 IEEE Symposium on Computers and Communications (ISCC)

The exponential amount of geospatial data that has been accumulated in an accelerated pace has inevitably motivated the scientific community to examine novel parallel technologies for tuning the performance of spatial queries. Managing spatial data for an optimized query performance is particularly a challenging task. This is due to the growing complexity of geometric computations involved in querying...

chapter

Hadoop cluster with FPGA-based hardware accelerators for K-means clustering algorithm

Ching-Che Chung, Yu-Hsin Wang

2017 IEEE International Conference on Consumer Electronics - Taiwan (ICCE-TW) > 143 - 144

2017 IEEE International Conference on Consumer Electronics - Taiwan (ICCE-TW)

In this paper, the implementation of the K-means clustering algorithm on a Hadoop cluster with FPGA-based hardware accelerators is presented. The proposed design follows MapReduce programming model and uses Hadoop distribution file system (HDFS) for storing large dataset. The proposed FPGA-based hardware accelerator for speed up the K-means clustering algorithm is implemented on Xilinx VC707 evaluation...

chapter

A Parallel K-Medoids Algorithm for Clustering based on MapReduce

M. Omair Shafiq, Eric Torunski

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA) > 502 - 507

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)

One of the most important machine learning techniques include clustering of data into different clusters or categories. There are several decent algorithms and techniques that exist to perform clustering on small to medium scale data. In the era of Big Data and with applications being large-scale and data-intensive in nature, there is a significant increment in volume, variety and velocity of data...

chapter

Application of meteorological big data

Xi Guo

2016 16th International Symposium on Communications and Information Technologies (ISCIT) > 273 - 279

2016 16th International Symposium on Communications and Information Technologies (ISCIT)

The abundant aspects of big data and it's technology are increasing due to new methods of fetching data and diverse needs. Meteorological data is also the source of big data in terms of volume, variety, veracity and velocity, and it includes structured, unstructured and hybrid forms. This paper aims to apply Hadoop architecture and MapReduce algorithm into meteorological big data. It also describes...

chapter

A modified hybrid Fuzzy clustering method for big data

Amir Khoshkbarchi, Ali Kamali, Mehdi Amjadi, Maryam Amir Haeri

2016 8th International Symposium on Telecommunications (IST) > 196 - 201

2016 8th International Symposium on Telecommunications (IST)

Clustering is among the most common data mining techniques and Fuzzy clustering can model the world even more realistically and more precisely. One of the most favorable fuzzy clustering methods is the Fuzzy C-Means (FCM) algorithm, which is actually identical to the (original) K-Means clustering algorithm fueled with a fuzzy flavor. However, there are some issues with the fuzzy clustering methods;...

chapter

MapReduce Model of Improved K-Means Clustering Algorithm Using Hadoop MapReduce

Nadeem Akthar, Mohd Vasim Ahamad, Shahbaaz Ahmad

2016 Second International Conference on Computational Intelligence & Communication Technology (CICT) > 192 - 198

2016 Second International Conference on Computational Intelligence & Communication Technology (CICT)

In today's digital world scenario, digital data is coming in and going out faster than ever before. This data is of no use until we extract some useful content from it. But, it is impractical and inefficient to use traditional database management techniques on big data. That's why, big data technologies like Hadoop comes to existence. Hadoop is an open source framework, which can be used to process...

chapter

Clustering on Big Data Using Hadoop MapReduce

Nadeem Akthar, Mohd Vasim Ahamad, Shahbaz Khan

2015 International Conference on Computational Intelligence and Communication Networks (CICN) > 789 - 795

2015 International Conference on Computational Intelligence and Communication Networks (CICN)

With the phenomenal increase in digital data, it is inefficient to run the traditional clustering algorithms on separate servers. To deal with this problem, researchers are migrating to distribute environment to implement the traditional clustering algorithms, more specifically K-means clustering. In traditional K Means Clustering, the problem of instability caused by the random initial centers exists...

chapter

An algorithm for visualization of big data in a two-dimensional space

Bo Wu, B. M. Wilamowski

IECON 2015 - 41st Annual Conference of the IEEE Industrial Electronics Society > 53 - 58

IECON 2015 - 41st Annual Conference of the IEEE Industrial Electronics Society

In this paper, a new algorithm for visualization of high-multidimensional data is described. The algorithm follows several steps. At first, centers representing several categories are selected, and Euclidean distances between these centers are calculated in a high-dimensional space. Then these centers are placed in a 2-dimensional space in such a way that distances in this 2-dimensional space are...

chapter

k-Means Performance Improvements with Centroid Calculation Heuristics Both for Serial and Parallel Environments

Jeyhun Karimov, Murat Ozbayoglu, Erdogan Dogdu

2015 IEEE International Congress on Big Data > 444 - 451

2015 IEEE International Congress on Big Data (BigData Congress)

K-means is the most widely used clustering algorithm due to its fairly straightforward implementations in various problems. Meanwhile, when the number of clusters increase, the number of iterations also tend to slightly increase. However there are still opportunities for improvement as some studies in the literature indicate. In this study, improved implementations of k-means algorithm with a centroid...

chapter

An optimized approach for unbalanced big data categorizing using fuzzy clustering

Saman Fallah Mehneh, JalilGazalan Toosi, Mehrdadjalali

2014 International Congress on Technology, Communication and Knowledge (ICTCK) > 1 - 4

2014 International Congress on Technology, Communication and Knowledge (ICTCK)

Big data is a set of very large and complex data that is hard to load on computers. The main challenge in big data world is related to their search, categorize and analyze specially, when they are unbalanced. Despite, there are a lot of works in the field of big data but analyzing unbalanced big data is still a fundamental challenge in this area. In this paper we try to solve the problem of RSIO-LFCM...

chapter

A Parallel Method for Rough Entropy Computation Using MapReduce

Si-Yuan Jing, Jin Yang, Kun She

2014 Tenth International Conference on Computational Intelligence and Security > 707 - 710

2014 Tenth International Conference on Computational Intelligence and Security (CIS)

Rough set theory has been proven to be a successful computational intelligence tool. Rough entropy is a basic concept in rough set theory and it is usually used to measure the roughness of information set. Existing algorithms can only deal with small data set. Therefore, this paper proposes a method for parallel computation of entropy using MapReduce, which is hot in big data mining. Moreover, corresponding...

chapter

PSCAN: A Parallel Structural Clustering Algorithm for Big Networks in MapReduce

Weizhong Zhao, V. Martha, Xiaowei Xu

2013 IEEE 27th International Conference on Advanced Information Networking and Applications (AINA) > 862 - 869

2013 IEEE 27th International Conference on Advanced Information Networking and Applications (AINA)

Big data such as complex networks with over millions of vertices and edges is infeasible to process using conventional computation. MapReduce is a programming model that empowers us to analyze big data in a cluster of computers. In this paper we propose a Parallel Structural Clustering Algorithm for big Networks (PSCAN) in MapReduce for the detection of clusters or community structures in big networks...

Filter options

Data set:
ieee
Keywords:
CLUSTERING ALGORITHMS
COMPUTERS
BIG DATA
Publication type:
book

Publication date

Set your own date range

Keywords

MAPREDUCE (7)
ALGORITHM DESIGN AND ANALYSIS (5)
CLUSTERING (5)
HADOOP (5)
DATA MINING (4)
ACCURACY (2)
COMPUTATIONAL MODELING (2)
DATA MODELS (2)
K-MEANS CLUSTERING (2)
AGRICULTURE (1)
APACHE HADOOP (1)
BENCHMARK TESTING (1)
CLASSIFICATION (1)
CLASSIFICATION ALGORITHMS (1)
CLUSTERING METHODS (1)
CLUSTERING; K-MEDOIDS; BIG DATA; MAPREDUCE (1)
COMMUNITIES (1)
COMMUNITY STRUCTURES (1)
COMPLEXITY THEORY (1)
CONVERGENCE (1)
DATA VISUALIZATION (1)
DATABASES (1)
DISTRIBUTED COMPUTING (1)
EDUCATIONAL INSTITUTIONS (1)
ENCODING (1)
ENTROPY (1)
FCM (1)
FIELD PROGRAMMABLE GATE ARRAYS (1)
GEOSPATIAL ANALYSIS (1)
HARDWARE (1)
INDUSTRIES (1)
INFORMATION ENTROPY (1)
INFORMATION TECHNOLOGY (1)
K-MEANS (1)
MACHINE LEARNING ALGORITHMS (1)
MAP-REDUCE (1)
METEOROLOGY (1)
MONITORING (1)
NETWORK CLUSTERING ALGORITHMS (1)
OPTIMIZATION (1)
PARALLEL ALGORITHMS (1)
PARTITIONING ALGORITHMS (1)
POWER GRID (1)
PROGRAM PROCESSORS (1)
PROGRAMMING (1)
PSO (1)
QUERYING SPATIAL DATA (1)
ROUGH SET THEORY (1)
SERVERS (1)
SET THEORY (1)
SHAPE (1)
SOFTWARE ENGINEERING (1)
SPARK (1)
SPARKS (1)
SPATIAL DATABASES (1)
STANDARDS (1)
TEXT CLASSIFICATION (1)
TIME FACTORS (1)
TOURISM (1)
TRAINING (1)
TRANSPORTATION (1)
TWITTER (1)
UNSUPERVISED LEARNING (1)
VISUALIZATION (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options