Search results

chapter

On the architecture of a clustering platform for the analysis of big volumes of data

Ioan-Daniel Borlea, Radu-Emil Precup, Florin Dragan

2016 IEEE 11th International Symposium on Applied Computational Intelligence and Informatics (SACI) > 145 - 150

2016 IEEE 11th International Symposium on Applied Computational Intelligence and Informatics (SACI)

In the last years the volume of data that was generated by the mankind has increased and the complexity of data generated has also increased. Since the computers have evolved and provide more processing power, it is possible to carry out the real-time analysis of big volumes of data. This paper suggests the architecture of a big data processing platform called BigTim, which is able to run clustering...

chapter

New methods of pattern analysis in the study of Iris Anderson-Fisher Data

Alexey Myachin

2016 6th International Conference on Computers Communications and Control (ICCCC) > 97 - 102

2016 6th International Conference on Computers Communications and Control (ICCCC)

A new method of pattern analysis, based on paired index comparison is introduced. Key properties of the method are described. The effectiveness is demonstrated on the Iris Anderson-Fisher Data.

chapter

SaFe-NeC: A scalable and flexible system for network data characterization

Daniele Apiletti, Elena Baralis, Tania Cerquitelli, Paolo Garza, more

NOMS 2016 - 2016 IEEE/IFIP Network Operations and Management Symposium > 812 - 816

NOMS 2016 - 2016 IEEE/IFIP Network Operations and Management Symposium

Nowadays, large volumes of data and measurements are being continuously generated by computer and telecommunication networks, but such volumes make it difficult to extract meaningful knowledge from them. This paper presents SaFe-NeC, an innovative methodology for analyzing network traffic by exploiting data mining techniques, i.e. clustering and classification algorithms, focusing on self-learning...

chapter

A new task scheduling method for 2 level load balancing in homogeneous distributed system

Lipika Datta

2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT) > 4320 - 4325

2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT)

A distributed system consists of several autonomous nodes. In a distributed system some of the nodes may be overloaded due to a large number of job arrivals while other nodes may remain idle without any processing. The performance of a distributed system depends crucially on dividing up work effectively among the computing nodes. So a way is needed to share load across all the computing nodes. In...

chapter

Anomaly detection in smart grid traffic data for home area network

Divya M Menon, N. Radhika

2016 International Conference on Circuit, Power and Computing Technologies (ICCPCT) > 1 - 4

2016 International Conference on Circuit, Power and Computing Technologies (ICCPCT)

Strengthening of Smart Grid functionalities has become the need of the 21st Century. Security evolves to be the primary concern at the deployment level of Smart Grids. Cyber security threats and vulnerabilities in Smart grid Network needs to be addressed before the deployment of the Smart Grid. Our proposed intrusion detection scheme identifies anomalies in the Smart Grid traffic and detects attacks...

chapter

GATE: Classification and clustering of text for semi-vowel/j/-morphophonemic approach

K. Sairam Reddy, K. Sasanka, S. Prasanna, R. Venkatesan

2016 International Conference on Circuit, Power and Computing Technologies (ICCPCT) > 1 - 7

2016 International Conference on Circuit, Power and Computing Technologies (ICCPCT)

In recent years, many successful machine learning applications have been developed. Classification & Clustering is one such. This application is cross-disciplinary, now that it is based on data mining algorithms on the technical side and on graphemes and morphophonemic on the linguistic side. It will thus map the correspondence between grapheme 〈y〉 and related phonemes via morphemes in a given...

chapter

Benchmarking of Distributed Computing Engines Spark and GraphLab for Big Data Analytics

Jian Wei, Kai Chen, Yi Zhou, Qu Zhou, more

2016 IEEE Second International Conference on Big Data Computing Service and Applications (BigDataService) > 10 - 13

2016 IEEE Second International Conference on Big Data Computing Service and Applications (BigDataService)

In this paper we evaluate and compare two representativeand popular distributed processing engines for large scalebig data analytics, Spark and graph based engine GraphLab. Wedesign a benchmark suite including representative algorithmsand datasets to compare the performances of the computingengines, from performance aspects of running time, memory andCPU usage, network and I/O overhead. The benchmark...

chapter

Magnetic Resonance Image Segmentation Algorithm Based on Fuzzy Clustering

Guohua Li

2016 Eighth International Conference on Measuring Technology and Mechatronics Automation (ICMTMA) > 379 - 382

2016 Eighth International Conference on Measuring Technology and Mechatronics Automation (ICMTMA)

Standard fuzzy c-means algorithm only considers gray information and noise tolerance ability is poor. In order to overcome the drawbacks of traditional fuzzy c-means algorithm, a kind of improved ant colony algorithm is used to optimize fuzzy c-means. Then a new kind of image segmentation algorithm is put forward based on improved fuzzy c-means method. The experiment results show that the proposed...

chapter

Automatic brain tumor tissue detection based on hierarchical centroid shape descriptor in Tl-weighted MR images

Elisee Ilunga-Mbuyamba, Juan Gabriel Avina-Cervantes, Dirk Lindner, Jesus Guerrero-Turrubiates, more

2016 International Conference on Electronics, Communications and Computers (CONIELECOMP) > 62 - 67

2016 International Conference on Electronics, Communications and Computers (CONIELECOMP)

The brain tumor tissue detection allows to localize a mass of abnormal cells in a slice of Magnetic Resonance (MR). The automatization of this process is useful for post processing of the extracted region of interest like the tumor segmentation. In order to detect this abnormal growth of tissue in an image, this paper presents a novel scheme which uses a two-step procedure; the k-means method and...

chapter

MapReduce Model of Improved K-Means Clustering Algorithm Using Hadoop MapReduce

Nadeem Akthar, Mohd Vasim Ahamad, Shahbaaz Ahmad

2016 Second International Conference on Computational Intelligence & Communication Technology (CICT) > 192 - 198

2016 Second International Conference on Computational Intelligence & Communication Technology (CICT)

In today's digital world scenario, digital data is coming in and going out faster than ever before. This data is of no use until we extract some useful content from it. But, it is impractical and inefficient to use traditional database management techniques on big data. That's why, big data technologies like Hadoop comes to existence. Hadoop is an open source framework, which can be used to process...

chapter

A comparative study of K-Means, DBSCAN and OPTICS

Hari Krishna Kanagala, V.V. Jaya Rama Krishnaiah

2016 International Conference on Computer Communication and Informatics (ICCCI) > 1 - 6

2016 International Conference on Computer Communication and Informatics

In view of today's information available, recent progress in data mining research has lead to the development of various efficient methods for mining interesting patterns in large databases. It plays a vital role in knowledge discovery process by analyzing the huge data from various sources and summarizing it into useful information. It is helpful for analyzing the volumes of data in different domains...

chapter

Outlier analysis and Detection using K-medoids with support vector machine

R.P.S. Manikandan, A.M. Kalpana, M. NaveenaPriya

2016 International Conference on Computer Communication and Informatics (ICCCI) > 1 - 7

2016 International Conference on Computer Communication and Informatics

Spatio - temporal methods is the process of innovations and finding the patterns from the knowledge representations through outliers. This kind of data representing the (i) the states of an object (ii) position or event in space at a particular period of time. It refers to the Objects whose attribute values are entirely different from its neighbourhood. Always their locations are different even the...

chapter

A survey on online Stock forum using subspace clustering

G. Shyamala, N. Pooranam

2016 International Conference on Computer Communication and Informatics (ICCCI) > 1 - 6

2016 International Conference on Computer Communication and Informatics

Financial stock Data Analysis and future prediction in terms of Sentiments is great challenge in the big data research. Among the unlabelled opinion, opinion classification in terms of unsupervised learning algorithm will lead to classification error as data is sparse and high dimensional. To overcome this problem, the sentiment analysis to extract the opinion of each word in the stock data has been...

chapter

A new network flow grouping method for preventing periodic shrew DDoS attacks in cloud computing

ZengGuang Liu, XiaoChun Yin, Hoon Jae Lee

2016 18th International Conference on Advanced Communication Technology (ICACT) > 66 - 69

2016 18th International Conference on Advanced Communication Technology (ICACT)

Based on the investigation of periodic shrew distributed DoS Attacks among enormous normal end-users' flow in cloud computing, this paper proposed a new method to take frequency-domain characteristics from the autocorrelation sequence of network flow as clustering feature to group end-user flow data by BIRTH algorithm, and re-merge these clustering results into new groups by overcoming the deficiency...

chapter

DDBSCAN: Different Densities-Based Spatial Clustering of Applications with Noise

Mohammad F. Hassanin, Mohamed Hassan, Abdalla Shoeb

2015 International Conference on Control, Instrumentation, Communication and Computational Technologies (ICCICCT) > 401 - 404

2015 International Conference on Control, Instrumentation, Communication and Computational Technologies (ICCICCT)

Recent advances in using computer with different fields of sciences produced huge amounts of data. These data represent as an analysis tool and key to overcome many problems. Clustering is a primary process to analyze the data as well as, it's a preprocessing step before other techniques like classification. Density-Based clustering algorithms have advantages like clustering any arbitrary shapes and...

chapter

A hybrid outlier detection algorithm based on partitioning clustering and density measures

Hamada Rizk, Sherin Elgokhy, Amany Sarhan

2015 Tenth International Conference on Computer Engineering & Systems (ICCES) > 175 - 181

2015 Tenth International Conference on Computer Engineering & Systems (ICCES)

Outlier detection is an important issue in the realm of data mining. Several applications relay on outlier detection such as intrusion detection, fraud detection, medical and public health data, image processing, etc. Clustering-based outlier detection algorithms are considered as the most important outlier detection approaches. They provide high detection rate, however, they suffer from high false...

chapter

Clustering on Big Data Using Hadoop MapReduce

Nadeem Akthar, Mohd Vasim Ahamad, Shahbaz Khan

2015 International Conference on Computational Intelligence and Communication Networks (CICN) > 789 - 795

2015 International Conference on Computational Intelligence and Communication Networks (CICN)

With the phenomenal increase in digital data, it is inefficient to run the traditional clustering algorithms on separate servers. To deal with this problem, researchers are migrating to distribute environment to implement the traditional clustering algorithms, more specifically K-means clustering. In traditional K Means Clustering, the problem of instability caused by the random initial centers exists...

chapter

Restructuring web search results by generating feedback session and clustering pseudo documents

Bhagyashri Girdhar Salve, R. B. Wagh

2015 Conference on Power, Control, Communication and Computational Technologies for Sustainable Growth (PCCCTSG) > 299 - 303

2015 Conference on Power, Control, Communication and Computational Technologies for Sustainable Growth (PCCCTSG)

Restructuring web search results is the best solution for ambiguous queries being entered to the search engine. When ambiguous queries are entered to the search engine gives multiple results for same query, so user don't get specific and accurate information about what they really want, so it becomes difficult for a user to get specific information related to the submitted keyword. For this reason...

chapter

From probabilistic computing approach to probabilistic rough set for solving problem related to uncertainty under machine learning

Subrata Paul, Anirban Mitra, K. Govinda Rajulu

2015 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC) > 1 - 6

2015 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC)

Box and Tiao suggested about the prior distribution, which according to them is hypothetically representing the knowledge about anonymous constraints prior to the availability of data. It acts as a productive role in Bayesian analysis. Further, allotments of such kind also represent former knowledge or relative ignorance [4]. The chance of occurrence or predictability is defined by the term Probability...

chapter

Hierarchical clustering technique for word sense disambiguation using Hindi WordNet

Nirali Patel, Bhargesh Patel, Rajvi Parikh, Brijesh Bhatt

2015 5th Nirma University International Conference on Engineering (NUiCONE) > 1 - 5

2015 5th Nirma University International Conference on Engineering (NUiCONE)

Word Sense Disambiguation (WSD) is crucial and its significance is prominent in every application of computational linguistics. WSD is a challenging problem of Natural Language Processing (NLP). Though there are lots of algorithms for WSD available, still little work is carried out for choosing optimal algorithm for that. Three approaches are available for WSD, namely, Knowledge-based approach, Supervised...

INFONA - science communication portal

Search results

On the architecture of a clustering platform for the analysis of big volumes of data

New methods of pattern analysis in the study of Iris Anderson-Fisher Data

SaFe-NeC: A scalable and flexible system for network data characterization

A new task scheduling method for 2 level load balancing in homogeneous distributed system

Anomaly detection in smart grid traffic data for home area network

GATE: Classification and clustering of text for semi-vowel/j/-morphophonemic approach

Benchmarking of Distributed Computing Engines Spark and GraphLab for Big Data Analytics

Magnetic Resonance Image Segmentation Algorithm Based on Fuzzy Clustering

Automatic brain tumor tissue detection based on hierarchical centroid shape descriptor in Tl-weighted MR images

MapReduce Model of Improved K-Means Clustering Algorithm Using Hadoop MapReduce

A comparative study of K-Means, DBSCAN and OPTICS

Outlier analysis and Detection using K-medoids with support vector machine

A survey on online Stock forum using subspace clustering

A new network flow grouping method for preventing periodic shrew DDoS attacks in cloud computing

DDBSCAN: Different Densities-Based Spatial Clustering of Applications with Noise

A hybrid outlier detection algorithm based on partitioning clustering and density measures

Clustering on Big Data Using Hadoop MapReduce

Restructuring web search results by generating feedback session and clustering pseudo documents

From probabilistic computing approach to probabilistic rough set for solving problem related to uncertainty under machine learning

Hierarchical clustering technique for word sense disambiguation using Hindi WordNet

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options