Advanced search

chapter

Parallel k-modes algorithm based on MapReduce

Guo Tao, Ding Xiangwu, Li Yefeng

2015 Third International Conference on Digital Information, Networking, and Wireless Communications (DINWC) > 176 - 179

2015 Third International Conference on Digital Information, Networking, and Wireless Communications (DINWC)

K-modes is a typical categorical clustering algorithm. Firstly, we improve the process of K-modes: when allocating categorical objects to clusters, the number of each attribute item in clusters is updated, so that the new modes of clusters can be computed after reading the whole dataset once. In order to make K-modes capable for large-scale categorical data, we then implement K-modes on Hadoop using...

chapter

High Breakdown Bundle Adjustment

Anders Eriksson, Mats Isaksson, Tat-Jun Chin

2015 IEEE Winter Conference on Applications of Computer Vision > 310 - 317

2015 IEEE Winter Conference on Applications of Computer Vision (WACV)

Identifying the parameters of a model such that it best fits an observed set of data points is fundamental to the majority of problems in computer vision. This task is particularly demanding when portions of the data has been corrupted by gross outliers, measurements that are not explained by the assumed distributions. In this paper we present a novel method that uses the Least Quantile of Squares...

chapter

Chinese-English SMT for cross-language dialogue agent support

Xiangyu Duan, Min Zhang

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific > 1 - 4

2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Advances in Statistical Machine Translation (SMT) for breaking language barrier have been seen in recent years, and there is huge demand on cross-language dialog communication between people. In this paper we propose to leverage SMT for supporting cross-language dialog communication. Several techniques are applied to improve the performance on a dialog domain, including rescoring, system combination,...

chapter

A Hadoop-Based Output Analyzer for Large-Scale Simulation Data

Kangsun Lee, Joonho Park

2014 IEEE Fourth International Conference on Big Data and Cloud Computing > 197 - 200

2014 IEEE International Conference on Big Data and Cloud Computing (BdCloud)

As modern simulations involve large inputs and outputs over the network, there is an increasing need to store, manage and analyze the massive datasets, efficiently. In this paper, we present ARLS (After action Reviewer for Large-Scale simulation data), a Hadoop-based output analysis tool for large-scale simulation datasets. ARLS clusters distributed storages using Hadoop and analyzes the large-scale...

chapter

Prediction of university enrollment using computational intelligence

Ryan Stallings, Biswanath Samanta

2014 IEEE Symposium on Swarm Intelligence > 1 - 8

2014 IEEE Symposium On Swarm Intelligence (SIS)

This work presents a study on prediction of university enrollment using three computational intelligence (CI) techniques. The enrollment forecasting has been considered as a form of time series prediction using CI techniques that include an artificial neural network (ANN), a neuro-fuzzy inference system (ANFIS) and an aggregated fuzzy time series model. A novel form of ANN, namely, single multiplicative...

chapter

MapReduce Model Implementation on MPI Platform

Guo Yucheng

2014 13th International Symposium on Distributed Computing and Applications to Business, Engineering and Science > 88 - 91

2014 13th International Symposium on Distributed Computing and Applications to Business, Engineering and Science (DCABES)

With development of Multicore clusters the taskscheduling problem in heterogeneous cluster has become hot point of research. The method to solve this problem in Cloud computing is virtualization, which can make the heterogeneous nodes being isomorphic and then using MapReduce model for task scheduling in isomorphic nodes. But the approach has some shortcomings: virtualization itself will cause the...

chapter

Cyberinfrastructure: Applications and challenges

Qiuhui Tong, Bo Yuan, Xiu Li

2014 International Conference on Smart Computing > 74 - 80

2014 International Conference on Smart Computing (SMARTCOMP)

This paper presents a comprehensive review of Cyberinfrastructure (CI), an emerging collaborative research environment, including its representative applications in four science communities around the world. An in-depth analysis is also conducted to reveal the key functions and desired features that can be expected from modern CI systems.

chapter

A framework for management of massive knowledge in cloud environment

Chaoshi Wang, Chun Zhao, Lin Zhang

2014 7th International Conference on Biomedical Engineering and Informatics > 843 - 847

2014 7th International Conference on Biomedical Engineering and Informatics (BMEI)

The word “Cloud” has become more and more popular these days. One of its applications, Cloud for manufacturing, is also proposed as we called Cloud Manufacturing. It is realized by setup a public service cloud platform which shares manufacturing resources and knowledge. As the big data era comes, the amount of both resources and knowledge in the platform may increase much more rapidly than ever before...

chapter

A data reusing strategy based on hive

Heng Xie, Mei Wang, Jiajin Le

2014 International Conference on Data Science and Advanced Analytics (DSAA) > 367 - 373

2014 International Conference on Data Science and Advanced Analytics (DSAA)

Large scale data process has emerged as an important issue for concerned researchers. By reusing calculation results, the efficiency of large scale data process can be improved greatly. This paper proposes an efficient data reusing strategy based on the data warehouse tool-Hive, which works on MapReduce framework. Since the intermediate calculation results have been stored in DFS by different jobs...

chapter

Path prediction based on second-order Markov chain for the opportunistic networks

Yubo Deng, Wei Liu, Lei Zhang, Yongping Xiong, more

2014 IEEE Computers, Communications and IT Applications Conference > 116 - 120

2014 IEEE Computing, Communications and IT Applications Conference (ComComAp)

In the opportunistic networks, nodes carry and store the data and forward it until they encounter each other. How to choose an appropriate opportunity to forward data is pivotal for nodes' routing in this type of networks. Since nodes currently will keep a regular movement state in the scene of this paper discussed, forecasting a node's moving track in the near future would be very helpful. Through...

chapter

The anatomy of a search and mining system for digital humanities

Martyn Harris, Mark Levene, Dell Zhang, Dan Levene

IEEE/ACM Joint Conference on Digital Libraries > 165 - 168

2014 IEEE/ACM Joint Conference on Digital Libraries (JCDL)

Samtla (Search And Mining Tools with Linguistic Analysis) is an online integrated research environment designed in collaboration with historians and linguists to facilitate the study of digitised texts written in any language. It currently supports the research of two corpora: the Genizah collection held by the Taylor-Schechter Genizah Research Unit in Cambridge University, and a collection of Aramaic...

chapter

Three Statistical Approaches to Sessionizing Network Flow Data

Patrick Rubin-Delanchy, Daniel J. Lawson, Melissa J. Turcotte, Nicholas Heard, more

2014 IEEE Joint Intelligence and Security Informatics Conference > 244 - 247

2014 IEEE Joint Intelligence and Security Informatics Conference (JISIC)

The network traffic generated by a computer, or a pair of computers, is often well modelled as a series of sessions. These are, roughly speaking, intervals of time during which a computer is engaging in the same, continued, activity. This article explores a variety of statistical approaches to re-discovering sessions from network flow data using timing alone. Solutions to this problem are essential...

chapter

Statistical Frameworks for Detecting Tunnelling in Cyber Defence Using Big Data

Daniel J. Lawson, Patrick Rubin-Delanchy, Nicholas Heard, Niall M. Adams

2014 IEEE Joint Intelligence and Security Informatics Conference > 248 - 251

2014 IEEE Joint Intelligence and Security Informatics Conference (JISIC)

How can we effectively use costly statistical models in the defence of large computer networks? Statistical modelling and machine learning are potentially powerful ways to detect threats as they do not require a human level understanding of the attack. However, they are rarely applied in practice as the computational cost of deploying all but the most simple algorithms can become implausibly large...

chapter

Message from the 3M4SE 2014 Workshop Chairs

Marten van Sinderen, Luis Ferreira Pires, Maria-Eugenia Iacob

2014 IEEE 18th International Enterprise Distributed Object Computing Conference Workshops and Demonstrations > 333 - 334

2014 IEEE 18th International Enterprise Distributed Object Computing Conference Workshops and Demonstrations (EDOCW)

This section of the volume contains the proceedings of the 3M4SE 2014 workshop, held on September 1-2, 2014, in Ulm, Germany, in conjunction with the 18th IEEE International EDOC Conference on Enterprise Computing, EDOC 2014.

chapter

A Rasch analysis of Math tests using M.In.E.R.Va. platform

Grazia Messineo, Salvatore Vassallo

2014 International Conference on Education Technologies and Computers (ICETC) > 12 - 17

2014 International Conference on Education Technologies and Computers (ICETC)

We present the M.In.E.R.Va. project, an online tool to help students (both attending Secondary Schools and Universities) recovering their deficiencies in Mathematics. We also present an analysis of the answers given by the students, performed using the Rasch model, which allows to investigate both students' results and the validity of the items.

chapter

An empiric weight computation for record linkage using linearly combined fields' similarity scores

Xinran Li, Aline Guttmann, Jacques Demongeot, Jean-Yves Boire, more

2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society > 1346 - 1349

2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)

Record linkage is the task of identifying which records from one or more data sources refer to the same entity. Many record linkage methods were introduced and applied over the last decades. In general, the principle is to compare a range of available identifier fields in record pairs among different data sources, in order to make a linkage decision. The Fellegi-Sunter probabilistic record linkage...

chapter

Translation and Projection Algorithms for Multiple Layer Images with a Hexadecimal Grid Graph Model

Koichi Anada, Taikoh Ikeda, Shinji Koka, Akihito Kubota, more

2014 IIAI 3rd International Conference on Advanced Applied Informatics > 551 - 552

2014 IIAI 3rd International Conference on Advanced Applied Informatics (IIAIAAI)

We deal with data structures and algorithms suitable for the displaying of multiple binary raster images. Multiple images are dealt as a multiple layer image. In this paper, we introduce three algorithms for operation of images represented by hexadeci-grid as multiple layer images and show some examples for our introduced alghrithms.

chapter

Computer anomaly detection based on the moving averages of the power series distributed random sequence

Deqiang Chen

2014 11th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD) > 312 - 316

2014 11th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)

In order to quickly determine the distribution of anomaly detection model based on small amounts of collected data, the moving relative entropy density deviation method (MREDD) is introduced to test the power series distributed random sequence. Through the moving averages of data analysis and comparison, the anomaly detection models can quickly be established. Experimental results show that this method...

chapter

Topic selection in latent dirichlet allocation

Biao Wang, Yang Liu, Zelong Liu, Maozhen Li, more

2014 11th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD) > 756 - 760

2014 11th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)

Latent Dirichlet Allocation (LDA) has been widely applied to text mining. LDA is a probabilistic topic model which processes documents as the probability distribution of topics. One challenging issue in application of LDA is to select the optimal number of topics in LDA model. This paper presents a topic selection method which considers the density of each topic and computes the most unstable topic...

chapter

Security Analysis of Delegable and Proxy Provable Data Possession in Public Cloud Storage

Yongjun Ren, Jian Shen, Jin Wang, Liming Fang

2014 Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing > 795 - 798

2014 Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP)

Cloud storage is now an important developmenttrend in information technology. However, informationsecurity has become an important problem to impede it forcommercial application, such as data confidentiality, integrity,and availability. In this paper, we revisit the two private PDPschemes. We show that the property of correctness cannot beachieved when active adversaries are involved in these auditingsystems...

INFONA - science communication portal

Advanced search

Advanced search in people

Parallel k-modes algorithm based on MapReduce

High Breakdown Bundle Adjustment

Chinese-English SMT for cross-language dialogue agent support

A Hadoop-Based Output Analyzer for Large-Scale Simulation Data

Prediction of university enrollment using computational intelligence

MapReduce Model Implementation on MPI Platform

Cyberinfrastructure: Applications and challenges

A framework for management of massive knowledge in cloud environment

A data reusing strategy based on hive

Path prediction based on second-order Markov chain for the opportunistic networks

The anatomy of a search and mining system for digital humanities

Three Statistical Approaches to Sessionizing Network Flow Data

Statistical Frameworks for Detecting Tunnelling in Cyber Defence Using Big Data

Message from the 3M4SE 2014 Workshop Chairs

A Rasch analysis of Math tests using M.In.E.R.Va. platform

An empiric weight computation for record linkage using linearly combined fields' similarity scores

Translation and Projection Algorithms for Multiple Layer Images with a Hexadecimal Grid Graph Model

Computer anomaly detection based on the moving averages of the power series distributed random sequence

Topic selection in latent dirichlet allocation

Security Analysis of Delegable and Proxy Provable Data Possession in Public Cloud Storage

Filter options

Publication date

Keywords

INFONA - science communication portal

Advanced search

Advanced search in people

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options