Search results

chapter

Providing Big Data Applications with Fault-Tolerant Data Migration across Heterogeneous NoSQL Databases

Marco Scavuzzo, Damian A. Tamburri, Elisabetta di Nitto

2016 IEEE/ACM 2nd International Workshop on Big Data Software Engineering (BIGDSE) > 26 - 32

2016 IEEE/ACM 2nd International Workshop on BIG Data Software Engineering (BIGDSE)

The recent growing interest on highly-available data-intensive applications sparked the need for flexible and portable storage technologies, e.g., NoSQL databases. Unfortunately, the lack of standard interfaces and architectures for NoSQLs makes it difficult and expensive to create portable applications, which results in vendor lock-in. Building on previous work, we aim at providing guaranteed fault-tolerant...

chapter

Conceptual model of automatic system of near duplicates detection in electronic documents

Andrii Biloshchytskyi, Alexander Kuchansky, Svitlana Biloshchytska, Anastasiia Dubnytska

2017 14th International Conference The Experience of Designing and Application of CAD Systems in Microelectronics (CADSM) > 381 - 384

2017 14th International Conference The Experience of Designing and Application of CAD Systems in Microelectronics (CADSM)

In the article, the conceptual model of near duplicates detection in electronic documents is considered. The model provides separation from the document of different data types (the text, the numerical sequences, images, diagrams and mathematical formulas) and applications to their analysis of the special tools allowing to identify similarities between fragments of the incoming document and documents...

chapter

Research on the Classification of High Dimensional Imbalanced Data Based on the Optimizational Random Forest Algorithm

Su Bo

2017 9th International Conference on Measuring Technology and Mechatronics Automation (ICMTMA) > 228 - 231

2017 9th International Conference on Measuring Technology and Mechatronics Automation (ICMTMA)

The random forest algorithm is a new classification and prediction model algorithm. So far, there is not much research on the problem of unbalanced data for random forest classification, ditto, no direct and effective method. On the basis of feature selection algorithm based on correlation measure, the integration feature selection method was helpful to increase the selection probability of classification...

chapter

The Risk Measurement and Empirical Study of China's CSI 300 Index Based on GARCH Model Family

Yuqing Zhang, Zuoquan Zhang, Dingyuan Fan

2016 6th International Conference on Digital Home (ICDH) > 221 - 226

2016 6th International Conference on Digital Home (ICDH)

In this paper, we establish a joint model GARCHGED-VaR and make the empirical analysis of CSI 300 index tostudy the risk early warning. First, we introduce several GARCH models, the GED distribution, VaR and CVaR risk measurement model, and establish a joint model GARCH-GED-VaR. Second, we select the CSI 300 index closing price from 2005 to 2014 as the sample, make the model fitting and parameter...

chapter

The Research and Application of Customer Segmentation on E-Commerce Websites

Xixi He, Chen Li

2016 6th International Conference on Digital Home (ICDH) > 203 - 208

2016 6th International Conference on Digital Home (ICDH)

This article constructs a three dimensional customer segmentation model based on customer lifetime value, customer satisfaction and customer activity, which more accurately divides customers into different groups. The corresponding variables are obtained by RFM model, Kano model and BG/NBD model. The customer segmentation model provides ten groups of customers with corresponding marketing strategies,...

chapter

The rapid prediction of playing index in accumulation on video websites

Fu-Lian Yin, Xue-Song Bai, Jian-Ping Chai

2016 13th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP) > 184 - 188

2016 13th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP)

In order to avoid the problems in traditional forecasting methods which demand too much of various data types and have difficulty in training models, this paper proposes two rapid prediction methods which are called “One by One Comparison” and “Regression as a Whole”. By using the two methods, as long as you get playing index in the first few days of a TV drama, the total playing index accumulation...

chapter

Forecasting house price index of China using dendritic neuron model

Ying Yu, Shuangbao Song, Tianle Zhou, Hanaki Yachi, more

2016 International Conference on Progress in Informatics and Computing (PIC) > 37 - 41

2016 International Conference on Progress in Informatics and Computing (PIC)

The result of Chinese housing market continues to prosper or not is related to the development of China, and further it also has an impact on the world finance. Thus forecasting the house price index is very important and challenging. In this paper we propose an unsupervised learnable neuron model (DNM) by including the nonlinear interactions between excitation and inhibition on dendrites. We use...

chapter

Learning partial power grasp with task-specific contact

Miao Li

2016 IEEE International Conference on Robotics and Biomimetics (ROBIO) > 337 - 343

2016 IEEE International Conference on Robotics and Biomimetics (ROBIO)

Generating robotic grasps for given tasks is a difficult problem. This paper proposes a learning-based approach to generate suitable partial power grasp for a set of tool-using tasks. First a number of valid partial power grasps are sampled in simulation and encoded as a probabilistic model, which encapsulates the relations among the task-specific contact, the graspable object feature and the finger...

chapter

A comparative study on code-mixed data of Indian social media vs formal text

Prakash Ranjan, Bharathi Raja, Ruba Priyadharshini, Rakesh Chandra Balabantaray

2016 2nd International Conference on Contemporary Computing and Informatics (IC3I) > 608 - 611

2016 2nd International Conference on Contemporary Computing and Informatics (IC3I)

This paper presents comparative experiment results of code mixed data with the normal text. We first identify the Languages present in social media text, in the case of code mixed data existing language detector fails to detect language at the word level because of the use of roman script to write their own language. So we bootstrap language identification step and we caluculate the Code Mixe Index...

chapter

MCNC: Multi-Channel Nonparametric Clustering from heterogeneous data

Thanh-Binh Nguyen, Vu Nguyen, Svetha Venkatesh, Dinh Phung

2016 23rd International Conference on Pattern Recognition (ICPR) > 3633 - 3638

2016 23rd International Conference on Pattern Recognition (ICPR)

Bayesian nonparametric (BNP) models have recently become popular due to their flexibility in identifying the unknown number of clusters. However, they have difficulties handling heterogeneous data from multiple sources. Existing BNP methods either treat each of these sources independently - hence do not get benefits from the correlating information between them, or require to explicitly specify data...

chapter

Clustering for point pattern data

Nhat-Quang Tran, Ba-Ngu Vo, Dinh Phung, Ba-Tuong Vo

2016 23rd International Conference on Pattern Recognition (ICPR) > 3174 - 3179

2016 23rd International Conference on Pattern Recognition (ICPR)

Clustering is one of the most common unsupervised learning tasks in machine learning and data mining. Clustering algorithms have been used in a plethora of applications across several scientific fields. However, there has been limited research in the clustering of point patterns - sets or multi-sets of unordered elements - that are found in numerous applications and data sources. In this paper, we...

chapter

WISDOM: Weighted incremental spatio-temporal multi-task learning via tensor decomposition

Jianpeng Xu, Jiayu Zhou, Pang-Ning Tan, Xi Liu, more

2016 IEEE International Conference on Big Data (Big Data) > 522 - 531

2016 IEEE International Conference on Big Data (Big Data)

This paper presents a novel multi-task learning framework for the accurate prediction of spatio-temporal data at multiple locations. The framework encodes the data as a third-order tensor and performs supervised tensor decomposition to identify the latent factors that capture the inherent spatiotemporal variabilities of the data and their relationship to the target variable of interest. The framework...

chapter

One Method of Cloth Simulation Based on Adaptive Meshes

Huijian Han, Luyu Wang, Kai Liu

2016 12th International Conference on Computational Intelligence and Security (CIS) > 483 - 486

2016 12th International Conference on Computational Intelligence and Security (CIS)

In this paper the adaptive mesh model is studied. First we introduce the 3 subdivision method which is already exist. Second, in order to make the simulation result more realistic and reasonable, we extend the traditional scheme and apply two different subdivision schemes on the triangular mesh. At last, a new mesh coarsening method is proposed, with the help of this method, we build a extended adaptive...

chapter

Predicting COPD Failure by Modeling Hazard in Longitudinal Clinical Data

Jianfei Zhang, Shengrui Wang, Josiane Courteau, Lifei Chen, more

2016 IEEE 16th International Conference on Data Mining (ICDM) > 639 - 648

2016 IEEE 16th International Conference on Data Mining (ICDM)

Chronic obstructive pulmonary disease (COPD) accounts for the highest rate of hospital readmissions and is the third leading cause of death in Canada, the United States and worldwide. Predicting COPD failure provides a prognostic warning of death or readmission, and is crucial to early intervention and decision-making. The aim of this study is to perform COPD failure prediction on longitudinal data...

chapter

Multidimensional Sparse Array Storage for Data Analytics

E. J. Otoo, Hairong Wang, Gideon Nimako

2016 IEEE 18th International Conference on High Performance Computing and Communications; IEEE 14th International Conference on Smart City; IEEE 2nd International Conference on Data Science and Systems (HPCC/SmartCity/DSS) > 1520 - 1529

2016 IEEE 18th International Conference on High Performance Computing and Communications; IEEE 14th International Conference on Smart City; IEEE 2nd International Conference on Data Science and Systems (HPCC/SmartCity/DSS)

A relational table over a set of attributes can be mapped onto a multi-dimensional array and stored as such. Such a conceptual view of relations lends itself to easy formulations of numerous analytical algorithms. This is the view taken in the representation of relations in data-warehousing to support On-Line Analytical Processing (OLAP). The main drawback of such a storage scheme is that the equivalent...

chapter

Correlation of hourly diffuse fraction of global horizontal solar radiation in Tamanrasset, Algeria

Madjid Chikh, Mourad Haddadi, Achour Mahrane, Ali Malek

2016 International Renewable and Sustainable Energy Conference (IRSEC) > 663 - 668

2016 International Renewable and Sustainable Energy Conference (IRSEC)

In order to evaluate the energy production of a solar system, the tilted global radiation is needed. Generally, only the global horizontal radiation data are available. To calculate a tilted global radiation, it is necessary to estimate the diffuse or the direct component of the horizontal solar radiation. In this article, a statistical procedure has been employed to develop correlations between the...

chapter

The use of data mining techniques to predict the ranking of E-government services

Nayla Salem Alkhatri, Nazar Zaki, Elfadil Mohammed, Musa Shallal

2016 12th International Conference on Innovations in Information Technology (IIT) > 1 - 6

2016 12th International Conference on Innovations in Information Technology (IIT)

The usage and improvement of information and communication technologies to enhance public sector services (e-Government) was recognized as an important task for the majority of governments in developed countries. Several countries are working hard to improve their e-Government ranking to support their sustainable development. This study employed several data mining techniques to build models that...

chapter

A latent variable clustering method for wireless sensor networks

Vladislav Vasilev, Georgi Iliev, Vladimir Poulkov, Albena Mihovska

2016 50th Asilomar Conference on Signals, Systems and Computers > 1400 - 1405

2016 50th Asilomar Conference on Signals, Systems and Computers

In this paper we derive a clustering method based on the Hidden Conditional Random Field (HCRF) model in order to maximizes the performance of a wireless sensor. Our novel approach to clustering in this paper is in the application of an index invariant graph that we defined in a previous work and that precisely links a hyper-tree structure to the data set assumptions. We show that a set of conditional...

chapter

Extending a Message Passing Runtime to Support Partitioned, Global Logical Address Spaces

D. Brian Larkins, James Dinan

2016 First International Workshop on Communication Optimizations in HPC (COMHPC) > 11 - 16

2016 First International Workshop on Communication Optimizations in HPC (COMHPC)

Partitioned Global Address Space (PGAS) parallel programming models can provide an efficient mechanism for managing shared data stored across multiple nodes in a distributed memory system. However, these models are traditionally directly addressed and, for applications with loosely-structured or sparse data, determining the location of a given data element within a PGAS can incur significant overheads...

chapter

Finding Rising Stars in Heterogeneous Social Networks

Pivithuru Wijegunawardana, Kishan Mehrotra, Chilukuri Mohan

2016 IEEE 28th International Conference on Tools with Artificial Intelligence (ICTAI) > 614 - 618

2016 IEEE 28th International Conference on Tools with Artificial Intelligence (ICTAI)

A rising star is an individual who shows the potential to become a star in the near future. We investigate the problem of finding rising stars when heterogeneous data sources are available to define the same person. The proposed solution examines multiple data sources to determine how the importance of an individual improves over time. Scores from different data sources are combined using a multi-objective...

INFONA - science communication portal

Search results

Providing Big Data Applications with Fault-Tolerant Data Migration across Heterogeneous NoSQL Databases

Conceptual model of automatic system of near duplicates detection in electronic documents

Research on the Classification of High Dimensional Imbalanced Data Based on the Optimizational Random Forest Algorithm

The Risk Measurement and Empirical Study of China's CSI 300 Index Based on GARCH Model Family

The Research and Application of Customer Segmentation on E-Commerce Websites

The rapid prediction of playing index in accumulation on video websites

Forecasting house price index of China using dendritic neuron model

Learning partial power grasp with task-specific contact

A comparative study on code-mixed data of Indian social media vs formal text

MCNC: Multi-Channel Nonparametric Clustering from heterogeneous data

Clustering for point pattern data

WISDOM: Weighted incremental spatio-temporal multi-task learning via tensor decomposition

One Method of Cloth Simulation Based on Adaptive Meshes

Predicting COPD Failure by Modeling Hazard in Longitudinal Clinical Data

Multidimensional Sparse Array Storage for Data Analytics

Correlation of hourly diffuse fraction of global horizontal solar radiation in Tamanrasset, Algeria

The use of data mining techniques to predict the ranking of E-government services

A latent variable clustering method for wireless sensor networks

Extending a Message Passing Runtime to Support Partitioned, Global Logical Address Spaces

Finding Rising Stars in Heterogeneous Social Networks

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options