The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
2017 the 2nd IEEE International Conference on Cloud Computing and Big Data Analysis (ICCCBDA 2017) is a comprehensive conference which focuses on cloud computing and big data analysis. The main goal of the conference is to address and deliberate on the latest technical status and recent trends in the research and applications of cloud computing and big data analysis.
Along with the explosive growth of the information in Social Network Service, the research of the quality of data has become a new hot point in related research field. High quality social data can more effectively support data mining, knowledge discovery, and can provide reliable and efficient data for users. Based on the measure problems of data quality, this paper discussed the measurement of two...
Current providers of the cloud storage service often ensure the data confidentiality by encrypting the file content and guarantee the data integrity by verifying the hash value of the file. However, when the cloud storage service fails, the availability of the user data cannot be guaranteed and nor can the cloud sharing function of the user data be supported. In addition, users have to give the provider...
the advent of online social networks has been one of the most exciting events in this decade. Many popular online social networks such as Twitter, Wechat, Weibo, LinkedIn, and Facebook have become increasingly popular. The consequences of the poor quality of data in a social network are often experienced in everyday life. This paper gives a domain ontology model, SNSQ Ontology, for data quality in...
The advancements of Internet accelerate the intelligent process of data integration in recent years. When fusing a volume of records about the same real-world entity into a single, consistent and clean representation, appropriate conflicts conciliation becomes essential for participants. This paper advocates a strategic framework for perceiving and resolving data inconsistencies via employing truth...
This paper summarizes the characteristics of the electricity load data collected every 15 minutes of power users. The single-day load data of single power user is taken as a 96-length vector, and the single-day data of n users is taken as a 96-column set. Single-user monthly data and annual data are described as a 30 (31) × 96 matrix and a 365 (366) × 96 matrix respectively. By comparing the wavelet...
Satellite remote sensing technology can extract disaster information rapidly and accurately for disaster monitoring on a regional or national basis. However, various sensors are generating huge volumes of remote sensing data for disaster management. It is urgent to handle such massive remote sensing images. In this paper, it provides the solutions for massive remote sensing data analysis and rapid...
In view of the current problems of miscellaneous data channel, huge-scale information and rough evaluation method in power grid development diagnosis analysis, a study of power grid development diagnosis system is conducted based on multi-source data analysis. With the interface to product management system (PMS), energy management system (EMS), distribution automation system (DAS), electric information...
To overcome the drawbacks of traditional convex evidence, in this paper we proposed a modified convex evidence theory model, we presented the modified combination function and use it to combine mass function of ordered propositions, we present the calculation of the parameters of the proposed combination function, and proposed a more accurate method to find the proposition which is most likely true...
As an open source implementation of GFS, Hadoop Distributed File System (HDFS) has high efficiency on handling the large files. However, due to its own master-slave structure and the storage of metadata, the efficiency is low when dealing with massive small files. It occupies large amount of NameNode memory, reduces access efficiency, and delays concurrent user access. In order to improve this performance...
Medical data are extensively used in the diagnosis of human health. So it has played a vital role for physicians as well as in medical engineering. Accordingly, many types of research are going on related to this to have a better prediction of the diseases or to improve the diagnosis quality. However, most of the researchers work on either dimensionality space or imbalanced data. Due to this, sometimes...
Database management systems have been indispensable to enterprises for decades. As the amount of data dramatically increased, database aggregation has encountered a dilemma between privacy and performance. In traditional database aggregation, all attributes have been encrypted to protect the privacy of data. However, in big data, this privacy measure is no longer feasible because cryptography will...
Recently, numerous NoSQL (Not Only SQL) data-store systems have been developed, which often involve Big Data processing. Importantly, there are no common standard APIs for accessing the different NoSQL systems. In order to solve the problem, many scholars have used different techniques to build SQL access layer for different NoSQL databases. Meanwhile another problem related to this topic needs to...
Nowadays, terrorism has evolved into such a destructive threat to the whole world that it is calling for an increasing devotion of professional researches and explorations. Machine learning, as a powerful weapon to unveil the hidden knowledge, has been successfully applied into the anti-terrorism field. The aim of this paper is as follows: by implementing anomaly detection algorithm into a famous...
Near-real-time data warehousing is an emerging area of research in order to meet the high and up-to-date demands of business organizations. This means customers transactions executed at data source level need to reflect into the data warehouse immediately that requires semi-stream join between a stream of customer transactions and disk-based master data. For this purpose a well-known algorithm called...
With the rapid development of information technology, the concept of big data is used in information collection on different things, especially for the text classification. This paper propose an improved KNN algorithm based on clustering for the automatic classification of Web text. In addition, we find a new method to find out which text in the same category belongs to the same cluster. Finally,...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.