The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
The explosion of data over the past twenty years has fostered a huge amount of research in processing semi-structured documents like HTML and XML documents on Web. Nevertheless, the explosion of semi-structured documents that originate from outside the Web domain is more challenging. The data of semi-structured documents are everywhere: in scientific research reports, official journals, electronic...
Home energy management system is an extension of the smart grid in residential sector. It is a hot topic in smart grid. This paper proposed a home energy management strategy. It uses data mining techniques to obtain useful information when analyzing the load characteristics. The home energy management system is composed of home gateways, interactive terminals, smart socket and smart appliances with...
With the development of society, Gait recognition, as an emerging biometric identification technology, has been paid more and more attention for its unique advantages of non-aggression, uniqueness, remote identification, easy to collect, and difficult to fake and hide. In this article, the gait and the recognition of the age range are combined together. A method of extracting gait features based on...
The relationship between med-long term load forecasting and socio-economic indicators is very difficultly described by an accurate mathematical model. So load forecasting needs to dig out few dominant factors from lots of socio-economic indicators. By introducing data mining technology into the association analysis of China's electricity consumption growth, many socio-economic indicators since 2000...
The problem of association rule mining is one of the most frequently studied and popular KDD tasks. Association rule mining is an important sub-branch of data mining. Based on the known current existence of association rule mining algorithms, this paper emphasizes the research work of deleting redundant association rules. It is a problem to mine quantitative association rules because the existing...
Currently, the worldwide open and distance education institutions are carrying out teaching reform measures continuously to ensure open education quality, while teaching reform courses examination data analysis results will be able to reflect open education teaching reform effectiveness objectively. So Wilcoxon Rank Sum Test based examination data analysis scheme is proposed in this paper to detect...
Although the genomics data are accumulated in an exponential growth, the molecular complexity of cancer is still hard to understand. The most remarkable characteristics of the genomic data are severely high-dimensional features with a small number of samples, such as gene expression data. The traditional data mining method has a limited ability to process these asymmetry datasets. In order to select...
The traditional relation extraction methods require the pre-defined relation types and a corpus with human tags. The information extracted by the current open relation extraction (ORE) methods is incomplete, and the relation types are finite. To solve the above problems, we propose ClausORE, which is an n-ary ORE method for Chinese text and extracts the entities and relations between entities from...
Cloud computing provides a virtual, flexible, scalable resource manage mode for Internet enterprises. As the fundamental storage architecture of cloud computing, cloud storage is proposed separately to achieve the high available, scalable storage. However, with the amount of redundant business data, more and more cloud space was occupied and more network bandwidth cost was bought in. To utilize cloud...
Nowadays, more and more people are getting engaged in the construction of the Internet, consciously or not, by posting their individual comments on it. In today's big data era, opinion mining on customer's opinions has become one of the most effective ways to roundly use the great amount of information. Opinion mining, a brand new section of unstructured information mining, is mainly related to emotional...
The network of credit reference is a typical complex network in the theoretical level and practical level. In the big data era, in order to break the privacy of the credit calculation method and promote its development, this paper proposes a big data mining algorithm on the network of credit reference, which modifies the algorithm of traditional machine learning algorithm and the algorithm of HMM...
This paper proposes a new classification method for data stream based on the combination concept drift detection and classification model. The proposed method includes a pooling mechanism, which stores classifiers corresponding to different concepts to ensure that the classification model will not do re-training when those concepts which appeared previously are present again, so as to directly sort...
In the processing of source retrieval in plagiarism detection, rationale for keywords extraction is to select only those phrases or words which maximize the chance of retrieving source documents matching the suspicious document. TF-IDF (term frequency-inverse document frequency), weighted TF-IDF (the weighted term frequency-inverse document frequency, namely, the TF-IDF of a term with a different...
With the rapid development of clustering analysis technology, there have been many application-specific clustering algorithms, such as text clustering. K-Means algorithm, as one of the classic algorithms of clustering algorithms, and a textual document clustering algorithms commonly used in the analysis process, is widely used because of its simple and low complexity. This article in view of two big...
There are three different methods for analyzing instrument directivity radiation, but the efficiencies of these different methods are never compared. In this paper we attempted to provide accurate directivity patterns of G key Bangdi by extracting simultaneous full audio data in three different methods. G key Bangdi was recorded with the musician in an anechoic room with 34 microphones distributed...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.