The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
At present, the power system is building up on top of a series of auxiliary systems for examples communication systems, monitoring systems, marketing systems and so on. All the systems work based on the shared power system data which are defined using Common Information Model (CIM). Due to diversiform reasons, errors may exist in the data. Therefore the verification technologies are developed. So...
Latent Dirichlet Allocation (LDA) has been widely applied to text mining. LDA is a probabilistic topic model which processes documents as the probability distribution of topics. One challenging issue in application of LDA is to select the optimal number of topics in LDA model. This paper presents a topic selection method which considers the density of each topic and computes the most unstable topic...
At present, the power system is building up on top of a series of auxiliary systems for examples communication systems, monitoring systems, marketing systems and so on. All the systems work based on the shared power system data which are defined using Common Information Model (CIM). Due to diversiform reasons, errors may exist in the data. Therefore the verification technologies are developed. So...
MapReduce has become a major programming model that supports distributed and parallel processing for large-scale data-intensive applications such as web data mining, network traffic analysis, machine learning and scientific simulation. Hadoop is the most popular open-source implementation of the MapReduce programming model. In Hadoop, input files are divided into many data blocks and these blocks...
Analysis about EST data usually starts with EST clustering, the process of grouping fragments according their original consensus long sequence. The similarity between ESTs always means that part of the sequences match with each other in some way. Accurate clustering is quadratic in time in average EST length and numbers, and the number of ESTs in public EST database is increasing exponentially. With...
Latent Semantic Indexing is a widely used text mining technology nowadays due its effectiveness in dealing with the problems of synonymy and polysemy within a proper matrix scale. However LSI is enormously computationally intensive especially for processing large scale data. And effective solution is to increase the computational power available to LSI using multiple computing nodes. In this paper...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.