The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
The real-time information on the Web changes dynamically and surge quickly, which cause considerable difficulty in access to interested information. How to mine hot events, how to analyze the correlation of events and how to organize information structurally are challenging tasks. In this paper, to address these problems, we propose STeller, an approach to mine context-aware story — a series of correlated...
While e-commerce has grown substantially over last several years, more and more people are utilizing this popular channel to purchase products and services. Thus the ability to predict user demographics, including gender, age and location has important applications in advertising, personalization, and recommendation. In this paper, we aim to automatically predict the users' genders based on their...
Extracting opinion words and targets is a main task in opinion mining. This paper proposes a novel approach with a dynamic process of joint propagation and refinement. In the propagation process, two initial datasets of opinion words and targets are separately obtained by given seed words and seed dependency patterns under the pre-defined extraction rules, and meanwhile new dependency patterns are...
Dynamic topic models (DTM) are of great use toanalyze the evolution of unobserved topics of a text collectionover time. Recent years have witnessed the explosive growth ofstreaming text data emerging from online media, which createsan unprecedented need for DTMs for timely event analysis. While there have been some matrix factorization methods inthe literature for dynamic topic modeling, further study...
Aspect-based opinion mining is to find elaborate opinions towards an underlying theme, perspective or viewpoint as to a subject such as a product or an event. Nowadays, with rapid growing of opinionated text on theWeb, mining aspect-level opinions has become a promising means for online public opinion analysis. In particular, the booming of various types of online media provide diverse yet complementary...
With Recommendation technology has been widely used in advertising push, e-commerce and other fields and it has shown its powerful application prospect. But with the index increasing of mobile commerce data size, the size of the recommendation system is also increased and this leads to that the traditional collaborative filtering recommendation algorithm cannot adapt to such a big data processing...
In this paper, we present CLUE, a system event analytics tool for black-box performance diagnosis in production Cloud Computing systems. CLUE provides an unified and extensible means of profiling service transactional behaviors, and builds structured data called event sketches. CLUE further offers a set of analytic tools for summarizing and analyzing event sketches by integrating data mining and statistical...
Coal enterprises informatization level falls behind other industries. In order to speed up the development of coal enterprises informatization, and to reduce management costs to maximize the profits, this paper sets up a coal sales system based on data mining as a good reference for the coal enterprises. The paper first analyses the necessity of design of such system, then defines the functional modules...
For B2B (Business to Business) E-commerce systems in cloud environment, we propose a model for mining patterns in multi-databases in this paper. It gets the global pattern by analyzing local patterns globally, and then gets the high-vote pattern and exceptional pattern. Due to the difference between single-database and multi-databases, support in single-database only has local effect. The conception...
In a data-driven Science Collaborative Framework, access authorization is a vital component to facilitate the management of the collective data and computing resources shared by researchers from geographically distributed locations. But traditional virtual organization based access control frameworks are not suitable for self-organizing, ad-hoc and opportunistic scientific collaborations, in which...
This paper studies the application of customer relationship management based on data mining technology, an overview of the basic theory of data mining knowledge, and customer relationship management theory and application of pharmaceutical companies raised customer resource management problems and solutions, practice of data mining in Customer Ratings on the main process and the general method, which...
Using data-mining technology, this paper established a new method, named the Integrated Surface Drought Index (ISDI). ISDI integrates traditional meteorological data, remotely sensed indices, and biophysical data, and attempt to describe drought from a more comprehensive perspective. The evaluation results indicated that the construction models for three phases of growth season have very high regression...
We present block-level links based content extraction (BLCE)-a method to extract content from the web pages by using the link attributes of blocks, which contains the number of links and the length of link text (anchor text).We describe how to divide one web page into blocks and how to merge the similar blocks into one, then compute the number of links and the total length of anchor text. We find...
Extraction of Land Cover Information is important to make a study of global change. Taking East China as a study area, after MODIS data of this area in the four seasons are preprocessed, spectrum analysis of typical surface features are carried out. On these basis, by using decision tree classification, selecting spectral characteristics, NDVI and classification results of the maximum likelihood method...
The techniques and tactics in net sports are the main factors of winning, while the key technology is the analysis of techniques and tactics and decision support. Using artificial intelligence, data mining and decision support technology, based on the development of multimedia and interactive data acquisition system and intelligent analysis system for the techniques and tactics in net sports, the...
With the growing scale of current computing systems, traditional configuration tuning methods become less effective because they usually assume a small number of parameters in the system. In order to handle the scalability issue of configuration tuning, this paper proposes a cooperative optimization framework, which mimics the behavior of team playing to discover the optimal configuration setting...
This paper deduces the conception of mining information model from Building Information Model. It also discusses the content and structure of mining information model, and emphasizes lifecycle management in mine construction. Furthermore, it analyzes the activities in each of phases: planning, design, construction, production and ending based-on mining information model. Finally, it illustrates that...
Open source projects often maintain open bug repositories during development and maintenance, and the reporters often point out straightly or implicitly the reasons why bugs occur when they submit them. The comments about a bug are very valuable for developers to locate and fix the bug. Meanwhile, it is very common in large software for programmers to override or overload some methods according to...
Aiming at the requirements in the process of building a SOA application system, such as dynamic service deployment, service finding and reference, the demand for adaptability to change and system scalability, this paper introduces the characteristics and drawbacks of service-oriented architecture, describes the theory about OSGi and its advantages. Through the combination of SOA with OSGi, a dynamic...
The epsilon-entropy was adopted as a measure of information in the analysis of linear Gaussian continuous time control systems under the consideration that the `virtual reproductions' of system input and output are subjected to a common high enough precision requirement. The function of system variety which describes the variation of time average information in system was defined as the difference...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.