The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
The following topics are dealt with: data process; knowledge visualization; data integration, interoperability, security, privacy; data mining and information extraction; data modelling and architectures; data quality, reliability, robustness; data streaming, parallel and distributed data mining; database design, evaluation, query and optimization; XML, Web Services, Ontologies; and social and mathematical...
Social network allows users to organize collections of resources on the web in a collaborative fashion. Collaborative filtering as a classical method has been also used in helping people to deal with information overload in folksonomy system. The problem of devising methods to solve the contextual problems emerging in the process of recommendation application over the social network is increasing...
Introducing the granular computing and rough set and carrying two basic attribute reduction arithmetic. In this paper, by anglicizing and contrasting the attribute reduction arithmetic in rough set model of granular computing, proposing the improved arithmetic for agroclimatology. At a work it takes core attributes by constructing discernable matrix, and then uses them for heuristic information to...
A computer vision technique has been applied to analyze and research road surface meteorology. The original color data in HIS and RGB models has constituted feature vectors. Robust technique has been used to remove outliers before image process. And a BP neural network has been employed to identify the images collected from road surface in four kinds of states (namely, covered by dry asphalt, water,...
In this paper we present an analysis of the importance of cost sensitive methodology in medical disease prediction. And a C5.0 based cost sensitive ensemble approach is presented to support improving the effectiveness and decreasing the misdiagnosis cost in the same time. The verification is conducted on the real cases from Changhai Hospital Renal Registered (CHRR) data which consists of 483 records,...
Based on the unascertained theory, the paper elaborated exhaustively the modeling processes of the unascertained measure model. Combined with the unascertained measure model, the safety factors affecting coal seam-roof stability are analyzed and evaluated. According to the reality of Bofang coal seam-roof stability, we can collect the measured data, determine the safety evaluation index, calculate...
In this paper the traffic safety warning technologies and methods are studied deeply based on the macro-forecast and micro-forecast methods. The macro-forecast will realize the traffic safety warning based on the macro-data such as the number of traffic accidents, the death toll and the number of motor vehicles owned and so on. And the micro-forecast is based on the micro-data such as traffic flow...
In the past few years, the volume of junk information on the Internet has grown tremendously, researchers' begun to handle this issue. In this paper, DCM (discriminative category matching) algorithm is employed to filter the Web information according to the content of information. To our knowledge, the algorithm is the first introduced into filtering. It takes the relative importance of a feature...
Web page de-duplication module is an important part of search engine system, which can improve its performance and quality with filtering the Web pages downloaded by crawler system of search engine and eliminating the duplicated Web pages. This paper from the source of duplicated Web pages - reshipment proposes a Web page de-duplication method that the information including original Web sites and...
Vehicles report-stop fraud is an important factor of highway fees loss, which is more than the proportion of 20% of the total loss in statistics. The paper uses data mining and data warehouse technology to resolve the deficiencies of traditional data management information system on the analysis of vehicles report-stop fraud. The significance of the issue and mining goal were described, and the correlational...
There are several methods to estimate fractal dimension in fractional Brownian motion model. In this paper, one of methods, the variance method, is analyzed detailedly, and some problems in the method are pointed out. To resolve the problems, an improved method is proposed. To validate the method, a comparing experiment of two methods is designed. In the experiment, a SPOT image is selected and five...
The diagnosis method of rubbing fault in rotary machinery was investigated by support vector machine combined with wavelet transform. The rubbing fault of a rotary machine was simulated with a rubbing-block. The decomposed signals at every level were continuous for the case without rubbing fault, while the decomposed signals were bursting signals at level 1, level 2 and level 3, and continuous signals...
In this work, we studied how to rapidly match remote sensing images by the semantic information of geographical objects in Grid architecture and how to slice, index, and assemble each tiled image in every grid node. We first designed the grid architecture of remote sensing images sharing and gave a new idea that searching corresponding images by semantic information of geographical objects. To each...
With the integration of global economy, import anti-dumping cases soared, particularly gathered in the chemical industry. In anti-dumping investigations of western enterprises, lower adjustment of earnings in the report period is very common to pursue political protection. In order to verify whether there was downward earnings management in anti-dumping investigations in China's chemical enterprises,...
Ajax is an important approach for improving rich interactivity between Web server and end users during Web 2.0 eras. At the same time, the structured data in AJAX Web pages can not be extracted easily due to its asynchronous loading. In this paper, we propose a technique for extracting the structured data from the AJAX based Web pages. Firstly, an AjaxFetcher component is created to fetch the dynamic...
This paper analyses the causes that give rise to the risk of credit fraud and presents an anomaly detection method by using an outlier detection model based on similar coefficient sum. It finds fraud record by computing similar coefficient sum of every two objects and an example is given to validate the model. The result show the feasibility and validity of the method. The research work furnishes...
For a classification problem, noise in real-world data can dramatically lower the predictive accuracy of a learner and increase the time in building model. Researchers have proved that preprocessing noise before learning can bring more advantages. Previous work mostly focus on class noise detection for the difficulties of attribute noise detection. In this paper, we present a cluster based noise detection...
In many researches, one of major challenges is to archive, access, and analyze various heterogeneous databases containing useful information gathered from a large volume of data. Metadata plays a critical role in database integration, so metadata model is put forward which comprises information of data content. Discussion on information integration solves the problem with manipulating the metadata...
This paper analyzes the potential causes of the performance bottleneck in I/O access paths of storage architecture and proposes a predictive approach based on feedforward to optimize the I/O performance of storage subsystems effectively, which uses a time series analysis method based on ARIMA to build the predictive and monitor model of the performance. This approach can improve the availability of...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.