The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Diversifying requirements forces a data analytics platform to deal with complicated workflows including various combinations of data integration. We have developed a new workflow scheduler for our Hadoop-based data analytics platform to cope with the issue. The new scheduler enables fine-grained dependency management efficiently on a huge monolithic workflow and can improve the utilization efficiency...
The example of how to create and organize the ETL process on isolated small scale database system which is the part of the larger information system is given in this paper. We have given short description of the database, ETL process and given directions on how to expand classic ETL process with intermediate steps in order to automatize complete process and prepare it for loading into main data warehouse...
Today, airport baggage handling is far from perfect. Baggage goes on the wrong flights, is left behind, or gets lost, which costs a lot of money for the airlines, as well as frustration for the passengers. To remedy the situation, we present a data warehouse (DW) solution for storing and analyzing spatio-temporal Radio Frequency Identification (RFID) baggage tracking data. Analysis of this data can...
The research system of data mining and analysis for statistics in Qinghai Province is a manifest analysis, inquiry, and service system built up upon the existing network of the provincial statistics bureau with the increasing national, local and social concern on statistics and the rapid expansion of the statistic information and aiming at meeting the needs of governments for statistic information...
Numerous studies have generated cost estimating relationships (CERs) for transportation projects via data analysis. Some studies collected data from databases, while others sourced data from conventional paper-based formats. When cost data were not in a consistent format, many studies failed to discuss the streamlining of pattern recognition. This work adopts a standard procedure for identifying CERs...
Cloud Computing System (CCS) aims to power the next generation data centers and enables application service providers to lease data center capabilities for deploying applications depending on user Quality of Service (QoS) requirements. Huge investments and complex managements are shifted from users to providers. To improve efficiency and simplify the management, in this paper, intelligent cloud computing...
This paper suggests the structure of patent data integration and analysis based on business intelligence (BI) to help enterprises make the effective decisions about patent strategy and orientation of technological development by extracting effective information from mass data. Firstly, the patent data is acquired from heterogeneous data sources into the local database. Then, we can load the business...
Businesses of all sizes and in different industries, as well as government agencies, are finding that they can realize significant benefits by implementing a data warehouse. A data warehouse provides the base for the powerful data analysis techniques that are available today such as data mining and multidimensional analysis, as well as the more traditional query and reporting. Having an enterprise...
Based on meteorological information background and ideology of data warehouse construction, this article analyzes multiple meteorological data sources, designs the meteorological data warehouse architecture, target data model and ETL process. The system have been deployed on ORACLE-OWB platform, a better result has been achieved.
Web log files store data related to the use of a website. Analyzing these data in detail is therefore crucial for improving the user browsing experience. However, usually Web log data are stored in flat files in different formats which hinders their analysis, thus obliging to use specific Web log analysis tools. In this context, approaches for structuring Web log data to better analyze them are highly...
This paper proposes a web-based On Line Analyse Process (OLAP) structure due to the weaknesses of OLAP system within the conventional Client/Server(C/S) structure. Studied with actual examples, a multidimensional model of web-based OLAP drilling analysis system has been designed, by applying the methods of link between Office Web Components and analysis server, various OLAP analysis operation through...
Higher education in the new 21st century, require faculty and staff in a college or a university to update the educational thinking, to absorb the advanced educational concepts, and to make full use of modern information technologies for management. This article presented a new situation of science research, teaching, as well as management, through research and practice on creating a data warehouse...
In the previous work we developed the web-based OLAP (On-line Analytical Processing) integrated with the data warehouse for hotspot data in Indonesia. This work aims to develop a visualization module for hotspot clusters resulted from OLAP operations including roll up and drill down. The data warehouse consists of hotspot data represented in multidimensional model with two dimensions: time and location...
Using data cube to analysis historical fact data online more faster than Ad-Hoc queries, but it need very large external storage. In DSMS (Data Stream Management System), due to capacity of memory is much smaller than disk, we meet even more problem in analyzing stream data by in-memory StreamCube. So, we compress StreamCube to gain more information about stream data in certain storage. We implement...
A good visualization model for Data Warehouse (DW) should be semantically enriched enough so that it can express analytical contents along with the multidimensional data model artifacts to the end-user and decision makers. In this paper the OLAP Umbrella visualization model has been proposed for the purpose. Besides preserving the classical multidimensional data model artifacts and OLAP specific contents,...
A technical framework and relative key techniques to realize a report system based on Three-layer Calculating Architecture are proposed, including metadata mapping and its application, functions of the engine, ETL module, data warehouse and etc. which were designed for this report system. All the techniques studied in this paper have been practiced and have been realized in an actual project of data...
Promoting students success requires the implementation of processes and mechanisms that allows the closely monitoring of the students academic activities. Although essential, the activities involved in this complex process do not take place in many higher education institutions due to the lack of appropriate practices and an adequate technological support that sustain these practices. To overcome...
To resolve the heterogeneous distribution of data sources of financial business and provide financial decision-making with information support, based on the studies of data warehouse technology and decision support system, the author designed the support system of financial decision-making on the basis of data warehouse applying to Jiangxi Province' s finance, which includes the overall planning of...
ETL is an important foundation of building data warehouse. Users extract the required data from the data source. and after clean the data, in accordance with pre-defined model of good data warehouse, data will be loaded into the data warehouse .The article has designed and implemented a common management tool for ETL according to the principle of ETL and the project actual needs. It can not only support...
Development of information systems for the systems health management domain are typically concerned with data collection assets, such as customer data systems or on-platform data recording devices, or the need for data warehouses to integrate copies of data from these sources. We propose that there exists a significant capability gap in this area - there is a need in the health management domain to...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.