The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Multidimensional arrays are commonly used in scientific and engineering applications. The disk layout for the multidimensional arrays will obviously affect the performance of data querying. Homogeneous Replica method are widely used to maintain the data reliability in most of the distributed storage systems and used to improve the data locality in some parallel processing systems. In this paper, we...
Efficient top-k query processing in highly distributed environments is useful but challenging. This paper focuses on the problem over vertically partitioned data and aims to propose efficient algorithms with lower communication cost. Two new algorithms, DBPA and BulkDBPA, are proposed in this paper. DBPA is a direct extension of the centralized algorithm BPA2 into distributed environments. Absorbing...
Data integration systems often suffer from performance bottlenecks due to the network overhead incurred by frequent and large-scale data retrievals. In this paper, we propose to develop query reconstruction mechanism in source wrappers to exploit data sharing across multiple data retrievals, and hence to optimize the performance of query execution. We propose a derived-based query reconstructing method...
Data is becoming more and more important in the current computing environment and enterprise application. In the grid, many systems and middleware aim to hide the heterogeneity of the data resources and to provide a unified way to access them. However, there are a few systems focusing on data integration which means combining structured and unstructured data residing at different data resources and...
Since query processing of data integration needs to access data from numerous wide-distributed sources over network, it is crucial to investigate how to deal with the expensive communication overhead. A staged data integration model is introduced for grid environment in this paper. It takes advantage of the abundant computer nodes to process integrated query over a number of highly-distributed and...
This paper considers the impact of clustering demands on download performance of data grids. The performance metrics are hit ratios and average access latency. For a replication strategy, we build a mechanism by means of majorization theory to compare system performance for two clustering patterns. We employ proportional replication strategy as an example to illustrate the effectiveness of this mechanism...
We have designed and implemented the Zetta data storage system (ZettaDS), a light-weight scalable distributed data storage system for cluster. While sharing many common characters with some of modern distributed data storage systems such as single meta server architecture, running on inexpensive commodity components, our system is a very light-weight one and aims to handle lots of small files efficiently...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.