The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Real world Web mining applications usually have different requirements, such as massive data processing, low system latency, and high scalability. In order to meet these different requirements, we proposed a distributed text mining system with a layered architecture that divides the system functions into three layers, namely, the crawling and storage layer, the basic mining layer, and the analysis...
In many real-world topic detection tasks, the process of the topic detection is often interactive, which means the users are likely to interfere the reason process by expressing their preferences. We proposed an algorithm, iOLDA, and the software framework for interactive topic evolution pattern detection based on Latent Dirichlet Allocation (LDA). To abate those topics not interested or related,...
The deep web integration system employs a set of semantic mappings between the mediated schema and the schemas of web data sources. In this dynamic environment, sources often undergo changes that invalidate the mappings. Such continuous monitoring is extremely labor intensive, and poses a key bottleneck to the widespread deployment of web data integration systems in practice. The paper describes DBMFR...
Conficker is a worm outbreak recently which form a large botnet and became a huge threat to the security of the internet. In this paper, the redirect technology of domain name was used to monitor the conficker. For it's low killing rate and long-term period of propagation, we built a propagation model of botnet based on conficker monitoring. In the model, we take into account the geography, connectivity...
With rapid development of the Internet and communication technology, massive text data has been accumulated in Internet, including text data on network pages, emails, instant messengers and etc. Requirements on increasing data volume, real-time data-loading and creating text indexes pose enormous challenges to data-loading techniques. This paper presents a data loading system in real time, text-loader...
The deep Web integration system employs a set of semantic mappings between the mediated schema and the schemas of data sources. In this dynamic distributed environment, sources often undergo changes that invalidate the mappings. Such continuous monitoring is extremely labor intensive, and poses a key bottleneck to the widespread deployment of data integration systems in practice. The paper describes...
In this paper, we advocate that routers will filter bandwidth depletion of DDoS traffic. It is our consideration that server owners who experience an attack should work with ISP routers to defend DDoS. The main idea is to use statistical approaches of Netflow to allocate weighted bandwidth at the routers. We propose a new algorithm based on genetic algorithm to filter traffic on routers and maximize...
Grids can integrate the distributed resources on the Web such as computing, data, storage, software and manpower to accomplish collaborative tasks. Service grid has combined Web service with the grid, and over the next decade lots of business will be completely transformed by using grid-enabled Web services to share not only applications but also computer power efficiently. However, in the business...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.