The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Generally different websites have different web page structures, which would heavily affect the extraction quality when the web content is automatically collected. On the basis of a statistical analysis on content features and structure characteristics of News domain web pages, this paper proposes a maximum continuous sum of text density (MCSTD) method to efficiently and effectively extract web content...
Deep Web databases contain huge amounts of data that can only be accessed via the front-end Web form queries. Since query clients cannot directly issue SPARQL queries against the Deep Web databases, Semantic Web applications cannot take advantage of this information resource. Recently, many RDB2RDF mapping tools, relying on mapping languages such as W3C's R2RML, provide the ability to view existing...
Parallel corpora are indispensable resources for a variety of multilingual natural language processing. This paper describes a system, which mines automatically parallel corpora from web pages. It attempts to overcome the shortage of parallel corpora in minority languages. Learning from the existing technology of mining web bilingual corpora, and combining with the characteristics of minority languages...
E-government is innovation and change of administrative activities for hundreds of years. This paper proposed a E-government's Web mining method based on the Semantic Web. It first describes the Design and Implementation of the Web spider, tool of colleting network data collection, including the basic structure of the work processes and procedures of the Web spider, the storage structure and form...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.