The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Like search engines, recommender systems have become a tool that cannot be ignored by websites with a large selection of products, music, news or simply webpages links. The performance of this kind of system depends on a large amount of information. At the same time, the amount of information on the Web is continuously growing, especially due to increased User Generated Content since the apparition...
Internet is a huge source of information. Search engines have indexed much of this information and are able to extract the relevant webpages that are related to a given query. However, once the search engine retrieves a set of webpages, the user has to read the webpages in order to find the relevant information. This is a time consuming task because webpages often mix information related to different...
Web page de-duplication module is an important part of search engine system, which can improve its performance and quality with filtering the Web pages downloaded by crawler system of search engine and eliminating the duplicated Web pages. This paper from the source of duplicated Web pages - reshipment proposes a Web page de-duplication method that the information including original Web sites and...
This paper analyzes the information characteristic under the Web2.0 environment from several aspects such as information creation, information activity, information communication and information users. Thus, we put forward that the information filtering is worth paying attention topic under the Web2.0 environment, introduce concept of the information filtering and classification, and discuss the detailed...
To enhance the efficiency of surfing the Internet, an integrated platform for real estate agency is established based on service-oriented architecture (SOA). It employs the concept of search engine and loads the information provided by each real estate agency to this platform. Due to various types of data structure from each real estate agency website, a unified format must be defined for integrating...
The rapid development of blogs has brought on some serious problems such as disclosure of sensitive information, spread of unhealthy information, etc. So it is very important for supervisors to detect them. The common methods based on search engines have some drawbacks such as lower efficiency and lower precision because they need to retrieve and update blog pages frequently, and to analyze all blog...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.