The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
As a typical user activity based model in search engines, click model is currently the mainstream approach to describe and analyze user behaviors when handling a number of search-related applications, such as automated ranking alternations, search quality metrics, and online advertising. However, most of the existing work in this literature only consider click through data but ignore other aspects...
The article describes a method for constructing a model for ranking the search engine delivery on the Internet using inductive GMDH algorithms. The method makes it possible to enhance substantially the relevance of scientific and technical information search on the Internet provided to sift spam and the commercial information. The process of discovering the web resources ranking model is described...
We review how the state of information technology has evolved for consumers vs. enterprises. We discuss some of the key challenges in enterprise search over structured data and suggest a few promising directions for the research community.
Using disaggregated data from a Chinese search engine we jointly model ad rank and performance for hospitality related keyword searches. As a result of our modeling framework we can better determine the optimal keyword bidding strategy for an advertiser given the search engine's control over ad rank. Our approach removes rank bias in estimating keyword bidding performance. We then illustrate the impact...
At the recent development in the application of Information Communication Technologies (ICT), Internet has been at the forefront. Since, people have been becoming more interested in the Internet and Electronic Businesses (e-business), a growing number of organizations have been already entered or planning to move into e-business. In e-business environment, search engines play an important role because...
The popularization of the Internet provide people with more quick and direct access to information channel, the network search data record the netizens' tens of thousands of search concerns and needs to provide the necessary data base for the research of social and economic behavior. Search behaviors' anonymity just can meet the venereal-disease suspected patients' privacy need. This paper will use...
Online price comparison agents (shopbots) allow consumers to instantaneously receive price and other information from many online retailers. Contrary to conventional wisdom, our empirical study of the book and computer hardware categories reveals that consumers are visiting more online retailer web sites after using shopbots. This finding suggests that after searching for an item through a shopbot...
When trying to measure the effect of irreversible treatments such as training interventions, the choice of the experimental design can be difficult. A two group cross-over experimental design cannot be used due to longitudinal effects during the course of the experimental run, which can be especially large in dynamic web search environments. A standard case/control two group design also can be problematic...
Language Model (LM) constitutes one of the key components in Keyword Spotting (KWS). The rapid development of the World Wide Web (WWW) makes it an extremely large and valuable data source for LM training, but it is not optimal to use the raw transcripts from WWW due to the mismatch of content between the web corpus and the test data. This paper proposes a novel two-step data selection method based...
Image annotation is a promising approach to bridging the semantic gap between low-level features and high-level concepts, and it can avoid the heavy manual labor. Most existing automatic image annotation approaches are based on supervised learning. They often encounter several problems, such as insufficiency of training data, lack of ability in dealing with new concept, and a limited number of semantic...
The term Deep Web (sometimes also called Hidden Web) refers to the data content that is created dynamically as the result of a specific search on the Web. In this respect, such content resides outside web pages, and is only accessible through interaction with the web site typically via HTML forms. It is believed that the size of the Deep Web is several orders of magnitude larger than that of the so-called...
With the rapid growing of data on Web, we are facing three serious problems. Firstly, there are a huge number of data resources which are heterogeneous and dynamic.Secondly, most of data on Web are unstructured. Thirdly, there are various kinds of Web users who have different interests and requirements. In this paper, we proposed a new system architecture for unstructured data management on Web to...
Current mobile Web search engines essentially conduct document-level ranking and retrieval. However, structured information about real-world objects embedded in static WebPages and online databases exists in huge amounts. We explore a new paradigm to enable Web search at the object level in this paper, extracting and integrating Web information for objects relevant to a specific application domain...
Document-level information retrieval can unfortunately lead to highly inaccurate relevance ranking in answering object-oriented queries. A paradigm is proposed to enable searching at the object level. However, this reliability assumption is no longer valid in the object retrieval context when multiple copies of information about the same object typically exist. To resolve multiple copies inconsistent...
Due to the popularity of the consumer electronics, the digital data are largely generated every day. With a large quantity of the digital data, there is no well designed method for data arrangement and categorization. Using the search engines is preferable to deal with the huge amount of data. However, in this way, the data act like ldquopassive contentsrdquo which waits the search engine to index...
In order to satisfy the different background, goal and period user personalization requirement, personalization information service becomes the research spot of the information retrieval field step by step at present. This article will carry on the analysis to the personalized search engine frame model, carry on the optimization and improvement to modeling technology and establishment of the user...
The Web has become the largest information source, which includes all aspect of human's life. The myriad information is believed to be hidden behind the deep Web, which the search engines and crawlers can't access directly. To extend the human's physical limitation of accessing information, the information fusion system is introduced. The Web is very different from the traditional database community,...
Many knowledge workers are increasingly using online resources to find out latest developments in their specialty and articles of interest. To extract relevant information from such multiple online information sources summarization is being used. Current summarization systems produce a uniform version of summary for all users. However summaries which are generic in nature do not cater to the userpsilas...
Recently, many commercial products, such as Google Trends and Yahoo! Buzz, are released to monitor the past search engine query frequency trend. However, little research has been devoted for predicting the upcoming query trend, which is of great importance in providing guidelines for future business planning. In this paper, a unified solution is presented for such a purpose. Besides the classical...
The Web has the potential to become the world's largest knowledge base. In order to unleash this potential, the wealth of information available on the Web needs to be extracted and organized. There is a need for new querying techniques that are simple and yet more expressive than those provided by standard keyword-based search engines. Searching for knowledge rather than Web pages needs to consider...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.