The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
In the processing of source retrieval in plagiarism detection, rationale for keywords extraction is to select only those phrases or words which maximize the chance of retrieving source documents matching the suspicious document. TF-IDF (term frequency-inverse document frequency), weighted TF-IDF (the weighted term frequency-inverse document frequency, namely, the TF-IDF of a term with a different...
Through analyzing the search engine logs, we can better understand the law o users' search behavior, mining users' personality, so that improving the performances of web information retrieval. This paper analyzes the user, query, clickthrough data of Sogou, a large-scale Chinese search engine. We focus on the relation of user, query and URL, revealing some new characteristic of the Web user. The result...
To measure the diversity of the user interests over the same query, this paper applies kappa coefficient as the indicator of the consistency of users' clicks for a given query. It compares three different settings of the Kappa parameters and shows the Kappa formula can be well adapted to the Query log analysis. Based on the further analysis of the Kappa results over Sogou Query LOG, it is revealed...
Both classification and ranking strategy have been reported positively in mining the named entity (NE) translation from the snippets re-turned by the Web search engine. Taking the most challenging issue of the organization name and its translation as an example, this paper conducts a contrastive study on the two strategies under SVM framework. We empirically show that the method of translation ranking...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.