The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
The success of the search engine may be our Newtonian paradigm for the Web. It enables us to do so much information discovery that it is difficult to imagine what we cannot do with it.
We propose a novel method for mining knowledge from linked Web pages. Unlike most conventional methods for extracting knowledge from linked data, which are based on graph theory, the proposed method is based on our associated keyword space (ASKS), which is a nonlinear version of linear multidimensional scaling (MDS
Document indexation is an essential task achieved by archivists or automatic indexing tools. To retrieve relevant documents to a query, keywords describing this document have to be carefully chosen. Archivists have to find out the right topic of a document before starting to extract the keywords. For an archivist
distances in a multidimensional scaling space. In this study, we introduce an example of a 3-D multimedia space using the Associated Keyword Space (ASKS) and demonstrate similarity relationships between various sources of data in this space.
based spam topic detection strategy through keyword extraction. In particular, spam topic is detected by using the topic model of multiple features with the keywords of clues, which integrate the corresponding feature of News, BBS and Blog. We get the min cost of 0.282 through TDT4 evaluating corpus and the satisfaction of
Two keyword-extraction ways are usually used, one is simply using the information from exactly single word like word frequency and TF.IDF, the other is based on the relationship between words. The relationship is usually described as word similarity which derives from a corpus (WordNet, HowNet) or man-made thesaurus
Online advertising has now turned to be one of the major revenue sources for today's Internet companies. Among the different channels of advertising, contextual advertising takes the great part. There are already lots of studies done for the keyword extraction problem in contextual advertising for English, however
based on keyword and/or facet searches become less effective in providing access to specific sets of user generated data. To address their limitation, we propose an approach where keywords and summarization of subset of document could be automatically generated during an interactive user session to facilitate user's
The purpose of this paper is to provide a solution of extracting appropriate keywords to identify meaningful learning-contents on the Web. There are some issues in identifying documents that have learning content. Firstly, the documents need to be identified according to the learning area of a student's school year
Keyword extraction is an important application in the area of information technology. Automatic keyword extraction can help people know what is the article primarily talking about without reading the long passage carefully. This paper mainly introduced a keyword extraction algorithm using pagerank on Synonym. Firstly
Internet is becoming an increasingly important platform for ordinary life and work. It is expected that keyword extraction can help people quickly find hot spots on the web, since keywords in a document provide important information about the content of the document. In this paper, we propose to use text clustering
Search engines on the Web have popularized the keyword-based search paradigm, while searching in databases users need to know a database schema and a query language. Keyword search techniques on the Web cannot directly be applied to databases because the data on the Internet and database are in different forms. So
This paper presents a keyword extraction technique that can be used for tracking topics over time. In our work, keywords are a set of significant words in an article that gives high-level description of its contents to readers. Identifying keywords from a large amount of on-line news data is very useful in that it can
Image annotation becomes increasingly more important as the Web continues to grow. We propose a novel approach to enhancing keyword-based Web-image annotation in folksonomy, where a volunteer user is notified what kind(s) of keywords are necessary, and that keywords have been sufficiently provided by other volunteer
Currently, the automatic keywords extraction method can only extract keywords appeared in the articles and it cannot extract the implicit keyword which does not appear in the articles. It is a difficult work to extract implicit keywords in an article in the task of automatic keywords extraction. This work can also be
Analyzing users' Web log data and extracting their interests of Web-watching behaviors are important and challenging research topics of Web usage mining. Users visit their favorite sites and sometimes search new sites by performing keyword search on search engines. Users' Web-watching behaviors can be regarded as a
Security content filtering of World Wide Web is one of the important tasks among network security. The lower precision of Web mining based on keywords is a common fault, especially when those grouchy persons used active disturbing methods to cheat and bypass various filters. To filter these few but purposively or
In order to improve searching results of Web pages and enhancing Web crawling operation, the Web page clustering based on searching keywords is proposed in this paper, which firstly employed matching degree between Web pages and searching keywords to decide the sequence of showing pages of searching results. Then
This paper presents an attempt to show the efficiency of some search engines in dealing with Arabic keywords. This can be achieved by comparing the number of retrieved pages, retrieving time, and stability (in both the number of retrieved pages and the order for each retrieved page) for each one of the selected 20
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.