The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Document indexation is an essential task achieved by archivists or automatic indexing tools. To retrieve relevant documents to a query, keywords describing this document have to be carefully chosen. Archivists have to find out the right topic of a document before starting to extract the keywords. For an archivist
Keyword extraction is an important application in the area of information technology. Automatic keyword extraction can help people know what is the article primarily talking about without reading the long passage carefully. This paper mainly introduced a keyword extraction algorithm using pagerank on Synonym. Firstly
improve the performance of English-Hindi CLIR system using English and Hindi WordNet, Local Expansion using initial query, definition based pre query expansion and keyword ranking. The pre and post query expansion helps to improving the performance of English-Hindi CLIR system and based upon past experiences the proposed
semantic net which can be applied to build personalized search engine and tested with single query keyword and multi ones by three different calculating policies. The test results show that it can affect the sort of pages. The personalized search based on vocabulary semantic net improves the quality of search results greatly.
The Web forum is a key tool in new knowledge building among students in learning management systems. Unfortunately, the huge number of messages makes difficult, for tutors and teachers, to quickly evaluate the progress of their students so, an automated support to the analysis is needed. Our solution relies on simple statistical indices inspired by the work in the text analysis field. The obtained...
The collection of the concepts that are discussed in a document set can be represented by a geometric structure, called simplical complex, of combinatorial topology. A simplex is a high-frequency keyword set that co-occurs closely which, we believe, carries a concept in the document set. The collection of all these
automatic transcription of a spoken document using a speech recognizer. The difficult point of this task is that the automatic transcription contains many recognition errors, therefore we cannot trust keywords extracted from the automatic transcription using conventional method such as tfmiddotidf. To solve this problem, we
in both Thai and English is built for helping users from a lot of keywords of the same term and (3) a set of keywords from herbal usages can be combined with the name keyword. From the results, information collected from KUIHerb is useful for searching.
search the Web effectively. In this paper, we present a QS module, denoted CQS, which assists children in finding appropriate query keywords to capture their information needs by (i) analyzing content written for/by children, (ii) examining phrases and other metadata extracted from reputable (children's) websites, and (iii
Social bookmarking tools are rapidly emerging on the Web as it can be witnessed by the overwhelming number of participants. In such spaces, users annotate resources by means of any keyword or tag that they find relevant, giving raise to lightweight conceptual structures aka folksonomies. In this respect, needless to
Nowadays, Internet users are familiar with the Web searching process; and searching is the most common task performed on the Web. However, the web search is especially difficult for beginners when they try to utilize a keyword query language. Subsequently, beginners usually try to find information with ambiguous
questionnaire based survey was conducted using 40 Cypriot citizens divided into two age groups who were asked to annotate an image dataset using a vocabulary of 52 keywords. Our results indicate that there are age differences in the way people annotate images, while the gender differences are smaller than our assumptions
based on ontology. It uses the rich semantic knowledge of ontology to upgrade the retrieval based on keywords to concepts, and combines it with the specialized engine to improve retrieval effect and efficiency. The paper also takes patent information for example to explain its application at the end.
Web page recommendation model traces userspsila Web-surfing trails, extracts the useful information including keywords, Web page URLs and userspsila evaluations on Web pages, and automatically generates FCA (formal concept analysis) knowledge base and enterprise ontology knowledge base with WordNet. While users are
Web has grown to a huge mass of information resource and is diverse in content. To search such rich source of information one has to be very precise in using keywords in queries to retrieve the relevant documents. Most of the queries issued to search engines are short and have ambiguous context. One way to produce
keywords of different languages are also revealed. We conducted experiments on a set of Chinese-English bilingual parallel corpora to discover the relationships between documents of these languages.
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.