The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
The World Wide Web has emerged itself to be a huge repository of knowledge. Many websites provide lot of information regarding a topic of interest. In this paper this feature of WWW is made use for the concept of a dynamic encyclopedia. Apart from traditional web search and retrieval this paper deals with the construction of a web encyclopedia page by making use of relevant information from various...
Existing work in the semantic relatedness literature has already considered various information sources such as WordNet, Wikipedia and Web search engines to identify the semantic relatedness between two words. We will show that existing semantic relatedness measures might not be directly applicable to microblogging content such as tweets due to i) the informality and short length of microblogging...
We present a method for the automatic classification of text documents into a dynamically defined set of topics of interest. The proposed approach requires only a domain ontology and a set of user-defined classification topics, specified as contexts in the ontology. Our method is based on measuring the semantic similarity of the thematic graph created from a text document and the ontology sub-graphs...
With the development of ICT the need for automated question-answering systems is becoming increasingly important. Question-answering systems are still under development and experimentation. This paper is an overview of the research area that deals with question-answering systems; it explains the concept of question-answering systems and points out the problems that occur during their development....
XML Information Retrieval is approach to identify the appropriate answer granularity and controlling to elements overlap. Recently, the demand for integrating Full Text Search and relational search has increased dramatically. The RDBMS implementation is generally much worse in the performance than the IR engine implementation. Especially, when a query is processed in the RDBMS, the number of join...
Social bookmarking tools are rapidly emerging on the Web as it can be witnessed by the overwhelming number of participants. In such spaces, users annotate resources by means of any keyword or tag that they find relevant, giving raise to lightweight conceptual structures aka folksonomies. In this respect, needless to mention that ontologies can be of benefit for enhancing information retrieval metrics...
We investigate the automatic generation of topic pages as an alternative to the current Web search paradigm. Topic pages explicitly aggregate information across documents, filter redundancy, and promote diversity of topical aspects. We propose a novel framework for building rich topical aspect models and selecting diverse information from the Web. In particular, we use Web search logs to build aspect...
The origin of a music artist or a band is an important kind of musical meta-data as it usually influences his/her/its music. In this paper, we propose three approaches to automatically determine the country of origin of a person or institution, which we apply to music artists and bands. The first approach investigates estimates of page counts returned for specific queries to Web search engines. The...
For question answering, the multi-source approach is justifiable especially when different sources provide different types of knowledge. In this paper, a variety of question and answer types are revealed. The key point this paper addresses under the framework of extensible QA is efficient and consonant usage of a number of distinct QA techniques for improving the answer confidence. To prove the extensibility...
In this paper an approach based on Wikipedia link structure for sense disambiguation is presented and evaluated. Wikipedia is used as a reference to obtain lexicographic relationships and in combination with statistical information extraction it is possible to deduce concepts related to the terms extracted from a corpus. In addition, since the corpus covers a representation of a part of the real world...
In this paper we present two adaptations of the PageRank algorithm to collections of XML documents and the experimental results obtained for the Wikipedia collection used at INEX-1 2007. These adaptations to which we referred as ldquoDOCRANK and TOPICAL_docrankrdquo allow the re-rank of the results returned by the base run execution to improve retrieval quality. Our experiments are performed on the...
The paper describes the development and usage of a grammar developed to extract definitions from documents. One of the most important practical usages of the developed grammar is the automatic extraction of definitions from web documents. Three evaluation scenarios were run, the results of these experiments being the main focus of the paper. One scenario uses an e-learning context and previously annotated...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.