The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Relation discovery is a crucial task in ontology learning process. The classical approaches for relation extraction, based on statistical, syntactical or pattern matching techniques, focus typically on the taxonomic aspect. The discovery of non-taxonomic relationships is often neglected. We extend these approaches by taking into account the document structure which bears additional knowledge. This...
Ontology Learning (OL) from a text is a process that consists of text processing, knowledge extraction, and ontology construction. For Arabic language, text processing, and knowledge extraction tasks are not mature as for Latin languages. They have not been integrated into the full Arabic OL pipeline. Currently, there is very little automated support for using knowledge from Arabic literature in semantically-enabled...
Semantic relation plays an important role in knowledge acquisition research. This paper proposes a method of semantic relation acquisition and automatic synthesis based on Wikipedia. First of all, we obtain the three kinds of basic semantic relations from Wikipedia and extend the semantic of concept aiming at the problem of semantic fuzziness in the semantic relation. Then, an automatic synthesis...
The Web has made possible many advanced text-mining applications, such as news summarization, essay grading, question answering, and semantic search. For many of such applications, statistical text-mining techniques are ineffective since they do not utilize the morphological structure of the text. Thus, many approaches use NLP-based techniques, that parse the text and use patterns to mine and analyze...
There is a substantial body of work on the extraction of relations from texts, most of which is based on pattern matching or on applying tree kernel functions to syntactic structures. Whereas pattern application is usually more efficient, tree kernels can be superior when assessed by the F-measure. In this paper, we introduce a hybrid approach to extracting meronymy relations, which is based on both...
This paper presents the concept of surface text patterns for extracting purpose data from the web. In order to obtain an optimal set of patterns, we have developed a method for learning purpose patterns automatically. A corpus was downloaded from the Internet using bootstrapping by providing a few hand-crafted examples of each purpose pattern to a generic search engine. This corpus was then tagged...
Web-scale relation extraction is crucial to building the Web people search engines. Previous extraction models, such as Snowball, focus only on single type extraction, while the real applications always require as many as possible types of relation. In this paper, we propose a novel Web-scale relation extraction framework Multi-Type Snowball (MultiSnowball). MultiSnowball targets at extracting multiple...
In this paper we present a relation extraction system which uses a combination of pattern based, structure based and statistical approaches. This system uses raw texts and Wikipedia articles to learn conceptual relations. Wikipedia structures are rich source of information in relation extraction and are well used in this system. A set of patterns are extracted for Persian language and are used to...
Automatic extraction of Chinese synonyms plays an important role in information retrieval and semantic resource construction. Based on the analyzing and comparing the different technologies of synonyms extraction, this paper proposes multi-strategy method including literal similarity algorithm, pattern matching algorithm and PageRank algorithm to extraction Chinese synonyms from encyclopedia resource...
Relation annotation (RA) is a process of marking up relations among a set of entities identified from a plain text. RA is important to enterprise applications due to its capability of revealing semantics in business environments. However, RA in business environment is different from that in news domain because the entities involved in the relations in business domain often not just refer to entities...
An open domain question answering system is one of the emerging information retrieval systems available on the World Wide Web that is becoming popular day by day to get succinct and relevant answers in response of users' questions. The validation of the correctness of the answer is an important issue in the field of question answering. In this paper, we are proposing a World Wide Web based solution...
The main limits of chatbot technology are related to the building of their knowledge representation and to their rigid information retrieval and dialogue capabilities, usually based on simple "pattern matching rules". The analysis of distributional properties of words in a texts corpus allows the creation of semantic spaces where represent and compare natural language elements. This space...
With the demand of accurate and domain specific bilingual dictionaries, research in the field of automatic dictionary extraction has become popular. Due to lack of domain specific terminology in parallel corpora, extraction of bilingual terminology from Wikipedia (a corpus for knowledge extraction having a huge amount of articles, links within different languages, a dense link structure and a number...
Automatic acquiring of synonyms plays an important role in information retrieval and semantic resource development. In order to enhance the ability of the Chinese synonyms acquiring, this paper presents pattern matching approach and PageRank algorithm approach based on the Wiki repository. In pattern matching approach, we analyze the defining and explaining paragraph of the entry in the Wiki repository...
Entity Ranking (ER) is a recently emerging search task in Information Retrieval, where the goal is not finding documents matching the query words, but instead finding entities which match types and attributes mentioned in the query. In this paper we propose a formal model to define entities as well as a complete ER system, providing examples of its application to enterprise, Web, and Wikipedia scenarios...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.