The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Sentiment Analysis is one of the significant issues in the area of natural language processing, computational linguistics and text mining. It has also become a potential research area in bibliographic search and opinion mining, which is our main focus in this paper. Sentiment analysis of citations on schema-based research contents, such as scientific articles and reports, may not only makes an appropriate...
Word clouds have emerged as a straightforward and visually appealing visualization method for text. They are used in various contexts as a means to provide an overview by distilling text down to those words that appear with highest frequency. Typically, this is done in a static way as pure text summarization. We think, however, that there is a larger potential to this simple yet powerful visualization...
In this paper we address the problem of segmenting (and summarizing) a text by Formal Concept Analysis (FCA) using Lexical Chains established for the text. The first proposed method relies on the Conceptual hierarchy (Concept lattice) derived for a formal context expressing the map of Lexical Chains to the text. The second one offers a conceptual view for a segmentation, using a conceptual clustering...
Concordancing is a technique which analyzes text corpora to show how any given word or phrase in the text is used in the immediate contexts in which it appears. The main focus of this technique consist in discovering patterns and rules of authentic language use through analysis of actual usage, and generating theories of what does not account for the probable choices that speakers actually make. In...
The inter-sentence semantic relation is a semantic relation where exists between adjacent sentences in the context of the discourse. Accurate recognizing of inter-sentence semantic relation is of great significance to text understanding, text reasoning and text structure analysis. However, because of the impact of a number of factors, such as discourse context environment, anaphora resolution and...
In this paper we present a hybrid measure of semantic word similarity using fuzzy inference system which combines both the corpus based distance measures as well as gloss overlap to get the final similarity between two words. We use WordNet as a lexical dictionary to get semantic information about words. We show that this new measure reasonably correlates to human judgments and the average performance...
The development of Internet technologies makes it possible to obtain data in near real time about the financial state of companies. Moreover, tools such as XBRL have been developed to deal with the automatic generation of business reports. However the available tools are not suitable to support the current tendency towards the so called, Continuous Reporting. Here, for a specific purpose, the wealth...
Ontologies is playing an increasingly important role in knowledge management and the Semantic Web. The tourism information ontology is becoming a core research field in the realm of information retrieval. An ontology construction method based on Formal Concept Analysis (FCA) to extract domain ontology from unstructured text documents is proposed. Under the framework of our ontology construction method,...
Pursuing on the analysis of product reviews, an unsupervised product features categorization method is proposed. Morphemes as smallest linguistic meaningful unit are induced in measuring the intra relationship among product features instead of words. Opinion words around product features are chosen to represent the inter relationship among product features instead of full context information. The...
We present a tool that facilitates the efficient extension of morphological lexica. The tool exploits information from a morphological lexicon, a morphological grammar and a text corpus to guide the acquisition process. In particular, it employs statistical models to analyze out-of-vocabulary words and predict lexical information. These models do not require any additional labeled data for training...
Tacit knowledge in requirements documents can lead to miscommunication between software engineers and other stakeholders. One way in which the presence of tacit knowledge is signalled in text is by linguistic presuppositions. In this paper, we present a brief introduction to tacit knowledge, presuppositions and the links between them. Our aim is to build a theoretically grounded system which is able...
Many extant natural language watermarking techniques demand deep structure analysis, and so suffer in reliability. We propose a scheme for natural language watermarking, which embedding watermark bits into the pragmatics feature of text by rewriting sentences. In contrast, we eschew syntactic and semantic analysis. We make use of transformation templates and our templates based on pragmatics rule...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.