The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Much of the previous work in Big Data has focussed on numerical sources of information. However, with the ‘narrative turn’ in many disciplines gathering pace and commercial organisations beginning to realise the value of their textual assets, natural language data is fast catching up as an exploitable source of information for decision making. With vast quantities of unstructured textual data on the...
mple scientific research has confirmed significant linguistic differences between truthful and deceptive discourse in both laboratory and field experiments. The current investigation focused on whether indicators of truth or deception are context-independent or are influenced by two context factors: motivation and modality. A 2 (veracity: truthful/deceptive) by 2 (incentives: high/low) by 3 (modality:...
This paper discusses the behavior of `kaa' and suggests the selection of Part of Speech (POS) on the basis of linguistic evidence. It also suggests some tests that can be used for correct classification of `kaa'. The selection of correct POS is important for computational processing, including parsing, generation, and identification of grammatical relations.
Natural language is the preferred form for writing use cases. While a few linguistic techniques exist that extract or validate structured information from unstructured natural language use case, they often cannot be extended beyond their primary language. Extending linguistic analysis and automated validation capabilities across multiple languages is necessary not only for widespread industrial adoption...
Concordancing is a technique which analyzes text corpora to show how any given word or phrase in the text is used in the immediate contexts in which it appears. The main focus of this technique consist in discovering patterns and rules of authentic language use through analysis of actual usage, and generating theories of what does not account for the probable choices that speakers actually make. In...
An intelligent machine will require the ability to converse on any subject: including philosophy. This paper describes how the recent discovery of the four Semantic Categories has made it possible for a machine to analysis any assertion, however convoluted, by means of Semantic Category Analysis: a procedure which provides a basis for future Man/Machine dialogue. Analytical examples discussed include...
In question answering systems, question taxonomy is commonly used as the representation of user information needs. This paper proposes CogQTaxo, a framework of three-dimensional question taxonomy. The dimensions represent the surface information need, the implicit information needs and the pragmatic expectations respectively. The employed linguistic classification criteria follow the cognitive process...
In this paper we present Double Tree, a new visualization of Key Word In Context (KWIC) displays targeted to support linguistic analysis. Inspired by Wattenberg's and Viégas' [1] Word Tree visualization, Double Tree extends the idea of representing KWIC results as trees. We address several issues with Word Trees with respect to the specific demands of linguists and discuss the design decisions and...
Toponym Disambiguation (TD) in Geographic Information Retrieval (GIR) systems is a crucial technique, which makes a direct impact on the quality of subsequent assignment of geographic focus to a document and that of establishment of spatial index as well as the effectiveness of the entire retrieval model as a whole. We explore the mechanism for human beings' dealing with the problem of TD. Human's...
For the majority college students in Taiwan, learning and using terminologies of a specific domain between Chinese and English interchangeably are quite a challenge. Most of the students seek for assistances from library resources or search for answers on web. Unfortunately, the students would not be able to identify the correctness of their findings, or the worse, the students cannot choose the right...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.