The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
This work presents the design and development of a web-based system that supports cross-language similarity analysis and plagiarism detection. A suspicious document dq in a language Lq is to be submitted to the system via a PHP web-based interface. The system will accept the text through either uploading or pasting it directly to a text-area. In order to lighten large texts and provide an ideal set...
We argue that pronominal anaphora understanding must rely on the recovery of argument structure asymmetries in conjunction with principles restricting the set of possible antecedents for pronouns. We provide empirical evidence for the need of deep parsing recovering arguments, both overt and covert, that can be possible antecedents for pronouns. We identify several limits of systems that do not rely...
This paper will focus on certain disconnected oppositions between Brazilian Portuguese (BP) and European Portuguese (EP). Assuming these oppositions are the result of absence of I to C movement in BP, our goal is to discover what property is implied in the process and how we can derive such phenomena through the computational system.
In this paper, we present an approach to automatically extract and classify opinions in texts. We propose a similarity measurement calculating semantically distances between a word and predefined subgroups of seed words. We have evaluated our algorithm on the semantic evaluation company “SemEval 2007” corpus, and we obtained the best value of Precision and F1 62% and 61%. As an improvement of 20 %...
The extraction of temporal information from text documents is becoming increasingly important in many applications such as natural language processing, information retrieval, question answering, etc. Indeed, the temporal dimension plays a key role on most of these systems, promoting better performance. Our goal is the definition of a temporal document representation, incorporating the time dimension...
Word Alignment is an important supporting task for different NLP applications like training of machine translation systems, translation lexicon induction, word sense discovery, word sense disambiguation, information extraction and the cross-lingual projection of linguistic information. In this paper we study the main rules and guidelines required to build an aligner tool for Arabic language which...
This paper presents a computational model of language generation, based on Phase Theory, that automatically constructs sentences from underlying numerations. This model incorporates explicit algorithms that determine selection and merger of Lexical Items from a subnumeration, determine the labels of Merged syntactic elements, account for movement of elements within a derivation, and account for when...
Pattern recognition is very challenging multidisciplinary research area attracting researchers and practitioners. Gesture recognition is a specialized pattern recognition task with the goal of interpreting human gestures via mathematical models. One of the usages of gesture recognition is the sign language recognition which is the basic communication method between deaf people. Since there is lack...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.