The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Authors can be differentiated by their styles of writing. In this paper, we propose features which attempt to classify authors based on their writing styles. The features can be usage of parts of speech, punctuation marks, word lengths, sentence lengths, number of unique words used, etc. This concept is used in many fields like email classification, fraud detection, etc. We propose a module to extract...
Word sense disambiguation is the process of identifying existence of polysemous words in the text and disambiguating the appropriate sense satisfying the given context in Kannada language. The proposed methodology uses the synonyms of the target word and its surrounding words' gloss in combination with part of speech tagging to determine the overlap between the senses of the polysemous word or the...
Abstractive multi-document summarization aims at generating new sentences whose elements originate from different source sentence. It can be achieved via phrase selection and merging approach which aims at constructing new sentences by exploring syntactic units such as fine-grained noun and verb phrase. It can be also achieved by extracting semantic information from source sentence which uses the...
There is a tremendous growth in the number of Internet users every day. These users are spread all over the globe belonging to different community speaking different languages. India being a multilingual country has more than six crores people speaking Kannada (south Indian regional) language. There is demand for many applications to be effective to solve problems related to native languages. In this...
Polysemy word is a word which has multiple meanings. The same word can be used in different context to mean different things. Consider a paragraph of Kannada sentences, a person with no prior Kannada language knowledge cannot distinguish the difference in the meaning of a polysemy word. She/he might think the same meaning for all the occurrence of the word in the whole paragraph which is incorrect...
Facial recognition is a topic of interest for research as it has room for improvement in the accuracy of the recognition rate. To achieve this, either the recognition algorithm is modified or more efficient pre-processing techniques are used. This paper proposes a novel and optimized Artificial Bee Colony (ABC) algorithm, to perform facial recognition. Although the database being used here is Labeled...
The Information Extraction is a method for filtering information from large volumes of text. It includes the extraction of documents from collections and the tagging of particular terms in text. But non-text information such as graphs, images, figures, etc are common in any technical documents. Scientific charts are commonly used in graphical representation of statistical, experimental and technical...
Abstractive summarization has been explored only to some extent in recent years in English, Japanese and other foreign languages. This paper shows that abstraction can be accomplished for Indian Languages, specifically Kannada, using guided summarization approach. The sArAmsha system involves analyzing the given Kannada document and performing parts of speech tagging and stemming operations, identification...
The Internet provides many sources of different opinions, expressed through user reviews of products, blogs, and forum discussions. Systems which could automatically summarize these opinions would be immensely useful for those who wish to use this information to make decisions. The previous work in automatic summarization has completely focused on extractive summarization, in which key sentences are...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.