The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Feature selection is a strategy that aims at making text classifiers more efficient and accurate. In this paper, we proposed a novel feature selection method based on Tibetan grammar for Tibetan classification. Tibetan language express grammatical meaning through the function words and word order, and the function word has large proportions. By analyzing the Tibetan grammar and distribution of part...
The increasing number of e-commerce and social networking sites are producing large amount of data pertaining to reviews of a product, restaurant etc. A keen observation reveals that the text data gathered from any social review site are specific to a context and are subjective in nature promoting varied perceptions of sentiments. The novel idea is to define context specific grammar as semantics for...
The de facto implementation of Software Defined Networking (SDN), i.e., OpenFlow, only parses L2-L4 headers, which limits the use of SDN to employ control intelligence in application layer. In this paper, we advocate content parsing to empower SDN with finer grained control ability over traffic. Specifically, we propose a scalable content parser, called COPY, to identify and parse application layer...
This paper presents the state-of-the art dynamic sign language recognition (DSLR) system for smart home interactive applications. Our novel DSLR system comprises two main subsystems: an image processing (IP) module and a stochastic linear formal grammar (SLFG) module. Our IP module enables us to recognize the individual words of the sign language (i.e., a single gesture). In this module, we used the...
Stochastic Context-Free Grammars (SCFG) have promising application prospect in the field of Multi-Function Radars (MFR) states recognition and threat estimation, which entails the fast learning of the probability of radar grammar production based on training data. Conventional learning algorithms are limited in practical application for their high computational complexity. A new fast learning algorithm...
Chunking or shallow syntactic parsing is proving to be a task of interest to many natural language processing applications. The problem gets worse for the Arabic language because of its specific features that make it quite different and even more ambiguous than other natural languages when processed. In this paper, we present a method for chunking Arabic texts based on supervised learning. We use...
To make faster and more complete Kazakh syntactic analysis, the improved algorithm analysis Chart analysis method is presented. First introduced the tradition of bottom-up and top-down chart analysis, focusing on bottom-up analysis algorithms applications statement and found that the algorithm increases the length of the sentence lower efficiency of the algorithm. For a long sentence Kazakh left recursive...
Grammar Induction (GI) is the problem of extracting hidden regularities and syntactic patterns in languages. Not only the manner of extraction is intricate but also the definition of meaningful patterns is a challenge. Alignment Based Learning (ABL) is one of the research endeavors targeting such challenges in GI. Our present research on applying ABL to POS sequences in English, Persian and Arabic...
This paper presents a unified framework for recognizing and scoring dance motion using 2-layer classifier so that computation complexity is distributed into two layers. This research examines the performance of sliding window, hidden Markov Model (HMM) and conditional random field (CRF) as the first layer classifier to segment the input video into a sequence of motion primitive label. The second layer...
Translation of natural language has always attracted attention of scholars world-wide, be it manual or machine based. Since, the last six decades machine translation has been witnessed. It is attempted in various Indian and Foreign languages. Machine Translation has also been attempted with different techniques. The success ratio of translation has always been an encouraging factor, which kept attracting...
Question identification is a field Natural Language Processing and also Information Extraction. The aim of work is detecting Turkish tweets which are including question expressions. The application contains three stages: applying some pre-processing steps to data set for cleaning unnecessary data like Retweet, determining candidate tweets via a rule-based method and extracting tweets which are really...
In this paper we described an algorithm called NegDetector for locating concerned clinical terms mentioned in electronic narrative text clinical documents and detecting whether the particular terms appeared in different positions are negated or affirmed. The algorithm infers the status of a condition with regard to the property from simple lexical clues occurring in the context of condition, maybe...
Recognition of handwritten mathematical expressions (HMEs) has become a cutting edge research topic recently, as there are increasingly needs for pen-inputting applications. In this paper, we presented a novel framework to analyse HME layout and semantic information. This framework includes three steps, namely symbol segmentation, symbol recognition and semantic relationship analysis. For symbol segmentation,...
The pointer analysis finds out what a pointer points to in a program. This analysis is useful in static program analysis, bug detection, source navigations and so on. We propose an approach to generate new constraints for one points-to and we use the constraints to refine it. The constraints can be used to refine other program analysis as well. This new constraints are based on the sequencings among...
Since their early development, genetic programming-based algorithms have been showing to be successful at challenging problems, attaining several human-competitive results and other awards. This paper will present another achievement of such algorithms by describing how our team has won an international machine-learning competition. We have solved, by means of grammar-based genetic programming techniques,...
This work deals with the on-line recognition of hand-drawn graphical sketches with structure. We present a novel approach, in which the search for a suitable interpretation of the input is formulated as a combinatorial optimization task - the max-sum problem. The recognition pipeline consists of two main stages. First, groups of strokes possibly representing symbols of a sketch (symbol candidates)...
All of the previous syllable based Automatic Speech Recognizers (ASRs) for the Amharic language are built by training a separate acoustic model for each of the 196 distinctly pronounced Consonant-Vowel (CV) syllable. In this paper, we will demonstrate that a smaller number of acoustic models are sufficient to build a syllable based, speaker independent, continuous, Amharic ASR. It is built for weather...
Tibetan information processing is an interdiscipline of Tibetan linguistics and computer science. According to the Tibetan grammar, the formation of Tibetan word and words needs to consider the properties of components, therefore, the component decomposition for modern Tibetan words is a basic work for Tibetan information processing. This paper is based on the Tibetan grammar studying the model of...
Statistical approach with surrounding context around a space was widely used as a main feature for Thai sentence-breaking. However, it does not represent a contextual behaviour regarding an entire context in a sentence. Moreover, it does not take an advantage of Thai grammar rules to determine a sentence boundary. This paper proposes the use of a hybrid approach integrating between rule-based method...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.