The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Video cataloging is the main technology of video management and reuse. It helps to quickly and accurately query tapes or films and improves work efficiency greatly. However, current cataloging is performed manually because existing computer-aided cataloging system can't achieve expected accuracy and speed. In this paper, it proposes an algorithm to control video structure analysis process cleverly...
This paper presents a new statistical model for describing real textured images. Our model is based on the observation that the Scale-Invariant Feature Transform (SIFT) descriptors extracted from a given image can be properly modeled by the Gamma distribution. The maximum-likehood algorithm was used to estimate the two parameters of the Gamma distribution. The efficiency of the proposed approach was...
In this work, an effort has been made to differentiate the allied raagas in Carnatic music. Allied raagas are the raagas that are composed using same set of notes. The features derived from the pitch sequence are used for differentiating these raagas. The coefficients of legendre polynomials, used to fit the pitch contours of the song clips are used for identifying raagas. Obtained features are validated...
A mathematical morphology based filter structure called a sieve is used to process mouth image sequences of a talker's mouth and form visual speech features. The effects of varying the type of filter, the post-processing and hidden Markov model (HMM) parameters on recognition accuracy are investigated using two audio-visual speech databases.
Raga plays an important role in Indian classical music. Raga is made up from the swara or note. According to characteristics of raga, Indian classical music is further divided into two systems Hindustani / North Indian classical music, Carnatic / South India classical music. This paper introduces us with some basic terms in Indian classical music and terms associated with raga. Then we discussed different...
This paper presents a new probabilistic graphical model used to model and recognize words representing the names of Tunisian cities. In fact, this work is based on a dynamic hierarchical Bayesian network. The aim is to find the best model of Arabic handwriting to reduce the complexity of the recognition process by permitting the partial recognition. Actually, we propose a segmentation of the word...
Using gait recognition methods, people can be identified by the way they walk. The most successful and efficient of these methods are based on the Gait Energy Image (GEI). In this paper, we extend the traditional Gait Energy Image by including depth information. First, GEI is extended by calculating the required silhouettes using depth data. We then formulate a completely new feature, which we call...
This paper introduces a new method for streamed action recognition using Motion Capture (MoCap) data. First, the histograms of action poses, extracted from MoCap data, are computed according to Hausdorf distance. Then, using a dynamic programming algorithm and an incremental histogram computation, our proposed solution recognizes actions in real time from streams of poses. The comparison of histograms...
Brain is the most complicated organ of body. It controls the activity of all other organs. Understanding its function and its language could give us a direct communication pathway for connecting with injured motor organ and it could be the core of functional repairing. Neurons are the vertices of a vast network that generates the brain signals. Neuronal recordings capture brain activity signatures...
In this paper, we focus on discrete expression classification using dynamic 3D sequences (4D data) recording the facial movements. A robust approach for registering 4D data is proposed and a variant of local binary patterns on three orthogonal planes is used for feature extraction. We present a fully automatic facial expression recognition pipeline. The system was evaluated on the publicly available...
Online handwriting recognition of Indian scripts has been drawing increasing attention in recent years. Related research has gained further momentum due to recent planned funding by the Govt. of India towards technology development of Indian languages and scripts. Standard databases of handwritten characters of a few Indian scripts have already become available. These include online handwritten character...
This paper addresses the problem of font retrieval using a query-by-example paradigm: given a font, retrieve the the most visually similar fonts. We describe a font by (a) rendering a set of reference characters, (b) extracting a feature vector for each reference character and (c) concatenating the-level character descriptors. The similarity between two fonts is simply the similarity between the vectorial...
In this paper, we address the texture retrieval problem using wavelet distribution. We propose a new statistical scheme to represent the marginal distribution of the wavelet coefficients using a mixture of generalized Gaussian distributions (MoGG). The MoGG allows to capture a wide range of histogram shapes, which provides a better description of texture and enhances texture discrimination. We propose...
This paper proposes, an efficient method for text independent writer identification using a codebook. The occurrence histogram of the shapes in the codebook is used to create a feature vector for the handwriting. There is a wide variety of different shapes in the connected components obtained from handwriting. Small fragments of connected components should be used to avoid complex patterns. A new...
In human's expression recognition, the representation of expression features is essential for the recognition accuracy. In this work we propose a novel approach for extracting expression dynamic features from facial expression videos. Rather than utilising statistical models e.g. Hidden Markov Model (HMM), our approach integrates expression dynamic features into a static image, the Histogram Variances...
Many researchers have been conducted to retrieve pertinent parameters and adequate models for automatic music genre classification. It plays a significant role in multimedia applications. In principle, the categorization of music is mostly done by people expert in the field. These are based on several attributes music (timbre, melody, etc.). Despite great efforts employed, the results are very subjective...
This paper introduces the concept of a mapogram. A ma- pogram may be viewed as a special form of spatiogram, which is a histogram containing additional spatial information. Additionally, this paper presents theory relevant to the creation of a proposed mapogram. A similarity measure derived from the Bhattacharyya coefficient is obtained in order to make comparisons between mapograms. Examples using...
Image information is widely used for the content-based retrieval of the image sequence. It is mainly used to segment a video by scene. Through this task, the structural video browsing can be achieved. The process that divides video into shots is called ldquovideo segmentationrdquo. For the video segmentation, detecting cut which is turn point of scene is called ldquocut detectionrdquo. In this paper,...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.