The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
With the development of science and technology, computer technology is increasingly updated, and multimedia search technology has been widely used. For music retrieval, textbased retrieval technology can't meet the diversified retrieval needs. Humming retrieval is through matching the input audio and the audio in the database to match the audio. It's a more convenient way to retrieve music. In this...
Signature verification is very important in verification and authentication systems due to its widespread use in our daily lives. Previous research in this field has mainly focused on optimizing features and matching algorithms. In this paper, we present an effective alternative approach for producing a reference signature that generalizes the properties of the given sample signatures. Two experiments...
Restricted Boltzmann Machine (RBM) has been successfully applied to many different machine learning and pattern recognition problems. Usually, fixed learning rate (FLR) is used for training RBM. However, the reconstruction error (RCERR) with FLR may not be declined each iteration, which will result in a slow convergence speed. In this paper, we propose a method to dynamically choose the learning rate...
This paper introduces two novel algorithms for detecting groups of people standing or freely moving in a crowded environment. The proposed algorithms exploit low-level features extracted from videos. The first algorithm, the Link Method, uses a learning and forgetting strategy for modeling dynamics of proxemics between individuals. Two versions of this algorithm are proposed: they differ in the analysis...
Iris Recognition (IR) is a demanding field, owing to varying contrast and live-tissues. The important contrast invariant features need to be extracted to address this problem. This paper proposes a novel feature selection evolutionary algorithm, namely, Dynamic Binary Particle Swarm Optimization (DBPSO) for enhanced IR. DBPSO generates a highly optimized global best vector, using which, the number...
In this paper, the effect of keyword choice including and excluding plosive sounds on isolated speaker recognition system is investigated. In order to perform this study, a Turkish word database has been created consisting of 48 words including plosives and 7 words without plosives. Records are acquired at a sampling frequency of 16 kHz in a professional recording studio, with sound insulation. The...
Most of the existing techniques for palmprint recognition are based on metrics that evaluate the distance between a pair of features. These metrics are typically based on static functions. In this paper we propose a new technique for palmprint recognition based on a dynamical system approach, focusing on preliminary experimental results. The essential idea is that the procedure iteratively eliminates...
In this paper, a new set of features for addressing the problem of unsupervised query-by-example spoken term detection is proposed. The main purpose of this is to find a spoken query in large speech databases. In unsupervised audio search, language specific resources are not required. Thus this system is more appropriate in cases where enough training data is not available for creating an Automatic...
The use of Millimetre wave images has been proposed recently in the biometric field to overcome certain limitations when using images acquired at visible frequencies. In this paper, several body shape-based techniques were applied to model the silhouette of images of people acquired at 94 GHz. We put forward several methods for the parameterization and classification stage with the objective of finding...
We propose in this paper a framework for the segmentation and classification of document streams. The framework is composed of two modules: segmentation and verification. The two modules use an incremental classifier which learns progressively along the stream. In the segmentation module a relationship between two consecutive pages is classified as either: continuity or rupture. Rupture is synonymous...
In this paper, we propose a novel robust and efficient minutia-based fingerprint matching algorithm. There are two key contributions. First, we apply a set of global level minutia dependent features, i.e., the qualities that measure the reliabilities of the extracted minutiae and the area of overlapping regions between the query and template images of fingerprints. The implementation of these easy-to-get...
In this paper, a novel human motion captured data retrieval approach is presented Based on Quaternion and EMD. The method mainly contains two steps: indexing and matching. In indexing part, for solving high dimension data problem, we use the quaternion to represent key-joints rotation information, and mapping the distribution of original CMU database, we take K-means clustering to categorize query...
In this paper, we propose a method to integrate the results of different cover song identification algorithms into one single measure which, on the average, gives better results than initial algorithms. The fusion of the different distance measures is made by projecting all the measures in a multi-dimensional space, where the dimensionality of this space is the number of the considered distances....
Automatic recognition of human faces irrespective of the expression variations is a challenging problem. In this paper, we propose a novel method for face recognition based on ‘edge-strings’. Experimental studies on face perception have shown the significance of edge features in visual perception and learning. In the proposed technique, the edges of a face are identified, and a feature string is created...
This paper presents the test results of the Query-by-singing/humming(QbSH) system, which are measured with varying the pitch extraction algorithm. The test is for verifying matching engine of our QbSH system of which database is constructed from polyphonic recordings such as MP3 files. For the test, we used 3 different pitch extraction algorithms, and the experimental results are obtained with our...
Event co-reference is the process of identifying descriptions of the same event across sentences, documents, or structured databases. Existing event co-reference work focuses on sentence similarity models or feature based similarity models requiring slot filling. This work shows the effectiveness of using a hybrid approach where the similarity of two events is determined by a combination of the similarity...
This paper aims to understand the components of speech that contribute to emotion characteristics in speech. Four components of speech (vocal tract, excitation, duration and intonation) are considered in this study. A Flexible Analysis Synthesis Tool (FAST) is developed to modify the features of an utterance from neutral to emotion or from emotion to neutral. The key ideas used in this work are the...
In this paper, the methods used by literature to address online signature verification is studied. We propose new set of combination of current features to challenge the online signature verification. At the end, we examine one of the aforementioned methods and show the results. This research explains the classified biometrics elements in two main categories: physical and behavioural.
In this paper, we present an approach for word spotting in Gray-scale Pashto Documents, written in modified Arabic scripts. Various profile and transitional features are extracted from gray-scale word images. The gray-scale feature vectors are then converted into binary feature vectors by replacing each value within the gray-scale feature vectors with its binary equivalents. In this way, we have enabled...
Dynamic Time Wrapping is a non-linear distance measure solution, widely used in query by humming. This paper have a body of work on cross-sentence retrieval, end point loose, cost function and their improvements to raise DTW performance of query by humming. Based on these researches we build a prototype of hierarchical matching humming system. During the test of database contains 500 songs and 55...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.