The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Effective presentation skills can help to succeed in business, career and academy. This paper presents the design of speech assessment during the oral presentation and the algorithm for speech evaluation based on criteria of optimal intonation. As the pace of the speech and its optimal intonation varies from language to language, developing an automatic identification of language during the presentation...
This paper presented a method of evaluating the health of lithium battery based on the continuous hidden Markov model (CHMM). This paper focuses on how to use CHMM to build the evaluation model. The capacity of battery is chosen as the observation variable. The evaluation process is divided into two phases: leaning phase and evaluation phase. First, learning data is used to estimate the elements of...
In our project, our intention is to create a voice and speech recognition system in smart phones that recognizes voice and captures the speech data in Tamil and stores and converts the captured speech as text in Tamil language itself. This can be used in voice dialing, sending SMS by saying out the message and the captured message is sent to the recipient in Tamil. There has not been much consideration...
In this paper we present a system for offline recognition cursive Arabic handwritten text based on Hidden Markov Models (HMMs). The system is analytical without explicit segmentation used embedded training to perform and enhance the character models. Extraction features preceded by baseline estimation are statistical and geometric to integrate both the peculiarities of the text and the pixel distribution...
This paper describes an online handwritten cursive word recognition approach by combining segmentation-free and segmentation-based methods. To search the optimal segmentation and recognition path as the recognition result, we can attempt two methods: segmentation-free and segmentation-based, where we expand the search space using a character-synchronous beam search strategy. The probable search paths...
In this paper we present a system for offline recognition cursive Arabic handwritten text based on Hidden Markov Models (HMMs). The system is analytical without explicit segmentation used embedded training to perform and enhance the character models. Extraction features preceded by baseline estimation are statistical and geometric to integrate both the peculiarities of the text and the pixel distribution...
This paper presents a review on few notable speech recognition models that are reported in the last decade. Firstly, the models are categorized into sparse models, learning models and domain - specific models. Subsequently, the characteristics of the models have been observed using speech constraints, algorithmic constraints and performance constraints. The performance of these models reported in...
Speech recognition is a broad subject as speech is natural way of communication. The acoustic and language model for this system are available but mostly in English language [15]. In India there are so many peoples who can't understand or speak English. So the speech recognition system in English language is of no use for these people. Here we presented Isolated Hindi words recognition system which...
The difficulty of speech comprehension under background noise greatly influenced the use of hearing aids. To address this problem, this paper proposes a new hearing aids algorithm based on speech recognition and synthesis. This method is based on the pure speech to build a parameters database. Under the real noisy scene, implement speech recognition for the input speech, and then extract the corresponding...
In this paper an overview about the methods and approaches used in the past to achieve facial expression recognition as well as an approach that involves the use of neural networks that proves to be very efficient are presented. The possibility to achieve up to 70% accuracy even without extraction of facial features is substantiated. Achievements related to the latest improvements in the field of...
Dynamic Movement Primitives (DMPs)-originally a method for movement trajectory generation [1] has been also used for recognition tasks [2, 3]. However there has not been a systematic comparison between other recognition methods and DMPs using human movement data. This paper presents a comparison of commonly used Hidden Markov Model (HMM) based recognition with DMP based recognition using human generated...
Indonesian traditional dance preservation efforts that nowadays increasingly eroded by foreign culture needs to be improved and adapted to technological improvement. To answer that, BeatMe! Project developed for fulfil the needs of entertainment and traditional dance learning media by integrate 3D motion capture, processing data and visualization. This paper will explain the processing data detail...
This paper presents a system for automatic bird identification, which uses audio input. The experiments have been conducted on three groups of birds, which were created basing finishing on classification, the system is fully automated. The main problem in automatic bird recognition (ABR) is the choice of proper features and classifiers. Identification has been made using two classifiers-kNN (k Nearest...
3D sign language is a brand new technology that provides tools to create 3D signed content based on avatars. Pushed by the advances in computer graphics and many other advantages compared with videos of live signers, 3D sign language is getting more interest and lots of 3D signed scenes are being recorded and used for multiple purposes like young deaf teaching. In Tunsia, we created WebSign, a system...
We propose a segmentation based online word recognition approach which uses a Conditional Random Field (CRF) driven beam search strategy. An efficient trie-lexicon directed, breadth-first beam search algorithm is employed in a combined segmentation-and-recognition framework to accomplish real-time recognition of online handwritten cursive English words. This framework is developed by building a candidate...
This paper describes the Arabic Recognition Competition: Multi-font Multi-size Digitally Represented Text held in the context of the 11$^{th}$ International Conference on Document Analysis and Recognition (ICDAR2011), during September 18-21, 2011, Beijing, China. This first competition used the freely available Arabic Printed Text Image (APTI) database. Several research groups have started using the...
The research on noisy Tibetan speech recognition algorithm based on wavelet neural network (WNN) combined with auditory feature was carried out in this paper. The recognition classifier based on WNN was designed, and Mel Frequency Cepstrum Constant (MFCC) feature was given. Then the simulation on the given algorithm was run under the different signal to noise ratios (SNR), and the results illustrated...
In this paper, we propose a new method for computing and applying language model look-ahead in a dynamic network decoder, exploiting the sparseness of backing-off n-gram language models. Only partial (sparse) look-ahead tables are computed, with a size that depends on the number of words that have an n-gram score in the language model for a specific context, rather than a constant, vocabulary dependent...
In this paper, we design and implement an sBike (Sensorized Bike) prototype to support cyclists by recognizing various bicycling states including going straight, turning right or left, meandering, and stopping. An Android phone, which is integrated with an accelerometer, a magnetometer, and a GPS receiver, is mounted on the handle of bicycle to collect necessary data for analysis. Hidden Markov model...
The paper considers automatic visual recognition of signed expressions. The proposed method is based on modeling gestures with subunits, which is similar to modeling speech by means of phonemes. To define the subunits a data-driven procedure is applied. The procedure consists in partitioning time series, extracted from video, into subsequences which form homogeneous groups. The cut points are determined...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.