The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Based on the signal model of ear canals, a novel method for solving the inverse problem of estimating the unique solution of the ear canal area function and the eardrum reflection coefficient given the acoustic input impedance at the entrance of an ear canal is presented. Up-sampling techniques to improve the accuracy of the estimates are also presented. The performance of this method and factors...
This paper views target detection and pattern recognition as a kind of communications problem and applies error-correcting coding to the outputs of a convolutional neural network to improve the accuracy and reliability of detection and recognition of targets. The outputs of the convolutional neural network are designed according to codewords with maximum Hamming distances. The effects of the codewords...
It is known that convolutional neural networks (CNNs) are efficient for optical character recognition (OCR) and many other visual classification tasks. This paper applies error-correcting output coding (ECOC) to the CNN for segmentation-free OCR such that: 1) the CNN target outputs are designed according to code words of length N; 2) the minimum Hamming distance of the code words is designed to be...
In distributed speech recognition, vector quantization is used to reduce the number of bits for coding speech features at the user end in order to save energy for transmitting speech feature streams to remote recognizers and reduce data traffic congestion. We notice that the overall bit rate of the transmitted feature streams could be further reduced by not sending redundant frames that can be interpolated...
Voiced-unvoiced-silence (V/UV/S) classification of speech sounds is important in automatic speech/speaker recognition, speech segmentation, speech signal compression, and speech analysis. Training-based classifications suffer from lack of training databases or degrade when training and test statistics mismatch due to variances in speakers, languages, talking styles, noise, transmission channels, etc...
Knowledge about lip and glottal reflection coefficients during phonation is needed to eliminate their distortion effects on the estimates of vocal-tract area functions and glottal waves from vowel sounds. Direct measurements of these coefficients at human mouths are difficult. This paper presents a method for estimating them from vowel sounds. The estimation encounters an ill-defined inverse problem:...
This paper shows how to obtain accurate glottal waves via inverse filtering of vowel sounds and how to determine if these glottal waves contain any significant resonance of vocal tracts. We obtain vocal-tract filter (VTF) estimates for the inverse filtering from sustained vowel sounds over closed glottal phases using a new method, which minimizes the effects of glottal waves on the VTF estimates....
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.