The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
This paper presents a phone segmentation method without a prior knowledge about the text contents. The proposed method is an unsupervised phone boundary detection based on band-energy tracing technique. It demonstrates a better performance than those previous works when the method was applied to TIMIT corpus. But the performance degrades when the method is applied to a Mandarin Chinese speech database,...
Unsupervised phone segmentation means that the phone boundaries in an utterance can be detected without a prior knowledge about the text contents. Usually, a spectral change in the speech signal implies the existence of a phone boundary. In this paper, the Delta Spectral Function (DSF) is defined for each frame to represent the variation of band energy for a specific band. Then a number of bands that...
The reliable detection of salient acoustic-phonetic cues in speech signal plays an important role in speech recognition based on speech landmarks. Once speech landmarks are located, not only can phone recognition be performed, but other useful information can also be derived. This paper focuses on the detection of burst onset landmarks, which are crucial to the recognition of stop and affricate consonants...
The independent component analysis (ICA) is a commonly used method to find the demixing matrix for the blind source separation (BSS). For speech signals, we should solve BSS problems in the convolutive mixing model, i.e., ICA technique is extended to the frequency domain. The cross-spectral density matrices are computed for each frequency bin instead of covariance matrices in time domain. The joint...
Reliably detecting salient phonetic-acoustic cues plays an important role in speech recognition based on speech landmarks. Once these speech landmarks are located, not only phone recognition can be performed but some other useful information can be derived as well. This paper focuses on the topic of detecting burst onset landmark, an important phonetic characteristic in stops and affricates. The proposed...
The non-stationary behavior makes stops classification one of worthy examining subject in the speech community. Over several decades, many researchers have sorted out a list of acoustic properties that are useful to identify a stop. In this paper, we extract features that are sufficient to represent the important acoustic properties of stops, like statistic moments of the burst spectrum. In combining...
The speech modification is a mechanism of changing speech characteristics and prosody for some specific applications. It is used in voice conversion, pronunciation correction, tone perception, and language learning. The most important part is the change of pitch in an utterance. Pitch extraction is an essential process for speech modification. This paper presents an efficient pitch extraction algorithm...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.