The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
This paper presents a speech enhancement algorithm for Thai speech corrupted by noise. A Markov random field (MRF) is used to remove noise from the noisy speech. Specifically, the noisy speech is transformed using a short time Fourier transform (STFT) resulting in time-frequency representation coefficients. Then, it is fed into a voiced/unvoiced classification by using voice activity detector (VAD)...
This paper proposes the novel method to assess Thai speech based on fractal analysis. The fractal algorithm, namely, Higuchi's method was selected to evaluate the fractal dimension (FD) of segmented speech signals. To show the FD changes in waveform over time, the time-dependent FD (TDFD) was proposed. Probability distribution of TDFDs using kernel density estimation was used as an additional parameter...
Listening to a lecture in a classroom is a common process for studying in Thailand. Ability of learning is affected by the ability of hearing the instructors' speech. Acoustical environments of the classroom, hence, can influence speech intelligibility. In this research, acoustical parameters, reverberation time, listeners' locations in classrooms and their effects were studied. By using an assumption...
This paper presents a bi-lingual Thai-English text-to-speech synthesis (TTS) system on Android mobile devices. The system deploys a Thai text processor and a well-known open-source English text processor, which can analyzes English text at high intelligibility. With hidden Markov model (HMM) based speech unit and audio streaming optimization, it can synthesize highly smoothed sounds at a fast response...
Modern speech recognition techniques rely on large amount of speech data whose acoustic characteristics match with the operating environments to train their acoustic models. Gathering training data from loudspeakers playing recorded speech utterances are far more practical than from human speakers. This paper presents results from speech recognition experiments providing practical insights on effects...
Enhancement of speech perception is a crucial aspect for cochlear implant (CI) technology. In a tonal language such as Thai, with segments (consonants and vowels) and supra-segments (tones), many crucial acoustic cues are to be taken into account for speech processing strategy, i.e., amplitude envelopes and temporal fine structure. This paper presents a new speech synthesis algorithm for CI, which...
An automatic broad class segmentation is an important pre-processing step in speech recognition and other speech applications, for example, the speech transcription task to support the phonetic transcription of speech corpus and pronunciation error detection of phone boundaries in language learning applications. This research is aimed at the improvement of the acoustic parameters for the Thai automatic...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.