Serwis Infona wykorzystuje pliki cookies (ciasteczka). Są to wartości tekstowe, zapamiętywane przez przeglądarkę na urządzeniu użytkownika. Nasz serwis ma dostęp do tych wartości oraz wykorzystuje je do zapamiętania danych dotyczących użytkownika, takich jak np. ustawienia (typu widok ekranu, wybór języka interfejsu), zapamiętanie zalogowania. Korzystanie z serwisu Infona oznacza zgodę na zapis informacji i ich wykorzystanie dla celów korzytania z serwisu. Więcej informacji można znaleźć w Polityce prywatności oraz Regulaminie serwisu. Zamknięcie tego okienka potwierdza zapoznanie się z informacją o plikach cookies, akceptację polityki prywatności i regulaminu oraz sposobu wykorzystywania plików cookies w serwisie. Możesz zmienić ustawienia obsługi cookies w swojej przeglądarce.
In this paper, the development of Multilingual Phone Recognition System (MPRS) in the context of Indian languages is described. MPRS is a language independent Phone Recognition System (PRS) that could recognise the phonetic units present in a speech utterance of any language. We have developed two Bilingual and a quadrilingual PRS using four Indian languages — Kannada, Telugu, Bengali, and Odia. International...
This paper describes techniques for modification and incorporation of excitation source features to convert neutral speech to emotional speech. Sad and anger are considered in this work. The features that have been used are epoch strength, epoch sharpness and pitch contour. A new method of modifying and incorporating epoch strength and sharpness is proposed. New pitch contours corresponding to the...
The present work investigates the importance of excitation source features for language identification (LID). Linear prediction residual (LPR) represents the excitation source signal. By processing the LPR in sub-segmental, segmental and supra-segmental levels, we can get the language specific information present within a glottal cycle, within a sequence of a few glottal cycles and at the prosody...
Spoken language identification is a technique to model and classify the language, spoken by an unknown person. Language identification task is more challenging in environmental condition due to addition of different types of noise. Presence of noise in speech signal causes several nuisances. This paper covers several aspect of language identification in noisy environment. Experiments have been carried...
In the storyteller speech, pauses plays a significant role in introducing suspense and climax. Pauses are used to emphasize keywords, emotion-salient words and separate the phrases in the utterance. The objective of this work is to predict the position and duration of the pauses in the synthesized speech from the text-to-speech system. We analyzed the pause patterns in storyteller speech and classified...
This paper is concerned with speech signal based emotion recognition. Linear Prediction (LP) residual mainly contains source specific emotional information. LP residual is derived by inverse filtering of the speech signal. For characterizing the basic emotions, LP residual has been explored at sub-segmental level, segmental level, supra-segmental level, respectively. Gaussian mixture models (GMMs)...
This paper proposes a method for modeling the excitation signal to improve the quality of HMM-based speech synthesis system (HTS). Single optimal residual frame which closely relates to all frames of phone is chosen to represent the entire residual signal of the phone. Optimal residual frames of all phones present in the speech corpus are efficiently grouped based on positional and contextual features...
This work is mainly intended at identifying emotion contribution of different vowels in Telugu language. Instead of processing the entire speech signal we propose to focus only vowel parts of the utterance (/a/, /i/, /u/, /e/ and /o/). By analysing the vowels we can discriminate the emotions. In this work spectral and prosodic features are used for studying the effect of emotions on different vowels...
The present work explores the significance of the consonant-vowel (CV) transition and steady vowel (SV) regions for language identification (LID) task. The language-specific vocal tract information represented by Mel-frequency cepstral coefficients (MFCCs), extracted from the CV transition and steady vowel regions for LID task. The duration of CV transition and steady vowel regions are varied to analyze...
In this work, source features are explored for classifying infant cries. Different types of infant cries considered in this work are hunger, pain and wet-diaper. The various excitation source features explored in this work are source features namely epoch interval contour (EIC), epoch strength contour (ESC), epoch sharpness, slope of EIC and ESC features. In this work Gaussian Mixture Models (GMM)...
The basic goal of this work is to develop a Consonant-Vowel Recognition System (CVRS) for determining a sequence of Consonant-Vowel (CV) units present in a given speech utterance. In this work, we are focusing on developing CVRSs for Indian languages namely Bengali and Odia. This framework of developing CVRSs can be extended to any Indian languages. We have developed two separate CVRSs for Bengali...
Vowel regions play important role in various speech tasks, such as speech segmentation, speaker-verification, prosody modification and emotion conversion. The instants at which the onset and offset of vowel take place in the speech signal are known as vowel onset point and vowel offset point, respectively. Vowel regions start with the vowel onset point and end with the vowel offset point. In this...
In this paper, we proposed two-stage segmentation approach for splitting the TV broadcast news bulletins into sequence of news stories. In the first stage, speaker (news reader) specific characteristics present in initial headlines of the news bulletin are used for gross level segmentation. During second stage, errors in the gross level segmentation (first stage) are corrected by exploiting the speaker...
Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.