The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
In this paper, we present a CALL system with novel two-pass architecture for sentence reading miscues detection. The research is concentrated on the effect of the language model (LM) of the system, which is necessary for recognizing what is actually spoken by the speaker. We compared the two situations of using LM or not in a one-pass baseline system at first, and found that LM can lead to relatively...
This paper presents our recent study in resolving some specific acoustic problems of the computer assisted language learning (CALL) system by modifying the acoustic model (AM) and feature under ASR framework. Firstly, speaker dependent cepstrum mean normalization (Speaker CMN) is adopted to alleviate the distortion of channel, with which the average human-machine scoring correlation coefficient (ACC)...
This paper discusses the vowel pronunciation quality assessment of our computer assisted Mandarin Chinese learning system. Under the speech recognition framework, phonetic pronunciation assessment is usually based on the phonetic posterior probability score, which may be computed by normalizing the frame-based posterior probability or be calculated on the phone segment directly. By the first method,...
Keyword spotting becomes a very important branch of speech recognition. But the acoustic mismatch between training and testing environments often causes a severe degradation in the recognition performance. This paper presents an improved keyword spotting strategy. A fuzzy search algorithm is proposed to extract keyword hypotheses from a syllable confusion network (SCN). SCN is linear and naturally...
This paper presents an approach to tone recognition in mandarin conversational telephone speech (CTS) based on a real context model. The real context model is proposed as a new concept designed with special consideration on the fact that mandarin CTS is characterized by complicated tone behaviors due to physiological articulation. As pitch is a supra-segmental feature, current tone's pitch value is...
Noise environment and natural spoken speech, is still a challenging issue for speech recognition. In this paper, study on this field is explored on Mandarin speech, from aspects of signal processing, acoustic model, language model, decoding algorithm, and post processing. The two-phase mel-warped wiener filter algorithm is improved for obtaining noise-robust feature. Segmentation algorithm and gender...
Keyword spotting becomes a very important branch of speech recognition. But the acoustic mismatch between training and testing environments often causes a severe degradation in the recognition performance. This paper presents an improved keyword spotting strategy. A fuzzy search algorithm is proposed to extract keyword hypotheses from a syllable confusion network (SCN). SCN is linear and naturally...
Noise environment and natural spoken speech, is still a challenging issue for speech recognition. In this paper, study on this field is explored on Mandarin speech, from aspects of signal processing, acoustic model, language model, decoding algorithm, and post processing. The two-phase mel-warped wiener filter algorithm is improved for obtaining noise-robust feature. Segmentation algorithm and gender...
This paper presents an approach to tone recognition in mandarin conversational telephone speech (CTS) based on a real context model. The real context model is proposed as a new concept designed with special consideration on the fact that mandarin CTS is characterized by complicated tone behaviors due to physiological articulation. As pitch is a supra-segmental feature, current tone's pitch value is...
This paper presents a fast vocabulary-independent audio search method in Mandarin spontaneous speech which is based on syllable confusion network (SCN) indexing. Confusion network is linear and naturally suitable for indexing. The feasibility of using syllable confusion network as lattice representation is firstly investigated. Since direct syllabic decoding may not have a very high accuracy, long-...
This paper discusses the tone scoring part of a Mandarin pronunciation scoring system. It recognizes tones of isolated syllables and words by using a GMM model and uses the recognition results for tone assessment. Initially, experiment results are bad on strongly accented speech. There are two reasons: one is that the inaccurate force-alignment leads to incomplete FO contours; the other is due to...
No prior study has examined the effect of intravenous injection of bone marrow mononuclear cells (MNCs) on myocardial infarction size (IS). We tested the hypothesis that transplantation of MNCs decreases IS through the release of vascular endothelial growth factor (VEGF). Immediately after ligation of the left coronary artery of immunodeficient mice, PBS or MNCs were intravenously administered. Myocardial...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.