Search results

Items from 101 to 120 out of 378 results

1 ...
3
4
5
6
7
8
9

chapter

Cortical encoding of phonemic context during word production

Emily M. Mugler, Matthew Goldrick, Marc W. Slutzky

2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society > 6790 - 6793

2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)

Brain-computer interfaces that directly decode speech could restore communication to locked-in individuals. However, decoding speech from brain signals still faces many challenges. We investigated decoding of phonemes — the smallest separable parts of speech — from ECoG signals during word production. We expanded on previous efforts to identify specific phoneme by identifying phonemes by where in...

chapter

Neural decoding of spoken vowels from human sensory-motor cortex with high-density electrocorticography

Kristofer E. Bouchard, Edward F. Chang

2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society > 6782 - 6785

2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)

We present the first demonstration of single-trial neural decoding of vowel acoustic features during speech production with high performance. The ability to predict trial-by-trial fluctuations in speech production was facilitated by using high-density, large-area electrocorticography (ECoG) combined with an adaptive principal components regression. In experiments from two human neurosurgical patients...

chapter

Enhanced Out of Vocabulary Word Detection Using Local Acoustic Information

Xuyang Wang, Ta Li, Pengyuan Zhang, Jielin Pan, more

2014 Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing > 594 - 597

2014 Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP)

The detection of Out-of-vocabulary (OOV) words is a crucial problem for spoken term detection (STD). In this paper, the use of integration with local acoustic information is investigated to retrieve more OOV words. Tokens with high local acoustic probabilities propagated in the search space at the decoding stage will be forced to propagate to the next frame. In this way, acoustic similar words can...

chapter

Optimizing PLLR Features for Spoken Language Recognition

Mireia Diez, Amparo Varona, Mike Penagarikano, Luis Javier Rodriguez-Fuentes, more

2014 22nd International Conference on Pattern Recognition > 779 - 784

2014 22nd International Conference on Pattern Recognition (ICPR)

Phone Log-Likelihood Ratios (PLLR) have been recently introduced as features for spoken language and speaker recognition systems. This representation has proven to be an effective way of retrieving acoustic-phonotactic information into frame-level vectors, which can be easily plugged into state-of-the-art systems. In a previous work, we began the search of reduced representations of PLLRs, as a mean...

chapter

Improved mandarin spoken term detection by using deep neural network for keyword verification

Xuyang Wang, Ta Li, Yeming Xiao, Jielin Pan, more

2014 10th International Conference on Natural Computation (ICNC) > 144 - 148

2014 10th International Conference on Natural Computation (ICNC)

In this paper, we propose to use Deep Neural Network (DNN), which has been proved to be the state-of-the-art technique in speech recognition, to re-estimate the confidence of keyword hypotheses in the verification stage of spoken term detection. The speech recognition system based on DNN outperforms that based on conventional Gaussian Mixture Model (GMM) but suffers from the increased decoding time...

chapter

Decoding of attentional selection in a cocktail party environment from single-trial EEG is robust to task

Timo Lauteslager, James A. O'Sullivan, Richard B. Reilly, Edmund C. Lalor

2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society > 1318 - 1321

2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)

Recently it has been shown to be possible to ascertain the target of a subject's attention in a cocktail party environment from single-trial (∼60 s) electroencephalography (EEG) data. Specifically, this was shown in the context of a dichotic listening paradigm where subjects were cued to attend to a story in one ear while ignoring a different story in the other and were required to answer questions...

chapter

Modified Viterbi decoder for HMM based speech recognition system

Y Rajeev Kumar, A Venkatesh Babu, K A Naveen Kumar, John Sahaya Rani Alex

2014 International Conference on Control, Instrumentation, Communication and Computational Technologies (ICCICCT) > 470 - 474

2014 International Conference on Control, Instrumentation, Communication and Computational Technologies (ICCICCT)

Viterbi algorithm is a dynamic programming algorithm used to find out the most likely word uttered by the unknown speech signal. In Viterbi algorithm, the observation probabilities are calculated using Gaussian distribution function. For implementation of Viterbi decoder, these probability values are initially stored in RAM. Thus conventional Viterbi decoder requires large RAM for its execution. In...

chapter

Implementation and optimization of 1200bps MELPe based on ARM

Weiping Huang, Xiaoqun Zhao, Jingyun Xu

2014 International Conference on Audio, Language and Image Processing > 176 - 179

2014 International Conference on Audio, Language and Image Processing (ICALIP)

This paper introduces basic principles of MELPe (Enhanced Mixed-Excitation Linear Predictive), which is an enhanced algorithm of MELP. Compiling optimization and code optimization methods will be proposed based on ARM1176JZF-S kernel. The encoding time of optimized algorithm drops from 110.75ms per frame to 52.5ms per frame and decoding time drops from 14.88ms per frame to 10.73ms per frame. Efficiency...

chapter

Open domain continuous filipino speech recognition with code-switching

Federico Ang, Yoshikazu Miyanaga, Rowena Cristina Guevara, Rhandley Cajote, more

2014 IEEE International Symposium on Circuits and Systems (ISCAS) > 2301 - 2304

2014 IEEE International Symposium on Circuits and Systems (ISCAS)

It is widely known that database quality has a huge impact on speech recognition system performance, most especially when the expected domain is well represented. In this paper, we use this idea as leverage for a data-driven solution to the problem of code-switching in Filipino. Practical Filipino conversations often contain English and other loan words in varying frequencies, demanding better training...

chapter

Selection of active speaker(s) in VoIP conference bridges: From linear domain to CELP parameters domain

Emmanuel Rossignol Thepie Fapi, Eric Poulin

2014 IEEE REGION 10 SYMPOSIUM > 466 - 470

2014 IEEE Region 10 Symposium

This paper presents alternative approaches to select the mixed channels during teleconferencing involving CELP CoDecs. The proposals address the problems related to complexity and delay when classical solutions based on PCM samples are used. The principle consists of avoiding total speech decoding and to extrapolate the speech audio level based on CELP parameters, before channels selection. Only the...

chapter

A new secure and efficient scheme of ADPCM encoder based on chaotic encryption

Mimoun Hamdi, Houcemeddine Hermassi, Rhouma Rhouma, Safya Belghith

2014 1st International Conference on Advanced Technologies for Signal and Image Processing (ATSIP) > 7 - 11

2014 International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)

This paper presents a new secure variant of ADPCM encoders that are adopted by the CCITT as Adaptive Differential Pulse Code Modulation. This version provides encryption and decryption of voice simultaneously with operations ADPCM encoding and decoding. The evaluation of the scheme showed better performance in terms of speed and security.

chapter

Next generation of mixed excited linear prediction speech quality

Haresh Miyani, Aalay Mehta, Pratik Nai, Harshad Patel

2014 International Conference on Green Computing Communication and Electrical Engineering (ICGCCEE) > 1 - 6

2014 International Conference on Green Computing Communication and Electrical Engineering (ICGCCEE)

Nowadays the number of mobile subscribers is increasing all over the world, so the system for the communication has to be improved. Mixed Excited Linear Prediction (MELP) algorithm is developed for reducing the bandwidth of the signal as well as transmit more data on a single channel. This results in increase in channel capacity. MELP is basically a speech coding method, relying on a Speech Encoder...

chapter

Remote spoken document retrieval using foreground speech segmentation based isolated word recognizer

K. T. Deepak, S. R. Mahadeva Prasanna

2013 Annual IEEE India Conference (INDICON) > 1 - 4

2013 Annual IEEE India Conference (INDICON)

This work describes the development of a scheme for retrieving spoken documents in a remote fashion stored on a voice server. The spoken documents are recorded and indexed based on the frequency of occurrence of isolated keywords and are stored on the voice server. An isolated word recognizer (IWR) is developed for recognizing the identified keywords spoken in isolated fashion. The IWR employs foreground...

chapter

On the robustness of tiny decoding graphs for voice-based robotic interaction

Abdelaziz A. Abdelhamid, Waleed H. Abdulla, Bruce A. MacDonald

2013 6th IEEE Conference on Robotics, Automation and Mechatronics (RAM) > 185 - 189

2013 6th International Conference on Robotics, Automation and Mechatronics (RAM)

In this paper we study the robustness of a command decoding approach based on tiny decoding graphs for voice-based robotic interaction. This approach comprises the fusion of the grammar rules and the statistical n-gram language models to produce an elegant and quite efficient tiny decoding graph. The resulting tiny graph has several advantages such as high speed and improved robustness of command...

chapter

Performance of a single-chip low bit-rate voice transcoder

Armein Z. R. Langi

2013 Joint International Conference on Rural Information & Communication Technology and Electric-Vehicle Technology (rICT & ICeV-T) > 1 - 4

2013 Joint International Conference on Rural Information & Communication Technology and Electric-Vehicle Technology (rICT & ICeV-T)

The objective of this research is to study the performance of a high quality speech compression in real-time on a single-chip system. Based on voice over Internet protocol (VoIP) requirements, we have decided to implement a high quality speech coding (with signal-to-noise ratio, SNR of more than 10 dB), at a low bit rate of 8 kbit/s or less. The coder must have delay not more than 100 ms. The development...

chapter

A New Model-Based Prosody Coder for Mandarin Speech

Chen-Yu Chiang, Yu-Ping Hung, Sin-Horng Chen, Yih-Ru Wang

2013 Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing > 60 - 63

2013 Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP)

In this paper, a novel parametric prosody coding approach for Mandarin speech is proposed. It employs a hierarchical prosodic model (HPM) as a prosody generating model in the encoder to analyze the speech prosody of the input utterance to obtain a parametric representation of four prosodic-acoustic features of syllable pitch contour, syllable duration, syllable energy level, and syllable-juncture...

chapter

A New Joint Source Channel Decoding Scheme Based on Speech Signal

Di Gao, Xiaoqun Zhao

2013 International Conference on Computational and Information Sciences > 1967 - 1970

2013 Fifth International Conference on Computational and Information Sciences (ICCIS)

The study proposes a joint source channel decoding scheme with the speech source residual redundancy without changing the complexity of the decoding algorithm. As the speech parameter index could be used to calculate the transition probability of the speech coding parameter index, we can get transition matrixes on different speech parameters according to the statistics of the speech signal. Using...

chapter

A genetic algorithm with look-ahead mechanism to estimate formant synthesizer input parameters

Jonathas Trindade, Fabiola Araujo, Aldebaro Klautau, Pedro Batista

2013 IEEE Congress on Evolutionary Computation > 3035 - 3042

2013 IEEE Congress on Evolutionary Computation (CEC)

There are several commercial text-to-speech (TTS) systems that generate speech signals that sound very natural. A distinct problem is utterance copy, which consists in taking speech as input (instead of text, as in TTS) and find the input parameters that would drive a speech synthesizer to generate speech that mimics the target speech with respect to contents and speaker identity. Utterance copy is...

chapter

A comparison of audio features for elementary sound based audio classification

Robert Gubka, Michal Kuba

The International Conference on Digital Technologies 2013 > 14 - 17

2013 International Conference on Digital Technologies (DT)

In this paper we compare two sets of audio features in task of audio pattern searching based on elementary sound models. The first set of features consist of well-known mel-frequency cepstral coefficients together with their first and second order time derivatives. The second set was chosen from bag of features by particle swarm optimization algorithm and consist of following audio features: line...

chapter

Crim's French speech transcription system for ETAPE 2011

Vishwa Gupta, Gilles Boulianne, Frederic Osterrath, Pierre Ouellet

2013 8th International Workshop on Systems, Signal Processing and their Applications (WoSSPA) > 351 - 356

2013 8th InternationalWorkshop on Systems, Signal Processing and their Applications (WoSSPA)

This paper describes the French broadcast speech transcription system by CRIM for the ETAPE 2011 evaluation. The key elements in this recognizer include over 140,000-word dictionary, 478 hours of audio for training the acoustic models, feature-space MMI and boosted MMI discriminative training of the acoustic models, variable-frame-rate decoding with trigram language model, lattice rescoring with quadgram...

1 ...
3
4
5
6
7
8
9

Keywords:
DECODING
SPEECH

Publication date

Set your own date range

INFONA - science communication portal

Search results

Cortical encoding of phonemic context during word production

Neural decoding of spoken vowels from human sensory-motor cortex with high-density electrocorticography

Enhanced Out of Vocabulary Word Detection Using Local Acoustic Information

Optimizing PLLR Features for Spoken Language Recognition

Improved mandarin spoken term detection by using deep neural network for keyword verification

Decoding of attentional selection in a cocktail party environment from single-trial EEG is robust to task

Modified Viterbi decoder for HMM based speech recognition system

Implementation and optimization of 1200bps MELPe based on ARM

Open domain continuous filipino speech recognition with code-switching

Selection of active speaker(s) in VoIP conference bridges: From linear domain to CELP parameters domain

A new secure and efficient scheme of ADPCM encoder based on chaotic encryption

Next generation of mixed excited linear prediction speech quality

Remote spoken document retrieval using foreground speech segmentation based isolated word recognizer

On the robustness of tiny decoding graphs for voice-based robotic interaction

Performance of a single-chip low bit-rate voice transcoder

A New Model-Based Prosody Coder for Mandarin Speech

A New Joint Source Channel Decoding Scheme Based on Speech Signal

A genetic algorithm with look-ahead mechanism to estimate formant synthesizer input parameters

A comparison of audio features for elementary sound based audio classification

Crim's French speech transcription system for ETAPE 2011

Filter options

Publication date

Content availability

Keywords

Data set

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options