Search results

Items from 1 to 20 out of 3,806 results

chapter

Parallel processing capabilities in the process of speech recognition

Rakhimov Mekhriddin Fazliddinovich, Berdanov Ulug'bek Abdumurodovich

2017 International Conference on Information Science and Communications Technologies (ICISCT) > 1 - 3

2017 International Conference on Information Science and Communications Technologies (ICISCT)

A Speech recognition is one of the important process of information technology. Speech recognition plays a key role in many systems like voice control, IP-telephony, personal identification, recognition of individual words and phrases, accepting applications for reference services and searching system. There are many researching companies in this area, which developing and improving methods, algorithms...

chapter

Animated texts application in visualizing speech features for Foreign language learning

Nur Syafikah Binti Samsudin, Kazunori Mano

TENCON 2017 - 2017 IEEE Region 10 Conference > 1778 - 1783

TENCON 2017 - 2017 IEEE Region 10 Conference

Pronunciation training aid using media tools such as mobile apps and online web-based system are widely used nowadays. These tools often provide audio-based sample and phonetic style texts that can be used to support the learners train their pronunciation without language teachers. However, the learners still have the difficulty in the learning process, because they found it is hard to detect and...

chapter

Applications of deep learning in supervised speech separation

Shuangran Bai, Yungang Liu, Ting Zhang, Fengzhong Li

2017 Chinese Automation Congress (CAC) > 6539 - 6544

2017 Chinese Automation Congress (CAC)

Recently, deep learning has been proposed and verified to possess the strong ability to learn and express complex features, which has brought significant research achievements in signal processing. As a challenging task in speech signal processing, monaural speech separation has always been the research focus of researchers. From the usage of traditional signal processing methods and shallow models...

chapter

Improving the efficiency of voice control robots based on adaptive procedures

Zhadyra T. Zhumasheva, Aidana S. Kyzdarbekova, Maulen T. Abdulkhairov

2017 IEEE II International Conference on Control in Technical Systems (CTS) > 345 - 348

2017 IEEE II International Conference on Control in Technical Systems (CTS)

The paper considers the task of improving the efficiency of voice control of robots on the basis of adaptive procedures. This problem is considered in the context of noise resistance processing of speech signals for voice recognition subsystems. A solution to this problem is found in classes of adaptive algorithms for filtering voice signals based on sequential filtering. This model allows to improve...

chapter

Methods of pathology detection by speech analysis: Survey

Mustafa Berkay Yilmaz, Mounim A. El Yacoubi

2017 International Conference on Computer Science and Engineering (UBMK) > 28 - 33

2017 International Conference on Computer Science and Engineering (UBMK)

Speech analysis can be used for healthcare tasks such as pathology detection. Conventionally, a speech-language pathologist is specialized to detect anomalies from speech. Speech disorders result from a variety of causes such as brain injury, stroke, hearing loss, developmental delay or emotion alteration. Content of the speech is often not of interest for pathology detection, but characteristics...

chapter

Semantic parser for easy understandable speech broadcasting

Yosuke Kobayashi, Kosuke Ishikawa, Kengo Ohta, Jay Kishigami

2017 IEEE 6th Global Conference on Consumer Electronics (GCCE) > 1 - 2

2017 IEEE 6th Global Conference on Consumer Electronics (GCCE)

In this paper, we propose a public-address system for broadcasting speech that are easy to understand. The system converts a broadcast speech into text using speech recognizer, and it converts the text into simple text using semantic parser. Then, having obtained text with a simple meaning, the system broadcasts it using speech synthesizer. Proposed semantic parser can use dependency analysis to appropriately...

chapter

Unsupervised speaker segmentation framework based on sparse correlation feature

Yi Xin Sun, Yong Ma, Kai Bo Shi, Jiang Ping Hu, more

2017 Chinese Automation Congress (CAC) > 3058 - 3063

2017 Chinese Automation Congress (CAC)

With the increasing stress in working and studying, mental health becomes a major problem in the current social research. Generally, researchers can analyze psychological health states by using social perception behavior. The speech signal is an important research direction in this domain. It objectively assesses the mental health of social groups through the extraction and fusion of speech features...

chapter

Enhancing speech rate estimation techniques to improve dysarthria diagnosis

James Nathaniel Carmichael

2017 8th IEEE Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON) > 309 - 313

2017 8th IEEE Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON)

This report discusses the implementation of a computerized algorithm specifically designed to measure the syllables-per-minute rate of abnormal speech typically produced by persons suffering from an articulatory disorder known as dysarthria. This speech rate measurement application — which can also serve as a diagnostic tool in itself — has been integrated into the computerised Frenchay Dysarthria...

chapter

Noise power spectral density estimation for binaural noise reduction exploiting direction of arrival estimates

Daniel Marquardt, Simon Doclo

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) > 234 - 238

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

Noise reduction algorithms for head-mounted assistive listening devices are crucial to improve speech quality and intelligibility in background noise. For binaural hearing devices with one microphone per device, the noise power spectral density (PSD) is commonly estimated using various assumptions about the acoustic scenario. Since these methods lack robustness if the underlying assumptions are not...

chapter

Comparing statistical classifiers for emotion classification

Raseeda Hamzah, Nursuriati Jamil, Khyrina Airin Fariza Abu Samah, Nur Nabilah Abu Mangshor, more

2017 7th IEEE International Conference on System Engineering and Technology (ICSET) > 183 - 188

2017 7th IEEE International Conference on System Engineering and Technology (ICSET)

Speech emotion recognition has been widely used in human computer interaction and applications. This paper has classified emotion into two classes: happy and angry. All the speech signal is preprocessed from Malay spoken speech database. Emotional information is obtained by applying two well-established acoustical features that are Mel Frequency Cepstral Coefficients (MFCC) and Short Time Energy (STE)...

chapter

Comparison between random and daily speech database in the speech visualization

Nur Syafikah Binti Samsudin, Kazunori Mano

2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 3135 - 3140

2017 IEEE International Conference on Systems, Man and Cybernetics (SMC)

This paper presents a new technique using animated texts as the speech features' visualization medium for checking and detecting language learners' pronunciation. The proposed visualization tool will transform learners' speech features such as pitch, tempo or rhythm into animated texts form, and the mispronounce parts can be located by comparing them with the correct sample. In our previous experiments,...

chapter

Speech compressive sensing with ℓ₁-minimzation and iteratively reweighted least squares-ℓ_p-minimization: A comparative study

Wafa Derouaz, Thouraya Merazi-Meksen

2017 5th International Conference on Electrical Engineering - Boumerdes (ICEE-B) > 1 - 4

2017 5th International Conference on Electrical Engineering - Boumerdes (ICEE-B)

The interests in Compressed Sensing (CS) come from its ability to provide sampling as well as compression, enhancement, along with encryption of the source information simultaneously. All these advantages have made CS, researched and applied in numerous speech-processing applications. In this paper, we compare ℓ1-minimization and Iteratively Reweighted Least Squares (IRLS)-ℓp-minimization algorithms...

chapter

PitchKeywordExtractor: Prosody-based automatic keyword extraction for speech content

Iurii Lezhenin, Artyom Zhuikov, Natalia Bogach, Elena Boitsova, more

2017 Federated Conference on Computer Science and Information Systems (FedCSIS) > 265 - 269

2017 Federated Conference on Computer Science and Information Systems (FedCSIS)

Keyword extraction is widely used for information indexing, compressing, summarizing, etc. Existing keyword extraction techniques apply various text-based algorithms and metrics to locate the keywords. At the same time, some types of audio and audiovisual content, e. g. lectures, talks, interviews and other speech-oriented information, allow to perform keyword search by prosodic accents made by a...

chapter

Implementation of accent recognition methods subsystem for eLearning systems

Eugen Tverdokhleb, Hennadii Dobrovolskyi, Nataliya Keberle, Natalia Myronova

2017 9th IEEE International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications (IDAACS) > 2 > 1037 - 1041

2017 9th IEEE International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications (IDAACS)

The results of the implementation of an external accent recognition system and its integration into massive open online courses platform Moodle are reported. Accent recognition becomes important in foreign languages learning to provide a feedback to a student on a presence of a certain unwanted accent in a foreign language pronunciation. Implementation of several accent recognition methods and their...

chapter

Single-channel speech separation based on deep clustering with local optimization

Taotao Fu, Ge Yu, Lili Guo, Yan Wang, more

2017 3rd International Conference on Frontiers of Signal Processing (ICFSP) > 44 - 49

2017 3rd International Conference on Frontiers of Signal Processing (ICFSP)

There are many challenges in single-channel multi-person mixed speech separation, such as modeling the temporal continuity of the speech signals and improving the frame separation performance simultaneously. In this paper, a separation method based on Deep Clustering with local optimization by the improved Non-Negative Matrix Factorization (NMF) combined with Factorial Conditional Random Fields (FCRF)...

chapter

Speech recognition and classification using the compressive sensing method

Sombat Buakhlai, Sakol Udomsiri

2017 3rd International Conference on Frontiers of Signal Processing (ICFSP) > 8 - 14

2017 3rd International Conference on Frontiers of Signal Processing (ICFSP)

The research is aimed to present speech signal recognition and classification using the compressive sensing method to reduce the data from speech signal cross-correlation and to compare similarity and difference between the reference speech signal (the researcher's signal) and comparative signals. The speech signal reconstruction method is based on the solution to the underdetermined linear inverse...

chapter

Classification of Parkinson speech data by metric learning

Mahmut Kaya, Hasan Sakir Bilge

2017 International Artificial Intelligence and Data Processing Symposium (IDAP) > 1 - 5

2017 International Artificial Intelligence and Data Processing Symposium (IDAP)

Metric learning, one of the main topics of machine learning, is used to approximate similar data and to increase the distance between unrelated data in an existing space. With aiming the best solution for today's problems, setting a good metric for this would have a positive impact on performance. It has been benefited from a transformation matrix in metric learning. When we examine the studies in...

chapter

Development of speech corpora for Goalparia dialect and similar languages

Tanvira Ismail, L. Joyprakash Singh

2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA) > 170 - 173

2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)

Accurate dialect identification technique helps in improving the speech recognition systems that exist in most of the present day electronic devices and is also expected to help in providing new services in the field of e-health and telemedicine which is especially important for older and homebound people. The accuracy of a dialect identification system is highly dependent on its speech corpora. Therefore,...

chapter

Research of the speech signal reconstruction at empirical mode decomposition

A. L. Priorov, P. O. Pavlovichev, V. V. Khryashchev

2017 IEEE East-West Design & Test Symposium (EWDTS) > 1 - 7

2017 IEEE East-West Design & Test Symposium (EWDTS)

We discuss a method of signal analysis — empirical mode decomposition, and also its modification — complementary ensemble empirical mode decomposition. Both methods are used to research the reconstruction of a speech signal by the means of intrinsic mode functions that were received during the decomposition. Researches were performed using two English databases of speech signals which contain speech...

chapter

Grapheme-to-phoneme conversion based on high-order Markov chain for spoken term detection by text query

Dmitriy Prozorov, Alexandra Tatarinova

2017 IEEE East-West Design & Test Symposium (EWDTS) > 1 - 5

2017 IEEE East-West Design & Test Symposium (EWDTS)

Spoken term detection (STD) is a fundamental part of some speech processing applications. One of STD methods uses a phoneme representation of words from a spoken content and a text query. The paper presents a new grapheme-to-phoneme conversion method based on high-order Markov chain. The method is applied to retrieve of spoken documents in Russian language. The aim of this research is evaluation effectiveness...

Publication type:
book

Publication date

Set your own date range

Content availability

Available (3,690)
None (116)

Keywords

SPEECH PROCESSING (3,806)
SPEECH (2,236)
SPEECH RECOGNITION (1,071)
FEATURE EXTRACTION (660)
HIDDEN MARKOV MODELS (488)
ACOUSTICS (378)
NOISE (364)
TRAINING (364)
DATA MINING (327)
NATURAL LANGUAGE PROCESSING (309)
DATABASES (276)
SIGNAL TO NOISE RATIO (252)
ACCURACY (233)
ESTIMATION (227)
SIGNAL PROCESSING (220)
SPEAKER RECOGNITION (218)
SPEECH SYNTHESIS (209)
CORRELATION (202)
MEL FREQUENCY CEPSTRAL COEFFICIENT (194)
NOISE MEASUREMENT (192)
MICROPHONES (190)
SIGNAL PROCESSING ALGORITHMS (173)
SPEECH SIGNAL (172)
SPEECH ENHANCEMENT (168)
AUDIO SIGNAL PROCESSING (163)
SPEECH CODING (157)
SUPPORT VECTOR MACHINES (152)
ARTIFICIAL NEURAL NETWORKS (149)
ACOUSTIC SIGNAL PROCESSING (144)
IMAGE PROCESSING (144)
ALGORITHM DESIGN AND ANALYSIS (141)
EMOTION RECOGNITION (140)
MATHEMATICAL MODEL (138)
ROBUSTNESS (138)
GAUSSIAN PROCESSES (132)
NATURAL LANGUAGES (131)
BLIND SOURCE SEPARATION (129)
COMPUTATIONAL MODELING (124)
WAVELET TRANSFORMS (123)
CEPSTRAL ANALYSIS (116)
FILTERING THEORY (116)
SPEECH ANALYSIS (115)
TRANSFORMS (108)
LEARNING (ARTIFICIAL INTELLIGENCE) (106)
NEURAL NETS (104)
STATISTICAL ANALYSIS (102)
SIGNAL CLASSIFICATION (101)
AUDITORY SYSTEM (100)
TIME FREQUENCY ANALYSIS (100)
NOISE REDUCTION (99)
SOURCE SEPARATION (98)
CONFERENCES (97)
EQUATIONS (97)
AUTOMATIC SPEECH RECOGNITION (96)
SPECTRAL ANALYSIS (94)
SPEECH SIGNALS (94)
PATTERN CLASSIFICATION (93)
HUMANS (89)
ADAPTATION MODEL (88)
GAUSSIAN MIXTURE MODEL (88)
HIDDEN MARKOV MODEL (88)
FREQUENCY ESTIMATION (87)
HARMONIC ANALYSIS (86)
TIME-FREQUENCY ANALYSIS (86)
MEDICAL SIGNAL PROCESSING (85)
CLASSIFICATION ALGORITHMS (84)
COMPUTERS (84)
MUSIC (83)
INDEXES (82)
TESTING (80)
REVERBERATION (79)
PROBABILITY (78)
SIGNAL DENOISING (77)
DATA MODELS (76)
VECTORS (76)
DECODING (75)
ADAPTIVE FILTERS (74)
EDUCATIONAL INSTITUTIONS (74)
PROBABILITY DENSITY FUNCTION (74)
FREQUENCY DOMAIN ANALYSIS (72)
SPEECH INTELLIGIBILITY (72)
MAXIMUM LIKELIHOOD ESTIMATION (70)
DELAY (69)
INTERACTIVE SYSTEMS (69)
INFORMATION RETRIEVAL (68)
NEURAL NETWORKS (68)
SPECTROGRAM (67)
VOICE ACTIVITY DETECTION (67)
FILTERING (65)
VISUALIZATION (65)
CORRELATION METHODS (64)
PATTERN RECOGNITION (64)
SIGNAL DETECTION (64)
SPEECH SIGNAL PROCESSING (63)
INDEPENDENT COMPONENT ANALYSIS (62)
ANALYTICAL MODELS (61)
COMPLEXITY THEORY (61)
ENTROPY (61)
HMM (61)
VOCABULARY (61)
more

Data set

ieee (3,786)
Springer (19)
Wiley (1)

INFONA - science communication portal

Search results

Parallel processing capabilities in the process of speech recognition

Animated texts application in visualizing speech features for Foreign language learning

Applications of deep learning in supervised speech separation

Improving the efficiency of voice control robots based on adaptive procedures

Methods of pathology detection by speech analysis: Survey

Semantic parser for easy understandable speech broadcasting

Unsupervised speaker segmentation framework based on sparse correlation feature

Enhancing speech rate estimation techniques to improve dysarthria diagnosis

Noise power spectral density estimation for binaural noise reduction exploiting direction of arrival estimates

Comparing statistical classifiers for emotion classification

Comparison between random and daily speech database in the speech visualization

Speech compressive sensing with ℓ₁-minimzation and iteratively reweighted least squares-ℓ_p-minimization: A comparative study

PitchKeywordExtractor: Prosody-based automatic keyword extraction for speech content

Implementation of accent recognition methods subsystem for eLearning systems

Single-channel speech separation based on deep clustering with local optimization

Speech recognition and classification using the compressive sensing method

Classification of Parkinson speech data by metric learning

Development of speech corpora for Goalparia dialect and similar languages

Research of the speech signal reconstruction at empirical mode decomposition

Grapheme-to-phoneme conversion based on high-order Markov chain for spoken term detection by text query

Filter options

Publication date

Content availability

Keywords

Data set

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options