Szukanie zaawansowane

Szukanie zaawansowane w ludziach

Od:

Do:

Pozycje od 121 do 140 spośród 2 601 wyników

Poprzednia

1 ...
4
5
6
7
8
9
10

Następna

rozdział

Robust speaker verification with a two classifier format and feature enhancement

Joshua S. Edwards, Ravi P. Ramachandran, Umashanger Thayasivam

2017 IEEE International Symposium on Circuits and Systems (ISCAS) > 1 - 4

2017 IEEE International Symposium on Circuits and Systems (ISCAS)

In the presence of environmental noise, speaker verification systems inevitably see a decrease in performance. This paper proposes the (1) use of two parallel classifiers, (2) feature enhancement based on blind signal-to-noise ratio (SNR) estimation and (3) fusion, to improve the performance of speaker verification systems. The two classifiers are based on Gaussian mixture models and the partial least-squares...

rozdział

Speaker verification with mostly voiced speech for GMM/UBM and GMM/IBM systems

Vadim Ditlovich, Yuval Bistritz

2017 IEEE First Ukraine Conference on Electrical and Computer Engineering (UKRCON) > 1175 - 1180

2017 IEEE First Ukraine Conference on Electrical and Computer Engineering (UKRCON)

The paper proposes the use of just mostly voiced speech (MVS) for speaker verification (SV). The speech is partitioned into an MVS part and a non-MVS part by a simple machine classification. SV experiments were held with a standard Gaussian mixture model (GMM) with universal background model (UBM) system and a GMM with computationally improved individual background model (IBM) system. They demonstrate...

rozdział

Single channel blind source separation based on NMF and its application to speech enhancement

Yongqiang Chen

2017 IEEE 9th International Conference on Communication Software and Networks (ICCSN) > 1066 - 1069

2017 IEEE 9th International Conference on Communication Software and Networks (ICCSN)

In this paper, an improved nonnegative matrix factorization (NMF) algorithm is proposed for single channel blind source separation and applied to speech enhancement. By adding time correlation item to objective function to constrain the time-varying gain coefficients of noise, it can achieve better effect of speech enhancement. We propose an efficient algorithm to optimize objective function with...

rozdział

Research and application of combined kernel SVM in dynamic voiceprint password authentication system

Sen Zhu, Chengji Xu, Jinming Wang, Yingcai Xiao, więcej

2017 IEEE 9th International Conference on Communication Software and Networks (ICCSN) > 1052 - 1055

2017 IEEE 9th International Conference on Communication Software and Networks (ICCSN)

The kernel function plays an important role in the classification of support vector machines (SVM). In order to solve the problem that a single SVM kernel function can not achieve optimal learning ability and generalization ability in recognition classification at the same time, here we present a new combined kernel function by analyzing and comparing the characteristics of various kernel functions...

rozdział

Local training in speaker verification for PLDA

Hunny Pahuja, Priya Ranjan, Amit Ujlayan

2017 International Conference on Computing, Communication and Automation (ICCCA) > 1466 - 1469

2017 International Conference on Computing, Communication and Automation (ICCCA)

For i-vector model, normalization approach is Probabilistic linear discriminant analysis and has a significant performance for verification of speaker. However it requires a huge development data which cost a lot in many cases. Unsupervised adaption method is a possible approach, which use unlabeled data to adapt PLDA scattering matrices to the target domain. In this paper, ‘local training’ approach...

rozdział

Keynote speech 1: Progress in THz technology enabled by photonics

Cyril Renaud

2017 10th Global Symposium on Millimeter-Waves > 1

2017 10th Global Symposium on Millimeter-Waves (GSMM)

As THz and millimetre wave technologies are further developing for a range of applications, photonics is one of the key technology for its development. We will discuss the different recent advances in photonic technologies for THz and millimetre wave. In particular we will look at integration technologies and their potential for reduced foot print and lower power consumption. We will although look...

rozdział

Long short-term memory based on a reward/punishment strategy for recurrent neural networks

Jiangjiang Liu, Biao Luo, Pengfei Yan, Ding Wang, więcej

2017 32nd Youth Academic Annual Conference of Chinese Association of Automation (YAC) > 327 - 332

2017 32nd Youth Academic Annual Conference of Chinese Association of Automation (YAC)

Recurrent neural networks and their variants have received huge success in many difficult tasks, such as handwriting recognition and generation, natural language processing, acoustic modeling of speech, and so on. As a kind of recurrent neural network architectures, the long short-term memory (LSTM) has attracted great attention. Most research works focus on its structures, training algorithms and...

rozdział

Segmentation the speech of hard of hearing children

Laszlo Czap, Judit Maria Pinter, Attila K. Varga

2017 18th International Carpathian Control Conference (ICCC) > 446 - 450

2017 18th International Carpathian Control Conference (ICCC)

One service provided by our application ‘Speech Assistant System’ assisting the teaching of the hearing impaired to speak is the automatic assessment of words and sentences in the course of practice and feedback to the person. Individual speech sounds can only be correctly evaluated if they are compared with the appropriate reference speech sounds. This requires segmenting the speech to be examined...

rozdział

A Novel Concept of the Rehabilitation Training Coach Robot for Patients with Disability

Seung-Ho Han, Han-Gyu Kim, Ho-Jin Choi

2017 18th IEEE International Conference on Mobile Data Management (MDM) > 376 - 381

2017 18th IEEE International Conference on Mobile Data Management (MDM)

This paper proposes the rehabilitation treatment coach robot which will help at-home patients do their rehabilitation exercises at home without any professional trainers. The coach robot is designed to be cheap enough for patients to afford it. The robot suggests the rehabilitation program and corrects the posture of the patients during the exercise. The deep neural network is used for posture correction...

rozdział

A study of support vector machines for emotional speech recognition

Nattapong Kurpukdee, Sawit Kasuriya, Vataya Chunwijitra, Chai Wutiwiwatchai, więcej

2017 8th International Conference of Information and Communication Technology for Embedded Systems (IC-ICTES) > 1 - 6

2017 8th International Conference of Information and Communication Technology for Embedded Systems (IC-ICTES)

In this paper, efficiency comparison of Support Vector Machines (SVM) and Binary Support Vector Machines (BSVM) techniques in utterance-based emotion recognition is studied. Acoustic features including energy, Mel-frequency cepstral coefficients (MFCC), Perceptual linear predictive (PLP), Filter bank (FBANK), pitch, their first and second derivatives are used as frame-based features. Four basic emotions...

rozdział

Spoken language clustering in the i-vectors space

Stanislaw Kacprzak

2017 International Conference on Systems, Signals and Image Processing (IWSSIP) > 1 - 5

2017 International Conference on Systems, Signals and Image Processing (IWSSIP)

This paper presents the results of language clustering in the i-vectors space, a method to determine in an unsupervised manner how many languages are in a data set and which recordings contain the same language. The most dense i-vectors clusters are found using the DBSCAN algorithm in a low dimensional space obtained by the t-SNE method. Quality of clustering for spherical k-means and the proposed...

rozdział

Towards intoxicated speech recognition

Zixing Zhang, Felix Weninger, Martin Wollmer, Jing Han, więcej

2017 International Joint Conference on Neural Networks (IJCNN) > 1555 - 1559

2017 International Joint Conference on Neural Networks (IJCNN)

In a real-life scenario, the acoustic characteristics of speech often suffer from the variations induced by diverse environmental noises and different speakers. To overcome the speaker-related speech variation problem for Automatic Speech Recognition (ASR), many speaker adaptation techniques have been proposed and studied. Almost all of these studies, however, only considered the speakers' long-term...

rozdział

Biomorphic modeling of phoneme identification and classification based on an evolving fuzzy-neural network: From hardcomputing to softcomputing

Mario Malcangi, Hao Quan, Philip Grew

2017 International Joint Conference on Neural Networks (IJCNN) > 3092 - 3097

2017 International Joint Conference on Neural Networks (IJCNN)

Speech is dynamic in nature and organized in a complex time-and-frequency structure that makes it very hard to solve the issue of automatic speech recognition (ASR) for diverse speaker conditions. The hardcomputing approach to solving this issue (i.e conventional computing based on precisely-stated, analytical, mathematics-inspired models) pushed processing limits because it is highly computationally...

rozdział

On the use of deep recurrent neural networks for detecting audio spoofing attacks

Simone Scardapane, Lucas Stoffl, Florian Rohrbein, Aurelio Uncini

2017 International Joint Conference on Neural Networks (IJCNN) > 3483 - 3490

2017 International Joint Conference on Neural Networks (IJCNN)

Biometric security systems based on predefined speech sentences are extremely common nowadays, particularly in low-cost applications where the simplicity of the hardware involved is a great advantage. Audio spoofing verification is the problem of detecting whether a speech segment acquired from such a system is genuine, or whether it was synthesized or modified by a computer in order to make it sound...

rozdział

Improved speaker recognition system for stressed speech using deep neural networks

Sri Harsha Dumpala, Sunil Kumar Kopparapu

2017 International Joint Conference on Neural Networks (IJCNN) > 1257 - 1264

2017 International Joint Conference on Neural Networks (IJCNN)

Good speaker recognition systems should identify the speaker irrespective of what is spoken, including non-speech sounds that are often produced during natural conversations. In this work, the inclusion of breath sounds in the training phase of the speaker recognition is analyzed using the popular Gaussian mixture model-universal background model (GMM-UBM) and deep neural network (DNN) based systems...

rozdział

Speech-based emotion recognition and next reaction prediction

Fatemeh Noroozi, Neda Akrami, Gholamreza Anbarjafari

2017 25th Signal Processing and Communications Applications Conference (SIU) > 1 - 4

2017 25th Signal Processing and Communications Applications Conference (SIU)

Communication through voice is one of the main components of affective computing in human-computer interaction. In this type of interaction, properly comprehending the meanings of the words or the linguistic category and recognizing the emotion included in the speech is essential for enhancing the performance. In order to model the emotional state, the speech waves are utilized, which bear signals...

rozdział

Single channel speech enhancement using convolutional neural network

Tomas Kounovsky, Jiri Malek

2017 IEEE International Workshop of Electronics, Control, Measurement, Signals and their Application to Mechatronics (ECMSM) > 1 - 5

2017 IEEE International Workshop of Electronics, Control, Measurement, Signals and their Application to Mechatronics (ECMSM)

Neural networks can be used to identify and remove noise from noisy speech spectrum (denoisisng autoencoders, DAEs). The DAEs are typically implemented using the fully-connected feed-forward topology. Usually one of the following possibilities is used as DA target: 1) Ideal frequency ratio mask, which is applied to noisy spectrum to estimate the clean speech spectrum (masking) or 2) Clean speech spectrum...

rozdział

An exploratory analysis targeting diagnostic classification of AAC app usage patterns

Adham Atyabi, Beibin Li, Yeojin Amy Ahn, Minah Kim, więcej

2017 International Joint Conference on Neural Networks (IJCNN) > 1633 - 1640

2017 International Joint Conference on Neural Networks (IJCNN)

Augmentative and Alternative Communication (AAC) apps are apps that enable non-speech communicative forms. One class of AAC apps are speech-generating devices (SGDs), where icons/pictures are tapped to produce spoken words. These apps are widely used to support communication and language learning for individuals with disabilities such as autism spectrum disorder (ASD). Given that these apps are used...

rozdział

Joint optimization of modified ideal radio mask and deep neural networks for monaural speech enhancement

Wei Han, Congming Wu, Xiongwei Zhang, Qiye Zhang, więcej

2017 IEEE 9th International Conference on Communication Software and Networks (ICCSN) > 1070 - 1074

2017 IEEE 9th International Conference on Communication Software and Networks (ICCSN)

Monaural speech enhancement is a key yet challenging problem in speech area, which is always used as a pre-processing step of robust speech processing. Deep learning has proved to be very successful for solving this issue. In this paper, a new approach for enhancing the noisy speech in a single channel recording is presented. We propose a modified ideal ratio mask (IRM) which calculated by normalized...

artykuł

Deep Learning Based Binaural Speech Separation in Reverberant Environments

Xueliang Zhang, DeLiang Wang

IEEE/ACM Transactions on Audio, Speech, and Language Processing > 2017 > 25 > 5 > 1075 - 1084

Speech signal is usually degraded by room reverberation and additive noises in real environments. This paper focuses on separating target speech signal in reverberant conditions from binaural inputs. Binaural separation is formulated as a supervised learning problem, and we employ deep learning to map from both spatial and spectral features to a training target. With binaural inputs, we first apply...

Poprzednia

1 ...
4
5
6
7
8
9
10

Następna

Opcje filtrowania

Słowa kluczowe:
TRAINING
SPEECH

Data publikacji

Ustaw własny zakres dat

Dostępność treści

Dostępna (2,591)
Brak (10)

Typ publikacji

książka (2,284)
artykuł (317)

Słowa kluczowe

SPEECH RECOGNITION (1,174)
HIDDEN MARKOV MODELS (1,022)
FEATURE EXTRACTION (722)
ACOUSTICS (566)
SPEECH PROCESSING (411)
SPEAKER RECOGNITION (314)
DATABASES (313)
MEL FREQUENCY CEPSTRAL COEFFICIENT (269)
SUPPORT VECTOR MACHINES (264)
ACCURACY (248)
DATA MODELS (219)
SPEECH SYNTHESIS (192)
TRAINING DATA (190)
NEURAL NETWORKS (185)
COMPUTATIONAL MODELING (182)
TESTING (182)
ARTIFICIAL NEURAL NETWORKS (177)
VECTORS (165)
NOISE MEASUREMENT (162)
NATURAL LANGUAGE PROCESSING (159)
NOISE (156)
ADAPTATION MODELS (153)
DATA MINING (146)
AUTOMATIC SPEECH RECOGNITION (131)
EMOTION RECOGNITION (128)
SIGNAL TO NOISE RATIO (127)
MATHEMATICAL MODEL (114)
SPEECH ENHANCEMENT (114)
ADAPTATION MODEL (113)
HIDDEN MARKOV MODEL (113)
GAUSSIAN PROCESSES (107)
KERNEL (101)
CONTEXT (100)
DECODING (97)
ROBUSTNESS (96)
ESTIMATION (95)
CLASSIFICATION ALGORITHMS (93)
GAUSSIAN MIXTURE MODEL (87)
LEARNING (ARTIFICIAL INTELLIGENCE) (86)
NIST (84)
SPEAKER VERIFICATION (82)
HMM (81)
DICTIONARIES (79)
VOCABULARY (79)
CEPSTRAL ANALYSIS (73)
MAXIMUM LIKELIHOOD ESTIMATION (72)
MACHINE LEARNING (71)
MFCC (71)
DEEP NEURAL NETWORKS (70)
CORRELATION (69)
SPEECH CODING (69)
MICROPHONES (66)
ERROR ANALYSIS (65)
STATISTICAL ANALYSIS (65)
SPEAKER IDENTIFICATION (64)
TRANSFORMS (64)
PATTERN CLASSIFICATION (63)
OPTIMIZATION (62)
SPECTROGRAM (61)
VISUALIZATION (60)
ALGORITHM DESIGN AND ANALYSIS (59)
NEURAL NETS (59)
DEEP NEURAL NETWORK (58)
SUPPORT VECTOR MACHINE (58)
VOICE CONVERSION (58)
STANDARDS (55)
CLUSTERING ALGORITHMS (52)
CONTEXT MODELING (52)
TEXT ANALYSIS (50)
GMM (48)
HUMANS (48)
NATURAL LANGUAGES (48)
RECURRENT NEURAL NETWORKS (48)
DISCRIMINATIVE TRAINING (47)
SVM (47)
VECTOR QUANTIZATION (47)
NEURONS (46)
PREDICTIVE MODELS (46)
PROBABILITY (45)
ACOUSTIC SIGNAL PROCESSING (43)
CONFERENCES (43)
SIGNAL PROCESSING ALGORITHMS (43)
JOINTS (42)
EDUCATIONAL INSTITUTIONS (41)
PRINCIPAL COMPONENT ANALYSIS (41)
REVERBERATION (41)
SIGNAL CLASSIFICATION (40)
SIGNAL PROCESSING (40)
ENTROPY (39)
NEURAL NETWORK (39)
SUPPORT VECTOR MACHINE CLASSIFICATION (39)
TRAJECTORY (39)
DEEP LEARNING (37)
I-VECTOR (37)
LATTICES (37)
AUDITORY SYSTEM (36)
DECISION TREES (36)
DETECTORS (36)
więcej

Zbiór danych

ieee (2,594)
Elsevier (4)
CEJSH (1)
Springer (1)
Wiley (1)

INFONA - portal komunikacji naukowej

Szukanie zaawansowane