Search results

Items from 1 to 20 out of 30 results

chapter

Domain Adaptation for CNN Based Iris Segmentation

Ehsaneddin Jalilian, Andreas Uhl, Roland Kwitt

2017 International Conference of the Biometrics Special Interest Group (BIOSIG) > 1 - 6

2017 International Conference of the Biometrics Special Interest Group (BIOSIG)

Convolutional Neural Networks (CNNs) have shown great success in solving key artificial vision challenges such as image segmentation. Training these networks, however, normally requires plenty of labeled data, while data labeling is an expensive and time-consuming task, due to the significant human effort involved. In this paper we propose two pixel-level domain adaptation methods, introducing a training...

chapter

Local training in speaker verification for PLDA

Hunny Pahuja, Priya Ranjan, Amit Ujlayan

2017 International Conference on Computing, Communication and Automation (ICCCA) > 1466 - 1469

2017 International Conference on Computing, Communication and Automation (ICCCA)

For i-vector model, normalization approach is Probabilistic linear discriminant analysis and has a significant performance for verification of speaker. However it requires a huge development data which cost a lot in many cases. Unsupervised adaption method is a possible approach, which use unlabeled data to adapt PLDA scattering matrices to the target domain. In this paper, ‘local training’ approach...

chapter

Incremental adaptation using active learning for acoustic emotion recognition

Mohammed Abdelwahab, Carlos Busso

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5160 - 5164

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The performance of speech emotion classifiers greatly degrade when the training conditions do not match the testing conditions. This problem is observed in cross-corpora evaluations, even when the corpora are similar. The lack of generalization is particularly problematic when the emotion classifiers are used in real applications. This study addresses this problem by combining active learning (AL)...

chapter

Theoretical vulnerabilities in map speaker adaptation

Tetsushi Ohki, Akira Otsuka

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 2042 - 2046

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We analyze the theoretical vulnerability of maximum a posteriori(MAP) speaker adaptation, which is widely used in practical speaker recognition systems. First, we proved that there exist a set of feature vectors, what are called wolves, which can impersonate almost all the registered speakers with probability asymptotically close to 1 with at most two trials. Second, our experiment shows that the...

chapter

Exploiting sequence information for text-dependent Speaker Verification

Subhadeep Dey, Petr Motlicek, Srikanth Madikeri, Marc Ferras

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5370 - 5374

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Model-based approaches to Speaker Verification (SV), such as Joint Factor Analysis (JFA), i-vector and relevance Maximum-a-Posteriori (MAP), have shown to provide state-of-the-art performance for text-dependent systems with fixed phrases. The performance of i-vector and JFA models has been further enhanced by estimating posteriors from Deep Neural Network (DNN) instead of Gaussian Mixture Model (GMM)...

chapter

Verification based on palm vein by estimating wavelet coefficient with autoregressive model

Fereshte Yazdani, Mehran Emadi Andani

2017 2nd Conference on Swarm Intelligence and Evolutionary Computation (CSIEC) > 118 - 122

2017 2nd Conference on Swarm Intelligence and Evolutionary Computation (CSIEC)

Biometric is a pattern recognition system that automatically identifies people according to their physiologic and behavioral properties. Among the physiologic properties, hand has a special place so that all features of hand like palm lines, inner knuckles, external knuckles and geometry could be used. More recently, the usage of blood vessels pattern in the palm, in addition to the high acceptability,...

chapter

Objective measures to improve the selection of training speakers in HMM-based child speech synthesis

Avashna Govender, Febe de Wet

2016 Pattern Recognition Association of South Africa and Robotics and Mechatronics International Conference (PRASA-RobMech) > 1 - 6

2016 PRASA-RobMech International Conference

Building synthetic child voices is considered a difficult task due to the challenges associated with data collection. As a result, speaker adaptation in conjunction with Hidden Markov Model (HMM)-based synthesis has become prevalent in this domain because the approach caters for limited amounts of data. An initial average voice model is trained using data from multiple speakers and adapted to resemble...

article

An Automatic Subject-Adaptable Heartbeat Classifier Based on Multiview Learning

Can Ye, B.V. K. Vijaya Kumar, Miguel Tavares Coimbra

IEEE Journal of Biomedical and Health Informatics > 2016 > 20 > 6 > 1485 - 1492

In this paper, a novel subject-adaptable heartbeat classification model is presented, in order to address the significant interperson variations in ECG signals. A multiview learning approach is proposed to automate subject adaptation using a small amount of unlabeled personal data, without requiring manual labeling. The designed subject-customized models consist of two models, namely, general classification...

chapter

New approach for human detection in spherical images

Marouane Boui, Hicham Hadj-Abdelkader, Fakhr-Eddine Ababsa, El Houssine Bouyakhf

2016 IEEE International Conference on Image Processing (ICIP) > 604 - 608

2016 IEEE International Conference on Image Processing (ICIP)

Omnidirectional cameras are commonly used in computer vision and robotics. Their main advantage is their wide field of view which allows them to acquire a 360 degree view of the scene with only one sensor and a single shot. However, few studies have investigated the human detection problem using this kind of cameras. In this paper, we propose to extend the conventional approach for human detection...

chapter

Transferring deep representation for NIR-VIS heterogeneous face recognition

Xiaoxiang Liu, Lingxiao Song, Xiang Wu, Tieniu Tan

2016 International Conference on Biometrics (ICB) > 1 - 8

2016 International Conference on Biometrics (ICB)

One task of heterogeneous face recognition is to match a near infrared (NIR) face image to a visible light (VIS) image. In practice, there are often a few pairwise NIR-VIS face images but it is easy to collect lots of VIS face images. Therefore, how to use these unpaired VIS images to improve the NIR-VIS recognition accuracy is an ongoing issue. This paper presents a deep TransfeR NIR-VIS heterogeneous...

chapter

Improved acoustic modeling of low-resource languages using shared SGMM parameters of high-resource languages

Neethu Mariam Joy, Basil Abraham, Navneeth K, S. Umesh

2016 Twenty Second National Conference on Communication (NCC) > 1 - 6

2016 Twenty Second National Conference on Communication (NCC)

In this paper, we investigate methods to improve the recognition performance of low-resource languages with limited training data by borrowing subspace parameters from a high-resource language in subspace Gaussian mixture model (SGMM) framework. As a first step, only the state-specific vectors are updated using low-resource language, while retaining all the globally shared parameters from the high-resource...

chapter

A speech emotion recognition method in cross-languages corpus based on feature adaptation

Xinran Zhang, Cheng Zha, Gang Xiao, Li Zhao

2015 International Conference on Information Technology Systems and Innovation (ICITSI) > 1 - 4

2015 International Conference on Information Technology Systems and Innovation (ICITSI)

For speech emotion recognition on cross-corpus, we study the problem of speaker feature adaptation. First, we discuss the existing approaches in adaptive emotional classification from speech signals. Second, the speaker feature adaptive approach is further studied in view of additive emotion feature distortion. Finally we verified our approaches using different cross-languages corpus, including German,...

chapter

Plda-based system for text-prompted password speaker verification

Sergey Novoselov, Timur Pekhovsky, Andrey Shulipa, Oleg Kudashev

2015 12th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS) > 1 - 5

2015 12th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS)

Recently we have proposed a new State-GMM-supervector extractor for solving the problem of text-dependent speaker recognition. We demonstrated that segmenting the passphrase into word states for supervector extraction makes it possible to create more accurate statistical models of speech signals and to achieve reduction of EER compared to the best state-of-the-art systems of text-dependent verification...

chapter

Speech-laughs: An HMM-based approach for amused speech synthesis

Kevin El Haddad, Stephane Dupont, Jerome Urbain, Thierry Dutoit

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4939 - 4943

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper presents an HMM-based synthesis approach for speechlaughs. The building stone of this project was the idea of the co-occurrence of smile and laughter bursts in varying proportions within amused speech utterances. A corpus with three complementary speaking styles was used to train the underlying HMM models: neutral speech, speech-smile, and finally laughter in different articulatory configurations...

chapter

Supervised domain adaptation for emotion recognition from speech

Mohammed Abdelwahab, Carlos Busso

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5058 - 5062

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

One of the main barriers in the deployment of speech emotion recognition systems in real applications is the lack of generalization of the emotion classifiers. The recognition performance achieved in controlled recordings drops when the models are tested with different speakers, channels, environments and domain conditions. This paper explores supervised model adaptation, which can improve the performance...

chapter

Unsupervised speaker adaptation of DNN-HMM by selecting similar speakers for lecture transcription

Masato Mimura, Tatsuya Kawahara

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific > 1 - 4

2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Unsupervised speaker adaptation of Deep Neural Network (DNN) is investigated for lecture transcription tasks, in which a single speaker gives a long speech and thus speaker adaptation is important. The proposed method selects similar speakers to the test data (test speaker) from the training database, which are used for retraining the baseline DNN. Several speaker characteristic features are defined...

chapter

Progress in the Raytheon BBN Arabic Offline Handwriting Recognition System

Huaigu Cao, Prem Natarajan, Xujun Peng, Krishna Subramanian, more

2014 14th International Conference on Frontiers in Handwriting Recognition > 555 - 560

2014 14th International Conference on Frontiers in Handwriting Recognition (ICFHR)

This paper presents the most recent progress and state of the art result obtained from BBN's Arabic offline handwriting recognition research. Our system is based a left-to-right hidden Markov model and integrates discriminative learning methods including discriminative MPE and n-best rescoring using the scores of glyph classifiers (SVM, DNN) and the RNNLM. Arabic-related features for n-best rescoring...

chapter

Session variability in Automatic Speaker Verification

Djellali Hayet, Amirouche Radia, Djebbar Akila, Laskri Mohamed Tayeb

2014 International Conference on Multimedia Computing and Systems (ICMCS) > 185 - 190

2014 International Conference on Multimedia Computing and Systems (ICMCS)

This paper explores the use of mismatch condition in speaker variability applied to Automatic Speaker Verification (ASV) defined as a classification task to decide whether a proclaimed identity is true or not. This paper proposes to model mismatch conditions in speaker variability from session to another. It was shown that the speaker recognition accuracy deteriorates when there is an acoustic mismatch...

chapter

A UIM/ICM based approach to content-based image retrieval

Bo Li, Zhenjiang Miao, Zhen Qin, Wenju Liu

2013 IEEE International Conference on Multimedia and Expo (ICME) > 1 - 6

2013 IEEE International Conference on Multimedia and Expo (ICME)

This paper presents a new similarity measure and matching scheme for content-based image retrieval (CBIR), based on modeling positive and negative hypotheses and testing a query image against these two hypotheses. The paper proposes to calculate first a universal image model (UIM), which is built based on a large set of images. The derived UIM is then used as a reference for the calculation of adapted...

chapter

Improving online signature verification by user-specific likelihood ratio score normalization

Elhocine Boutellaa, Messaoud Bengherabi, Farid Harizi

2013 8th International Workshop on Systems, Signal Processing and their Applications (WoSSPA) > 296 - 300

2013 8th InternationalWorkshop on Systems, Signal Processing and their Applications (WoSSPA)

Online handwritten signature is a behavioral biometric trait with several practical applications. Examples of these applications include access control to personal devices and validation of online transactions. Several research work have been done to improve the performance of online signature verification systems. This paper presents an improvement of a recently proposed online signature verification...

Keywords:
ADAPTATION MODELS
DATABASES

Publication date

Set your own date range

Publication type

book (28)
article (2)

Keywords

HIDDEN MARKOV MODELS (11)
SPEECH (10)
COMPUTATIONAL MODELING (5)
DATA MODELS (5)
FEATURE EXTRACTION (5)
BIOLOGICAL SYSTEM MODELING (4)
FACE (4)
SPEECH RECOGNITION (4)
SUPPORT VECTOR MACHINES (4)
ACOUSTICS (3)
EMOTION RECOGNITION (3)
FACE RECOGNITION (3)
HANDWRITING RECOGNITION (3)
HMM (3)
ROBUSTNESS (3)
VECTORS (3)
ACCURACY (2)
IRIS RECOGNITION (2)
SPEAKER RECOGNITION (2)
SPEAKER VERIFICATION (2)
SPEECH SYNTHESIS (2)
STATISTICS (2)
SYNTHESIS (2)
TRAINING DATA (2)
ACOUSTIC MODEL (1)
ACTIVE APPEARANCE MODEL (1)
ACTIVE LEARNING (1)
ANCHOR MODEL (1)
ARABIC (1)
ARTIFICIAL NEURAL NETWORKS (1)
AUTHENTICATION (1)
AUTOMATIC SUBJECT ADAPTATION (1)
AUTOMATION (1)
AUTOREGRESSIVE MODEL (1)
BICYCLES (1)
BIOMETRIC (1)
BIOMETRICS (1)
BIOMETRICS (ACCESS CONTROL) (1)
CAMERAS (1)
CODEBOOK SESSION (1)
COMPUTERS (1)
CONFERENCES (1)
CONTROL (1)
CROSS-CORPUS ANALYSIS (1)
CROSS-LINGUAL (1)
CROSS-LINGUAL ADAPTATION (1)
DNN POSTERIORS (1)
DYNAMIC TIME WARPING (1)
ECG HEARTBEAT CLASSIFICATION (1)
EDUCATIONAL INSTITUTIONS (1)
ELECTROCARDIOGRAPHY (1)
ESTIMATION (1)
FACE VERIFICATION (1)
FACIAL FEATURE LOCALIZATION (1)
FACIAL LANDMARKING (1)
FACTOR ANALYSIS (1)
FAST MLLR (1)
FEATURE ADAPTATION (1)
GABOR WAVELET FEATURES (1)
GAIT (1)
GAUSSIAN MIXTURE MODEL (1)
GEOMETRY (1)
HEART BEAT (1)
HIDDEN MARKOV MODEL (1)
HISTOGRAM ADAPTATION (1)
HISTOGRAMS (1)
HOG (1)
HUMAN DETECTION (1)
HUMANS (1)
HYPOTHESIS TESTING (1)
I-VECTOR (1)
IMAGE MODELS (1)
IMAGE RETRIEVAL (1)
IMAGE SEGMENTATION (1)
IMPERSONATION (1)
INDIAN LANGUAGES (1)
INTERPOLATION (1)
IRIS (1)
LAYOUT (1)
LBP (1)
LDA (1)
LINEAR DISCRIMINANT ANALYSIS (1)
LINEAR PROGRAMMING (1)
LOW-RESOURCE (1)
MANIFOLD LEARNING (1)
MANIFOLDS (1)
MAP ADAPTATION (1)
MEASUREMENT (1)
MISMATCH CONDITION (1)
MIXTURE MODELS (1)
MULTIVIEW LEARNING (1)
NEURAL NETWORKS (1)
OMNIDIRECTIONAL CAMERA (1)
ONLINE HANDWRITING RECOGNITION (1)
OPTICAL CHARACTER RECOGNITION (1)
OPTICAL CHARACTER RECOGNITION SOFTWARE (1)
PALM VEIN (1)
more

INFONA - science communication portal

Search results

Domain Adaptation for CNN Based Iris Segmentation

Local training in speaker verification for PLDA

Incremental adaptation using active learning for acoustic emotion recognition

Theoretical vulnerabilities in map speaker adaptation

Exploiting sequence information for text-dependent Speaker Verification

Verification based on palm vein by estimating wavelet coefficient with autoregressive model

Objective measures to improve the selection of training speakers in HMM-based child speech synthesis

An Automatic Subject-Adaptable Heartbeat Classifier Based on Multiview Learning

New approach for human detection in spherical images

Transferring deep representation for NIR-VIS heterogeneous face recognition

Improved acoustic modeling of low-resource languages using shared SGMM parameters of high-resource languages

A speech emotion recognition method in cross-languages corpus based on feature adaptation

Plda-based system for text-prompted password speaker verification

Speech-laughs: An HMM-based approach for amused speech synthesis

Supervised domain adaptation for emotion recognition from speech

Unsupervised speaker adaptation of DNN-HMM by selecting similar speakers for lecture transcription

Progress in the Raytheon BBN Arabic Offline Handwriting Recognition System

Session variability in Automatic Speaker Verification

A UIM/ICM based approach to content-based image retrieval

Improving online signature verification by user-specific likelihood ratio score normalization

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options