Search results

Items from 1 to 4 out of 4 results

article

Text-Independent Speaker Identification Using the Histogram Transform Model

Zhanyu Ma, Hong Yu, Zheng-Hua Tan, Jun Guo

IEEE Access > 2016 > 4 > 9733 - 9739

In this paper, we propose a novel probabilistic method for the task of text-independent speaker identification (SI). In order to capture the dynamic information during SI, we design super-mel-frequency cepstral coefficients (MFCCs) features by cascading three neighboring MFCCs frames together. These super-MFCC vectors are utilized for probabilistic model training such that the speaker’s characteristics...

chapter

Regression-Based Multi-view Facial Expression Recognition

Ognjen Rudovic, Ioannis Patras, Maja Pantic

2010 20th International Conference on Pattern Recognition > 4121 - 4124

2010 20th International Conference on Pattern Recognition (ICPR 2010)

We present a regression-based scheme for multi-view facial expression recognition based on 2D geometric features. We address the problem by mapping facial points (e.g. mouth corners) from non-frontal to frontal view where further recognition of the expressions can be performed using a state-of-the-art facial expression recognition method. To learn the mapping functions we investigate four regression...

chapter

Anomaly detection for sea surveillance

R. Laxhammar

2008 11th International Conference on Information Fusion > 1 - 8

2008 11th International Conference on Information Fusion (FUSION 2008)

In this paper, unsupervised clustering of normal vessel traffic patterns is proposed and implemented, where patterns are represented as the momentary location, speed and course of tracked vessels. The learnt cluster models are used for anomaly detection in sea traffic. The Gaussian Mixture Model is used as cluster model and a greedy version of the Expectation-Maximization algorithm is used as clustering...

article

Conversion Function Clustering and Selection Using Linguistic and Spectral Information for Emotional Voice Conversion

Chi-Chun Hsia, Chung-Hsien Wu, Jian-Qi Wu

IEEE Transactions on Computers > 2007 > 56 > 9 > 1245 - 1254

In emotional speech synthesis, a large speech database is required for high-quality speech output. Voice conversion needs only a compact-sized speech database for each emotion. This study designs and accumulates a set of phonetically balanced small- sized emotional parallel speech databases to construct conversion functions. The Gaussian mixture bigram model (GMBM) is adopted as the conversion function...

Filter options

Data set:
ieee
Keywords:
FEATURE EXTRACTION
COMPUTATIONAL MODELING
GAUSSIAN PROCESSES
TRAINING DATA

Publication date

Set your own date range

Publication type

article (2)
book (2)

Keywords

GAUSSIAN MIXTURE MODEL (2)
2D GEOMETRIC FEATURES (1)
ANOMALY DETECTION (1)
AUDIO DATABASES (1)
CEPSTRAL ANALYSIS (1)
CLUSTERING ALGORITHMS (1)
CMU MULTI-PIE FACIAL EXPRESSION DATABASE (1)
CONVERSION FUNCTION CLUSTERING (1)
CONVERSION FUNCTION SELECTION (1)
DATA MODELS (1)
DATABASES (1)
EMOTION RECOGNITION (1)
EMOTIONAL SPEECH SYNTHESIS (1)
EMOTIONAL TEXT-TO-SPEECH SYNTHESIS (1)
EMOTIONAL VOICE CONVERSION (1)
EXPECTATION-MAXIMISATION ALGORITHM (1)
EXPECTATION-MAXIMIZATION ALGORITHM (1)
FACE RECOGNITION (1)
FACIAL POINT MAPPING (1)
FUNCTION CLUSTERING AND SELECTION (1)
GAUSSIAN MIXTURE BI-GRAM MODEL (1)
GAUSSIAN MIXTURE BIGRAM MODEL (1)
GAUSSIAN MIXTURE MODELS (1)
GAUSSIAN PROCESS REGRESSION (1)
GREEDY EXPECTATION-MAXIMIZATION (1)
GROUND PENETRATING RADAR (1)
HIDDEN MARKOV MODELS (1)
HISTOGRAM TRANSFORM MODEL (1)
HISTOGRAMS (1)
KERNEL (1)
LINEAR REGRESSION (1)
LINGUISTIC FEATURE (1)
LINGUISTIC INFORMATION (1)
LINGUISTICS (1)
MARINE ENGINEERING (1)
MEL-FREQUENCY CEPSTRAL COEFFICIENTS (1)
MOVING OBJECT SURVEILLANCE (1)
NOISE (1)
NORMAL VESSEL TRAFFIC PATTERN (1)
PATTERN CLUSTERING (1)
PREDICTIVE MODELS (1)
PROBABILISTIC LOGIC (1)
PROBABILITY DENSITY FUNCTION (1)
RADAR TRACKING (1)
REGRESSION ANALYSIS (1)
REGRESSION-BASED MULTI-VIEW FACIAL EXPRESSION RECOGNITION (1)
RELEVANCE VECTOR REGRESSION (1)
SEA SURVEILLANCE (1)
SEA TRAFFIC (1)
SPEAKER IDENTIFICATION (1)
SPECTRAL ANALYSIS (1)
SPECTRAL INFORMATION (1)
SPEECH (1)
SPEECH DATABASE (1)
SPEECH SYNTHESIS (1)
STATISTICAL HYPOTHESIS TESTING (1)
SUPPORT VECTOR MACHINES (1)
SUPPORT VECTOR REGRESSION (1)
SURVEILLANCE (1)
TRAFFIC CONTROL (1)
TRAFFIC ENGINEERING COMPUTING (1)
TRANSFORMS (1)
UNSUPERVISED CLUSTERING (1)
more

INFONA - science communication portal

Search results

Text-Independent Speaker Identification Using the Histogram Transform Model

Regression-Based Multi-view Facial Expression Recognition

Anomaly detection for sea surveillance

Conversion Function Clustering and Selection Using Linguistic and Spectral Information for Emotional Voice Conversion

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options