Search results for: Bo Xu

Items from 1 to 5 out of 5 results

chapter

Graph-based multi-modal scene detection for movie and teleplay

Su Xu, Bailan Feng, Peng Ding, Bo Xu

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1413 - 1416

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

Automatic scene detection is a fundamental step for efficient video searching and browsing. This paper presents our current work on scene detection that integrates three effective strategies into a single framework. For each video, firstly, a coherence signal is constructed by graph modal obtained from the similarity matrix in a temporal interval. Secondly, the signal is optimized by scene transition...

chapter

Multi-modal information fusion for news story segmentation in broadcast video

Bailan Feng, Peng Ding, Jiansong Chen, Jinfeng Bai, more

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1417 - 1420

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

With the fast development of high-speed network and digital video recording technologies, broadcast video has been playing a more and more important role in our daily life. In this paper, we propose a novel news story segmentation scheme which can segment broadcast video into story units with multi-modal information fusion (MMIF) strategy. Compared with traditional methods, the proposed scheme extracts...

chapter

A preliminary exploration on tone error detection in Mandarin based on clustering

Taotao Zhu, Dengfeng Ke, Zhenbiao Chen, Bo Xu

2010 4th International Universal Communication Symposium > 48 - 51

2010 4th International Universal Communication Symposium (IUCS 2010)

This paper addresses the ongoing issue of tone error detection for Mandarin Computer Assisted Language Learning (CALL) systems. A novel approach based on clustering is proposed. The selection of different contextual tonal factors including Uni-tone, LBi-tone and RBi-tone are explored. Experimental results show that our proposed approach is feasible, obtaining an Equal Error Rate (EER) of 18.75% by...

chapter

An efficient mispronounciation detction method using GLDS-SVM and formant enhanced features

HongYan Li, JiaEn Liang, ShiJin Wang, Bo Xu

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 4845 - 4848

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

Mispronunciation detection is an important component in computer assisted language learning (CALL) system. In this work, we introduce an efficient GLDS-SVM based detection method, which is successfully used in language and speaker identification systems, and combine it with traditional methods. The main ideas include: extended MFCC features with normalized formant trajectory information, and then...

chapter

Automatic Pronunciation Evaluation Based on Feature Extraction and Combination

Shuang Xu, Dengfeng Ke, Jie Jiang, Xi Yang, more

2008 3rd International Conference on Innovative Computing Information and Control > 454

2008 3rd International Conference on Innovative Computing Information and Control (ICICIC)

This paper presents an effective method for automatic pronunciation evaluation, which is based on feature extraction and combination. The proposed system extracts different kinds of evaluation features and combines them to produce an ultimate machine score, which predicts the overall pronunciation quality of a student. Experiments on a reading speech database show that most of the selected features...

Filter options

Keywords:
FEATURE EXTRACTION
HIDDEN MARKOV MODELS

Publication date

Set your own date range

Keywords

SPEECH (3)
CALL (2)
COMPUTER AIDED INSTRUCTION (2)
SUPPORT VECTOR MACHINES (2)
TESTING (2)
TRAINING (2)
VISUALIZATION (2)
ACOUSTICS (1)
ADAPTATION MODEL (1)
ANCHOR PERSON DETECTION (1)
AUDIO CLASSIFY (1)
AUDIO DATABASES (1)
AUDIO DETECTION (1)
AUTOMATIC PRONUNCIATION EVALUATION (1)
BROADCAST VIDEO (1)
CLUSTERING (1)
CLUSTERING ALGORITHMS (1)
COHERENCE (1)
COMPUTATIONAL MODELING (1)
COMPUTER ASSISTED LANGUAGE LEARNING (1)
CONTEXT (1)
DATABASES (1)
DETECTORS (1)
DISYLLABIC WORDS (1)
F0 (1)
FACE (1)
FEATURE COMBINATION (1)
GAUSSIAN PROCESSES (1)
GENERALIZED LINEAR DISCRIMINANT SEQUENCE (1)
GENERALIZED LINEAR DISCRIMINANT SEQUENCE-SUPPORT VECTOR MACHINE (1)
GLDS-SVM (1)
GRAPH-MODAL (1)
KERNEL (1)
LBI-TONE (1)
LINEAR REGRESSION (1)
MANDARIN (1)
MANDARIN COMPUTER ASSISTED LANGUAGE LEARNING SYSTEMS (1)
MEL-FREQUENCY CEPSTRAL COEFFICIENT (1)
MFCC (1)
MISPRONUNCIATION DETECTION (1)
MISPRONUNCIATION DETECTION METHOD (1)
MOTION PICTURES (1)
MULTI-MODAL (1)
NATURAL LANGUAGE PROCESSING (1)
NEWS STORY SEGMENTATION (1)
NOISE (1)
NORMALIZED FORMANT TRAJECTORY INFORMATION (1)
PATTERN CLUSTERING (1)
PRONUNCIATION QUALITY (1)
RBI-TONE (1)
READING SPEECH DATABASE (1)
REGRESSION ANALYSIS (1)
SILICON (1)
SPEECH PROCESSING (1)
SPEECH RECOGNITION (1)
STG ANALYSIS (1)
SUPPORT VECTOR MACHINE (1)
SYSTEM FUSION (1)
TONE ERROR DETECTION (1)
TOPIC CAPTION DETECTION AND TRACK (1)
UBM-GMM (1)
ULTIMATE MACHINE SCORE (1)
UNI-TONE (1)
UNIVERSAL BACKGROUND MODEL-GAUSSIAN MIXTURE MODEL (1)
more

INFONA - science communication portal

Search results for: Bo Xu

Graph-based multi-modal scene detection for movie and teleplay

Multi-modal information fusion for news story segmentation in broadcast video

A preliminary exploration on tone error detection in Mandarin based on clustering

An efficient mispronounciation detction method using GLDS-SVM and formant enhanced features

Automatic Pronunciation Evaluation Based on Feature Extraction and Combination

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options