Search results for: Li Lu

Items from 1 to 9 out of 9 results

chapter

Submodular data selection with acoustic and phonetic features for automatic speech recognition

Chongjia Ni, Lei Wang, Haibo Liu, Cheung-Chi Leung, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4629 - 4633

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, we propose to use acoustic feature based submodular function optimization to select a subset of untranscribed data for manual transcription, and retrain the initial acoustic model with the additional transcribed data. The acoustic features are obtained from an unsupervised Gaussian mixture model. We also integrate the acoustic features with the phonetic features, which are obtained...

chapter

A study on sports video classification based on audio analysis and speech recognition

Li Lu, Qingwei Zhao, Yonghong Yan, Kun Liu

2010 International Conference on Audio, Language and Image Processing > 737 - 742

2010 International Conference on Audio, Language and Image Processing (ICALIP)

This paper proposes a method to deal with the problem of sports classification through audio analysis. First, a two-pass audio segmentation module is developed as the front-end to extract announcer's speech from the audio streams. Then speech recognition technology is employed on the speech segments to extract keywords which are used as features to distinguish different sports. Finally, based on the...

chapter

Audio Segmentation System for Sport Games

Junfang Zhang, Baochen Jiang, Li Lu, Qingwei Zhao

2010 International Conference on Electrical and Control Engineering > 505 - 508

2010 International Conference on Electrical and Control Engineering (ICECE 2010)

This paper proposes a two-pass audio segmentation method for sports games. The 1st pass conducts the segmentation by a metric-based algorithm, and the 2nd pass conducts a model-based classification to extract speech segments. This audio segmentation module we developed can extract announcer's speech efficiently from the complex sport audio stream.

chapter

A SVM-Based Audio Event Detection System

Li Lu, Fengpei Ge, Qingwei Zhao, Yonghong Yan

2010 International Conference on Electrical and Control Engineering > 292 - 295

2010 International Conference on Electrical and Control Engineering (ICECE 2010)

This paper proposes a SVM-based method to deal with the problem of detecting audio events(cheering and applause) by audio analysis. In our framework, a sliding window is first used to pre-segment the audio stream into short segments by moving from start to the end. Second, various kinds of audio features are extracted to represent different audio sounds in each segment. Third, SVM(super vector machine)...

chapter

Detecting cheering events in sports games

Li Lu, Fengpei Ge, Qingwei Zhao, Yonghong Yan

2010 2nd International Conference on Education Technology and Computer > 1 > V1-223 - V1-227

2010 2nd International Conference on Education Technology and Computer (ICETC 2010)

This paper proposes a unified method to deal with the problem of detecting cheering events in audio stream of live sports games. In our framework, first, a sliding window is used to pre-segment the audio stream into short segments by moving from start to the end. Second, various kinds of audio features are extracted to represent different audio sounds in each segment. Third, GMM (Gaussian Mixture...

chapter

Commentator's Speech Extraction in Audio Stream of Sports Games

Li Lu, Fengpei Ge, Qingwei Zhao, Yonghong Yan

2009 International Conference on Research Challenges in Computer Science > 64 - 67

2009 International Conference on Research Challenges in Computer Science (ICRCCS 2009)

This paper proposes a method to deal with the problem of extracting commentator's speech in audio stream of live sports games. First, a two-pass metric-based audio segmentation module is developed to segment the audio stream into short ones with homogeneous acoustic features. Then a model-based classification module is adopted to extract the speech segments. For robust audio classification, various...

chapter

An Mandarin Pronunciation Quality Assessment System Using Two Kinds of Acoustic Models

Fengpei Ge, Li Lu, Changliang Liu, Fuping Pan, more

2009 International Conference on Research Challenges in Computer Science > 68 - 72

2009 International Conference on Research Challenges in Computer Science (ICRCCS 2009)

This paper presents our Mandarin pronunciation quality assessment system for the examination of Putonghua Shuiping Kaoshi (PSK) and investigates some measures to improve the assessment accuracy. In this paper, a selective speaker adaptation method is studied. In the adaptation module, we select well pronounced speech as the adaptation data, and adopt Maximum Likelihood Linear Regression (MLLR) to...

chapter

A Keyword Spotting Based Sports Type Determination System

Li Lu, Ran Xu, Fengpei Ge, Qingwei Zhao, more

2009 International Conference on Artificial Intelligence and Computational Intelligence > 2 > 361 - 365

2009 International Conference on Artificial Intelligence and Computational Intelligence (AICI 2009)

This paper proposes a novel system to automatically determine the sports type of a sports game by conducting keywords spotting on short fragments (around 10 minutes) of a sports game. In this system, we first develop an audio segmentation module as a front-end to separate announcers' speech efficiently from the complex sports audio stream. Then we employ speech recognition technology on these speech...

chapter

Sample-Based Automatic Dictionary Generation for Keyword Spotting System

Li Lu, Fengpei Ge, Ta Li, Qingwei Zhao, more

2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery > 5 > 505 - 508

2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2009)

In this paper we develop an approach to automatic, data-driven generation of pronunciation dictionaries for keyword spotting(KWS) systems. In practical applications, KWS tasks often have to deal with keywords whose pronunciations can not be found in the dictionary. To solve this problem, we study how to derive pronunciations automatically from speech samples of keywords. Recognized sequences from...

Filter options

Keywords:
SPEECH

Publication date

Set your own date range

Keywords

ACOUSTICS (7)
SPEECH RECOGNITION (6)
FEATURE EXTRACTION (5)
SPORT (5)
AUDIO SIGNAL PROCESSING (4)
AUDIO STREAMING (4)
GAMES (4)
HIDDEN MARKOV MODELS (4)
SPEECH PROCESSING (4)
ADAPTATION MODEL (3)
AUDIO STREAM (3)
AUDIO ANALYSIS (2)
BRIGHTNESS (2)
DATA MINING (2)
GAUSSIAN PROCESSES (2)
KEYWORD SPOTTING (2)
MAXIMUM LIKELIHOOD ESTIMATION (2)
SMOOTHING METHODS (2)
SUPPORT VECTOR MACHINES (2)
ACOUSTIC MODEL (1)
ACOUSTIC MODEL ADAPTATION (1)
ACOUSTIC MODELS (1)
ACOUSTIC SIGNAL PROCESSING (1)
ACTIVE LEARNING (1)
ANNOUNCER SPEECH SEGMENT EXTRACTION (1)
AUDIO DETECTION TASK (1)
AUDIO FEATURE EXTRACTION (1)
AUDIO SEGMENTATION (1)
AUDIO SEGMENTATION MODULE (1)
AUDIO SOUNDS (1)
AUDIO STREAM SEGMENTATION (1)
AUTOMATIC SPEECH RECOGNITION (1)
AVERAGE CORRELATION COEFFICIENT (1)
BACKGROUND NOISE (1)
BANDWIDTH (1)
BOUNDARY SEEKING (1)
BOUNDARY-SEEKING SMOOTHING ALGORITHM (1)
CANONICAL DICTIONARY (1)
CHEERING (1)
CHEERING DETECTION (1)
COMMENTATOR SPEECH EXTRACTION (1)
COMPLEX SPORT AUDIO STREAM (1)
COMPONENT (1)
CONFIDENCE METRIC (1)
CONFIDENCE-BASED METRIC (1)
DATA MODELS (1)
DATA SELECTION (1)
DATA-DRIVEN (1)
DATA-DRIVEN GENERATION (1)
DICTIONARIES (1)
EVENT DETECTION (1)
FALSE ALARMS (1)
FIGURE-OF-METRIC (1)
GAUSSIAN MIXTURE MODEL (1)
GMM (1)
GMM-BASED AUDIO EVENT DETECTION SYSTEM (1)
GOLD (1)
HOMOGENEOUS ACOUSTIC FEATURES (1)
HUMANS (1)
IMAGE CLASSIFICATION (1)
INDEXES (1)
KEYWORD SPOTTING SYSTEM (1)
KEYWORD-FREQUENCY-BASED ADAPTATION (1)
KWS PERFORMANCE (1)
LANGUAGE MODEL (1)
LANGUAGE MODEL ADAPTATION (1)
LIVE SPORTS GAMES (1)
MANDARIN PRONUNCIATION QUALITY ASSESSMENT SYSTEM (1)
MAXIMUM A POSTERIORI ALGORITHM (1)
MAXIMUM LIKELIHOOD LINEAR REGRESSION (1)
MODEL-BASED CLASSIFICATION ALGORITHM (1)
MODEL-BASED CLASSIFICATION MODULE (1)
MONOPHONE BASED ACOUSTIC MODEL (1)
MUSIC (1)
MUSIC SCENE ANALYSIS METHOD (1)
NOISE (1)
PASS CONDUCTS (1)
PATTERN CLASSIFICATION (1)
PHONEME CONFUSION NETWORK (1)
POSTERIOR PROBABILITIES (1)
PRONUNCIATION DICTIONARY (1)
PRONUNCIATION EXTRACTION (1)
PRONUNCIATIONS (1)
PUTONGHUA SHUIPING KAOSHI (1)
RECOGNIZED SEQUENCES (1)
REGRESSION ANALYSIS (1)
ROBUST AUDIO CLASSIFICATION (1)
ROBUSTNESS (1)
SAMPLE-BASED AUTOMATIC DICTIONARY GENERATION (1)
SCORE RANKING (1)
SCORE RANKING STRATEGY (1)
SELECTIVE SPEAKER ADAPTATION (1)
SIGNAL CLASSIFICATION (1)
SIGNAL DETECTION (1)
SLIDING WINDOW (1)
SLIDING-WINDOW BASED FRAMEWORK (1)
SMOOTHING RULES (1)
SPEAKER-INDEPENDENT ACOUSTIC MODEL (1)
SPEECH EXTRACTION (1)
more

INFONA - science communication portal

Search results for: Li Lu

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options