Search results for: Delphine Charlet

Items from 1 to 17 out of 17 results

chapter

Selecting Representative Speakers for a Speech Database on the Basis of Heterogeneous Similarity Criteria

Sacha Krstulović, Frédéric Bimbot, Olivier Boëffard, Delphine Charlet, more

Lecture Notes in Computer Science > Speaker Classification II > 276-292

In the context of the Neologos French speech database creation project, a general methodology was defined for the selection of representative speaker recordings. The selection aims at providing a good coverage in terms of speaker variability while limiting the number of recorded speakers. This is intended to make the resulting database both more adapted to the development of recently proposed multi-model...

chapter

Speaker diarization with unsupervised training framework

Gael Le Lan, Sylvain Meignier, Delphine Charlet, Paul Deleglise

2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5560 - 5564

ICASSP 2016 - 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper investigates single and cross-show diarization based on an unsupervised i-vector framework, on French TV and Radio corpora. This framework uses speaker clustering as a way to automatically select data from unlabeled corpora to train i-vector PLDA models. Performances between supervised and unsupervised models are compared. The experimental results on two distinct test corpora (one TV, one...

chapter

Title assignment for automatic topic segments in TV broadcast news

Abdessalam Bouchekif, Geraldine Damnati, Delphine Charlet, Nathalie Camelin, more

2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6100 - 6104

ICASSP 2016 - 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper addresses the task of assigning a title to topic segments automatically extracted from TV Broadcast News video recordings. We propose to associate a topic segment with the title of a newspaper article collected on the web at the same date. The task implies pairing newspaper articles and topic segments by maximising a given similarity measure. This approach raises several issues, such as...

chapter

Fusion of speaker and lexical information for topic segmentation: A co-segmentation approach

Delphine Charlet, Geraldine Damnati, Abdessalam Bouchekif, Ameur Douib

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5261 - 5265

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this work, we investigate how speaker-based information and lexical-based information can be fused efficiently for topic segmentation of spoken contents. While in recent work, we have proposed an early fusion scheme, so as to jointly model speaker and lexical distribution, we propose here a co-segmentation framework, between segmentations performed in the speaker space and in the lexical space...

chapter

Scene understanding for identifying persons in TV shows: Beyond face authentication

Mickael Rouvier, Benoit Favre, Meriem Bendris, Delphine Charlet, more

2014 12th International Workshop on Content-Based Multimedia Indexing (CBMI) > 1 - 6

2014 12th International Workshop on Content-Based Multimedia Indexing (CBMI)

Our goal is to automatically identify people in TV news and debates without any predefined dictionary of people. In this paper, we focus on the problem of person identification beyond face authentication in order to improve the identification results and not only where the face is detectable. We propose to use automatic scene analysis as features for people identification. We exploit two features:...

chapter

Multiple-view constrained clustering for unsupervised face identification in TV-broadcast

Meriem Bendris, Benoit Favre, Delphine Charlet, Geraldine Damnati, more

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 494 - 498

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Our goal is to automatically identify faces in TV broadcast without a pre-defined dictionary of identities. Most methods are based on identity detection (from OCR and ASR) and require a propagation strategy based on visual clustering. In TV content, people appear with many variations making the clustering difficult. In this case, speaker clustering can be a reliable link for face clustering. We propose...

chapter

Intra-content term weighting for topic segmentation

Abdessalam Bouchekif, Geraldine Damnati, Delphine Charlet

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 7113 - 7117

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Term weighting is an important task in many applications, such as information retrieval, extraction of significant words or automatic summarization. It translates the capacity of a term to discriminate a document within a collection, or a part of a document within a whole document. This paper deals with term weighting strategies in the context of lexical cohesion based topic segmentation. The aim...

chapter

Unsupervised face identification in TV content using audio-visual sources

Meriem Bendris, Benoit Favre, Delphine Charlet, Geraldine Damnati, more

2013 11th International Workshop on Content-Based Multimedia Indexing (CBMI) > 243 - 249

2013 11th International Workshop on Content-Based Multimedia Indexing (CBMI)

Our goal is to automatically identify faces in TV content without pre-defined dictionary of identities. Most of methods are based on identity detection (from OCR and ASR) and require a propagation strategy based on visual clusterings. In TV content, people appear with many variation making the clustering very difficult. In this case, identifying speakers can be a reliable link to identify faces. In...

chapter

Impact of overlapping speech detection on speaker diarization for broadcast news and debates

Delphine Charlet, Claude Barras, Jean-Sylvain Lienard

2013 IEEE International Conference on Acoustics, Speech and Signal Processing > 7707 - 7711

ICASSP 2013 - 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The overlapping speech detection systems developped by Orange and LIMSI for the ETAPE evaluation campaign on French broadcast news and debates are described. Using either cepstral features or a multi-pitch analysis, a F1-measure for overlapping speech detection up to 59.2% is reported on the TV data of the ETAPE evaluation set, where 6.7% of the speech was measured as overlapping, ranging from 1.2%...

chapter

Detecting politician speech in TV broadcast news shows

Delphine Charlet, Geraldine Damnati

2012 10th International Workshop on Content-Based Multimedia Indexing (CBMI) > 1 - 6

2012 10th International Workshop on Content-Based Multimedia Indexing (CBMI)

Politician speaker turn detection in TV Broadcast News shows is addressed in this paper. After a first role labeling pass of speaker turns among anchor, reporter and other, turns labeled as other are submitted to a politician speech detection process. The proposed approach combines acoustical and lexical cues as well as contextual information, and does not use any specific politician model (person-independent)...

chapter

Automatic error region detection and characterization in LVCSR transcriptions of TV news shows

Richard Dufour, Geraldine Damnati, Delphine Charlet

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4445 - 4448

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

This paper addresses the issue of error region detection and characterization in LVCSR transcriptions. It is a well-known phenomenon that errors are not independent and tend to co-occur in automatic transcriptions. We are interested in automatically detecting these so-called error regions. Additionally, in the context of information extraction in TVBN shows, being able to automatically characterize...

chapter

People indexing in TV-content using lip-activity and unsupervised audio-visual identity verification

Meriem Bendris, Delphine Charlet, Gerard Chollet

2011 9th International Workshop on Content-Based Multimedia Indexing (CBMI) > 139 - 144

2011 9th International Workshop on Content-Based Multimedia Indexing (CBMI)

Our goal is to structure TV-content by person allowing a user to navigate through the sequences of the same person. To let a user browse through the content without restriction on people within it, this structuration has to be done without any pre-defined dictionary of people. To this end, most methods propose to index people independently by the audio and visual information, and associate the indexes...

chapter

Robust speaker turn role labeling of TV Broadcast News shows

Geraldine Damnati, Delphine Charlet

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5684 - 5687

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Speaker role recognition in TV Broadcast News shows is addressed in this paper with a particular focus on speaker turn role labeling. A mixed approach combining speaker clustering and analysis of Automatic Speech Recognition output is proposed for assigning speaker turns a role among: anchor, reporter and other. 86% classification accuracy is obtained for automatically segmented speaker turns on a...

chapter

Talking faces indexing in TV-content

Meriem Bendris, Delphine Charlet, Gérard Chollet

2010 International Workshop on Content Based Multimedia Indexing (CBMI) > 1 - 6

8th International Workshop on Content-Based Multimedia Indexing (CBMI 2010)

Our objective is to index talking faces in a TV-Context: build a description of TV-content, in terms of talking people, without any pre-defined dictionary of identities. In TV-content, because of multi-face shots and non-speaking face shots, it is difficult to determine which face is speaking. In this work, a method is proposed which clusters people independently by the audio and by the visual information...

article

Optimizing the coverage of a speech database through a selection of representative speaker recordings

Sacha Krstulović, Frédéric Bimbot, Olivier Boëffard, Delphine Charlet, more

Speech Communication > 2006 > 48 > 10 > 1319-1348

In the context of the Neologos French speech database creation project,1The Neologos project was funded by the French Ministry of Research in the framework of the Technolangue program. ¹ a general methodology was defined for the selection of representative speaker recordings. The selection aims at providing a good coverage in terms of speaker variability while limiting the number of recorded...

article

Speaker recognition by location in the space of reference speakers

Yassine Mami, Delphine Charlet

Speech Communication > 2006 > 48 > 2 > 127-141

Speaker representation by location in a reference space is a new technique of speaker recognition and adaptation. It consists in representing a speaker relatively rather than absolutely, by comparing him to a set of well-trained speakers. The main motivation is to obtain a compact modeling of every speaker, which gives similar performances to those of the state of the art GMM-UBM. Thus, instead of...

chapter

Speaker indexing for retrieval of voicemail messages

Delphine Charlet

2002 IEEE International Conference on Acoustics, Speech, and Signal Processing > 1 > I-121 - I-124

Proceedings of ICASSP '02

This paper addresses the task of voicemail messages retrieval according to a target speaker defined by a given voicemail message. The core metric used for speaker modeling is GMM-based and the paper focuses on the sorting algorithms of the voicemail messages. Various algorithms are studied and compared. An algorithm that sorts messages according to their inclusion rank into a cluster built on the...

Filter options

Publication date

Set your own date range

INFONA - science communication portal

Search results for: Delphine Charlet

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options