Search results for: I. Potamitis

Items from 1 to 7 out of 7 results

chapter

On the First Greek-TTS Based on Festival Speech Synthesis

P. Zervas, I. Potamitis, N. Fakotakis, G. Kokkinakis

Lecture Notes in Computer Science > Text, Speech and Dialogue > Speech > 261-264

In this article we describe the first Text To Speech (TTS) system for the Greek language based on Festival architecture. We discuss practical implementation details and we capitalize on the preparation of the diphone database and on the prediction of phoneme duration module implemented with CART tree technique. Two male databases where used for two different speech synthesis engines, namely, residual...

chapter

Sound classification based on temporal feature integration

S Ntalampiras, I Potamitis, N Fakotakis

2010 4th International Symposium on Communications, Control and Signal Processing (ISCCSP) > 1 - 4

4th International Symposium on Communications, Control and Signal Processing (ISCCSP 2010)

The present work contributes to the field of generalized sound classification. We extensively examine the performance of the next three feature sets: a) MPEG-7 Audio Spectrum Projection, b) MFCC (using an alternative method for their extraction) and c) a group derived utilizing critical band based wavelet packets. Subsequently three types of temporal feature integration strategies are applied on the...

chapter

PROMETHEUS database: A multimodal corpus for research on modeling and interpreting human behavior

S. Ntalampiras, D. Arsic, A. Stormer, T. Ganchev, more

2009 16th International Conference on Digital Signal Processing > 1 - 8

2009 16th International Conference on Digital Signal Processing (DSP)

The present paper describes the construction of a multimodal database, referred to as the PROMETHEUS database, which contains recordings from heterogeneous sensors. The main purpose of this database is the development of a framework for monitoring and interpretation of human behavior in unrestricted environments of both indoor and outdoor type. It contains single-person and multi-person scenarios,...

chapter

On acoustic surveillance of hazardous situations

S. Ntalampiras, I. Potamitis, N. Fakotakis

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 165 - 168

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

The present study presents a practical methodology for automatic space monitoring based solely on the perceived acoustic information. We consider the case where atypical situations such as screams, explosions and gunshots take place in a metro station environment. Our approach is based on a two stage recognition schema, each one exploiting HMMs for approximating the density function of the corresponding...

chapter

Acoustic Monitoring of Singing Insects

T. Ganchev, I. Potamitis, N. Fakotakis

2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '7 > 4 > IV-721 - IV-724

2007 IEEE International Conference on Acoustics, Speech, and Signal Processing

This work reports recent progress towards the development of a pilot system for automatic identification of singing insects. We propose a sound parameterization technique that is designed explicitly for the needs of acoustic insect recognition. It is combined with state-of-the-art classification methods that dominate speaker recognition technology. Specifically, the categorization of acoustic emissions...

chapter

Automatic acoustic identification of crickets and cicadas

I. Potamitis, T. Ganchev, N. Fakotakis

2007 9th International Symposium on Signal Processing and Its Applications > 1 - 4

2007 9th International Symposium on Signal Processing and Its Applications (ISSPA)

The general problem addressed in this work is automatic identification of insects using only the acoustic modality. In particular, we discuss the characteristics of the acoustic profiles of two target groups of insects: crickets and cicadas. Subsequently, we employ advanced machine learning techniques to categorize them on the levels of specific insect, family, subfamily, genus, and species. To deal...

chapter

Gender-dependent and speaker-dependent speech enhancement

I. Potamitis, N. Fakotakis, G. Kokkinakis

2002 IEEE International Conference on Acoustics, Speech, and Signal Processing > 1 > I-249 - I-252

Proceedings of ICASSP '02

Our work introduces a speech enhancement technique that can explicitly incorporate prior information about the gender or speaker time-frequency characteristics in its formalism. We approximate the multimodal, clean speech linear spectrum magnitude with a mixture of Gaussians pdfs using the Expectation-Maximization algorithm (EM). Subsequently. we apply the Bayesian inference framework to the degraded...

Filter options

Publication date

Set your own date range

Keywords

ACCURACY (2)
ACOUSTIC SIGNAL PROCESSING (2)
ACOUSTICS (2)
ANIMALS (2)
CICADAS (2)
CRICKETS (2)
HIDDEN MARKOV MODELS (2)
ACOUSTIC EMISSIONS (1)
ACOUSTIC INFORMATION (1)
ACOUSTIC INSECT RECOGNITION (1)
ACOUSTIC MODALITY (1)
ACOUSTIC MONITORING (1)
ACOUSTIC SIGNAL DETECTION (1)
ACOUSTIC SURVEILLANCE (1)
AIRPORTS (1)
ARTIFICIAL NEURAL NETWORKS (1)
ASYNCHRONOUS TRANSFER MODE (1)
AUDIO SIGNAL PROCESSING (1)
AUTOMATIC ACOUSTIC IDENTIFICATION (1)
AUTOMATIC SPACE MONITORING (1)
AUTOREGRESSIVE FUNCTIONS (1)
AUTOREGRESSIVE PROCESSES (1)
BIOACOUSTICS (1)
BIOLOGICAL CONTROL SYSTEMS (1)
BIOLOGY COMPUTING (1)
BIOMEDICAL ACOUSTICS (1)
CAMERAS (1)
CIVIL SAFETY (1)
CLASSIFICATION ALGORITHMS (1)
COMPUTATIONAL MODELING (1)
CONTENT BASED AUDIO RECOGNITION (1)
DATA MINING (1)
DATABASES (1)
DENSITY FUNCTION (1)
DISTANCE MEASUREMENT (1)
DISTRIBUTED DATABASES (1)
ESTIMATION (1)
EXPLOSIONS (1)
FEATURE EXTRACTION (1)
GAUSSIAN MIXTURE MODELS (1)
GAUSSIAN PROCESSES (1)
GUNSHOTS (1)
HAZARDOUS SITUATIONS (1)
HETEROGENEOUS SENSORS (1)
HMM (1)
HUMAN BEHAVIOR (1)
HUMAN-ROBOT INTERACTION INTERFACES (1)
HUMANS (1)
KATYDIDS (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
LEFT-RIGHT HIDDEN MARKOV MODELS (1)
MACHINE LEARNING TECHNIQUES (1)
MEL FREQUENCY CEPSTRAL COEFFICIENT (1)
METRO STATION ENVIRONMENT (1)
MFCC (1)
MONITORING (1)
MPEG-7 (1)
MPEG-7 AUDIO SPECTRUM PROJECTION (1)
MULTIMODAL DATABASE (1)
NEURAL NETS (1)
NOISE (1)
POWER HARMONIC FILTERS (1)
PROBABILISTIC LOGIC (1)
PROBABILISTIC NEURAL NETWORK (1)
PROBABILITY DENSITY FUNCTION (1)
PRODUCTION (1)
PROMETHEUS DATABASE (1)
PUBLIC AREAS SURVEILLANCE (1)
SCORE-LEVEL CLASSIFIER FUSION (1)
SCREAMS (1)
SENSORS (1)
SHORT-TERM STATISTICS (1)
SIGNAL TO NOISE RATIO (1)
SIGNAL-BASED SURVEILLANCE (1)
SINGING INSECTS AUTOMATIC IDENTIFICATION (1)
SMART HOME (1)
SOUND CLASSIFICATION (1)
SOUND PARAMETERIZATION TECHNIQUE (1)
SPARSE SPECTRAL REPRESENTATION (1)
SPEAKER RECOGNITION (1)
SPECTRAL MOMENTS (1)
SPEECH (1)
SPEECH ENHANCEMENT (1)
STAGE RECOGNITION SCHEME (1)
STATE-OF-THE-ART CLASSIFICATION METHODS (1)
SURVEILLANCE (1)
TEMPORAL FEATURE INTEGRATION (1)
TRANSFORM CODING (1)
VIDEO SURVEILLANCE (1)
WAVEFORM ANALYSIS (1)
WAVELET PACKETS (1)
WEIGHT MEASUREMENT (1)
ZOOLOGY (1)
more

Data set

ieee (6)
Springer (1)

INFONA - science communication portal

Search results for: I. Potamitis

On the First Greek-TTS Based on Festival Speech Synthesis

Sound classification based on temporal feature integration

PROMETHEUS database: A multimodal corpus for research on modeling and interpreting human behavior

On acoustic surveillance of hazardous situations

Acoustic Monitoring of Singing Insects

Automatic acoustic identification of crickets and cicadas

Gender-dependent and speaker-dependent speech enhancement

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options