Wyniki wyszukiwania dla: G. Rozinaj

Pozycje od 1 do 15 spośród 15 wyników

rozdział

Speech synthesis using compressed database

R. Rybarova, G. Rozinaj

2015 57th International Symposium ELMAR (ELMAR) > 105 - 108

2015 57th International Symposium ELMAR

This paper describes implementation of the speech synthesizer using a diphone database in parametric format. The paper deals also with harmonic models with noise (HNM) belonging to sinusoidal model used for database segments coding. The HNM approach of speech representation allows us to reduce significantly the database of speech segments for a concatenative speech synthesis, too.

rozdział

Context analysis using bigrams

M. Spilka, G. Rozinaj, R. Rybarova

2015 IEEE 19th International Conference on Intelligent Engineering Systems (INES) > 401 - 404

2015 IEEE 19th International Conference on Intelligent Engineering Systems (INES)

This paper focuses on using bigrams in a topic determination for speech synthesizer. It contains an explanation of a modular architecture for the speech synthesizer and importance of context analysis for customizing and quality enhancement of synthesized speech. The bigram carries information about context and in this work it is shown how to use them to improve the identification of the theme. At...

rozdział

Analysis of prosody features in Slovak

A Kondelova, J Toth, G Rozinaj

Proceedings ELMAR-2010 > 371 - 374

2010 52nd International Symposium ELMAR

In this work we were able to create module to speech synthesizer. The main function of this module is to change fundamental frequency. There are many possibilities how to change shape of melodic curve. We picked up method to modify fundamental frequency with PRAAT script. In this work we also analyzed some types of Slovak sentences with focus on prosodic curves.

rozdział

Adding Voicing Features into Speech Recognition Based on HMM in Slovak

J. Kacur, G. Rozinaj

2009 16th International Conference on Systems, Signals and Image Processing > 1 - 4

2009 16th International Conference on Systems, Signals and Image Processing

This article discusses the impact of substituting some of the basic speech features with the voiced/ unvoiced information and possibly with the estimated pitch value. As a good measure of the signal's voicing the average magnitude difference function was assumed, especially the ratio of its average value to its local minima found within the accepted ranges of the pitch. Furthermore, the pitch itself...

rozdział

Parametrization of a Slovak speech database for mobile platform speech synthesis

M.T. Nagy, G. Rozinaj, P. Hvisc

2009 International Symposium ELMAR > 225 - 228

2009 International Symposium ELMAR. ELMAR 2009

One of the main topics in the area of user-friendly human machine interface is speech synthesis. The paper deals with the application of harmonic plus noise model (HNM) for preparation of compressed speech database in a format that is that is useful for easy prosodic modification of the synthesized speech. The HNM approach of speech representation allows us to reduce significantly the database of...

rozdział

A residual modeling extension of HNM model for prosodic modification of Slovak speech

M.T. Nagy, G. Rozinaj

2008 15th International Conference on Systems, Signals and Image Processing > 453 - 456

2008 International Conference on Systems, Signals and Image Processing (IWSSIP)

This article describes an extension of HNM model used for prosodic modification of speech. HNM model represents speech signals as a sum of harmonic and noise part. The decomposition of speech signal into these two parts allows more natural sounding modifications of the signal. The parametric representation of speech provides a straightforward way of changing prosodic features of the speech. Our algorithm,...

rozdział

Comparison of automatic speech recognizer SPHINX 3.6 and SPHINX 4.0 for creating systems in Slovak language

J. Vojtko, J. Korosi, G. Rozinaj

2008 15th International Conference on Systems, Signals and Image Processing > 537 - 539

2008 International Conference on Systems, Signals and Image Processing (IWSSIP)

In this paper we discuss a topic of an automatic speech recognition system based on a system SPHINX in various versions and configurations. We compare Sphinx version 3 and 4 for recognition of Slovak speech. Other comparison is focused on the type of a language model. We have used regular grammar and bi-gram language model as compared language model.

rozdział

System for prosodic modification of corpus synthetized Slovak speech

M. Turi Nagy, G. Rozinaj, J. Cepko

2008 50th International Symposium ELMAR > 2 > 643 - 646

2008 50th International Symposium ELMAR

In this article, a design of an improvement of a TTS system for Slovak speech synthesis is described. The improvement consists of a new type of parametric corpus storing, that allows prosodic modification of the speech in real-time processing. This approach is based on a HNM (harmonic plus noise) model that represents speech signals as a sum of a harmonic and a noise part. The decomposition of speech...

rozdział

IQ kiosk in metropolitan information system

J. Vrabec, G. Rozinaj, R. Talafova

2008 50th International Symposium ELMAR > 2 > 635 - 638

2008 50th International Symposium ELMAR

The paper introduces an idea of a metropolitan information system. The aim of the system is to provide various kinds of information about the city not only for tourists and strangers but for the citizens of the city, as well. The main principle is based on a philosophy of accessing data from the Internet and to provide a user-friendly interface to these data using various types of intelligent kiosks...

rozdział

IQ Kiosk - multimedia intelligent terminal

J. Vrabec, G. Rozinaj

ELMAR 2007 > 203 - 206

49th International Symposium ELMAR-2007

In this paper we describe a proposal of a multimedia kiosk that can be used to provide diverse information to the wide public. We propose two versions of the intelligent kiosk. First version is placed on public places and offer three-dimensional human head displayed on a large display that gives information about city, institutions, weather, etc. It is a system with integrated microphone array, camera...

rozdział

Speech synthesis for mobile phone

R. Talafova, G. Rozinaj, J. Cepko

ELMAR 2007 > 167 - 170

49th International Symposium ELMAR-2007

This project is about speech synthesis and creating a speech synthesiser for a mobile cell phone. The first part of this project is about speech synthesis. From the all type of synthesis only diphones synthesis is discussed further, because its features for a mobile cell phone are superior, compared to the other types. This work further analyses implementation of speech synthesiser -this means loading...

rozdział

A hybrid pitch period estimation method based on HNM model

M.T. Nagy, G. Rozinaj, A. Palenik

ELMAR 2007 > 175 - 178

49th International Symposium ELMAR-2007

Pitch period estimation (also called fundamental frequency estimation) is widely needed in speech processing for many purposes. In our system for prosodic modification of speech, the pitch period estimation is used as a basis for frame length detection. The pitch period estimation method used in the system is a hybrid method that is based on YIN fundamental frequency estimation algorithm and a method...

rozdział

The training of Slovak speech recognition system based on Sphinx 4 for GSM networks

J. Vojtko, J. Kacur, G. Rozinaj

ELMAR 2007 > 147 - 150

49th International Symposium ELMAR-2007

In the submitted paper we present the training process of HMM models that are designed to be used in ASR systems employed in GSM networks. First a brief overview regarding the current problems and applications of ASR systems is given, followed by the description of MOBILDAT-SK speech database and the SPHINX 4 and SphitixTrain capabilities. Then the process of HMM models training is presented utilizing...

rozdział

Face Feature Detection for 3D Model of Talking Head with Speech Synthesis

R. Talafova, G. Rozinaj

2007 14th International Workshop on Systems, Signals and Image Processing and 6th EURASIP Conference focused on Speech and Image Processing, Multimedia Communications and Services > 137 - 139

2007 14th International Workshop in Systems, Signals and Image Processing and 6th EURASIP Conference focused on Speech and Image Processing, Multimedia Communications and Services - EC-SIPMCS 2007

Several new original algorithms for face feature detection are presented within this paper. The detected objects are used to produce realistic 3D model of human face. Presented methods have been tested and the results are discussed in the paper. The facial feature detection is based on human skin chromaticity and morphological characteristics of the human head. Output of the skin detection is used...

rozdział

MABox - Multimodal Microphone Array Algorithm Development System

J. Vrabec, G. Rozinaj, J. Vojtko

2007 14th International Workshop on Systems, Signals and Image Processing and 6th EURASIP Conference focused on Speech and Image Processing, Multimedia Communications and Services > 281 - 283

2007 14th International Workshop in Systems, Signals and Image Processing and 6th EURASIP Conference focused on Speech and Image Processing, Multimedia Communications and Services - EC-SIPMCS 2007

In this work a design and a realization of multimodal microphone array algorithm development system which is proposed to develop new microphone array algorithm named MABox is presented. This device incorporates microphone array with four microphones, ADC cards and development software. Microphones are integrated in a separate directional box pointed to the speaker, the box is connected via USB and...

Opcje filtrowania

Data publikacji

Ustaw własny zakres dat

Słowa kluczowe

SPEECH (8)
SPEECH SYNTHESIS (7)
DATABASES (6)
SPEECH RECOGNITION (6)
HNM MODEL (4)
PROSODIC MODIFICATION (4)
HARMONIC ANALYSIS (3)
HARMONIC PLUS NOISE MODEL (3)
HMM (3)
MICROPHONE ARRAY (3)
MOBILE HANDSETS (3)
NATURAL LANGUAGE PROCESSING (3)
NOISE (3)
POWER HARMONIC FILTERS (3)
SPEECH PROCESSING (3)
DATA MINING (2)
FEATURE EXTRACTION (2)
HIDDEN MARKOV MODELS (2)
HUMAN COMPUTER INTERACTION (2)
MOBILE PHONE (2)
PROSODY (2)
SINUSOIDAL MODELING (2)
SPEECH CODING (2)
SPEECH-BASED USER INTERFACES (2)
SYNTHESIZERS (2)
TRAINING (2)
TTS SYNTHESIS (2)
3D MODEL (1)
ACCURACY (1)
ADC CARDS (1)
ANNOTATION FILE (1)
ARRAY (1)
ARRAY SIGNAL PROCESSING (1)
ASR (1)
ASR SYSTEMS (1)
AUDIO DATABASES (1)
AUTOMATIC SPEECH RECOGNITION (1)
AUTOMATIC SPEECH RECOGNITION SPHINX 3.6 (1)
AUTOMATIC SPEECH RECOGNITION SYSTEM (1)
BEAM FORMING (1)
BEAMFORMING ALGORITHM (1)
BIGRAM LANGUAGE MODEL (1)
BIOLOGICAL SYSTEM MODELING (1)
C# (1)
CD PHONEME MODELS (1)
CELLULAR RADIO (1)
CITIES AND TOWNS (1)
COMPRESSED PARAMETRIC DATABASE (1)
CONCATENATIVE SPEECH SYNTHESIS (1)
CONTEXT (1)
CONTEXT DEPENDENT PHONEMES (1)
CONTEXT INDEPENDENT PHONEMES (1)
CORPUS SYNTHETIZED SLOVAK SPEECH (1)
DECODING (1)
DEVELOPMENT SOFTWARE ENVIRONMENT (1)
DIPHONE (1)
DIPHONE SPEECH SYNTHESIS (1)
DIPHONES (1)
DIPHONES SYNTHESIS (1)
DSP (1)
FACE ANIMATION (1)
FACE FEATURE DETECTION (1)
FACE RECOGNITION (1)
FRAME LENGTH DETECTION (1)
FREQUENCY ESTIMATION (1)
FUNDAMENTAL FREQUENCY (1)
FUNDAMENTAL FREQUENCY DETECTION (1)
FUNDAMENTAL FREQUENCY ESTIMATION (1)
GRAMMAR (1)
GRAPHICAL ENGINE (1)
GSM NETWORKS (1)
HARMONIC PLUS NOISE MODEL (HNM) (1)
HCI (1)
HMM MODELS (1)
HUMAN SKIN CHROMATICITY (1)
HYBRID PITCH PERIOD ESTIMATION METHOD (1)
INFORMATION SYSTEMS (1)
INTELLIGENT KIOSK (1)
INTELLIGENT KIOSKS (1)
INTELLIGENT SPEECH COMMUNICATION INTERFACE (1)
INTERNET (1)
IQ KIOSK (1)
JAVA APPLICATION (1)
JAVA MICROPHONE (1)
MABOX (1)
MASPER TRAINING SCHEME (1)
MEL FREQUENCY CEPSTRAL COEFFICIENT (1)
MELODIC CURVE (1)
METROPOLITAN INFORMATION SYSTEM (1)
MFCC (1)
MICROPHONE ARRAYS (1)
MOBILDAT (1)
MOBILDAT-SK (1)
MOBILDAT-SK SPEECH DATABASE (1)
MOBILE (1)
MOBILE APPLICATIONS (1)
MOBILE CELL PHONE (1)
MOBILE DEVICES (1)
MOBILE PLATFORM SPEECH SYNTHESIS (1)
MODIFICATION (1)
więcej

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania dla: G. Rozinaj

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu