Search results for: Bartosz Ziółko

Items from 1 to 16 out of 16 results

chapter

Speech/music discrimination for analysis of radio stations

Stanislaw Kacprzak, Blazej Chwiecko, Bartosz Ziolko

2017 International Conference on Systems, Signals and Image Processing (IWSSIP) > 1 - 4

2017 International Conference on Systems, Signals and Image Processing (IWSSIP)

A computationally efficient feature, called Minimum Energy Density (MED) was applied to discriminate audio signals between speech and music in the radio stations programs. The presented binary classifier is based on testing two features: energy distribution and differences between energy in channels. We analyzed 240 hours of signals, from 10 Polish radio stations. Our analysis enables us to provide...

chapter

HMM-based breath and filled pauses elimination in ASR

Piotr Zelasko, Tomasz Jadczyk, Bartosz Ziolko

2014 International Conference on Signal Processing and Multimedia Applications (SIGMAP) > 255 - 260

2014 International Conference on Signal Processing and Multimedia Applications (SIGMAP)

The phenomena of filled pauses and breaths pose a challenge to Automatic Speech Recognition (ASR) systems dealing with spontaneous speech, including recognizer modules in Interactive Voice Reponse (IVR) systems. We suggest a method based on Hidden Markov Models (HMM), which is easily integrated into HMM-based ASR systems and allows detection of those disturbances without incorporating additional parameters...

chapter

Comparison of language models trained on written texts and speech transcripts in the context of automatic speech recognition

Sebastian Dziadzio, Aleksandra Nabozny, Aleksander Smywinski-Pohl, Bartosz Ziolko

2015 Federated Conference on Computer Science and Information Systems (FedCSIS) > 193 - 197

2015 Federated Conference on Computer Science and Information Systems (FedCSIS)

We investigate whether language models used in automatic speech recognition (ASR) should be trained on speech transcripts rather than on written texts. By calculating log-likelihood statistic for part-of-speech (POS) n-grams, we show that there are significant differences between written texts and speech transcripts. We also test the performance of language models trained on speech transcripts and...

chapter

Detecting recorded speech for polish language

Bartosz Stolinski, Bartosz Ziolko

AFRICON 2015 > 1 - 5

IEEE AFRICON 2015

Three possible methods of detecting recorded speech were analysed and tested according to their applicability in the field of voicemail detection in this paper. Methods chosen for testing were: transmission channel characteristics extraction with PFCC, recorded speech detection with trained pattern classifier, differences in transmission channels and speech recognition. Most of the tests gave results...

chapter

Efficient vectorized architecture for Feedback Delay Network reverberator with policy based design

Ireneusz Gawlik, Tomasz Pedzimaz, Szymon Palka, Bartosz Ziolko

2015 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA) > 124 - 127

2015 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)

The Feedback Delay Network (FDN) is used as artificial digital reverberation algorithm. Being one of the most naturally sounding approaches it became widely implemented in many sound processing software products. Although FDN is a very potent tool in regards to artificial reverberation, achieving proper perceptual quality of acoustic simulation usually demands additional modifications to signal processing...

chapter

Length of phonemes in a context of their positions in Polish sentences

Magdalena Igras, Bartosz Ziolko, Mariusz Ziolko

2013 International Conference on Signal Processing and Multimedia Applications (SIGMAP) > 59 - 64

2013 International Conference on Signal Processing and Multimedia Applications (SIGMAP)

The paper presents statistical phonetic data of Polish collected from a corpus. Lengths of phonemes vary from 5 ms to 670 ms. Average durations of Polish phonemes are presented as well as an important anomaly of longer phonemes in the end of sentences, which is the main topic of the paper. This observation can be used in speech recognition for automatic insertation of dots and sentence modelling....

chapter

Linguistically motivated tied-state triphones for polish speech recognition

Piotr Zelasko, Bartosz Ziolko, Tomasz Jadczyk, Tomasz Pedzimaz

2015 IEEE 2nd International Conference on Cybernetics (CYBCONF) > 251 - 254

2015 IEEE 2nd International Conference on Cybernetics (CYBCONF)

The paper presents one of the possible approaches to build a triphone model for automatic speech recognition of Polish. Even though classifiers are well developed and described, such task is not a trivial one because of lack of enough training data and importance of calculation time spent for the training of the model. To overcome this problem, some states are typically tied using data-driven criteria...

chapter

Design of integer filters for transmultiplexer perfect reconstruction

Bartosz Ziolko, Mariusz Ziolko, Michal Nowak

2005 13th European Signal Processing Conference > 1 - 4

2005 13th European Signal Processing Conference

A new and efficient method of designing the transmultiplexer filters is presented. The bilinear equations posed for the FIR filters are solved to achieve perfect reconstruction. For a given combining filter bank a separation filter bank can be developed by solving a set of algebraic equations. Some examples of a two-channel and four-channel transmultiplexer system are provided to illustrate the method...

chapter

Statistics of diphones and triphones presence on the word boundaries in the Polish language. Applications to ASR

Bartosz Ziolko, Piotr Zelasko, Dawid Skurzok

XXII Annual Pacific Voice Conference (PVC) > 1 - 6

2014 XXII Annual Pacific Voice Conference (PVC)

Recognition of continuous speech is one of the major challenges in automatic speech recognition (ASR), especially in phonetically complex languages (i.e. Polish). To improve ASR of the Polish language, we obtained phoneme statistics to locate diphones and triphones within the running speech sequences. We found that these clusters occur more likely between the words boundaries rather than within the...

chapter

Is phoneme length and phoneme energy useful in automatic speaker recognition?

Magdalena Igras, Bartosz Ziolko, Mariusz Ziolko

XXII Annual Pacific Voice Conference (PVC) > 1 - 5

2014 XXII Annual Pacific Voice Conference (PVC)

The paper presents analysis of prosodic parameters of speech (energy, phoneme duration) as features characteristic for speaker. The most significant parameters of the features were investigated using CORPORA speech database and described statistically. We observed that phoneme duration depends on a speaker, as well as the preboundary lengthening of the phonemes in sentences. An average phoneme energy...

chapter

Wavelet method for breath detection in audio signals

Magdalena Igras, Bartosz Ziolko

2013 IEEE International Conference on Multimedia and Expo (ICME) > 1 - 6

2013 IEEE International Conference on Multimedia and Expo (ICME)

An algorithm for automatic detection of breath events in a speech signal is suggested in this paper. The issues of breath events occurrences in recordings are discussed as well as their statistical parameters. Also the role of breath pauses for signalizing punctuation and emotional or physical state of the speaker, in both spontaneous and read speech, is described. Wavelet parameters of energy in...

chapter

Confidence measure by substring comparison for automatic speech recognition

Bartosz Ziolko, Tomasz Jadczyk, Dawid Skurzok, Mariusz Ziolko

2012 International Conference on Audio, Language and Image Processing > 314 - 318

2012 International Conference on Audio, Language and Image Processing (ICALIP)

Two possible confidence measures for automatic speech recognition are presented along with results of tests where they were applied. One of them is widely known and it is based on comparing the strongest hypotheses with an average of a few next hypotheses. We found it not efficient in all cases, this is why we came up with our own method based on comparison of substrings. New algorithm was found useful...

chapter

Evaluation of errors in Polish phones segmentation for different types of transitions

Bartosz Ziolko, Mariusz Ziolko, Suresh Manandhar, Richard C Wilson

Melecon 2010 - 2010 15th IEEE Mediterranean Electrotechnical Conference > 1435 - 1438

MELECON 2010 - 2010 15th IEEE Mediterranean Electrotechnical Conference

The paper presents an evaluation of Polish phone segmentation for different types of phones. The categorisation was done based on acoustic properties. The segmentation method is based on discrete wavelet transform and was already published. The results show that several types of transitions, especially from and to vowels cause more errors than others.

chapter

Lossless Jpeg-Base Compression of Transmultiplexed Images

Przemysaw Sypka, Mariusz Ziolko, Bartosz Ziolko

2006 IEEE 12th Digital Signal Processing Workshop&4th IEEE Signal Processing Education Workshop > 531 - 534

2006 IEEE 12th Digital Signal Processing Workshop & 4th IEEE Signal Processing Education Workshop

A transmultiplexer assigned to combine images into one image to be sent through a single communication channel is presented. The considered system can be equipped with integer-to-integer filters to enable the lossless compression. The efficiency of lossless JPEG compression applied to transmultiplexed signals has been verified

chapter

Lossy Compression Approach to Transmultiplexed Images

Przemyslaw Sypka, Mariusz Ziolko, Bartosz Ziolko

Proceedings ELMAR 2006 > 289 - 292

Proceedings ELMAR 2006

The application of image compression methods in transmultiplexer systems is presented. The specific energy distribution in the combined image spectrum makes the standard compression methods, especially these that base on frequency decomposition not efficient. Two cases are described and compared: the compression of combined image and the preliminary compression of input images before transmultiplexing...

article

Wavelet method of speech segmentation

Bartosz Ziolko, Suresh Manandhar, Richard C. Wilson, Mariusz Ziolko

02006 00014th European Signal Processing Conference > 2006 > 1 - 5

2006 14th European Signal Processing Conference

In this paper a new method of speech segmentation is suggested. It is based on power fluctuations of the wavelet spectrum for a speech signal. In most approaches to speech recognition, the speech signals are segmented using constant-time segmentation. Constant segmentation needs to use windows to decrease the boundary distortions. A more natural approach is to segment the speech signals on the basis...

Filter options

Data set:
ieee

Publication date

Set your own date range

Publication type

book (15)
article (1)

Keywords

SPEECH (7)
ACOUSTICS (3)
SPEECH RECOGNITION (3)
ASR (2)
BREATH DETECTION (2)
DATA COMPRESSION (2)
DISCRETE WAVELET TRANSFORMS (2)
IMAGE CODING (2)
POLISH (2)
SPEECH PROCESSING (2)
ABSTRACTS (1)
ACOUSTIC PROPERTIES (1)
AUTOMATIC SPEECH RECOGNITION (1)
BREATH (1)
BREATH PAUSES (1)
C++ IMPLEMENTATIONS (1)
COMMUNICATION CHANNEL (1)
COMPRESSION (1)
COMPUTATIONAL MODELING (1)
COMPUTER SCIENCE (1)
CONTAINERS (1)
CONTEXT (1)
CONTEXT MODELING (1)
DATA MODELS (1)
DATABASES (1)
DIGITAL FILTERS (1)
DISCRETE WAVELET TRANSFORM (1)
DWT (1)
ENERGY DISTRIBUTION (1)
EQUATIONS (1)
ERROR EVALUATION (1)
EVENT SPOTTING IN AUDIO (1)
FDN (1)
FILLED PAUSE (1)
FILLED PAUSE DETECTION (1)
FILLER (1)
FILLER DETECTION (1)
FILTER BANK (1)
FILTER BANKS (1)
FILTERING (1)
FILTERING THEORY (1)
FINITE IMPULSE RESPONSE FILTERS (1)
FREQUENCY DECOMPOSITION (1)
FREQUENCY DIVISION MULTIPLEXING (1)
FREQUENCY MODULATION (1)
HIDDEN MARKOV MODELS (1)
HMM (1)
IMAGE COMPRESSION METHODS (1)
IMAGE SAMPLING (1)
IMAGE SPECTRUM (1)
IMAGE TRANSMISSION (1)
IMAGE TRANSMULTIPLEXER (1)
INDEXES (1)
INTEGER-TO-INTEGER FILTERS (1)
IVR (1)
JPEG (1)
LOSSLESS JPEG-BASE COMPRESSION (1)
LOSSY COMPRESSION APPROACH (1)
LUMINANCE (1)
MATHEMATICAL MODEL (1)
MORPHOSYNTACTIC LANGUAGE MODEL (1)
MOUTH (1)
MULTIMEDIA COMMUNICATION (1)
MULTIPLE SIGNAL CLASSIFICATION (1)
MULTIPLEXING EQUIPMENT (1)
MULTIRATE PROCESSING (1)
OPTIMIZATIONS (1)
PHONEME STATISTICS (1)
POLICY-BASED DESIGN (1)
POLISH PHONES SEGMENTATION (1)
PRELIMINARY COMPRESSION (1)
RADIO CONTENT ANALYSIS (1)
ROCKS (1)
SIMD (1)
SOUND CLASSIFICATION (1)
SPEAKER RECOGNITION (1)
SPEECH ANALYSIS (1)
SPEECH/MUSIC DISCRIMINATION (1)
SPOKEN LANGUAGE PROCESSING (1)
SPONTANEOUS SPEECH (1)
STANDARDS (1)
TAGGING (1)
TELECOMMUNICATION CHANNELS (1)
TEMPLATE METAPROGRAMMING (1)
TIED STATE MODEL (1)
TRAINING (1)
TRANSMISSION CHANNEL (1)
TRANSMULTIPLEXED IMAGES (1)
TRANSMULTIPLEXER SYSTEM (1)
TRANSMULTIPLEXING (1)
TRIPHONES (1)
UPSAMPLING (1)
VISUAL COMMUNICATION (1)
WAVELET PACKET (1)
WAVELET PACKET DECOMPOSITION (1)
WAVELET TRANSFORMS (1)
WORLD WIDE WEB (1)
WRITTEN AND SPOKEN LANGUAGE COMPARISON (1)
more

INFONA - science communication portal

Search results for: Bartosz Ziółko

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options