Wyniki wyszukiwania

rozdział

Speech pause time: A potential biomarker for depression detection

Zhenyu Liu, Huanyu Kang, Lei Feng, Lan Zhang

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 2020 - 2025

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

Detecting depression via speech is an attractive topic in recent years. Significant correlation was found between speech pause time and depressive severity. In the present study, 92 depressed patients and 92 age-, gender- and education level-matched control participants were examined to investigate three temporal characteristics of speech: recording time (RT), phonation time (PT) and speech pause...

rozdział

PSD estimation of multiple sound sources in a reverberant room using a spherical microphone array

Abdullah Fahim, Prasanga N. Samarasinghe, Thushara D. Abhayapala

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) > 76 - 80

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

We propose an efficient method to estimate source power spectral densities (PSDs) in a multi-source reverberant environment using a spherical microphone array. The proposed method utilizes the spatial correlation between the spherical harmonics (SH) coefficients of a sound field to estimate source PSDs. The use of the spatial cross-correlation of the SH coefficients allows us to employ the method...

rozdział

Low-Complexity Kalman filter for multi-channel linear-prediction-based blind speech dereverberation

Thomas Dietzen, Simon Doclo, Ann Spriet, Wouter Tirry, więcej

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) > 284 - 288

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

Multi-channel linear prediction (MCLP) has been shown to be a suitable framework for tackling the problem of blind speech dereverberation. In recent years, a number of adaptive MCLP algorithms have been proposed, whereby the majority operates in the short-time Fourier transform (STFT) domain. In this paper, we focus on the STFT-based Kalman filter solution to the adaptive MCLP task. Similarly to all...

rozdział

Unsupervised speaker segmentation framework based on sparse correlation feature

Yi Xin Sun, Yong Ma, Kai Bo Shi, Jiang Ping Hu, więcej

2017 Chinese Automation Congress (CAC) > 3058 - 3063

2017 Chinese Automation Congress (CAC)

With the increasing stress in working and studying, mental health becomes a major problem in the current social research. Generally, researchers can analyze psychological health states by using social perception behavior. The speech signal is an important research direction in this domain. It objectively assesses the mental health of social groups through the extraction and fusion of speech features...

rozdział

Software for an objective evaluation of the quality of syllables's pronunciation in speech rehabilitation

E. Y. Kostyuchenko, R. V. Mescheryakov, D. I. Novokhrestova, A. V. Pyatkov, więcej

2017 IEEE II International Conference on Control in Technical Systems (CTS) > 267 - 270

2017 IEEE II International Conference on Control in Technical Systems (CTS)

In this paper, a description of the software is provided that allows to obtain an automated objective quantitative assessment of the patient's speech quality. The evaluation is carried out in the process of speech rehabilitation after surgical treatment of oncological diseases of the organs of the speech-forming tract. The evaluation is obtained based on comparing patient records before the operation...

rozdział

A new correlated set-membership partial-update NLMS (SM-PU-NLMSCOR) algorithm for acoustic noise reduction

Abdelhak Cheffi, Mohamed Djendi, Abderrezak Guessoum

2017 5th International Conference on Electrical Engineering - Boumerdes (ICEE-B) > 1 - 4

2017 5th International Conference on Electrical Engineering - Boumerdes (ICEE-B)

This paper addresses the problem of speech quality enhancement and acoustic noise reduction by adaptive filtering algorithms. In this paper, we propose a new version of the set-membership partial-update normalized least mean square (SM-PU-NLMS) algorithm. The proposed algorithm is based on the use of the cross-correlation of the output error signal and the noisy to control the variable-step-size in...

rozdział

Multiple fundamental frequencies estimation approaches based on multi-scale product analysis

Jihen Zeremdini, Mohamed Anouar Ben Messaoud, Aicha Bouzid

2017 3rd International Conference on Frontiers of Signal Processing (ICFSP) > 55 - 58

2017 3rd International Conference on Frontiers of Signal Processing (ICFSP)

This paper describes three methods for multiple fundamental frequencies estimation based on the multi-scale product analysis. The three methods use the autocorrelation of the multi-scale product analysis for the target pitch estimation. For the intrusion pitch, each one has its techniques. The first one uses the classic comb filtering. The second method employs the rectangular comb filter followed...

rozdział

Multi-channel estimation of power spectral density matrix using inter-frame and inter-banc information

Raziyeh Ranjbaryan, Hamid Reza Abutalebi

2017 25th European Signal Processing Conference (EUSIPCO) > 1634 - 1638

2017 25th European Signal Processing Conference (EUSIPCO)

In this paper, we address the estimation of power spectral density (PSD) matrix. The accurate estimation of PSD matrix plays an important role in many speech enhancement methods. In traditional PSD estimation methods, only the information of previous frames is employed through a forgetting factor. In the current research, we consider the correlation of inter-band components and incorporate their information...

rozdział

Signal Analysis for Voice Evaluation in Parkinson’s Disease

Domenico Mirarchi, Patrizia Vizza, Giuseppe Tradigo, Nicola Lombardo, więcej

2017 IEEE International Conference on Healthcare Informatics (ICHI) > 530 - 535

2017 IEEE International Conference on Healthcare Informatics (ICHI)

Parkinson's Disease (PD) is a neurodegenerative disorder that is frequently correlated with vowel articulation difficulties. The phonation problem arises in patients affected by PD is commonly known as Parkinsonian Dysarthria and identifiedby vocal signal analysis. The analysis supporte physicians and specialists in early detection and monitoring of dysarthria aiming, to increase patients life quality...

rozdział

Identification of correlation between blood relations using speech signal

Palli Padmini, Shikha Tripathi, Kaustav Bhowmick

2017 IEEE International Conference on Signal Processing, Informatics, Communication and Energy Systems (SPICES) > 1 - 6

2017 IEEE International Conference on Signal Processing, Informatics, Communication and Energy Systems (SPICES)

This paper presents a study of how speech features have comparable parameters amongst blood relations. Mel Frequency Cepstral Coefficients (MFCC) has been used for extracting the features of input speech signal, along with vector quantization through modified k-means LBG (Linde, Buzo, and Gray) algorithm are implemented to analyse and estimate the similarity to perform related studies. The study is...

rozdział

Sensitivity analysis of the multi-frame MVDR filter for single-microphone speech enhancement

Dorte Fischer, Simon Doclo

2017 25th European Signal Processing Conference (EUSIPCO) > 603 - 607

2017 25th European Signal Processing Conference (EUSIPCO)

Recently, a multi-frame minimum variance distortionless response (MFMVDR) filter for single-microphone noise reduction has been proposed, which exploits speech correlation across consecutive time frames. It has been shown that the MFMVDR filter achieves impressive results when the speech interframe correlation vector can be accurately estimated. In this paper, we analyze the influence of estimation...

rozdział

Blind Source Separation and Identification for Speech Signals

Jie Yin, Zhiliang Liu, Yaqiang Jin, Dandan Peng, więcej

2017 International Conference on Sensing, Diagnostics, Prognostics, and Control (SDPC) > 398 - 402

2017 International Conference on Sensing, Diagnostics, Prognostics, and Control (SDPC)

Background noise reduction has been studied for many years. However, unwanted human speech noise suppression is not well discussed due to sparsity of the speech signal. Traditional blind source separation (BSS) methods such as independent component analysis (ICA) assume the prior knowledge of the number of sources and require that the number of sources must equal the number of sensors. Above limitations...

rozdział

Disordered speech quality estimation using linear prediction

Yousef S Ettomi Ali, Vijay Parsa, Phillip Doyle, Soulaimane Berkane

2017 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing (PACRIM) > 1 - 5

2017 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing (PACRIM)

Tracheoesophageal (TE) speech is generated by patients who have undergone a total laryngectomy where the larynx (voice box) is removed and replaced by a tracheoesophageal puncture. This work presents a novel low complexity algorithm to estimate the degree of severity of disordered TE speech. The proposed algorithm uses features which are computed from 32-ms voiced frames of the speech signal. A 21-st...

rozdział

Continuous affect prediction using eye gaze

Jonny O'Dwyer, Ronan Flynn, Niall Murray

2017 28th Irish Signals and Systems Conference (ISSC) > 1 - 6

2017 28th Irish Signals and Systems Conference (ISSC)

In recent times, there has been significant interest in the machine recognition of human emotions, due to the suite of applications to which this knowledge can be applied. A number of different modalities, such as speech or facial expression, individually and with eye gaze, have been investigated by the affective computing research community to either classify the emotion (e.g. sad, happy, angry)...

rozdział

Assessing freezing of gait in parkinson's disease using analysis of hypokinetic dysarthria

Zoltan Galaz, Jiri Mekyska, Tomas Kiska, Vojtech Zvoncak, więcej

2017 40th International Conference on Telecommunications and Signal Processing (TSP) > 735 - 738

2017 40th International Conference on Telecommunications and Signal Processing (TSP)

Hypokinetic dysarthria (HD) and freezing of gait (FOG) are frequent symptoms of Parkinson's disease (PD). The aim of this work is to reveal pathological mechanisms common for HD and FOG, and use acoustic analysis of dysarthric speech to assess the gait difficulties in PD. We used a correlation analysis to investigate a relationship between speech features and FOG evaluated by freezing of gait questionnaire...

rozdział

A new look at the automatic mapping between Arabic distinctive phonetic features and acoustic cues

Yousef Alotaibi, Yasser Seddiq, Ali Meftah, Sid-Ahmed Selouani, więcej

2017 40th International Conference on Telecommunications and Signal Processing (TSP) > 368 - 371

2017 40th International Conference on Telecommunications and Signal Processing (TSP)

In this paper, the multidimensional phonological feature structure of Arabic is investigated. Our goal is to assess the performance of statistical and connectionist approaches in performing the complex mappings between distinctive phonetic features (DPF) and associated acoustic cues. The present study explores the mapping between 29 phonological voicing, place, and manner features and Mel-frequency...

rozdział

Identification of hypokinetic dysarthria using acoustic analysis of poem recitation

Jan Mucha, Zoltan Galaz, Jiri Mekyska, Tomas Kiska, więcej

2017 40th International Conference on Telecommunications and Signal Processing (TSP) > 739 - 742

2017 40th International Conference on Telecommunications and Signal Processing (TSP)

Up to 90% of patients with Parkinson's disease (PD) suffer from hypokinetic dysarthria (HD). In this work, we analysed the power of conventional speech features quantifying imprecise articulation, dysprosody, speech dysfluency and speech quality deterioration extracted from a specialized poem recitation task to discriminate dysarthric and healthy speech. For this purpose, 152 speakers (53 healthy...

rozdział

Speaker-independent speech emotion recognition based on random forest feature selection algorithm

Wei-Hua Cao, Jian-Ping Xu, Zhen-Tao Liu

2017 36th Chinese Control Conference (CCC) > 10995 - 10998

2017 36th Chinese Control Conference (CCC)

Feature selection is a crucial step in the development of a system for identifying emotions in speech. How to select high correlation features is an open question. This paper focuses on feature selection method which aims to extract the most effective acoustic features to improve the performance of emotion recognition. Emotional feature selection of speaker-independent speech based on Random Forest...

rozdział

Pre-processing voice signals for voice recognition systems

Gulmira K. Berdibaeva, Oleg N. Bodin, Valery V. Kozlov, Dmitry I. Nefed'ev, więcej

2017 18th International Conference of Young Specialists on Micro/Nanotechnologies and Electron Devices (EDM) > 242 - 245

2017 18th International Conference of Young Specialists on Micro/Nanotechnologies and Electron Devices (EDM)

The article considers the pre-processing voice signals for voice recognition systems based on the use of artificial neural networks. Based segmentation preprocessing is put in the speech signal according to a phonetic transcription of language, in order to reduce the amount of data supplied to the input of the neural network, which considerably improves its input data sensitivity. Application of numerical...

rozdział

Speaker verification with mostly voiced speech for GMM/UBM and GMM/IBM systems

Vadim Ditlovich, Yuval Bistritz

2017 IEEE First Ukraine Conference on Electrical and Computer Engineering (UKRCON) > 1175 - 1180

2017 IEEE First Ukraine Conference on Electrical and Computer Engineering (UKRCON)

The paper proposes the use of just mostly voiced speech (MVS) for speaker verification (SV). The speech is partitioned into an MVS part and a non-MVS part by a simple machine classification. SV experiments were held with a standard Gaussian mixture model (GMM) with universal background model (UBM) system and a GMM with computationally improved individual background model (IBM) system. They demonstrate...

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania

Speech pause time: A potential biomarker for depression detection

PSD estimation of multiple sound sources in a reverberant room using a spherical microphone array

Low-Complexity Kalman filter for multi-channel linear-prediction-based blind speech dereverberation

Unsupervised speaker segmentation framework based on sparse correlation feature

Software for an objective evaluation of the quality of syllables's pronunciation in speech rehabilitation

A new correlated set-membership partial-update NLMS (SM-PU-NLMSCOR) algorithm for acoustic noise reduction

Multiple fundamental frequencies estimation approaches based on multi-scale product analysis

Multi-channel estimation of power spectral density matrix using inter-frame and inter-banc information

Signal Analysis for Voice Evaluation in Parkinson’s Disease

Identification of correlation between blood relations using speech signal

Sensitivity analysis of the multi-frame MVDR filter for single-microphone speech enhancement

Blind Source Separation and Identification for Speech Signals

Disordered speech quality estimation using linear prediction

Continuous affect prediction using eye gaze

Assessing freezing of gait in parkinson's disease using analysis of hypokinetic dysarthria

A new look at the automatic mapping between Arabic distinctive phonetic features and acoustic cues

Identification of hypokinetic dysarthria using acoustic analysis of poem recitation

Speaker-independent speech emotion recognition based on random forest feature selection algorithm

Pre-processing voice signals for voice recognition systems

Speaker verification with mostly voiced speech for GMM/UBM and GMM/IBM systems

Opcje filtrowania

Data publikacji

Dostępność treści

Słowa kluczowe

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Dostępność treści

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu