Serwis Infona wykorzystuje pliki cookies (ciasteczka). Są to wartości tekstowe, zapamiętywane przez przeglądarkę na urządzeniu użytkownika. Nasz serwis ma dostęp do tych wartości oraz wykorzystuje je do zapamiętania danych dotyczących użytkownika, takich jak np. ustawienia (typu widok ekranu, wybór języka interfejsu), zapamiętanie zalogowania. Korzystanie z serwisu Infona oznacza zgodę na zapis informacji i ich wykorzystanie dla celów korzytania z serwisu. Więcej informacji można znaleźć w Polityce prywatności oraz Regulaminie serwisu. Zamknięcie tego okienka potwierdza zapoznanie się z informacją o plikach cookies, akceptację polityki prywatności i regulaminu oraz sposobu wykorzystywania plików cookies w serwisie. Możesz zmienić ustawienia obsługi cookies w swojej przeglądarce.
We investigate the role of top-down task drive attention in the cocktail party problem. In a recently proposed computational model of top-down attention it is possible to simulate the cocktail party problem and make predictions about sensitivity to confounders under different levels of attention. Based on such simulations we expect that under strong top-down attention pattern recognition is improved...
A great problem in speech communication is to identify the shape and characteristics of the vocal apparatus. This task is normally done by using a vocal tract model, based on the calculation of the function, area would be performed. We will show that these models have good performance in experiments. A mathematical model of voice box has been obtained. We suggest a new numerical method without saturation,...
Sound reproduced and perceived in different environments, or transmitted via diverse transmission channels shows distinctive acoustic characteristics, sometimes quoted simply as acoustics. Acoustics of an auditorium can be described by the transfer function from the source to the receiver, or the impulse response in the time domain, which can be measured by instrumentation using a number of methods...
The goal of our work is to create natural verbal interaction between humans and speech dialogue agents. In this paper, we focus on generations of back-channel for speech dialogue agents the same way humans do. To create such a system, the system needs to predict the appropriate timing of back-channel on the basis of the human's speech. For the prediction model, we use a neuro-dynamical system called...
In a leading service economy like India, services lie at the very center of economic activity. Competitive organizations now look not only at the skills and knowledge, but also at the behavior required by an employee to be successful on the job. Emotionally competent employees can effectively deal with occupational stress and maintain psychological well-being. This study explores the scope of the...
In this paper we present an experiment addressing the effect of voice pitch on the evaluation of a social robot receptionist. Twenty eight test participants interacted with two “female” robot characters: one with a high-pitched, exuberant voice, the other with a low-pitched, calm voice. Our results show that the high pitch robot was perceived significantly more attractive in terms of voice, behavior...
This paper presents reliability of MLP in speaker identification using characteristics extracted from their voices. Classification accuracy depends on speaking condition and varies up to 23% depending on the selected speaking condition. Results of simulation experiment show that MLP is effective in speaker identification, especially in the case of retelling and synchronous speech where we achieved...
Recent human performance research at the Naval Surface Warfare Center, Dahlgren Division (NSWCDD) has shown that increasing the number of concurrent voice communications tasks individual Navy watchstanders must handle is an uncompromising empirical barrier to streamlining crew sizes in future shipboard combat information centers. Subsequent work on this problem at the Naval Research Laboratory (NRL)...
Robotic systems are today capable of performing patrolling and surveillance tasks in indoor structured environments. However, they need to be designed by taking into account the operational environment and the specific task to be accomplished. This dependency from the specific features of task and environment (contextual information, according to Turner [11]), severely restricts their practical deployment...
This research reports the development and evaluation of Malay emotional voice corpora through listening evaluation, and how the numbers of emotion choices offered to evaluators affect the result of the evaluation. The voice corpora comprises of three emotions, namely anger, sadness and happiness being expressed by two male and two female actors. The voice corpora were evaluated in two separate listening...
This paper presented a robust sound recognition work applied to awareness for health/children/elderly care. Specific sound awareness services can be activated based on recognized sound classes for detecting human activities as health care. To attain this goal, this study developed key technologies as follows: 1) SNR-aware subspace signal enhancement, 2) pitch and power density-based sound/speech discrimination,...
Spoken interactions usually have accurate timing and alignment between interlocutors: turn-taking and topic flow are managed in a manner that provides conversational fluency and smooth progress of the interaction. Turn-taking and topic flow are also important in applications such as robot companions that interact with a user in real time. The creation of a multimodal conversational corpus for modeling...
Human Speech conveys speaker's emotional state along with linguistic intelligence. Meaning of a speech sample changes when it is uttered with different emotions. The present paper gives a description of different types of studies conducted to analyze, perceive and recognize commonly occurring emotions in Hindi speech. These have been classified as anger, happiness, fear, sadness, surprise in addition...
This paper presents experiments where natural and spontaneous cognitive processes, in particular those who lead to the attribution of personality traits to unacquainted people, are used as a natural form of feature extraction. In particular, personality assessments provided by human judges are used as features to distinguish between professional and non-professional speakers. The same task is performed...
We consider how the continuity of form in natural sounds may be used to discover sparse time-frequency representations. To proceed, we describe a method to represent any sound as a collection of contours in the time-frequency plane. By analysing the signal in many time-scales, an over-complete set of shapes is generated for a given sound. From this redundant set of shapes the simplest, most parsimonious...
During face-to-face interpersonal interaction, people have a tendency to mimic each other. People not only mimic postures, mannerisms, moods or emotions, but they also mimic several speech-related behaviors. In this paper we describe how visual and vocal behavioral information expressed between two interlocutors can be used to detect and identify visual and vocal mimicry. We investigate expressions...
Social signal processing has the ambitious goal of bridging the social intelligence gap between computers and humans. Nowadays, computers are not only the new interaction partners of humans, but also a privileged interaction medium for social exchange between humans. Consequently, enhancing machine abilities to interpret and reproduce social signals is a crucial requirement for improving computer-mediated...
Online discussion boards are an important medium for collaboration. The goal of our work is to understand how messages and individual discussants contribute to Q&A discussions. We present a novel network model for capturing in-formation roles of messages and discussants, and show how we identify useful answers to the initial question. We first classify information seeking or information providing...
With the rapid development of the Internet, steganography on Voice over IP (VoIP) has been attracted a lot of research efforts. To date, existing VoIP steganography research commonly focus on information hiding in the LSB bits of Network Audio Streams, yet, we found this approach may raise serious security threat, where the hidden information may be easily removed, detected and attacked. Towards this...
The increasing ability of industrial robots to perform complex tasks in collaboration with humans requires more capable ways of communication and interaction. Traditional systems use separate interfaces such as touchscreens or control panels in order to operate the robot, or to communicate its state and prospective actions to the user. Transferring human communication, such as gestures to technical...
Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.