Wyniki wyszukiwania

rozdział

The Role of Top-Down Attention in the Cocktail Party: Revisiting Cherry's Experiment after Sixty Years

Letizia Marchegiani, Seliz G. Karadogan, Tobias Andersen, Jan Larsen, więcej

2011 10th International Conference on Machine Learning and Applications and Workshops > 1 > 183 - 188

2011 Tenth International Conference on Machine Learning and Applications (ICMLA 2011)

We investigate the role of top-down task drive attention in the cocktail party problem. In a recently proposed computational model of top-down attention it is possible to simulate the cocktail party problem and make predictions about sensitivity to confounders under different levels of attention. Based on such simulations we expect that under strong top-down attention pattern recognition is improved...

rozdział

Identification of predominent frequencies in a speech signal using modeling of vocal chord

V. S. Balaji, N. R. Raajan, Har Narayan Upadhayay

2011 INTERNATIONAL CONFERENCE ON RECENT ADVANCEMENTS IN ELECTRICAL, ELECTRONICS AND CONTROL ENGINEERING > 478 - 481

2011 International Conference on Recent Advancements in Electrical, Electronics and Control Engineering (ICONRAEeCE)

A great problem in speech communication is to identify the shape and characteristics of the vocal apparatus. This task is normally done by using a vocal tract model, based on the calculation of the function, area would be performed. We will show that these models have good performance in experiments. A mathematical model of voice box has been obtained. We suggest a new numerical method without saturation,...

rozdział

A Cognitive Model to Mimic an Aspect of Low Level Perception of Sound: Modelling Reverberation Perception by Statistical Signal Analysis

Francis F. Li

2011 Second International Conference on Innovations in Bio-inspired Computing and Applications > 348 - 351

2011 2nd International Conference on Innovations in Bio-Inspired Computing and Applications (IBICA)

Sound reproduced and perceived in different environments, or transmitted via diverse transmission channels shows distinctive acoustic characteristics, sometimes quoted simply as acoustics. Acoustics of an auditorium can be described by the transfer function from the source to the receiver, or the impulse response in the time domain, which can be measured by instrumentation using a number of methods...

rozdział

Predicting listener back-channels for human-agent interaction using neuro-dynamical model

Shotaro Sano, Shun Nishide, Hiroshi G. Okuno, Tetsuya Ogata

2011 IEEE/SICE International Symposium on System Integration (SII) > 18 - 23

2011 IEEE/SICE International Symposium on System Integration (SII 2011)

The goal of our work is to create natural verbal interaction between humans and speech dialogue agents. In this paper, we focus on generations of back-channel for speech dialogue agents the same way humans do. To create such a system, the system needs to predict the appropriate timing of back-channel on the basis of the human's speech. For the prediction model, we use a neuro-dynamical system called...

rozdział

Jitter measurements for performance enhancement in the service sector

Agnes Jacob, P. Mythili

2011 Annual IEEE India Conference > 1 - 4

2011 Annual IEEE India Conference (INDICON)

In a leading service economy like India, services lie at the very center of economic activity. Competitive organizations now look not only at the skills and knowledge, but also at the behavior required by an employee to be successful on the job. Emotionally competent employees can effectively deal with occupational stress and maintain psychological well-being. This study explores the scope of the...

rozdział

The influence of voice pitch on the evaluation of a social robot receptionist

Andreea Niculescu, Betsy van Dijk, Anton Nijholt, Swee Lan See

2011 International Conference on User Science and Engineering (i-USEr ) > 18 - 23

2011 International Conference on User Science and Engineering (i-USEr 2011)

In this paper we present an experiment addressing the effect of voice pitch on the evaluation of a social robot receptionist. Twenty eight test participants interacted with two “female” robot characters: one with a high-pitched, exuberant voice, the other with a low-pitched, calm voice. Our results show that the high pitch robot was perceived significantly more attractive in terms of voice, behavior...

rozdział

Speaker identification in smart environments with multilayer perceptron

Jasmina Novakovic

2011 19thTelecommunications Forum (TELFOR) Proceedings of Papers > 1418 - 1421

2011 19th Telecommunications Forum Telfor (TELFOR)

This paper presents reliability of MLP in speaker identification using characteristics extracted from their voices. Classification accuracy depends on speaking condition and varies up to 23% depending on the selected speaking condition. Results of simulation experiment show that MLP is effective in speaker identification, especially in the case of retelling and synchronous speech where we achieved...

rozdział

Facilitating the watchstander's voice communications task in future Navy operations

Derek Brock, Christina Wasylyshyn, Brian McClimens, Dennis Perzanowski

2011 - MILCOM 2011 Military Communications Conference > 2222 - 2226

MILCOM 2011 - 2011 IEEE Military Communications Conference

Recent human performance research at the Naval Surface Warfare Center, Dahlgren Division (NSWCDD) has shown that increasing the number of concurrent voice communications tasks individual Navy watchstanders must handle is an uncompromising empirical barrier to streamlining crew sizes in future shipboard combat information centers. Subsequent work on this problem at the Naval Research Laboratory (NRL)...

rozdział

User-friendly security robots

Gabriele Randelli, Luca Iocchi, Daniele Nardi

2011 IEEE International Symposium on Safety, Security, and Rescue Robotics > 308 - 313

2011 IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR)

Robotic systems are today capable of performing patrolling and surveillance tasks in indoor structured environments. However, they need to be designed by taking into account the operational environment and the specific task to be accomplished. This dependency from the specific features of task and environment (contextual information, according to Turner [11]), severely restricts their practical deployment...

rozdział

Assessing the naturalness of malay emotional voice corpora

Mumtaz B. Mustafa, Raja N. Ainon, Roziati Zainuddin, Zuraidah M. Don, więcej

2011 International Conference on Speech Database and Assessments (Oriental COCOSDA) > 174 - 179

2011 Oriental COCOSDA 2011 - International Conference on Speech Database and Assessments

This research reports the development and evaluation of Malay emotional voice corpora through listening evaluation, and how the numbers of emotion choices offered to evaluators affect the result of the evaluation. The voice corpora comprises of three emotions, namely anger, sadness and happiness being expressed by two male and two female actors. The voice corpora were evaluated in two separate listening...

rozdział

Robust sound recognition applied to awareness for health/children/elderly care

Jhing-Fa Wang, Po-Yi Shih, Zhon-Hua Fu, Sheng-Chieh Lee

2011 IEEE International Conference on Systems, Man, and Cybernetics > 216 - 219

2011 IEEE International Conference on Systems, Man and Cybernetics - SMC

This paper presented a robust sound recognition work applied to awareness for health/children/elderly care. Specific sound awareness services can be activated based on recognized sound classes for detecting human activities as health care. To attain this goal, this study developed key technologies as follows: 1) SNR-aware subspace signal enhancement, 2) pitch and power density-based sound/speech discrimination,...

rozdział

A multimodal corpus for modeling turn management in multi-party conversations

H. Furukawa, M. Nishida, K. Jokinen, S. Yamamoto

2011 International Conference on Speech Database and Assessments (Oriental COCOSDA) > 142 - 146

2011 Oriental COCOSDA 2011 - International Conference on Speech Database and Assessments

Spoken interactions usually have accurate timing and alignment between interlocutors: turn-taking and topic flow are managed in a manner that provides conversational fluency and smooth progress of the interaction. Turn-taking and topic flow are also important in applications such as robot companions that interact with a user in real time. The creation of a multimodal conversational corpus for modeling...

rozdział

Emotions in Hindi speech- analysis, perception and recognition

S.S. Agrawal

2011 International Conference on Speech Database and Assessments (Oriental COCOSDA) > 7 - 13

2011 Oriental COCOSDA 2011 - International Conference on Speech Database and Assessments

Human Speech conveys speaker's emotional state along with linguistic intelligence. Meaning of a speech sample changes when it is uttered with different emotions. The present paper gives a description of different types of studies conducted to analyze, perceive and recognize commonly occurring emotions in Hindi speech. These have been classified as anger, happiness, fear, sadness, surprise in addition...

rozdział

Humans as feature extractors: Combining prosody and personality perception for improved speaking style recognition

Gelareh Mohammadi, Alessandro Vinciarelli

2011 IEEE International Conference on Systems, Man, and Cybernetics > 363 - 366

2011 IEEE International Conference on Systems, Man and Cybernetics - SMC

This paper presents experiments where natural and spontaneous cognitive processes, in particular those who lead to the attribution of personality traits to unacquainted people, are used as a natural form of feature extraction. In particular, personality assessments provided by human judges are used as features to distinguish between professional and non-professional speakers. The same task is performed...

rozdział

Contour representations of sound

Yoonseob Lim, Barbara Shinn-Cunningham, Timothy Gardner

2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) > 317 - 320

2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

We consider how the continuity of form in natural sounds may be used to discover sparse time-frequency representations. To proceed, we describe a method to represent any sound as a collection of contours in the time-frequency plane. By analysing the signal in many time-scales, an over-complete set of shapes is generated for a given sound. From this redundant set of shapes the simplest, most parsimonious...

rozdział

Towards visual and vocal mimicry recognition in human-human interactions

Xiaofan Sun, Khiet P. Truong, Maja Pantic, Anton Nijholt

2011 IEEE International Conference on Systems, Man, and Cybernetics > 367 - 373

2011 IEEE International Conference on Systems, Man and Cybernetics - SMC

During face-to-face interpersonal interaction, people have a tendency to mimic each other. People not only mimic postures, mannerisms, moods or emotions, but they also mimic several speech-related behaviors. In this paper we describe how visual and vocal behavioral information expressed between two interlocutors can be used to detect and identify visual and vocal mimicry. We investigate expressions...

rozdział

Recent developments in social signal processing

Albert Ali Salah, Maja Pantic, Alessandro Vinciarelli

2011 IEEE International Conference on Systems, Man, and Cybernetics > 380 - 385

2011 IEEE International Conference on Systems, Man and Cybernetics - SMC

Social signal processing has the ambitious goal of bridging the social intelligence gap between computers and humans. Nowadays, computers are not only the new interaction partners of humans, but also a privileged interaction medium for social exchange between humans. Consequently, enhancing machine abilities to interpret and reproduce social signals is a crucial requirement for improving computer-mediated...

rozdział

Analyzing Answers in Threaded Discussions Using a Role-Based Information Network

Jeon-Hyung Kang, Jihie Kim

2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing > 111 - 117

2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust (PASSAT) / 2011 IEEE Third Int'l Conference on Social Computing (SocialCom)

Online discussion boards are an important medium for collaboration. The goal of our work is to understand how messages and individual discussants contribute to Q&A discussions. We present a novel network model for capturing in-formation roles of messages and discussants, and show how we identify useful answers to the initial question. We first classify information seeking or information providing...

rozdział

Adaptive VoIP Steganography for Information Hiding within Network Audio Streams

Erchi Xu, Bo Liu, Liyang Xu, Ziling Wei, więcej

2011 14th International Conference on Network-Based Information Systems > 612 - 617

2011 14th International Conference on Network-Based Information Systems (NBiS)

With the rapid development of the Internet, steganography on Voice over IP (VoIP) has been attracted a lot of research efforts. To date, existing VoIP steganography research commonly focus on information hiding in the LSB bits of Network Audio Streams, yet, we found this approach may raise serious security threat, where the hidden information may be easily removed, detected and attacked. Towards this...

rozdział

A human-centered approach to robot gesture based communication within collaborative working processes

Tobias Ende, Sami Haddadin, Sven Parusel, Tilo Wusthoff, więcej

2011 IEEE/RSJ International Conference on Intelligent Robots and Systems > 3367 - 3374

2011 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2011)

The increasing ability of industrial robots to perform complex tasks in collaboration with humans requires more capable ways of communication and interaction. Traditional systems use separate interfaces such as touchscreens or control panels in order to operate the robot, or to communicate its state and prospective actions to the user. Transferring human communication, such as gestures to technical...

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania

The Role of Top-Down Attention in the Cocktail Party: Revisiting Cherry's Experiment after Sixty Years

Identification of predominent frequencies in a speech signal using modeling of vocal chord

A Cognitive Model to Mimic an Aspect of Low Level Perception of Sound: Modelling Reverberation Perception by Statistical Signal Analysis

Predicting listener back-channels for human-agent interaction using neuro-dynamical model

Jitter measurements for performance enhancement in the service sector

The influence of voice pitch on the evaluation of a social robot receptionist

Speaker identification in smart environments with multilayer perceptron

Facilitating the watchstander's voice communications task in future Navy operations

User-friendly security robots

Assessing the naturalness of malay emotional voice corpora

Robust sound recognition applied to awareness for health/children/elderly care

A multimodal corpus for modeling turn management in multi-party conversations

Emotions in Hindi speech- analysis, perception and recognition

Humans as feature extractors: Combining prosody and personality perception for improved speaking style recognition

Contour representations of sound

Towards visual and vocal mimicry recognition in human-human interactions

Recent developments in social signal processing

Analyzing Answers in Threaded Discussions Using a Role-Based Information Network

Adaptive VoIP Steganography for Information Hiding within Network Audio Streams

A human-centered approach to robot gesture based communication within collaborative working processes

Opcje filtrowania

Data publikacji

Dostępność treści

Słowa kluczowe

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Dostępność treści

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu