Search results

Items from 1 to 16 out of 16 results

chapter

A cloud-based framework for Thai Large Vocabulary Speech Recognition

Sila Chunwijitra, Chanchai Junlouchai, Kamthorn Krairaksa, Vataya Chunwijitra, more

2016 13th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON) > 1 - 6

2016 13th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON)

This paper presents an improvement of a distributed Thai speech recognizer (SR). Two main objectives of the improvement are investigated; 1) the response time in terms of a real-time factor (RTF), 2) the cloud computing deployment. The proposed framework adapts and migrates the baseline collaborative DSR system to the Docker platform. Multiple containers are shared system resources such as CPU, memory,...

chapter

Analysis of long-term and large-scale experiments on robot dialogues using a cloud robotics platform

Komei Sugiura, Koji Zettsu

2016 11th ACM/IEEE International Conference on Human-Robot Interaction (HRI) > 525 - 526

2016 11th ACM/IEEE International Conference on Human-Robot Interaction (HRI)

To build conversational robots, roboticists are required to have deep knowledge of both robotics and spoken dialogue systems. Unlike using stand-alone speech recognition/ synthesis toolkits, a cloud robotics platform for human-robot communication enables high-quality speech recognition and synthesis that is optimized to human-robot interactions. This is challenging because we need to build a wide...

chapter

Enhance run-time performance with a collaborative distributed speech recognition framework

Nattapong Kurpukdee, Phuttapong Sertsi, Sila Chunwijitra, Vataya Chunwijitra, more

2015 International Computer Science and Engineering Conference (ICSEC) > 1 - 6

2015 International Computer Science and Engineering Conference (ICSEC)

This paper presents an improvement of a distributed Thai speech recognizer, aiming to enhance system response time as measured by a real-time factor (RTF) for a better user experience. The system is designed based on a collaborative multi-agents and task workers concept. A Streaming Agent is introduced to manage speech signal transfer while a Recognition Agent is applied to manage speech recognition...

chapter

TREN - Turkish speech recognition platform

Hasan Palaz, Alper Kanak, Yucel Bicil, Mehmet Ugur Dogan, more

2005 13th European Signal Processing Conference > 1 - 4

2005 13th European Signal Processing Conference

TREN (Turkish Recognition ENgine) is a modular, HMM-based (Hidden Markov Model) and speaker-independent speech recognition system whose system software architecture is based on Distributed Component Object Model (DCOM). TREN contains specialized modules that allow a full interoperable platform including a Turkish speech recognizer, feature extractor, end-point detector and a performance monitoring...

chapter

Multi-user real-time speech recognition with a GPU

Jungsuk Kim, Wonyong Sung

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1617 - 1620

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

We have developed a multi-user large vocabulary speech recognition system employing a fully composed one-level weighted finite state transducer (WFST) based network on a Graphics Processing Unit (GPU). This system improves the overall throughput and latency of speech recognition engine which processes multiple users' utterances at the same time with efficient scheduling, parameter sharing, and communication...

chapter

Voice controlled environment for the assistive tools and living space control

Vytautas Rudzionis, Rytis Maskeliunas, Kestutis Driaunys

2012 Federated Conference on Computer Science and Information Systems (FedCSIS) > 1075 - 1080

2012 Federated Conference on Computer Science and Information Systems (FedCSIS)

This paper describes our efforts developing the smart home environment for the assistive living. The key element of the smart environment is the ubiquitous voice user interface with several additional capabilities (such as the recognition of several gestures). This work is a further development of voice controlled devices. The presence of the commercial speech recognition engines and our experience...

chapter

Swar-Suchak: Open source voice enabled information retrieval system

Punyabrata Ghatak, Mohan Singh, Chandan Kumar Goyal, Saurabh Banga, more

2011 International Conference on Recent Trends in Information Technology (ICRTIT) > 689 - 694

2011 International Conference on Recent Trends in Information Technology (ICRTIT)

The constant improvement of both hardware and software related to mobile computing is enhancing the capabilities of mobile devices. The present day mobile phones can run rich stand alone applications as well as distributed client-server applications that access information via a web gateway. This changed environment brings new opportunities as well as constraints for mobile application developers...

chapter

The Use of Cloud Speech Recognition Technology in Vehicle Diagnosis Applications

Shi-Huang Chen, Jun-Yu Chen, Kuo-Yuan Lu

2011 Fifth International Conference on Innovative Mobile and Internet Services in Ubiquitous Computing > 567 - 570

2011 Fifth International Conference on Innovative Mobile and Internet Services in Ubiquitous Computing (IMIS)

This paper developed a speech controlled interface with cloud computing technology for vehicle on-board diagnostic (OBD) system. The proposed vehicle OBD system is constructed by two parts. They are OBD embedded global position system (GPS-OBD) module and vehicle surveillance server. The speech recognition task is performed in vehicle surveillance server, instead of GPS-OBD module. The speech signal...

chapter

A Study on Speech Control Interface for Vehicle On-Board Diagnostic System

Shi-Huang Chen, Yu-Ru Wei

2010 Fourth International Conference on Genetic and Evolutionary Computing > 614 - 617

2010 Fourth International Conference on Genetic and Evolutionary Computing (ICGEC 2010)

This paper developed a speech controlled interface for vehicle on-board diagnostic (OBD) system. The proposed vehicle OBD system contains three parts. They are OBD embedded global position system (GPS-OBD) module, speech controlled interface, and vehicle surveillance server. The GPSOBD module is designed to monitor the real-time location as well as operation information of vehicle. The real-time location...

chapter

Real-time speaker adapted speech to speech translation system in mobile environment

Yong Guan, Lin Zheng, Jilei Tian

IEEE 10th INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS > 577 - 580

2010 10th International Conference on Signal Processing (ICSP 2010)

In this paper, a real-time speech to speech translation (S2ST) system in mobile environment is designed and implemented as a client-server architecture. Particularly, we apply cross lingual speaker adaptation to adapt synthesized speech to enrolling speaker to ensure personalization. This realtime S2ST system provides streaming way, multi-threading and speaker adapted speech to speech translation...

chapter

Personalized IVR system in contact center

M Soujanya, S Kumar

2010 International Conference on Electronics and Information Engineering > 1 > V1-453 - V1-457

2010 International Conference on Electronics and Information Engineering (ICEIE 2010)

One of the important challenges in today's contact center solution is to provide the service to the customer in a cost effective manner without disregarding the customer This paper describes the implementation of Personalized IVR system in Contact Center. Personalized IVRs are used to provide self service to the customer so as reducing the burden from the customer care representatives also called...

chapter

The Asian network-based speech-to-speech translation system

S. Sakti, N. Kimura, M. Paul, C. Hori, more

2009 IEEE Workshop on Automatic Speech Recognition&Understanding > 507 - 512

2009 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU 2009)

This paper outlines the first Asian network-based speech-to-speech translation system developed by the Asian Speech Translation Advanced Research (A-STAR) consortium. The system was designed to translate common spoken utterances of travel conversations from a certain source language into multiple target languages in order to facilitate multiparty travel conversations between people speaking different...

chapter

Implementation of G.729 Codec Based on DaVinci Technology

Xiangping Kong, Hezhi Lin, Lianfen Huang, Jianan Lin, more

2008 International Conference on MultiMedia and Information Technology > 11 - 14

2008 International Conference on Multimedia and Information Technology (MMIT 2008)

As the widely application of digital multi-media technology, G.729 has become one of the most popular audio standards. In this paper, the TMS320DM6446 which is a Davinci-based multi-core processor is chosen as our platform. We present our implementation method of G.729 algorithm which is compatible with eXpress DSP algorithm interface standard - digital media (xDAIS-DM, also called "xDM")...

chapter

A Speech-Enabled Assistive Collaborative Platform for Educational Purposes with User Personalization

V. Kolias, C. Kolias, I. Anagnostopoulos, G. Kambourakis, more

2008 Third International Workshop on Semantic Media Adaptation and Personalization > 157 - 163

2008 Third International Workshop on Semantic Media Adaptation and Personalization

With the proliferation of Web 2.0 applications, collaborative learning has gathered a lot of attention due its potentiality in the e-learning field. Forums, Wikis and Blogs for example are only some of the applications that exploit the collaborative nature of e-learning. However, these applications are originally designed for access from desktop systems and access to them when on the move can prove...

chapter

Design and Implementation of Voice Web Pages for Online Shopping Based on .NET and Streaming Media

Guoqiang Di, Yaoyao Liu, Lingchao Han, Jianping Wu

2008 International Conference on Management of e-Commerce and e-Government > 226 - 229

2008 International Conference on Management of e-Commerce and e-Government

Voice interaction is a conversation mode between human and computer pursued by people. Based on the .NET platform, the techniques of data binding and text to speech are used to create voice homepages at the web sites of online shopping, and the technique of streaming media is used to transfer data, which make it possible to browse text and play voice simultaneously, and even enable a blind person...

chapter

Thai voice application gateway

D. Kaitrungrit, M.N. Dailey, C. Wutiwiwatchai

2008 5th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology > 1 > 101 - 104

2008 5th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON)

We propose and implement a low-cost Thai voice gateway that combines current technology in network systems and telephony. It enhances traditional telephony-based applications with access to resources on the Web. The system is based on open standards for speech technology and existing open source software. It supports the VoiceXML markup language for voice dialogs, the MRCP protocol for communication...

Filter options

Data set:
ieee
Keywords:
SERVERS
ENGINES
SPEECH

Publication date

Set your own date range

Keywords

SPEECH RECOGNITION (15)
SPEECH-BASED USER INTERFACES (4)
DISTRIBUTED SPEECH RECOGNITION (3)
HIDDEN MARKOV MODELS (3)
SPEECH SYNTHESIS (3)
CLIENT-SERVER ARCHITECTURE (2)
CLIENT-SERVER SYSTEMS (2)
CLOUD COMPUTING (2)
COLLABORATION (2)
DATABASES (2)
GLOBAL POSITION SYSTEM (GPS) (2)
GLOBAL POSITIONING SYSTEM (2)
LANGUAGE TRANSLATION (2)
LOGIC GATES (2)
MEDIA (2)
MOBILE COMMUNICATION (2)
ON-BOARD DIAGNOSTIC (OBD) (2)
OPEN SOURCE SOFTWARE (2)
REAL TIME SYSTEMS (2)
SPEECH-CONTROLLED INTERFACE (2)
SURVEILLANCE (2)
TELEMATICS (2)
TEXT TO SPEECH (2)
THAI SPEECH RECOGNIZER (2)
TIME FACTORS (2)
VEHICLES (2)
VOICEXML (2)
.NET PLATFORM (1)
3G (1)
3G MOBILE COMMUNICATION (1)
ACCURACY (1)
ACOUSTICS (1)
ADAPTATION MODEL (1)
ADAPTIVE HYPERMEDIA (1)
AMR CODING (1)
ANDROID (1)
API (1)
ASIAN LANGUAGES (1)
ASIAN NETWORK (1)
ASIAN SPEECH TRANSLATION ADVANCED RESEARCH (1)
ASP.NET 2003 (1)
AUDIO CODING (1)
AUDIO STANDARDS (1)
AUTOMATIC SPEECH RECOGNITION (1)
AUTOMATIC SPEECH RECOGNITION (ASR) (1)
CALL CENTER (1)
CALL CENTRES (1)
CODEC ENGINE (1)
CODECS (1)
COLLABORATIVE E-LEARNING (1)
COLLABORATIVE LEARNING (1)
COMPUTER AIDED INSTRUCTION (1)
COMPUTER ARCHITECTURE (1)
COMPUTERS (1)
CONTACT CENTER SOLUTION (1)
CONTAINERS (1)
CONTAINERS AND CLUSTERS (1)
CONTROL SYSTEMS (1)
CORRELATION (1)
COST EFFECTIVE MANNER (1)
CROSS LINGUAL SPEAKER ADAPTATION (1)
CUSTOMER CARE REPRESENTATIVES (1)
CUSTOMER SERVICES (1)
DATA BINDING (1)
DATA COMPRESSION (1)
DATA COMPRESSION ALGORITHM (1)
DATA MINING (1)
DAVINCI TECHNOLOGY (1)
DAVINCI-BASED MULTICORE PROCESSOR (1)
DIGITAL MEDIA (1)
DIGITAL MULTIMEDIA TECHNOLOGY (1)
DIGITAL SIGNAL PROCESSING (1)
DIGITAL SIGNAL PROCESSING CHIPS (1)
DRIVER INFORMATION SYSTEMS (1)
DSP (1)
DSP ALGORITHM (1)
E-LEARNING FIELD (1)
EDUCATIONAL PURPOSES (1)
EDUCATIONAL RESOURCES (1)
ELECTRONIC COMMERCE (1)
EMBEDDED GLOBAL POSITION SYSTEM (1)
EMBEDDED SYSTEMS (1)
EXPERT SYSTEMS (1)
FESTIVAL (1)
G.729 (1)
G.729 CODEC (1)
GPU (1)
GRAMMAR (1)
GRAPHICS PROCESSING UNIT (1)
GROUPWARE (1)
INSTRUMENTS (1)
INTERACTIVE SYSTEMS (1)
INTERACTIVE VOICE RESPONSE SYSTEMS (IVRS) (1)
INTERFACE STANDARD (1)
INTERNET (1)
INTERNETWORKING (1)
LVCSR (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options