Advanced search

chapter

Improved acoustic modeling of low-resource languages using shared SGMM parameters of high-resource languages

Neethu Mariam Joy, Basil Abraham, Navneeth K, S. Umesh

2016 Twenty Second National Conference on Communication (NCC) > 1 - 6

2016 Twenty Second National Conference on Communication (NCC)

In this paper, we investigate methods to improve the recognition performance of low-resource languages with limited training data by borrowing subspace parameters from a high-resource language in subspace Gaussian mixture model (SGMM) framework. As a first step, only the state-specific vectors are updated using low-resource language, while retaining all the globally shared parameters from the high-resource...

chapter

An Innovative Statistical Tool for Automatic OWL-ERD Alignment

Arianna Pipitone, Francesca Anastasio, Roberto Pirrone

2016 IEEE Tenth International Conference on Semantic Computing (ICSC) > 96 - 99

2016 IEEE Tenth International Conference on Semantic Computing (ICSC)

Aligning two representations of the same domain with different expressiveness is a crucial topic in nowadays semantic web and big data research. OWL ontologies and Entity Relation Diagrams are the most widespread representations whose alignment allows for semantic data access via ontology interface, and ontology storing techniques. The term ""alignment" encompasses three different processes:...

chapter

High-Level Surveillance Event Detection Using an Interval-Based Query Language

Sven Helmer, Fabio Persia

2016 IEEE Tenth International Conference on Semantic Computing (ICSC) > 39 - 46

2016 IEEE Tenth International Conference on Semantic Computing (ICSC)

We propose a language based on relational algebra extended by intervals for detecting high-level surveillance events from a video stream. The operators we introduce for describing temporal constraints are based on the well-known Allen's interval relationships. The semantics of our language are clearly defined and we illustrate its usefulness by expressing typical events in it and showing the promising...

chapter

An Efficient Approach of Training Artificial Neural Network to Recognize Bengali Hand Sign

Alvi Mahadi, Fatema Tuj Johora, Mohammad Abu Yousuf

2016 IEEE 6th International Conference on Advanced Computing (IACC) > 152 - 157

2016 IEEE 6th International Conference on Advanced Computing (IACC)

This work proposes a system that percepts handsigns and gestures via computer vision system and extractsufficient amount of images from it. After applying imageprocessing and extracting the features of the images, the systemuses an algorithm to recognize the hand signs and gestures. Inthe process of recognizing the hand signs, the Artificial NeuralNetwork (ANN) is being trained with some specific...

article

Robust Face Sketch Style Synthesis

Shengchuan Zhang, Xinbo Gao, Nannan Wang, Jie Li

IEEE Transactions on Image Processing > 2016 > 25 > 1 > 220 - 232

Heterogeneous image conversion is a critical issue in many computer vision tasks, among which example-based face sketch style synthesis provides a convenient way to make artistic effects for photos. However, existing face sketch style synthesis methods generate stylistic sketches depending on many photo-sketch pairs. This requirement limits the generalization ability of these methods to produce arbitrarily...

chapter

MFCC feature with optimized frequency range: An essential step for emotion recognition

Subhasmita Sahoo, Aurobinda Routray

2016 International Conference on Systems in Medicine and Biology (ICSMB) > 162 - 165

2016 International Conference on Systems in Medicine and Biology (ICSMB)

One of the major challenge in human emotion recognition is extraction of features containing maximum prosodic information. The accuracy of entire emotion detection system eventually relies upon the efficiency of the selected feature. When it comes to identifying emotions from voice, ambiguity in detection can never be completely avoided due to several reasons. Exclusion of redundant information to...

chapter

Emotion recognition from facial image analysis using composite similarity measure aided bidimensional empirical mode decomposition

Arghya Bhattacharya, Dwaipayan Choudhury, Debangshu Dey

2016 IEEE First International Conference on Control, Measurement and Instrumentation (CMI) > 336 - 340

2016 IEEE First International Conference on Control, Measurement and Instrumentation (CMI)

The aim of this work is to automatically detect and analyse the emotions from the digital videos and images. Initially the images are extracted from pre-recorded videos, from which the faces are cropped automatically. The training dataset is formed with minimal number of images per subject for each emotion. Bi-dimensional Empirical Mode Decomposition (BEMD) is used to decompose the images in its Intrinsic...

chapter

Generalizing a closed-form correlation model of oriented bandpass natural images

Zeina Sinno, Alan C. Bovik

2015 IEEE Global Conference on Signal and Information Processing (GlobalSIP) > 373 - 377

2015 IEEE Global Conference on Signal and Information Processing (GlobalSIP)

Building natural scene statistic models is a potentially transformative development for a wide variety of visual applications, ranging from the design of faithful image and video quality models to the development of perceptually optimized image enhancing techniques. Most predominant statistical models of natural images only characterize the univariate distributions of divisively normalized bandpass...

chapter

English learning system of oral phonation based on phoneme and smart phone platform

Sun Yutong, Li Xuan

2015 7th International Conference on Modelling, Identification and Control (ICMIC) > 1 - 8

2015 7th International Conference on Modelling, Identification and Control (ICMIC)

Design a software system on smart phone platform. The purpose of this system is providing a reasonable method to evaluate the English accent of non-native speakers, based on the phoneme recognition and fluency assessment, taking advantage of Hidden Markov Model (HMM). Meanwhile, this paper would use the neural net algorithm to combine the objective scoring and experts' scoring to increase the accuracy...

chapter

A layered two-step Hidden Markov Model positioning method for mine environments based on Wi-Fi signals

Junyi Yu, Yongfeng Huang

2015 4th International Conference on Computer Science and Network Technology (ICCSNT) > 1 > 1160 - 1164

2015 4th International Conference on Computer Science and Network Technology (ICCSNT)

The safety of miners is of interest to all countries. In the event of a coal mine disaster, how to locate the miners remains the biggest and most urgent issue. The aim of this study is to propose a precise positioning method for underground mine environments. In this paper, a layered two-step Hidden Markov Model is proposed to simulate human walking in underground mine environments and an improved...

chapter

Stressed speech analysis using sparse representation over temporal information based dictionary

Bhanu Priya, S. Dandapat

2015 Annual IEEE India Conference (INDICON) > 1 - 6

2015 Annual IEEE India Conference (INDICON)

In this paper, a novel sparse representation over learned and exemplar dictionaries is explored to estimate the speech information of stressed speech. Stressed speech contains speech and stress informations. The acoustic variabilities are induced due to presence of stress information, which results in degradation of the performance of speech recognition system. In this work, the acoustic variabilities...

chapter

Reduced feature extraction for emotional speech recognition

Hemanta Kumar Palo, Mihir Narayan Mohanty

2015 Annual IEEE India Conference (INDICON) > 1 - 5

2015 Annual IEEE India Conference (INDICON)

There has been a considerable use of acoustic features for speaker identification and recognition. Few of these features have also been used by researchers to recognize emotions in speech effectively. Here an attempt is made to characterize human speech emotions with acoustic features as speech rate, formant frequencies, amplitude and energy initially. Further, a reduced acoustic feature set based...

chapter

American midland dialect identification using prosodic features and SVM

A. Etman, A. A. Louis Beex

2015 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT) > 516 - 521

2015 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT)

There has been confusion about the American Midland dialect for a long time. Since 1968, researchers have been looking for an answer to the question of whether it exists or not. Starting with Bailey, who was unsuccessful in identifying the Midland dialect based on vocabulary only, as vocabulary varies within the same community, and ending with Johnson, who proved that the Midland region is a separate...

chapter

An HMM approach for synthesizing amused speech with a controllable intensity of smile

Kevin El Haddad, Huseyin Cakmak, Alexis Moinet, Stephane Dupont, more

2015 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT) > 7 - 11

2015 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT)

Smile is not only a visual expression. When it occurs together with speech, it also alters its acoustic realization. Being able to synthesize speech altered by the expression of smile can hence be an important contributor for adding naturalness and expressiveness in interactive systems. In this work, we present a first attempt to develop a Hidden Markov Model (HMM)-based synthesis system allowing...

chapter

Deep neural network based acoustic model using speaker-class information for short time utterance

Hiroshi Seki, Kazumasa Yamamoto, Seiichi Nakagawa

2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1222 - 1225

2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

In speech recognition, it is preferable not to hypothesize the details, e.g., specific age and gender, of a target user. However, speaker independence is one of the things that degrades ASR performance. In this work, we propose a speaker adaptation method to recognize a short time utterance. There have been several studies on speaker-independent DNN-HMM in which i-vector is computed, and the additional...

chapter

Static hand gesture recognition for vietnamese sign language (VSL) using principle components analysis

Thao Nguyen Thi Huong, Tien Vu Huu, Thanh Le Xuan, San Vu Van

2015 International Conference on Communications, Management and Telecommunications (ComManTel) > 138 - 141

2015 International Conference on Communications, Management and Telecommunications (ComManTel)

Nowadays, hand gesture is one of main considerations for hearing impaired people because they use sign language to communicate with each other and to normal people. In general, the normal people have difficulties with sign language therefore they need an interpreter supporting communication. Then the automatic hand gesture recognition system is needed to help hearing impaired people integrating into...

chapter

Use of Multiple Classifier System for Gender Driven Speech Emotion Recognition

Pravina P. Ladde, Vaishali S. Deshmukh

2015 International Conference on Computational Intelligence and Communication Networks (CICN) > 713 - 717

2015 International Conference on Computational Intelligence and Communication Networks (CICN)

This paper proposes a system that allows recognizing a person's emotional state with the help of recording audio signals. This system is able to recognize four emotions (anger, happiness, sadness and neutral) This emotion recognition technique is mainly composed of two subsystems as - 1) gender recognition (GR) and 2) emotion recognition (ER). It has been proved experimentally that the performance...

chapter

Hidden Markov models & principal component analysis for multispectral palmprint identification

Abdallah Meraoumia, Maarouf Korichi, Salim Chitroub, Ahmed Bouridane

2015 5th International Conference on Information & Communication Technology and Accessibility (ICTA) > 1 - 6

2015 5th International Conference on Information & Communication Technology and Accessibility (ICTA)

Automatic personal identification from their physical and behavioral traits, called biometrics technologies, is now needed in many fields such as: surveillance systems, access control systems, physical buildings and many more applications. In this paper, we propose an efficient online personal identification system based on Multi-Spectral Palmprint images (MSP) using Hidden Markov Model (HMM) and...

chapter

Rangegram: A novel payload based anomaly detection technique against web traffic

Mayank Swarnkar, Neminath Hubballi

2015 IEEE International Conference on Advanced Networks and Telecommuncations Systems (ANTS) > 1 - 6

2015 IEEE International Conference on Advanced Networks and Telecommuncations Systems (ANTS)

Application specific intrusion detection methods are used to detect network intrusions targeted at applications. Normally such detection methods require payload or packet content analysis. One of the prominent method of payload modeling and analysis is sequence or ngram modeling. Normally ngrams generated from a packet are compared with a database of ngrams seen during training phase. Depending on...

chapter

Real-time gesture recognition based on motion quality analysis

Celine Jost, Pierre De Loor, Lexis Nedelec, Elisabetta Bevacqua, more

2015 7th International Conference on Intelligent Technologies for Interactive Entertainment (INTETAIN) > 47 - 56

2015 7th International Conference on Intelligent Technologies for Interactive Entertainment (INTETAIN)

This paper presents a robust and anticipative realtime gesture recognition and its motion quality analysis module. By utilizing a motion capture device, the system recognizes gestures performed by a human, where the recognition process is based on skeleton analysis and motion features computation. Gestures are collected from a single person. Skeleton joints are used to compute features which are stored...

INFONA - science communication portal

Advanced search

Advanced search in people

Improved acoustic modeling of low-resource languages using shared SGMM parameters of high-resource languages

An Innovative Statistical Tool for Automatic OWL-ERD Alignment

High-Level Surveillance Event Detection Using an Interval-Based Query Language

An Efficient Approach of Training Artificial Neural Network to Recognize Bengali Hand Sign

Robust Face Sketch Style Synthesis

MFCC feature with optimized frequency range: An essential step for emotion recognition

Emotion recognition from facial image analysis using composite similarity measure aided bidimensional empirical mode decomposition

Generalizing a closed-form correlation model of oriented bandpass natural images

English learning system of oral phonation based on phoneme and smart phone platform

A layered two-step Hidden Markov Model positioning method for mine environments based on Wi-Fi signals

Stressed speech analysis using sparse representation over temporal information based dictionary

Reduced feature extraction for emotional speech recognition

American midland dialect identification using prosodic features and SVM

An HMM approach for synthesizing amused speech with a controllable intensity of smile

Deep neural network based acoustic model using speaker-class information for short time utterance

Static hand gesture recognition for vietnamese sign language (VSL) using principle components analysis

Use of Multiple Classifier System for Gender Driven Speech Emotion Recognition

Hidden Markov models & principal component analysis for multispectral palmprint identification

Rangegram: A novel payload based anomaly detection technique against web traffic

Real-time gesture recognition based on motion quality analysis

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Advanced search

Advanced search in people

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options