Search results

Items from 21 to 40 out of 2,639 results

chapter

Research on voiceprint recognition based on weighted clustering recognition SVM algorithm

Yang Wu, Lihong Xu, Yandong Chen, Xueyang Zhang

2017 Chinese Automation Congress (CAC) > 1144 - 1148

2017 Chinese Automation Congress (CAC)

Support vector machine (SVM) algorithm received much attention in the research of voiceprint recognition, especially for small sample datasets. However, with the increase of recognition number and speech features number, the rate of model training and recognition is significantly reduced. In order to solve the problem, a new weighted clustering algorithm is proposed, which use “one to one” SVM model...

chapter

Traffic pattern modeling, trajectory classification and vehicle tracking within urban intersections

Cheng-En Wu, Wen-Yen Yang, Hai-Che Ting, Jia-Shung Wang

2017 International Smart Cities Conference (ISC2) > 1 - 6

2017 International Smart Cities Conference (ISC2)

Traffic behavioral monitoring within urban intersections is an essential issue in the Intelligent Transportation Systems (ITS) for a smart city. This paper investigates on gathering traffic information within an urban intersection where accidents frequently occur. In this paper, traffic pattern modeling, trajectory classification and a real-time vehicle tracker within the urban intersection are proposed...

chapter

Semi-Supervised travel mode detection from smartphone data

Mohsen Rezaie, Zachary Patterson, Jia Yuan Yu, Ali Yazdizadeh

2017 International Smart Cities Conference (ISC2) > 1 - 8

2017 International Smart Cities Conference (ISC2)

With the advent of the incorporation of GPS receivers and then GPS-enabled smartphones in transportation data collection, many studies have looked at how to infer meaningful information from this data. Research in this field has concentrated on the use of heuristics and supervised machine learning methods to detect: trip ends, trip itineraries, travel mode and trip purpose. All the methods used until...

chapter

Single-channel speech separation based on deep clustering with local optimization

Taotao Fu, Ge Yu, Lili Guo, Yan Wang, more

2017 3rd International Conference on Frontiers of Signal Processing (ICFSP) > 44 - 49

2017 3rd International Conference on Frontiers of Signal Processing (ICFSP)

There are many challenges in single-channel multi-person mixed speech separation, such as modeling the temporal continuity of the speech signals and improving the frame separation performance simultaneously. In this paper, a separation method based on Deep Clustering with local optimization by the improved Non-Negative Matrix Factorization (NMF) combined with Factorial Conditional Random Fields (FCRF)...

chapter

Novel alignment method for DNN TTS training using HMM synthesis models

Sinisa Suzic, Tijana Delic, Darko Pekar, Vladimir Ostojic

2017 IEEE 15th International Symposium on Intelligent Systems and Informatics (SISY) > 271 - 276

2017 IEEE 15th International Symposium on Intelligent Systems and Informatics (SISY)

In order to train neural networks (NN) for text-to-speech synthesis (TTS), phonetic segmentation must be performed. The most accurate segmentation is performed manually, but the process of creating manual alignments is costly and time-consuming, so automatic procedures are preferable. In this paper, a simple alignment method based on models trained during hidden Markov Model (HMM) based TTS system...

chapter

Human action recognition with hidden Markov models and neural network derived poses

Egbert Gedat, Pascal Fechner, Richard Fiebelkorn, Ralf Vandenhouten

2017 IEEE 15th International Symposium on Intelligent Systems and Informatics (SISY) > 157 - 162

2017 IEEE 15th International Symposium on Intelligent Systems and Informatics (SISY)

A human action recognition method is introduced that detects a set of actions in videos by a temporal expansion with hidden Markov models of a pose detection with an artificial neural network. The method was set-up and tested using eleven actions from the MOCAP motion capture database comprising 3,947 frames. A poses alphabet of fourteen relevant poses was defined to be learned by an artificial neural...

chapter

A hardware/software co-design architecture for ultrasonic flaw detection with Hidden Markov Model and wavelet transform

Kushal Virupakshappa, Erdal Oruklu

2017 IEEE International Ultrasonics Symposium (IUS) > 1

2017 IEEE International Ultrasonics Symposium (IUS)

This work presents an embedded hardware architecture for real-time ultrasonic NDE applications that incorporate Hidden Markov Model (HMM) based statistical signal methods. HMM has been successfully used in applications like audio segment retrieval, speech/language recognition and image processing applications. Recently, we proposed a new Hidden Markov Model (HMM) based ultrasonic flaw detection algorithm...

chapter

A hardware/software co-design architecture for ultrasonic flaw detection with Hidden Markov Model and Wavelet Transform

Kushal Virupakshappa, Erdal Oruklu

2017 IEEE International Ultrasonics Symposium (IUS) > 1 - 4

2017 IEEE International Ultrasonics Symposium (IUS)

This work presents an embedded hardware architecture for real-time ultrasonic NDE applications that incorporate Hidden Markov Model (HMM) based statistical signal methods. Proposed algorithm is a combination of Discrete Wavelet Transform (DWT) for pre-processing A-scan signals and HMM for classification of the flaw presence. For this study, a MicroZed FPGA with Xilinx Zynq-7020 System-on-Chip (SoC)...

chapter

A rule based fuzzy gesture recognition system to interact with Sphero 2.0 using a smart phone

Aykut Beke, Ahmet Arda Yuceler, Tufan Kumbasar

2017 International Artificial Intelligence and Data Processing Symposium (IDAP) > 1 - 4

2017 International Artificial Intelligence and Data Processing Symposium (IDAP)

In this study, we will present a rule based fuzzy gesture recognition system where a user will interact with a spherical robot with hand gestures performed with a smart phone and the droid will respond by imitating this movements. In this context, we will take up the Gesture Recognition, Fuzzy Logic and Internet of Things (IoT) frameworks to construct such a Human-Machine Interface (HMI). In the proposed...

chapter

Voice transformation using pitch and spectral mapping

Anisha Yathigiri, Meenalatha Bathula, Susmitha Kothapalli, Susmitha Vekkot, more

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 1540 - 1544

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

This paper provides a voice transformation model that uses pitch data and Feed-forward Neural Networks on Line Spectral Frequency. The aim of this work is to achieve the transformation of a speech signal produced by a source speaker by modifying voice individuality parameters such that it appears to be spoken by a chosen target speaker, without modifying the message contents. Most of the previous...

chapter

Unsupervised multiview learning with partial distribution information

Shashini De Silva, Jinsub Kim, Raviv Raich

2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP) > 1 - 6

2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP)

We consider a training data collection mechanism wherein, instead of annotating each training instance with a class label, additional features drawn from a known class-conditional distribution are acquired concurrently. Considering true labels as latent variables, a maximum likelihood approach is proposed to train a classifier based on these unlabeled training data. Furthermore, the case of correlated...

chapter

HMM based cross-OSN user migration modeling and forecasting

Zhihao Tian, Kai Niu, Zhiqiang He

2017 2nd IEEE International Conference on Computational Intelligence and Applications (ICCIA) > 234 - 238

2017 2nd IEEE International Conference on Computational Intelligence and Applications (ICCIA)

With the incredible growth of OSNs (online social networks), users have numerous choices every moment. However, due to the limit of time and resources, only a small part of OSNs are chosen to remain social and active by users. The dynamic changes of users' interests entail user migration. Understanding user migration behavior is important to improve business intelligence and retain users. In this...

chapter

Prediction of cardiac arrhythmia type using clustering and regression approach (P-CA-CRA)

Prathibhamol Cp, Anjana Suresh, Gopika Suresh

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 51 - 54

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

Cardiac Arrhythmia is a disease dealing with improper beating of heart. The improper condition may be fast beating or slow beating associated with heart. This paper proposes a detection or prediction scheme in the type of cardiac arrhythmia disease. It uses a clustering approach and regression methodology. The clustering approach used is DBSCAN and for regression, multiclass logistic regression is...

chapter

A system for detecting professional skills from resumes written in natural language

Emil St. Chifu, Viorica Rozina Chifu, Iulia Popa, Ioan Salomie

2017 13th IEEE International Conference on Intelligent Computer Communication and Processing (ICCP) > 189 - 196

2017 13th IEEE International Conference on Intelligent Computer Communication and Processing (ICCP)

In this paper, we present a new method for detecting professional skills (as noun phrases) from resumes written in natural language. The proposed method uses an ontology of skills, the Wikipedia encyclopedia, and a set of standard multi word part-of-speech patterns in order to detect the professional skills. First, the method checks to see if there are, in the text of the resumes, skills that are...

chapter

Isolated forest in keystroke dynamics-based authentication: Only normal instances available for training

Kai Song, Yujie Zhou, Hongming Liu, Nianhao Zhu

2017 2nd IEEE International Conference on Computational Intelligence and Applications (ICCIA) > 63 - 67

2017 2nd IEEE International Conference on Computational Intelligence and Applications (ICCIA)

Keystroke dynamics, which is a biometric characteristic that depends on typing style of users. In the past thirty years, dozens of classifiers have been proposed for distinguishing people using keystroke dynamics; many have obtained excellent results in evaluation. However, a more common case is that only normal instances are available and none of the rare classes are observed. It leads us to use...

chapter

A comparison of different part-of-speech tagging technique for text in Bahasa Indonesia

Ahmad Zuli Amrullah, Rudy Hartanto, I Wayan Mustika

2017 7th International Annual Engineering Seminar (InAES) > 1 - 5

2017 7th International Annual Engineering Seminar (InAES)

Part of speech tagging has some different methods or techniques to the problem in assigning each word of a text with a part-of-speech tag. In this paper, we conducted some part-of-speech tagging techniques for Bahasa Indonesia experiments using statistical approach (Unigram, Hidden Markov Models) and Brill's tagger. In this study, we used Supervised POS Tagging approach requiring a large number of...

chapter

Pitch prediction from Mel-generalized cepstrum — a computationally efficient pitch modeling approach for speech synthesis

M V Achuth Rao, Prasanta Kumar Ghosh

2017 25th European Signal Processing Conference (EUSIPCO) > 1629 - 1633

2017 25th European Signal Processing Conference (EUSIPCO)

Text-to-speech (TTS) systems are often used as part of the user interface in wearable devices. Due to limited memory and computational/battery power in wearable devices, it could be useful to have a TTS system which requires less memory and is less computationally intensive. Conventional speech synthesis systems has separate modeling for pitch (FO-model) and spectral representation, namely Mel generalized...

chapter

Automatic detection of bird species from audio field recordings using HMM-based modelling of frequency tracks

Peter Jancovic, Munevver Kokuer

2017 25th European Signal Processing Conference (EUSIPCO) > 1779 - 1783

2017 25th European Signal Processing Conference (EUSIPCO)

This paper presents an automatic system for detection of bird species in field recordings. A sinusoidal detection algorithm is employed to segment the acoustic scene into isolated spectro-temporal segments. Each segment is represented as a temporal sequence of frequencies of the detected sinusoid, referred to as frequency track. Each bird species is represented by a set of hidden Markov models (HMMs),...

chapter

What makes audio event detection harder than classification?

Huy Phan, Philipp Koch, Fabrice Katzberg, Marco Maass, more

2017 25th European Signal Processing Conference (EUSIPCO) > 2739 - 2743

2017 25th European Signal Processing Conference (EUSIPCO)

There is a common observation that audio event classification is easier to deal with than detection. So far, this observation has been accepted as a fact and we lack of a careful analysis. In this paper, we reason the rationale behind this fact and, more importantly, leverage them to benefit the audio event detection task. We present an improved detection pipeline in which a verification step is appended...

chapter

Local frame match distance: A novel approach for exemplar gesture recognition

Radu Tudor Ionescu, Marius Popescu, Christopher Conly, Vassilis Athitsos

2017 25th European Signal Processing Conference (EUSIPCO) > 788 - 792

2017 25th European Signal Processing Conference (EUSIPCO)

Gesture recognition using a training set of limited size for a large vocabulary of gestures is a challenging problem in computer vision. With few examples per gesture class, researchers often employ state-of-the-art exemplar-based methods such as Dynamic Time Warping (DTW). This paper makes two contributions in the area of exemplar-based gesture recognition. As an alternative to DTW, we first introduce...

Keywords:
TRAINING
HIDDEN MARKOV MODELS

Publication date

Set your own date range

Content availability

Available (2,627)
None (12)

Keywords

SPEECH (916)
SPEECH RECOGNITION (756)
FEATURE EXTRACTION (722)
ACOUSTICS (426)
HIDDEN MARKOV MODEL (334)
ACCURACY (312)
COMPUTATIONAL MODELING (307)
DATABASES (290)
DATA MODELS (286)
DATA MINING (233)
SUPPORT VECTOR MACHINES (228)
TRAINING DATA (212)
HMM (211)
HANDWRITING RECOGNITION (184)
TESTING (183)
NATURAL LANGUAGE PROCESSING (175)
MATHEMATICAL MODEL (161)
ARTIFICIAL NEURAL NETWORKS (151)
VECTORS (146)
NEURAL NETWORKS (137)
LEARNING (ARTIFICIAL INTELLIGENCE) (136)
ADAPTATION MODELS (135)
SPEECH PROCESSING (132)
SPEECH SYNTHESIS (129)
CONTEXT (116)
MEL FREQUENCY CEPSTRAL COEFFICIENT (116)
DECODING (111)
IMAGE SEGMENTATION (111)
SPEAKER RECOGNITION (109)
PROBABILITY (105)
AUTOMATIC SPEECH RECOGNITION (104)
HUMANS (99)
ADAPTATION MODEL (97)
TRAJECTORY (94)
CLASSIFICATION ALGORITHMS (93)
VOCABULARY (93)
GESTURE RECOGNITION (89)
GAUSSIAN PROCESSES (85)
MAXIMUM LIKELIHOOD ESTIMATION (85)
MARKOV PROCESSES (83)
CHARACTER RECOGNITION (82)
ERROR ANALYSIS (82)
PATTERN RECOGNITION (81)
TEXT ANALYSIS (80)
ESTIMATION (79)
DICTIONARIES (77)
VITERBI ALGORITHM (77)
NOISE (76)
PREDICTIVE MODELS (76)
IMAGE RECOGNITION (75)
MACHINE LEARNING (74)
PATTERN CLASSIFICATION (73)
OPTIMIZATION (72)
VISUALIZATION (71)
KERNEL (67)
ROBUSTNESS (66)
TAGGING (66)
CLUSTERING ALGORITHMS (63)
STATISTICAL ANALYSIS (63)
CONTEXT MODELING (62)
SHAPE (62)
FACE RECOGNITION (61)
JOINTS (57)
NOISE MEASUREMENT (57)
RECURRENT NEURAL NETWORKS (57)
IMAGE CLASSIFICATION (56)
NEURONS (56)
SUPPORT VECTOR MACHINE (55)
LABELING (53)
TRANSFORMS (53)
CONDITIONAL RANDOM FIELDS (52)
SENSORS (52)
STANDARDS (52)
ALGORITHM DESIGN AND ANALYSIS (51)
HANDWRITTEN CHARACTER RECOGNITION (51)
NEURAL NETS (50)
SEMANTICS (49)
BAYES METHODS (47)
DETECTORS (47)
FACE (47)
IMAGE SEQUENCES (47)
PRINCIPAL COMPONENT ANALYSIS (47)
CONFERENCES (46)
CAMERAS (45)
PROBABILISTIC LOGIC (43)
TEXT RECOGNITION (43)
DISCRIMINATIVE TRAINING (42)
NATURAL LANGUAGES (42)
COMPUTER VISION (41)
ENTROPY (41)
INFORMATION RETRIEVAL (41)
SIGNAL TO NOISE RATIO (41)
HEURISTIC ALGORITHMS (40)
IMAGE MOTION ANALYSIS (40)
LATTICES (40)
PATTERN CLUSTERING (40)
PIXEL (40)
BIOLOGICAL SYSTEM MODELING (39)
more

INFONA - science communication portal

Search results

Research on voiceprint recognition based on weighted clustering recognition SVM algorithm

Traffic pattern modeling, trajectory classification and vehicle tracking within urban intersections

Semi-Supervised travel mode detection from smartphone data

Single-channel speech separation based on deep clustering with local optimization

Novel alignment method for DNN TTS training using HMM synthesis models

Human action recognition with hidden Markov models and neural network derived poses

A hardware/software co-design architecture for ultrasonic flaw detection with Hidden Markov Model and wavelet transform

A hardware/software co-design architecture for ultrasonic flaw detection with Hidden Markov Model and Wavelet Transform

A rule based fuzzy gesture recognition system to interact with Sphero 2.0 using a smart phone

Voice transformation using pitch and spectral mapping

Unsupervised multiview learning with partial distribution information

HMM based cross-OSN user migration modeling and forecasting

Prediction of cardiac arrhythmia type using clustering and regression approach (P-CA-CRA)

A system for detecting professional skills from resumes written in natural language

Isolated forest in keystroke dynamics-based authentication: Only normal instances available for training

A comparison of different part-of-speech tagging technique for text in Bahasa Indonesia

Pitch prediction from Mel-generalized cepstrum — a computationally efficient pitch modeling approach for speech synthesis

Automatic detection of bird species from audio field recordings using HMM-based modelling of frequency tracks

What makes audio event detection harder than classification?

Local frame match distance: A novel approach for exemplar gesture recognition

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options