Search results for: Bhiksha Raj

Items from 1 to 7 out of 7 results

chapter

An approach for self-training audio event detectors using web data

Benjamin Elizalde, Ankit Shah, Siddharth Dalmia, Min Hun Lee, more

2017 25th European Signal Processing Conference (EUSIPCO) > 1863 - 1867

2017 25th European Signal Processing Conference (EUSIPCO)

Audio Event Detection (AED) aims to recognize sounds within audio and video recordings. AED employs machine learning algorithms commonly trained and tested on annotated datasets. However, available datasets are limited in number of samples and hence it is difficult to model acoustic diversity. Therefore, we propose combining labeled audio from a dataset and unlabeled audio from the web to improve...

chapter

SphereFace: Deep Hypersphere Embedding for Face Recognition

Weiyang Liu, Yandong Wen, Zhiding Yu, Ming Li, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6738 - 6746

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper addresses deep face recognition (FR) problem under open-set protocol, where ideal face features are expected to have smaller maximal intra-class distance than minimal inter-class distance under a suitably chosen metric space. However, few existing algorithms can effectively achieve this criterion. To this end, we propose the angular softmax (A-Softmax) loss that enables convolutional neural...

chapter

Adaptation of SVM for MIL for inferring the polarity of movies and movie reviews

Joana Correia, Isabel Trancoso, Bhiksha Raj

2016 IEEE Spoken Language Technology Workshop (SLT) > 258 - 264

2016 IEEE Spoken Language Technology Workshop (SLT)

Polarity detection is a research topic of major interest, with many applications including detecting the polarity of product reviews. However, in some cases, the polarity of the product reviews might not be available while the polarity of the product itself might be, prohibiting the use of any form of fully supervised learning technique. This scenario, while different, is close to that of multiple...

chapter

Efficient autism spectrum disorder prediction with eye movement: A machine learning framework

Wenbo Liu, Li Yi, Zhiding Yu, Xiaobing Zou, more

2015 International Conference on Affective Computing and Intelligent Interaction (ACII) > 649 - 655

2015 International Conference on Affective Computing and Intelligent Interaction (ACII)

We propose an autism spectrum disorder (ASD) prediction system based on machine learning techniques. Our work features the novel development and application of machine learning methods over traditional ASD evaluation protocols. Specifically, we are interested in discovering the latent patterns that possibly indicate the symptom of ASD underneath the observations of eye movement. A group of subjects...

chapter

Privacy-preserving Query-by-Example Speech Search

Jose Portelo, Alberto Abad, Bhiksha Raj, Isabel Trancoso

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1797 - 1801

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper investigates a new privacy-preserving paradigm for the task of Query-by-Example Speech Search using Secure Binary Embeddings, a hashing method that converts vector data to bit strings through a combination of random projections followed by banded quantization. The proposed method allows performing spoken query search in an encrypted domain, by analyzing ciphered information computed from...

article

Learning-Based Auditory Encoding for Robust Speech Recognition

Yu-Hsiang Bosco Chiu, Bhiksha Raj, Richard M. Stern

IEEE Transactions on Audio, Speech, and Language Processing > 2012 > 20 > 3 > 900 - 914

This paper describes an approach to the optimization of the nonlinear component of a physiologically motivated feature extraction system for automatic speech recognition. Most computational models of the peripheral auditory system include a sigmoidal nonlinear function that relates the log of signal intensity to output level, which we represent by a set of frequency dependent logistic functions. The...

chapter

Learning contextual relevance of audio segments using discriminative models over AUD sequences

Sourish Chaudhuri, Bhiksha Raj

2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) > 197 - 200

2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

Effective retrieval of multimodal data involves performing accurate segmentation and analysis of such data. With easy access to a number of audio and video sharing platforms online, user-generated content with considerably less than ideal recording conditions has increased rapidly. One major issue with such content is the presence of semantically irrelevant segments in such recordings. This leads...

Filter options

Keywords:
FEATURE EXTRACTION

Publication date

Set your own date range

Publication type

book (6)
article (1)

Keywords

TRAINING (5)
ACOUSTICS (2)
EUROPE (2)
FACE (2)
SPEECH (2)
SUPPORT VECTOR MACHINES (2)
ARTIFICIAL NEURAL NETWORKS (1)
AUDIO SEGMENT SELECTION (1)
AUDITORY MODEL (1)
AUDS (1)
AUTISM SPECTRUM DISORDER (1)
BAG-OF-WORDS (1)
COMPUTATIONAL MODELING (1)
CONTEXT (1)
DATA MINING (1)
DATA PRIVACY (1)
DETECTORS (1)
DICTIONARIES (1)
DISCRIMINATIVE TRAINING (1)
DOC2VEC (1)
DYNAMIC TIME WARPING (1)
EUCLIDEAN DISTANCE (1)
EYE TRACKING (1)
FACE RECOGNITION (1)
HAMMING DISTANCE (1)
HEURISTIC ALGORITHMS (1)
HIDDEN MARKOV MODELS (1)
HISTOGRAMS (1)
IMDB (1)
KERNEL (1)
LARGE MARGIN DISCRIMINATIVE TRAINING (1)
MANIFOLDS (1)
MEASUREMENT (1)
MEDIA (1)
MEL FREQUENCY CEPSTRAL COEFFICIENT (1)
MOTION PICTURES (1)
MULTIPLE INSTANCE LEARNING (1)
NOISE (1)
OPTIMIZATION (1)
QUANTIZATION (SIGNAL) (1)
QUERY-BY-EXAMPLE SPEECH SEARCH (1)
ROBUST AUTOMATIC SPEECH RECOGNITION (1)
SECURE BINARY EMBEDDINGS (1)
SENTIMENT ANALYSIS (1)
SPEECH RECOGNITION (1)
SPORTS EQUIPMENT (1)
SUPPORT VECTOR MACHINE (1)
SVM (1)
TESTING (1)
TRAINING DATA (1)
VISUALIZATION (1)
YOUTUBE (1)
more

INFONA - science communication portal

Search results for: Bhiksha Raj

An approach for self-training audio event detectors using web data

SphereFace: Deep Hypersphere Embedding for Face Recognition

Adaptation of SVM for MIL for inferring the polarity of movies and movie reviews

Efficient autism spectrum disorder prediction with eye movement: A machine learning framework

Privacy-preserving Query-by-Example Speech Search

Learning-Based Auditory Encoding for Robust Speech Recognition

Learning contextual relevance of audio segments using discriminative models over AUD sequences

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options