Search results

Items from 1 to 4 out of 4 results

chapter

Semantic Regularisation for Recurrent Image Annotation

Feng Liu, Tao Xiang, Timothy M. Hospedales, Wankou Yang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4160 - 4168

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The CNN-RNN design pattern is increasingly widely applied in a variety of image annotation tasks including multi-label classification and captioning. Existing models use the weakly semantic CNN hidden layer or its transform as the image embedding that provides the interface between the CNN and RNN. This leaves the RNN overstretched with two jobs: predicting the visual concepts and modelling their...

chapter

Learning semantic attributes via a common latent space

Ziad Al-Halah, Tobias Gehrig, Rainer Stiefelhagen

2014 International Conference on Computer Vision Theory and Applications (VISAPP) > 2 > 48 - 55

2014 International Conference on Computer Vision Theory and Applications (VISAPP)

Semantic attributes represent an adequate knowledge that can be easily transferred to other domains where lack of information and training samples exist. However, in the classical object recognition case, where training data is abundant, attribute-based recognition usually results in poor performance compared to methods that used image features directly. We introduce a generic framework that boosts...

chapter

The story of a single cell: Peeking into the semantics of spikes

Roi Kliper, T Serre, D Weinshall, I Nelkenz

2010 2nd International Workshop on Cognitive Information Processing > 281 - 286

2010 2nd International Workshop on Cognitive Information Processing (CIP 2010)

Traditionally, the modeling of sensory neurons has focused on the characterization and/or the learning of input-output relations. Motivated by the view that different neurons impose different partitions on the stimulus space, we propose instead to learn the structure of the stimulus space, as imposed by the cell, by learning a cell specific distance function or kernel. Metaphorically speaking, this...

chapter

Classifying laughter and speech using audio-visual feature prediction

Stavros Petridis, Ali Asghar, Maja Pantic

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 5254 - 5257

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

In this study, a system that discriminates laughter from speech by modelling the relationship between audio and visual features is presented. The underlying assumption is that this relationship is different between speech and laughter. Neural networks are trained which learn the audio-to-visual and visual-to-audio features mapping for both classes. Classification of a new frame is performed via prediction...

Filter options

Keywords:
PREDICTIVE MODELS
TRAINING
VISUALIZATION

Publication date

Set your own date range

Keywords

SEMANTICS (2)
ARTIFICIAL NEURAL NETWORKS (1)
AUDIO SIGNAL PROCESSING (1)
AUDIO-TO-VISUAL FEATURE MAPPING (1)
AUDIO-VISUAL SYSTEMS (1)
AUDIOVISUAL FEATURE-LEVEL FUSION (1)
AUDIOVISUAL SPEECH / LAUGHTER FEATURE RELATIONSHIP (1)
BIOELECTRIC POTENTIALS (1)
BRAIN (1)
BRAIN MODELS (1)
CELL RESPONSE (1)
CELL SPECIFIC DISTANCE FUNCTION (1)
CELL-SPECIFIC DISTANCE FUNCTION (1)
CELLULAR BIOPHYSICS (1)
COMPUTATIONAL MODELING (1)
DECODING (1)
DECORRELATION (1)
FEATURE EXTRACTION (1)
INFEROTEMPORAL CORTEX (1)
INPUT-OUTPUT RELATION (1)
LATENT SPACE (1)
LAUGHTER (1)
LAUGHTER-VS-SPEECH DISCRIMINATION (1)
MACAQUE MONKEY (1)
NEURAL DATA (1)
NEURAL NETS (1)
NEURAL NETWORKS (1)
NEURONS (1)
NEUROPHYSIOLOGY (1)
OBJECT CLASSIFICATION (1)
PARTIAL LEAST SQUARES ANALYSIS (1)
PATTERN CLASSIFICATION (1)
PREDICTION-BASED CLASSIFICATION (1)
PREFRONTAL CORTEX (1)
SEMANTIC ATTRIBUTES (1)
SEMANTIC PARTITION (1)
SENSORY NEURON MODELING (1)
SPEECH (1)
SPEECH FEATURE (1)
SPEECH PROCESSING (1)
SPIKE SEMANTICS (1)
STIMULUS SPACE (1)
VISUAL-TO-AUDIO FEATURE MAPPING (1)
more

INFONA - science communication portal

Search results

Semantic Regularisation for Recurrent Image Annotation

Learning semantic attributes via a common latent space

The story of a single cell: Peeking into the semantics of spikes

Classifying laughter and speech using audio-visual feature prediction

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options