Search results

chapter

Comparing and combining unimodal methods for multimodal recognition

Satoru Ishikawa, Jorma Laaksonen

2016 14th International Workshop on Content-Based Multimedia Indexing (CBMI) > 1 - 6

2016 14th International Workshop on Content-Based Multimedia Indexing (CBMI)

Multimodal recognition has recently become more attractive and common method in multimedia information retrieval. In many cases it shows better recognition results than using only unimodal methods. Most of current multimodal recognition methods still depend on unimodal recognition results. Therefore, in order to get better recognition performance, it is important to choose suitable features and classification...

chapter

Model-based video content representation

Lukas Diem, Maia Zaharieva

2016 14th International Workshop on Content-Based Multimedia Indexing (CBMI) > 1 - 6

2016 14th International Workshop on Content-Based Multimedia Indexing (CBMI)

Recurring visual elements in videos commonly represent central content entities, such as main characters and dominant objects. The automated detection of such elements is crucial for various application fields ranging from compact video content summarization to the retrieval of videos sharing common visual entities. Recent approaches for content-based video analysis commonly require for prior knowledge...

chapter

Simple tag-based subclass representations for visually-varied image classes

Xinchao Li, Peng Xu, Yue Shi, Martha Larson, more

2016 14th International Workshop on Content-Based Multimedia Indexing (CBMI) > 1 - 6

2016 14th International Workshop on Content-Based Multimedia Indexing (CBMI)

In this paper, we present a subclass-representation approach that predicts the probability of a social image belonging to one particular class. We explore the co-occurrence of user-contributed tags to find subclasses with a strong connection to the top level class. We then project each image onto the resulting subclass space, generating a subclass representation for the image. The advantage of our...

chapter

Fish activity tracking and species identification in underwater video

Ekram Hossain, S. M. Shaiful Alam, Amin Ahsan Ali, M Ashraful Amin

2016 5th International Conference on Informatics, Electronics and Vision (ICIEV) > 62 - 66

2016 International Conference on Informatics, Electronics and Vision (ICIEV)

In this paper we propose an automatic marine life monitoring system. First task in the monitoring process is to detect underwater moving objects as fishes. Second Task is to identify the species of the detected fish. Third task is to track the detected fish to avoid multiple counting and record their activities. Detection is performed using GMM based background subtraction method, classification is...

chapter

Image-based approach for the detection of counterfeit banknotes of Bangladesh

Mohammad Shorif Uddin, Pronaya Prosun Das, Md. Shamim Ahmed Roney

2016 5th International Conference on Informatics, Electronics and Vision (ICIEV) > 1067 - 1072

2016 International Conference on Informatics, Electronics and Vision (ICIEV)

Currency duplication also known as counterfeit currency is a vulnerable threat on economy. It is now a common phenomenon due to advanced printing and scanning technology. Bangladesh has been facing serious problem by the increasing rate of fake notes in the market. To get rid of this problem various fake note detection methods are available around the world and most of these are hardware based and...

chapter

Classifying digital X-ray images into different human body parts

Sanad Saha, Asif Mahmud, Amin Ahsan Ali, Md. Ashraful Amin

2016 5th International Conference on Informatics, Electronics and Vision (ICIEV) > 67 - 71

2016 International Conference on Informatics, Electronics and Vision (ICIEV)

In medical information retrieval research, automatically classifying X-ray images based on body-parts is a challenging problem. In ImageCLEF's 2015 campaign there was a contest where the participants were challenged to cluster X-ray images into different groups based on presence of particular body-part in that X-ray image. In brief the challenge was to classify given X-ray images primarily into five...

chapter

Role of voxel selection and ROI in fMRI data analysis

Raheel Zafar, Aamir Saeed Malik, Nidal Kamel, Sarat C Dass

2016 IEEE International Symposium on Medical Measurements and Applications (MeMeA) > 1 - 6

2016 IEEE International Symposium on Medical Measurements and Applications (MeMeA)

Functional magnetic resonance imaging (fMRI) is one of the most popular and reliable modality to measure brain activities. The quality of fMRI data is best among other modalities such as Electroencephalography (EEG) and Magnetoencephalography (MEG). In fMRI, normally number of features are more than the number of instances so it is necessary to select the features and do dimension reduction to remove...

chapter

An improved interest point detector for human action recognition

Songtao Ding, Shiru Qu

2016 Chinese Control and Decision Conference (CCDC) > 4355 - 4360

2016 Chinese Control and Decision Conference (CCDC)

In this work, we present a method of human action recognition based on detection of interest points by spatial and temporal constraints. Firstly, the improved Harris-Laplace algorithm is proposed to solve the problem of multi-scale. Then, the bag-of-visual features (BoV) model is used for feature extraction, and is built the visual dictionary with K-means clustering. We train the Support Vector Machine...

chapter

Multi-speaker voice activity detection using a camera-assisted microphone array

Trond F. Bergh, Ines Hafizovic, Sverre Holm

2016 International Conference on Systems, Signals and Image Processing (IWSSIP) > 1 - 4

2016 International Conference on Systems, Signals and Image Processing (IWSSIP)

We present a method for voice activity detection of multiple concurrent speakers using a camera-assisted microphone array. The proposed method uses face detection to identify locations of potential speech sources, and uses this information in an adaptive beamforming procedure to form a spatially directed detection algorithm to identify voice activity for individual speakers. Voice activity is classified...

chapter

Effect of voxel selection on temporal mesh model for brain decoding

Arman Afrasiyabi, Itir Onal, Fatos T. Yarman Vural

2016 24th Signal Processing and Communication Application Conference (SIU) > 2249 - 2252

2016 24th Signal Processing and Communication Application Conference (SIU)

In this study, we combine a voxel selection method with temporal mesh model to decode the discriminative information distributed in functional Magnetic Resonance Imaging (fMRI) data. We first employ one way Analysis of Variance (ANOVA) feature selection to select the most informative voxels. Then, we form meshes around selected voxels with their spatial and functional neighbors by employing the Mesh...

chapter

Scene nudity level detection with deep nets

Savas Ozkan, Ersin Esen, Ilkay Atil, Gozde Bozdagi Akar

2016 24th Signal Processing and Communication Application Conference (SIU) > 2069 - 2072

2016 24th Signal Processing and Communication Application Conference (SIU)

In this paper, we present an approach that can detect scene nudity level with high precision using different deep net configurations. For this purpose, a recent approach [1] which has intense and very deep convolution layers is used. During net modelling, we strive to obtain most successful net configuration by comparing different Dropout models and image sizes -64 × 64, 128 × 128-. Additionally,...

chapter

Structured output tracking with deep neural network and optical flow

Youngjoo Jo, Jun-Cheol Park, Dae-Shik Kim

2016 2nd International Conference on Control, Automation and Robotics (ICCAR) > 350 - 356

2016 2nd International Conference on Control, Automation and Robotics (ICCAR)

The deep learning of neural network works on vision recognition and classification tasks briskly, and it can extract great features of an image for classification. Recently, many approaches have studied the visual tracking in two-ways with these characteristics. First, they can regard tracking problem as classifying each video and frame by learning all dataset. Second, use the deep neural network...

chapter

Fusing Deep Convolutional Networks for Large Scale Visual Concept Classification

Hilal Ergun, Mustafa Sert

2016 IEEE Second International Conference on Multimedia Big Data (BigMM) > 210 - 213

2016 IEEE Second International Conference on Multimedia Big Data (BigMM)

Deep learning architectures are showing great promise in various computer vision domains including image classification, object detection, event detection and action recognition. In this study, we investigate various aspects of convolutional neural networks (CNNs) from the big data perspective. We analyze recent studies and different network architectures both in terms of running time and accuracy...

chapter

CNUSVM: Hybrid CNN-Uneven SVM Model for Imbalanced Visual Learning

Mengyue Geng, Yaowei Wang, Yonghong Tian, Tiejun Huang

2016 IEEE Second International Conference on Multimedia Big Data (BigMM) > 186 - 193

2016 IEEE Second International Conference on Multimedia Big Data (BigMM)

Recently, deep Convolutional Neural Networks (CNNs) have been used to achieve state-of-the-art performance on a wide range of visual learning tasks. However, when facing some imbalanced learning tasks where the training samples are unevenly distributed among different classes, CNNs tend to produce performance bias toward the majority class, making them not suitable for applications in which the recognition...

chapter

Support vector machines with time series distance kernels for action classification

Mohammad Ali Bagheri, Qigang Gao, Sergio Escalera

2016 IEEE Winter Conference on Applications of Computer Vision (WACV) > 1 - 7

2016 IEEE Winter Conference on Applications of Computer Vision (WACV)

Despite the outperformance of Support Vector Machine (SVM) on many practical classification problems, the algorithm is not directly applicable to multi-dimensional trajectories having different lengths. In this paper, a new class of SVM that is applicable to trajectory classification, such as action recognition, is developed by incorporating two efficient time-series distances measures into the kernel...

chapter

On visual vocabulary size in SVM classification

Weixue Liu, Hongxia Cui, Jian Hou, Jianxin Kang

2016 IEEE International Conference on Industrial Technology (ICIT) > 962 - 967

2016 IEEE International Conference on Industrial Technology (ICIT)

Codebook has been shown to be an effective image representation method. In this method, discriminative local features, e.g., SIFT, are extracted from images and then pooled together. All these local features are then clustered and the centers of all the clusters form a codebook. By counting the distribution of local features on these codes, we obtain a histogram of local features as the global feature...

chapter

Learning deep-sea substrate types with visual topic models

Arnold Kalmbach, Maia Hoeberechts, Alexandra Branzan Albu, Herve Glotin, more

2016 IEEE Winter Conference on Applications of Computer Vision (WACV) > 1 - 9

2016 IEEE Winter Conference on Applications of Computer Vision (WACV)

We propose and evaluate a method for learning deep-sea substrate types using video recorded with a remotely operated vehicle (ROV). The goal of this work is to create a labelled spatial map of substrate types from ROV video in order to support biological and geological domain research. The output of our method describes the mixtures of geological features such as sediment and types of lava flow in...

chapter

Adapting attributes by selecting features similar across domains

Siqi Liu, Adriana Kovashka

2016 IEEE Winter Conference on Applications of Computer Vision (WACV) > 1 - 8

2016 IEEE Winter Conference on Applications of Computer Vision (WACV)

Attributes are semantic visual properties shared by objects. They have been shown to improve object recognition and to enhance content-based image search. While attributes are expected to cover multiple categories, e.g. a dalmatian and a whale can both have "smooth skin", we find that the appearance of a single attribute varies quite a bit across categories. Thus, an attribute model learned...

chapter

A driver fatigue detection method based on multi-sensor signals

Hao Yin, Yuanqi Su, Yuehu Liu, Danchen Zhao

2016 IEEE Winter Conference on Applications of Computer Vision (WACV) > 1 - 7

2016 IEEE Winter Conference on Applications of Computer Vision (WACV)

Fatigue during long-time driving threatens the safety of drivers and transportation. In this paper, we provide an effective method based on multi-sensor signals collected from Kinect2.0 camera and PPG pulse sensor to build a driver fatigue detection system. Unlike most traditional works, we define the transitional process of fatigue and elaborate its effect on training classifiers. The simulation...

chapter

Relevance Feedback Based CBIR System Using SVM and Bayes Classifier

Navneet Kaur, Sonika Jindal, Bhavneet Kaur

2016 Second International Conference on Computational Intelligence & Communication Technology (CICT) > 214 - 218

2016 Second International Conference on Computational Intelligence & Communication Technology (CICT)

Image search techniques were not generally basedon visual features but on the textual annotation of images. Images were firstly annotated with text and then searched usinga text-based approach from traditional database managementsystems which is time consuming and difficult to manage. Toovercome this problem, CBIR (Content Based Image Retrieval) is introduced which is becoming the hottest research...

INFONA - science communication portal

Search results

Comparing and combining unimodal methods for multimodal recognition

Model-based video content representation

Simple tag-based subclass representations for visually-varied image classes

Fish activity tracking and species identification in underwater video

Image-based approach for the detection of counterfeit banknotes of Bangladesh

Classifying digital X-ray images into different human body parts

Role of voxel selection and ROI in fMRI data analysis

An improved interest point detector for human action recognition

Multi-speaker voice activity detection using a camera-assisted microphone array

Effect of voxel selection on temporal mesh model for brain decoding

Scene nudity level detection with deep nets

Structured output tracking with deep neural network and optical flow

Fusing Deep Convolutional Networks for Large Scale Visual Concept Classification

CNUSVM: Hybrid CNN-Uneven SVM Model for Imbalanced Visual Learning

Support vector machines with time series distance kernels for action classification

On visual vocabulary size in SVM classification

Learning deep-sea substrate types with visual topic models

Adapting attributes by selecting features similar across domains

A driver fatigue detection method based on multi-sensor signals

Relevance Feedback Based CBIR System Using SVM and Bayes Classifier

Filter options

Publication date

Content availability

Keywords

Data set

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options