Search results

chapter

A comparative study of sampling analysis in scene classification of high-resolution remote sensing imagery

Jingwen Hu, Gui-Song Xia, Fan Hu, Hong Sun, more

2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS) > 2389 - 2392

2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS)

Scene classification is a key problem in the interpretation of high-resolution remote sensing imagery. The state-of-the-art methods, e.g. bag-of-visual-words model and its various extensions as well as the topic models, share similar procedures: patch sampling, feature description/learning and classification. Patch sampling is the first and the key procedure which has a great influence on the results...

chapter

A benchmark for scene classification of high spatial resolution remote sensing imagery

Jingwen Hu, Tianbi Jiang, Xinyi Tong, Gui-Song Xia, more

2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS) > 5003 - 5006

2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS)

Scene classification for high-resolution remotely sensed imagery have been widely investigated in recent years. However, there is few public, widely accepted and large scale dataset for benchmarking different methods. This paper presents a new and large dataset consisting of 5000 high-resolution remote sensing images which is manually labeled in 20 semantic classes for scene classification. Each class...

chapter

A new approach to create 3D fixation density maps for stereoscopic images

Baoyan Ma, Jun Zhou, Xiao Gu, Mingmin Wang, more

2015 3DTV-Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON) > 1 - 4

2015 3DTV-Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON 2015)

An approach to create fixation density maps(FDM) for stereoscopic images is proposed in this paper, overcoming the shortages of current methods. A new representation of stereoscopic images like Computed Tomography(CT) is used, which can show more information such as depth and discomfort zone. Apart from this, we follow the protogenetic 2D calibration of eyetracker by a 3D offline calibration to gain...

chapter

Free-hand gesture control with “touchable” virtual interface for human-3DTV interaction

Shun Zhang, Jinjun Wang, Yihong Gong, Shizhou Zhang

2015 3DTV-Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON) > 1 - 4

2015 3DTV-Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON 2015)

Hand gesture based input method has quickly emerged as an alternative way for human-3DTV interaction. However, it limits the user experience and gets severe when gesture recognition in uncontrolled TV room is not accurate or robust enough and a large type and number of gestures are required. In this paper, we present a simple and fast human-3DTV interaction method that combines the advantages of touchless...

chapter

Monitoring and visualizing the daily activities and in-house locations using smartphone

Sittichai Sukreep, Pornchai Mongkolnam, Chakarida Nukoolkit

2015 12th International Joint Conference on Computer Science and Software Engineering (JCSSE) > 291 - 296

2015 12th International Joint Conference on Computer Science and Software Engineering (JCSSE)

Fall is a leading cause of accidental injury deaths and a key cause of significant health problems, especially for elderly people who live alone. To assist those people for seeking help when falling and keeping records of key daily movements, we propose a simple yet effective system to monitor the daily activities and in-house locations using smartphone. We also test the system for the optimum arrangement...

chapter

Coloring image search with coupled multi-index

Liang Zheng, Shengjin Wang, Qi Tian

2015 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP) > 137 - 141

2015 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP)

The precision of visual matching and the trade-off between accuracy and time efficiency have long been bottlenecks of image search systems. This work addresses the two problem simultaneously by introducing the coupled Multi-Index (cMI) structure. First, by combining SIFT and color features on the indexing-level, the discriminative power of visual words is greatly enhanced. Second, by reducing the...

chapter

Explore the Eye Movement Regarding the Cognitive Process of Online Optic Reasoning Learning

Chou Wen-Chi, She Hsiao-Ching

2015 IEEE 15th International Conference on Advanced Learning Technologies > 305 - 306

2015 IEEE 15th International Conference on Advanced Learning Technologies (ICALT)

This study to explore the eye movement patterns with underlying cognitive process of optic reasoning between science and non-science major students that have different prior knowledge of optical concepts. There are 33 science major and 33 non-science major undergraduate students were involved in this study. The results showed the science major students and non-science major students have improved...

chapter

Improving bag of visual words representations with genetic programming

Hugo Jair Escalante, Jose Martinez-Carraza, Sergio Escalera, Victor Ponce-Lopez, more

2015 International Joint Conference on Neural Networks (IJCNN) > 1 - 8

2015 International Joint Conference on Neural Networks (IJCNN)

The bag of visual words is a well established representation in diverse computer vision problems. Taking inspiration from the fields of text mining and retrieval, this representation has proved to be very effective in a large number of domains. In most cases, a standard term-frequency weighting scheme is considered for representing images and videos in computer vision. This is somewhat surprising,...

chapter

Subject centric group feature for person re-identification

Li Wei, Shishir K. Shah

2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 28 - 35

2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

This paper presents a subject centric group feature for person re-identification. Our approach is inspired by the observation that people often tend to walk alongside others or in a group. We argue that co-travelers' information, including geometry and visual cues, can reduce the re-identification ambiguity and lead to better accuracy, compared to approaches that rely only on visual cues. We introduce...

chapter

Keep it accurate and diverse: Enhancing action recognition performance by ensemble learning

Mohammad Ali Bagheri, Qigang Gao, Sergio Escalera, Albert Clapes, more

2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 22 - 29

2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

The performance of different action recognition techniques has recently been studied by several computer vision researchers. However, the potential improvement in classification through classifier fusion by ensemble-based methods has remained unattended. In this work, we evaluate the performance of an ensemble of action learning techniques, each performing the recognition task from a different perspective...

chapter

Effective semantic pixel labelling with convolutional networks and Conditional Random Fields

Sakrapee Paisitkriangkrai, Jamie Sherrah, Pranam Janney, Anton Van-Den Hengel

2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 36 - 43

2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Large amounts of available training data and increasing computing power have led to the recent success of deep convolutional neural networks (CNN) on a large number of applications. In this paper, we propose an effective semantic pixel labelling using CNN features, hand-crafted features and Conditional Random Fields (CRFs). Both CNN and hand-crafted features are applied to dense image patches to produce...

chapter

Learning to count with deep object features

Santi Segui, Oriol Pujol, Jordi Vitria

2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 90 - 96

2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Learning to count is a learning strategy that has been recently proposed in the literature for dealing with problems where estimating the number of object instances in a scene is the final objective. In this framework, the task of learning to detect and localize individual object instances is seen as a harder task that can be evaded by casting the problem as that of computing a regression value from...

chapter

Multi-scale pyramid pooling for deep convolutional representation

Donggeun Yoo, Sunggyun Park, Joon-Young Lee, In So Kweon

2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 71 - 80

2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Compared to image representation based on low-level local descriptors, deep neural activations of Convolutional Neural Networks (CNNs) are richer in mid-level representation, but poorer in geometric invariance properties. In this paper, we present a straightforward framework for better image representation by combining the two approaches. To take advantages of both representations, we extract a fair...

chapter

Do deep features generalize from everyday objects to remote sensing and aerial scenes domains?

Otavio A. B. Penatti, Keiller Nogueira, Jefersson A. dos Santos

2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 44 - 51

2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

In this paper, we evaluate the generalization power of deep features (ConvNets) in two new scenarios: aerial and remote sensing image classification. We evaluate experimentally ConvNets trained for recognizing everyday objects for the classification of aerial and remote sensing images. ConvNets obtained the best results for aerial images, while for remote sensing, they performed well but were outperformed...

chapter

Speed-accuracy tradeoff of controlling absolute magnitude of fingertip force

Teng Li, Dangxiao Wang, Shusheng Zhang, Yuru Zhang, more

2015 IEEE World Haptics Conference (WHC) > 408 - 414

2015 IEEE World Haptics Conference (WHC)

Controlling absolute magnitudes of fingertip force is an important skill in many haptic interactions such as surgical operations and mechanical assemblies. A fundamental question in the force control is how quickly human can output a target force with expected accuracy. In this paper, human's capability to control absolute magnitudes of fingertip force under audio or visual feedback was observed through...

chapter

Beyond Bag-of-Words: Fast video classification with Fisher Kernel Vector of Locally Aggregated Descriptors

Ionut Mironica, Ionut Duta, Bogdan Ionescu, Nicu Sebe

2015 IEEE International Conference on Multimedia and Expo (ICME) > 1 - 6

2015 IEEE International Conference on Multimedia and Expo (ICME)

In this paper we introduce a new video description framework that replaces traditional Bag-of-Words with a combination of Fisher Kernels (FK) and Vector of Locally Aggregated Descriptors (VLAD). The main contributions are: (i) a fast algorithm to densely extract global frame features, easier and faster to compute than spatio-temporal local features; (ii) replacing the traditional k-means based vocabulary...

chapter

Recipe recognition with large multimodal food dataset

Xin Wang, Devinder Kumar, Nicolas Thome, Matthieu Cord, more

2015 IEEE International Conference on Multimedia & Expo Workshops (ICMEW) > 1 - 6

2015 IEEE International Conference on Multimedia & Expo Workshops (ICMEW)

This paper deals with automatic systems for image recipe recognition. For this purpose, we compare and evaluate leading vision-based and text-based technologies on a new very large multimodal dataset (UPMC Food-101) containing about 100,000 recipes for a total of 101 food categories. Each item in this dataset is represented by one image plus textual information. We present deep experiments of recipe...

chapter

Action recognition by Huffman coding and implicit action model

Nijun Li, Tongchi Zhou, Lin Zhou, Zhenyang Wu

2015 IEEE International Conference on Computational Intelligence and Virtual Environments for Measurement Systems and Applications (CIVEMSA) > 1 - 6

2015 IEEE International Conference on Computational Intelligence and Virtual Environments for Measurement Systems and Applications (CIVEMSA)

Human action recognition is at the core of computer vision, and has great application value in intelligent human-computer interactions. On the basis of Bag-of-Words (BoW), this work presents a Huffman coding and Implicit Action Model (IAM) combined framework for action recognition. Specifically, Huffman coding, which outperforms naïve Bayesian method, is a robust estimation of visual words' conditional...

chapter

A reliable brain computer interface implemented on FPGA for mobile dialing system

Chih-Wei Feng, Jui-Chung Chang, Wei-Chen Chen, Wai-Chi Fang

2015 IEEE International Conference on Consumer Electronics - Taiwan > 110 - 111

2015 IEEE International Conference on Consumer Electronics - Taiwan (ICCE-TW)

This paper demonstrates a high performance brain-computer interface (BCI) that allows users to dial phone numbers. The system is based on Canonical Correlation Analysis (CCA) and Steady-State Visual Evoked Potential (SSVEP). Through six buttons (9Hz, 10Hz, 11Hz, 12Hz, 13 Hz, 14Hz) displayed on the screen, subjects can choose the number by gazing at the computer interface. This proposed EEG (Electroencephalography)...

chapter

PET: An eye-tracking dataset for animal-centric Pascal object classes

Syed Omer Gilani, Ramanathan Subramanian, Yan Yan, David Melcher, more

2015 IEEE International Conference on Multimedia and Expo (ICME) > 1 - 6

2015 IEEE International Conference on Multimedia and Expo (ICME)

We present PET- the Pascal animal classes Eye Tracking database. Our database comprises eye movement recordings compiled from forty users for the bird, cat, cow, dog, horse and sheep trainval sets from the VOC 2012 image set. Different from recent eye-tracking databases such as [1, 2], a salient aspect of PET is that it contains eye movements recorded for both the free-viewing and visual search task...

INFONA - science communication portal

Search results

A comparative study of sampling analysis in scene classification of high-resolution remote sensing imagery

A benchmark for scene classification of high spatial resolution remote sensing imagery

A new approach to create 3D fixation density maps for stereoscopic images

Free-hand gesture control with “touchable” virtual interface for human-3DTV interaction

Monitoring and visualizing the daily activities and in-house locations using smartphone

Coloring image search with coupled multi-index

Explore the Eye Movement Regarding the Cognitive Process of Online Optic Reasoning Learning

Improving bag of visual words representations with genetic programming

Subject centric group feature for person re-identification

Keep it accurate and diverse: Enhancing action recognition performance by ensemble learning

Effective semantic pixel labelling with convolutional networks and Conditional Random Fields

Learning to count with deep object features

Multi-scale pyramid pooling for deep convolutional representation

Do deep features generalize from everyday objects to remote sensing and aerial scenes domains?

Speed-accuracy tradeoff of controlling absolute magnitude of fingertip force

Beyond Bag-of-Words: Fast video classification with Fisher Kernel Vector of Locally Aggregated Descriptors

Recipe recognition with large multimodal food dataset

Action recognition by Huffman coding and implicit action model

A reliable brain computer interface implemented on FPGA for mobile dialing system

PET: An eye-tracking dataset for animal-centric Pascal object classes

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options