Search results

chapter

Spatial pyramid context-aware moving vehicle detection and tracking in urban aerial imagery

Mahdieh Poostchi, Kannappan Palaniappan, Guna Seetharaman

2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS) > 1 - 6

2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS)

Persistent detection and tracking of moving vehicles in airborne imagery provide indispensable information for many traffic surveillance applications including traffic monitoring and management, navigation systems, activity recognition and event detection. This paper presents a collaborative Spatial Pyramid Context-aware detection and Tracking system (SPCT) for moving vehicles in dense urban aerial...

chapter

Urban Fusion: Visualizing Urban Data Fused with Social Feeds via a Game Engine

Jan Perhac, Wei Zeng, Shiho Asada, Stefan Mueller Arisona, more

2017 21st International Conference Information Visualisation (IV) > 312 - 317

2017 21st International Conference Information Visualisation (IV)

This paper presents a framework which allows urban planners to navigate and interact with large datasets fused with social feeds in real-time, enhanced by a virtual reality (VR) capability, which further promotes the knowledge discovery process and allows to interact with urban data in natural yet immersive way. A challenge in urban planning is making decisions based on datasets which are many times...

chapter

Gaze Embeddings for Zero-Shot Image Classification

Nour Karessli, Zeynep Akata, Bernt Schiele, Andreas Bulling

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6412 - 6421

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Zero-shot image classification using auxiliary information, such as attributes describing discriminative object properties, requires time-consuming annotation by domain experts. We instead propose a method that relies on human gaze as auxiliary information, exploiting that even non-expert users have a natural ability to judge class membership. We present a data collection paradigm that involves a...

chapter

Representing word image using visual word embeddings and RNN for keyword spotting on historical document images

Hongxi Wei, Hui Zhang, Guanglai Gao

2017 IEEE International Conference on Multimedia and Expo (ICME) > 1368 - 1373

2017 IEEE International Conference on Multimedia and Expo (ICME)

Visual words of Bag-of-Visual-Words (BoVW) framework are independent each other, which results in not only discarding spatial orders between visual words but also lacking semantic information. This study is inspired by word embeddings that a similar embedding procedure is applied to a large number of visual words. By this way, the corresponding embedding vectors of the visual words can be formulated...

chapter

A novel framework for enhancement of the low lighting video

Jianhua Pang, Sheng Zhang, Wencang Bai

2017 IEEE Symposium on Computers and Communications (ISCC) > 1366 - 1371

2017 IEEE Symposium on Computers and Communications (ISCC)

A novel and effective framework for the enhancement of low lighting images is proposed in this paper. The novel framework presents an optimized de-haze algorithm on inverted images to enhance the low-dynamic-range images which optimizes the complicated process of computing the parameters A and t(x). The improved gamma correction is used to enhance the image contrast for providing better visual performance...

chapter

Real time video summarization on mobile platform

Pradeep Choudhary, Sowmya P. Munukutla, K. S. Rajesh, Alok S. Shukla

2017 IEEE International Conference on Multimedia and Expo (ICME) > 1045 - 1050

2017 IEEE International Conference on Multimedia and Expo (ICME)

Recent advances in technology and rapid growth of consumer electronics have made tremendous amount of multimedia information available to the general population. Browsing through large collections of consumer videos and manually creating summaries can be tedious. Automatic summarization techniques will give the user an easy way to look up important content of a collection of media and to browse media...

chapter

Object State Recognition for Automatic AR-Based Maintenance Guidance

Pavel Dvorak, Radovan Josth, Elisabetta Delponte

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 1244 - 1250

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

This paper describes a component of an Augmented Reality (AR) based system focused on supporting workers in manufacturing and maintenance industry. Particularly, it describes a component responsible for verification of performed steps. Correct handling is crucial in both manufacturing and maintenance industries and deviations may cause problems in later stages of the production and assembly. The primary...

chapter

The human detection in images using the depth map

Dmitriy Tatarenkov, Dmitry Podolsky

2017 Systems of Signal Synchronization, Generating and Processing in Telecommunications (SINKHROINFO) > 1 - 4

2017 Systems of Signal Synchronization, Generating and Processing in Telecommunications (SINKHROINFO)

In today world the necessity for the autonomous mobile robots and vehicles is increasing. The safety autonomous moving demands the reliable and fast detection algorithms. The Histogram of Oriented Gradients (HOG) descriptors show significantly outperforms the existing feature sets for a human detection. Though the given method has a lot of type I errors. The amount of these errors can be decreased...

chapter

Evaluation of Different Histogram Distances for Temporal Segmentation in Digital Videos of Football Matches from TV Broadcast

Rodrigo Chacon-Quesada, Francisco Siles-Canales

2017 International Conference and Workshop on Bioinspired Intelligence (IWOBI) > 1 - 7

2017 International Conference and Workshop on Bioinspired Intelligence (IWOBI)

This article gives a more robust justification for the use of the Bhattacharyya distance in the algorithm used by our Automated Sport Analysis System named ACE in the first of its three perception modules. Such first module consists in the temporal segmentation of television video broadcasts, aiming to break down the video into shots, delimited by scene boundaries. An evaluation of other seven histogram...

chapter

Novel Method for Storyboarding Biomedical Videos for Medical Informatics

Sema Candemir, Sameer Antani, Zhiyun Xue, George Thoma

2017 IEEE 30th International Symposium on Computer-Based Medical Systems (CBMS) > 127 - 132

2017 IEEE 30th International Symposium on Computer-Based Medical Systems (CBMS)

We propose a novel method for developing static storyboard for video clips included with biomedical research literature. The technique uses both visual and audio content in the video to select candidate key frames for the storyboard. From the visual channel, the Intra-frames are extracted using FFmpeg tool. IBM Watson speech-to-text service is used to extract words from the audio channel, from which...

chapter

HHT-based remote respiratory rate estimation in thermal images

Duan-Yu Chen, Jyun-Ci Lai

2017 18th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD) > 263 - 268

2017 18th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD)

Thermal image has many applications on image processing such as human detection, face recognition and physiological signal evaluation, etc. The respiratory rate is an important physiological signal, and it is highly related to emotion and some diseases. Therefore, we propose a non-contact method to estimate the respiratory rate from thermal image in this paper. Thermal image can provide the information...

chapter

MPEG-7 visual descriptors selection for burn characterization by multidimensional scaling match

Constantin Vertan, Mihai-Sorin Badea, Corneliu Florea, Laura Florea, more

2017 E-Health and Bioengineering Conference (EHB) > 253 - 256

2017 E-Health and Bioengineering Conference (EHB)

This paper presents a new approach towards the selection of color image features to be used in the classification of burn wounds. The features are selected such that they generate similarity matrices and multidimensional scaling (MDS) plots that match the similarity matrix and the MDS-plot resulting from a subjective visual burn area similarity test performed by trained surgeons. We show that standard...

chapter

Color image enhancement using histogram equalization method without changing hue and saturation

Su-Ling Lee, Chien-Cheng Tseng

2017 IEEE International Conference on Consumer Electronics - Taiwan (ICCE-TW) > 305 - 306

2017 IEEE International Conference on Consumer Electronics - Taiwan (ICCE-TW)

In this paper, a color image enhancement method is presented by using intensity histogram equalization (HE) approach without changing hue and saturation in HSI color space. The proposed method has better visual colorfulness than the conventional HE method because hue and saturation are preserved in the enhancement process. The back-lighting image and night-time image are used to demonstrate the effectiveness...

chapter

Automatic endosomal structure detection and localization in fluorescence microscopic images

Dongyun Lin, Zhiping Lin, Ramraj Velmurugan, Raimund J. Ober

2017 IEEE International Symposium on Circuits and Systems (ISCAS) > 1 - 4

2017 IEEE International Symposium on Circuits and Systems (ISCAS)

This paper proposes a modified spatially-constrained similarity measure (mSCSM) method for endosomal structure detection and localization under the bag-of-words (BoW) framework. To our best knowledge, the proposed mSCSM is the first method for fully automatic detection and localization of complex subcellular compartments like endosomes. Essentially, a new similarity score and a novel two-stage output...

chapter

A survey on key frame extraction methods of a MPEG video

Shivangi Pandey, Prashant Dwivedy, Sunil Meena, Anjali Potnis

2017 International Conference on Computing, Communication and Automation (ICCCA) > 1192 - 1196

2017 International Conference on Computing, Communication and Automation (ICCCA)

The key frame extraction helps us to make obtainable summary of a video. After studying a variety of diverse methods of Key frame extraction, we will have comparative analysis of the methods depending on their important features and result. If we want to present the entire video within a squat interval of time, video summary becomes the best alternative for this. This has become a very essential work...

chapter

Simple texture descriptors for classifying monochrome planetary rover terrains

Dhara K. Shukla, Krzysztof Skonieczny

2017 IEEE International Conference on Robotics and Automation (ICRA) > 5495 - 5502

2017 IEEE International Conference on Robotics and Automation (ICRA)

Planetary rovers face mobility hazards associated with various classes of terrains they traverse: sand, bedrock, and rock-strewn terrain. This work develops visual classifiers for these 3 terrain types for single monochrome navigation images from the NASA Mars Exploration Rover missions. The classifiers are based primarily on visual texture, captured in histograms of edges filter responses at various...

chapter

Cross-modal visuo-tactile object recognition using robotic active exploration

Pietro Falco, Shuang Lu, Andrea Cirillo, Ciro Natale, more

2017 IEEE International Conference on Robotics and Automation (ICRA) > 5273 - 5280

2017 IEEE International Conference on Robotics and Automation (ICRA)

In this work, we propose a framework to deal with cross-modal visuo-tactile object recognition. By cross-modal visuo-tactile object recognition, we mean that the object recognition algorithm is trained only with visual data and is able to recognize objects leveraging only tactile perception. The proposed cross-modal framework is constituted by three main elements. The first is a unified representation...

chapter

Design of an SSVEP-based BCI system with visual servo module for a service robot to execute multiple tasks

Shili Sheng, Peipei Song, Lingyue Xie, Zhendong Luo, more

2017 IEEE International Conference on Robotics and Automation (ICRA) > 2267 - 2272

2017 IEEE International Conference on Robotics and Automation (ICRA)

Brain-computer interface (BCI) systems can translate the human mind into control commands, which makes it feasible to improve the life quality of physically challenged people. However, in real-life situations, it is still difficult for users to utilize robots to provide basic services with BCI systems. We aimed to propose a BCI-based system with a visual servo module to operate a service robot. We...

chapter

Structural superpixel descriptor for visual tracking

Wenjun Huang, Ruimin Hu, Chao Liang, Weijian Ruan, more

2017 International Joint Conference on Neural Networks (IJCNN) > 3146 - 3152

2017 International Joint Conference on Neural Networks (IJCNN)

Object representation is a major component in object tracking, however, most conventional patch-based methods just simply decompose the object into patches with grid or stochastic rectangles. This kind of decomposition ignores the intrinsic structure of object, leading to low discriminative power and weak representation effectiveness when similar objects appear or under background clutters. In this...

chapter

Multi-index fusion via similarity matrix pooling for image retrieval

Xin Chen, Jun Wu, Shaoyan Sun, Qi Tian

2017 IEEE International Conference on Communications (ICC) > 1 - 6

ICC 2017 - 2017 IEEE International Conference on Communications

Different kinds of features hold some distinct merits, making them complementary to each other. Inspired by this idea an index level multiple feature fusion scheme via similarity matrix pooling is proposed in this paper. We first compute the similarity matrix of each index, and then a novel scheme is used to pool on these similarity matrices for updating the original indices. Compared with the existing...

INFONA - science communication portal

Search results

Spatial pyramid context-aware moving vehicle detection and tracking in urban aerial imagery

Urban Fusion: Visualizing Urban Data Fused with Social Feeds via a Game Engine

Gaze Embeddings for Zero-Shot Image Classification

Representing word image using visual word embeddings and RNN for keyword spotting on historical document images

A novel framework for enhancement of the low lighting video

Real time video summarization on mobile platform

Object State Recognition for Automatic AR-Based Maintenance Guidance

The human detection in images using the depth map

Evaluation of Different Histogram Distances for Temporal Segmentation in Digital Videos of Football Matches from TV Broadcast

Novel Method for Storyboarding Biomedical Videos for Medical Informatics

HHT-based remote respiratory rate estimation in thermal images

MPEG-7 visual descriptors selection for burn characterization by multidimensional scaling match

Color image enhancement using histogram equalization method without changing hue and saturation

Automatic endosomal structure detection and localization in fluorescence microscopic images

A survey on key frame extraction methods of a MPEG video

Simple texture descriptors for classifying monochrome planetary rover terrains

Cross-modal visuo-tactile object recognition using robotic active exploration

Design of an SSVEP-based BCI system with visual servo module for a service robot to execute multiple tasks

Structural superpixel descriptor for visual tracking

Multi-index fusion via similarity matrix pooling for image retrieval

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options