Search results

Items from 21 to 40 out of 455 results

chapter

Deep Local Video Feature for Action Recognition

Zhenzhong Lan, Yi Zhu, Alexander G. Hauptmann, Shawn Newsam

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 1219 - 1225

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

We investigate the problem of representing an entire video using CNN features for human action recognition. End-to-end learning of CNN/RNNs is currently not possible for whole videos due to GPU memory limitations and so a common practice is to use sampled frames as inputs along with the video labels as supervision. However, the global video labels might not be suitable for all of the temporally local...

chapter

Signal Classification in Quotient Spaces via Globally Optimal Variational Calculus

Gregory S. Chirikjian

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 735 - 743

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

A ubiquitous problem in pattern recognition is that of matching an observed time-evolving pattern (or signal) to a gold standard in order to recognize or characterize the meaning of a dynamic phenomenon. Examples include matching sequences of images in two videos, matching audio signals in speech recognition, or matching framed trajectories in robot action recognition. This paper shows that all of...

chapter

A novel fast terminal sliding mode control method based on immersion and invariance for course control of USV

Xue Wentao, Zhang Chen, Li Jianzhen, Wang Yulong

2017 36th Chinese Control Conference (CCC) > 3212 - 3217

2017 36th Chinese Control Conference (CCC)

The course control of an unmanned surface vehicle(USV) with water-jet-propelled is addressed using a novel fast terminal sliding mode control approach based on system immersion and manifold invariant (FTSMC-I&I). The control scheme can ensure all error signals globally exponentially converge to origin in finite-time by the novel fast terminal sliding mode controller. In addition, I&I method...

chapter

Semantic Instance Segmentation for Autonomous Driving

Bert De Brabandere, Davy Neven, Luc Van Gool

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 478 - 480

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Semantic instance segmentation remains a challenge. We propose to tackle the problem with a discriminative loss function, operating at pixel level, that encourages a convolutional network to produce a representation of the image that can easily be clustered into instances with a simple post-processing step. Our approach of combining an offthe- shelf network with a principled loss function inspired...

chapter

Deceiving Google’s Cloud Video Intelligence API Built for Summarizing Videos

Hossein Hosseini, Baicen Xiao, Radha Poovendran

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 1305 - 1309

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Despite the rapid progress of the techniques for image classification, video annotation has remained a challenging task. Automated video annotation would be a breakthrough technology, enabling users to search within the videos. Recently, Google introduced the Cloud Video Intelligence API for video analysis. As per the website, the system can be used to "separate signal from noise, by retrieving...

chapter

Learning Dynamic GMM for Attention Distribution on Single-Face Videos

Yun Ren, Zulin Wang, Mai Xu, Haoyu Dong, more

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 1632 - 1641

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

The past decade has witnessed the popularity of video conferencing, such as FaceTime and Skype. In video conferencing, almost every frame has a human face. Hence, it is necessary to predict attention on face videos by saliency detection, as saliency can be used as a guidance of regionof- interest (ROI) for the content-based applications. To this end, this paper proposes a novel approach for saliency...

chapter

Object-Specific Style Transfer Based on Feature Map Selection Using CNNs

Ayumu Shinya, Nguyen Duc Tung, Tomohiro Harada, Ruck Thawonmas

2017 Nicograph International (NicoInt) > 88

2017 Nicograph International (NicoInt)

We propose a method for transferring an arbitrary style to only a specific object in an image. Style transfer is the process of combining the content of an image and the style of another image into a new image. Our results show that the proposed method can realize style transfer to specific object.

chapter

Integrating a Priori Probabilistic Knowledge into Classification for Image Description

Andrea Apicella, Anna Corazza, Francesco Isgro, Giuseppe Vettigli

2017 IEEE 26th International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (WETICE) > 197 - 199

2017 IEEE 26th International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (WETICE)

This paper discusses a possible implementation of the integration of knowledge from a probabilistic ontology in the automatic description of images. This combination not only provides the relations existing between the different segments, but also improve the classification accuracy, as the context often gives cues suggesting the correct class of the segment.

chapter

Keyboard recognition from scale-invariant feature transform

Ming-Te Chao, Yung-Sheng Chen

2017 IEEE International Conference on Consumer Electronics - Taiwan (ICCE-TW) > 205 - 206

2017 IEEE International Conference on Consumer Electronics - Taiwan (ICCE-TW)

Based on the scale-invariant feature transform, this paper presents an approach to keyboard recognition. Not only the skewed keyboard can be corrected, but also the keys in the keyboard can be located. Experimental results confirm the feasibility of the proposed method.

chapter

Phasic maximal and local maximal occurrence representation for video-based person re-identification

Gang Liu, Chang Tian, Ze-Min Wu

2017 IEEE 9th International Conference on Communication Software and Networks (ICCSN) > 1187 - 1190

2017 IEEE 9th International Conference on Communication Software and Networks (ICCSN)

This paper proposes a new spatio-temporal appearance feature named Phasic Maximal and Local Maximal Occurrence (PM-LOMO) representation for video-based person re-identification. To perform temporal alignment of the sequence, we selected the optimal period of walking cycle and divide frames into several phases based on the extreme points of the sequence's Flow Energy Profile (FEP). To describe the...

chapter

Pattern detection and recognition in SAR images

Ievgen M. Gorovyi, Dmytro S. Sharapov

2017 IEEE First Ukraine Conference on Electrical and Computer Engineering (UKRCON) > 123 - 126

2017 IEEE First Ukraine Conference on Electrical and Computer Engineering (UKRCON)

Synthetic aperture radar (SAR) is a powerful tool for remote sensing of the Earth surface. In the paper, several applications of pattern detection and recognition algorithms for extraction of information from SAR images are discussed. In particular, an idea of usage of optical flow techniques for automatic estimation of the moving target displacements from a sequence of single-look SAR images is proposed...

chapter

The use of ART-2 neural network for processing information signals of non-destructive testing

R. M. Galagan, A. S. Momot

2017 IEEE First Ukraine Conference on Electrical and Computer Engineering (UKRCON) > 981 - 985

2017 IEEE First Ukraine Conference on Electrical and Computer Engineering (UKRCON)

Describes the universal approach to the intellectual automated system development of digital signal processing for acoustic testing devices with free vibrations method and the usage of artificial neural networks. The system solves the problem of defects recognition and classification, and enhances performance testing in comparison with traditional instruments.

chapter

Real-time lane marking detection using modified 1-bit transform based pre-processing

Ayhan Kucukmanisa, Ramazan Duvar, Oguzhan Urhan

2017 25th Signal Processing and Communications Applications Conference (SIU) > 1 - 4

2017 25th Signal Processing and Communications Applications Conference (SIU)

In this work, an image processing based lane-detection approach is proposed. In the proposed approach, candidate pixels that can be used for lane markings are detected by making use of 1-bit transform as a pre-processing step. Next, feature points are extracted via Sobel filter and candidate lane markings are decided employing a correlation and Hough transform based approach. Finally, Kalman filter...

chapter

Effect of patch based training on object localization with convolutional neural networks

Semih Orhan, Yalin Bastanlar

2017 25th Signal Processing and Communications Applications Conference (SIU) > 1 - 4

2017 25th Signal Processing and Communications Applications Conference (SIU)

In recent years, Convolutional Neural Networks (CNNs) have shown great performance not only in image classification and image recognition tasks but also several tasks of computer vision. A lot of models which have different number of layers and depths, have been proposed. In this work, locations of leopards are tried to be identified by deep neural networks. To accomplish this task, two different...

chapter

Deep distance metric learning for maritime vessel identification

Erhan Gundogdu, Berkan Solmaz, Aykut Koc, Veysel Yucesoy, more

2017 25th Signal Processing and Communications Applications Conference (SIU) > 1 - 4

2017 25th Signal Processing and Communications Applications Conference (SIU)

This paper addresses the problem of maritime vessel identification by exploiting the state-of-the-art techniques of distance metric learning and deep convolutional neural networks since vessels are the key constituents of marine surveillance. In order to increase the performance of visual vessel identification, we propose a joint learning framework which considers a classification and a distance metric...

chapter

Scene detection via depth maps of 3 dimensional videos

Huseyin Bayrak, Gokce Nur Yilmaz

2017 25th Signal Processing and Communications Applications Conference (SIU) > 1 - 4

2017 25th Signal Processing and Communications Applications Conference (SIU)

Scene detection via processing of multimedia data is a significant research area for the advancement of the video technologies and applications. Currently, the scene detection is mostly performed manually. Thus, it is time consuming and costly. Therefore, it is important to develop algorithms that can automatically segment scenes to support the advancement of these technologies and applications. With...

chapter

Understanding a city from its visuals: An interdisciplinary program proposal

Ceyhun Burak Akgul, Onur Nizam Sonmez, Sema Alacam

2017 25th Signal Processing and Communications Applications Conference (SIU) > 1 - 4

2017 25th Signal Processing and Communications Applications Conference (SIU)

This study aims to provide an overview on the intersection and interaction between architecture, urban modeling, planning fields and computer vision field. The reflection of the methods and approaches of fields such as visual recognition, natural language processing, data mining and data visualization onto architecture and urban studies are investigated and potentials of inter/transdisciplinary encounters...

chapter

Unclassified wheat identification with bag of contour fragments

Ahmet Okan Onarcan, Kemal Ozkan, Murat Olgun

2017 25th Signal Processing and Communications Applications Conference (SIU) > 1 - 4

2017 25th Signal Processing and Communications Applications Conference (SIU)

Production of high quality wheat has a great importance especially in the solution of nutrition problems. It is necessary to make decomposition for specifying the quality. Here, high quality and unclassified wheat recognition are realized. The most distinctive feature between high quality and poor quality wheat is the shape difference. In this study, Bag of Contour Fragments (BCF) was used as a shape...

chapter

Action recognition with skeletal volume and deep learning

Ali Seydi Keceli, Aydin Kaya, Ahmet Burak Can

2017 25th Signal Processing and Communications Applications Conference (SIU) > 1 - 4

2017 25th Signal Processing and Communications Applications Conference (SIU)

The use of depth sensors in activity recognition is a technology that emerges in human computer interaction and motion recognition. In this study, an approach to identify single-person activities using deep learning on depth image sequences is presented. First, a 3D volumetric template is generated using skeletal information obtained from a depth video. The generated 3D volume is used for extracting...

chapter

Object detection with convolutional context features

Emre Can Kaya, A. Aydin Alatan

2017 25th Signal Processing and Communications Applications Conference (SIU) > 1 - 4

2017 25th Signal Processing and Communications Applications Conference (SIU)

A novel extension to Hızlı B-ESA object detection algorithm is proposed in order to learn convolutional context features for determining boundaries of objects better. For input images, the hypothesis windows and their context around those windows are learned through convolutional layers as two parallel networks. The resulting object and context feature maps are combined in such a way that they preserve...

Keywords:
CONFERENCES
PATTERN RECOGNITION

Publication date

Set your own date range

Content availability

Available (454)
None (1)

Keywords

COMPUTER VISION (234)
FEATURE EXTRACTION (157)
SIGNAL PROCESSING (122)
COMPUTATIONAL MODELING (81)
IMAGE COLOR ANALYSIS (74)
SIGNAL PROCESSING ALGORITHMS (73)
COMPUTERS (71)
IMAGE SEGMENTATION (71)
IMAGE PROCESSING (69)
TRAINING (69)
ALGORITHM DESIGN AND ANALYSIS (67)
ROBUSTNESS (60)
IMAGE RECOGNITION (59)
CAMERAS (58)
DATA MINING (58)
NOISE (58)
TRANSFORMS (56)
MATHEMATICAL MODEL (55)
SHAPE (55)
EDUCATIONAL INSTITUTIONS (54)
ARTIFICIAL NEURAL NETWORKS (53)
EQUATIONS (53)
ACCURACY (52)
DATABASES (50)
IMAGE EDGE DETECTION (49)
ESTIMATION (44)
SUPPORT VECTOR MACHINES (43)
CLASSIFICATION ALGORITHMS (39)
LIGHTING (39)
COMPLEXITY THEORY (38)
MULTIMEDIA COMMUNICATION (37)
FACE RECOGNITION (36)
VISUALIZATION (35)
IMAGE RESOLUTION (34)
REAL TIME SYSTEMS (34)
TESTING (34)
ANALYTICAL MODELS (32)
HISTOGRAMS (32)
INDEXES (32)
OBJECT DETECTION (31)
PRINCIPAL COMPONENT ANALYSIS (31)
WAVELET ANALYSIS (31)
WAVELET TRANSFORMS (31)
CORRELATION (30)
ARTIFICIAL INTELLIGENCE (29)
COMPUTER SCIENCE (29)
GEOMETRY (28)
IMAGE RECONSTRUCTION (28)
IMAGING (28)
OBJECT RECOGNITION (28)
STREAMING MEDIA (28)
GRAPHICS (27)
MACHINE INTELLIGENCE (27)
MACHINE LEARNING (27)
OPTIMIZATION (27)
PRESSES (27)
ELECTRONIC MAIL (26)
IMAGE CLASSIFICATION (26)
PATTERN ANALYSIS (26)
SOFTWARE (24)
LABORATORIES (23)
TRACKING (23)
CLUSTERING ALGORITHMS (22)
IMAGE ANALYSIS (22)
DATA MODELS (21)
DETECTORS (21)
FILTERING (21)
HIDDEN MARKOV MODELS (21)
IMAGE CODING (21)
IMAGE RETRIEVAL (21)
USA COUNCILS (21)
VECTORS (21)
ADAPTATION MODEL (20)
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (20)
OPTICAL IMAGING (20)
ROBOTS (20)
SURVEILLANCE (20)
VEHICLES (20)
VIDEO SEQUENCES (20)
ENCODING (19)
FACE (19)
PROCEEDINGS OF THE IEEE (19)
SOLID MODELING (19)
SUPPORT VECTOR MACHINE CLASSIFICATION (19)
APPROXIMATION ALGORITHMS (18)
AUTOMATION (18)
HUMANS (18)
REMOTE SENSING (18)
BIOMEDICAL IMAGING (17)
CYBERNETICS (17)
IMAGE MOTION ANALYSIS (17)
MANGANESE (17)
MATERIALS (17)
SECURITY (17)
BIOLOGICAL SYSTEM MODELING (16)
BRIGHTNESS (16)
ENTROPY (16)
HELIUM (16)
more

INFONA - science communication portal

Search results

Deep Local Video Feature for Action Recognition

Signal Classification in Quotient Spaces via Globally Optimal Variational Calculus

A novel fast terminal sliding mode control method based on immersion and invariance for course control of USV

Semantic Instance Segmentation for Autonomous Driving

Deceiving Google’s Cloud Video Intelligence API Built for Summarizing Videos

Learning Dynamic GMM for Attention Distribution on Single-Face Videos

Object-Specific Style Transfer Based on Feature Map Selection Using CNNs

Integrating a Priori Probabilistic Knowledge into Classification for Image Description

Keyboard recognition from scale-invariant feature transform

Phasic maximal and local maximal occurrence representation for video-based person re-identification

Pattern detection and recognition in SAR images

The use of ART-2 neural network for processing information signals of non-destructive testing

Real-time lane marking detection using modified 1-bit transform based pre-processing

Effect of patch based training on object localization with convolutional neural networks

Deep distance metric learning for maritime vessel identification

Scene detection via depth maps of 3 dimensional videos

Understanding a city from its visuals: An interdisciplinary program proposal

Unclassified wheat identification with bag of contour fragments

Action recognition with skeletal volume and deep learning

Object detection with convolutional context features

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options