Search results

Items from 141 to 160 out of 9,934 results

1 ...
5
6
7
8
9
10
11

chapter

A saliency detection model combined local and global features

Pin Wang, Guohui Tian, Huanzhao Chen

2017 Chinese Automation Congress (CAC) > 2863 - 2870

2017 Chinese Automation Congress (CAC)

Most present methods of saliency detection emphasize too much on the local contrast while ignore the global feature of image. The detailed characteristics of the image can be reflected based on the local comparison of image. However, the overall saliency of the image cannot be reflected. In this paper, a saliency detection model combined local and global features was proposed. Firstly, a local feature...

chapter

Research on heart sound recognition based on support vector machine

Yutai Wang, Boyuan Sun, Xinghai Yang, Qingfang Meng

2017 Chinese Automation Congress (CAC) > 62 - 65

2017 Chinese Automation Congress (CAC)

The present status of heart sound recognition is introduced in the paper. In order to improve the performance of heart sound recognition, a new model based on SVM is proposed. Firstly, the wavelet transform is used to reduce the noise of the heart sound, and then MFCC feature is extracted from heart sound. On this basis, the Support Vector Machine is used to build the classification model. In the...

chapter

Moving object tracking with feature learning and inheriting

Sun Xiaoyan, Chang Faliang

2017 Chinese Automation Congress (CAC) > 899 - 903

2017 Chinese Automation Congress (CAC)

Moving object tracking with discriminative model is very popular in recent years, which focuses on online selecting highly informative features to maximize the separability between object and background. An adapted particle filter tracker with online learning and inheriting discriminative model is proposed in this paper. Top-ranked discriminative features are selected into appearance model by Online...

chapter

Robust object detection for tiny and dense targets in VHR aerial images

Haining Xie, Tian Wang, Meina Qiao, Mengyi Zhang, more

2017 Chinese Automation Congress (CAC) > 6397 - 6401

2017 Chinese Automation Congress (CAC)

Object detection in Very High Resolution (VHR) optical remote sensing images is a challenged work for objects are usually dense and tiny. With random orientation, various backgrounds as well as unpredictable noise make traditional image processing methods perform badly. In this paper, we propose using state-of-art Region-based fully convolutional networks to solve object detection tasks in aerial...

chapter

Face liveness detection based on enhanced local binary patterns

Xiaolei Liu, Runge Lu, Wei Liu

2017 Chinese Automation Congress (CAC) > 6301 - 6305

2017 Chinese Automation Congress (CAC)

For face recognition systems, impostors can obtain legal identity authentication by presenting the printed images, the downloaded images or candid videos to the sensor. In this paper, an enhanced face local binary feature (ELBP) of a face map is extracted as a classification feature to identify whether the face map is a real face or a fake face. Compared with the dynamic or static methods proposed...

chapter

Recognition and simulation of parachute action based on continuous hidden Markov model

Xuan Gong, Liang Han, Jiangyun Wang, Maopeng Ran

2017 Chinese Automation Congress (CAC) > 4108 - 4113

2017 Chinese Automation Congress (CAC)

Building a human-computer interactive parachute simulator is an efficient way to avoid the high risk and high cost of field parachute training. In this paper, a novel dynamic recognition and simulation approach of parachute training is developed. Firstly we process the skeletal data acquired by Kinect and enforce the indication of the trainees' parachute posture, where principle component analysis...

chapter

MemNet: A Persistent Memory Network for Image Restoration

Ying Tai, Jian Yang, Xiaoming Liu, Chunyan Xu

2017 IEEE International Conference on Computer Vision (ICCV) > 4549 - 4557

2017 IEEE International Conference on Computer Vision (ICCV)

Recently, very deep convolutional neural networks (CNNs) have been attracting considerable attention in image restoration. However, as the depth grows, the longterm dependency problem is rarely realized for these very deep models, which results in the prior states/layers having little influence on the subsequent ones. Motivated by the fact that human thoughts have persistency, we propose a very deep...

chapter

Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training

Rakshith Shetty, Marcus Rohrbach, Lisa Anne Hendricks, Mario Fritz, more

2017 IEEE International Conference on Computer Vision (ICCV) > 4155 - 4164

2017 IEEE International Conference on Computer Vision (ICCV)

While strong progress has been made in image captioning recently, machine and human captions are still quite distinct. This is primarily due to the deficiencies in the generated word distribution, vocabulary size, and strong bias in the generators towards frequent captions. Furthermore, humans – rightfully so – generate multiple, diverse captions, due to the inherent ambiguity in the captioning task...

chapter

Regional Interactive Image Segmentation Networks

JunHao Liew, Yunchao Wei, Wei Xiong, Sim-Heng Ong, more

2017 IEEE International Conference on Computer Vision (ICCV) > 2746 - 2754

2017 IEEE International Conference on Computer Vision (ICCV)

The interactive image segmentation model allows users to iteratively add new inputs for refinement until a satisfactory result is finally obtained. Therefore, an ideal interactive segmentation model should learn to capture the user's intention with minimal interaction. However, existing models fail to fully utilize the valuable user input information in the segmentation refinement process and thus...

chapter

Unsupervised Learning of Stereo Matching

Chao Zhou, Hong Zhang, Xiaoyong Shen, Jiaya Jia

2017 IEEE International Conference on Computer Vision (ICCV) > 1576 - 1584

2017 IEEE International Conference on Computer Vision (ICCV)

Convolutional neural networks showed the ability in stereo matching cost learning. Recent approaches learned parameters from public datasets that have ground truth disparity maps. Due to the difficulty of labeling ground truth depth, usable data for system training is rather limited, making it difficult to apply the system to real applications. In this paper, we present a framework for learning stereo...

chapter

Online Multi-object Tracking Using CNN-Based Single Object Tracker with Spatial-Temporal Attention Mechanism

Qi Chu, Wanli Ouyang, Hongsheng Li, Xiaogang Wang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 4846 - 4855

2017 IEEE International Conference on Computer Vision (ICCV)

In this paper, we propose a CNN-based framework for online MOT. This framework utilizes the merits of single object trackers in adapting appearance models and searching for target in the next frame. Simply applying single object tracker for MOT will encounter the problem in computational efficiency and drifted results caused by occlusion. Our framework achieves computational efficiency by sharing...

chapter

Chinese license plate character recognition based on convolution neural network

Donghui Yao, Wenxing Zhu, Yanjun Chen, Lidong Zhang

2017 Chinese Automation Congress (CAC) > 1547 - 1552

2017 Chinese Automation Congress (CAC)

Considering the problems of low recognition rate and poor robustness in traditional recognition algorithms, we propose a license plate character recognition algorithm based on convolution neural network. In this paper, we adopt a coarse-to-fine strategy for designing the network architecture. Through the convolutional layers and pooling layers, features of input images will be extracted and then sent...

chapter

Multi-label Learning of Part Detectors for Heavily Occluded Pedestrian Detection

Chunluan Zhou, Junsong Yuan

2017 IEEE International Conference on Computer Vision (ICCV) > 3506 - 3515

2017 IEEE International Conference on Computer Vision (ICCV)

Detecting pedestrians that are partially occluded remains a challenging problem due to variations and uncertainties of partial occlusion patterns. Following a commonly used framework of handling partial occlusions by part detection, we propose a multi-label learning approach to jointly learn part detectors to capture partial occlusion patterns. The part detectors share a set of decision trees via...

chapter

Spatio-Temporal Person Retrieval via Natural Language Queries

Masataka Yamaguchi, Kuniaki Saito, Yoshitaka Ushiku, Tatsuya Harada

2017 IEEE International Conference on Computer Vision (ICCV) > 1462 - 1471

2017 IEEE International Conference on Computer Vision (ICCV)

In this paper, we address the problem of spatio-temporal person retrieval from videos using a natural language query, in which we output a tube (i.e., a sequence of bounding boxes) which encloses the person described by the query. For this problem, we introduce a novel dataset consisting of videos containing people annotated with bounding boxes for each second and with five natural language descriptions...

chapter

SSD-6D: Making RGB-Based 3D Detection and 6D Pose Estimation Great Again

Wadim Kehl, Fabian Manhardt, Federico Tombari, Slobodan Ilic, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1530 - 1538

2017 IEEE International Conference on Computer Vision (ICCV)

We present a novel method for detecting 3D model instances and estimating their 6D poses from RGB data in a single shot. To this end, we extend the popular SSD paradigm to cover the full 6D pose space and train on synthetic model data only. Our approach competes or surpasses current state-of-the-art methods that leverage RGBD data on multiple challenging datasets. Furthermore, our method produces...

chapter

A scale-invariant framework for image classification with deep learning

Yalong Jiang, Zheru Chi

2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 1019 - 1024

2017 IEEE International Conference on Systems, Man and Cybernetics (SMC)

In this paper, we propose a scale-invariant framework based on Convolutional Neural Networks (CNNs). The network exhibits robustness to scale and resolution variations in data. Previous efforts in achieving scale invariance were made on either integrating several variant-specific CNNs or data augmentation. However, these methods did not solve the fundamental problem that CNNs develop different feature...

chapter

Blind quality assessment for contrast changed images

Dixiu Zhong, Ping Shi, Da Pan, Ming Hou, more

2017 IEEE 3rd Information Technology and Mechatronics Engineering Conference (ITOEC) > 494 - 498

2017 IEEE 3rd Information Technology and Mechatronics Engineering Conference (ITOEC)

Contrast of image plays an important role in image perception quality and is also susceptive to various factors during image acquisition process. However, only a few image quality evaluation algorithms have been focused on the contrast-changed image quality assessment (IQA), and none of these methods belongs to blind IQA algorithms. Therefore, they cannot be applied to the case when the reference...

chapter

Binaural and log-power spectra features with deep neural networks for speech-noise separation

Alfredo Zermini, Qingju Liu, Yong Xu, Mark D. Plumbley, more

2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP) > 1 - 6

2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP)

Binaural features of interaural level difference and interaural phase difference have proved to be very effective in training deep neural networks (DNNs), to generate time-frequency masks for target speech extraction in speech-speech mixtures. However, effectiveness of binaural features is reduced in more common speech-noise scenarios, since the noise may over-shadow the speech in adverse conditions...

chapter

Analysis of data fusion techniques for multi-microphone audio event detection in adverse environments

Irene Martin-Morato, Maximo Cobos, Francesc J. Ferri

2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP) > 1 - 6

2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP)

Acoustic event detection (AED) is currently a very active research area with multiple applications in the development of smart acoustic spaces. In this context, the advances brought by Internet of Things (IoT) platforms where multiple distributed microphones are available have also contributed to this interest. In such scenarios, the use of data fusion techniques merging information from several sensors...

chapter

Improving missing issue-commit link recovery using positive and unlabeled data

Yan Sun, Celia Chen, Qing Wang, Barry Boehm

2017 32nd IEEE/ACM International Conference on Automated Software Engineering (ASE) > 147 - 152

2017 32nd IEEE/ACM International Conference on Automated Software Engineering (ASE)

Links between issue reports and corresponding fix commits are widely used in software maintenance. The quality of links directly affects maintenance costs. Currently, such links are mainly maintained by error-prone manual efforts, which may result in missing links. To tackle this problem, automatic link recovery approaches have been proposed by building traditional classifiers with positive and negative...

1 ...
5
6
7
8
9
10
11

Keywords:
TRAINING
FEATURE EXTRACTION

Publication date

Set your own date range

Content availability

Available (9,864)
None (70)

Keywords

SUPPORT VECTOR MACHINES (2,144)
ACCURACY (1,520)
CLASSIFICATION ALGORITHMS (1,256)
DATABASES (1,219)
DATA MINING (1,118)
FACE (1,044)
FACE RECOGNITION (980)
TESTING (964)
IMAGE CLASSIFICATION (865)
ARTIFICIAL NEURAL NETWORKS (842)
PRINCIPAL COMPONENT ANALYSIS (743)
KERNEL (741)
HIDDEN MARKOV MODELS (722)
MACHINE LEARNING (665)
VISUALIZATION (665)
IMAGE SEGMENTATION (658)
SPEECH (648)
LEARNING (ARTIFICIAL INTELLIGENCE) (640)
HISTOGRAMS (622)
NEURAL NETWORKS (620)
IMAGE COLOR ANALYSIS (578)
VECTORS (577)
IMAGE RECOGNITION (541)
SHAPE (539)
OBJECT DETECTION (503)
DETECTORS (500)
SUPPORT VECTOR MACHINE (476)
PATTERN RECOGNITION (470)
COMPUTATIONAL MODELING (466)
PATTERN CLASSIFICATION (460)
PIXEL (446)
TRAINING DATA (429)
CLASSIFICATION (411)
SPEECH RECOGNITION (377)
ROBUSTNESS (372)
NEURAL NETS (371)
NEURONS (369)
SUPPORT VECTOR MACHINE CLASSIFICATION (356)
ALGORITHM DESIGN AND ANALYSIS (355)
FEATURE SELECTION (355)
COMPUTER VISION (347)
SVM (346)
CAMERAS (329)
CORRELATION (304)
IMAGE EDGE DETECTION (303)
ELECTROENCEPHALOGRAPHY (301)
MATHEMATICAL MODEL (296)
DATA MODELS (293)
HUMANS (292)
HANDWRITING RECOGNITION (291)
ESTIMATION (289)
SEMANTICS (286)
DICTIONARIES (272)
TRANSFORMS (272)
WAVELET TRANSFORMS (272)
DEEP LEARNING (258)
CHARACTER RECOGNITION (255)
OBJECT RECOGNITION (248)
CONVOLUTION (247)
MEL FREQUENCY CEPSTRAL COEFFICIENT (244)
TEXT ANALYSIS (235)
OPTIMIZATION (231)
NEURAL NETWORK (222)
CLUSTERING ALGORITHMS (216)
LIGHTING (206)
IMAGE RESOLUTION (204)
ENTROPY (198)
STATISTICAL ANALYSIS (198)
MEASUREMENT (195)
NOISE (195)
ACOUSTICS (189)
CONTEXT (186)
IMAGE RETRIEVAL (182)
IMAGE REPRESENTATION (180)
CONFERENCES (178)
EMOTION RECOGNITION (178)
IMAGE PROCESSING (177)
COMPUTER ARCHITECTURE (176)
NATURAL LANGUAGE PROCESSING (175)
GENETIC ALGORITHMS (172)
BOOSTING (169)
SIGNAL PROCESSING (169)
FACE DETECTION (168)
PREDICTIVE MODELS (168)
BIOLOGICAL NEURAL NETWORKS (167)
ENCODING (166)
STANDARDS (165)
IMAGE RECONSTRUCTION (164)
VEHICLES (163)
DECISION TREES (162)
PCA (161)
THREE-DIMENSIONAL DISPLAYS (161)
EIGENVALUES AND EIGENFUNCTIONS (154)
IMAGE TEXTURE (154)
MEDICAL IMAGE PROCESSING (154)
VIDEOS (154)
BIOMETRICS (ACCESS CONTROL) (150)
EDUCATIONAL INSTITUTIONS (150)
more

INFONA - science communication portal

Search results

A saliency detection model combined local and global features

Research on heart sound recognition based on support vector machine

Moving object tracking with feature learning and inheriting

Robust object detection for tiny and dense targets in VHR aerial images

Face liveness detection based on enhanced local binary patterns

Recognition and simulation of parachute action based on continuous hidden Markov model

MemNet: A Persistent Memory Network for Image Restoration

Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training

Regional Interactive Image Segmentation Networks

Unsupervised Learning of Stereo Matching

Online Multi-object Tracking Using CNN-Based Single Object Tracker with Spatial-Temporal Attention Mechanism

Chinese license plate character recognition based on convolution neural network

Multi-label Learning of Part Detectors for Heavily Occluded Pedestrian Detection

Spatio-Temporal Person Retrieval via Natural Language Queries

SSD-6D: Making RGB-Based 3D Detection and 6D Pose Estimation Great Again

A scale-invariant framework for image classification with deep learning

Blind quality assessment for contrast changed images

Binaural and log-power spectra features with deep neural networks for speech-noise separation

Analysis of data fusion techniques for multi-microphone audio event detection in adverse environments

Improving missing issue-commit link recovery using positive and unlabeled data

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options