2016 Visual Communications and Image Processing (VCIP)

Items from 1 to 20 out of 183 results

chapter

Front cover

2016 Visual Communications and Image Processing (VCIP) > c1

2016 Visual Communications and Image Processing (VCIP)

chapter

Keynotes: Deep learning for visual understanding: Effectiveness vs. efficiency

Shuicheng Yan

2016 Visual Communications and Image Processing (VCIP) > 1

2016 Visual Communications and Image Processing (VCIP)

In this talk, I shall introduce a series of research related with deep learning for visual understanding, and focus on three aspects: 1) how to make the model size to be small while keep high accuracy, 2) how to design proper objective functions to enhance algorithmic learnability, and 3) how to design proper network structure to accelerate the inference speed.

chapter

Keynotes: Multimedia-the past, today, and the future

Ari Visa

2016 Visual Communications and Image Processing (VCIP) > 1 - 2

2016 Visual Communications and Image Processing (VCIP)

Multimedia is defined by Webster Dictionary as "using or involving several forms of communication or expression". It has been common to understand that a multimedia exhibits of photographs, films, and music. In the 1993 first edition of McGraw-Hill's Multimedia: Making It Work, Tay Vaughan declared "Multimedia is any combination of text, graphic art, sound, animation, and video that...

chapter

Message from the VCIP 2016 program chairs

2016 Visual Communications and Image Processing (VCIP) > 1 - 2

2016 Visual Communications and Image Processing (VCIP)

chapter

Schedule manuscript

2016 Visual Communications and Image Processing (VCIP) > 1 - 2

2016 Visual Communications and Image Processing (VCIP)

chapter

Welcome message from general co-chairs

2016 Visual Communications and Image Processing (VCIP) > 1 - 2

2016 Visual Communications and Image Processing (VCIP)

chapter

Area chairs

2016 Visual Communications and Image Processing (VCIP) > 1 - 2

2016 Visual Communications and Image Processing (VCIP)

chapter

Special session chairs

2016 Visual Communications and Image Processing (VCIP) > 1 - 2

2016 Visual Communications and Image Processing (VCIP)

chapter

Keynotes: From geo-referenced pixels to knowledge

Luc Vincent

2016 Visual Communications and Image Processing (VCIP) > 1 - 2

2016 Visual Communications and Image Processing (VCIP)

Our planet is photographed on a daily basis by dozens of imaging satellites, hundreds of airplanes and drones, and thousands of cars collecting street-level imagery. This imagery is critical to consumer products such as Google Earth or Google Street View, which let users travel virtually and explore any destination around the world. In addition, it is used by governments and commercial entities to...

chapter

Tutorials: 3D video processing techniques for immersive contents generation

Yo-Sung Ho

2016 Visual Communications and Image Processing (VCIP) > 1 - 2

2016 Visual Communications and Image Processing (VCIP)

With the emerging market of 3D imaging products, 3D video has become an active area of research and development in recent years. 3D video is the key to provide more realistic and immersive perceptual experiences than the existing 2D counterpart. There are many applications of 3D video, such as 3D movie and 3DTV, which are considered the main drive of the next-generation technical revolution. Stereoscopic...

chapter

Organizing committee

2016 Visual Communications and Image Processing (VCIP) > 1 - 3

2016 Visual Communications and Image Processing (VCIP)

chapter

A novel mode decision for depth map coding in 3D-AVS

Jing Su, Falei Luo, Shanshe Wang, Shiqi Wang, more

2016 Visual Communications and Image Processing (VCIP) > 1 - 4

2016 Visual Communications and Image Processing (VCIP)

In this paper, a new mode decision scheme is proposed for depth map coding in 3D-AVS. The novelty of the paper mainly contains the following two points. Firstly, an improved distortion estimation model of synthesized views is proposed. Secondly, for the mode decision of depth map coding, the distortion is represented to be the weighted sum of depth distortion and estimated distortion of the synthesized...

chapter

Internal-video mode dependent directional transform

Xiaolei Li, Cuiling Lan, Yunhui Shi, Wenpeng Ding, more

2016 Visual Communications and Image Processing (VCIP) > 1 - 4

2016 Visual Communications and Image Processing (VCIP)

As the projection of the real world, videos usually have many repeated patterns with similar structures cross regions, presenting strong non-local correlations. Moreover, different videos own different characteristics. Exploitation of the non-local correlations by off-line training of transforms has attracted considerable attention over the past years for compression. However, the samples used for...

chapter

Shot boundary detection using convolutional neural networks

Jingwei Xu, Li Song, Rong Xie

2016 Visual Communications and Image Processing (VCIP) > 1 - 4

2016 Visual Communications and Image Processing (VCIP)

Video shot boundary detection (SBD) is necessary for further video analysis like video retrieval and annotation. Great efforts have been made to develop SBD algorithms for speed and accuracy. Most works implement frame histogram as features to measure similarity for detection. However, when changes between consecutive shot boundaries are small and backgrounds of them are highly similar, most state-of-the-art...

chapter

Background-foreground information based bit allocation algorithm for surveillance video on high efficiency video coding (HEVC)

Xiujuan Li, Yimamu'aishan Abudoulikemu

2016 Visual Communications and Image Processing (VCIP) > 1 - 4

2016 Visual Communications and Image Processing (VCIP)

Bit allocation plays an important role in rate control for it determines the calculation of other parameters of the rate control model. For surveillance videos, however, the bit distribution analysis shows that the foreground parts should get more bits and be encoded in higher quality than background parts. By utilizing the background and foreground information (BFI) provided by surveillance videos,...

chapter

Scene flow estimation through 3D analysis of multi-focus images

Hiroyoshi Fujii, Kazuya Kodama, Takayuki Hamamoto

2016 Visual Communications and Image Processing (VCIP) > 1 - 4

2016 Visual Communications and Image Processing (VCIP)

If scene flow expressed in three-dimensional (3D) vector fields is robustly estimated from multi-view or multi-focus images, we can develop advanced 3D motion tracking and motion compensation for 3D video compression. In this study, based on a synthesis of multi-focus images from multi-view images, we propose a novel method for analyzing 3D scene flow accurately at low computational cost as an extension...

chapter

Pyramid stereo matching for spherical panoramas

Jian Weng, Wei Zhang, Weidong Zhang, Jianjie Gao

2016 Visual Communications and Image Processing (VCIP) > 1 - 4

2016 Visual Communications and Image Processing (VCIP)

This paper presents a novel pyramid stereo matching method to improve the matching accuracy of panoramas. Initial camera parameters and feature correspondences are obtained from Structure From Motion (SFM) with normal images extracted from two panoramas. Then a stereo matching pyramid is constructed to refine the feature correspondences layer by layer, and the correspondence is corrected in the original...

chapter

Error models of finite word length arithmetic in CNN accelerator design

Yuan Gao, Zhenyu Liu, Dongsheng Wang

2016 Visual Communications and Image Processing (VCIP) > 1 - 4

2016 Visual Communications and Image Processing (VCIP)

Convolution Neural Network (CNN) is a state of the art machine learning algorithm. For CNN accelerator implementations, fixed-point and floating-point are two typical numeric representations. Because of the effects of rounding, reducing the word length would save the hardware and the power overheads while sacrificing the computation accuracy. The inherent robustness of neural network makes it possible...

chapter

Q-DNN: A quality-aware deep neural network for blind assessment of enhanced images

Qingbo Wu, Hongliang Li, Fanman Meng, King N. Ngan

2016 Visual Communications and Image Processing (VCIP) > 1 - 4

2016 Visual Communications and Image Processing (VCIP)

Image enhancement is widely popular due to its capability of producing "better" visual quality for specific applications. Although many enhancement algorithms have been developed in recent years, the studies towards blind assessment of enhanced images are still very lacking. In this paper, we propose a data-driven blind image quality assessment (BIQA) method based on the quality-aware deep...

chapter

Face recognition using training data with artificial occlusions

Hao Liu, Huiping Duan, Hongyu Cui, Yunjie Yin

2016 Visual Communications and Image Processing (VCIP) > 1 - 4

2016 Visual Communications and Image Processing (VCIP)

In face recognition for criminal identification, the training data are always clean while the probe data are occluded by sunglasses, scarf or other facial accessories. Occlusions in the probe data severely degrade the recognition performance. We find that introducing artificial occlusions into the training data is helpful in this situation. The incremental training data is decomposed into a class-specific...

Publication date

Set your own date range

Keywords

HEVC (21)
SPARSE REPRESENTATION (7)
IMAGE QUALITY ASSESSMENT (5)
INTRA PREDICTION (5)
SUPER-RESOLUTION (5)
3D-HEVC (4)
HIGH DYNAMIC RANGE (4)
IMAGE COMPRESSION (4)
IMAGE PROCESSING (4)
RATE CONTROL (4)
SALIENCY DETECTION (4)
VIDEO CODING (4)
3D VIDEO CODING (3)
3D-AVS (3)
COMPRESSIVE SENSING (3)
CONVOLUTIONAL NEURAL NETWORK (3)
DEEP LEARNING (3)
FACE RECOGNITION (3)
IMAGE CLASSIFICATION (3)
MOTION ESTIMATION (3)
SOFTCAST (3)
STEREOSCOPIC IMAGE QUALITY ASSESSMENT (3)
VISUAL ATTENTION (3)
3D VIDEO (2)
BIT ALLOCATION (2)
CLUSTERING (2)
CNN (2)
CONTEXT MODELING (2)
CONTRAST ENHANCEMENT (2)
DCT (2)
DEPTH DISTORTION (2)
DEPTH MAP (2)
DEPTH REFINEMENT (2)
DISCRETE COSINE TRANSFORM (2)
FAST ALGORITHM (2)
FEATURE EXTRACTION (2)
FRAME RATE UP-CONVERSION (2)
FREE ENERGY (2)
FUSION (2)
GAUSSIAN MIXTURE MODEL (2)
GMM (2)
GUIDED FILTER (2)
HIGH EFFICIENCY VIDEO CODING (2)
HIGH EFFICIENCY VIDEO CODING (HEVC) (2)
IMAGE CONTENT (2)
IMAGE RESTORATION (2)
INTRA CODING (2)
JPEG (2)
KLT (2)
LIGHT FIELD (2)
NO-REFERENCE (2)
NO-REFERENCE (NR) (2)
OBJECT PROPOSAL (2)
OPTICAL FLOW (2)
SALIENCY (2)
SEGMENTATION (2)
SPARSITY (2)
STEREOSCOPIC IMAGE (2)
STRUCTURE TENSOR (2)
TRANSFORM UNIT (2)
VIDEO COMPRESSION (2)
VISUAL TRACKING (2)
×265 (1)
3D (1)
3D IMAGE (1)
3D MODELS PROCESSING (1)
3D SUBJECTIVE DATASET (1)
3D SUBJECTIVE QUALITY PREDICTION (1)
ACCORDION TRANSFORMATION (1)
ACF (1)
ADAPTATION (1)
ADAPTIVE (1)
ADAPTIVE CU MINIMUM SPLITTING (1)
ADAPTIVE SMOOTHING (1)
ADAPTIVE THRESHOLDS (1)
ADAPTIVE TRANSFER FUNCTION (1)
AFFINITY PROPAGATION (1)
ALTERNATING DIRECTION METHOD OF MULTIPLIERS (1)
ALTERNATING MINIMIZATION (1)
ANALYSIS DICTIONARY (1)
ANCHORED REGRESSION (1)
ARTIFICIAL OCCLUSIONS (1)
ASYMMETRICAL VISIBILITY THRESHOLD (1)
AUGMENT REALITY (1)
AUTOREGRESSIVE(AR) (1)
AVS2 (1)
AVS2-3D (1)
BACKGROUND RECOVERY (1)
BACKGROUND-FOREGROUND INFORMATION (1)
BACKLIGHT SCALING (1)
BACKWARD COMPATIBLE (1)
BARTEN CSF MODEL (1)
BAYESIAN MODEL (1)
BAYESIAN RULE (1)
BEST MODE INFORMATION SHARING (1)
BETTER REGION PROPOSALS (1)
BINARY LOGISTIC REGRESSION (1)
BINARY TREE (1)
BINDCT (1)
BINOCULAR (1)
more

INFONA - science communication portal

2016 Visual Communications and Image Processing (VCIP)

Front cover

Keynotes: Deep learning for visual understanding: Effectiveness vs. efficiency

Keynotes: Multimedia-the past, today, and the future

Message from the VCIP 2016 program chairs

Schedule manuscript

Welcome message from general co-chairs

Area chairs

Special session chairs

Keynotes: From geo-referenced pixels to knowledge

Tutorials: 3D video processing techniques for immersive contents generation

Organizing committee

A novel mode decision for depth map coding in 3D-AVS

Internal-video mode dependent directional transform

Shot boundary detection using convolutional neural networks

Background-foreground information based bit allocation algorithm for surveillance video on high efficiency video coding (HEVC)

Scene flow estimation through 3D analysis of multi-focus images

Pyramid stereo matching for spherical panoramas

Error models of finite word length arithmetic in CNN accelerator design

Q-DNN: A quality-aware deep neural network for blind assessment of enhanced images

Face recognition using training data with artificial occlusions

Filter options

Publication date

Keywords

INFONA - science communication portal

2016 Visual Communications and Image Processing (VCIP) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2016 Visual Communications and Image Processing (VCIP)