Search results

Items from 1 to 20 out of 750 results

chapter

Region of Interest Autoencoders with an Application to Pedestrian Detection

Jerome Williams, Gustavo Carneiro, David Suter

2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA) > 1 - 8

2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA)

We present the Region of Interest Autoencoder (ROIAE), a combined supervised and reconstruction model for the automatic visual detection of objects. More specifically, we augment the detection loss function with a reconstruction loss that targets only foreground examples. This allows us to exploit more effectively the information available in the sparsely populated foreground training data used in...

chapter

Learning Robust Visual-Semantic Embeddings

Yao-Hung Hubert Tsai, Liang-Kang Huang, Ruslan Salakhutdinov

2017 IEEE International Conference on Computer Vision (ICCV) > 3591 - 3600

2017 IEEE International Conference on Computer Vision (ICCV)

Many of the existing methods for learning joint embedding of images and text use only supervised information from paired images and its textual attributes. Taking advantage of the recent success of unsupervised learning in deep neural networks, we propose an end-to-end learning framework that is able to extract more robust multi-modal representations across domains. The proposed method combines representation...

chapter

SurfaceNet: An End-to-End 3D Neural Network for Multiview Stereopsis

Mengqi Ji, Juergen Gall, Haitian Zheng, Yebin Liu, more

2017 IEEE International Conference on Computer Vision (ICCV) > 2326 - 2334

2017 IEEE International Conference on Computer Vision (ICCV)

This paper proposes an end-to-end learning framework for multiview stereopsis. We term the network SurfaceNet. It takes a set of images and their corresponding camera parameters as input and directly infers the 3D model. The key advantage of the framework is that both photo-consistency as well geometric relations of the surface structure can be directly learned for the purpose of multiview stereopsis...

chapter

Satellite super-resolution images depending on deep learning methods: A comparative study

Hatem Magdy Keshk, Xu-Cheng Yin

2017 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC) > 1 - 7

2017 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC)

The deep learning neural network is a recent development that has become the subject of research in the computer vision and remote sensing disciplines. Super resolution (SR) images can be obtained using deep neural network methods that achieve a higher performance than all previous traditional methods. Here, in this study, the objective is to describe existing deep learning methods for SR satellite...

chapter

Fine-Tuning Infinity Restricted Boltzmann Machines

Leandro Aparecido Passos, Joao Paulo Papa

2017 30th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI) > 63 - 70

2017 30th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI)

Restricted Boltzmann Machines (RBMs) have received special attention in the last decade due to their outstanding results in number of applications, such as face and human motion recognition, and collaborative filtering, among others. However, one of the main concerns about RBMs is related to the number of hidden units, which is application-dependent. Infinite RBM (iRBM) was proposed as an alternative...

chapter

Single Image Super-Resolution Using Multiple Extreme Learning Machine Regressors

Daniel Luis Cosmo, Fernando Kentaro Inaba, Evandro Ottoni Teatini Salles

2017 30th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI) > 397 - 404

2017 30th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI)

This paper presents a new technique to solve the single image super resolution reconstruction problem based on multiple extreme learning machine regressors, called here MELM. The MELM employs a feature space of low resolution images, divided in subspaces, and one regressor is trained for each one. In the training task, we employ a color dataset containing 91 images, with approximately 5.3 million...

chapter

Stacked sparse autoencoder based fault detection and location method for modular five-level converters

Qiaoxuan Yin, Bin Duan, Mengjun Shen, Xiangshuai Qu

IECON 2017 - 43rd Annual Conference of the IEEE Industrial Electronics Society > 1580 - 1585

IECON 2017 - 43rd Annual Conference of the IEEE Industrial Electronics Society

This paper presents a novel method for fault detection and location in modular five-level converters (MFLC) based on stacked sparse autoencoder (SSAE). SSAE is composed of multiple SAE and a softmax classifier. The capacitor voltage signals of all sub-modules (SMs) in the MFLC circuit are combined into a multi-channel signal. By moving window along the multi-channel signal, a set of signal segments...

chapter

A detail enhancement strategy for face sketch synthesis based on NSST

Weiguo Wan, Hyo Jong Lee

2017 International Conference on Information and Communication Technology Convergence (ICTC) > 784 - 788

2017 International Conference on Information and Communication Technology Convergence (ICTC)

Face sketch synthesis plays an important role in both law enforcement and digital entertainment. The existing methods for sketch synthesis always suffer from noising and blurring effect. To resolve these problems, a nonsubsampled Shearlet transform (NSST) based detail enhancement strategy is proposed. The exemplar-based method is firstly adopted to synthesize the primary sketch, then the final sketch...

chapter

Minutia-based enhancement of fingerprint samples

Patrick Schuch, Simon Schulz, Christoph Busch

2017 International Carnahan Conference on Security Technology (ICCST) > 1 - 6

2017 International Carnahan Conference on Security Technology (ICCST)

Image enhancement is a common pre-processing step before the extraction of biometric features from a fingerprint sample. This can be essential especially for images of low image quality. An ideal fingerprint image enhancement should intend to improve the end-to-end biometric performance, i.e. the performance achieved on biometric features extracted from enhanced fingerprint samples. We use a model...

chapter

Texture-centralized deep convolutional neural network for single image super resolution

Chengqi Li, Zhigang Ren, Bo Yang, Xingyu Wan, more

2017 Chinese Automation Congress (CAC) > 3707 - 3710

2017 Chinese Automation Congress (CAC)

There have been significant progresses in single image super-resolution (SR) using deep convolutional neural network. In this paper, we propose a modified deep convolutional neural network model incorporated with image texture priors for single image SR. The model consist of a particular feature extraction layer followed by image reconstruction process, aiming to centralize on the image texture information...

chapter

MemNet: A Persistent Memory Network for Image Restoration

Ying Tai, Jian Yang, Xiaoming Liu, Chunyan Xu

2017 IEEE International Conference on Computer Vision (ICCV) > 4549 - 4557

2017 IEEE International Conference on Computer Vision (ICCV)

Recently, very deep convolutional neural networks (CNNs) have been attracting considerable attention in image restoration. However, as the depth grows, the longterm dependency problem is rarely realized for these very deep models, which results in the prior states/layers having little influence on the subsequent ones. Motivated by the fact that human thoughts have persistency, we propose a very deep...

chapter

Image Super-Resolution Using Dense Skip Connections

Tong Tong, Gen Li, Xiejie Liu, Qinquan Gao

2017 IEEE International Conference on Computer Vision (ICCV) > 4809 - 4817

2017 IEEE International Conference on Computer Vision (ICCV)

Recent studies have shown that the performance of single-image super-resolution methods can be significantly boosted by using deep convolutional neural networks. In this study, we present a novel single-image super-resolution method by introducing dense skip connections in a very deep network. In the proposed network, the feature maps of each layer are propagated into all subsequent layers, providing...

chapter

3D-PRNN: Generating Shape Primitives with Recurrent Neural Networks

Chuhang Zou, Ersin Yumer, Jimei Yang, Duygu Ceylan, more

2017 IEEE International Conference on Computer Vision (ICCV) > 900 - 909

2017 IEEE International Conference on Computer Vision (ICCV)

The success of various applications including robotics, digital content creation, and visualization demand a structured and abstract representation of the 3D world from limited sensor data. Inspired by the nature of human perception of 3D shapes as a collection of simple parts, we explore such an abstract shape representation based on primitives. Given a single depth image of an object, we present...

chapter

Iterative denoising-based mesh-to-grid reconstruction with hyperparametric adaptation

Jan Koloda, Michel Batz, Andre Kaup

2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP) > 1 - 5

2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP)

This paper presents a new method for the reconstruction of images from samples located at non-integer mesh positions. This is a common scenario for many image processing applications such as multi-image super-resolution, frame-rate up-conversion, or virtual view synthesis in multi-camera systems. The proposed method consists of an iterative procedure that employs adaptive denoising in order to reduce...

chapter

Block-based compressed sensing of images via deep learning

Amir Adler, David Boublil, Michael Zibulevsky

2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP) > 1 - 6

2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP)

Compressed sensing (CS) is a signal processing framework for efficiently reconstructing a signal from a small number of measurements, obtained by linear projections of the signal. Block-based CS is a lightweight CS approach that is mostly suitable for processing very high-dimensional images and videos: it operates on local patches, employs a low-complexity reconstruction operator and requires significantly...

chapter

FompNet: Compressive sensing reconstruction with deep learning over wireless fading channels

Lei Bo, Hancheng Lu, Yujiao Lu, Jianwen Meng, more

2017 9th International Conference on Wireless Communications and Signal Processing (WCSP) > 1 - 6

2017 9th International Conference on Wireless Communications and Signal Processing (WCSP)

With the ability to reconstruct signals from a highly incomplete number of samples, Compressive Sensing (CS) has been proposed in bandwidth-constrained scenarios like remote sensing, where signals exist some degree of redundancy. In CS, reconstruction approaches are of great importance. However, current reconstruction approaches are of highly computational complexity because they use greedy or convex...

chapter

Face sketch synthesis using conditional adversarial networks

Chikontwe Philip, Lee Hyo Jong

2017 International Conference on Information and Communication Technology Convergence (ICTC) > 373 - 378

2017 International Conference on Information and Communication Technology Convergence (ICTC)

In this paper, we explore the use of recent conditional generative adversarial network framework for image to image translation applied to the domain of heterogeneous face sketch synthesis. Since the inception of the adversarial framework in 2014, great success has been noted with several variants till date. Further, we introduce a new dataset for composite sketch images. In particular we explore...

chapter

Fuzzy restricted Boltzmann machine and deep belief network: A comparison on image reconstruction

Feng Shuang, C. L. Philip Chen

2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 1828 - 1833

2017 IEEE International Conference on Systems, Man and Cybernetics (SMC)

The fuzzy restricted Boltzmann machine (FRBM) is demonstrated to have better generative and discriminative capabilities than traditional RBM. We now further investigate and compare the generative ability of DBN with FRBM on image reconstruction. The DBN is pre-trained by stacking RBMs layer by layer and then fine-tuned by the wake-sleep algorithm. Then the FRBM, RBM and DBN are compared in detail...

chapter

Deep learning-bat high-dimensional missing data estimator

Collins Leke, A. R. Ndjiongue, Bhekisipho Twala, Tshilidzi Marwala

2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 483 - 488

2017 IEEE International Conference on Systems, Man and Cybernetics (SMC)

In recent years, several new methods for missing data estimation have been developed. Real world datasets possess the properties of big data being volume, velocity and variety. With an increase in volume which includes sample size and dimensionality, existing imputation methods have become less effective and accurate. Much attention has been given to narrow Artificial Intelligence frameworks courtesy...

chapter

Learning Lightprobes for Mixed Reality Illumination

David Mandl, Kwang Moo Yi, Peter Mohr, Peter M. Roth, more

2017 IEEE International Symposium on Mixed and Augmented Reality (ISMAR) > 82 - 89

2017 IEEE International Symposium on Mixed and Augmented Reality (ISMAR)

This paper presents the first photometric registration pipeline for Mixed Reality based on high quality illumination estimation using convolutional neural networks (CNNs). For easy adaptation and deployment of the system, we train the CNNs using purely synthetic images and apply them to real image data. To keep the pipeline accurate and efficient, we propose to fuse the light estimation results from...

Keywords:
TRAINING
IMAGE RECONSTRUCTION

Publication date

Set your own date range

Content availability

Available (745)
None (5)

Keywords

IMAGE RESOLUTION (203)
DICTIONARIES (180)
FACE (174)
FEATURE EXTRACTION (164)
FACE RECOGNITION (130)
PRINCIPAL COMPONENT ANALYSIS (103)
DATABASES (102)
IMAGE CODING (79)
SHAPE (71)
SUPER-RESOLUTION (64)
INTERPOLATION (57)
VECTORS (56)
SPARSE REPRESENTATION (53)
IMAGE EDGE DETECTION (52)
ENCODING (48)
SIGNAL RESOLUTION (48)
IMAGE SEGMENTATION (47)
TESTING (47)
THREE DIMENSIONAL DISPLAYS (47)
ALGORITHM DESIGN AND ANALYSIS (46)
SOLID MODELING (46)
COMPUTATIONAL MODELING (45)
MANIFOLDS (45)
IMAGE RECOGNITION (43)
ACCURACY (42)
MACHINE LEARNING (42)
CAMERAS (41)
IMAGE COLOR ANALYSIS (41)
NEURAL NETWORKS (40)
OPTIMIZATION (40)
VISUALIZATION (40)
ESTIMATION (39)
KERNEL (39)
LEARNING (ARTIFICIAL INTELLIGENCE) (39)
PIXEL (39)
TRANSFORMS (39)
LIGHTING (38)
ROBUSTNESS (38)
PSNR (36)
FACE HALLUCINATION (35)
DATA MINING (34)
DATA MODELS (34)
SUPPORT VECTOR MACHINES (34)
THREE-DIMENSIONAL DISPLAYS (33)
ARTIFICIAL NEURAL NETWORKS (32)
DICTIONARY LEARNING (32)
TRAINING DATA (32)
IMAGE CLASSIFICATION (31)
IMAGE REPRESENTATION (31)
MATHEMATICAL MODEL (31)
SIGNAL PROCESSING (31)
COMPUTER VISION (28)
EQUATIONS (28)
CLASSIFICATION ALGORITHMS (27)
GEOMETRY (27)
PATTERN RECOGNITION (27)
CORRELATION (26)
SIGNAL PROCESSING ALGORITHMS (26)
STRONTIUM (26)
MEDICAL IMAGE PROCESSING (24)
NOISE (24)
CONVOLUTION (23)
SPARSE MATRICES (22)
IMAGE RESTORATION (21)
SURFACE RECONSTRUCTION (20)
DECODING (19)
EIGENVALUES AND EIGENFUNCTIONS (19)
IMAGE PROCESSING (19)
SPATIAL RESOLUTION (19)
WAVELET TRANSFORMS (19)
BIOMEDICAL IMAGING (18)
COMPRESSED SENSING (18)
CONFERENCES (18)
EDUCATIONAL INSTITUTIONS (18)
IMAGE ENHANCEMENT (18)
NEURONS (18)
NOISE REDUCTION (18)
APPROXIMATION METHODS (17)
CLUSTERING ALGORITHMS (17)
DEEP LEARNING (17)
SUPER RESOLUTION (17)
DETECTORS (16)
IMAGE COMPRESSION (16)
SEMANTICS (16)
VECTOR QUANTIZATION (16)
COMPRESSIVE SENSING (15)
COVARIANCE MATRIX (15)
DISCRETE COSINE TRANSFORMS (15)
HYPERSPECTRAL IMAGING (15)
IMAGE TEXTURE (15)
IMAGING (15)
INDEXES (15)
MANIFOLD LEARNING (15)
NEURAL NETS (15)
SUPPORT VECTOR MACHINE CLASSIFICATION (15)
CLASSIFICATION (14)
COMPUTED TOMOGRAPHY (14)
IMAGE SEQUENCES (14)
more

INFONA - science communication portal

Search results

Region of Interest Autoencoders with an Application to Pedestrian Detection

Learning Robust Visual-Semantic Embeddings

SurfaceNet: An End-to-End 3D Neural Network for Multiview Stereopsis

Satellite super-resolution images depending on deep learning methods: A comparative study

Fine-Tuning Infinity Restricted Boltzmann Machines

Single Image Super-Resolution Using Multiple Extreme Learning Machine Regressors

Stacked sparse autoencoder based fault detection and location method for modular five-level converters

A detail enhancement strategy for face sketch synthesis based on NSST

Minutia-based enhancement of fingerprint samples

Texture-centralized deep convolutional neural network for single image super resolution

MemNet: A Persistent Memory Network for Image Restoration

Image Super-Resolution Using Dense Skip Connections

3D-PRNN: Generating Shape Primitives with Recurrent Neural Networks

Iterative denoising-based mesh-to-grid reconstruction with hyperparametric adaptation

Block-based compressed sensing of images via deep learning

FompNet: Compressive sensing reconstruction with deep learning over wireless fading channels

Face sketch synthesis using conditional adversarial networks

Fuzzy restricted Boltzmann machine and deep belief network: A comparison on image reconstruction

Deep learning-bat high-dimensional missing data estimator

Learning Lightprobes for Mixed Reality Illumination

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options