Wyniki wyszukiwania

Pozycje od 61 do 80 spośród 1 136 wyników

Poprzednia

Następna

rozdział

Constructing a hierarchical tree for image annotation

Jiwei Hu, Kin-Man Lam, Ping Lou, Quan Liu

2017 IEEE International Conference on Multimedia and Expo (ICME) > 265 - 270

2017 IEEE International Conference on Multimedia and Expo (ICME)

Image annotation is always an easy task for humans but a tough task for machines. Inspired by human's thinking mode, there is an assumption that the computer has double systems. Each of the systems can handle the task individually and in parallel. In this paper, we introduce a new hierarchical model for image annotation, based on constructing a novel, hierarchical tree, which consists of exploring...

rozdział

Learning deep and sparse feature representation for fine-grained object recognition

M. Srinivas, Yen-Yu Lin, Hong-Yuan Mark Liao

2017 IEEE International Conference on Multimedia and Expo (ICME) > 1458 - 1463

2017 IEEE International Conference on Multimedia and Expo (ICME)

In this paper, we address fine-grained classification which is quite challenging due to high intra-class variations and subtle inter-class variations. Most modern approaches to fine-grained recognition are established based on convolutional neural networks (CNN). Despite the effectiveness, these approaches still suffer from two major problems. First, they highly rely on large sets of training data,...

rozdział

A novel convolutional neural network architecture for image super-resolution based on channels combination

Cun Liu, Yuanxiang Li, Jianhua Luo, Yongjun Zhou

2017 20th International Conference on Information Fusion (Fusion) > 1 - 8

2017 20th International Conference on Information Fusion (Fusion)

Several models based on deep neural networks have applied to single image super-resolution and obtained great improvements in terms of both reconstruction accuracy and computational performance. All these methods focus either on performing the super-resolution (SR) reconstruction operation in the high resolution (HR) space after upscaling with a single filter, usually bicubic interpolation, or optimizing...

rozdział

Single-channel speech separation based on robust sparse Bayesian learning

Zhe Wang, Guoan Bi, Xiumei Li

2017 13th IEEE International Conference on Control & Automation (ICCA) > 113 - 117

2017 13th IEEE International Conference on Control & Automation (ICCA)

This paper describes a novel algorithm to improve the performance of sparsity based single-channel speech separation(SCSS) problem based on compressed sensing which is an emerging technique for efficient data reconstruction. The conventional approach assumes the mixing conditions and source signals are stationary. For practical applications of audio source separation, however, we face the challenges...

rozdział

Domain transfer sparse representation for single sample face recognition

Venice Erin Liong, Haibin Yan

2017 IEEE International Conference on Multimedia & Expo Workshops (ICMEW) > 663 - 668

2017 IEEE International Conference on Multimedia & Expo Workshops (ICMEW)

In this paper, we propose a new single sample face recognition approach under the widely used sparse representation-based classification (SRC) framework. Previous work has shown that SRC only works well when there are sufficient number of training samples per person and not suitable for SSFR. To address this, we propose a domain transfer sparse representation-based classification (DT-SRC) method by...

rozdział

VIDEOWHISPER: Towards unsupervised learning of discriminative features of videos with RNN

Na Zhao, Hanwang Zhang, Mingxing Zhang, Richang Hong, więcej

2017 IEEE International Conference on Multimedia and Expo (ICME) > 277 - 282

2017 IEEE International Conference on Multimedia and Expo (ICME)

We present VidedWhisfer, a novel approach for unsupervised video representation learning, in which video sequence is treated as a self-supervision entity based on the observation that the sequence encodes video temporal dynamics (e.g., object movement and event evolution). Specifically, for each video sequence, we use a pre-learned visual dictionary to generate a sequence of high-level semantics,...

rozdział

Single depth image super-resolution with multiple residual dictionary learning and refinement

Lijun Zhao, Huihui Bai, Jie Liang, Anhong Wang, więcej

2017 IEEE International Conference on Multimedia and Expo (ICME) > 739 - 744

2017 IEEE International Conference on Multimedia and Expo (ICME)

Learning-based image super-resolution methods often use large datasets to learn texture features. When these methods are applied to depth images, emphasis should be given on learning the geometrical structures at object boundaries, since depth images do not have much texture information. In this paper, we develop a scheme to learn multiple residual dictionaries from only one external image. After...

rozdział

Parsimonious Coding and Verification of Offline Handwritten Signatures

Elias N. Zois, Ilias Theodorakopoulos, Dimitrios Tsourounis, George Economou

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 636 - 645

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

A common practice for addressing the problem of verifying the presence, or the consent of a person in many transactions is to utilize the handwritten signature. Among others, the offline or static signature is a valuable tool in forensic related studies. Thus, the importance of verifying static handwritten signatures still poses a challenging task. Throughout the literature, gray-level images, composed...

rozdział

Recognition and retrieval of sound events using sparse coding convolutional neural network

Chien-Yao Wang, Andri Santoso, Seksan Mathulaprangsan, Chin-Chin Chiang, więcej

2017 IEEE International Conference on Multimedia and Expo (ICME) > 589 - 594

2017 IEEE International Conference on Multimedia and Expo (ICME)

This paper proposes a novel deep convolutional neural network (CNN), called sparse coding convolutional neural network (SC-CNN), to address the problem of sound event recognition and retrieval task. Unlike the general framework of a CNN, in which feature learning process is performed hierarchically, the proposed framework models the whole memorizing procedures in the human brain, including encoding,...

rozdział

Deep hybrid residual learning with statistic priors for single image super-resolution

Risheng Liu, Xiangyu Wang, Xin Fan, Haojie Li, więcej

2017 IEEE International Conference on Multimedia and Expo (ICME) > 1111 - 1116

2017 IEEE International Conference on Multimedia and Expo (ICME)

This paper considers single image super-resolution (SISR), which is an important low-level vision task and has various applications in multimedia society. Recently, deep neural networks have archived good performance on this field. But most of existing deep models are based on the fully data-dependent network architecture, thus missing majority of domain-knowledge of the super-resolution task. To...

rozdział

Non-negative dictionary learning with pairwise partial similarity constraint

Xu Zhou, Pak Lun Kevin Ding, Baoxin Li

2017 IEEE International Conference on Multimedia and Expo (ICME) > 1410 - 1415

2017 IEEE International Conference on Multimedia and Expo (ICME)

Discriminative dictionary learning has been widely used in many applications such as face retrieval / recognition and image classification, where the labels of the training data are utilized to improve the discriminative power of the learned dictionary. This paper deals with a new problem of learning a dictionary for associating pairs of images in applications such as face image retrieval. Compared...

rozdział

Balanced Two-Stage Residual Networks for Image Super-Resolution

Yuchen Fan, Honghui Shi, Jiahui Yu, Ding Liu, więcej

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 1157 - 1164

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

In this paper, balanced two-stage residual networks (BTSRN) are proposed for single image super-resolution. The deep residual design with constrained depth achieves the optimal balance between the accuracy and the speed for super-resolving images. The experiments show that the balanced two-stage structure, together with our lightweight two-layer PConv residual block design, achieves very promising...

rozdział

Towards a continuous speech corpus for banking domain automatic speech recognition

George Suciu, Stefan-Adrian Toma, Romulus Cheveresan

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 6

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

This paper presents the work done towards developing a speech corpus for Romanian, for automatic speech recognition for the banking domain. This work is done in the context of the Speech2Process project, which aims at creating a system which allows interaction between customers and agents in the contact center much easier. The application to use the banking corpus will provide automatic response to...

rozdział

MaRePhoR — An open access machine-readable phonetic dictionary for Romanian

Stefan-Adrian Toma, Adriana Stan, Mihai-Lica Pura, Traian Barsan

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 6

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

This paper introduces a novel open access resource, the machine-readable phonetic dictionary for Romanian — MaRePhoR. It contains over 70,000 word entries, and their manually performed phonetic transcription. The paper describes the dictionary format and statistics, as well as an initial use of the phonetic transcription entries by building a grapheme to phoneme converter based on decision trees....

rozdział

Gait-Watch: A Context-Aware Authentication System for Smart Watch Based on Gait Recognition

Weitao Xu, Yiran Shen, Yongtuo Zhang, Neil Bergmann, więcej

2017 IEEE/ACM Second International Conference on Internet-of-Things Design and Implementation (IoTDI) > 59 - 70

2017 IEEE/ACM Second International Conference on Internet-of-Things Design and Implementation (IoTDI)

With recent advances in mobile computing and sensing technology, smart wearable devices have pervaded our everyday lives. The security of these wearable devices is becoming a hot research topic because they store various private information. Existing approaches either only rely on a secret PIN number or require an explicit user authentication process. In this paper, we present Gait-watch, a context-aware...

rozdział

Online Bayesian Learning for Remote-Sensing Imagery Compression

Zizhuo Zhang, Shaoyang Li, Xiaoming Tao, Linhao Dong, więcej

2017 IEEE 85th Vehicular Technology Conference (VTC Spring) > 1 - 5

2017 IEEE 85th Vehicular Technology Conference: VTC2017-Spring

This work investigates a statistical technique for high performance remote-sensing imagery compression. By exploiting existing remote-sensing data sets, useful structural and texture prior information can be learned. The main methodologies are Bayesian dictionary learning and stochastic approximation. A Bayesian network simulating the generation mechanism of remote- sensing images is modelled. The...

rozdział

Real-time hand posture and gesture-based touchless automotive user interface using deep learning

V. John, M. Umetsu, A. Boyali, S. Mita, więcej

2017 IEEE Intelligent Vehicles Symposium (IV) > 869 - 874

2017 IEEE Intelligent Vehicles Symposium (IV)

In this study, a vision based in-car entertainment user interface is presented. The user interface is designed using a hand posture and gesture recognition algorithm in deep learning framework. The hand posture recognition algorithm is formulated using the convolutional neural network to perform the fundamental tasks in the user interface. The hand gesture recognition algorithm is formulated using...

rozdział

Joint Dictionary Learning for Person Re-identification

Yunlu Xu, Jie Guo, Zheng Huang

2017 IEEE Second International Conference on Data Science in Cyberspace (DSC) > 505 - 510

2017 IEEE Second International Conference on Data Science in Cyberspace (DSC)

Person re-identification is known as matching an individual captured in one or more cameras using a gallery of provided candidates from a different camera view. It is a hard task owing to variations in illumination, viewpoints, poses and small number of annotated training individuals. For obtaining the proper distance metrics, we propose a novel approach based on dictionary learning. Our method decomposes...

rozdział

Vehicle sparse recognition via class dictionary learning

Ji-xin Liu, Ning Sun, Guang Han, Haigen Yang

2017 2nd International Conference on Image, Vision and Computing (ICIVC) > 185 - 188

2017 2nd International Conference on Image, Vision and Computing (ICIVC)

As the main body of modern traffic, transport vehicle is the focus of intelligent transportation systems. For three typical vehicles (including the automobile, motorcycle and bicycle), this paper proposes a new transport vehicle recognition system via class dictionary learning. For solving problems in the traditional transport vehicle recognition under sparse recognition framework, our method use...

rozdział

Single image super resolution based on feature enhancement

Shiyao Suo, Xiaohai He, Honggang Chen, Shuhua Xiong, więcej

2017 2nd International Conference on Image, Vision and Computing (ICIVC) > 473 - 477

2017 2nd International Conference on Image, Vision and Computing (ICIVC)

In most of the existing regression-based methods, mapping matrices are directly learnt from features which are extracted from the interpolation results of low-resolution (LR) images. Nevertheless, this kind of features usually suffer from many artifacts which may produce bad effects on image super-resolution (SR) reconstruction. In this paper, we propose an effective single image super-resolution...

Poprzednia

Następna

Opcje filtrowania

Słowa kluczowe:
TRAINING
DICTIONARIES

Data publikacji

Ustaw własny zakres dat

Dostępność treści

Dostępna (1,133)
Brak (3)

Słowa kluczowe

FEATURE EXTRACTION (272)
IMAGE RECONSTRUCTION (180)
SPARSE REPRESENTATION (179)
ENCODING (167)
DICTIONARY LEARNING (151)
FACE (121)
SUPPORT VECTOR MACHINES (113)
VECTORS (111)
ACCURACY (110)
DATABASES (103)
FACE RECOGNITION (99)
IMAGE RESOLUTION (96)
TESTING (89)
SPARSE MATRICES (86)
VISUALIZATION (80)
NATURAL LANGUAGE PROCESSING (78)
HIDDEN MARKOV MODELS (77)
OPTIMIZATION (76)
TRAINING DATA (75)
SPEECH (71)
MACHINE LEARNING (69)
DATA MINING (68)
MATCHING PURSUIT ALGORITHMS (68)
ALGORITHM DESIGN AND ANALYSIS (66)
CLASSIFICATION ALGORITHMS (65)
IMAGE CODING (61)
KERNEL (61)
SPARSE CODING (57)
ROBUSTNESS (49)
SEMANTICS (48)
IMAGE CLASSIFICATION (47)
LEARNING (ARTIFICIAL INTELLIGENCE) (47)
CLASSIFICATION (46)
HISTOGRAMS (46)
COMPUTATIONAL MODELING (45)
IMAGE SEGMENTATION (42)
SPEECH RECOGNITION (40)
DATA MODELS (38)
CONTEXT (37)
SUPER-RESOLUTION (36)
CLUSTERING ALGORITHMS (34)
IMAGE REPRESENTATION (33)
SIGNAL RESOLUTION (33)
TEXT ANALYSIS (33)
NEURAL NETWORKS (30)
SIGNAL PROCESSING ALGORITHMS (30)
COMPRESSED SENSING (29)
NOISE MEASUREMENT (29)
PRINCIPAL COMPONENT ANALYSIS (29)
SHAPE (28)
HYPERSPECTRAL IMAGING (27)
IMAGE RECOGNITION (27)
ADAPTATION MODELS (26)
LEARNING SYSTEMS (26)
MATHEMATICAL MODEL (26)
NOISE (26)
ACOUSTICS (24)
LIGHTING (24)
MANIFOLDS (24)
TAGGING (24)
TRANSFORMS (24)
CORRELATION (23)
OBJECT RECOGNITION (23)
INTERPOLATION (22)
JOINTS (22)
K-SVD (22)
LINEAR PROGRAMMING (22)
NOISE REDUCTION (22)
SPARSE REPRESENTATIONS (22)
COMPLEXITY THEORY (21)
ESTIMATION (21)
SENTIMENT ANALYSIS (21)
APPROXIMATION ALGORITHMS (20)
COMPUTER VISION (20)
DETECTORS (20)
EDUCATIONAL INSTITUTIONS (20)
IMAGE COLOR ANALYSIS (20)
IMAGE EDGE DETECTION (20)
PROBABILITY (20)
CHARACTER RECOGNITION (19)
COMPUTATIONAL LINGUISTICS (19)
MEASUREMENT (19)
PIXEL (19)
BAYES METHODS (18)
CONFERENCES (18)
APPROXIMATION METHODS (17)
ARTIFICIAL NEURAL NETWORKS (17)
CAMERAS (17)
PREDICTION ALGORITHMS (17)
TEXT CATEGORIZATION (17)
COLLABORATION (16)
ENTROPY (16)
EQUATIONS (16)
INDEXES (16)
INTERNET (16)
PATTERN RECOGNITION (16)
PSNR (16)
SUPPORT VECTOR MACHINE CLASSIFICATION (16)
więcej

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Dostępność treści

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu