Search results

Items from 1 to 20 out of 34 results

chapter

Human action recognition based on self-learned key frames and features extraction

Qi Fu, Lina Liu, Shiwei Ma

2017 Chinese Automation Congress (CAC) > 3498 - 3502

2017 Chinese Automation Congress (CAC)

Human action recognition is one of the most active research areas of computer vision. With the rapid development of deep learning, using neural networks to realize action recognition becomes a popular thesis. This paper proposes a self-learned action recognition method based on neural networks. The proposed method trains dictionaries with sparse autoencoder (SAE) and extracts the key frames with sparse...

chapter

Leveraging deep neural networks with nonnegative representations for improved environmental sound classification

Victor Bisot, Romain Serizel, Slim Essid, Gael Richard

2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP) > 1 - 6

2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP)

This paper introduces the use of representations based on nonnegative matrix factorization (NMF) to train deep neural networks with applications to environmental sound classification. Deep learning systems for sound classification usually rely on the network to learn meaningful representations from spectrograms or hand-crafted features. Instead, we introduce a NMF-based feature learning stage before...

chapter

LCNN: Lookup-Based Convolutional Neural Network

Hessam Bagherinezhad, Mohammad Rastegari, Ali Farhadi

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 860 - 869

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Porting state of the art deep learning algorithms to resource constrained compute platforms (e.g. VR, AR, wearables) is extremely challenging. We propose a fast, compact, and accurate model for convolutional neural networks that enables efficient learning and inference. We introduce LCNN, a lookup-based convolutional neural network that encodes convolutions by few lookups to a dictionary that is trained...

chapter

Broad learning system: Feature extraction based on K-means clustering algorithm

Zhulin Liu, Jin Zhou, C. L. Philip Chen

2017 4th International Conference on Information, Cybernetics and Computational Social Systems (ICCSS) > 683 - 687

2017 4th International Conference on Information, Cybernetics and Computational Social Systems (ICCSS)

Broad Learning System [1] proposed recently demonstrates efficient and effective learning capability. This model is also proved to be suitable for incremental learning algorithms by taking the advantages of random vector flat neural networks. In this paper, a modified BLS structure based on the K-means feature extraction is developed. Compared with the original broad learning system, acceptable performance...

chapter

A novel convolutional neural network architecture for image super-resolution based on channels combination

Cun Liu, Yuanxiang Li, Jianhua Luo, Yongjun Zhou

2017 20th International Conference on Information Fusion (Fusion) > 1 - 8

2017 20th International Conference on Information Fusion (Fusion)

Several models based on deep neural networks have applied to single image super-resolution and obtained great improvements in terms of both reconstruction accuracy and computational performance. All these methods focus either on performing the super-resolution (SR) reconstruction operation in the high resolution (HR) space after upscaling with a single filter, usually bicubic interpolation, or optimizing...

chapter

Balanced Two-Stage Residual Networks for Image Super-Resolution

Yuchen Fan, Honghui Shi, Jiahui Yu, Ding Liu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 1157 - 1164

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

In this paper, balanced two-stage residual networks (BTSRN) are proposed for single image super-resolution. The deep residual design with constrained depth achieves the optimal balance between the accuracy and the speed for super-resolving images. The experiments show that the balanced two-stage structure, together with our lightweight two-layer PConv residual block design, achieves very promising...

article

Hybrid CNN and Dictionary-Based Models for Scene Recognition and Domain Adaptation

Guo-Sen Xie, Xu-Yao Zhang, Shuicheng Yan, Cheng-Lin Liu

IEEE Transactions on Circuits and Systems for Video Technology > 2017 > 27 > 6 > 1263 - 1274

Convolutional neural network (CNN) has achieved the state-of-the-art performance in many different visual tasks. Learned from a large-scale training data set, CNN features are much more discriminative and accurate than the handcrafted features. Moreover, CNN features are also transferable among different domains. On the other hand, traditional dictionary-based features (such as BoW and spatial pyramid...

chapter

Class-wise deep dictionary learning

Vanika Singhal, Prerna Khurana, Angshul Majumdar

2017 International Joint Conference on Neural Networks (IJCNN) > 1125 - 1132

2017 International Joint Conference on Neural Networks (IJCNN)

In this work we propose a new framework for combined feature extraction and classification. The base idea stems from the sparse representation based classification; where in the training samples from each class are assumed to form a basis for representing the same. Later studies learned a basis for each class using dictionary learning; these were shallow techniques where only one level of dictionary...

chapter

Low-dose CT denoising with convolutional neural network

Hu Chen, Yi Zhang, Weihua Zhang, Peixi Liao, more

2017 IEEE 14th International Symposium on Biomedical Imaging (ISBI 2017) > 143 - 146

2017 IEEE 14th International Symposium on Biomedical Imaging (ISBI 2017)

To reduce the potential radiation risk, low-dose CT has attracted much attention. However, simply lowering the radiation dose will lead to significant deterioration of the image quality. In this paper, we propose a noise reduction method for low-dose CT via deep neural network without accessing original projection data. A deep convolutional neural network is trained to transform low-dose CT images...

chapter

Supervised monaural source separation based on autoencoders

Keiichi Osako, Yuki Mitsufuji, Rita Singh, Bhiksha Raj

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 11 - 15

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, we propose a new supervised monaural source separation based on autoencoders. We employ the autoencoder for the dictionary training such that the nonlinear network can encode the target source with high expressiveness. The dictionary is trained by each target source without the mixture signal, which makes the system independent from the context where the dictionaries will be used. In...

chapter

Signal representations in modern signal processing

Rebecca Willett

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6453 - 6457

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The last decade of John Cozzens's tenure at the NSF witnessed the advent of theory and methods at the heart of modern data science. These advances include (but are not limited to) compressed sensing, sparse coding, inference methods robust to outliers and missing data, and convex optimization tools that facilitate a host of novel inference methods. This paper describes how these methods evolved from...

chapter

Epithelium-stroma classification in histopathological images via convolutional neural networks and self-taught learning

Yue Huang, Han Zheng, Chi Liu, Gustavo Rohde, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1073 - 1077

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Epithelium-stroma classification is always considered as an important preprocessing step for morphological quantitative analysis in image-based histological researches of oncologic diseases. However, large-scale accurate ground-truth labeling is expensive in histopathological image analysis, thus the classification performances will still be limited with the insufficient labeled training samples....

chapter

Deep Sparse-coded Network (DSN)

Youngjune Gwon, Miriam Cha, H. T. Kung

2016 23rd International Conference on Pattern Recognition (ICPR) > 2610 - 2615

2016 23rd International Conference on Pattern Recognition (ICPR)

We present Deep Sparse-coded Network (DSN), a deep architecture based on multilayer sparse coding. It has been considered difficult to learn a useful feature hierarchy by stacking sparse coding layers in a straightforward manner. The primary reason is the modeling assumption for sparse coding that takes in a dense input and yields a sparse output vector. Applying a sparse coding layer on the output...

chapter

Application of pronunciation knowledge on phoneme recognition by LSTM neural network

Bo Zhang, Yuqin Gan, Yan Song, Benlai Tang

2016 23rd International Conference on Pattern Recognition (ICPR) > 2906 - 2911

2016 23rd International Conference on Pattern Recognition (ICPR)

When applied for phoneme recognition, the Connectionist Temporal Classification (CTC) objective function allows a neural network to be trained with the phoneme level transcriptions of training utterances. A limitation of the CTC is that it can not be applied directly for network training with large speech corpora, since those corpora usually only have word level transcriptions. This work extends the...

chapter

Supervised dictionary learning in BoF framework for Scene Character recognition

Maroua Tounsi, Ikram Moalla, Adel M. Alimi

2016 23rd International Conference on Pattern Recognition (ICPR) > 3987 - 3992

2016 23rd International Conference on Pattern Recognition (ICPR)

In recent years, growing attention has been paid to recognizing text in natural scenes images. Scene Character recognition (SCR) is an important step in automatizing the process of reading text in natural scenes.

chapter

Few-View CT reconstruction method based on deep learning

Ji Zhao, Zhiqiang Chen, Li Zhang, Xin Jin

2016 IEEE Nuclear Science Symposium, Medical Imaging Conference and Room-Temperature Semiconductor Detector Workshop (NSS/MIC/RTSD) > 1 - 4

2016 IEEE Nuclear Science Symposium, Medical Imaging Conference and Room-Temperature Semiconductor Detector Workshop (NSS/MIC/RTSD)

To reduce patient's dose, few-view CT reconstruction promises to be a good attempt. The key to better reconstruction is the sparse view artifacts. In recent years, DL(deep learing) has attracted a lot of attention because its outstanding performance in image processing. We propose a deep learning method for few-view CT reconstuction. Our method directly learns an end-to-end mapping between the full-view/few-view...

chapter

Primi speech recognition based on deep neural network

Wenjun Hu, Meijun Fu, Wenlin Pan

2016 IEEE 8th International Conference on Intelligent Systems (IS) > 667 - 671

2016 IEEE 8th International Conference on Intelligent Systems (IS)

In order to improve the performance of Primi speech recognition system, a novel method based on deep neural network has been proposed. The deep neural network has two distinct characteristics, one is a high-capacity, and the other is a highly complex network structure. On the Kaldi platform, the neural network, containing four hidden layers, which used to deal with the Primi speech recognition. The...

chapter

Dynamic Neural Networks for Text Classification

Lea Vega, Andres Mendez-Vazquez

2016 International Conference on Computational Intelligence and Applications (ICCIA) > 6 - 11

2016 International Conference on Computational Intelligence and Applications (ICCIA)

This research proposes an approach for text classification that uses a simple neural network called Dynamic Text Classifier Neural Network (DTCNN). The neural network uses as input vectors of words with variable dimension without information loss called Dynamic Token Vectors (DTV). The proposed neural network is designed for the classification of large and short text into categories. The learning...

chapter

Coupled deep auto-encoder with image edge information for image super-resolution

Yinggan Tang, Chunning Bu, Liying Zhao

2016 IEEE International Conference on Information and Automation (ICIA) > 1708 - 1713

2016 IEEE International Conference on Information and Automation (ICIA)

Image super-resolution aims to recover a fine-resolution image from one or more low-resolution image(s). In this paper, we propose a novel image super-resolution approach based on the recent development of coupled deep auto-encoder. In the training step, the vector of the local low resolution (LR) and high resolution image (HR) patches and the corresponding edge information are extracted to be the...

chapter

Classification from generation: Recognizing deep grammatical information during reading from rapid event-related fMRI

Tali Bitan, Alex Frid, Hananel Hazan, Larry M. Manevitz, more

2016 International Joint Conference on Neural Networks (IJCNN) > 4637 - 4642

2016 International Joint Conference on Neural Networks (IJCNN)

A novel fMRI classification method designed for rapid event related fMRI experiments is described and applied to the classification of loud reading of isolated words in Hebrew. Three comparisons of different grammatical complexity were performed: (i) words versus asterisks (ii) “with diacritics versus without diacritics” and (iii) “with root versus no root”. We discuss the most difficult task and,...

Keywords:
DICTIONARIES
NEURAL NETWORKS

Publication date

Set your own date range

Publication type

book (30)
article (4)

Keywords

FEATURE EXTRACTION (8)
MACHINE LEARNING (8)
DEEP LEARNING (6)
IMAGE RECONSTRUCTION (6)
IMAGE RESOLUTION (6)
CONVOLUTIONAL NEURAL NETWORKS (4)
DICTIONARY LEARNING (4)
SUPPORT VECTOR MACHINES (4)
COMPUTATIONAL MODELING (3)
COMPUTER ARCHITECTURE (3)
CONVOLUTION (3)
DEEP NEURAL NETWORK (3)
DEEP NEURAL NETWORKS (3)
ENCODING (3)
HIDDEN MARKOV MODELS (3)
NOISE MEASUREMENT (3)
SPEECH (3)
SUPER-RESOLUTION (3)
ACCURACY (2)
BACKPROPAGATION (2)
CHARACTER RECOGNITION (2)
CLASSIFICATION (2)
COMPUTED TOMOGRAPHY (2)
CONTEXT (2)
CONVOLUTIONAL CODES (2)
DECODING (2)
DICTIONARY (2)
IMAGE RECOGNITION (2)
IMAGE SUPER-RESOLUTION (2)
INTERPOLATION (2)
KERNEL (2)
MARKET RESEARCH (2)
MATCHING PURSUIT ALGORITHMS (2)
NEURAL NETWORK (2)
NEURONS (2)
SPARSE CODING (2)
TESTING (2)
TEXT CATEGORIZATION (2)
TEXT RECOGNITION (2)
TRAINING DATA (2)
VISUALIZATION (2)
ACOUSTICS (1)
ADABOOST (1)
ALGORITHM DESIGN AND ANALYSIS (1)
AUTO-ENCODER (1)
AUTOENCODER (1)
AUTOMATA (1)
BENCHMARK TESTING (1)
BIOLOGICAL NEURAL NETWORKS (1)
BROAD LEARNING SYSTEM (1)
CASCADED DETECTOR (1)
CHANNELS COMBINATION (1)
CHINESE NAMED ENTITY RECOGNITION (1)
CLASSIFICATION ALGORITHMS (1)
CLUSTERING ALGORITHMS (1)
CNN (1)
COGNITIVE PROCESSING (1)
COMPONENT (1)
CONCATENATIVE SYNTHESIS (1)
CONDITIONAL RANDOM FIELDS (1)
CONFERENCES (1)
CONNECTIONIST TEMPORAL CLASSIFICATION (1)
CONVOLUTIONAL NEURAL NETWORK (1)
CONVOLUTIONAL NEURAL NETWORKS (CNNS) (1)
CORPUS-BASED (1)
COST FUNCTION (1)
COVARIANCE MATRIX (1)
CYBERNETICS (1)
DATABASES (1)
DEEP BELIEF NETWORK(DBN) (1)
DEEP CONVOLUTIONAL NEURAL NETWORKS (1)
DEEP NETWORK (1)
DEGRADATION (1)
DETECTORS (1)
DIGITAL TV (1)
DNN (1)
DOMAIN ADAPTATION (DA) (1)
DROPOUT (1)
EDGE INFORMATION (1)
EDUCATIONAL INSTITUTIONS (1)
EEG (1)
END-TO-END SCENE TEXT RECOGNITION (1)
EPITHELIUM-STROMA CLASSIFICATION (1)
FEATURE REPRESENTATION (1)
FEW-VIEW (1)
FILTERING (1)
FISHER VECTOR (1)
FUNCTIONAL MAGNETIC RESONANCE IMAGING (FMRI) (1)
HISTOPATHOLOGICAL IMAGE ANALYSIS (1)
HUMAN ACTION RECOGNITION (1)
HYPERSPECTRAL (1)
HYPERSPECTRAL IMAGING (1)
IMAGE CODING (1)
IMAGE COLOR ANALYSIS (1)
IMAGE EDGE DETECTION (1)
IMPACTS ON EMOTION (1)
INCREMENTAL LEARNING (1)
more

INFONA - science communication portal

Search results

Human action recognition based on self-learned key frames and features extraction

Leveraging deep neural networks with nonnegative representations for improved environmental sound classification

LCNN: Lookup-Based Convolutional Neural Network

Broad learning system: Feature extraction based on K-means clustering algorithm

A novel convolutional neural network architecture for image super-resolution based on channels combination

Balanced Two-Stage Residual Networks for Image Super-Resolution

Hybrid CNN and Dictionary-Based Models for Scene Recognition and Domain Adaptation

Class-wise deep dictionary learning

Low-dose CT denoising with convolutional neural network

Supervised monaural source separation based on autoencoders

Signal representations in modern signal processing

Epithelium-stroma classification in histopathological images via convolutional neural networks and self-taught learning

Deep Sparse-coded Network (DSN)

Application of pronunciation knowledge on phoneme recognition by LSTM neural network

Supervised dictionary learning in BoF framework for Scene Character recognition

Few-View CT reconstruction method based on deep learning

Primi speech recognition based on deep neural network

Dynamic Neural Networks for Text Classification

Coupled deep auto-encoder with image edge information for image super-resolution

Classification from generation: Recognizing deep grammatical information during reading from rapid event-related fMRI

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options