Search results

chapter

Atrous Faster R-CNN for Small Scale Object Detection

Tongfan Guan, Hao Zhu

2017 2nd International Conference on Multimedia and Image Processing (ICMIP) > 16 - 21

2017 2nd International Conference on Multimedia and Image Processing (ICMIP)

Deep Convolutional Neural Networks based object detection has made significant progress recent years. However, detecting small scale objects is still a challenging task. This paper addresses the problem and proposes a unified deep neural network building upon the prominent Faster R-CNN framework. This paper has two main contributions. Firstly, an Atrous Region Proposal Network (ARPN) is proposed to...

chapter

Low-rank and sparse soft targets to learn better DNN acoustic models

Pranay Dighe, Afsaneh Asaei, Herve Bourlard

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5265 - 5269

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Conventional deep neural networks (DNN) for speech acoustic modeling rely on Gaussian mixture models (GMM) and hidden Markov model (HMM) to obtain binary class labels as the targets for DNN training. Subword classes in speech recognition systems correspond to context-dependent tied states or senones. The present work addresses some limitations of GMM-HMM senone alignments for DNN training. We hypothesize...

chapter

Structure-aware classification using supervised dictionary learning

Yael Yankelevsky, Michael Elad

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4421 - 4425

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, we propose a supervised dictionary learning algorithm that aims to preserve the local geometry in both dimensions of the data. A graph-based regularization explicitly takes into account the local manifold structure of the observations. A second graph regularization gives similar treatment to the feature domain and helps in learning a more robust dictionary. Both graphs can be constructed...

chapter

Active learning for low-resource speech recognition: Impact of selection size and language modeling data

Ali Raza Syed, Andrew Rosenberg, Michael Mandel

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5315 - 5319

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Active learning aims to reduce the time and cost of developing speech recognition systems by selecting for transcription highly informative subsets from large pools of audio data. Previous evaluations at OpenKWS and IARPA BABEL have investigated data selection for low-resource languages in very constrained scenarios with 2-hour data selections given a 1-hour seed set. We expand on this to investigate...

chapter

Learning discriminative features from electroencephalography recordings by encoding similarity constraints

Sebastian Stober

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6175 - 6179

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper introduces a pre-training technique for learning discriminative features from electroencephalography (EEG) recordings using deep neural networks. EEG data are generally only available in small quantities, they are high-dimensional with a poor signal-to-noise ratio, and there is considerable variability between individual subjects and recording sessions. Similarity-constraint encoders as...

chapter

Fast HEVC intra coding algorithm based on machine learning and Laplacian Transparent Composite Model

Yi Shan, En-hui Yang

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 2642 - 2646

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Compared with H.264, High Efficient Video Coding (HEVC) improves the coding efficiency by 50% at the price of significant increase in encoding time, due to Rate Distortion Optimization (RDO) on large variations of block sizes and prediction modes. In this paper, a fast intra coding algorithm is proposed to alleviate the high computational complexity of HEVC intra-frame coding. The proposed algorithm...

chapter

Face recognition in real-world images

Xavier Fontaine, Radhakrishna Achanta, Sabine Susstrunk

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1482 - 1486

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Face recognition systems are designed to handle well-aligned images captured under controlled situations. However real-world images present varying orientations, expressions, and illumination conditions. Traditional face recognition algorithms perform poorly on such images. In this paper we present a method for face recognition adapted to real-world conditions that can be trained using very few training...

chapter

A fast and ultra low power time-based spiking neuromorphic architecture for embedded applications

Tao Liu, Wujie Wen

2017 18th International Symposium on Quality Electronic Design (ISQED) > 19 - 22

2017 18th International Symposium on Quality Electronic Design (ISQED)

Time-based Spiking Neural Network (SNN) has recently received increased attentions in neuromorphic computing system designs due to more bio-plausibility and better energy-efficiency. However, unleashing its potentials in realistic cognitive applications is facing significant challenges such as inefficient information representations and impractical learnings. In this work, we aim for exploring a practical...

chapter

Sparse representation classification based language recognition using elastic net

Om Prakash Singh, Rohit Sinha

2017 4th International Conference on Signal Processing and Integrated Networks (SPIN) > 380 - 384

2017 4th International Conference on Signal Processing and Integrated Networks (SPIN)

In our earlier work, we have explored the sparse representation classification (SRC) for language recognition (LR) task. In those works, the orthogonal matching pursuit (OMP) algorithm was used for sparse coding. In place of l₀-norm minimization in the OMP algorithm, one could also use l_l-norm minimization based sparse coding such as the least absolute shrinkage and selection operator (LASSO). Though...

chapter

Language recognition via sparse coding over learned dictionary

Om Prakash Singh, Rohit Sinha

2017 4th International Conference on Signal Processing and Integrated Networks (SPIN) > 494 - 497

2017 4th International Conference on Signal Processing and Integrated Networks (SPIN)

In this work, we explore the use of sparse features derived using a learned dictionary for language recognition (LR). These sparse features are referred to as s-vector and are derived by sparse coding of the commonly used low-dimensional i-vector based representation of speech utterances over the learned dictionary. The orthogonal matching pursuit (OMP), least absolute shrinkage and selection operator...

chapter

Spreadsheet testing in practice

Sohon Roy, Felienne Hermans, Arie van Deursen

2017 IEEE 24th International Conference on Software Analysis, Evolution and Reengineering (SANER) > 338 - 348

2017 IEEE 24th International Conference on Software Analysis, Evolution and Reengineering (SANER)

Despite being popular end-user tools, spreadsheets suffer from the vulnerability of error-proneness. In software engineering, testing has been proposed as a way to address errors. It is important therefore to know whether spreadsheet users also test, or how do they test and to what extent, especially since most spreadsheet users do not have the training, or experience, of software engineering principles...

chapter

The importance of program Design Patterns training

Viggo Holmstedt, Shegaw A. Mengiste

2017 IEEE 24th International Conference on Software Analysis, Evolution and Reengineering (SANER) > 559 - 560

2017 IEEE 24th International Conference on Software Analysis, Evolution and Reengineering (SANER)

Design Patterns for Object Oriented Systems constitute an important tool for improving software quality by providing reusable design. Many academic institutions believe in their relevance, and do courses accordingly. This paper explores practitioners' perception of the relevance their patterns knowledge has for their work. The paper also assesses how managers' perception of pattern knowledge conforms...

chapter

SCNN: An accelerator for compressed-sparse convolutional neural networks

Angshuman Parashar, Minsoo Rhu, Anurag Mukkara, Antonio Puglielli, more

2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA) > 27 - 40

2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA)

Convolutional Neural Networks (CNNs) have emerged as a fundamental technology for machine learning. High performance and extreme energy efficiency are critical for deployments of CNNs, especially in mobile platforms such as autonomous vehicles, cameras, and electronic personal assistants. This paper introduces the Sparse CNN (SCNN) accelerator architecture, which improves performance and energy efficiency...

chapter

Code-division multiplexed resistive pulse sensor networks for spatio-temporal detection of particles in microfluidic devices

Ningquan Wang, Ruxiu Liu, Roozbeh Khodambashi, Norh Asmare, more

2017 IEEE 30th International Conference on Micro Electro Mechanical Systems (MEMS) > 362 - 365

2017 IEEE 30th International Conference on Micro Electro Mechanical Systems (MEMS)

Spatial separation of suspended particles based on contrast in their physical or chemical properties forms the basis of various biological assays performed on lab-on-a-chip devices. To electronically acquire this information, we have recently introduced a microfluidic sensing platform, called Microfluidic CODES, which combines the resistive pulse sensing with the code division multiple access in multiplexing...

chapter

Comparative analysis of the classical and nonclassical artificial neural networks

Daria Lisitsa, Anton A. Zhilenkov

2017 IEEE Conference of Russian Young Researchers in Electrical and Electronic Engineering (EIConRus) > 922 - 925

2017 IEEE Conference of Russian Young Researchers in Electrical and Electronic Engineering (EIConRus)

A lot of artifiicial neural networks were proposed by scientists over the last time. Each of them can cope with the tasks of limited difficulty level, determined by their properties and capabilities. The aim of this paper is to outline difference of them and to define their positive and negative sites in different tasks of identification and control.

chapter

Gated factored 3-way RBM for image transformation

Lei Xia, Amit Yadav, Ning Duobiao

2016 13th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP) > 150 - 153

2016 13th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP)

The Factored 3-way Restricted Boltzmann Machine has encoded the image transformation successfully. But when utilize the code to unknown image, the result was much affected by the feature of training samples. Based on the model, we separated the transformation feature out of the hidden representation and designed a new probabilistic model with gate for learning distributed representations of image...

chapter

Sparse coding with sparse dictionaries for credit risk classification

Xueyan Mei

2016 International Conference on Progress in Informatics and Computing (PIC) > 23 - 26

2016 International Conference on Progress in Informatics and Computing (PIC)

Credit risk analysis seeks to determine whether a customer is likely to default on the financial obligation, which is a very important problem in finance. In this paper, we will present a machine learning framework to deal with this problem by formulating it as a binary classification problem. The framework consists of two parts: dictionary learning and classifier training. Firstly, we introduce a...

chapter

Mutually incoherent pose bases for Action recognition

Yinzhong Qian, Wenbin Chen, I-fan Shen

2016 23rd International Conference on Pattern Recognition (ICPR) > 823 - 828

2016 23rd International Conference on Pattern Recognition (ICPR)

We propose mutually incoherent pose bases for action recognition in static image, each of which implicitly represents co-occurrence of poselets. First of all, action specific poselets are trained. To suppress the ambiguity of detection, we cluster poselet activations by the overlap of predicted torso bound of each poselet. Then pose feature of an action person can be extracted which is a vector composed...

chapter

Deep Sparse-coded Network (DSN)

Youngjune Gwon, Miriam Cha, H. T. Kung

2016 23rd International Conference on Pattern Recognition (ICPR) > 2610 - 2615

2016 23rd International Conference on Pattern Recognition (ICPR)

We present Deep Sparse-coded Network (DSN), a deep architecture based on multilayer sparse coding. It has been considered difficult to learn a useful feature hierarchy by stacking sparse coding layers in a straightforward manner. The primary reason is the modeling assumption for sparse coding that takes in a dense input and yields a sparse output vector. Applying a sparse coding layer on the output...

chapter

BeamECOC: A local search for the optimization of the ECOC matrix

Cemre Zor, Berrin Yanikoglu, Erinc Merdivan, Terry Windeatt, more

2016 23rd International Conference on Pattern Recognition (ICPR) > 198 - 203

2016 23rd International Conference on Pattern Recognition (ICPR)

Error Correcting Output Coding (ECOC) is a multi-class classification technique in which multiple binary classifiers are trained according to a preset code matrix such that each one learns a separate dichotomy of the classes. While ECOC is one of the best solutions for multi-class problems, one issue which makes it suboptimal is that the training of the base classifiers is done independently of the...

INFONA - science communication portal

Search results

Atrous Faster R-CNN for Small Scale Object Detection

Low-rank and sparse soft targets to learn better DNN acoustic models

Structure-aware classification using supervised dictionary learning

Active learning for low-resource speech recognition: Impact of selection size and language modeling data

Learning discriminative features from electroencephalography recordings by encoding similarity constraints

Fast HEVC intra coding algorithm based on machine learning and Laplacian Transparent Composite Model

Face recognition in real-world images

A fast and ultra low power time-based spiking neuromorphic architecture for embedded applications

Sparse representation classification based language recognition using elastic net

Language recognition via sparse coding over learned dictionary

Spreadsheet testing in practice

The importance of program Design Patterns training

SCNN: An accelerator for compressed-sparse convolutional neural networks

Code-division multiplexed resistive pulse sensor networks for spatio-temporal detection of particles in microfluidic devices

Comparative analysis of the classical and nonclassical artificial neural networks

Gated factored 3-way RBM for image transformation

Sparse coding with sparse dictionaries for credit risk classification

Mutually incoherent pose bases for Action recognition

Deep Sparse-coded Network (DSN)

BeamECOC: A local search for the optimization of the ECOC matrix

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options