Search results

chapter

Autoencoder, low rank approximation and pseudoinverse learning algorithm

Ke Wang, Ping Guo, Xin Xin, Zebin Ye

2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 948 - 953

2017 IEEE International Conference on Systems, Man and Cybernetics (SMC)

Deep multi-layer neural networks are generally trained using variants of the gradient descent based algorithm. However, this kind of algorithms usually encounter a series of shortcomings, such as low training efficiency, local minimum, difficult control parameter tuning, and gradient vanishing or exploding. Besides, for a specific application, how to design the structure of the network, that is, how...

chapter

A robust adaptive stochastic gradient method for deep learning

Caglar Gulcehre, Jose Sotelo, Marcin Moczulski, Yoshua Bengio

2017 International Joint Conference on Neural Networks (IJCNN) > 125 - 132

2017 International Joint Conference on Neural Networks (IJCNN)

Stochastic gradient algorithms are the main focus of large-scale optimization problems and led to important successes in the recent advancement of the deep learning algorithms. The convergence of SGD depends on the careful choice of learning rate and the amount of the noise in stochastic estimates of the gradients. In this paper, we propose an adaptive learning rate algorithm, which utilizes stochastic...

chapter

A distributed constrained-form support vector machine

Francois D. Cote, Ioannis N. Psaromiligkos, Warren J. Gross

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4242 - 4246

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Despite the importance of distributed learning, few fully distributed support vector machines exist. In this paper, not only do we provide a fully distributed nonlinear SVM; we propose the first distributed constrained-form SVM. In the fully distributed context, a dataset is distributed among networked agents that cannot divulge their data, let alone centralize the data, and can only communicate with...

chapter

Large-scale nonconvex stochastic optimization by Doubly Stochastic Successive Convex approximation

Aryan Mokhtari, Alec Koppel, Gesualdo Scutari, Alejandro Ribeiro

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4701 - 4705

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We consider supervised learning problems over training sets in which both the number of training examples and the dimension of the feature vectors are large. We focus on the case where the loss function defining the quality of the parameter we wish to estimate may be non-convex, but also has a convex regularization. We propose a Doubly Stochastic Successive Convex approximation scheme (DSSC) able...

chapter

A new samples selecting method based on K nearest neighbors

Kai Yang, Yi Cai, Zhiwei Cai, Xingwei Tan, more

2017 IEEE International Conference on Big Data and Smart Computing (BigComp) > 457 - 462

2017 IEEE International Conference on Big Data and Smart Computing (BigComp)

Short text classification uses a supervised learning process, and it needs a huge amount of labeled data for training. This process consumes a lot of human resources. In traditional supervised learning problems, active learning can reduce the amount of samples that need to be labeled manually. It achieves this goal by selecting the most representative samples to represent the whole training set. Uncertainty...

chapter

Learning opposites using neural networks

Shivam Kalra, Aditya Sriram, Shahryar Rahnamayan, H.R. Tizhoosh

2016 23rd International Conference on Pattern Recognition (ICPR) > 1213 - 1218

2016 23rd International Conference on Pattern Recognition (ICPR)

Many research works have successfully extended algorithms such as evolutionary algorithms, reinforcement agents and neural networks using “opposition-based learning” (OBL). Two types of the “opposites” have been defined in the literature, namely type-I and type-II. The former are linear in nature and applicable to the variable space, hence easy to calculate. On the other hand, type-II opposites capture...

chapter

Adaptive data augmentation for image classification

Alhussein Fawzi, Horst Samulowitz, Deepak Turaga, Pascal Frossard

2016 IEEE International Conference on Image Processing (ICIP) > 3688 - 3692

2016 IEEE International Conference on Image Processing (ICIP)

Data augmentation is the process of generating samples by transforming training data, with the target of improving the accuracy and robustness of classifiers. In this paper, we propose a new automatic and adaptive algorithm for choosing the transformations of the samples used in data augmentation. Specifically, for each sample, our main idea is to seek a small transformation that yields maximal classification...

chapter

Learning classifier competence based on graph for dynamic classifier selection

Cuiqin Hou, Yingju Xia, Zhuoran Xu, Jun Sun

2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD) > 1164 - 1168

2016 12th International Conference on Natural Computation and 13th Fuzzy Systems and Knowledge Discovery (ICNC-FSKD)

Classifier competence is critical important for classifier ensemble. This study proposes an optimization problem on the neighborhood graph of data and develops an iteration algorithm to learn the competences of classifiers. The learned competences of classifiers not just reflect the competitiveness of classifiers, but also vary smooth on the neighboring data. Experimental results on five different...

chapter

Bounded OS-DL: An improvement for one-stage dictionary learning algorithm

YuanMei Wen, GenPeng Xu

2016 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC) > 1 - 5

2016 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC)

In this paper we propose a novel algorithm, which is an improvement for one-stage dictionary learning (OS-DL) algorithm, by imposing a l²-norm constraint on the update of the atoms. Our contribution embarks from the OS-DL algorithm and incorporates the well-known convex optimization method, proximal point method, into this algorithm. Experimental results on recovering a known dictionary and sparsely...

chapter

Efficient computation of the Levenberg-Marquardt algorithm for feedforward networks with linear outputs

Philip de Chazal, Mark D. McDonnell

2016 International Joint Conference on Neural Networks (IJCNN) > 68 - 75

2016 International Joint Conference on Neural Networks (IJCNN)

An efficient algorithm for the calculation of the approximate Hessian matrix for the Levenberg-Marquardt (LM) optimization algorithm for training a single-hidden-layer feedforward network with linear outputs is presented. The algorithm avoids explicit calculation of the Jacobian matrix and computes the gradient vector and approximate Hessian matrix directly. It requires approximately 1/N the floating...

chapter

A pruning algorithm for extreme learning machine based on sparse coding

Yuanlong Yu, Zhenzhen Sun

2016 International Joint Conference on Neural Networks (IJCNN) > 2596 - 2602

2016 International Joint Conference on Neural Networks (IJCNN)

This paper presents a pruned sparse extreme learning machine (PS-ELM) algorithm, which can generate a compact single-hidden-layer neural network (SLNN) by automatically pruning the number of hidden nodes while keep high accuracy. In this PS-ELM algorithm, input connections between input and hidden layers are base vectors, which can sparsely map the input features into hidden layer by using gradient...

chapter

Overcomplete dictionary learning with Jacobi atom updates

Paul Irofti, Bogdan Dumitrescu

2016 39th International Conference on Telecommunications and Signal Processing (TSP) > 421 - 424

2016 39th International Conference on Telecommunications and Signal Processing (TSP)

Dictionary learning for sparse representations is traditionally approached with sequential atom updates, in which an optimized atom is used immediately for the optimization of the next atoms. We propose instead a Jacobi version, in which groups of atoms are updated independently, in parallel. Extensive numerical evidence for sparse image representation shows that the parallel algorithms, especially...

chapter

Fast-convergent distributed coordinated precoding for TDD multicell MIMO systems

Rasmus Brandt, Mats Bengtsson

2015 IEEE 6th International Workshop on Computational Advances in Multi-Sensor Adaptive Processing (CAMSAP) > 457 - 460

2015 IEEE 6th International Workshop on Computational Advances in Multi-Sensor Adaptive Processing (CAMSAP)

Several distributed coordinated precoding methods relying on over-the-air (OTA) iterations in time-division duplex (TDD) networks have recently been proposed. Each OTA iteration incurs overhead, which reduces the time available for data transmission. In this work, we therefore propose an algorithm which reaches good sum rate performance within just a few number of OTA iterations, partially due to...

chapter

A core-set approach for distributed quadratic programming in big-data classification

Giuseppe Notarstefano

2015 54th IEEE Conference on Decision and Control (CDC) > 1372 - 1377

2015 54th IEEE Conference on Decision and Control (CDC)

A new challenge for learning algorithms in cyber-physical network systems is the distributed solution of big-data classification problems, i.e., problems in which both the number of training samples and their dimension is high. Motivated by several problem set-ups in Machine Learning, in this paper we consider a special class of quadratic optimization problems involving a “large” number of input data,...

chapter

Sparse least squares support vector machine with L₀-norm in primal space

Qi Li, Xiaohang Li, Wei Ba

2015 IEEE International Conference on Information and Automation > 2778 - 2783

2015 IEEE International Conference on Information and Automation (ICIA)

Least squares support vector machine (LS-SVM) has been successfully applied in many classification and regression tasks. The main drawback of the LS-SVM algorithm is the lack of sparseness. Combing the primal least squares twin support vector machine (LS-TSVM) and the sparse LS-SVM with L₀-norm minimization, a new sparse least squares support vector regression algorithm with L₀-norm in primal space(L...

chapter

Saturation in PSO neural network training: Good or evil?

Anna Rakitianskaia, Andries Engelbrecht

2015 IEEE Congress on Evolutionary Computation (CEC) > 125 - 132

2015 IEEE Congress on Evolutionary Computation (CEC)

Particle swarm optimisation has been successfully applied as a neural network training algorithm before, often outperforming traditional gradient-based approaches. However, recent studies have shown that particle swarm optimisation does not scale very well, and performs poorly on high-dimensional neural network architectures. This paper hypothesises that hidden layer saturation is a significant factor...

chapter

Fast DNN training based on auxiliary function technique

Dung T. Tran, Nobutaka Ono, Emmanuel Vincent

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 2160 - 2164

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Deep neural networks (DNN) are typically optimized with stochastic gradient descent (SGD) using a fixed learning rate or an adaptive learning rate approach (ADAGRAD). In this paper, we introduce a new learning rule for neural networks that is based on an auxiliary function technique without parameter tuning. Instead of minimizing the objective function, a quadratic auxiliary function is recursively...

chapter

Large-Scale Multiclass Support Vector Machine Training via Euclidean Projection onto the Simplex

Mathieu Blondel, Akinori Fujino, Naonori Ueda

2014 22nd International Conference on Pattern Recognition > 1289 - 1294

2014 22nd International Conference on Pattern Recognition (ICPR)

Dual decomposition methods are the current state-of-the-art for training multiclass formulations of Support Vector Machines (SVMs). At every iteration, dual decomposition methods update a small subset of dual variables by solving a restricted optimization problem. In this paper, we propose an exact and efficient method for solving the restricted problem. In our method, the restricted problem is reduced...

chapter

Semi-supervised Segmentation Fusion of Multi-spectral and Aerial Images

Mete Ozay

2014 22nd International Conference on Pattern Recognition > 3839 - 3844

2014 22nd International Conference on Pattern Recognition (ICPR)

A Semi-supervised Segmentation Fusion algorithm is proposed using consensus and distributed learning. The aim of Unsupervised Segmentation Fusion (USF) is to achieve a consensus among different segmentation outputs obtained from different segmentation algorithms by computing an approximate solution to the NP problem with less computational complexity. Semi-supervision is incorporated in USF using...

chapter

An iterative learning control algorithm for portability between trajectories

Xuemei Gao, Sandipan Mishra

2014 American Control Conference > 3808 - 3813

2014 American Control Conference - ACC 2014

Iterative learning control (ILC) algorithms are typically used to iteratively refine the feed-forward control input to a system to achieve an optimized performance objective. Because of its ease of implementation and robustness, ILC has found widespread use in a variety of industrial applications. However, a key limitation of ILC is the requirement that learning has to be re-initiated for each new...

INFONA - science communication portal

Search results

Autoencoder, low rank approximation and pseudoinverse learning algorithm

A robust adaptive stochastic gradient method for deep learning

A distributed constrained-form support vector machine

Large-scale nonconvex stochastic optimization by Doubly Stochastic Successive Convex approximation

A new samples selecting method based on K nearest neighbors

Learning opposites using neural networks

Adaptive data augmentation for image classification

Learning classifier competence based on graph for dynamic classifier selection

Bounded OS-DL: An improvement for one-stage dictionary learning algorithm

Efficient computation of the Levenberg-Marquardt algorithm for feedforward networks with linear outputs

A pruning algorithm for extreme learning machine based on sparse coding

Overcomplete dictionary learning with Jacobi atom updates

Fast-convergent distributed coordinated precoding for TDD multicell MIMO systems

A core-set approach for distributed quadratic programming in big-data classification

Sparse least squares support vector machine with L₀-norm in primal space

Saturation in PSO neural network training: Good or evil?

Fast DNN training based on auxiliary function technique

Large-Scale Multiclass Support Vector Machine Training via Euclidean Projection onto the Simplex

Semi-supervised Segmentation Fusion of Multi-spectral and Aerial Images

An iterative learning control algorithm for portability between trajectories

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options