Search results

Items from 1 to 20 out of 206 results

chapter

Reset-free guided policy search: Efficient deep reinforcement learning with stochastic initial states

William Montgomery, Anurag Ajay, Chelsea Finn, Pieter Abbeel, more

2017 IEEE International Conference on Robotics and Automation (ICRA) > 3373 - 3380

2017 IEEE International Conference on Robotics and Automation (ICRA)

Autonomous learning of robotic skills can allow general-purpose robots to learn wide behavioral repertoires without extensive manual engineering. However, robotic skill learning must typically make trade-offs to enable practical real-world learning, such as requiring manually designed policy or value function representations, initialization from human demonstrations, instrumentation of the training...

chapter

Policy optimization of dialogue management in spoken dialogue system for out-of-domain utterances

Yuhong Xu, Peijie Huang, Jiecong Tang, Qiangjia Huang, more

2016 International Conference on Asian Language Processing (IALP) > 10 - 13

2016 International Conference on Asian Language Processing (IALP)

This paper addresses the policy optimization of a dialogue management scheme based on partially observable Markov decision processes (POMDP), which is designed for out-of-domain (OOD) utterances processing in spoken dialogue system. First, POMDP-Based DM Modeling for OOD Utterances is proposed, together with detail of some principal elements. Then, joint state transition exploration and dialogue policy...

chapter

Continuous Action-Space Reinforcement Learning Methods Applied to the Minimum-Time Swing-Up of the Acrobot

Barry D. Nichols

2015 IEEE International Conference on Systems, Man, and Cybernetics > 2084 - 2089

2015 IEEE International Conference on Systems, Man, and Cybernetics (SMC)

Here I apply three reinforcement learning methods to the full, continuous action, swing-up acrobot control benchmark problem. These include two approaches from the literature: CACLA and NM-SARSA and a novel approach which I refer to as Nelder Mead-SARSA. Nelder Mead-SARSA, like NMSARSA, directly optimises the state-action value function for action selection, in order to allow continuous action reinforcement...

chapter

Using machines to learn method-specific compilation strategies

R N Sanchez, J N Amaral, D Szafron, M Pirvu, more

International Symposium on Code Generation and Optimization (CGO 2011) > 257 - 266

2011 9th Annual IEEE/ACM International Symposium on Code Generation and Optimization (CGO 2011)

Support Vector Machines (SVMs) are used to discover method-specific compilation strategies in Testarossa, a commercial Just-in-Time (JiT) compiler employed in the IBM^® J9 Java™ Virtual Machine. The learning process explores a large number of different compilation strategies to generate the data needed for training models. The trained machine-learned model is integrated with the compiler to predict...

chapter

Boosted metric learning for 3D multi-modal deformable registration

F Michel, M Bronstein, A Bronstein, N Paragios

2011 IEEE International Symposium on Biomedical Imaging: From Nano to Macro > 1209 - 1214

2011 8th IEEE International Symposium on Biomedical Imaging: From Nano to Macro (ISBI 2011)

Defining a suitable metric is one of the biggest challenges in deformable image fusion from different modalities. In this paper, we propose a novel approach for multi-modal metric learning in the deformable registration framework that consists of embedding data from both modalities into a common metric space whose metric is used to parametrize the similarity. Specifically, we use image representation...

chapter

Multilevel dictionary learning for sparse representation of images

J J Thiagarajan, K N Ramamurthy, A Spanias

2011 Digital Signal Processing and Signal Processing Education Meeting (DSP/SPE) > 271 - 276

2011 Digital Signal Processing and Signal Processing Education Meeting (DSP/SPE)

Adaptive data-driven dictionaries for sparse approximations provide superior performance compared to predefined dictionaries in applications involving representation and classification of data. In this paper, we propose a novel algorithm for learning global dictionaries particularly suited to the sparse representation of natural images. The proposed algorithm uses a hierarchical energy based learning...

chapter

Localized support vector machines using Parzen window for incomplete sets of categories

K L Veon, M H Mahoor

2011 IEEE Workshop on Applications of Computer Vision (WACV) > 448 - 454

2011 IEEE Workshop on Applications of Computer Vision (WACV)

This paper describes a novel approach to pattern classification that combines Parzen window and support vector machines. Pattern classification is usually performed in universes where all possible categories are defined. Most of the current supervised learning classification techniques do not account for undefined categories. In a universe that is only partially defined, there may be objects that...

chapter

Training MT Model Using Structural SVM

Tiansang Du, Baobao Chang

2010 International Conference on Asian Language Processing > 249 - 252

2010 International Conference on Asian Language Processing (IALP 2010)

This paper presents a training method of log-linear model for statistical machine translation based on structural support vector machine. This method is designed to directly optimize parameters with respect to translation quality. By adopting maximum-margin principle of SVM, the MT model can learn from training samples with generalization capability. Experiments are carried out on a hierarchical phrase-based...

chapter

Spatial Based Feature Generation for Machine Learning Based Optimization Compilation

A M Malik

2010 Ninth International Conference on Machine Learning and Applications > 925 - 930

2010 Ninth International Conference on Machine Learning and Applications (ICMLA 2010)

Modern compilers provide optimization options to obtain better performance for a given program. Effective selection of optimization options is a challenging task. Recent work has shown that machine learning can be used to select the best compiler optimization options for a given program. Machine learning techniques rely upon selecting features which represent a program in the best way. The quality...

chapter

Homotopy Regularization for Boosting

Zheng Wang, Yangqiu Song, Changshui Zhang

2010 IEEE International Conference on Data Mining > 1115 - 1120

2010 10th IEEE International Conference on Data Mining (ICDM 2010)

In this paper, we present a homotopy regularization algorithm for boosting. We introduce a regularization term with adaptive weight into the boosting framework and compose a homotopy objective function. Optimization of this objective approximately composes a solution path for the regularized boosting. Following this path, we can find suitable solution efficiently using early stopping. Experiments...

chapter

Learning a Bi-Stochastic Data Similarity Matrix

Fei Wang, Ping Li, Arnd Christian Konig

2010 IEEE International Conference on Data Mining > 551 - 560

2010 10th IEEE International Conference on Data Mining (ICDM 2010)

An idealized clustering algorithm seeks to learn a cluster-adjacency matrix such that, if two data points belong to the same cluster, the corresponding entry would be 1; otherwise the entry would be 0. This integer (1/0) constraint makes it difficult to find the optimal solution. We propose a relaxation on the cluster-adjacency matrix, by deriving a bi-stochastic matrix from a data similarity (e.g...

chapter

A New SVM Approach to Multi-instance Multi-label Learning

Nam Nguyen

2010 IEEE International Conference on Data Mining > 384 - 392

2010 10th IEEE International Conference on Data Mining (ICDM 2010)

In this paper, we address the problem of multi-instance multi-label learning (MIML) where each example is associated with not only multiple instances but also multiple class labels. In our novel approach, given an MIML example, each instance in the example is only associated with a single label and the label set of the example is the aggregation of all instance labels. Many real-world tasks such as...

chapter

Integer Programming for Multi-class Active Learning

D Yankov, S Rajan, A Ratnaparkhi

2010 IEEE International Conference on Data Mining Workshops > 1257 - 1264

2010 10th IEEE International Conference on Data Mining Workshops (ICDMW 2010)

Active learning has been demonstrated to be a powerful tool for improving the effectiveness of binary classifiers. It iteratively identifies informative unlabeled examples which after labeling are used to augment the initial training set. Adapting the procedure to large-scale, multi-class classification problems, however, poses certain challenges. For instance, to guarantee improvement by the method...

chapter

ALPOS: A Machine Learning Approach for Analyzing Microblogging Data

Dan Zhang, Yan Liu, R D Lawrence, V Chenthamarakshan

2010 IEEE International Conference on Data Mining Workshops > 1265 - 1272

2010 10th IEEE International Conference on Data Mining Workshops (ICDMW 2010)

With the development of Internet, the increasing volume of information posted on micro-blogging sites like Twitter necessitates the need for efficient information filtering. In conventional text classification problems, it is assumed that the feature vectors extracted from the available documents are sufficient to learn good classifiers. However, this conventional approach is not likely to work for...

chapter

Efficient Additive Models via the Generalized Lasso

D Semenovich, N Morioka, A Sowmya

2010 IEEE International Conference on Data Mining Workshops > 1228 - 1233

2010 10th IEEE International Conference on Data Mining Workshops (ICDMW 2010)

We propose a framework for learning generalized additive models at very little additional cost (a small constant) compared to some of the most efficient schemes for learning linear classifiers such as linear SVMs and regularized logistic regression. We achieve this through a simple feature encoding scheme followed by a novel approach to regularization which we term ``generalized lasso''. Addtive models...

chapter

Research on Stage Classification of Flight Parameter Based on PTSVM

Hui Lu, Kefei Mao

2010 13th IEEE International Conference on Computational Science and Engineering > 55 - 63

2010 IEEE 13th International Conference on Computational Science and Engineering (CSE 2010)

Flight Parameters stage classification is the premise of the fault diagnosis and trend forecast based on flight parameters. Stage classification belongs to the classification optimization problem of multi-attribute data through analysis the flight data. This paper carried out the research for the two-class classification based on the semi-supervised learning methods PTSVM (Progressive Transductive...

chapter

The Influence Machine: Nonnegative Instance-Space Learning with Differentiated Regularization

Jian Zhang

2010 Ninth International Conference on Machine Learning and Applications > 861 - 866

2010 Ninth International Conference on Machine Learning and Applications (ICMLA 2010)

We introduce a new method for classification called the influence machine. The influence machine assigns influence powers to the instances in the training sample so that they can apply their influence to other instances through the connections between the instances specified by a connection matrix. A new instance is classified to be positive if the overall influence it receives is positive and vice...

chapter

Learning from images and speech with Non-negative Matrix Factorization enhanced by input space scaling

Joris Driesen, Hugo Van hamme, W Bastiaan Kleijn

2010 IEEE Spoken Language Technology Workshop > 1 - 6

2010 IEEE Spoken Language Technology Workshop (SLT 2010)

Computional learning from multimodal data is often done with matrix factorization techniques such as NMF (Non-negative Matrix Factorization), pLSA (Probabilistic Latent Semantic Analysis) or LDA (Latent Dirichlet Allocation). The different modalities of the input are to this end converted into features that are easily placed in a vectorized format. An inherent weakness of such a data representation...

chapter

Evaluation of Genetic Algorithms for tuning SVM parameters in multi-class problems

F Samadzadegan, A Soleymani, R A Abbaspour

2010 11th International Symposium on Computational Intelligence and Informatics (CINTI) > 323 - 328

2010 11th International Symposium on Computational Intelligence and Informatics (CINTI 2010)

Support Vector Machine (SVM) is a useful technique for data classification with successful applications in different fields of bioinformatics, image segmentation, data mining, etc. A key problem of these methods is how to choose an optimal kernel and how to optimize its parameters in the learning process of SVM. The objective of this study is to propose a Genetic Algorithm approach for parameter optimization...

chapter

On a Multiobjective Training Algorithm for RBF Networks Using Particle Swarm Optimization

G R L Silva, D A G Vieira, A C Lisboa, Vasile Palade

2010 22nd IEEE International Conference on Tools with Artificial Intelligence > 2 > 282 - 285

2010 22nd International Conference on Tools with Artificial Intelligence (ICTAI 2010)

This paper presents a novel algorithm for multiobjective training of Radial Basis Function (RBF) networks based on least-squares and Particle Swarm Optimization methods. The formulation is based on the fundamental concept that supervised learning is a bi-objective optimization problem, in which two conflicting objectives should be minimized. The objectives are related to the empirical training error...

Keywords:
TRAINING
OPTIMIZATION
LEARNING (ARTIFICIAL INTELLIGENCE)

Publication date

Set your own date range

Content availability

Available (203)
None (3)

Keywords

SUPPORT VECTOR MACHINES (89)
OPTIMISATION (57)
KERNEL (53)
ARTIFICIAL NEURAL NETWORKS (47)
MACHINE LEARNING (46)
PATTERN CLASSIFICATION (43)
CLASSIFICATION ALGORITHMS (40)
DATA MINING (37)
ACCURACY (33)
SUPPORT VECTOR MACHINE (25)
ALGORITHM DESIGN AND ANALYSIS (21)
FEATURE EXTRACTION (21)
NEURAL NETS (20)
TRAINING DATA (19)
GENETIC ALGORITHMS (17)
IMAGE CLASSIFICATION (17)
PARTICLE SWARM OPTIMISATION (17)
REGRESSION ANALYSIS (16)
SVM (16)
PARTICLE SWARM OPTIMIZATION (15)
CONVERGENCE (14)
CONVEX PROGRAMMING (14)
DATA MODELS (14)
MATHEMATICAL MODEL (13)
SUPERVISED LEARNING (13)
BENCHMARK TESTING (11)
BOOSTING (11)
DATABASES (11)
FACE RECOGNITION (11)
QUADRATIC PROGRAMMING (11)
APPROXIMATION METHODS (10)
CLASSIFICATION (10)
COMPLEXITY THEORY (10)
COMPUTATIONAL COMPLEXITY (10)
APPROXIMATION ALGORITHMS (9)
EVOLUTIONARY COMPUTATION (9)
FACE (9)
GENETIC ALGORITHM (9)
LEARNING SYSTEMS (9)
PREDICTION ALGORITHMS (9)
TESTING (9)
COMPUTATIONAL MODELING (8)
DISTANCE MEASUREMENT (8)
HEURISTIC ALGORITHMS (8)
ITERATIVE METHODS (8)
MACHINE LEARNING ALGORITHMS (8)
NEURONS (8)
OBJECT RECOGNITION (8)
PREDICTIVE MODELS (8)
PROBABILITY DENSITY FUNCTION (8)
SUPPORT VECTOR MACHINE CLASSIFICATION (8)
TEXT ANALYSIS (8)
CORRELATION (7)
HIDDEN MARKOV MODELS (7)
LEARNING ALGORITHM (7)
RADIAL BASIS FUNCTION NETWORKS (7)
SEARCH PROBLEMS (7)
VIDEO SIGNAL PROCESSING (7)
ARTIFICIAL NEURAL NETWORK (6)
BIOLOGICAL CELLS (6)
COVARIANCE MATRIX (6)
EQUATIONS (6)
ESTIMATION (6)
FEATURE SELECTION (6)
FUZZY NEURAL NETS (6)
LEARNING (6)
LEAST SQUARES APPROXIMATIONS (6)
MINIMISATION (6)
OBJECT DETECTION (6)
SIGNAL PROCESSING ALGORITHMS (6)
STATISTICAL LEARNING THEORY (6)
ADAPTATION MODEL (5)
APPROXIMATION THEORY (5)
COMPUTER VISION (5)
DICTIONARIES (5)
GALLIUM (5)
GENERALISATION (ARTIFICIAL INTELLIGENCE) (5)
GENERALIZATION (5)
GRADIENT METHODS (5)
HISTOGRAMS (5)
IMAGE RECOGNITION (5)
IMAGE REPRESENTATION (5)
IMAGE RETRIEVAL (5)
LABELING (5)
LINEAR PROGRAMMING (5)
NATURAL LANGUAGE PROCESSING (5)
NEAREST NEIGHBOR SEARCHES (5)
NEURAL NETWORKS (5)
NEUROCONTROLLERS (5)
NOISE (5)
PROTOTYPES (5)
STATISTICAL ANALYSIS (5)
VISUALIZATION (5)
ADABOOST (4)
BAYESIAN METHODS (4)
CLUSTERING ALGORITHMS (4)
COMPUTER ARCHITECTURE (4)
more

INFONA - science communication portal

Search results

Reset-free guided policy search: Efficient deep reinforcement learning with stochastic initial states

Policy optimization of dialogue management in spoken dialogue system for out-of-domain utterances

Continuous Action-Space Reinforcement Learning Methods Applied to the Minimum-Time Swing-Up of the Acrobot

Using machines to learn method-specific compilation strategies

Boosted metric learning for 3D multi-modal deformable registration

Multilevel dictionary learning for sparse representation of images

Localized support vector machines using Parzen window for incomplete sets of categories

Training MT Model Using Structural SVM

Spatial Based Feature Generation for Machine Learning Based Optimization Compilation

Homotopy Regularization for Boosting

Learning a Bi-Stochastic Data Similarity Matrix

A New SVM Approach to Multi-instance Multi-label Learning

Integer Programming for Multi-class Active Learning

ALPOS: A Machine Learning Approach for Analyzing Microblogging Data

Efficient Additive Models via the Generalized Lasso

Research on Stage Classification of Flight Parameter Based on PTSVM

The Influence Machine: Nonnegative Instance-Space Learning with Differentiated Regularization

Learning from images and speech with Non-negative Matrix Factorization enhanced by input space scaling

Evaluation of Genetic Algorithms for tuning SVM parameters in multi-class problems

On a Multiobjective Training Algorithm for RBF Networks Using Particle Swarm Optimization

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options