The Elements of Statistical Learning

chapter

Front Matter

Springer Series in Statistics > The Elements of Statistical Learning > i-xxii

chapter

Introduction

Trevor Hastie, Robert Tibshirani, Jerome Friedman

Springer Series in Statistics > The Elements of Statistical Learning > 1-8

Statistical learning plays a key role in many areas of science, finance and industry. Here are some examples of learning problems: Predict whether a patient, hospitalized due to a heart attack, will have a second heart attack. The prediction is to be based on demographic, diet and clinical measurements for that patient. Predict the price of a stock in 6 months from now, on the basis of company...

chapter

Overview of Supervised Learning

Trevor Hastie, Robert Tibshirani, Jerome Friedman

Springer Series in Statistics > The Elements of Statistical Learning > 9-41

The first three examples described in Chapter 1 have several components in common. For each there is a set of variables that might be denoted as inputs, which are measured or preset. These have some influence on one or more outputs. For each example the goal is to use the inputs to predict the values of the outputs. This exercise is called supervised learning.

chapter

Linear Methods for Regression

Trevor Hastie, Robert Tibshirani, Jerome Friedman

Springer Series in Statistics > The Elements of Statistical Learning > 43-99

A linear regression model assumes that the regression function E(Y |X) is linear in the inputs X1, …, Xp. Linear models were largely developed in the precomputer age of statistics, but even in today’s computer era there are still good reasons to study and use them. They are simple and often provide an adequate and interpretable description of how the inputs affect the output. For prediction purposes...

chapter

Linear Methods for Classification

Trevor Hastie, Robert Tibshirani, Jerome Friedman

Springer Series in Statistics > The Elements of Statistical Learning > 101-137

In this chapter we revisit the classification problem and focus on linear methods for classification. Since our predictor G(x) takes values in a discrete set G, we can always divide the input space into a collection of regions labeled according to the classification.We saw in Chapter 2 that the boundaries of these regions can be rough or smooth, depending on the prediction function. For an important...

chapter

Basis Expansions and Regularization

Trevor Hastie, Robert Tibshirani, Jerome Friedman

Springer Series in Statistics > The Elements of Statistical Learning > 139-189

We have already made use of models linear in the input features, both for regression and classification. Linear regression, linear discriminant analysis, logistic regression and separating hyperplanes all rely on a linear model.

chapter

Kernel Smoothing Methods

Trevor Hastie, Robert Tibshirani, Jerome Friedman

Springer Series in Statistics > The Elements of Statistical Learning > 191-218

In this chapter we describe a class of regression techniques that achieve flexibility in estimating the regression function f(X) over the domain IR^p by fitting a different but simple model separately at each query point x ₀.

chapter

Model Assessment and Selection

Trevor Hastie, Robert Tibshirani, Jerome Friedman

Springer Series in Statistics > The Elements of Statistical Learning > 219-259

The generalization performance of a learning method relates to its prediction capability on independent test data. Assessment of this performance is extremely important in practice, since it guides the choice of learning method or model, and gives us a measure of the quality of the ultimately chosen model.

chapter

Model Inference and Averaging

Trevor Hastie, Robert Tibshirani, Jerome Friedman

Springer Series in Statistics > The Elements of Statistical Learning > 261-294

For most of this book, the fitting (learning) of models has been achieved by minimizing a sum of squares for regression, or by minimizing cross-entropy for classification. In fact, both of these minimizations are instances of the maximum likelihood approach to fitting.

chapter

Additive Models, Trees, and Related Methods

Trevor Hastie, Robert Tibshirani, Jerome Friedman

Springer Series in Statistics > The Elements of Statistical Learning > 295-336

In this chapter we begin our discussion of some specific methods for supervised learning. These techniques each assume a (different) structured form for the unknown regression function, and by doing so they finesse the curse of dimensionality.

chapter

Boosting and Additive Trees

Trevor Hastie, Robert Tibshirani, Jerome Friedman

Springer Series in Statistics > The Elements of Statistical Learning > 337-387

Boosting is one of the most powerful learning ideas introduced in the last twenty years. It was originally designed for classification problems, but as will be seen in this chapter, it can profitably be extended to regression as well. The motivation for boosting was a procedure that combines the outputs of many “weak” classifiers to produce a powerful “committee.” From this perspective boosting bears...

chapter

Neural Networks

Trevor Hastie, Robert Tibshirani, Jerome Friedman

Springer Series in Statistics > The Elements of Statistical Learning > 389-416

In this chapter we describe a class of learning methods that was developed separately in different fields—statistics and artificial intelligence—based on essentially identical models. The central idea is to extract linear combinations of the inputs as derived features, and then model the target as a nonlinear function of these features.

chapter

Support Vector Machines and Flexible Discriminants

Trevor Hastie, Robert Tibshirani, Jerome Friedman

Springer Series in Statistics > The Elements of Statistical Learning > 417-458

In this chapter we describe generalizations of linear decision boundaries for classification. Optimal separating hyperplanes are introduced in Chapter 4 for the case when two classes are linearly separable. Here we cover extensions to the nonseparable case, where the classes overlap.

chapter

Prototype Methods and Nearest-Neighbors

Trevor Hastie, Robert Tibshirani, Jerome Friedman

Springer Series in Statistics > The Elements of Statistical Learning > 459-483

In this chapter we discuss some simple and essentially model-free methods for classification and pattern recognition. Because they are highly unstructured, they typically are not useful for understanding the nature of the relationship between the features and class outcome.

chapter

Unsupervised Learning

Trevor Hastie, Robert Tibshirani, Jerome Friedman

Springer Series in Statistics > The Elements of Statistical Learning > 485-585

The previous chapters have been concerned with predicting the values of one or more outputs or response variables Y = (Y ₁, …, Y _m) for a given set of input or predictor variables X ^T = (X ₁, …, X _p). Denote by $$x_i^T = (x_{i1}, \ldots, x_{ip})$$ the inputs for the i...

chapter

Random Forests

Trevor Hastie, Robert Tibshirani, Jerome Friedman

Springer Series in Statistics > The Elements of Statistical Learning > 587-604

Bagging or bootstrap aggregation (section 8.7) is a technique for reducing the variance of an estimated prediction function. Bagging seems to work especially well for high-variance, low-bias procedures, such as trees. For regression, we simply fit the same regression tree many times to bootstrapsampled versions of the training data, and average the result. For classification, committee of trees each...

chapter

Ensemble Learning

Trevor Hastie, Robert Tibshirani, Jerome Friedman

Springer Series in Statistics > The Elements of Statistical Learning > 605-624

The idea of ensemble learning is to build a prediction model by combining the strengths of a collection of simpler base models. We have already seen a number of examples that fall into this category.

chapter

Undirected Graphical Models

Trevor Hastie, Robert Tibshirani, Jerome Friedman

Springer Series in Statistics > The Elements of Statistical Learning > 625-648

A graph consists of a set of vertices (nodes), along with a set of edges joining some pairs of the vertices. In graphical models, each vertex represents a random variable, and the graph gives a visual way of understanding the joint distribution of the entire set of random variables. They can be useful for either unsupervised or supervised learning. In an undirected graph, the edges have no directional...

chapter

High-Dimensional Problems: p ≫ N

Trevor Hastie, Robert Tibshirani, Jerome Friedman

Springer Series in Statistics > The Elements of Statistical Learning > 649-698

In this chapter we discuss prediction problems in which the number of features p is much larger than the number of observations N, often written p ≫ N. Such problems have become of increasing importance, especially in genomics and other areas of computational biology. We will see that high variance and overfitting are a major concern in this setting. As a result, simple, highly regularized approaches...

chapter

Back Matter

Springer Series in Statistics > The Elements of Statistical Learning > 699-745

INFONA - science communication portal

The Elements of Statistical Learning
Data Mining, Inference, and Prediction

Front Matter

Introduction

Overview of Supervised Learning

Linear Methods for Regression

Linear Methods for Classification

Basis Expansions and Regularization

Kernel Smoothing Methods

Model Assessment and Selection

Model Inference and Averaging

Additive Models, Trees, and Related Methods

Boosting and Additive Trees

Neural Networks

Support Vector Machines and Flexible Discriminants

Prototype Methods and Nearest-Neighbors

Unsupervised Learning

Random Forests

Ensemble Learning

Undirected Graphical Models

High-Dimensional Problems: p ≫ N

Back Matter

Filter options

Publication date

Publication language

INFONA - science communication portal

The Elements of Statistical Learning Data Mining, Inference, and Prediction $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication language

Reporting an error / abuse

Sending the report failed

Accessibility options

The Elements of Statistical Learning
Data Mining, Inference, and Prediction