Journal of the Royal Statistical Society: Series B (Statistical Methodology)

article

Data envelope fitting with constrained polynomial splines

Abdelaati Daouia, Hohsuk Noh, Byeong U. Park

Journal of the Royal Statistical Society: Series B (Statistical Methodology) > 78 > 1 > 3 - 30

Estimation of support frontiers and boundaries often involves monotone and/or concave edge data smoothing. This estimation problem arises in various unrelated contexts, such as optimal cost and production assessments in econometrics and master curve prediction in the reliability programmes of nuclear reactors. Very few constrained estimators of the support boundary of a bivariate distribution have...

article

Lasso regression: estimation and shrinkage via the limit of Gibbs sampling

Bala Rajaratnam, Steven Roberts, Doug Sparks, Onkar Dalal

Journal of the Royal Statistical Society: Series B (Statistical Methodology) > 78 > 1 > 153 - 174

The application of the lasso is espoused in high dimensional settings where only a small number of the regression coefficients are believed to be non‐zero (i.e. the solution is sparse). Moreover, statistical properties of high dimensional lasso estimators are often proved under the assumption that the correlation between the predictors is bounded. In this vein, co‐ordinatewise methods, which are the...

article

A tilting approach to ranking influence

Marc G. Genton, Peter Hall

Journal of the Royal Statistical Society: Series B (Statistical Methodology) > 78 > 1 > 77 - 97

We suggest a new approach, which is applicable for general statistics computed from random samples of univariate or vector‐valued or functional data, to assessing the influence that individual data have on the value of a statistic, and to ranking the data in terms of that influence. Our method is based on, first, perturbing the value of the statistic by ‘tilting’, or reweighting, each data value,...

article

Variable selection for support vector machines in moderately high dimensions

Xiang Zhang, Yichao Wu, Lan Wang, Runze Li

Journal of the Royal Statistical Society: Series B (Statistical Methodology) > 78 > 1 > 53 - 76

The support vector machine (SVM) is a powerful binary classification tool with high accuracy and great flexibility. It has achieved great success, but its performance can be seriously impaired if many redundant covariates are included. Some efforts have been devoted to studying variable selection for SVMs, but asymptotic properties, such as variable selection consistency, are largely unknown when...

article

Methodology for non‐parametric deconvolution when the error distribution is unknown

Aurore Delaigle, Peter Hall

Journal of the Royal Statistical Society: Series B (Statistical Methodology) > 78 > 1 > 231 - 252

In the non‐parametric deconvolution problem, to estimate consistently a density or distribution from a sample of data contaminated by additive random noise, it is often assumed that the noise distribution is completely known or that an additional sample of replicated or validation data is available. Methods also have been suggested for estimating the scale of the error distribution, but they require...

article

An M‐estimator of spatial tail dependence

John H. J. Einmahl, Anna Kiriliouk, Andrea Krajina, Johan Segers

Journal of the Royal Statistical Society: Series B (Statistical Methodology) > 78 > 1 > 275 - 298

Tail dependence models for distributions attracted to a max‐stable law are fitted by using observations above a high threshold. To cope with spatial, high dimensional data, a rank‐based M‐estimator is proposed relying on bivariate margins only. A data‐driven weight matrix is used to minimize the asymptotic variance. Empirical process arguments show that the estimator is consistent and asymptotically...

article

Hypothesis testing for automated community detection in networks

Peter J. Bickel, Purnamrita Sarkar

Journal of the Royal Statistical Society: Series B (Statistical Methodology) > 78 > 1 > 253 - 273

Community detection in networks is a key exploratory tool with applications in a diverse set of areas, ranging from finding communities in social and biological networks to identifying link farms in the World Wide Web. The problem of finding communities or clusters in a network has received much attention from statistics, physics and computer science. However, most clustering algorithms assume knowledge...

article

Non‐parametric estimation of finite mixtures from repeated measurements

Stéphane Bonhomme, Koen Jochmans, Jean‐Marc Robin

Journal of the Royal Statistical Society: Series B (Statistical Methodology) > 78 > 1 > 211 - 229

This paper provides methods to estimate finite mixtures from data with repeated measurements non‐parametrically. We present a constructive identification argument and use it to develop simple two‐step estimators of the component distributions and all their functionals. We discuss a computationally efficient method for estimation and derive asymptotic theory. Simulation experiments suggest that our...

article

Report of the Editors—2015

Journal of the Royal Statistical Society: Series B (Statistical Methodology) > 78 > 1 > 1 - 2

article

The lasso for high dimensional regression with a possible change point

Sokbae Lee, Myung Hwan Seo, Youngki Shin

Journal of the Royal Statistical Society: Series B (Statistical Methodology) > 78 > 1 > 193 - 210

We consider a high dimensional regression model with a possible change point due to a covariate threshold and develop the lasso estimator of regression coefficients as well as the threshold parameter. Our lasso estimator not only selects covariates but also selects a model between linear and threshold regression models. Under a sparsity assumption, we derive non‐asymptotic oracle inequalities for...

article

Non‐parametric inference for density modes

Christopher R. Genovese, Marco Perone‐Pacifico, Isabella Verdinelli, Larry Wasserman

Journal of the Royal Statistical Society: Series B (Statistical Methodology) > 78 > 1 > 99 - 126

We derive non‐parametric confidence intervals for the eigenvalues of the Hessian at modes of a density estimate. This provides information about the strength and shape of modes and can also be used as a significance test. We use a data‐splitting approach in which potential modes are identified by using the first half of the data and inference is done with the second half of the data. To obtain valid...

article

Using post‐outcome measurement information in censoring‐by‐death problems

Fan Yang, Dylan S. Small

Journal of the Royal Statistical Society: Series B (Statistical Methodology) > 78 > 1 > 299 - 318

Many clinical studies on non‐mortality outcomes such as quality of life suffer from the problem that the non‐mortality outcome can be censored by death, i.e. the non‐mortality outcome cannot be measured if the subject dies before the time of measurement. To address the problem that this censoring by death is informative, it is of interest to consider the average effect of the treatment on the non‐mortality...

article

Semiparametric estimation in the secondary analysis of case–control studies

Yanyuan Ma, Raymond J. Carroll

Journal of the Royal Statistical Society: Series B (Statistical Methodology) > 78 > 1 > 127 - 151

We study the regression relationship between covariates in case–control data: an area known as the secondary analysis of case–control studies. The context is such that only the form of the regression mean is specified, so that we allow an arbitrary regression error distribution, which can depend on the covariates and thus can be heteroscedastic. Under mild regularity conditions we establish the theoretical...

article

Optimal designs for the prediction of individual parameters in hierarchical models

Maryna Prus, Rainer Schwabe

Journal of the Royal Statistical Society: Series B (Statistical Methodology) > 78 > 1 > 175 - 191

Characterizations of optimal designs are derived for the prediction of individual response curves within the framework of hierarchical linear mixed models. It is shown that the so‐obtained optimal designs may differ substantially from those propagated in the literature so far and that the latter may become useless in terms of their performance.

article

Statistics of heteroscedastic extremes

John H. J. Einmahl, Laurens Haan, Chen Zhou

Journal of the Royal Statistical Society: Series B (Statistical Methodology) > 78 > 1 > 31 - 51

We extend classical extreme value theory to non‐identically distributed observations. When the tails of the distribution are proportional much of extreme value statistics remains valid. The proportionality function for the tails can be estimated non‐parametrically along with the (common) extreme value index. For a positive extreme value index, joint asymptotic normality of both estimators is shown;...

INFONA - science communication portal

Journal of the Royal Statistical Society: Series B (Statistical Methodology)

Data envelope fitting with constrained polynomial splines

Lasso regression: estimation and shrinkage via the limit of Gibbs sampling

A tilting approach to ranking influence

Variable selection for support vector machines in moderately high dimensions

Methodology for non‐parametric deconvolution when the error distribution is unknown

An M‐estimator of spatial tail dependence

Hypothesis testing for automated community detection in networks

Non‐parametric estimation of finite mixtures from repeated measurements

Report of the Editors—2015

The lasso for high dimensional regression with a possible change point

Non‐parametric inference for density modes

Using post‐outcome measurement information in censoring‐by‐death problems

Semiparametric estimation in the secondary analysis of case–control studies

Optimal designs for the prediction of individual parameters in hierarchical models

Statistics of heteroscedastic extremes

Filter options

Publication date

Keywords

INFONA - science communication portal

Journal of the Royal Statistical Society: Series B (Statistical Methodology)

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options