Multiple regression models are often tested by making plots of predicted against measured values. In these plots, all observations are supposed to fall on the diagonal. Points not positioned on the diagonal show unmodeled behaviour. Some of these deviations are caused by random noise. Environmental data have quite some measurement and sampling noise and one is not supposed to model or predict this noise. However, there can also be a systematic variation, a bias. This bias is often expressed as systematically low predictions for high values. The high values fall below the diagonal in the plot. A kind of bias is a contraction around the diagonal. The high values are predicted too low and the low values are predicted too high: the predictions are contracted around the center of the data set. One factor contributing to bias or contraction is nonlinearities in the true physical relationship. The data set consists of hourly ozone measurements and parallel measurements of nitrogen oxides, temperature, UV radiation and more than 50 organic chemicals. The measurements were made on surface air in an urban environment. It may be assumed that the ozone concentrations are influenced by all the other variables, so a multivariate regression model may be made with 57 predictor variables and ozone concentration as the response variable. Because of expected collinearities and large noise, a partial least squares (PLS) regression model is chosen. The total set of 717 objects is split into a calibration and a test set of 358 and 359 objects, respectively. The data are noisy and the relationship is very nonlinear. It is shown how contraction and prediction bias occur and how extra steps of reducing and nonlinearizing the data remove these effects until only a substantial random noise is left.