Multiple linear regression (MLR) is a commonly used statistical technique to predict future values. In this paper, we examine the situation in which a given time series dataset contains numerous observations of important predictor variables that can effectively be classified into groups based on their values. In such situations, cluster analysis is often employed to improve the MLR models predictive accuracy, usually by creating separate regressions for each cluster. We introduce a novel approach in which we use the clusters and cluster centroids as input data for the predictor variables to improve the predictive accuracy of the MLR model. We illustrate and test this approach with a real dataset on fleet maintenance.

You do not currently have access to this chapter.
Don't already have an account? Register

Purchased this content as a guest? Enter your email address to restore access.

Please enter valid email address.
Email address must be 94 characters or fewer.