The linear regression algorithm used in RSM analysis minimizes residuals at the inference space design sites. A common assumption is that residuals at these sites describe the unexplained variance that would be associated with general model preduictions elsewhere in the design space. That tends to be less true when substantial multicollinearity is in play. Under those circumstances, the prediction errors can be substantially larger at off-design points, where the model might be applied for general response prediction. For this reason, we now include as a matter of course a number of confirmation points distributed randomly within the test matrix. These points are comprised of randomly selected factor levels within the ranges tested, and are not fitted to produce the response model, but instead are used to test it. We often find significant differences in residual variance at the confirmation points relative to the design points, especially when the RSM data are overfitted, or when variance inflation factors are high. We use this in conjunction with subject-matter expertise to try to reformulate our RSM models by attempting to eliminate selected high-order terms that display high multicollinearity (as evidenced by the VIF numbers). This generally leads to a more robust model -- one with a more uniform distribution of actual prediction errors as quantified directly via the confirmation points.
MEMO TO STATEASE: The existing point-prediction capability in DX7 is really great for individual points, but some of our people are going so far as to replicate the rest of the DX7 functionality in MATLAB in order to obtain relief from the onerous chore of examining multiple prediction errors one point at a time (OPAT?

) for large numbers of confirmation points. It is not uncommon to have scores of such points in a large-scale wind tunnel test. Would you PLEASE consider a batch-mode point-prediction capability in which multiple confirmation runs might be pasted into a spreadsheet along with their measured responses, to be compared in one fell swoop with model predictions for those points? A report that shows the same information as the current point-prediction, but for multiple points, would be just the ticket. A count of residual magnitudes less than the 95% prediction interval half-width would be a valuable extra. Thanks!