Research Article Volume 5 Issue 3
A general overview of response surface methodology
Andre I Khuri
Regret for the inconvenience: we are taking measures to prevent fraudulent form submissions by extractors and page crawlers. Please type the correct Captcha word to see email ID.
Department of Statistics, University of Florida, USA
Correspondence: Andre I. Khuri, Department of Statistics, University of Florida, USA
Received: October 29, 2017 | Published: March 10, 2017
Citation: Khuri AI. A general overview of response surface methodology. Biom Biostat Int J. 2017;5(3):87-93. DOI: 10.15406/bbij.2017.05.00133
Download PDF
Response surface methodology
Response surface methodology (RSM) is a collection of statistical and mathematical techniques used for the purpose of
- Setting up a series of experiments (design) for adequate predictions of a response y.
- Fitting a hypothesized (empirical) model to data obtained under the chosen design.
- Determining optimum conditions on the model's input (control) variables that lead to maximum or minimum response within a region of interest.
Formal work on RSM began with the publication of the article On the Experimental Attainment of Optimum Conditions by Box et al.1
RSM review articles
- Hill and Hunter2
- Mead and Pike3
- Myers, Khuri, and Carter4
See also the article by Steinberg and Hunter.5
RSM books include
- Myers and Montgomery6
- Khuri and Cornell7
- Box and Draper8
The present article provides a general overview of response surface methodology (RSM).
Notation
Let
denote the mean of a response variable
. Let
denote a set of input, or control, variables. Then,
For example, we have the first-order model
or the second-order model
where
and
Let
denote the design setting of variable
at the uth experimental run
. Let
denote the corresponding response value. By definition, the design matrix
is the
matrix
We have the linear model
where
is
of rank p and
is a vector of unknown parameters.
The least-squares estimator of
is
where
is a vector of order
of the same form as a row of
, but
is evaluated at the point
For a first-order model, the predicted response is
and for a second-order model we have
The prediction variance is given by
Choice of a response surface design
Some important properties of a response surface design (Box and Draper)8:
- Generation of a satisfactory distribution of information throughout the region of interest,
.
- “Closeness" of
to
over
.
- Good detectibility of lack of fit.
- Insensitivity (robustness) to extreme observations and to violations of the usual normal theory assumptions.
- Ability to perform experiments in blocks.
- Extendibility to a higher-order design.
- Requiring a small number of experimental runs.
Variance-related design criteria
- D-optimality: Maximization of
is propotional to the volume of an ellipsoidal confidence region on
.
- A-optimality: Minimization of
- E-optimality: Minimization of the largest eigenvalue of
- G-optimality: Minimization of
- The above are referred to as alphabetic optimality criteria. Some authors, including Box [9], have questioned the applicability of alphabetic optimality theory to response surface experiments, since such optimal designs are sensitive to the form of the model.
- Rotatability: A design D is rotatable if
is constant at all points that are equidistant from the design center (Box and Hunter (1957)).
- A rotatable design has the uniform precision property (UP) if the prediction variance at
is equal to the prediction variance at a distance
from the origin. A design is rotatable if and only if the
matrix has a particular form.
- Rotatability is a desirable property to have, especially when there is a need to optimize
over the surface of a hypersphere, as is the case in ridge analysis.
- Orthogonality: A design is orthogonal if it can provide independent information about the effects of the various terms in the model.
Examples:
-
is orthogonal, but is not rotatable.
- Central composite design can be made rotatable and either have the
- UP property or the orthogonality property.
Bias-related design criteria
The fitted model is
The “True" model is
Integrated Mean Squared Error Criterion:10,11 This amounts to the minimization of J, where
is the reciprocal of the volume of
.
where
is the alias matrix, and
is the matrix corresponding to
The Box and Draper criterion for the minimization of
is:
Designs that minimize J have characteristics similar to those of designs which minimize
alone (all bias designs).
Design robustness
A design is robust if it helps reduce the impact of nonideal conditions on data analysis. Some of these conditions include
- Nonnormal errors
- Missing observations
- Outliers
Box12 introduced the word “robust" when examining the effect of departure from normality on the analysis of variance. Some relevant articles are:13,15,16,48
Designs for the slope
These are designs for the estimation of the partial derivatives of the mean response
with respect
Myers and Lahoda17 extended the Box-Draper integrated mean squared error criterion to find appropriate designs for the joint estimation of the partial derivatives of
.
Hader and Park18 introduced the concept of design rotatability for second-order models. Under this design criterion,
is constant at all points that are equidistant from the design center
Park19 considered “slope rotatability over all directions":
where
is a unit vector.
where
denotes the surface of a hypersphere of unit radius,
is the reciprocal of the surface of
.
A design
is slope rotatable over all directions if
is constant at all points equidistant from the design center.
Park gave necessary and sufficient conditions for a design to be slope rotatable over all directions for second-order models.
Related articles
- Huda and Mukerjee20
- Mukerjee and Huda,21
- Park22
Designs to increase the power of the lack of fit test
These are designs that induce a certain degree of sensitivity to possible inadequacy of the fitted model.
Construction of such designs is done by the maximization of
optimality: Maximize
, the minimum of
over a specified region
in the
space.
optimality: Maximize
, the average value of
over the boundary of
.
Response surface analysis
Estimation of Optimum Conditions.
The Method of Steepest Ascent This is a maximum-region-seeking procedure introduced by Box and Wilson1 for RSM: To maximize
subject to the constraint
where
is a unit vector in the direction of
A related article is Myers and Khuri.23
Ridge analysis
Related articles include Hoerl,24 Draper25
To maximize (minimize)
subject to the constraint
where
is a Lagrange multiplier.
For a maximum,
should be larger than
.
For a minimum,
should be smaller than
.
By selecting several values of
as shown above, we obtain plots of
max.
vs. r and plots of
vs.
for
Khuri and Myers26 introduced a certain modification to the method of ridge analysis in cases in which the design is not rotatable and may even be ill conditioned. In such cases,
can vary appreciably on a hypersphere
centered at the origin inside a region of interest. The proposed modification optimizes
on
subject to maintaining certain a constraint on the size of
where
is the number of parameters in the model,
are orthonor-mal eigenvectors of
, and
are the corresponding eigenvalues of
. Let
be the smallest eigenvalue of
and let
be the corresponding eigenvector. To optimize
subject to the constraints
where
is a small positive constant, or equivalently,
Confidence intervals and confidence regions associated with the optimum response
The estimated optimum of a mean response and the location of the optimum are obtained by using
. They are therefore random variables. In order to have a better assessment of optimum conditions, confidence intervals on the optimum of the mean response as well as confidence regions on the location of the optimum are needed. Relevant references include
- Box and Hunter27
- Stablein, Carter, and Wampler28
- Carter, Chinchilli, Campbell, and Wampler29
- Carter, Chinchilli, Myers, and Campbell30
Measure of rotatability31
A design is rotattable if and only if the
matrix has a particular form, which we denote by
. For any given design
, the “closeness" of the corresponding
matrix to
can be determined. On this basis, a measure
, which quantifies the percent rotatability that is inherent in
, can be derived.
design
is rotatable if and only if
.
For example, the
factorial design is 93.08% rotatable.
Properties of
:
- It is invariant to a change of scale (same change for all the input variables).
- It is invariant to the addition of experimental runs at the design center (design augmentation at the center).
- It applies to any response surface model and design.
- It is not invariant to design rotation.
Draper and Pukelsheim32 introduced a similar measure that is invariant to design rotation, but it applies only to second-order models.
Advantages of a measure of rotatability
- Can be used to compare designs with respect to their degree of rotatability.
- Can be used to construct a design that satisfies a certain desirable criterion, such as a bias or a variance criterion, in addition to having a relatively high percent rotatability.
- Can be used to “repair" rotatability by the addition of properly chosen experimental runs to a given design.
Variance dispersion graphs
Giovannitti-jensen and myers (1989)
They plotted maximum and minimum values of
, on a hyperspher
, against
.
Relevant Reference are Khuri33 used the measure of rotatability
to derive several upper bounds on the range of
. These bounds are easy to compute since they only require determining eigenvalues and traces associated with the matrices
and
, where
and
is the “rotatable" portion of
.
Measuring slope rotatability
Park and Kim34 defined a measure
to quantify the degree of slope rotatability over axial directions18 for second-order models. It takes the value zero if and only if the design is slope rotatable. It can also be used to “repair" slope rotatability by design augmentation.
Jang and Park35 considered the maximum,
, and minimum,
, values of
over a hypersphere
, where
The quantity
is zero if and only if the design is slope rotatable over all directions. Jang and Park plotted
and
against
(slope variance dispersion graphs).
Achievement of target conditions
Khuri and vining36
Specification Problem: To determine conditions on the input variables in a process that cause the mean response
to fall within specified bounds, e.g.,
, where a and b are given, with a specified degree of confidence.
Procedure
Start with an initial n-point design
Calculate
Compute
Where
If there is a point
such that
, then
has a probability of at least
of satisfying
is a solution to the specification problem.
Khuri and Vining presented a sequential procedure of adding points to
, if necessary, until a solution can be found to the inequalities
for a sufficiently large
.
Multiresponse experiments
An experiment in which a number of responses can be measured for each setting of a group of input variable,
, is called a multiresponse experiment.
When several responses are considered simultaneously, any statistical investigation of the responses should take into consideration the multi- variate nature of the data. The response variables should not be analyzed individually or independently of one another. This is particularly true when the response variables are correlated.
Traditional response surface techniques that apply to single response models are, in general, not adequate to analyze multiresponse models.
Linear multiresponse models
In particular, if the individual response models are linear, then
Box draper37 determinant criterion
An estimate of
is obtained by minimizing the determinant of
Relevant articles include Box, Hunter, MacGregor, and Erjavec;38 Khuri.39
Estimation for linear multiresponse models
The r linear response models,
can be represented as a single linear multiresponse model
Where
X is a block-diagonal matrix
,
and,
The variance-covariance matrix of is given by the direct (Kronecker) product
If
is unknown, an estimate
can be used, where
and
Zelner40
In particular, if
, then,
Designs for multiresponse models
Draper and Hunter,41 Fedorov):42 D-optimal designs.
must be known. Wijesinha and Khuri43 used an estimate of
. Wijesinha and Khuri44 considered designs that maximize the power of the multi variate lack fit test.
Khuri:45 Multiresponse rotatability for linear multirespose models.
A design is multiresponse rotatable if
is constant at all points x that are equidistant from the origin, where
is the vector of r predicted responses.
This design property can be achieved if and only if the design is rotatable for a single-response model having the highest degree among all the r response models.
Lack of fit of a multiresponse model
Khuri46
The models
can be written as
where
Assume that replicated observations are available on all the responses at some points in a region of interest.
The above model suffers from lack of fit if and only if there exists a linear combination of the responses that suffers from lack of fit.
The multivariate lack of fit test is Roy's largest-root test,
Where
(
repeated observations are taken at the
th design point
).
Large values of the test statistic are significant.
Note: The test may be significant even though the response models do not individually exhibit a significant lack of fit.
A numerical example
Richert, et al.47 investigated the effects of heating temperature
, pH level
, redox potential
, sodium oxalate
, and sodium lauryl sulfate
on foaming properties of whey protein concentrates. The responses are
=whipping time
=maximum overrun
=percent soluble protein.
A second-order model was assumed for each response.
Multiresponse optimization
Khuri and Conlon48
The purpose of a multiresponse optimization technique is to find operating conditions on the input variables that lead to optimal, or near optimal, values of several response functions.
In this case, an unbiased estimate of
, the variance-covariance matrix of ther response variables, is given by
Let
Let
be the optimum value of
optimized individually over a region of interest R
.
An “ideal” optimum occurs when
, for
, attain their individual optima at the same set of conditions.
To find “compromise conditions" that are somewhat favorable to all the responses, we consider the metric (distance function)
Multiresponse optimization is then reduced to minimizing
with respect to
over
.
Future directions in RSM
- Research workers in RSM should provide the means to facilitate the use of RSM by data analysts, practitioners, and experimenters.
- Research workers in RSM need feedback from experimenters. This can help create methodology that applies to realistic situations.
- Better software support. The software currently available for RSM is limited. This is particularly true in the multiresponse area.
- More research work is needed in the multiresponse case, especially in the design area.
- Multiresponse techniques should be modified to allow the presence of a block effect in the model.
- There is a need to explore new RSM techniques suitable for more general models under less restrictive assumptions (generalized linear models).
- Develop designs which are robust with respect to several criteria.
- There is a need for nonparametric techniques in RSM (model-free techniques).
Acknowledgments
Conflicts of interest
References
- Box GEP, Wilson KB. On the experimental attainment of optimum conditions. Journal of the Royal Statistical Society B. 1951;13:1–45.
- Hill WJ, Hunter WG. A review of response surface methodology: A literature survey. Technometrics. 1996;8(4):571–590.
- Mead R, Pike DJ. A review of response surface methodology from a biometric point of view. Biometrics. 1975;31(4):803–851.
- Myers RH, Khuri AI, Carter WH. Response surface methodology: 1966-1988. Technometrics. 1989;31(2):137–157.
- Steinberg DM, Hunter WG (1984) Experimental design: review and comment: (with Discussion and Response). Technometrics 26(2):105–109.
- Myers RH, Montgomery DC. Response surface methodology: process and product optimization using designed experiments. Wiley, New York; 1995.
- Khuri AI, Cornell JA. Response surfaces. 2nd ed. Dekker, New York; 1996.
- Box GEP, Draper NR. Response surfaces, mixtures, and ridge analyses. 2nd ed. Wiley, Hoboken, New Jersey; 2007.
- Box GEP. Choice of response surface design and alphabetic optimality. Utilitas Mathematica. 1982;21B:11–55.
- Box GEP, Draper NR. A basis for the selection of a response surface design. Journal of the American Statistical Association. 1959;54(287):622–654.
- Box GEP, Draper NR. The choice of a second order rotatable design. Biometrika. 1963;50(3/4):335–352.
- Box GEP. Non-normality and tests on variances. Biometrika. 1953;40(3/4):318–335.
- Herzberg AM, Andrews DF. Some considerations in the optimal design of experiments in non-optimal situations. Journal of the Royal Statistical Society. 1976;38(3):284–289.
- Khuri AI, Conlon M. Simultaneous optimization of multiple responses represented by polynomial regression functions. Technometrics. 1981;23(4):363–375.
- McKee B, Kshirsagar AM. Effect of missing plots in some response surface designs. Communications in Statistics - Theory and Methods. 1982;11(14):1525–1549.
- Akhtar M, Prescott P. Response surface designs robust to missing observations. Communications in Statistics - Simulation and Computation. 1986;15(2):345–363.
- Myers RH, Lahoda SJ. A generalization of the response surface mean square error criterion with a specific application to the slope. Technometrics. 1975;17(4):481–486.
- Hader RJ, Park SH. Slope rotatable central composite designs. Technometrics. 1978;20(4):413–417.
- Park SH. A class of multifactor designs for estimating the slope of response surfaces. Technometrics. 1987;29(4):449–453.
- Huda S, Mukerjee R. Minimizing the maximum variance of the difference between two estimated responses. Biometrika. 1984; 71(2):381–385.
- Mukerjee R, Huda S. Minimax second- and third- order designs to estimate the slope of a response surface. Biometrika. 1985; 72(1):173–178.
- Park JY. Designs for estimating the difference between two responses. Communications in Statistics - Theory and Methods. 1990;19(12):4773–4787.
- Khuri AI, Myers RH. Modified ridge analysis. Techno-metrics. 1979;21(4):467–473.
- Hoerl AE. Optimum solution of many variables equations. Chemical Engineering Progress. 1953;55:67–78.
- Draper NR. “Ridge analysis” of response surfaces. Technometrics. 1963;5(4):469–479.
- Myers RH, Khuri A. A new procedure for steepest ascent. Communications in Statistics - Theory and Methods. 1979;8(14): 1359–1376.
- Box GEP, Hunter JS. A confidence region for the solution of a set of simultaneous equations with an application to experimental design. Biometrika. 1954;41(1/2):190–199.
- Stablein DM, Carter WH, Wampler GL. Confidence regions for constrained optima in response-surface experiments. Biometrics. 1983;39(3):759–763.
- Carter WH, Chinchilli VM, Campbell ED, et al. Confidence interval about the response at the stationary point of a response surface, with an application to preclinical cancer therapy. Biometrics. 1984;40(4):1125–1130.
- Carter WH, Chinchilli VM, Myers RH, et al. Confidence intervals and improved ridge analysis of response surfaces. Technometrics. 1986;28(4):339–346.
- Khuri AI. A measure of rotatability for response surface designs. Technometrics. 1988;30:95–104.
- Draper NR, Pukelsheim F. Another look at rotatability. Technometrics. 1990;32(2):195–202.
- Khuri AI. Response surface models with random block effects. Technometrics. 1992;34(1):26–37.
- Park SH, Kim HJ. A measure of slope-rotatability for second order response surface experimental designs. Journal of Applied Statistics. 1992;19(3):391–404.
- Jang DH, Park SH. A measure and a graphical method for evaluating slope rotatability in response surface designs. Communications in Statistics - Theory and Methods. 1993;22:1849–1863.
- Khuri AI, Vining GG. Conditions required for the mean response to fall within specified bounds. Journal of Statistical Planning and Inference. 1991;28(1):125–136.
- Box GEP, Draper NR. The Bayesian estimation 0f common parameters from several responses. Biometrika. 1965;52(3/4):355–365.
- Box GEP, Hunter WG, MacGregor JF, et al. Some problems associated with the analysis of multiresponse data. Technometrics. 1973;15(1):33–51.
- Khuri AI. Multiresponse rotatability. Journal of Statistical Plan- ning and Inference.1990;25(1):1–6.
- Zelner A. An efficient method of estimating seemingly unrelated regressions and tests for aggregation bias. Journal of the American Statistical Association. 1962;57:348–368.
- Draper NR, Hunter WG. Design of experiments for parameter estimation in multiresponse situations. Biometrika. 1966;53(3/4): 525–533.
- Fedorov VV. Theory of optimal experiments. Academic Press, New York; 1972.
- Wijesinha MC, Khuri AI. The sequential generation of multiresponse D-optimal designs when the variance-covariance matrix is not known. Communications in Statistics-Simulation and Computation. 1987;16(1):239–259.
- Wijesinha MC. Construction of optimal designs to increase the power of the multiresponse lack of fit test. Journal of Statistical Planning and Inference. 1987;16:179–192.
- Khuri AI. The analysis of multiresponse experiments: A review. in statistical design and analysis of industrial experiments, In: Subir Ghosh, editor. New York; 1990;231–246.
- Khuri AI. A test for lack of fit of a linear multiresponse model. Technometrics. 1985;27(3):213–218.
- Richert SH, Morr CV, Conney CM. Effect of heat and other factors upon foaming properties of whey protein concentrates. Journal of Food Science. 1974;39(1):42–48.
- Khuri AI, Myers RH. Design related robustness of tests in regression models. Communications in Statistics - Theory and Methods. 1981;10(3):223–235.
©2017 Khuri. This is an open access article distributed under the terms of the,
which
permits unrestricted use, distribution, and build upon your work non-commercially.