Research Article Volume 11 Issue 5
An analytical study of the critical values of response
rate in single-arm phase II clinical trial designs
Yi Liu,1 Sin-Ho Jung2
Regret for the inconvenience: we are taking measures to prevent fraudulent form submissions by extractors and page crawlers. Please type the correct Captcha word to see email ID.
1Department of Statistics, North Carolina State University, USA
2Department of Biostatistics and Bioinformatics, Duke University, USA
Correspondence: Sin-Ho Jung, Department of Biostatistics and Bioinformatics, Duke University, USA
Received: November 07, 2022 | Published: December 30, 2022
Citation: Yi L, Sin-Ho J. An analytical study of the critical values of response rate in single-arm phase II clinical trial designs. Biom Biostat Int J. 2022;11(5):178-183. DOI: 10.15406/bbij.2022.11.00374
Download PDF
Abstract
A single-arm phase II clinical trial is usually conducted for finding an appropriate dose-level and testing toxicity for an experimental cancer therapy in comparison to some historical controls, and is usually the most doable trial type due to the feasibility under limited budget, patient pool and medical conditions that can be met. We considered the standard setting of a single-arm two-stage phase II clinical trial, and investigated the patterns of critical values and sample sizes at both two stages of minimax and optimal designs under different design parameters, i.e., under different response rates of control and treatment, significant level of the test, and statistical power. We provided analytic derivations to the patterns we are interested under large sample approximation, and investigated them under finite ans small sample via a numerical study by considering extensively possible design parameters over a fine grid. We finally concluded that the critical values at different stages of the test are related to the sample sizes at different stages in a similar way for both optimal and minimax designs, but they also reveal some differences and reflected the nature of these two types of design.
Keywords: single-arm two-stage trial, optimal design, minimax design, critical value, response rate
Introduction
A phase II clinical trial is often conducted to find an appropriate dose-level and test toxicity for experimental cancer therapies in compare to some historical controls. This kind of trials usually requires small sample sizes due to ethical considerations, before proceeding to the subsequent phase of the trial for assessing the efficacy, outcomes and adverse effect in a larger group of subjects.1,2 At the same time, a single-arm design is often the most doable trial due to the feasibility under limited budget, patient pool and medical conditions that can be met. The number of written requests reported on 2021 of single-arm trials issued by the Food and Drug Administration (FDA) of the United States is an essential volume.3
Statistically, the most simple and traditional single-arm phase II clinical trial design-the single-stage design, is specified as follows. Given the values of design parameters
, that is, statistical significance level, power of the test, null response rate, and treatment response rate, respectively, we want to find a pair of positive integers
to specify the sample size and rejection boundary (or critical value, for rejecting
) of the phase II trial under some constraints, and usually we also have a upper bound of
specified subjectively, such as
to reflect the small-sample characteristic of a phase II trial.4 Then, starting from
for a fixed
(the minimal sample size needed), we can search over a grid the smallest integer
such that based on level
, we reject
if
and otherwise fail to reject
, that is,
where
is the CDF of
. Then if the power
where
is the CDF of
, the pair
is said to be the optimal design. If there is no
meet these conditions simultaneously, we continue to
. Thus, the optimal design minimizes the sample size among all designs satisfies the conditions. It is also easy to know that the optimal single-stage design under a parameter setting is unique.
In some scenario, two-stage designs are more likely to be considered. In a two-stage design, the phase II clinical trial is conducted via two stages sequentially, and whether to proceed to the second stage depends on the results from the first stage. A desired design under the specifications of some parameters is given by finding appropriate sample sizes and critical values of the response rate of both two stages. The two-stage design is more economical and ethical, which makes the trial stop earlier if there is no efficacy tested, and the data collected is more informative than that from a single-stage trial if we proceed to phase III.4 Similar to single-stage design, the two-stage design is a mathematically easy problem since it can be done over a finite grid once the parameters are specified. In this paper, the problem is stated and discussed under the design of a single-arm two-stage phase II clinical trial.
The remainder of the paper is organized as follows. Section 4 reviewed the statistical setup of the single-arm two-stage phase II trials and described the scientific question of interest. We reviewed the classical optimal and minimax designs and discussed an issue we are concerned about involved in these two designs, and briefly stated our hunch. Section 5 introduced a numerical analysis to confirm our hunch. Section 6 concluded the paper.
Single-Arm two-stage Phase II clinical trials
Setup
Consider the test
vs.
, where
is the response rate of a historical therapy,
is the response rate of the experimental therapy needs to be tested, and under our context,
for some
as a clinical meaningful difference. We conduct the test by two stages. Denote
with
as the sample size of the test at stage
. Denote
and
under
.
and
are independent.
We consider two types of stopping criteria when the phase II trial needs to be stopped after stage 1. First, we consider the futility (lower) stop only, which means we stop the trial after stage 1 only if we find the treatment is ineffective based on the result from stage 1, in other words, this happens when we find the positive response of stage 1
for a critical value
that requires us to reject the null hypothesis
under specified design parameters. Second, we consider both futility and superiority (upper) stops. The superiority stop means we also stop the trial when
for a critical value
that specifies the enough effectiveness of the treatment, i.e., because of the enough positive responses to the treatment, we can stop and claim the effectiveness of the treatment, and there is no need to conduct stage 2.
Specifically, the framework of finding specific designs of a single-arm two-stage phase II clinical trial is specified as follows. Under futility stop with design parameters
:
- Calculate
,
- Define probability of early termination (PET) under
:
and expected sample size (EN) under
:
- Among designs with
and
:
- Optimal design
minimizes EN
- Minimax design
minmizes
where
is the probability measure under
. The significant level and power can be calculate by
where
for
and
for
,
is the probability mass of
and
is the corresponding CDF,
. The algorithm of finding an optimal or minimax design is easy to implement via greedy searching over possible choices of
. Starting from a
, for
, the probabilities and EN aforementioned can be calculated, and so the corresponding designs can be found.
Under both futility and superiority stops with design parameters
:
- Calculate
- Define probability of early termination under
, and expected sample size (EN) under
,
, and then
- Among designs with
and
:
- Optimal design
minimizes EN
- Minimax design
minmizes
A user-friendly software for finding optimal and minimax designs of two-stage phase II clinical trials can be found at http://www2.cscc.unc.edu/impact7/CTDSystems.
What are we looking for?
Following the notations in Section 4.1, we are interested if there is any pattern(s) among the
values we found from the desired designs given the constrains on level, power, etc. In fact, Jung4 found a phenomenon that
,
and
by a few examples from the output of optimal and minimax trial designs, but has not investigated this in more details. We wanted to provide a formal analysis in the relationship of these numbers that an optimal or minimax design returns, given any specific setting of the design parameters, which we believe can provide some prior knowledge about what we can expect from a specific design, help us to understand more on the difference and similarity of minimax and optimal designs, and look at them from a new perspective.
First, we proved that under some regularity conditions and constraints by phase II clinical trials, and using large sample approximation,
,
, and
for some
, and
is a parameter that can be related at least to the response rates of control and treatment, i.e.,
and
. In other words, we may expect that the rejection criteria, i.e., the critical values
and
of the test are mainly determined by the response rates of control and treatment when the sample size is large. Details can be found in Appendix 7.1 for single-stage design and 7.2 for two-stage designs. Second, we used an extensive numerical study in Section 5 to confirm our hunch when the sample size is finite and small, and based on results from optimal and minimax designs.
Numerical analysis
Setup
We investigated the patterns of
over a fine grid of parameters
. We set
ranged from
with a small increment
, and
, where
. We consider
and
for all of these
and
values. We restricted the maximum of the total sample size
by 55 to reflect the small-sample-size characteristic of phase II trials. Finally, we considered both optimal and minimax designs under two stop criteria, i.e., (1) with futility stop only; (2) with both futility and superiority stops.
Under a parameter setting, we searched the (minimax or optimal) design
if the trial stops at futility stop of stage 1,
if the trial is allowed to stop at both futility and superiority stops of stage 1.
Based on the theoretical results in Appendix 7, we investigated: (1) under futility stop only, the trends of
(and
) for stage 1, and
(and
) for stage 2, i.e. we choose
only in our numerical experiment (based on some previous experience and we would like to confirm how if
is a good candidate for
); (2) under both futility and superiority stops, the trends of
(and
) and
(and
) for stage 1, and
(and
) for stage 2.
Results
For simplicity and because of the similarity of our findings in multiple cases, we only present the results under
, where
, as a representative, under both stop criteria and both designs. The complete results under all parameters we considered can be found in Appendix 8.
Figures 1& 2 show that for different
, the trends of ratio differences we are interested, i.e.,
,
and
. They indicate that overall, these differences are close to 0, and the difference is smaller on stage 2, i.e.,
, under both stopping criteria, which means
is a good approximation to
over the
we considered under this parameter setting. At the same time, we can observe that, the other two differences are also close to 0, but the differences under optimal design are overall closer to 0 than those of minimax design. The results for all other cases in Figure 3 to Figure 12 in Appendix 8.1 and 8.2 are similar to this case. We plotted all trends including differences in ratio and frequency. They also result in the same conclusion.
Figure 1 Trends of
and
under both optimal and minimax designs, when
and there is futility stop only.
Figure 2 Trends of
,
and
under both optimal and minimax designs, when
and there are both futility and superiority stops.
Figure 3 The trends of
over different choices of
, under both optimal and minimax designs, when there is futility stop only.
Table 1 calculates the summary statistics of the trends in Figures 1 &2 over different
(in total, there are 131
’s in each figure). We found from the mean that, these differences are very close to 0 on average, and thus the approximation we use is good. We found from the interquartile range (IQR) and standard deviation (SD) that they are larger for minimax design than optimal design for a same difference, thus the variation of the approximation on optimal design is smaller. Table 2 further provides all numerical results under this setting, i.e.
, including all design outputs, ratios and differences of interest. We present results over
but with increment
compared to
for the figures in order to maintain the concision and information provided by the table.
Concluding remarks
In this paper, we focused on single-arm two-stage phase II clinical trials, which is widely needed in reality. We specifically take the trends of critical values for rejecting null hypothesis at both two stages in a phase II clinical trial into consideration. Using both theoretical derivations under large-sample and numerical studies under finite and small sample, we confirmed that the critical values can be approximated and dominated by the total sample sizes and response rates of different stages. We also found that, this approximation is more obvious in optimal design than that in minimax design under the same design parameter setting. Our finding indicates that although minimax design provides less total sample size needed for a phase II trial, the property of an optimal design is closer to what we expect under large sample, and overall the optimal designs have more stable patterns on their relationships among critical values, sample sizes of both stages and response rates under null and alternative hypotheses.
Appendix: technical proofs
The numerical study in Section 5 confirms our hunch in Section 4.2 under finite (and actually small) sample. In this section, we provide some analytic proofs about the relationship between
and
when the total sample size
is large, using normal approximation, based on some regular assumptions (specified later).
Large sample approximation of single-stage phase II trials
In a single-arm single-stage trial, given a large
, we can find the approximate value of
using large sample approximation. We assume that there exist
and
such that
, which means the response rates satisfy the “positivity”, i.e., bounded away from 0 and 1. In reality, this means we assume that a treatment cannot be effective or ineffective to all subjects in the target population. Next, denote
, then under
:
(7.1.1)
as
, where
,
. Denote
by
. Given
,
and
satisfy
and
. So, under a large sample,
(7.1.2)
and
(7.1.3)
where
is the
-quantile of the standard normal distribution. From (7.1.2) and (7.1.3), we know
or
By plugging this in (7.1.2) or (7.1.3), we have
(7.1.4)
with
.
Note that both
and
are finite, and usually, in phase II clinical trials, we choose
so that
. Hence, if
, the weights in (7.1.4) are closer to
.
Large sample approximation of two-stage phase II trials
In the case of two-stage trials, we investigated in Section 5 by numerical studies under finite and small sample. Now, we provide an analytic proof assuming some regularity conditions about under large sample. Following notations in Section 4.1, note that under
:
(7.2.1)
as
, where
.
Under futility stop only
Under a large
and assume
, a constant, as
, the condition
is equivalent to
,
where
is the density of
, and
is the survival function of
under
. Note that
where
is the CDF of a standard normal distribution. Let
,
Note that there exists an
such that
since the lower limit of the above integral
.
Then,
(7.2.2)
Therefore, by (7.2.1) and (7.2.2), we know
(7.2.3)
where
is the
-quantile of the standard normal distribution. Thus, if we choose
(achievable usually since
should be small in practice), then
is finite, and thus
as
.
Next, we show the relationship among
and
. Let
and by independence,
under
. Thus, (7.1.1) holds for
. We know
and
.
One the one hand,
, and let
On the other hand,
and let
We note that there exist an
such that
for both
, thus the right hand side of the above two probabilities are conservatively equal to
under large sample. Now,
Thus,
(7.2.4)
with
. By choosing
, we have
and
, so both
and
are finite. Whether the weights in (7.2.4) are closer to 1/2 is a more complicated problem here than that in Section 7.1.
Under both futility and superiority stops
Following the same notations and assumptions made in Section 7.2.1, note that similar to the thoughts in Section 7.2.1, condition
is equivalent to
where
Thus, by choosing
, we can similarly show that
(7.2.5)
Thus,
.
Second, we note that
. Hence, the condition
is equivalent to
, thus
where
Then, by choosing
, we know that
(7.2.6)
Thus,
.
Finally, denote
, and we have that
,
and
. Let
and
where
is the CDF of the standard normal distribution, and thus choosing
and
, similar to the proof in Section 7.2.1, we have
(7.2.7)
where
.
Appendix: complete results of the numerical study
We provide figures of two types of trends. That is, trends in frequency (Figures 3 & 4) for investigating the trends of and , and trends in difference of ratio (Figures 5 & 6) for investigating the trends of
and
.
Figure 4 The trends of
over different choices of
, under both optimal and minimax designs, when there is futility stop only.
Figure 5 The trends of
(ratio) over different choices of
, under both optimal and minimax designs, when there is futility stop only.
Figure 6 The trends of
(ratio) over different choices of
, under both optimal and minimax designs, when there is futility stop only.
Table 3– Table 8 present all numerical results, including the returns of designs, ratios, and differences of interest under all parameter settings in this numerical study. Specifically, we only present under
with an increment
compared to
in the figures, for maintaining the information and concision provided by the tables simultaneously. Each table represents a case of
.
Results under both futility and superiority stops
Same as Section 8.1, we provide figures of trends in frequency (Figure 7-Figure 9) for investigating the trends of
,
and
, and trends in difference of ratio (Figure 10-Figure 12) for investigating the trends of
,
and
.
Figure 7 The trends of
over different choices of
, under both optimal and minimax designs, when there are both futility and superiority stops.
Figure 8 The trends of
over different choices of
, under both optimal and minimax designs, when there are both futility and superiority stops.
Figure 9 The trends of
over different choices of
, under both optimal and minimax designs, when there are both futility and superiority stops.
Figure 10 The trends of
(ratio) over different choices of
, under both optimal and minimax designs, when there are both futility and superiority stops.
Figure 11 The trends of
over different choices of
, under both optimal and minimax designs, when there are both futility and superiority stops.
Figure 12 The trends of
(ratio) over different choices of
, under both optimal and minimax designs, when there are both futility and superiority stops.
Table 9-Table 14 present all numerical results, including the returns of designs, ratios, and differences of interest under all parameter settings in this numerical study. Specifically, we only present under
with an increment
compared to
in the figures, for maintaining the information and concision provided by the tables simultaneously. Each table represents a case of
.
Acknowledgments
Conflicts of interest
The authors declared that there are no conflicts of interest.
References
©2022 Yi, et al. This is an open access article distributed under the terms of the,
which
permits unrestricted use, distribution, and build upon your work non-commercially.