On estimating flexible weibull parameters with type I progressive interval censoring with random removal using data of cancerous tumors in blood

doi:10.15406/bbij.2016.04.00108

eISSN: 2378-315X

Biometrics & Biostatistics International Journal

Research Article Volume 4 Issue 5

On estimating flexible weibull parameters with type I progressive interval censoring with random removal using data of cancerous tumors in blood

Afify WM

Verify Captcha

Regret for the inconvenience: we are taking measures to prevent fraudulent form submissions by extractors and page crawlers. Please type the correct Captcha word to see email ID.

Department of Head of Statistics, Mathematics & Insurance, Kafr El-sheikh University, Egypt

Correspondence: Afify WM, Department of Head of Statistics, Mathematics & Insurance, Kafr El-sheikh University, Faculty of Commerce, Egypt

Received: September 09, 2016 | Published: October 14, 2016

Citation: Afify WM. On estimating flexible weibull parameters with type I progressive interval censoring with random removal using data of cancerous tumors in blood. Biom Biostat Int J. 2016;4(5):208-216. DOI: 10.15406/bbij.2016.04.00108

Download PDF

Abstract

In this paper, the maximum likelihood and the Bayes estimators of the two unknown parameters of the flexible Weibull distribution have been obtained for progressive Interval type-I censoring scheme with binomial random removal. Point estimation and confidence intervals based on maximum likelihood and bootstrap method are also proposed. A Bayesian approach using Markov chain Monte Carlo (MCMC) method to generate from the posterior distributions and in turn computing the Bayes estimators are developed. To illustrate the proposed methods will discuss an example with the real data. Finally, comparing the two techniques through comparisons between the maximum likelihood using bootstrap method and different Bayes estimators using MCMC study.

Keywords: flexible weibull distribution, progressive interval type-I censoring, random removal, percentile bootstrap, bayesian and non-bayesian approach, markov chain monte carlo (MCMC)

Introduction

Censoring is very common in life tests in the past several decades; the experimenter may be unable to obtain complete information on failure times of all experimental items. For this reason, Aggarwalla¹ suggested a useful type of censoring, namely, a progressively Type I interval censored data, which is a union of Type I interval and progressive censoring. This method of lifetime data collection can be useful to a biological experimenter, particularly when the experimental units are humans, as continuous monitoring is often not possible to implement, and withdrawal rates from such studies may high.

In progressive censored the number of units being removed from the test at each failure time may occur at random. For example; the number of patients who drop out of clinical test at each stage is random and cannot be predetermined. That is why to display a more general censoring scheme called progressive progressively Type I interval censored with random removal. It can be described as follows: suppose $n$ units are put on life test at time $T_{0} = 0$ and under inspection at m pre-specified times $T_{1} < T_{2} < ... < T_{m}$ where $T_{m}$ is scheduled time to terminate the experiment. The number, $k_{i}$ , of failures within $(T_{i - 1}, T_{i}]$ is recorded and $r_{i}$ surviving items are randomly removed from the life testing at the ith inspection time, $T_{i}$ , for $i = 1, 2, ... m$ . Since the number, $Y_{i}$ , of surviving items is a random variable and exact number of items with drawn should not be greater than $Y_{i}$ at time schedule $T_{i}$ , $r_{i}' s$ are random. Such a censoring mechanism is termed as progressive interval type-I censoring with random removal scheme. If we assume that probability of removal of a unit at every stage is π for each unit then r_i can be considered to follow a binomial distribution i.e, $r_{i} \approx B (n - m - \sum_{j = 0}^{i - 1} r_{j}, π)$ for $i = 1, 2, ..., m$ . The main difference between progressive interval type I censoring with fixed removal and progressive interval type I censoring with random removals is that the removals are predetermined in the former case while they are random in the latter case. Note that m is pre-determined in both cases. However, many practical applications suggest that it is more flexible to have removals random to accommodate the unexpected drop out of experimental subjects.

Although progressive censoring occurs frequently in many applications, there are relatively few works on it. Some early works can be found in Cohen,² Readers can refer to the book Balakrishnan & Aggarwala³ for more details on the methods and applications of this topic. However, all these works assumed that the number of units being removed from the test is fixed in advance. In practice, it is impossible to pre-determine the removal pattern. Thus, Yuen & Tse⁴ and Yang et al.⁵ considered the estimation problem when lifetimes collected under a Type II progressive censoring with random removals and Kendell & Anderson⁶ point out that the expected duration under grouped data. Progressive type-I interval censored sampling is an important practical problem that has received considerable attention in the past several years. Based on the progressive type-I interval censored sampling, Ashour & Afify⁷ derived the maximum likelihood estimators of parameters of the exponentiated Weibull family and their asymptotic variances under random removal. Lin et al.⁸ determined optimally spaced inspection times for the log-normal distribution, while Ng & Wang⁹ and Chen & Lio¹⁰ compared three classical estimation methods, the maximum likelihood estimators the moment method and the probability plot method in terms of the Weibull distribution and generalized exponential respectively.

In Bayesian approach, It is too difficult to find integrate over the posterior distribution and the problem is that the integrals are usually impossible to evaluate analytically. But in MCMC technique, the MCMC methodology provided a convenient and efficient way to sample from complex, high-dimensional statistical distributions. Recently, application of the MCMC method to the estimation of parameters or some other vital properties about statistical models is very common. Green et al.¹¹ using the MCMC method for estimating the three parameters Weibull distribution, and they showed that the MCMC method is better than the ML method, when given a proper prior distribution of the parameters. As a generalization of the two parameter Weibull model, Gupta et al.¹² gave a complete Bayesian analysis of the Weibull extension model using MCMC simulation and complete sample. Lin & Lio¹³ discussed Bayesian inference under progressive type I interval censoring by using MCMC.

A random variable x is said to have a Flexible Weibull Distribution with parameters $λ, β > 0$ if its probability density function, cumulative function, survival function and hazard function are given by

$f (x; λ, β) = (λ + \frac{β}{x^{2}}) e^{λ x - \frac{β}{x}} e^{λ x - \frac{β}{x}}$ (1)

$F (x; λ, β) = 1 - e^{- e^{λ x - \frac{β}{x}}}$ (2)

$\bar{F} (x; λ, β) = e^{- e^{λ x - \frac{β}{x}}}$ (3) respectively.

In this paper we consider the Bayesian inference of the scale parameters for progressive interval type-I censored data when both parameters are unknown. We assumed that the both scale parameters $λ a n d β$ have gamma prior and they are independently distributed. As expected in this case also, the Bayes estimates cannot be obtained in closed form. We propose to use the Gibbs sampling procedure to generate MCMC samples, and then using the Metropolis–Hastings algorithms, we obtain the Bayes estimates of the unknown parameters. We perform some simulation experiments to see the behavior of the proposed Bayes estimators and compare their performances with the maximum likelihood estimators.

The rest of the paper is organized as follows. In the next section, the ML estimators of the unknown parameters and approximate confidence intervals are presented. The corresponding parametric bootstrap confidence intervals for the parameters are given in Section 3. In Section 4, we cover Bayes estimates and construction of credible intervals using the MCMC techniques. In Section 5, for illustrative purposes, we performed a real data analysis. Comparisons among estimators are investigated through Monte Carlo simulations in Section 6. Finally, conclusions appear in Section 7.

Classical estimation and percentile bootstrap algorithm (Boot-p)

Classical estimation (maximum likelihood estimators) of the unknown parameters and approximate confidence intervals are presented. Also, the corresponding parametric bootstrap confidence intervals using percentile bootstrap Algorithm (Boot-p) for the parameters are given in this section.

Classical estimation

Suppose a progressively Type-I interval censored sample is collected as described above, beginning with a random sample of units with a continuous lifetime distribution $F (x)$ and let $k_{1}, k_{2}, ..., k_{m}$ denote the number of units known to have failed in the intervals $(0, T_{1}], (T_{1}, T_{2}], ..., (T_{m - 1}, T_{m}]$ , respectively. Then, based on this observed data, the joint likelihood function will be Aggarwala.¹

$L_{1} (X; λ, β \ R) = C \prod_{i = 1}^{m} {[F (T_{i}; λ, β) - F (T_{i - 1}; λ, β)]}^{k_{i}} {[1 - F (T_{i}; λ, β)]}^{r_{i}}$ (4)

Where C is constant. Clearly, if $r_{i} = 0$ for $i = 1, 2, ..., m - 1$ and $r_{m} = n - k$ equation (4) reduces to the likelihood function for interval type I censoring data is defined as follows:

$L (X; λ, β) = C {(1 - F (T_{m}, λ, β))}^{n - k} \prod_{i = 1}^{m} {(F (T_{i}, λ, β) - F (T_{i - 1}, λ, β))}^{k_{i}}$

Where $k = \sum_{i = 1}^{m} k_{i}$ and $k_{1}, k_{2}, ..., k_{m}$ are the number of units known to have failed in the intervals $(0, T_{1}], (T_{1}, T_{2}], ..., (T_{m - 1}, T_{m}],$ respectively.

For type I progressive Interval censoring, supposed that $r_{i}$ is independent of $X_{i}$ for all $i$ ; Wu & Chang¹⁴ suggested the following likelihood function of a progressive interval censoring with binomial removals

$L (X, R; λ, β) = L_{1} (X; λ, β \ R = r) \times P (R)$ (5)

Where $L_{1} (X; θ \ R = r)$ is the likelihood function for a progressive type I interval censored with fixed removal (4) and $P (R)$ will be

$\begin{array}{l} P (R) = P (R_{m - 1} = r_{m - 1} \ R_{m - 2} = r_{m - 2}, ..., R_{1} = r_{1}) P (R_{m - 2} = r_{m - 2} \ R_{m - 3} = r_{m - 3}, ..., R_{1} = r_{1}) ... \\ P (R_{2} = r_{2} \ R_{1} = r_{1}) P (R_{1} = r_{1}) .... P (R_{2} = r_{2} \ R_{1} = r_{1}) P (R_{1} = r_{1}) \end{array}$

Such as

$P (R) = \frac{(n - m)!}{\prod_{i = 1}^{m} r_{i}! (n - m - \sum_{j = 1}^{m - 1} r_{j})!} π^{\sum_{j = 1}^{m - 1} r_{j}} {(1 - π)}^{(m - 1) (n - m) - \sum_{j = 1}^{m - 1} (m - j) r_{j}}$ (6)

and $f (.), F (.)$ are the same as defined before in (1) and (2) respectively. The log likelihood function with random removal can be written as

$\begin{array}{l} \log L (X, R; λ, β) = \log C + \sum_{i = 1}^{m} k_{i} \log [F (T_{i}) - F (T_{i - 1})] + \sum_{i = 1}^{m} r_{i} \log [1 - F (T_{i})] + \\ \sum_{j = 1}^{m - 1} r_{j} \log π + (m - 1) (n - m) - \sum_{j = 1}^{m - 1} (m - j) r_{j} \log (1 - π) \end{array}$ (7)

The maximum likelihood estimations of $λ$ and $β$ are the simultaneous solutions of following normal equations

\sum_{i = 1}^{m} k_{i} \frac{\frac{\partial F (T_{i})}{\partial λ} - \frac{\partial F (T_{i - 1})}{\partial λ}}{F (T_{i}) - F (T_{i - 1})} + \sum_{i = 1}^{m} r_{i} \frac{\frac{\partial}{\partial λ} [1 - F (T_{i})]}{[1 - F (T_{i})]} = 0

(8)

\sum_{i = 1}^{m} k_{i} \frac{\frac{\partial F (T_{i})}{\partial β} - \frac{\partial F (T_{i - 1})}{\partial β}}{F (T_{i}) - F (T_{i - 1})} + \sum_{i = 1}^{m} r_{i} \frac{\frac{\partial}{\partial β} [1 - F (T_{i})]}{[1 - F (T_{i})]} = 0

(9)

Note that $P (R)$ does not involve the parameters. Therefore, the MLE $\hat{π}$ of $π$ can be found by maximizing $P (R)$ directly, that is,

$\frac{1}{\overset{⌢}{π}} \sum_{j = 1}^{m - 1} r_{j} - \frac{1}{1 - \overset{⌢}{π}} ((m - 1) (n - m) - \sum_{j = 1}^{m - 1} (m - j) r_{j}) = 0$

Therefore, the maximum likelihood estimation of parameter $\hat{π}$ is given by

\hat{π} = \frac{\sum_{j = 1}^{m - 1} r_{j}}{(m - 1) (n - m) - \sum_{j = 1}^{m - 1} (m - j - 1) r_{j}}

(10)

It may be noted that (9) and (10) cannot be solved simultaneously to provide a nicely closed form for the estimators. Therefore, we propose to use ﬁxed point iteration method for solving these equations. Using Fisher information matrix $I (\hat{λ}, \hat{β}, \overset{⌢}{π})$ in the Appendix and the asymptotic normality of the maximum likelihood estimators can be used to compute the approximate confidence intervals (ACI) for parameters $λ$ , $β$ and $π$ Therefore, $(1 - γ) 100 %$ confidence intervals for parameters $λ$ , $β$ and $π$ will be become

$\hat{λ} \pm Z_{γ / 2} \sqrt{V a r (\hat{α})}$ , $\hat{β} \pm Z_{γ / 2} \sqrt{V a r (\hat{β})}$ and $\hat{π} \pm Z_{γ / 2} \sqrt{V a r (\hat{π})}$

Where $Z_{γ / 2}$ is percentile of the standard normal distribution with right-tail probability $γ / 2$ .

Data algorithm

The data generation is based on the algorithm proposed by Aggarwala¹ to simulate the numbers, $k_{i}$ of failed items in each subinterval $(T_{i - 1}, T_{i}], i = 1, \dots, m,$ from an initial sample of size putting on life testing at time 0. This algorithm, which is an extension from the procedure developed by Kemp & Kemp¹⁵ for the multinomial distribution, involves generating m binomial random variables. A procedure to generate a progressively type I interval censored data with random removal, $(k_{i}, r_{i}, T_{i}), i = 1, \dots, m,$ from the flexible Weibull distribution can be described as follows briefly: let $k_{0} = 0$ and $r_{0} = 0$ and for $i = 1, \dots, m,$

Step 1 set $i = 0$ and let $k s u m = r s u m = 0$ .

Step 2 $i = i + 1$

Using initial $π$ to generate a sample $R = r_{i}$ , $i = 1, \dots, m$ using binomial distribution, where $r_{1}$ following the binomial $(n - m, π)$ distribution and the variables $r_{i} / r_{1}, r_{2}, \dots, r_{i - 1}$ follow the binomial $(n - m - \sum_{j = 1}^{i - 1} r_{j}, π)$ distribution for $i = 2, 3, \dots, m - 1$

Set $r_{m} = {\begin{matrix} n - m - \sum_{j = 1}^{m - 1} r_{j} i f n - m - \sum_{j = 1}^{m - 1} r_{j} > 0 \\ 0 o t h e r w i s e \end{matrix}$

Generate k_i as a binomial random variable with parameters n-k sum-r sum and $p = (e^{- e^{λ T_{i - 1} - \frac{β}{T_{i}}} - 1} - e^{- e^{λ T_{i} - \frac{β}{T_{i}}}}) / (1 - e^{- e^{λ T_{i - 1} - \frac{β}{T_{i}}} - 1})$

Step 3 Set $k s u m = k s u m + k_{i}$ and $r s u m = r s u m + r_{i}$ .

Step 4 If $i < m$ , go to step 2; otherwise, stop.

Percentile bootstrap algorithm (Boot-p)

We can increase information about the population value more than does a point estimate by using a parametric bootstrap interval. We propose to use confidence intervals based on the parameteric bootstrap methods using percentile bootstrap Algorithm (Boot-p) based on the idea of Efron.¹⁶

The algorithm for estimating the confidence intervals is illustrated as follows:

Before progressing further, we first describe how we generate progressively interval Type I censored data with binomial random removals. The following algorithm is followed to obtain these samples.

Specify the values of $n; m; T$ .
Specify the values of $λ, β$ and $π$ .
Form data algorithm; compute the maximum likelihood estimates of the parameters $\hat{λ}$ , $\hat{β}$ and $\hat{π}$ , by solving the likelihood equations simultaneously in (8), (9) and (10).
Use $\hat{λ}$ , $\hat{β}$ and $\hat{π}$ , to generate a bootstrap sample $k^{*}$ with the same values of r_i, m;(i=1,2,…,m) using algorithm presented in Balakrishnan & Sandhu.¹⁷
As in step 3, based on $k^{*}$ compute the bootstrap sample estimates of $\hat{λ}$ , $\hat{β}$ and $\hat{π}$ , say $\hat{λ} *$ , $\hat{β} *$ and $\hat{π} *$ .
Repeat steps 4-5 B times representing B bootstrap maximum likelihood estimators of $λ$ , $β$ and $π$ based on B different bootstrap samples.
Arrange all $\hat{λ} *' s$ , $\hat{β} *' s$ and $\hat{π} *' s$ , in an ascending order to obtain the bootstrap sample $(φ_{l}^{[1]}, φ_{l}^{[2]}, \dots, φ_{l}^{[B]}), l = 1, 2, 3$ (where $φ_{1} \equiv \hat{λ} *$ , $φ_{2} \equiv \hat{β} *$ and $φ_{3} \equiv \hat{π} *$ ).

Let $G (z) = P (φ_{l} \leq z)$ be the cumulative distribution function of $φ_{l}$ . Define $φ_{l b o o t} = G^{- 1} (z)$ for given Z. The approximate bootstrap $100 (1 - 2 γ) %$ confidence interval (ABCI) of $φ_{l}$ is given by $[φ_{l}_{b o o t} (γ), φ_{l}_{b o o t} (1 - γ)]$ .

Bayesian estimation and MCMC technique

In this section, we will focus to Bayesian approach using Markov chain Monte Carlo (MCMC) method to generate from the posterior distributions and in turn computing the Bayes estimators are developed.

Bayesian estimation

In Bayesian scenario, we need to assume the prior distribution of the unknown model parameters to take into account uncertainty of the parameters. The informative prior densities for $λ$ and $β$ are given as

$g_{1} (λ) α λ^{b - 1} e^{- λ a}, a, b, λ > 0$ ,

$g_{2} (β) α β^{d - 1} e^{- β c}, c, d, β > 0$ ,

and $π$ has a

$g_{3} (π) α π^{A - 1} {(1 - π)}^{B - 1}, 0 < π < 1; A, B > 0$

Note that the parameters $λ$ , $β$ and $π$ behave as independent random variables. The joint informative prior probability density function of $λ$ , $β$ and $π$ is

$g (λ, β, π) α g_{1} (λ) \times g_{2} (β) \times g_{3} (π)$

$(λ, β, π) α λ^{b - 1} e^{- λ a} β^{d - 1} e^{- β c} π^{A - 1} {(1 - π)}^{B - 1}$ (11)

where $a, b, c, d, A a n d B$ are assumed to be known and are chosen to reflect prior knowledge about $λ$ , $β$ and $π$ .

Note that when $a = b = c = d = A = B = 0$ , (we call it prior 0) they are the non-informative $λ$ , $β$ and $π$ respectively.

It follows from (4), (6) and (11) that the joint posterior density function of $λ$ , $β$ and $π$ given x is thus

$π * (λ, β, π) α \frac{L_{1} (X; λ, β / R = r) \times P (R) \times g (λ, β, π)}{\int_{0}^{\infty} \int_{0}^{\infty} \int_{0}^{1} L_{1} (X; λ, β / R = r) \times P (R) g (λ, β, π) d λ d β d π}$

$π * (λ, β, π) α \prod_{i = 1}^{m} {[1 - \frac{e^{- u (T_{i})}}{e^{- u (T_{i - 1})}}]}^{k_{i}} e^{- {k_{i} u (T_{i - 1}) + r_{i} u (T_{i})}} π \sum_{j = 1}^{m - 1} r_{j} . {(1 - π)}^{(m - 1) (n - m) - \sum_{j = 1}^{m - 1} (m - j) r_{j}} \times λ^{b - 1} e^{- λ a} β^{d - 1} e^{- β c} π^{A - 1} {(1 - π)}^{B - 1}$ (12)

where $u (T_{i}) = e^{λ T_{i} - \frac{β}{T_{i}}}$ and $u (T_{i - 1}) = e^{λ T_{i - 1} - \frac{β}{T_{i - 1}}}$ .

It is not possible to compute (12) analytically. The problem is that the integrals in (12) are usually impossible to evaluate analytically, and the numerical methods may fail. The MCMC method provides an alternative method for parameter estimation. In the following subsections, we propose using the MCMC technique to obtain Bayes estimates of the unknown parameters and construct the corresponding credible intervals.

MCMC technique

Computer simulation of Markov chains in the space of parameter will depend on Markov chain Monte Carlo (MCMC) Gilks et al.¹⁸ The Markov chains are defined in such a way that the posterior distribution in the given statistical inference problem is the asymptotic distribution. However, the posterior likelihood usually does not have a closed form for a given progressively type-I interval-censored data. Moreover, a numerical integration cannot be easily applied in this situation. A lot of standard approaches to display like Markov chains exist, including Gibbs sampling, Metropolis-Hastings (M-H) and reversible jump. The M-H algorithm is a very general MCMC method first expansion by Metropolis et al.¹⁹ and later extended by Hastings.²⁰ it is possible to use these algorithms by implement posterior simulation in essentially any problem which allow point wise evaluation of the prior distribution and likelihood function. It can be used to obtain random samples from any arbitrarily complicated target distribution of any dimension that is known up to a normalizing constant. In fact, Gibbs sampler is just a special case of the M-H algorithm.

In order to use the method of MCMC for estimating the parameters of the flexible Weibull distribution and random removal, namely, $λ$ , $β$ and $π$ . Let us consider independent priors as in (10), the full conditional distribution for any parameter can be obtained, to within a constant, by factoring out from the likelihood function $L (X, R; λ, β)$ any terms containing the relevant parameter and multiplying by its prior. From (11), the full posterior conditional distribution for $λ$ is proportional to

$π^{*} (λ / x, β, π) \propto \prod_{i = 1}^{m} {[1 - \frac{u (T_{i})}{u (T_{i - 1})}]}^{k_{i}} e^{- {k_{i} u (T_{i - 1}) + r_{i} u (T_{i})}} \times λ^{b - 1} e^{- λ a}$ (12)

Also, the full posterior conditional distribution for β is proportional to

$π^{*} (β / x, λ, π) \propto \prod_{i = 1}^{m} {[1 - \frac{u (T_{i})}{u (T_{i - 1})}]}^{k_{i}} e^{- {k_{i} u (T_{i - 1}) + r_{i} u (T_{i})}} \times β^{d - 1} e^{- β c}$ (13)

Similarly, the marginal posterior density of $π$ is proportional to

$π^{*} (π / x, λ, β) \propto π^{A + \sum_{j = 1}^{m - 1} r_{j} - 1} {(1 - π)}^{B + (m - 1) (n - m) - \sum_{j = 1}^{m - 1} (m - j) r_{j} - 1}$ (14)

It is noted that the posterior distribution of $π$ is beta with parameters $A *$ and $B *$ where $A^{*} = A + \sum_{j = 1}^{m - 1} r_{j}$ and $B^{*} = B + (m - 1) (n - m) - \sum_{j = 1}^{m - 1} (m - j) r_{j}$ and, $π$ therefore, samples of can be easily generated using any beta generating routine. But the conditional posterior distribution of $λ$ and $β$ equations (12) and (13) respectively, cannot be reduced analytically to well-known distributions and therefore it is not possible to sample directly by standard methods, but the plot of it show that it is similar to normal distribution. So to generate random numbers from this distribution, we use the M-H method with normal proposal distribution.

MCMC process

Now, we propose the following scheme to generate $λ$ , $β$ and $π$ from density functions and in turn obtain the Bayes estimates and the corresponding credible intervals.

Start with an $λ^{(0)} = \hat{λ}, β^{(0)} = \hat{β}$ and $M = b u r n - i n$ .
Set $t = 1$ .
Generate $π^{(t)}$ from beta distribution $π * (π / x, λ, β)$ .
Using M-H algorithm Metropolis et al. [19], $λ^{(t)}$ from $π * (λ / x, β, π)$ with the $N (λ^{(t - 1)}, σ_{λ}^{2})$ proposal distribution where $σ_{λ}^{2}$ is the variance of obtained using variance-covariance matrix; similarly, $β^{(t)}$ from $π * (β / X, λ, π)$ with the $N (β^{(t - 1)}, σ_{β}^{2})$ proposal distribution where $σ_{β}^{2}$ is the variance of $β$ obtained using variance-covariance matrix.
Compute $λ^{(t)}$ , $λ^{(0)} = \hat{λ}, β^{(0)} = \hat{β}$ and $π^{(t)}$ .
Set $t = t + 1$ .
Repeats Steps 3-6 N times.
Obtain the Bayes estimates of $λ$ , $β$ and $π$ with respect to the squared error loss function as $\hat{E} (λ / x) = \frac{1}{N - M} \sum_{i = M + 1}^{N} λ_{i}$ , $\hat{E} (β / x) = \frac{1}{N - M} \sum_{i = M + 1}^{N} β_{i}$ and $\hat{E} (π / x) = \frac{1}{N - M} \sum_{i = M + 1}^{N} π_{i}$
To compute the credible intervals of , and , order $λ_{1}, ..., λ_{N - M}$ , $β_{1}, ..., β_{N - M}$ and $π_{1}, ..., π_{N - M}$ as $λ_{1} < ... < λ_{N - M,}$ , $β_{1} < ... < β_{N - M,}$ and $π_{1} < ... < π_{N - M .}$ .Then the $100 (1 - γ) %$ symmetric credible intervals (SCI) of , and become:

$[λ_{(N - M)} γ / 2, λ_{(N - M) (1 - γ / 2)}], [β_{(N - M)} γ / 2, β_{(N - M) (1 - γ / 2)}]$ and $[π_{(N - M) γ / 2,} π_{(N - M) (1 - γ / 2)}]$

Real data analysis

To conduct a study within the Institute of Oncology in Tanta - Egypt. This study is concerned with the treatment of cancerous tumors in blood and studies their impact on the overall health of the patient. Underwent the study 228 patients and they had varying degrees of disease. Patients were examined every 15 days for 6 consecutive months. Of course there were cases of withdrawal (death - interruption of treatment for different reasons)

As we know on the basis of a single sample, one cannot make a general statement regarding the behavior of proposed estimators, therefore we present a simulation study for the study of the behavior of the estimators in the next section.

Simulation

The simulation is conducted using the R version 3.2.2 (for more information about R programming, the reader may refer to this manual of R, version 3.3.0 under development (2015-10-30) Copyright 2000 –2015 R Core Team). The simulation setup is parallel to the real data given in (Table 1). To be speciﬁc, each replication of the simulation generates a progressively type-I interval-censored data within twelve subintervals which have pre-speciﬁed inspection times (in terms of half month),

$T_{0} = 0, T_{1} = 16, T_{2} = 31, T_{3} = 46, T_{4} = 61, T_{5} = 76, T_{6} = 91, T_{7} = 106, T_{8} = 121, T_{9} = 136, T_{10} = 151, T_{11} = 166 a n d T_{12} = 181.$

The last inspection time, , is the scheduled time to terminate the experiment. The lifetime distribution is flexible Weibull with parameters

λ

and

β

where the simulation input parameters are selected close to the maximum likelihood estimators of flexible Weibull parameters for modeling the real data in (Table 1). The performance of parameter estimation under progressively type I interval censored with random removal is compared via the maximum likelihood, bootstrap method and MCMC procedure developed in this paper. The summary for 1000 simulation runs is shown in (Tables 2-5). Bayes estimates of

λ

β

and

π

using MCMC method, we assume that informative priors a = 2,b = 3,c = 4 , d = 2, A = 2 and B = 3) on

λ

β

and

π

in (Table 4). Also, by non-informative prior using MCMC procedure with Bayes estimation will be obtained on estimates of parameters in (Table 5).

$k_{i}$	Cases of Withdrawal			Number of Random Removals $m$
$k_{i}$	Interval in Hours $T_{i}$	Number at Risk	Number of Failure $k_{i}$	Number of Random Removals $m$
1	[0,16)	228	25	2
2	[16,31)	201	39	2
3	[31,46)	160	25	1
4	[46,61)	134	20	3
5	[61,76)	111	11	1
6	[76,91)	99	14	2
7	[91,106)	83	11	3
8	[106,121)	69	17	0
9	[121,136)	52	6	2
10	[136,151)	44	31	1
11	[151,166)	12	6	1
12	[166,181)	5	5	0

Table 1 Examine patients every 15 days

	Different Parameters
	$λ$	$β$	$π$
Average	6.441	0.0841	0.6312
MSE	0.0134	0.0743	0.0484
Bias	0.0231	0.1073	0.0094
Variance	0.0129	0.0627	0.0483
ACI	[5.0132,7.9801]	[-0.1736,0.0901]	[0.4421,0.7782]
Length ACI	2.9669	0.2637	0.3361

Table 2 Progressively type I interval censored with random removal via the, maximum likelihood

	Different Parameters
	$λ$	$β$	$π$
Average	5.966	0.099	0.7058
MSE	0.1174	0.0984	0.0487
Bias	0.0346	0.1764	0.0109
Variance	0.1162	0.0673	0.0486
ABCI	[5.0117,8.0412]	[-0.1811,0.0884]	[0.4434,0.7992]
Length ABCI	3.0295	0.2695	0.3558

Table 3 Progressively type I interval censored with random removal via the, bootstrap method

	Different Parameters
	$λ$	$β$	$π$
Average	4.902	0.0083	0.5118
MSE	0.0035	0.0277	0.04804
Bias	0.0049	0.0833	0.0017
Variance	0.0035	0.0207	0.04803
SCI	[5.1023,7.6421]	[-0.1075,0.0826]	[0.4927,0.6524]
Length SCI	2.5398	0.1901	0.1597

Table 4 Progressively type I interval censored with random removal via the, MCMC procedure developed (Informative Priors)

	Different Parameters
	$λ$	$β$	$π$
Average	4.671	0.0398	0.6501
MSE	0.0673	0.0559	0.0656
Bias	0.0174	0.0304	0.0093
Variance	0.067	0.0549	0.0655
SCI	[4.8821,8.0307]	[-0.1010,0.0721]	[0.3881,0.7061]
Length SCI	3.1486	0.1731	0.318

Table 5 Progressively type I interval censored with random removal via the, MCMC procedure developed (Non-Informative Priors)

Both of density functions of $π^{*} (λ / x, β, π)$ and $π^{*} (β / x, λ, π)$ can be approximated by normal distribution functions but density function of $π^{*} (π / x, λ, β)$ will be beta as mentioned in subsection (3.3) which are plotted in (Figure 1& 2) Chain of MCMC outputs of $λ$ , $β$ and $π$ , using 100 000 MCMC samples. This was done with 1000 bootstrap sample and 100 000 MCMC sample and discard the ﬁrst 50000 values as ‘burn-in’. The Bayes estimators can be seen to have the smaller risks than classical estimators for all the considered cases. It may also be noted that the Bayes estimators obtained under informative prior are more efficient than those obtained under non-informative priors. This indicates that the Bayesian procedure with accurate prior information provides more precise estimates. Also, The Length of the SCI (using informative prior) is smaller than the Length of the ACI and ABCI.

Figure 1 Posterior density function of

λ

β

and

π

Figure 2 Chain of MCMC outputs of

λ

β

and

π

Conclusion

The methodology developed in this paper will be very useful to the researchers, engineers, statisticians and in the field of medical where such type of life test is needed and especially where the Weibull distribution is used. we have considered the problem of estimation for flexible Weibull distribution in the presence of Progressive Type-I Interval censored sample with Binomial removals. The scope of this censoring scheme in clinical trials has been discussed. We have found that Bayesian procedure provides estimates of the unknown parameters of flexible Weibull model with smaller MSE. The length of SCI is smaller than that of the ACI and ABCI. Applying the MCMC process through the application of the MH algorithm to deal with the Bayesian estimation for another lifetime distributions under type I progressive interval censoring with random removal could be a fruitful future research.

Appendix

The asymptotic variance-covariance matrix of the maximum likelihood estimators for parameters, and are given by elements of the inverse of the Fisher information matrix with random removal will be

$I (\hat{λ}, \hat{β}, \hat{π}) = [\begin{matrix} I_{1} (\hat{λ}, \hat{β}) & 0 \\ 0 & I_{2} (\hat{π}) \end{matrix}]$ ,

Unfortunately, the exact mathematical expressions for the above expectations are very difficult to obtain. Therefore, we give the approximate (observed) asymptotic varaince-covariance matrix for the maximum likelihood estimators, which is obtained by dropping the expectation operator E, where

$I_{1}^{- 1} (\hat{λ}, \hat{β}) = {[\begin{matrix} (- \frac{\partial^{2} \ln L (π)}{\partial λ^{2}}) & (- \frac{\partial^{2} \ln L (π)}{\partial λ \partial β}) \\ (- \frac{\partial^{2} \ln L (π)}{\partial λ \partial β}) & (- \frac{\partial^{2} \ln L (π)}{\partial β^{2}}) \end{matrix}]}^{- 1}_{λ = {\hat{λ}}_{2} β = \hat{β}} [\begin{matrix} V (\hat{λ}) & C o v (\hat{λ}, \hat{β}) \\ C o v (\hat{λ}, \hat{β}) & V (\hat{β}) \end{matrix}]$

We explained how to find Fisher information matrix $I_{1} (\hat{λ}, \hat{β})$ in the Appendix B.

$I_{2} (\hat{π}) = E (- \frac{\partial^{2} \ln L (π)}{\partial π^{2}})$ ,

and

$\frac{\partial^{2} \ln P (R)}{\partial π^{2}} = \frac{- 1}{π^{2}} \sum_{j = 1}^{m - 1} r_{j} - \frac{1}{{(1 - π)}^{2}} [(m - 1) (n - m) - \sum_{j = 1}^{m - 1} (m - j) r_{j}] .$

Numerical technique is needed to obtain the Fisher information matrix and the variance-covariance matrix. Note that under fixed and random removal the estimates based on intervals with equal length when the intervals are of equal length, so that monitoring and censoring occur periodically say $T_{i} = i . t$ .

We determine the second partials by differentiating the first partials, equations (8) and (9), obtaining

$\frac{\partial^{2} l o g L}{\partial λ \partial β} = \sum_{i = 1}^{m} k_{i} {\frac{\frac{\partial^{2} F (T_{i})}{\partial λ \partial β} - \frac{\partial^{2} F (T_{i - 1})}{\partial λ \partial β}}{F (T_{i}) - F (T_{i - 1})} - \frac{[\frac{\partial F (T_{i})}{\partial λ} - \frac{\partial F (T_{i - 1})}{\partial λ}] [\frac{\partial F (T_{i})}{\partial β} - \frac{\partial F (T_{i - 1})}{\partial β}]}{{[F (T_{i}) - F (T_{i - 1})]}^{2}}} + \sum_{i = 1}^{m} r_{i} {\frac{\frac{\partial^{2}}{\partial λ \partial β} [1 - F (T_{i})]}{[1 - F (T_{i})]} - \frac{[\frac{\partial}{\partial λ} [1 - F (T_{i})]] [\frac{\partial}{\partial β} [1 - F (T_{i})]]}{{[1 - F (T_{i})]}^{2}}}$

$\frac{\partial^{2} l o g L}{\partial λ^{2}} = \sum_{i = 1}^{m} k_{i} {\frac{\frac{\partial^{2} F (T_{i})}{\partial λ^{2}} - \frac{\partial^{2} F (T_{i - 1})}{\partial λ^{2}}}{F (T_{i}) - F (T_{i - 1})} - \frac{{[\frac{\partial F (T_{i})}{\partial λ} - \frac{\partial F (T_{i - 1})}{\partial λ}]}^{2}}{{[F (T_{i}) - F (T_{i - 1})]}^{2}}} + \sum_{i = 1}^{m} r_{i} {\frac{\frac{\partial^{2}}{\partial λ^{2}} [1 - F (T_{i})]}{[1 - F (T_{i})]} - \frac{{[\frac{\partial}{\partial λ} [1 - F (T_{i})]]}^{2}}{{[1 - F (T_{i})]}^{2}}}$

$\frac{\partial^{2} \log L}{\partial β^{2}} = \sum_{i = 1}^{m} k_{i} {\frac{\frac{\partial^{2} F (T_{i})}{\partial β^{2}} - \frac{\partial^{2} F (T_{i - 1})}{\partial β^{2}}}{F (T_{i}) - F (T_{i - 1})} - {\frac{[\frac{\partial F (T_{i})}{\partial β} - \frac{\partial F (T_{i - 1})}{\partial β}]}{{[F (T_{i}) - F (T_{i - 1})]}^{2}}}^{2}} + \sum_{i = 1}^{m} r_{i} {\frac{\frac{\partial^{2}}{\partial β^{2}} [1 - F (T_{i})]}{[1 - F (T_{i})]} - {\frac{[\frac{\partial}{\partial β} [1 - F (T_{i})]]}{{[1 - F (T_{i})]}^{2}}}^{2}}$

$F (T_{i}) = 1 - e^{- e^{λ T_{i}} - \frac{β}{T_{i}}}$

$\frac{\partial F (T_{i})}{\partial λ} = \frac{- 1}{T_{i}} e^{λ T_{i - \frac{β}{T_{i}}}} e^{- e^{λ T_{i} - \frac{β}{T_{i}}}} = \frac{- 1}{T_{i}} e^{λ T_{i - \frac{β}{T_{i}}}} {1 - F (T_{i})}$

$\frac{\partial F (T_{i})}{\partial β} = \frac{- 1}{T_{i}} e^{λ T_{i - \frac{β}{T_{i}}}} e^{- e^{λ T_{i} - \frac{β}{T_{i}}}} = \frac{- 1}{T_{i}} e^{λ T_{i - \frac{β}{T_{i}}}} {1 - F (T_{i})}$

$\frac{\partial F (T_{i})}{\partial λ} = [1 - F (T_{i})] = - \frac{\partial F (T_{i})}{\partial λ}$

$\frac{\partial F (T_{i})}{\partial β} = [1 - F (T_{i})] = - \frac{\partial F (T_{i})}{\partial β}$

$\frac{\partial^{2} F (T_{i})}{\partial λ^{2}} = \frac{1}{T_{i}} e^{λ T_{i} - \frac{β}{T_{i}}} {T_{i} [1 - F (T_{i})] + \frac{\partial F (T_{i})}{\partial λ}}$

$\frac{\partial^{2} F (T_{i})}{\partial β^{2}} = \frac{1}{T_{i}} e^{λ T_{i} - \frac{β}{T_{i}}} {\frac{1}{T_{i}} [1 - F (T_{i})] + \frac{\partial F (T_{i})}{\partial β}}$

$\frac{\partial^{2} F (T_{i})}{\partial λ \partial β} = - e^{λ T_{i} - \frac{β}{T_{i}}} {[1 - F (T_{i})] + T_{i} \frac{\partial F (T_{i})}{\partial β}}$