Research Article Volume 8 Issue 1
Department of Statistics, College of Science, Eritrea
Correspondence:
Received: December 28, 2018 | Published: February 26, 2019
Citation: Abebe B, Shukla Kk. A discrete Pranav distribution and its applications. Biom Biostat Int J. 2019;8(1):33-37. DOI: 10.15406/bbij.2019.08.00267
In the recent decades, the discretization of continuous distribution has been attracting to the attention of researchers because it generates distributions that can be used for strictly discrete data. In this paper, a discrete Pranav distribution, which is a discrete analogue of continuous Pranav distribution, has been carried out. It’s important properties including coefficient of variation, skewness, kurtosis and index of dispersion have been obtained and discussed graphically. The method of maximum likelihood estimation has been used for estimating its parameter. The goodness of fit of the proposed distribution have been illustrated using some real count datasets and it was found better fit as compared to other one parameter discrete distributions.
Keywords: Pranav distribution, discretization, moment generating function, moments, estimation, goodness of fit
In recent past years, the use of discrete analogue of a continuous distribution avoids the use of a continuous distribution in the case of strictly available discrete data.
In many cases, it is not easy to get samples from continuous distributions. The observed values, in the most of cases, are collected actually discrete in nature for the reason that they are measured to only finite number of decimal places and cannot completely presents all points in a continuum. According to Lai,1 discretization of a continuous lifetime model is an appealing approach to derive a discrete lifetime model corresponding to the continuous one. Therefore, it is reasonable and convenient to model the situation by an appropriate discrete distribution generated from the underlying continuous distribution preserving one or more important characteristics including probability density function (pdf), mean residual life function etc. and important statistical properties of the distribution.
In Statistics literature, different researchers have used different methods of discritization to propose a discrete type of distribution analogues of continuous distribution. In this study, one of the discretization methods has been used to find discrete analogous of continuous Pranav distribution introduced by Shukla.2 Infinite series method has been used to find pmf of Pranav distribution, appropriate definition is given in the next paragraph. This method was firstly used by Good3 who proposed the discrete Good distribution to model for the frequencies of species. It was given as follows.
A random variable is said to have a discrete Good distribution if its pmf can be expressed as
(1)
whereThe method of infinite series is formulated by the definition which is given as below:
Definition: Let be a continuous random variable having pdf with support on . Then the corresponding discrete random variable has pmf given by
(1.2)
where may be the vector of parameters indexing the distribution of .
This method has been used by many researchers to derive discretization of continuous distribution, such as, Kulasekara & Tonkyn,4 Doray & Luong,5 Sato et al.,6 Nekoukhou et al.,7 are some among others, who proposed a version of the method when the continuous random variable of interest is defined on . Thus, if the random variable is defined on , the pmf of can be defined as
(1.3)
Josmar et al.,8 using infinite series method has derived a discrete Shanker distribution (DSD) with parameter and having pmf
(1.4)
They have discussed its various statistical properties including its applications to model count datasets in their paper. Which is a discrete analogue of continuous Shanker distribution introduced by Shanker9 having pdf(1.5)
Using same method of discretization, the pmf of discrete Lindley distribution (DLD) proposed by Berhane & Shanker10 is given by(1.6)
where the parameter .
They have discussed its important statistical properties including estimation of parameter of DLD and applied on some count datasets from engineering and biology in their paper. They showed its superiority over other discrete one parameter distributions such as Poisson Lindley distribution (PLD) proposed by Shankar,11 Poisson Akash distribution (PAD) proposed by Shanker,12 and DSD proposed by Josmer et al. ,8As mentioned above, DLD is a discrete analogue of continuous Lindley distribution introduced by Lindley13 having pdf
(1.7)
Recently, Berhane & Shanker,14 proposed a discrete Akash distribution (DAD) using infinite series method, the pmf of a discrete Akash distribution is given as(1.8)
They have discussed its important statistical properties including estimation of method and applied on some count datasets, and showed its superiority over DSD, DLD, PLD and PAD in their paper. Which is the discrete analogue of a continuous Akash distribution introduced by Shanker,15 its pdf is given as:(1.9)
Shanker12 proposed PAD, a Poisson mixture of Akash distribution, having pmf(1.10)
He has discussed its important statistical properties including estimation of parameter along with applications of PAD in his paper. PAD was applied to count datasets and showed its superiority with PLD and other distribution of one parameter.
The PLD is a Poisson mixture of Lindley distribution introduced by Sankaran11 and is defined by its pmf
(1.11)
The main objective of this paper is to propose a discretization of Pranav distribution for the reason being that it was observed, Pranav distribution gives better fit than one parameter continuous Akash, Shaker, Sujatha, Lindley and Exponential distributions. Keeping this view in mind, it is hoped that it would be better than discrete Akash, discrete Shanker and discrete Lindley distributions and other one parameter discrete distributions.
The main objective of this paper is to propose a discretization of Pranav distribution for the reason being that it was observed, Pranav distribution gives better fit than one parameter continuous Akash, Shaker, Sujatha, Lindley and Exponential distributions. Keeping this view in mind, it is hoped that it would be better than discrete Akash, discrete Shanker and discrete Lindley distributions and other one parameter discrete distributions.
(2.1)
(2.2)
Shukla2 has discussed its various statistical properties including moments based measures, hazard rate function, and other important properties along with Bonferroni and Lorenz curves and stress-strength reliability. Pranav distribution applied for modeling lifetime data from biomedical sciences and engineering and explained its superiority over Akash, Shanker, Ishita, Sujatha, and exponential distributions.
Using the above definition, the pmf of the discrete random variable , corresponding to a continuous random variable follows Pranav distribution (2.1) with parameter , can be obtained as
(2.3)
We would call this distribution, a discrete Pranav distribution (DPD). The nature and behavior of DPD for varying values of its parameter has been shown graphically in Figure 1. From the figure it was observed that pmf of DPD is increasing as increased values of (Figure 1).
The survival function, and the cumulative distribution function (cdf), of DPD can be obtained as
(2.5)
cdf graphs of DPD are presented in Figure 2.
Since is a decreasing function of , is log-concave and therefore, the DPD has an increasing hazard rate. Further, for , which implies unimodality, by theorem 3 of Keilson & Gerber.16 The detailed about interrelationship between log-concavity, unimodality and increasing hazard rate of discrete distributions can be shown in Grandell.17
and
The first four moments about origin of DPD can thus be obtained as
Using the relationship between central moments and moments about origin, the central moments of DPD are derived as
The coefficient of variation (C.V), coefficient of skewness , coefficient of kurtosis and index of dispersion of DPD can be obtained using the relationships below
Table 1 exhibits the nature and behavior of coefficient of variation (C.V), coefficient of skewness, coefficient of kurtosis and index of dispersion (ID) for varying values of the parameter (Figure 3).
Figure 3 The plot of measures of descriptive statistics of DPD for varying values of the parameter .
|
Values of descriptive statistics |
|||||
Mean |
Variance |
C.V |
Skewness |
Kurtosis |
ID |
|
0.5 |
7.915 |
16.3833 |
0.5114 |
0.95 |
4.4328 |
2.0699 |
1 |
3.2844 |
5.288 |
0.7001 |
0.6689 |
3.6405 |
1.61 |
1.5 |
1.1918 |
2.2352 |
1.2545 |
1.3138 |
4.5122 |
1.8755 |
2 |
0.4147 |
0.7018 |
2.0197 |
2.4253 |
9.6591 |
1.692 |
2.5 |
0.1712 |
0.2418 |
2.8725 |
3.4854 |
17.7216 |
1.4125 |
3 |
0.0825 |
0.1013 |
3.8558 |
4.479 |
27.2341 |
1.2272 |
3.5 |
0.0438 |
0.0492 |
5.0675 |
5.5959 |
39.358 |
1.1239 |
4 |
0.0245 |
0.0261 |
6.6069 |
7.0201 |
57.6163 |
1.0682 |
Table 1 Values of descriptive statistics of DPD for varying values of
It is clear from Table 1 that the mean, variance, and index of dispersion of DPD are decreasing as increased values of the parameter , whereas coefficient of variation, coefficient of skewness and coefficient of kurtosis of DPD are increasing as increased values of parameter . , indicates that DPD can be a suitable model for over-dispersed data.
and its log likelihood function is
Differentiating above equation with respect , we have
Above equation can be simplified to solve the value of parameter. In this paper, R-software is used to estimate for value of .
In this section, the goodness of fit of the DPD has been discussed with two count datasets. The dataset in Table 2 has been taken from Kemp & Kemp18 and dataset in Table 3 has been taken from Beall,19 detailed about the datasets can been shown in their paper. The proposed model is compared with DSD, DLD, PLD, PAD and DAD (Figure 4&5).20
No. of error per group |
Observed |
Expected frequency |
|||||
Frequency |
PLD |
PAD |
DLD |
DSD |
DAD |
DPD |
|
0 |
35 |
33.1 |
33.5 |
31 |
31.7 |
33.2 |
36 |
1 |
11 |
15.3 |
14.7 |
17.4 |
16.9 |
14.2 |
10.6 |
2 |
8 |
6.7 |
6.6 |
7.4 |
7.2 |
7.6 |
7.1 |
3 |
4 |
2.9 |
3 |
2.8 |
2.7 |
3.3 |
3.9 |
4 |
2 |
2 |
2.2 |
1.4 |
1.5 |
1.7 |
2.4 |
Total |
60 |
60 |
60 |
60 |
60 |
60 |
60 |
|
|
1.7434 |
2.078 |
1.2678 |
1.2276 |
1.5404 |
1.689 |
1.8141 |
1.4185 |
3.3667 |
2.9963 |
1.0398 |
0.1712 |
||
d.f. |
1 |
2 |
1 |
1 |
2 |
2 |
|
p-value |
0.178 |
0.492 |
0.066 |
0.0837 |
0.595 |
0.919 |
Table 2 Distribution of mistakes in copying groups of random digits
No. of insects |
Observed |
Expected frequency |
|||||
Frequency |
PLD |
PAD |
DLD |
DSD |
DAD |
DPD |
|
0 |
33 |
31.5 |
32 |
29.6 |
30.3 |
31.6 |
34.4 |
1 |
12 |
14.2 |
13.6 |
16.2 |
15.6 |
13.2 |
9.8 |
2 |
6 |
6.1 |
6 |
6.6 |
6.4 |
6.9 |
6.4 |
3 |
3 |
2.5 |
2.6 |
2.4 |
2.4 |
2.9 |
3.4 |
4 |
1 |
1 |
1.1 |
0.8 |
0.8 |
1 |
1.4 |
5 |
1 |
0.7 |
0.7 |
0.4 |
0.5 |
0.4 |
0.6 |
Total |
56 |
56 |
56 |
56 |
56 |
56 |
56 |
|
1.8115 |
2.1446 |
1.2993 |
1.2535 |
1.5686 |
1.7122 |
|
|
0.4598 |
0.2541 |
1.5422 |
1.1516 |
0.1747 |
0.6055 |
|
d.f. |
1 |
1 |
1 |
1 |
1 |
2 |
|
p-value |
0.498 |
0.614 |
0.215 |
0.283 |
0.676 |
0.739 |
Table 3 Observed and expected frequencies for distribution of Pyrausta nublilalis in 1937
In this section, the goodness of fit of the DPD has been discussed with two count datasets. The dataset in Table 2 has been taken from Kemp & Kemp18 and dataset in Table 3 has been taken from Beall,19 detailed about the datasets can been shown in their paper. The proposed model is compared with DSD, DLD, PLD, PAD and DAD (Figure 4&5).20
In this paper, a discrete Pranav distribution (DPD) has been proposed. Its moment generating function, moments and moments based measures including statistical constants have been derived and their nature and behavior has been discussed numerically and graphically. The method of maximum likelihood estimation has been discussed for estimating its parameter. The goodness of fit of DPD has been explained using two real count datasets. The DPD gives better fit as compared to PLD, PAD, DLD, DSD and DAD in the presented datasets.
None.
Author declares that there is no conflict of interest.
©2019 Abebe, et al. This is an open access article distributed under the terms of the, which permits unrestricted use, distribution, and build upon your work non-commercially.
2 7