
 
 
Research Article Volume 4 Issue 7
     
 
	Garima distribution and its application to model behavioral science data
 Rama Shanker
   
    
 
   
    
    
  
    
    
   
      
      
        
        Regret for the inconvenience: we are taking measures to prevent fraudulent form submissions by extractors and page crawlers. Please type the correct Captcha word to see email ID.
        
        
 
 
 
          
     
    
    
    
    
    
        
        
       
     
   
 
    
    
  
Department of Statistics, Eritrea Institute of Technology, Eritrea
Correspondence: Rama Shanker, Department of Statistics, Eritrea Institute of Technology, Asmara, Eritrea
Received: September 30, 2016 | Published: December 9, 2016
Citation: Shanker R. Garima distribution and its application to model behavioral science data. Biom Biostat Int J. 2016;4(7):275-281.  DOI: 10.15406/bbij.2016.04.00116
 Download PDF
        
       
Abstract
  In this  paper a continuous distribution named “Garima distribution” has been suggested  for modeling data from behavioral science. The important properties including  its shape, moments, skewness, kurtosis, hazard rate function, mean residual  life function, stochastic ordering, mean deviations, order statistics, Bonferroni and Lorenz curves,  entropy measure, stress-strength reliability have been discussed. The condition  under which Garima distribution is over-dispersed, equi-dispersed, and  under-dispersed are presented along with other one parameter continuous  distributions. The estimation of its parameter has been discussed using maximum  likelihood estimation and method of moments. The application of the proposed  distribution has been explained using a numerical example from behavioral  science and the fit has been compared with other one parameter continuous  distributions.
  Keywords: lifetime distribution, moments, hazard rate function, mean residual life  function, mean deviations, order statistics, estimation of parameter, goodness of fit
 
Introduction
  The  modeling and analyzing lifetime data are crucial in many applied sciences  including behavioral science, medicine, engineering, insurance and finance,  amongst others.  There are a number of  continuous distributions for modeling lifetime data such as exponential,  Lindley, gamma, lognormal, and Weibull and their generalizations. The  exponential, Lindley and the Weibull distributions are more popular than the  gamma and the lognormal distributions because the survival functions of the  gamma and the lognormal distributions cannot be expressed in closed forms and  both require numerical integration. Though each of exponential and Lindley  distributions has one parameter, the Lindley distribution has one advantage  over the exponential distribution that the exponential distribution has  constant hazard rate whereas the Lindley distribution has monotonically  decreasing hazard rate.
  Recently  Shanker1–4 has introduced new lifetime  distributions, namely Shanker, Akash, Aradhana, and Sujatha distributions for  modeling lifetime data from biomedical sciences, engineering and behavioral  sciences and showed its superiority over Lindley5  and exponential distributions. The probability density function (p.d.f.) and  the cumulative distribution function (c.d.f.) of Sujatha, Aradhana, Akash,  Shanker, Lindley and exponential distributions are presented in Table 1.
  
  
  
    
      Distributions  | 
      Pdf  | 
      Cdf  | 
    
    
      Sujatha  | 
      
  | 
      
  | 
    
    
      Aradhana  | 
      
  | 
      
  | 
    
    
      Akash  | 
      
  | 
      
  | 
    
    
      Shanker  | 
      
  | 
      
  | 
    
    
      Lindley  | 
      
  | 
      
  | 
    
    
      Exponential  | 
      
  | 
      
  | 
    
  
  Table 1 pdf and cdf of Sujatha,4 Aradhana,3  Akash,2 Shanker,1  Lindley5 and exponential distributions
 
 
 
  
  
  
  
  
  
  
  
  
 
A new lifetime distribution
  The  probability density function (p.d.f.) of a new lifetime distribution can be  introduced as
    
                         (2.1)                                           
   We would call this distribution, “Garima  distribution”. This distribution can be easily expressed as a mixture of  exponential 
 and gamma 
 with mixing  proportion 
. We have
   
                                            (2.2)
    where 
.
    The  corresponding cumulative distribution function (c.d.f.) of (2.1) is given  by          
    
;       
                              (2.3)
  The  graphs of the p.d.f. and the c.d.f. of Garima distributions for different  values of 
 are shown in Figure 1.
 
 
  Figure 1 Graphs of the pdf and cdf of  Garima distribution for various values of the parameter θ.
 
 
 
 
 
 
 
 
 
 
Moments and related measures
  The 
 the moment about origin of Garima distributon (2.1)  has been obtained as
    
  
  and so  the first four moments about origin as
    
,        
,          
,         
  Using  the relationship between central moments and the moments about origin, the  central moments of Garima distribution are obtained as
    
  
 
  
 
  
  Thus  the coefficient of variation 
, coefficient of skewness 
, coefficient of kurtosis 
 and index of dispersion 
 of Garima  distribution are obtained as
    
  
  
  
 
    
 
    The  condition under which Garima distribution is over-dispersed 
, equi-dispersed 
 and under-dispersed 
 are presented in Table 2  along with other lifetime distributions.
 
 
    
      Lifetime 
        Distributions  | 
      Over-Dispersion 
          
  | 
      Equi-Dispersion 
        
        
      
        | 
      Under-Dispersion 
        
   | 
    
    
      Garima  | 
      
  | 
      
  | 
      
  | 
    
    
      Sujatha  | 
      
          
   | 
      
          
   | 
      
          
   | 
    
    
      Aradhana  | 
      
          
   | 
      
          
   | 
      
          
   | 
    
    
      Akash  | 
      
          
   | 
      
          
   | 
      
          
   | 
    
    
      Shanker  | 
      
          
   | 
      
          
   | 
      
          
   | 
    
    
      Lindley  | 
      
          
   | 
      
          
   | 
      
          
   | 
    
    
      Exponential  | 
      
          
   | 
      
          
   | 
      
          
   | 
    
  
  Table 2 Over-dispersion, equi-dispersion  and under-dispersion of Garima, Sujatha,4 Aradhana,3 Akash,2 Shanker,1 Lindley,5 and exponential  distributions for varying values of their parameter θ
 
 
 
 
 
 
 
 
 
 
 
 
 
Generating functions
  The  moment generating function 
, characteristic function 
, and cumulant generating function 
of Garima distribution (1.3) are given by 
    
  
 
                  
 
  
 
    Using  the expansion 
, we get
  
  
  
  Thus  the 
th cumulant of Garima distribution is given  by
    
= coefficient of 
 in 
                     
    
  This  gives
   
  
  
  
 
    Which the  same are as obtained earlier.
 
Hazard rate function and mean residual life function
  Let 
 be a continuous random variable with pdf 
 and cdf 
. The hazard rate function (also known as the failure  rate function) and the mean residual life function of 
 are respectively defined as 
    
                                        (5.1)
    and 
                             (5.2)
    The  hazard rate function, 
 and the mean residual life function,
 of Garima  distribution are given by 
  
                                                                   (5.3)
    and                   
                                                                (5.4)
    
    It can  be easily verified that 
 and 
.It is also obvious from the graphs of 
 and 
 that 
 is an  increasing or decreasing function of 
, and 
, where as 
 is a decreasing function of 
, and 
. The graph of the hazard rate function and mean  residual life function of Garima distribution are shown in Figures 2 & 3.          
  Figure 2 Graph of hazard rate function of Garima distribution for different values of parameter θ.
 
 
 
 
  Figure 3 Graph of mean residual life function of Garima distribution for different values of parameter θ.
 
 
 
 
 
Stochastic orderings
  Stochastic  ordering of positive continuous random variables is an important tool for  judging their comparative behavior. A random variable 
 is said to be smaller than a random variable 
 in the 
 
 
 
 
  
    - stochastic order 
 if 
 for all 
 
    - hazard rate order 
 if 
 for all 
 
    - mean residual life order 
 if 
 for all 
 
    - likelihood ratio order 
 if 
 decreases in 
.
 
  
  The  following results due to Shaked & Shanthikumar [6]  are well known for establishing stochastic ordering of distributions
  
   
                                            (6.1)
   
    The  Garima distribution is ordered with respect to the strongest ‘likelihood ratio’  ordering as shown in the following theorem:
  
    - Theorem: Let 
 Garima distributon 
 and 
 Garima distribution 
. If 
, then 
 and hence 
, 
 and 
.
 
    - Proof: We  have 
 
  
  
  
    Now 
  
 
    This gives       
     Thus for 
, 
. This means that 
 and hence 
, 
 and 
.
 
Mean deviations
  The  amount of scatter in a population is measured to some extent by the totality of  deviations usually from mean and median. These are known as the mean deviation  about the mean and the mean deviation about the median defined by
    
 and 
, respectively, where 
 and 
. The measures 
 and 
 can be calculated using the relationships
  
 
  
 
 
 
 
                                                            (7.1)
    and 
  
 
  
 
  
 
 
                                                                    (7.2)
    Using p.d.f.  (2.1) and expression for the mean of Garima distribution, we get
  
                           (7.3)
  
                           (7.4)
    Using  expressions from (7.1), (7.2), (7.3), and (7.4), the mean deviation about mean,
 and the mean  deviation about median, 
 of Garima  distribution are obtained as
  
                                                                        (7.5)
    
                                  (7.6)
 
Order statistics
  Let 
 be a random  sample of size 
 from Garima  distribution (2.1). Let 
 denote the corresponding order statistics. The p.d.f.  and the c.d.f. of the 
th order statistic, say 
 are given by
    
  
 
    and 
  
 
  
,
    respectively, for 
.
   Thus,   the p.d.f. and the c.d.f of 
th  order  statistics of Garima distribution are given by
   
 
    and 
  
 
Bonferroni and lorenz curves
  The  Bonferroni and Lorenz curves7 and Bonferroni  and Gini indices have applications not only in economics to study income and  poverty, but also in other fields like reliability, demography, insurance and  medicine. The Bonferroni and Lorenz curves are defined as
    
        (9.1)
    and 
         (9.2)
    respectively or equivalently 
 
                                                                      (9.3)
    and    
                                                                      (9.4)
    respectively, where 
and 
.
    The Bonferroni and Gini indices are thus  defined as
  
                                                                        (9.5)
    and 
                                                               (9.6)
    respectively.
    Using p.d.f. (2.1), we get 
  
                                                   (9.7)
    Now  using equation (8.7) in (8.1) and (8.2), we get 
 
                                                (9.8)
    and  
                                                       (9.9)
    Now using equations (9.8) and (9.9) in (9.5)  and (9.6), the Bonferroni and Gini indices of Garima distribution (2.1) are  obtained as
  
                                                       (9.10)
  
                                                  (9.11)
 
Renyi entropy
  Entropy  of a random variable 
 is a measure of variation of uncertainty. A popular  entropy measure is Renyi entropy [8]. If 
 is a continuous random variable having probability  density function 
, then Renyi entropy is defined as
    
 
    where 
.
    Thus,  the Renyi entropy for the Garima distribution (2.1) is obtained as
  
 
  
  
  
  
  
 
 
Stress-strength reliability
  The  stress- strength reliability describes the life of a component which has random  strength 
 that is subjected to a random stress 
. When the stress applied to it exceeds the strength,  the component fails instantly and the component will function satisfactorily  till 
. Therefore, 
 is a measure of component reliability and in  statistical literature it is known as stress-strength parameter. It has wide  applications in almost all areas of knowledge especially in engineering such as  structures, deterioration of rocket motors, static fatigue of ceramic  components, aging of concrete pressure vessels etc.
    Let 
 and 
 be independent strength and stress random variables  having Garima distribution (2.1) with parameter 
 and 
 respectively.  Then the stress-strength reliability 
 of Garima distribution can be obtained as
 
 
 
  
.
 
Estimation of parameter
  Maximum  likelihood estimates (MLE)   
  Let 
 be a random  sample from Garima distribution (2.1). The likelihood function, 
 of (2.1) is given by
  
    
 
    The natural log likelihood  function is thus obtained as
  
 
    Now        
    where 
 is the sample mean.
    The  maximum likelihood estimate,
 of 
 is the solution  of the equation 
 and so it can  be obtained by solving the following non-linear equation 
  
                                          (12.1.1)
Method  of moment estimates (MOME)
Equating  the population mean of the Garima distribution to the corresponding sample  mean, the method of moment estimate (MOME)
, of 
 can be obtained as
   
                  (12.2.1)
 
A numerical example
  In this  section the goodness of fit of the Garima distribution has been discussed with  an example from behavioral science. The data is related with behavioral  sciences, collected by Balakrishnan N et al. [9].  The scale “General Rating of Affective Symptoms for Preschoolers (GRASP)”  measures behavioral and emotional problems of children, which can be classified  with depressive condition or not according to this scale. A study conducted by  the authors in a city located at the south part of Chile has allowed collecting  real data corresponding to the scores of the GRASP scale of children with  frequency in parenthesis, which are:
    
        19(6)  | 
        20(15)  | 
        21(14)  | 
        22(9)  | 
        23(12)  | 
        24(10)  | 
      
      
        25(6)  | 
        26(9)  | 
        27(8)  | 
        28(5)  | 
        29(6)  | 
        30(4)  | 
      
      
        31(3)  | 
        32(4)  | 
        33  | 
        34  | 
        35(4)  | 
        36(2)  | 
      
      
        37(2)  | 
        39  | 
        42  | 
        44  | 
         | 
         | 
      
    
   
  In  order to compare distributions, 
    
, AIC (Akaike Information Criterion), AICC (Akaike  Information Criterion Corrected), BIC (Bayesian Information Criterion),K-S  Statistics ( Kolmogorov-Smirnov Statistics)   for above data set have been computed and presented in Table 3.  The  formulae for computing AIC, AICC, and BIC are as follows: 
    
,    
,    
 
  The  best distribution is the distribution which corresponds to lower values of 
, AIC, AICC, and BIC.
  It can  be easily seen from above table that the Garima distribution is better than  Aradhana, Sujatha, Akash, Shanker, Lindley and exponential distributions  for modeling behavioral science data and thus  Garima distribution should be preferred over Aradhana, Sujatha, Akash, Shanker,  Lindley and exponential distributions   for modeling behavioral science data.
  
  
    
      Model  | 
        ML Estimate  | 
      
   | 
      AIC  | 
      AICC  | 
      BIC  | 
    
    
      Garima  | 
      0.05317  | 
      188.32  | 
      190.32  | 
      190.35  | 
      193.23  | 
    
    
      Aradhana  | 
      0.11557  | 
      989.49  | 
      991.49  | 
      991.52  | 
      994.40  | 
    
    
      Sujatha  | 
      0.11745  | 
      985.69  | 
      987.69  | 
      987.72  | 
      990.60  | 
    
    
      Akash  | 
      0.11961  | 
      981.28  | 
      983.28  | 
      983.31  | 
      986.18  | 
    
    
      Shanker  | 
      0.07974  | 
      1033.10  | 
      1035.10  | 
      1035.13  | 
      1037.99  | 
    
    
      Lindley  | 
      0.07725  | 
      1041.64  | 
      1043.64  | 
      1043.68  | 
      1046.54  | 
    
    
      Exponential  | 
      0.04006  | 
      1130.26  | 
      1132.26  | 
      1132.29  | 
      1135.16  | 
    
  
  Table 3 MLE’s,-2ln L, AIC, AICC, and  BIC of Garima, Aradhana, Sujatha [4], Akash [2], Shanker [1],  Lindley [5] and exponential distributions
 
 
 
  
  
  
  
  
  
  
  
  
  
  
 
 
Conclusion
   A one  parameter lifetime distribution named, “Garima distribution” has been proposed  and studied. Its mathematical properties including shape, moments, skewness,  kurtosis, hazard rate function, mean residual life function, stochastic  ordering, mean deviations, order statistics, Bonferroni and Lorenz curves,  Renyi entropy and stress-strength reliability   have been discussed. The condition under which Garima distribution is  over-dispersed, equi-dispersed, and under-dispersed are presented along with  the conditions under which Sujatha, Aradhana, Akash, Shanker, Lindley and  exponential distributions are over-dispersed, equi-dispersed and  under-dispersed. The method of moments and the method of maximum likelihood  estimation have also been discussed for estimating its parameter. Finally, a  numerical example from behavioral science has been considered for the goodness  of fit of Garima distribution and the fit has been compared with Sujatha,  Aradhana, Akash, Shanker, Lindley and exponential distributions. The goodness  of fit of the Garima distribution shows that it is an important model for  modeling behavioral science data.  
  NOTE: The paper is named in  loving memory of my niece Garima Satypriya, daughter of my respected brother  Prof. Uma Shanker, Department of Mathematics, K.K College of Engineering &  Management, Biharsharif, Nalanda, India.
 
Acknowledgments
 Conflicts of interest
References
  
  
  ©2016 Shanker. This is an open access article distributed under the terms of the, 
 which 
permits unrestricted use, distribution, and build upon your work non-commercially.