
 
 
Research Article Volume 7 Issue 2
     
 
	On the variation of the probability distribution of the future life–time: a case of the kenyan mortality experience
 Richard Onyino Simwa  
    
 
   
    
    
  
    
    
   
      
      
        
        Regret for the inconvenience: we are taking measures to prevent fraudulent form submissions by extractors and page crawlers. Please type the correct Captcha word to see email ID.
        
        
 
 
 
          
     
    
    
    
    
    
        
        
       
     
   
 
    
    
  
University of Nairobi, Kenya
Correspondence: Richard Onyino Simwa, University of Nairobi, School of Mathematics, PO Box 3019700100, Nairobi, Kenya, Tel 2547 2277 1902
Received: March 09, 2018 | Published: April 9, 2018
Citation: Simwa RO. On the variation of the probability distribution of the future life–time: a case of the kenyan mortality experience. Biom Biostat Int J. 2018;7(2):141-145.  DOI: 10.15406/bbij.2018.07.00202
 Download PDF
        
       
 
 
Abstract
  A  life table is an essential tool for valuing life insurance policies and it presents  the probability distribution of the future life–time of a group of lives at the  various ages. They are developed by the experts with actuarial knowledge. The  life table will vary with the group of lives considered in the mortality  investigation. Further the variation may also prevail when the same group of  lives is investigated at different time periods, due to the effect of  generational change in mortality. In this paper we apply statistical inference  on published life tables for the Kenyan mortality experience for the mortality investigations  performed during two separate disjoint time periods, to investigate significance  of the variation in the mortality as the periods of the investigation vary. It  is shown that the variation in the probability distribution of the future life–time  for the Kenyan mortality experience is significant. Thus we confirm, as known  in practice by the actuaries, that there is a need for continuous mortality  investigations and the construction of the corresponding life tables, every  after some time interval, to account for the variation in mortality as  generations vary.
  Keywords: life table,  curtate future life–time, probability mass function, cumulative distribution  function, goodness–of–fit tests, kenyan mortality experience
  
Introduction
  Consider  a life aged x and let K(x) denote the curtate future life–time of the life,  that is the time left before he or she dies, in completed years, see Gerber.1 Then K(x) is a discrete random  variable with K(x)= 0, 1, ..  where 
 is the assumed limiting age, such that is the maximum age a life can live and the operation  [.] refers to 'integer part' of the object. The probability mass function for  K(x), P(K(x)=k), k=0, 1,2,… ,.. can be derived from a life table as noted in Section  2. Let (r) denotes the corresponding distribution function.  To compare two separate mortality experiences, letdenote  the distribution function corresponding to the other experience. Then to  compare the distributions, the hypotheses are given by.
  
  Versus
  
  The  non–parametric goodness–of–fit tests can be used to carry out the test given  appropriate data, Gibbon.2 However, as in Benjamin  et al.3 given the relevant life tables, we have the  equivalent hypothesis test, namely to test
  The  Life table corresponding to (r) is the same as the Life table corresponding to (for  all real values, r)
  Versus
  
    Tests  for versus can  be performed using available published life tables for a mortality experience of  a given population, Benjamin et al.3  and Scott.4  These are goodness–of–fit tests, which include the chi–square test, and the  other specific goodness–of–fit tests that address the limitations encountered  when the former test is applied.3,4  The specific tests include the standardized deviation test, the ordinary signs test and the cumulative deviations test. In the methodology section these tests are discussed. Results on the goodness–of–fit tests then follow. Conclusion and recommendation from the study are discussed in the paper.
  
	
Methodology
  Consider  a life table, Gerber.1 The table has  two main columns: ‘Age, x’ (column 1) and ‘Number of lives aged x,'(column  2). A third column for ‘Number of deaths aged x ’,  may also be included in the table. However the  third column can be derived from the second column. Thus the life table  functions, and  form the basis of a life table. Note  that the and  the  functions of the age x, are standard life  table discrete functions applied in actuarial science to denote the number of  lives and deaths at age x, respectively, in a life table.
  Let  K(x) denote the curtate (completed years) future life – time for a life aged x  , last birthday. Then clearly K(x) = 0, 1, 2, …. Which is a random variable, at any age x, with the probability  mass function given by Gerber.1
   
  Thus  the cumulative distribution function for K(x), say(y) is given by 
  
  
  The null hypothesis
  For  two cohorts of lives their corresponding mortality experiences can be  represented by the random variables K(x) and K’(x) with cumulative  distributions,(y) and (y)  respectively. To test for equality of the distributions, the null hypothesis is  given by
   ,  for all values of y.
  We  note, from Equation (2.2), that this hypothesis is equivalent to the hypothesis
  The  life table for K(x) is the same as the life table for K’(x)
  To  testagainst  an alternative we use the approach for comparing a standard life table with  another life table as in Benjamin et al.3  and Scott.4
  Note  that we can have the null hypothesis stated as follows;
   Underlying  mortality rates of each age x for the experience is the same as the corresponding  rates in the standard table.
  The  alternative hypothesis is given by
  NOT 
  The  following assumptions are made in the study:
  
    - The  tests are carried out on data for the age range 24 to 65 years in order to make  the analysis consistent with the nature of the data. This is based on the  general assumption that a majority of the population only purchase life  assurance products after 24 years, the average age at entry into employment,  and before 65 years, which is the average retirement age.5
 
    - We  test the hypothesis at 5% level of significance.
 
  
 Statistical tests for goodness–of–fit 
  The  differences in mortality experiences can be investigated by carrying out the  following statistical for a given level of significance. 
  
    - The  Chi–Square Test
 
    - The  Individual Standardized Deviations (ISD) Test
 
    - The  Ordinary Signs Test
 
    - Cumulative  Deviation Test
 
  
 The chi–square  test: The chi–square  statistic also known as the overall test may be defined as:  
  
  This  is the sum of the squares of the standardized deviations of the probability of  death at each age x. 
  
    - Lives  (lx) are independent with respect to  mortality experience
 
    - The  expected number of deaths at each age is large (>5)
 
  
  Where  Actual, denoted by A = the number of deaths, dx,  in the table 2007–2010. Expected, denoted by E =the number of deaths, dx, in the table 2000–2003. The chi–square  statistic has n degrees of freedom, where n is the total number of ages which  in this case n=42. Let the confidence interval be 95%, therefore, the null  hypothesis is rejected if the calculated value of the test statistic is greater  than the tabulated chi–square value at level of significanceIf  the computed chi–square statistic is large, we conclude that there is  significant difference between the observed and the expected number of deaths. The  chi square test has various limitations.3,4  It fails to detect the following.
  
    - A  number of excessively large of deviations neutralized by an excessively large  number of small deviations. This can be remedied by the individual standardized  deviations test.
 
    - The  signs of the deviations which can be remedied by doing a signs test
 
    - Any  patterns of deviation signs. This can be remedied by the runs test.
 
    - A  large cumulative deviation over the age range. This can be remedied by the  cumulative deviations test.
 
  
  The  following discussion on these specific goodness–of–fit tests follow, as in Benjamin et al.3 and Scott.4
  Individual  standardized deviations (ISD) test: The ISD test is  used to correct the defect of the Chi–Square Test whereby it fails to detect a  number of excessively large deviations. Moreover, the ISD test seeks to  determine whether the observed pattern of the individual standardized  deviations is consistent with a standard normal distribution. The assumption  made when using the ISD test is that the normal approximation is a suitable  approximation at all ages. We test the null hypothesis that the pattern of the  ISDs follows the standard normal distribution. First, the standardized  deviations Zx for each age  x were obtained, where each Zx =  (A – E) /E, where A and E are as defined in Section 4.2.1, at age x. The real  line is divided into eight intervals:
  (–,  –3), (–3, –2), (–2, –1), (–1, 0), (0, 1), (1, 2), (2, 3), (3,)
  The  count of the number of standardized deviations, ,  which fall into each interval is noted. To obtain the number of deviations  which should fall into the “expected” category, we use a general rule for the  ISD test, that the expected number of deviations should not be less than 5 in  each interval. One is therefore required to sometimes pool data of adjacent  cells in the table to achieve this requirement (Table 1).
  The  statistic can be obtained as follows.
    
      Interval  | 
      (-, -3)  | 
      (-3, -2)  | 
      (-2, -1)  | 
      (-1, 0)  | 
      (0, 1)  | 
      (1, 2)  | 
      (2, 3)  | 
      (3,)  | 
    
    
      % of Expected Values  | 
      0  | 
      2  | 
      14  | 
      34  | 
      34  | 
      14  | 
      2  | 
      0  | 
    
  
  Table 1  Standard normal distribution, area (in percentage) under the curve for given intervals
 
 
 
 
  This  statistic will have (r – 1) degrees of freedom where r = number of groups.  There ought to be a nearly equal number of positive and negative values. An  excess of either positive or negative values shows that the data is skewed and  has a positive bias or a negative bias, respectively. Further, if the  standardized deviations do not adhere to a normal distribution, then the  mortality experiences are different.
  Ordinary signs  test: The ordinary  signs test is an overall test for bias to test whether the mortality rates are  too high or too low. It deals with the defect of chi–square test where it fails  to detect an imbalance between the number of positive and negative deviations. We  expect that if the mortality rates of the two tables are consistent then  roughly, the signs of the deviations have a binomial distribution with  parameters n (the total number of ages being considered) and 0.5. This is a two–sided  test since we consider both the negative and positive signs. If the number of  deviations of either sign is very large compared to the other, then we conclude  that the rates are biased. Let K be the number of positive deviations. Under  the null hypothesis K~ binomial (n, 0.5). 
  If  n (the total number of ages being considered) is large (>20), we use the  approximation
  K~ normal (n/2, n/4)
  The  limitation of signs test is that it is qualitative and not quantitative.
  Cumulative  deviations test: This method  deals with the failure of the chi–square test to detect a large cumulative  deviation over the age range. It detects overall bias or log runs of deviations  that have the same sign. The assumption of this test is that the normal  approximation is reasonable at all ages. The null hypothesis is that the  mortality rates are not biased or that the variance is not greater than  expected.
  Test Statistic:
   
  This  is a two–tailed test. We test at 5% level of significance. If the absolute  value of the calculated statistic is greater than 1.96, we reject the null  hypothesis and conclude that the mortality rates are such the two sets of data  are significantly different.
  
	
Data, computations and results
  Data  
  The  data are the secondary data which appear in the published Kenya Mortality  Tables, namely the 2000–2003 Kenyan Mortality Life Table and the 2007–2010  Kenyan Mortality Life Table appear in Appendix A1, as given in AKI (2018). Published  life tables for a given group of lives are readily available in the public  domain in most economies where they exist. They are useful in actuarial and  demographic applications, among others.
  Exploratory data  analysis 
  The  graphs in Figure 1 and Figure 2 reveal existence of some differences in the mortality rate for the two  separate mortality investigations. The significance of the difference is  studied in Section 5.2 (Figure 1).
  Figure 1  shows that there is a general shift of the curve to the right as we move from  the 2000–2003 life table mortality to the 2007–2010 life table mortality and as  we move from men mortality to female mortality rates. This implies an  improvement in mortality, that on average, lives have a higher chance of  surviving at each age, for the age groups affected. The mortality for men in  the 2000–2003 life table remained the higher across all the ages and increases  sharply at age 60, compared to the 2007–2010 life table mortality experience.  The mortality for women in years 2007–2010 was higher than that of 2007–2010  life table male mortality and 2000–2003 life table female mortality between  ages 56 and 80 but lower from age 80. The graphs in Figure 2 further confirm the observations from Figure 1,  emphasizing the age group 20 to 60 years (Figure 2).  We test the significance of these differences using the statistical tests in  the following section.
  
  
Figure 1 A plot of mortality rate against age for male and female population, age range 20-100.
 
 
  
Figure 2 A plot of mortality rate against age for male and female population, age range 20 to 60.
 
 
 
  
  
  
  Results on the goodness–of–fit tests
  Results  are based on the application of the approaches in Section 2 on the Kenyan  mortality experience secondary data available in form of the published  mortality life tables, AKI (2018). Four test statistics, namely the chi–square  test, the individual standardized deviation (ISD) test, the ordinary sign test  and the cumulative deviations test are applied. These tests are considered  in Section 5.2.1, Section 5.2.2, Section 5.3.3 and Section 5.3.4 respectively.  We consider, in each case, the male lives and the female lives mortality  experience separately.
  Chi–square test: The tabulated value of the Chi–Square Test  (CST) statistic at 0.05 level of significance when the degrees of freedom  (n=42) is The  computed values are as determined in from Table A1–1  and Table A1–2 in the Appendix.
  Mortality  experience for male lives (CST): From Table A1–1,  we can easily compute
  =  2272.17
  Hence  computed  value = 2272.17
  Mortality  experience for female lives (CST): From Table A1–2,  we can easily compute  
  =  3223.32
  Hence  computed  value = 3223.32. The computed  value for both males and females  mortality data are greater than the tabulated value. Hence there is sufficient  evidence to reject the null hypothesis and thus there could be a significant  difference between the two mortality experiences for either male and female  lives.
  Individual standardized  deviations (ISD) Test
  Mortality  experience for male lives (ISD) (Table 2)
    
      Range  | 
      Actual  | 
      Combined actual (A)  | 
      Expected  | 
      Combined expected (E)  | 
      A-E  | 
      (A-E)²   | 
      {(A-E)²/E}  | 
    
    
      - to -3  | 
      5  | 
       | 
      0  | 
       | 
       | 
       | 
       | 
    
    
      -3 to -2  | 
      2  | 
       | 
      1.6  | 
       | 
       | 
       | 
       | 
    
    
      -2 to -1  | 
      1  | 
      8  | 
      11.2  | 
      12.8  | 
      -4.8  | 
      23.04  | 
      1.8  | 
    
    
      -1 to 0  | 
      2  | 
      2  | 
      27.2  | 
      27.2  | 
      -25.2  | 
      635.04  | 
      23.3471  | 
    
    
      0 to 1  | 
      1  | 
      1  | 
      27.2  | 
      27.2  | 
      -26.2  | 
      686.44  | 
      25.2368  | 
    
    
      1 to 2  | 
      1  | 
      31  | 
      11.2  | 
      12.8  | 
      18.2  | 
      331.24  | 
      25.8781  | 
    
    
      2 to 3  | 
      2  | 
       | 
      1.6  | 
       | 
       | 
       | 
       | 
    
    
      3 to  | 
      28  | 
       | 
      0  | 
       | 
       | 
       | 
      ( TOTAL)  | 
    
    
       | 
       | 
       | 
       | 
       | 
       | 
       | 
      76.2619  | 
    
  
  Table 2  Computation of the  statistic for ISD test, male lives
 
 
 
  Mortality  experience for female lives (ISD): The computed value of the Chi–square  statistic for the Males is 76.2619. The computed value of the Chi–square  statistic for the Females is 87.40625. The computed values in both cases exceed  the upper 95% of the tabulated value of a Chi–Square distribution with 3  degrees of freedom, which are 7.815. Thus, we can conclude that the data does  not conform to the standard normal distribution. Hence there is sufficient  evidence to reject the null hypothesis and thus there is evidence of significant  difference between the two mortality experiences for either male and female  lives (Table 3). 
    
      Range  | 
      Actual  | 
      Combined actual (A)  | 
      Expected  | 
      Combined expected (E)  | 
      A-E  | 
      (A-E)²   | 
      {(A-E)²/E}  | 
    
    
      -to -3  | 
      9  | 
       | 
      0  | 
       | 
       | 
       | 
       | 
    
    
      -3 to -2  | 
      0  | 
       | 
      1.6  | 
       | 
       | 
       | 
       | 
    
    
      -2 to -1  | 
      0  | 
      9  | 
      11.2  | 
      12.8  | 
      -3.8  | 
      14.44  | 
      1.12813  | 
    
    
      -1 to 0  | 
      0  | 
      0  | 
      27.2  | 
      27.2  | 
      -27.2  | 
      739.84  | 
      27.2  | 
    
    
      0 to 1  | 
      0  | 
      0  | 
      27.2  | 
      27.2  | 
      -27.2  | 
      739.84  | 
      27.2  | 
    
    
      1 to 2  | 
      1  | 
      33  | 
      11.2  | 
      12.8  | 
      20.2  | 
      408.04  | 
      31.8781  | 
    
    
      2 to 3  | 
      0  | 
       | 
      1.6  | 
       | 
       | 
       | 
       | 
    
    
      3 to   | 
      32  | 
       | 
      0  | 
       | 
       | 
       | 
      (TOTAL)  | 
    
    
       | 
       | 
       | 
       | 
       | 
       | 
       | 
      87.4063  | 
    
  
  Table 3  Computation of the 
 statistic for ISD test, female lives
 
 
 
  Ordinary signs  test
  The  ordinary signs test (OST) addressed the defect of the Chi–Square test where it  failed to detect an imbalance between the number of positive and negative  deviations observed in the mortality data for assured lives in Kenya. Since the  total number of ages under observation (n=42) is large, a normal distribution  approximation is assumed for the number of positive deviation, to be denoted by  K. Thus, K ~ Normal (21, 10.5)3,4. That is the statistic K has a normal  distribution with mean = 21, and variance = 10.5.
  Mortality  experience for male lives (OST): The number of positive deviations, from Table A1–1, we note that K =32.
  To  standardize this value:
  
  
      Mortality experience  for female lives (OST): The number of  positive deviations from Table A1–2, can easily  be computed to obtain K =33.
  To  standardize this value:
  
  The  ordinary signs test is a two–tailed test. This means that the tabulated value  required is the table value for the standardized normal distribution, at a  level of significance of 0.025 since the level of significance from  the standard normal tables, the table value, denoted by ,  is given by .  Thus by comparison, the Z values computed for both Males and Females mortality  data are larger than the tabulated value at 0.05 level of significance. There  is therefore sufficient evidence to reject the null hypothesis. This means that  there is an imbalance between positive and negative deviations. It can be  concluded, from these results, that the mortality rates being compared are  significantly different in each case for males and females. 
  Cumulative  deviations test
  The  cumulative deviations test (CDT) corrects the failure of the chi–square test to  detect a large cumulative deviation over the age range. Moreover, the test is a  good measure of overall bias.
  Mortality  experience for male lives(CDT): From Table A1–1,  we compute the standardized cumulative deviation value for males data,
   
Mortality  experience for female lives (CDT): From Table A1–2,  we compute the standardized cumulative deviation value for females data, 
  
  The  cumulative deviations test is a two–tailed test. This means that the tabulated  value required is the table value for the standardized normal distribution , at  a level of significance of 0.025 since the level of significance α = 0.05. From  the standard normal tables, the table value, denoted by ,  is given by . Thus  by comparison, the computed value for both mortality for male and mortality for  female lives are larger than the tabulated value at 0.05 level of significance.  There is therefore sufficient evidence to reject the null hypothesis. This  means that there is an imbalance between positive and negative deviations. Thus  the mortality rates being compared are significantly different.
  
	
Conclusion and recommendation
  All  the tests applied in Section 3 lead to rejection of the null hypothesis that  the 2000–2003 Kenya mortality life table and the 2007–2010 Kenya mortality life  table are similar, and this imply that there is significant variation in the  mortality experience underlying the two life tables. Hence the distribution of  the future life–time varies with the period of the mortality investigation for the  Kenyan mortality experience. Thus we confirm and recommend that for the life  tables to remain relevant they should be developed continuously by having mortality  investigations carried out every after some appropriate interval of time.6
  
	
Acknowledgement 	
 
	
Conflict of interest statement
  Author declares that there is no conflict of  interest.
  
	
References
  
      - Gerber  HU. Life Insurance Mathematics. 3rd ed. Springer: Swiss Association of  Actuaries; 1997:18‒20.
 
      - Gibbons  JD. Non-Parametric Statistical Inference.  New York: Graw-Hill Book Company; 1971:(4):68‒73.
 
      - Benjamin  B, Pollard JH. The Analysis of Mortality and Other Actuarial Statistics. 2nd ed. London: William Heinemann Ltd; 1980:222‒236.
 
      - WF Scott. Mortality  Studies. Aberdeen: University of Aberdeen; 2000:81‒83.
 
      - Brown  RL. Issues in the Modelling of Mortality at Advanced Ages. Institute of  Insurance and Pension Research, University of Waterloo, Canada; 1997.
 
     - Report  on Kenyan Mortality Life Tables. Association of Kenya Insurers (AKI); 2018.
 
 
     
  
 
 
  
  ©2018 Simwa. This is an open access article distributed under the terms of the, 
 which 
permits unrestricted use, distribution, and build upon your work non-commercially.