Research Article Volume 8 Issue 2
Department of Statistics, Jimma University, Ethiopia
Correspondence: Reta Habtamu Bacha, Department of Statistics, Jimma University, Jimma, Ethiopia
Received: December 29, 2018 | Published: April 30, 2019
Citation: Bacha RH, Tadesse M. Bayesian generalized linear model for identifying predictors of child nutritional status in Ethiopia. Biom Biostat Int J. 2019;8(2):65-74. DOI: 10.15406/bbij.2019.08.00273
Background: In Ethiopia, child malnutrition is one of the most serious public health problem and the highest in the world. The overall prevalence of underweight among children under five years old in Ethiopia was 25% and 7% are severely and moderate underweight in 2014 respectively. Malnutrition in sub-Saharan Africa contributes to high rates of childhood morbidity and mortality. However, little information on the nutritional status of children is available from informal settlements. The primary aim of this study was then to determine the determinants of children malnutrition in Ethiopia using Bayesian generalized linear model. The overall prevalence of underweight among children in Ethiopia was 36.4.
Methods: Data was obtained from 2011 Ethiopian Demographic and Health Survey (EDHS). Bayesian Generalized regression model was used to identify the effects of selected socioeconomic, demographic, health and environmental covariates. Bayesian approach with Markov Chain Monte Carlo (MCMC) technique was used.
Results: The analysis result revealed that out of the 11, 654 number cases examined in this study 32.8% of male children were underweight and 33.6% of female children were underweight. It was found that the covariates succeeding birth interval, sex of child, child by choice not by chance, vaccination and cough were the most important determinants of children nutritional status in Ethiopia.
Conclusion: Inference is the fully Bayesian and classical generalized linear model based on recent Markov chain Monte Carlo techniques.Some of the socioeconomic, demographic and environmental determinants included in the study were found to be statistically significant. The result was suggested that for reducing childhood malnutrition, due emphasis should be given in improving the knowledge and practice of parents on appropriate young child feeding practice and frequent growth monitoring together with appropriate and timely interventions.
Keywords: nutrition, weight-for-age, malnutrition, anthrometric measurement, mcmc, bayesian generalized linear model
Source of data
This study is based on the 2011 Ethiopian Demographic and Health Survey (EDHS) data obtained from the Central Statistics Agency of Ethiopia. In this study, to assess the nutritional status of children, we have three anthropometric indices: weight-for-age, height-for-age and weight-for-height which are considered as the indicators of nutritional status.
Variables of the study
In this study socio-economic, demographic, health and environmental characteristics are to be proximate determinants of nutritional status. In our application on childhood nutritional status the variables are Weight-for-age (underweight) which is measured by Z-score and body mass index(BMI). Body mass index is a measure of weight adjusted for height. It is calculated as weight in kilograms divided by the square of height in meters.8 These variables are continuous. Since the nutritional status of children(weight-for age) and body mass index are affected by a number of factors, the predictor variables to be analyzed in this study as determinants of nutritional status of children are grouped as socio-economic, demographic, health and environmental factors. These factors include the child"s age, sex of child, birth order, preceding birth interval, duration of breast feeding, region, wealth index, mother education level, types of place of residence, number of household member, source of drinking water, ever had vaccination and children by choice not by chance.
UV-Vis analysis of products generated after Tubantin Brown GGL treatment
Decolorization percentage of TB GGL after treatment by binary mixture (H. desertis G11 and M. trichothecenolyticum TL13) was analyzed by UV-Vis spectrophotometer (Shimadzu UV-1800 PC model Kyoto, Japan) by scanning centrifuged cell-free supernatant samples in the range 200–800 nm. The supernatant used was obtained after centrifugation 10000 rpm for 15 min at 4°C.
As _1,_2..._j p(y)= is fixed and constant.
Thus the posterior distribution can be expressed as:
(1)
Where is likelihood function, is posterior distribution, is prior distribution in this, we will analyze the effects of the different types of covariates on the response variables body mass index and nutritional status by using weight-for-age (underweight) Z-score as continuous variable with the assumption that each of the covariates has a linear effect on the response variable. Since we assume each covariate has a linear effect the approach we follow in this step is Bayesian Gaussian linear regression parametric approach.Bayesian generalized linear regression model
When the effect of covariates on the response variables is linear we follow Bayesian Gaussian linear regression parametric approach. One of the most popular and useful tools in data analysis is the linear regression analysis model. Consider the normal linear regression model in which a response variableis related to one or more explanatory or predictor variables . For a random sample of n individuals, the model becomes:
(2)
where and . We typically assume that is normally distributed with mean 0 and variance , and and are independent of one another for i.e . In the Bayesian approach probability distributions are used to quantify uncertainty. Thus, in contrast to the frequentest approach, a joint probability model for response and parameters is specified. The distribution of the dependent variable is specified conditional on the parameters ( and )[8]. The analysts certainty or uncertainty about the parameter before the data analysis is represented by the prior distribution for the parameters . After observing the sample data , the prior distribution is updated by the empirical data applying Bayes theorem, as in equation (1) and regression model in (2) we obtain equation (3).(3)
Yielding the so called posterior distribution of the parameters . For our specific application to multiple linear regression, we use a noncommittal, vague prior. For each regression coefficient, the prior distribution is a very broad normal distribution, with a mean of zero and a standard deviation that is large enough 1000.Prior distribution
In Bayesian inference, the unknown function , j =1,...,p , the fixed effects parameters ,aswell as the variance parameterar are considered as random variables and have to be supplemented by appropriate prior assumptions. In the absence of any prior knowledge we assume independent diffuse priors , j=1,...,r for the parameters of fixed effects. In this study, for the continuous covariates child age in month and Mother age at first birth those assumed to have a possibly nonlinear effect on the Z-score are modeled non-parametrically as cubic P-spline with second order random walk priors.
The general form of the priors
Suppose thatis the vector of corresponding function evaluations at observed values of x. Then, the general form of the prior for is
(4)
where, is a penalty matrix that penalizes too abrupt jumps between neighboring parameters. In most cases will be rank deficient; therefore the prior for would be improper. We choose an inverse gamma distribution with hyperparamters a and b, i.e.with probability density function given by:
Common choices for and are and (or ). Alternatively, one may take .
Priors for fFixed effects
As indicated above for the parameter vector of fixed effects we choose a diffuse prior , j=1,...,r Another choice would be to work with a multivariate Gaussian distribution . In this study, diffuse priors will be used for the fixed effects.
Priors for continuous effects under linear setup
Priors for the unknown functions depend on the type of the covariates and on prior beliefs about the smoothness of . In the following we express the vector of function evaluations of a function as the matrix product of a design matrix and a vector of unknown parameters s", i.e .First and second order random walk priors
Let us consider the case of a continuous covariate x with equally spaced observations , i = 1,..., m, , then, defines the ordered sequence of distinct covariate values. Here m denotes the number of different observations for x in the data set. A common approach in dynamic or state space models is to estimate one parameter for each distinct ; i.e., define, and let denote the vector of function evaluation.Bayesian P-splines
Any smoother depends heavily on the choice of smoothing parameter, and for P spline in a mixed (fixed and continuous) framework. A closely related approach for continuous covariates is based on the P-splines approach introduced by Eilers and Marx (1996). The basic assumption of this approach is that the unknown functionca be approximated by a spline of degree l with equally spaced knots within the domain of .
Posterior inference
In this case, the specific form of the posterior depends only on the parameterization of the regression terms in the model. Then, we use Markov Chain Monte Carlo (MCMC) simulations to draw samples from the posterior and statistical inference is done by means of Markov Chain Monte Carlo techniques in a full Bayesian setting. Now we restrict the presentation to models with predictor. Full Bayesian inference is based on the entire posterior distribution. Letbe the vector of all unknown parameters, then the posterior is given by.
In a fully Bayesian approach, parameter estimates are generated by drawing random samples from the posterior via MCMC simulation techniques. The variance parameters can be estimated simultaneously with the regression coefficients by assigning additional hyper-prior to them. The most common assumption is, that the are independently inverse gamma distributed, i.e. , with hyper-parameters and specified a prior. A standard choice is to use
Model diagnostic
Once a model has been developed, we now would like to know how effective the model is in describing the outcome. This is referred to as goodness of fit. The most common ways of checking goodness of fit are: diagnosis for convergence and mixing and posterior-predictive check. For convergence tests, the researcher used time series plot.
Model selection and comparison
Model selection is to select the best model among several choices based on evaluation of the performance of the models. One immediate question is how to measure goodness of fit? There are many choices. The simplest one is residual sum of squares.9
Where is the observed responses and is the fitted value. This measure is also called mean squared error, and commonly used as the default measure. For GLM, deviance is a quality of fit for a model that depends on the likelihood. The researcher skip the details and the underlying theory of the definition, but pointing out that the statistical software can compute the quantity automatically. The next question is why we should consider model selection? There are several reasons. First, people tend to believe or can understand simpler models with fewer predictors and less complicated structure. Second, one can certainly add more and more features into the model without screening and get better and better fit, till perfect fit, but the problem is over fitting. Remember that the researcher want to find the best-predicting model not the best fitting model. In this study, AIC(Akaike Information Criterion), BIC(Bayesian Information Criterion), CV(Cross-Validation) and DIC (The deviance information criterion) are used to compare the linear frequent and the linear Bayesian approach with their smallest values to be better model. In this study, authors used R software(version 3.4.0) to analyze the determinant of factors under five years children nutritional status in Ethiopia.
Descriptive analysis of the data
As one can see from Table 1, among the cases examined in this study of male children were underweight and of female children were underweight. The highest prevalence of child malnutrition was observed among children whose preceding birth interval was less than 24 months ( 35%) unlike to the lowest prevalence of child malnutrition which was recorded from children whose preceding birth interval is 48 and above (31.9% ). The proportion of underweight children as we can seen from Table 1, it is also differs by type of place of residence: urban and rural. Accordingly, higher numbers of underweight children (33.6% )) reside in rural areas, and relatively less numbers of underweight children (31.2% ) reside in urban centers. The evidence given in Table 1 shows that the percentage nutritional statuses of children who had diarrhea recently seem to be high probability of underweight than those children which had no diarrhea recently. Likewise, one can observe that the percentages nutritional status of children who had fever recently seemed high probability of underweight than those children who have no fever recent (33.9% ,33.1% , respectively). The evidence given in Table 1 shows that the percentages nutritional status of children in households who had used other water sources seemed to be more underweight than those children in households which used water from public source. The family education level has also it"s effect on the nutritional status of children that a mother with higher educational level(father education level) had a child with better nutritional status. Since the percentages of underweight is decreasing from no education to secondary and above education level. Children who had disease are affected by malnutrition e.g child who had cough is more affected than child who had not cough as we observe from Table 1.
Covariates |
Categories |
Stunted |
Wasted |
Underweight |
Region |
Tigray |
379(34.4%) |
351(31.8%) |
373(33.8%) |
Affar |
358(36.5%) |
272(27.6%) |
355(36.0%) |
|
Amhara |
420(35.6%) |
361(30.6%) |
398(33.8%) |
|
Oromia |
540(34.2%) |
492(31.2%) |
546(34.6%) |
|
Somali |
377(42.3%) |
243(27.3%) |
271(30.4%) |
|
Benishangul-Gumuz |
294(32.8%) |
304(34.0%) |
297(33.2%) |
|
SNNP |
516(35.5%) |
466(32.0%) |
473(32.5%) |
|
Gambela |
273(37.4%) |
228(31.3%) |
228(31.3%) |
|
Harari |
241(40.3%) |
176(29.4%) |
181(30.3%) |
|
Addis Ababa |
143(39.0%) |
103(28.1%) |
121(33.0%) |
|
Dire Dawa |
226(35.8%) |
191(30.2%) |
215(34.0%) |
|
Place of residence |
Urban |
697(39.8%) |
507(29.0%) |
547(31.2%) |
Wealth index |
Poorest |
1172(36.4%) |
950(29.5%) |
1100(34.1%) |
Poorer |
663(35.1%) |
607(32.1%) |
621(32.8%) |
|
Middle |
565(33.4%) |
536(31.7%) |
589(34.9%) |
|
Richer |
602(35.7%) |
537(31.8%) |
548(32.5%) |
|
Richest |
765(39.8%) |
557(29.0%) |
600(31.2%) |
|
Sex |
Male |
1918(36.2%) |
1646(31.0%) |
1741(32.8%) |
Female |
1849(36.3%) |
1541(30.2%) |
1717(33.6%) |
|
Children by choice |
No |
3027(36.1%) |
2559(30.5%) |
2798(33.4%) |
Yes |
735(36.4%) |
628(31.1%) |
659(32.6%) |
|
Mother edu.level |
No education |
2665(36.6%) |
2187(30.0%) |
2427(33.3%) |
Primary |
900(34.5%) |
833(31.9%) |
875(33.1%) |
|
Secondary and above |
202(38.5%) |
167(31.8%) |
156(29.7%) |
|
Father edu.level |
No education |
1903(35.8%) |
1604(30.1%) |
1816(34.1%) |
Primary |
1339(35.5%) |
1193(31.6%) |
1244(32.9%) |
|
Secondary and above |
442(39.2%) |
338(30.0%) |
347(30.8%) |
|
Drinking water |
Public |
1607(36.7%) |
1324(30.2%) |
1447(33.1%) |
Others |
2158(35.8%) |
1862(30.9%) |
2010(33.3%) |
|
Ever had vaccination |
No |
881(38.9%) |
675(29.8%) |
708(31.3%) |
Yes |
1990(35.6%) |
1693(30.3%) |
1912(34.2%) |
|
Rural |
3070(35.4%) |
2680(30.9%) |
2911(33.6%) |
|
Had diarrhea recently |
No |
3221(36.6%) |
2709(30.8%) |
2867(32.6%) |
Yes |
530(33.3%) |
475(29.8%) |
588(36.9%) |
|
Had child fever |
No |
3028(36.3%) |
2548(30.6%) |
2757(33.1%) |
Yes |
719(35.3%) |
627(30.8%) |
691(33.9%) |
|
Had cough recently |
No |
3029(36.5%) |
2507(30.2%) |
2757(33.2%) |
Yes |
723(34.6%) |
674(32.2%) |
695(33.9%) |
|
Preceding birth interval |
less than 24 |
1022(36.6%) |
825(29.5%) |
969(35.0%) |
24-48 |
999(37.0%) |
839(31.1%) |
945(33.8%) |
|
Above 48 |
946(34.2%) |
853(30.8%) |
861(31.9%) |
|
Breast feeding |
Never breastfed |
61(39.1%) |
45(28.8%) |
50(35.1%) |
Ever breastfed(not cur) |
1983(36.0%) |
1689(30.7%) |
1832(32.3%) |
|
Still breast feeding |
1700(36.1%) |
1442(30.6%) |
1570(30.3%) |
Table 1 Summary of covariates with the response variable (weight-for-age)
From Table 2 the highest malnourished region are SNNP, Oromia, Gambela, Affar and Tigray. From all of these region the smallest malnourished region is Harari. The place of residence has also its impact as result the highest malnourished lives in rural comparing with urban.
Covariates |
Categories |
malnourished |
well-nourished |
Region |
Tigray |
550(49.7%) |
556(50.3%) |
Affar |
494(49.8%) |
498(50.2%) |
|
Amhara |
558(47.3%) |
622(52.7%) |
|
Oromia |
795(50.1%) |
793(49.9%) |
|
Somali |
423(47.3%) |
471(52.7%) |
|
Benishangul-Gumuz |
472(52.6%) |
426(47.4%) |
|
SNNP |
782(53.5%) |
680(46.5%) |
|
Gambela |
329(44.9%) |
403(55.1%) |
|
Harari |
249(41.6%) |
349(58.4%) |
|
Addis Ababa |
157(42.7%) |
211(57.3%) |
|
Dire Dawa |
311(49.0%) |
324(51.0%) |
|
Place of residence |
Urban |
802(45.6%) |
955(54.4%) |
Rural |
4318(49.7%) |
4378(50.3%) |
|
Wealth index |
Poorest |
1610(49.8%) |
1621(50.2%) |
Poorer |
925(48.5%) |
982(51.5%) |
|
Middle |
867(51.1%) |
830(48.9%) |
|
Richer |
834(49.4%) |
855(50.6%) |
|
Richest |
884(45.8%) |
1045(54.2%) |
|
Sex |
Male |
2654(49.8%) |
2664(50.2%) |
Female |
2463(48.0%) |
2673(52%) |
|
Source of drinking water |
public |
2141(48.7%) |
2252(51.3%) |
others |
2975(49.1%) |
3081(50.9%) |
|
Had diarrhea |
No |
4290(48.6%) |
4543(51.4%) |
Yes |
51.5(823%) |
775(48.5%) |
|
Had child fever |
No |
4064(48.6%) |
26644301(51.4%) |
Yes |
1037(50.7%) |
1009(49.3%) |
|
Had cough recently |
No |
4037(48.5%) |
4285(51.5%) |
Yes |
1076(51.1%) |
1028(48.9%) |
|
preceding birth interv |
less than 24 |
1407(50.1%) |
1400(49.9%) |
24-48 |
1319(48.8%) |
1386(51.2%) |
|
above 48 |
1394(46.1%) |
8613(53.9%) |
|
Duration of bf |
never breastfed |
81(50.9%) |
78(49.1%) |
ever bf, not currently |
2634(47.6%) |
2897(52.4%) |
|
still bf |
2281(5.8%) |
2441(54.2%) |
Table 2 Summary of Covariates with the response variable(Body mass index)
Bayesian generalized linear regression model
Results of generalized linear regression analysis weight-for-age revealed that, the intercept is estimated to be -692.832. It can be interpreted as the value you would predict for weight for age if all predictor variables are set to minimum. We would expect an average Z-sore value of approximately -692.832 for weight for age (under weight). Results of generalized linear model analysis weight for age shows that sex of child is significant(estim -145.487, p-value 0.0315)which indicate that, Weight-for-age(underweight) of a female is lowered by 145.487 as compared to a male child. Children by choice not by chance is significant( estim -214.050, p-value 0.0432) that is Underweight of children by choice not by chance is lowered by 214.0.050 as compared to children by chance. Mother education level at secondary and above is significant level. And it is positively related to weight-for-age of children (estim 838.0805, p-value 0.0873). Weight-for-age declines with increasing mother education level. Ever had vaccination is significant(estim -194.487, p-value 0.0270). Weight-for-age of children who had ever vaccination is lowered by 194.487 as compared to no had never vaccination. Had cough is significant. Weight-for-age of children who had cough is increased by 282.807 as compared to who had no cough. Duration of breast feeding who had ever breasted, not currently breastfeeding is positively related to weight-for-age (etim 314.196, p-value=0.0655), significant at level. Weight-for-age declines with increase in duration of breast feeding of a child. Results of generalized linear model for frequentist approach for body mass index shows that the intercept value is approximately -553.208 , the y-intercept, can be interpreted as the value you would predict for body mass index if all predictor variables are zero. We would expect an average value of approximately -553.208 for body mass index (malnourished). However, this is only a meaningful interpretation if it is reasonable that all the predictors equal to zero, and if the data set actually included values for all predictors that were near 0. If neither of these conditions are true, then really has no meaningful interpretation. It just shift the regression line in the right place. The current study revealed that, Child age in month is significant at level. Body mass index of child is negatively related to child age(estm 6.340, p-value 0.0812). Which implies as child age increases the body mass index is deceased. Succeeding birth interval is positively related to body mass index (etim 6.558, p-value=0.0722), significant at level. Body mass index increase with increasing in succeeding birth interval of a child. Sex of child (gender) is significant. Body mass index of a female is lowered by 140. as compared to a male child. Children by choice not chance is significant. Body mass index of child is negatively related to children by choice not chance. Child ever had vaccination is significant. Body mass index of child is negatively related to ever had vaccination. Child who had cough is significant at . Body mass index of child is positively related to child who had cough. Duration of beast feeding is significant. Body mass index of child is positively to duration of breast feeding.
Table 3 gives results for the fixed effects (categorical covariates) on the nutritional status of children under age five in Ethiopia. The output gives posterior means along with their standard deviations and credible intervals. Since the confidence interval do not include zero, sex of child, child by choice not by chance, ever had vaccination and had cough were found statistically significant at significance level. But, regions, mother education level, source of drinking water and duration of breast feeding were found statistically insignificant. One can observe that child who had vaccinated contributes to better nourishment for children under five age. In this study, the relative chance to be underweight for the children was found to decrease with the increase of vaccination for children. The children who had no vaccination were more likely to be malnourished as compared to children ever had vaccination. And also the analysis shows that female children are better nourished than male children. On the other hand, one can observe that underweight is higher for children who had cough than child who had no cough. So children who had cough should be treated at early stage. We can see the output matches up with frequentest(traditional) approach. As the sample size increase the result should converge to the same values. One of the benefits of the Bayesian perspective (for any analysis) is that it allows us to make credible interval statements. Credible intervals are similar to confidence intervals, but in the Bayesian framework, the interval REALLY IS believed to contain the true population parameter. For instance: a credible interval for a parameter being estimated is interpreted as, there is a probability that the actual parameter is included in that interval. This is because the interval is based on information from the posterior distribution, of for instance, one of the predictor"s coefficient posterior distribution. For example, we can say there is a probability that the true population value of the Child"s sex (Female) coefficient is between -271.255 and -11.280.
(Intercept) |
Mean |
Sd |
2.50% |
50% |
97.50% |
-37.017 |
172.848 |
-380.801 |
-35.513 |
296.638 |
|
as.factor (Region1) Tigray |
-235.028 |
136.785 |
-498.682 |
-236.257 |
35.935 |
as.factor (Schild) Female |
-140.934 |
66.113 |
-271.255 |
-141.446 |
-11.28 |
as.factor (Childbychoice) Yes |
-238.769 |
100.714 |
-443.776 |
-236.86 |
-47.297 |
as.factor (MEDL) Secondary and above |
829.133 |
443.42 |
-26.276 |
832.555 |
1689.62 |
as.factor (SDW) Others |
103.306 |
67.58 |
-26.675 |
103.055 |
236.557 |
as.factor (EV) Yes |
-180.055 |
80.879 |
-336.825 |
-181.197 |
-21.63 |
as.factor (HC)Yes |
217.882 |
93.165 |
38.254 |
219.401 |
397.688 |
as.factor (DBFa) Ever breastfed, not current |
276.466 |
164.331 |
-39.01 |
275.631 |
598.303 |
Table 3 Summary of Bayesian generalized linear model fixed effects estimation results of weight-for-age
Scale estimate: 2178410
N=1969 DIC=1986.847 pd=8.922508 AIC=28753.8
method=STEP family=gaussian iterations=10000 step=2
From Table 4 we also observe that number of significant variables are the same for weight for age and as the result those variables are affect body mass index. Body mass index of child sex(female) is lowered by -138.5870 comparing to male child which indicate the decrement of body mass index is highly observed in female children that can make female more susceptible to illness, impair the growth and development, cause infertility and the raise for different disease including death. Likewise from Table 4 child who had cough faces high body mass index comparing to child who had not cough and child ever vaccinated faces low body mass index. Which indicate only vaccination is not enough as result children should get sufficient macro nutrients. Generally, the Bayesian generalized linear model satisfy the normal assumption of the residual. the models in Bayesian approach MCMC post estimation diagnosis, it is also possible to extract sampling paths of parameters with function samples(), or to plot the samples directly. For instance, coefficient sampling paths for term sx(SBI) are displayed which is important to check the model convergence in the time series plot.
(Intercept) |
Mean |
Sd |
2.50% |
50% |
97.50% |
167.046 |
222.901 |
-262.467 |
168.528 |
601.062 |
|
Child age |
-6.1193 |
3.5448 |
-13.1032 |
-6.1044 |
0.7738 |
SBI |
7.0981 |
3.5063 |
0.3645 |
7.0083 |
14.1063 |
as.factor (Region1) Tigray |
-199.706 |
135.52 |
-475.312 |
-199.459 |
69.1336 |
as.factor (Region1) SNNP |
141.205 |
89.4776 |
-34.7017 |
143.12 |
317.365 |
as.factor (Schild) Female |
-138.587 |
65.7587 |
-268.593 |
-137.592 |
-11.8307 |
as.factor (Childbychoice) Yes |
-220.97 |
99.9777 |
-413.363 |
-221.502 |
-24.937 |
as.factor (MEDL ) Secondary and above |
772.072 |
441.918 |
-111.42 |
781.612 |
1616.64 |
as.factor (EV) Yes |
-163.949 |
80.9812 |
-327.144 |
-163.562 |
-2.8443 |
as.factor (HC) Yes |
207.696 |
92.5048 |
25.2276 |
207.318 |
389 |
as.factor (DBFa) Ever breastfed, not current |
268.439 |
163.619 |
-50.7223 |
266.995 |
591.912 |
Table 4 Summary of Bayesian generalized linear model fixed effects estimation results: for BMI
Scale estimate: 2126770
N=1969 DIC=1990.781 pd=10.88936 AIC=28710.6
method=STEP family=gaussian iterations=10000 step=2
Model comparison
We analyzed the DHS data set using two different approaches. It is reasonable to ask which type of model is better. The two models are listed in the following:
Model 1 = Generalized linear regression model in frequentest
Model 2 = Generalized linear regression model in Bayesian Model selection is an important part of statistical analysis, and indeed is a central to the pursuit of science in general. Many authors have examined this question, rom both frequentist and Bayesian perspective, and many tools o selecting the "better model". The models proposed in this study are quite general and the model building process can be quite challenging. Currently, an automated procedure for Bayesian model selection is not available. However, a few recommendations are possible. Users should try to incorporate everything that is theoretically possible and different Bayesian models could be compared via the DIC.10 From Table 5, when we compare Model 1 and Model 2 via AIC, the most interesting fact here is that the better models in terms of AIC. Model 2 (Generalized linear regression model in Bayesian approach) since they have less value (28753.8) and (28710.6) and they are chosen by the data analyst. The point is that models 2 are a better models than models 1 (Gaussian linear regression model in frequentest approach).
AIC |
GCV |
DIC |
AIC |
GCV |
DIC |
||
Model 1 |
34378 |
- |
- |
Model 1 |
34334 |
- |
- |
Model 2 |
28753.8 |
- |
1986.847 |
Model 2 |
28710.6 |
- |
1990.781 |
Table 5 Summary result of models comparison criteria for weight-for-age on the left and body mass index on right hand side
The authors gratefully acknowledge Central Statistics Agency for providing the data. This work was financially supported by the College of Natural Science, Hawassa University.
Author declares that there is no conflicts of interest.
©2019 Bacha, et al. This is an open access article distributed under the terms of the, which permits unrestricted use, distribution, and build upon your work non-commercially.
2 7