Research Article Volume 9 Issue 1
Department of Statistics, Jimma University, Ethiopia
Correspondence: Reta Habtamu Bacha, Department of Statistics, Jimma University, Jimma, Ethiopia
Received: May 12, 2019 | Published: February 28, 2020
Citation: Bacha RH. Identifying prognosticators covariates of child nutritional status in ethiopia: A bayesian generalized additive modelling approach. Biom Biostat Int J. 2020;9(1):28-42. DOI: 10.15406/bbij.2020.09.00297
Malnutrition among children under age five is the major public health delinquent issue in the developing world, particularly in Ethiopia. This study aimed to figure out determinants of Ethiopian children malnutrition by applying Bayesian approach with Markov chain Monte Carlo (MCMC) techniques on the 2011 EDHS data. The preliminary analysis indicated that the overall prevalence of underweight among children in Ethiopia is found 36.4%. Bayesian generalized additive regression model applied to flexibly estimate effects of socio-economic, demographic, health and environmental covariates. The estimation result showed that covariates succeeding birth interval, gender of child, child by choice not by chance, vaccination and cough are significantly affect the children nutritional status in Ethiopia. The effect of child age, mother’s age at child birth, succeeding birth intervals, number of household member and birth order were also explored non-parametrically as determinants of children nutritional status. Based up on this biometric analysis, concerned governmental and non-governmental bodies should give emphasis on the significant covariates to improve the children nutritional status of the country.
Keywords: children nutrition, malnutrition, arthrometric measurement, MCMC, generalized bayesian additive model and ethiopia
Malnutrition stays one of the most commonplace reasons of morbidity and mortality amongst children less than five years antique kids all through the World.1 Worldwide over 10 million children beneath the age of 5 years die every year from preventable and treatable ailments regardless of effective health interventions. At least half of these deaths are because of malnutrition. Malnourished children have reduced resistance to infection; consequently, they’re much more likely to die from common youth illnesses inclusive of diarrheal illnesses and respiratory infections. In addition, malnourished children that live on are likely to suffer from frequent infection, which adversely affects their nutritional repute and locks them into a vicious cycle of recurring illness, In growing international locations, malnutrition is a primary health hassle.2
The burden of malnutrition is much higher in South Asia compared to that in Africa and other parts of the world. The prevalence of underweight and stunting in South Asia has been recorded as 46 and 44%, respectively.3 Chronic malnutrition has been a persistent problem for young kids in Sub-Saharan Africa. An excessive percentage of those children fail to reach the everyday worldwide popular height for their age, this is, they may be stunted. In contrast, the percentage of children stunted in Southeast Asia decreased from 52 percent to 42% between 1990 and 2006. The variety of undernourished (low weight for age)people of every age in sub Saharan Africa accelerated from about 90 million in 1970 to 225 million in 2008, and became projected to add some other 100 million by 2015, even earlier than the present day international food rate hikes.
A study conducted on influence of socio-economic factors on nutritional status of children in a rural community of Osun state, Nigeria revealed that 9 the prevalence rates of underweight, wasting and stunting were 23.1%, 9% and 26.7% respectively and also prevalence and determinants of malnutrition among Under-five Children of Farming Households in Kwara State, Nigeria results indicate that 23.6%, 22% and 14.2% of the sample children have been stunted, underweight and wasted, respectively.5
Nowadays, nearly one in three persons globally suffers from at least one form of malnutrition: wasting, stunting, vitamin and mineral deficiency, overweight or obesity and diet-related NCDs(Non communicable diseases). In 2014, about 462 million adults worldwide have been underweight, at the equal time 1.9 billion had been either overweight or obese.6 In 2016, an estimated 41 million children under the age of 5 years were overweight or obese, while 155 million were chronically undernourished.7 Nutrition-related factors contribute to approximately 45% of deaths in children aged under 5 years (mainly due to undernutrition), while low- and middle-income countries are now witnessing a simultaneous rise in childhood overweight and obesity.8 A study done on malnutrition among under-five children in Bangladesh revealed that the high prevalence of stunting and underweight, for instance, 42% and 40% of under-five children were stunted and underweighted, respectively.9 Also study conducted nutritional status of under- five children in Mongolia also showed that the prevalence of stunting, wasting and underweight were 15.6%, 1.7% and 4.7%, respectively.
In Ethiopia, malnutrition is a major contributor to under-five mortality. According to the 2016 Ethiopia Demographic and Health Survey report, 12% of under-five children had a diarrheal episode inside the 2 weeks before the survey.10 More than half of under-five child deaths are resulting from illnesses that are without difficulty preventable and treatable via simple, value-effective, and less costly interventions. Strengthening fitness systems to offer such interventions to all kids will probably save many children’s lives
The problem
Malnutrition continues to be a main public health problem in developing countries. In Ethiopia, morbidity reports and community based studies have shown that malnutrition is a major public health problem that causes excess morbidity and mortality in children.10 In Ethiopia, infant malnutrition is one of the most critical public problems and the highest in the world.11 The overall prevalence of underweight among children in Ethiopia was 25%.12 In addition to this nutritional status of children in Ethiopia were not such much studied using Bayesian generalized additive models since it is the new field.
The foremost objective of this article is to identify predictors of children’s nutritional status in Ethiopia using the Bayesian generalized additive model. Specifically, we try to analyze the effect of continuous and categorical variables on nutritional status using the Bayesian generalized additive model. The reason why we want to use a generalized additive model: interpretability, flexibility/automation, and regularization. Hence, when our model contains nonlinear effects, GAM provides a regularized and interpretable solution while other methods generally lack at least one of these three features. In other words, GAMs strikes a nice balance between the interpretable, yet biased, linear model, and the extremely flexible, "black box" learning algorithms. In this case, some variables like Child Age (Childage), Age of Mothers at first Birth (AMB), Birth Order (BOD), Succeeding Birth Interval (SBI) and number of household member (Nhme) have a nonlinear effect on the response variables (i.e., weight for age and body mass index). For example, as we have seen from Figure 1, a number of household members (Nhme) has a nonlinear effect on the weight for age and which is difficult to interpret. So, smoothing is required to overcome this problem, which in turn included in the theory of generalized additive model as discussed above.
Figure 1 Relationship between number of household member (Nhme) as a nonlinear effect on the weight for age.
The output of this analysis will provide useful information on determinants of predictors of children’s nutritional status in Ethiopia, as well as concerned bodies, can take measurements on it.
Overview of burden child nutritional status
Child as well as infant mortality remains higher in developing despite a considerable decline in mortality levels in the last decades. According to the latest State of the World Children Report, approximately 26,000 children are dying each day around the world mostly from preventable causes and one third of these deaths occur in the first month of life. Nearly all of these deaths occur in 60 developing countries. Moreover, nearly half of the under-five deaths are due to under-nutrition. Sub-Saharan Africa is the region worst hit by higher under-five mortality followed by South Asia. Similarly, under-nutrition remains to be a grave public health problem in the developing world with South Asia having the most affected region followed by Sub-Saharan Africa.13 Chronic malnutrition has been a chronic problem for children in sub-Saharan Africa. A high percentage of these children fail to reach the normal international standard height for their age that is they are the "stunted". In assessment, the percentage of children stunted in Southeast Asia dropped from 53 percent to 42 percent between 1990 and 2006. The range of undernourished (low weight for age) people of every age in sub-Saharan Africa elevated from about 90 million in 1970 to 225 million in 2008 and was projected to add every other one hundred million by 2015, even earlier than the present-day world food price hikes.12
Childhood under-nutrition is amongst the most serious health issues facing developing countries It is an intrinsic indicator of well-being, however, it’s also related to morbidity, mortality, impaired youth development, and reduced exertions productiveness. Much of the burden of deaths resulting from malnutrition, estimated to be over half of childhood deaths in developing countries, can be attributed to just mild and moderate under-nutrition. Underweight children are particularly vulnerable to increased risk of death from infectious illnesses such as diarrhea and pneumonia.14 For those children that do not survive, the impact of chronic malnutrition in the first few years of life are long lasting and can lead to cognitive and physical developmental deficits, higher levels of chronic illness and disability in adult life (resulting in reduced work capacity), as well as adverse pregnancy outcomes (low birth).
Factors associated with Children’s Malnutrition
Important determinants of undernutrition include the education, income, and nutritional situation of the parents, access to clean water and sanitation, and access to primary health care and sex and age of child.15,16 Factors that are contributing to malnutrition may differ among regions, communities and over time. Identifying the underlying causes of malnutrition in a particular locality is important to solve the nutritional problems. Approximately 10 percent of children born in Ethiopia die before their first birthday and 17 percent will die before their fifth birthday.10 According to formulas developed by 57 percent Of under-five mortality in Ethiopia is related to intense and severe to moderate malnutrition.17 The consequences of malnutrition in children additionally consist of bad physical development and confined intellectual talents that diminish their running capability during maturity. Some of the socioeconomic and demographic elements explaining child nutrition in line with studies done in different places are reviewed follows.
Education of mother: Education of women is believed to exert an impact on health and nutritional status of children since it provides the mother with the necessary skills for child care, increase awareness of nutritional needs and preference of modern health facilities as well as change of traditional beliefs about diseases causation, and use of contraceptives for birth spacing. Studies in Ethiopia,15,16 indicate that the relationship between chronic nutritional status of children and maternal education.
Source of water and availability of toilet facility: Unfavorable health environment because of inadequate water and sanitation can boom the chance of infectious illnesses and circuitously motive sure forms of malnutrition. A com- parative study in some developing countries and in Jimma, Ethiopia18 showed that unprotected water source and non-availability of latrine were associated with low child stature.
Age of child: A cumulative indicator of growth retardation (height-for-age) in children is positively associated with age. Local studies in Ethiopia have also shown an increase in malnutrition with increase in age of the child.15 Children in the age group 0-5 months were found to be at significantly lower risk of stunting as compared with children in the age group 6-11 months.17 The prevalence of diseases and stunting rises with age.19
Gender of Child: In sub-Saharan Africa, male children under five years of age are more likely to become stunted than females.20 A number of studies in Africa suggest that rates of malnutrition among.21
Birth Order: It is expected that parents give less attention to older children when a new child is born because the new born needs much attention and care. One study showed that stunting is rare in birth orders 2-3 and higher birth order (5+) is positively associated with child malnutrition. The risk of stunting was also 1.3 times higher for children of first birth order as compared with children of birth order six or more.22
Birth interval of the child: Closely spaced pregnancies are often associated with the mother having little time to regain lost fat and nutrient stores.23 Studies in developing countries showed that children born after a short birth interval (less than 24 months) have better tiers of stunting in maximum countries in which DHS surveys were conducted. It was also observed that as the preceding birth interval of the child decreases, the likelihood of being stunted increases.22
Environmental factor: The impact of access to basic environmental services, such as water and sanitation on the probability children are stunted and underweight is examined in detail in Ethiopia and found as very significant.24
The dData
This analysis is based on the 2011 Demographic and Health Survey (EDHS) data, which obtained from the Central Statistics Agency of Ethiopia. The 2011 EDHS sample was selected using a stratified; two-stage cluster design and census enumeration areas (EAs) were the sample units for the first stage. The sample included 62 EAs, 187 in urban areas and 37 in rural areas. Households comprised the second stage of sampling. An entire list of households was performed in each of the 624 selected EAs. A total of approximately 11,654 children under the age of 60 months were identified in over 17 thousand households of the selected clusters and then anthropometric measurements such as height, weight, and ages were made. In this analysis, we consider social-demographic and socioeconomic variables as independent variables, while body mass index and weight-for-age were taken as the dependent variables.
The model
In this study, the authors used the Bayesian approach to predict the nutritional status of children. There have been tons of latest hobby in Bayesian inference for generalized additive and associated models. It is a method of statistical inference that allows one to combine prior information about a population parameter with the information contained in a sample.
Bayesian generalized additive regression model
Generalized Additive Model (GAM) is a method developed and popularized by Hastie, 2000.25 It provides a powerful class of models for modelling nonlinear effects of continuous covariates in regression models. Generalized additive models extend traditional linear models in another way, namely by allowing for a link between and the expected value of as (Gelman, et. al.,26 Bolstad;27).
(1)
Where are smooth functions. These functions are not given a parametric form but instead are estimated by nonparametric methods. Generalized additive models consist of a random component, an additive component, and a link function relating these two components. It assume that, the response Y, the random component, has density in the exponential family.22
(2)
Where:-
Moreover, the conditional expectation and with link function we have . Where is the design matrix, g is the link function. The quantity in the generalized additive models can be expressed as.28 i.e . Where, are smooth functions that define the additive component. Finally, the relationship between the mean of the response variable and is defined by a link function.
Generalized additive models focus on exploring data none parametrically. It is more suitable for exploring the data set and visualizing the relationship between the dependent variable and the independent variables.29
The smooth functions are typically represented in terms of linear combinations of smooth basis functions, for example, splines, and thus, can be added as columns to the model matrix. With a greater number of basis terms, more complex variations of the dependent variable can be described up to the limit of having the predicted curve pass through every point. Such a curve that describes the trend as well as the noise in the data will, in general, be poor at describing new samples from the same population. To control for this, a penalty for under smoothing is introduced during the optimization. The penalty is usually implemented as a proportion, λ of the integrated square of the second derivative of f (related to its curvature). Larger contributions of the term result in less smooth models 30,31. The choice of degree of penalization (or smoothness) is controlled by minimizing a criterion related to prediction error (i.e. fitting some of the data and calculating the error on the remaining portion) called generalized cross validation (GCV).
The cubic spline smoother imposes smoothness on . We seek the function that minimizes[21]:
Notice that measures the wiggliness of the function : linear have while nonlinear produce values bigger than zero. is a non negative smoothing parameter that must be chosen by the data analyst. It governs the trade-off between the goodness of fit to the data and (as measured by and wiggliness of the function. Larger values of force to be smoother.
For any value of the solution to 28 is a cubicspline, i.e, a piecewise cubic polynomial with pieces joined at the unique observed values of in the data set. Fast and stable numerical procedures are available for computation of the fitted curve.
Bayesian approaches are currently either based on regression splines second order random walk with adaptive knot selection32 or on smoothness priors25 and own work. Inference is based on Markov chain Monte Carlo inference techniques.33–35
Prior distribution
In Bayesian inference, the unknown function , j =1,...,p , the fixed effects parameters as well as the variance parameter are considered as random variables and have to be supplemented by appropriate prior assumptions. In the absence of any prior knowledge we assume independent diffuse priors , j=1,...,r for the parameters of fixed effects27-30.
Suppose that is the vector of corresponding function evaluations at observed values of x. Then, the general form of the prior for is
(3)
where, k is a penalty matrix that penalizes too abrupt jumps between neighboring parameters. In most cases K will be rank deficient; therefore the prior for f would be improper. We choose an inverse gamma distribution with hyper-parameters We choose an inverse gamma distribution with hyperparamters a and b, i.e.
with probability density function given by:
Common choices for a and b are a =1 and b =0.005 (or b =0.0005 ). Alternatively, one may take a=b=0.001
As indicated above for the parameter vector of fixed effects we choose a diffuse prior , j=1,...,r Another choice would be to work with a multivariate Gaussian distribution . In this study, diffuse priors will be used for the fixed effects.
Priors for the unknown functions depend on the type of the covariates and on prior beliefs about the smoothness of . In the following we express the vector of function evaluations of a function as the matrix product of a design matrix Xj and a vector of unknown parameters s’, i.e
Let us consider the case of a continuous covariate X with equally spaced observations , i = 1,..., m, , then, defines the ordered sequence of distinct covariate values. Here m denotes the number of different observations for X in the data set. A common approach in dynamic or state space models is to estimate one parameter for each distinct ; i.e., define, and let denote the vector of function evaluation. Then a first order random walk prior for is defined by:
(4)
A second order random walk is given by
(5)
with Gaussian errors and diffuse priors and , for initial values, respectively.
Any smoother depends heavily on the choice of smoothing parameter, and for P spline in a mixed (fixed and continuous) framework. A closely related approach for continuous covariates is based on the P-splines approach introduced by Eilers and Marx (1996). The basic assumption of this approach is that the unknown function can be approximated by a spline of degree l with equally spaced knots within the domain of .
Posterior Inference
When performing Bayesian inference, all inferential conclusions are based on the posterior of the model. In an empirical Bayes approach to structured additive regression, no hyper-prior are assigned to the hyper-parameters, i.e. the variances are treated as fixed. In this case, the specific form of the posterior depends only on the parameterization of the regression terms in the model. Then, we use Markov Chain Monte Carlo (MCMC) simulations to draw samples from the posterior and statistical inference is done by means of Markov Chain Monte Carlo techniques in a full Bayesian setting. Now we restrict the presentation to models with predictor. Full Bayesian inference is based on the entire posterior distribution. Let be the vector of all unknown parameters, then the posterior is given by.
In a fully Bayesian approach, parameter estimates are generated by drawing random samples from the posterior via MCMC simulation techniques. The variance parameters can be estimated simultaneously with the regression coefficients by assigning additional hyper-prior to them. The most common assumption is, that the are independently inverse gamma distributed, i.e. , with hyper-parameters and specified a prior. A standard choice is to use .36,32
Model Diagnostic
Once a model has been developed, we now would like to know how effective the model is in describing the outcome. This is referred to as goodness of fit. The most common ways of checking goodness of fit are: diagnosis for convergence and mixing and posterior-predictive check. In this study, Time series plots were used.37
Comparison of the model
There are several methods of model selection. In this analysis, AIC (Akaike information criterion), DIC (Deviance information criterion) and GCV (Generalized cross validation) are used to compare the linear frequent and the linear Bayesian approach. The model with the smallest AIC, DIC and GCV value considered a better fit.31
Overall procedures
The authors estimated a model parameters that mainly focuses on factors that are underlying determinants of nutritional status of children. And apply Generalized linear additive model to investigate linear and non-linear effects of covariates more flexibly than most previous works.
Descriptive analysis of the data
The result displayed on Table 1 shows the percentages and counts of underweight status of children with respect to the categorical explanatory variables. As one can see from Table 1, among the cases examined in this study of male children were underweight and of female children were underweight. The highest prevalence of child malnutrition was observed among children whose preceding birth interval was less than 24 months () unlike to the lowest prevalence of child malnutrition which was recorded from children whose preceding birth interval is 48 and above (31.9%).
Covariates |
Categories |
Stunted |
Wasted |
Underweight |
Region |
Tigray |
379(34.4%) |
351(31.8%) |
373(33.8%) |
|
Affar |
358(36.5%) |
272(27.6%) |
355(36.0%) |
|
Amhara |
420(35.6%) |
361(30.6%) |
398(33.8%) |
|
Oromia |
540(34.2%) |
492(31.2%) |
546(34.6%) |
|
Somali |
377(42.3%) |
243(27.3%) |
271(30.4%) |
|
Benishangul-Gumuz |
294(32.8%) |
304(34.0%) |
297(33.2%) |
|
SNNP |
516(35.5%) |
466(32.0%) |
473(32.5%) |
|
Gambela |
273(37.4%) |
228(31.3%) |
228(31.3%) |
|
Harari |
241(40.3%) |
176(29.4%) |
181(30.3%) |
|
Addis Ababa |
143(39.0%) |
103(28.1%) |
121(33.0%) |
|
Dire Dawa |
226(35.8%) |
191(30.2%) |
215(34.0%) |
Place of residence |
Urban |
697(39.8%) |
507(29.0%) |
547(31.2%) |
|
Rural |
3070(35.4%) |
2680(30.9%) |
2911(33.6%) |
Wealth index |
Poorest |
1172(36.4%) |
950(29.5%) |
1100(34.1%) |
|
Poorer |
663(35.1%) |
607(32.1%) |
621(32.8%) |
|
Middle |
565(33.4%) |
536(31.7%) |
589(34.9%) |
|
Richer |
602(35.7%) |
537(31.8%) |
548(32.5%) |
|
Richest |
765(39.8%) |
557(29.0%) |
600(31.2%) |
Sex |
Male |
1918(36.2%) |
1646(31.0%) |
1741(32.8%) |
|
Female |
1849(36.3%) |
1541(30.2%) |
1717(33.6%) |
Children by choice |
No |
3027(36.1%) |
2559(30.5%) |
2798(33.4%) |
|
Yes |
735(36.4%) |
628(31.1%) |
659(32.6%) |
Mother edu.level |
No education |
2665(36.6%) |
2187(30.0%) |
2427(33.3%) |
|
Primary |
900(34.5%) |
833(31.9%) |
875(33.1%) |
|
Secondary and above |
202(38.5%) |
167(31.8%) |
156(29.7%) |
Father edu.level |
No education |
1903(35.8%) |
1604(30.1%) |
1816(34.1%) |
|
Primary |
1339(35.5%) |
1193(31.6%) |
1244(32.9%) |
|
Secondary and above |
442(39.2%) |
338(30.0%) |
347(30.8%) |
Drinking water |
Public |
1607(36.7%) |
1324(30.2%) |
1447(33.1%) |
|
Others |
2158(35.8%) |
1862(30.9%) |
2010(33.3%) |
Ever had vaccination |
No |
881(38.9%) |
675(29.8%) |
708(31.3%) |
|
Yes |
1990(35.6%) |
1693(30.3%) |
1912(34.2%) |
Had diarrhea recently |
No |
3221(36.6%) |
2709(30.8%) |
2867(32.6%) |
|
Yes |
530(33.3%) |
475(29.8%) |
588(36.9%) |
Had child fever |
No |
3028(36.3%) |
2548(30.6%) |
2757(33.1%) |
|
Yes |
719(35.3%) |
627(30.8%) |
691(33.9%) |
Had cough recently |
No |
3029(36.5%) |
2507(30.2%) |
2757(33.2%) |
|
Yes |
723(34.6%) |
674(32.2%) |
695(33.9%) |
Preceding birth interval |
less than 24 |
1022(36.6%) |
825(29.5%) |
969(35.0%) |
|
24-48 |
999(37.0%) |
839(31.1%) |
945(33.8%) |
|
Above 48 |
946(34.2%) |
853(30.8%) |
861(31.9%) |
Breast feeding |
Never breastfed |
61(39.1%) |
45(28.8%) |
50(35.1%) |
|
Ever breastfed(not cur) |
1983(36.0%) |
1689(30.7%) |
1832(32.3%) |
|
Still breast feeding |
1700(36.1%) |
1442(30.6%) |
1570(30.3%) |
Table 1 Summary of covariates with the response variable(weight-for-age)
The proportion of underweight children as we can seen from Table 1, it is also differs by type of place of residence: urban and rural. Accordingly, higher numbers of underweight children (33.6%)) reside in rural areas, and relatively less numbers of underweight children (31.2%) reside in urban centers.
The evidence given in Table 1 shows that the percentage nutritional statuses of children who had diarrhea recently seem to be high probability of underweight than those children which had no diarrhea recently.
Likewise, one can observe that the percentages nutritional status of children who had fever recently seemed high probability of underweight than those children who have no fever recent (33.9%, 33.1%, respectively).
The evidence given in Table 1 shows that the percentages nutritional status of children in households who had used other water sources seemed to be more underweight than those children in households which used water from public source.
The family education level has also it’s effect on the nutritional status of children that a mother with higher educational level(father education level) had a child with better nutritional status. Since the percentages of underweight is decreasing from no education to secondary and above education level. Children who had disease are affected by malnutrition e.g child who had cough is more affected than child who had not cough as we observe from Table 1.
From Table 2 the highest malnourished region are SNNP, Oromia, Gambela, Affar and Tigray. From all of these region the smallest malnourished region is Harari. The place of residence has also its impact as result the highest malnourished lives in rural comparing with urban.
Covariates |
Categories |
malnourished |
well-nourished |
Region |
Tigray |
550(49.7%) |
556(50.3%) |
|
Affar |
494(49.8%) |
498(50.2%) |
|
Amhara |
558(47.3%) |
622(52.7%) |
|
Oromia |
795(50.1%) |
793(49.9%) |
|
Somali |
423(47.3%) |
471(52.7%) |
|
Benishangul-Gumuz |
472(52.6%) |
426(47.4%) |
|
SNNP |
782(53.5%) |
680(46.5%) |
|
Gambela |
329(44.9%) |
403(55.1%) |
|
Harari |
249(41.6%) |
349(58.4%) |
|
Addis Ababa |
157(42.7%) |
211(57.3%) |
|
Dire Dawa |
311(49.0%) |
324(51.0%) |
Place of residence |
Urban |
802(45.6%) |
955(54.4%) |
|
Rural |
4318(49.7%) |
4378(50.3%) |
Wealth index |
Poorest |
1610(49.8%) |
1621(50.2%) |
|
Poorer |
925(48.5%) |
982(51.5%) |
|
Middle |
867(51.1%) |
830(48.9%) |
|
Richer |
834(49.4%) |
855(50.6%) |
|
Richest |
884(45.8%) |
1045(54.2%) |
Sex |
Male |
2654(49.8%) |
2664(50.2%) |
|
Female |
2463(48.0%) |
2673(52%) |
Source of drinking water |
public |
2141(48.7%) |
2252(51.3%) |
|
others |
2975(49.1%) |
3081(50.9%) |
Had diarrhea |
No |
4290(48.6%) |
4543(51.4%) |
|
Yes |
51.5(823%) |
775(48.5%) |
Had child fever |
No |
4064(48.6%) |
26644301(51.4%) |
|
Yes |
1037(50.7%) |
1009(49.3%) |
Had cough recently |
No |
4037(48.5%) |
4285(51.5%) |
|
Yes |
1076(51.1%) |
1028(48.9%) |
preceding birth interv |
less than 24 |
1407(50.1%) |
1400(49.9%) |
|
24-48 |
1319(48.8%) |
1386(51.2%) |
|
above 48 |
1394(46.1%) |
8613(53.9%) |
Duration of bf |
never breastfed |
81(50.9%) |
78(49.1%) |
|
ever bf, not currently |
2634(47.6%) |
2897(52.4%) |
|
still bf |
2281(5.8%) |
2441(54.2%) |
Table 2 Summary of Covariates with the response variable(Body mass index)
Inferential part
Under this, we use Gaussian linear regression model to analyze the determinants that have linear effects and Gaussian additive model on its part which can be used to analyze variables that have a non linear effects. The semi parametric regression model that includes the linear effects categorical coariates as well as the non linear of continuous variables.
Bayesian generalized additive regression model: Under this we use Gaussian additive model which can be used to analyse variables that have a non-linear effect. In this case, the main interest is to model the dependence of weight for age of under five years old children and body mass index on continuous covariates including child’s age in month (Childage), Age of mother at first birth(AMB), birth order(BOD), succeeding birth interval(SBI) and Number of household member(Nhme).
Based on above concept, for these continuous covariates in the data set, the assumption of a strictly linear effect on the response y may not be appropriate. Generally, it will be difficult to model the possibly nonlinear effect of such covariates through a parametric functional form, which has to be linear in the parameters, prior to any data analysis. To overcome these difficulties, we replace the strictly linear predictor through an additive predictor, leading to the additive regression model.
The first thing to note is, from Table 3 and Table 4 the output is separated into parametric and smooth, or nonparametric parts. In this additive Gaussian regression model, from Table 3 of the parametric component, succeeding birth interval, sex of child, children by choice not by chance, ever had vaccination and had cough recently are significant affect on underweight status of children under age five years. The variables like duration of beast feeding, source of drinking water and number of household member(Nhme) are significant at level and can effect weight for age of child.
Estimate |
Std. Error |
t value |
P-value |
|
(Intercept) |
-771.31 |
1098.594 |
-0.702 |
0.4827 |
Wealth index |
7.047 |
30.589 |
0.23 |
0.8178 |
PBI |
38.632 |
40.169 |
0.962 |
0.3363 |
as.factor(Region1)Tigray |
290.657 |
1086.516 |
0.268 |
0.7891 |
as.factor(Region1)Affar |
501.397 |
1081.432 |
0.464 |
0.643 |
as.factor(Region1)Amhara |
580.181 |
1085.5 |
0.534 |
0.5931 |
as.factor(Region1)Oromiya |
452.41 |
1080.257 |
0.419 |
0.6754 |
as.factor(Region1)Somali |
516.865 |
1078.889 |
0.479 |
0.6319 |
as.factor(Region1)Benishangul-Gumuz |
520.576 |
1083.424 |
0.48 |
0.6309 |
as.factor(Region1)SNNP |
588.474 |
1081.436 |
0.544 |
0.5864 |
as.factor(Region1)Gambela |
539.967 |
1086.277 |
0.497 |
0.6192 |
as.factor(Region1)Harari |
346.372 |
1086.566 |
0.319 |
0.7499 |
as.factor(Region1)Dire Dawa |
395.183 |
1086.129 |
0.364 |
0.716 |
as.factor(TPR)Rural |
137.094 |
143.649 |
0.954 |
0.34 |
as.factor(Schild)Female |
-152.48 |
67.214 |
-2.269 |
0.0234* |
as.factor(FEDL)Primary |
36.687 |
94.918 |
0.387 |
0.6992 |
as.factor(MEDL)Secondary and above |
766.417 |
489.34 |
1.566 |
0.1175 |
as.factor(FEDL)Primary |
7.253 |
78 |
0.093 |
0.9259 |
as.factor(FEDL)Secondary and above |
120.491 |
168.106 |
0.717 |
0.4736 |
as.factor(SDW)Others |
132.4 |
71.213 |
1.859 |
0.0631. |
as.factor(EV)Yes |
-186.228 |
87.525 |
-2.128 |
0.0335* |
as.factor(HDR)Yes |
35.472 |
126.786 |
0.28 |
0.7797 |
as.factor(HF)Yes |
-183.462 |
120.232 |
-1.526 |
0.1272 |
as.factor(HC)Yes |
272.905 |
111.027 |
2.458 |
0.0141* |
as.factor(DBFa)Ever breastfed, not current |
312.43 |
169.797 |
1.84 |
0.0659. |
as.factor(DBFa)Still breastfeeding |
67.38 |
762.159 |
0.088 |
0.9296 |
Table 3 Results of parametric coefficients in generalized additive model of weight-for-age
edf |
Ref.df |
F |
p-value |
|
s(Childage) |
2.223 |
2.806 |
2.279 |
0.0610. |
s(AMB) |
1.658 |
2.065 |
0.562 |
0.5535 |
s(BOD) |
3.479 |
4.343 |
1.738 |
0.1445 |
s(SBI) |
8.263 |
8.806 |
2.099 |
0.0494* |
s(Nhme) |
7.925 |
8.709 |
1.693 |
0.0564. |
Table 4 Summary approximate significance of smooth terms of weight-for-age
Signif. codes: 0 ’***’ 0.001 ’**’ 0.01 ’*’ 0.05 ’.’ 0.1 ’ ’ 1
R-sq.(adj) = 0.0234 Deviance explained = 4.8%
GCV = 2.2178e+06 Scale est. = 2.1609e+06 n = 1969
The GCV, or generalized cross validation score can be taken as an estimate of the mean square prediction error based on a leave-one-out cross validation estimation process. However, the GCV score is an efficient measure of this concept that doesn’t actually require fitting all those models and overcomes other issues. The GCV, doesn’t give as what we require since its value is largest.
From Table 4 results child’s age in month, number of household member and succeeding birth interval significant at 10% and significant respectively, and affect weight for age of child. From Table 4 if the effective degrees of freedom for the smooth term is high and we observe that p-value is small or become significant and vice versa. So the corresponding variables become important affect on the response variables.
From Table 5 and 6 had cough, sex of child and children by choice not chance are significant and ever had vaccination and duration of breast feeding are significant at 10% level. And affect body mass index. Child’s age in month, succeeding birth interval and number of household member and are significant at 10% level. And affect body mass index of child.
Estimate |
Std. Error |
t value |
P-value |
|
(Intercept) |
-547.0614 |
1087.3824 |
-0.503 |
0.615 |
Wealth index |
10.2716 |
30.2544 |
0.34 |
0.7343 |
PBI |
37.1769 |
39.7694 |
0.935 |
0.35 |
Region1 Trigray |
184.1945 |
1075.1586 |
0.171 |
0.864 |
as.factor(Region1)Affar |
407.869 |
1070.1503 |
0.381 |
0.7031 |
as.factor(Region1)Amhara |
455.3451 |
1074.1781 |
0.424 |
0.6717 |
as.factor(Region1)Oromiya |
338.447 |
1068.9486 |
0.317 |
0.7516 |
as.factor(Region1)Somali |
368.3175 |
1067.5394 |
0.345 |
0.7301 |
as.factor(Region1)Benishangul-Gumuz |
431.5764 |
1071.9779 |
0.403 |
0.6873 |
as.factor(Region1)SNNP |
485.9827 |
1070.1282 |
0.454 |
0.6498 |
as.factor(Region1)Gambela |
335.3584 |
1074.9784 |
0.312 |
0.7551 |
as.factor(Region1)Harari |
218.0806 |
1075.1869 |
0.203 |
0.8393 |
as.factor(Region1)Dire Dawa |
281.1018 |
1074.6055 |
0.262 |
0.7937 |
as.factor(TPR)Rural |
158.2916 |
142.1675 |
1.113 |
0.2657 |
as.factor(Schild)Female |
-144.7441 |
66.5314 |
-2.176 |
0.0297* |
as.factor(Childbychoice)Yes |
-232.4889 |
104.4726 |
-2.225 |
0.0262* |
as.factor(MEDL)Primary |
36.6039 |
93.9137 |
0.39 |
0.6968 |
as.factor(MEDL)Secondary and above |
729.6677 |
484.3697 |
1.506 |
0.1321 |
as.factor(FEDL)Primary |
-0.2891 |
77.2075 |
-0.004 |
0.997 |
as.factor(FEDL)Secondary and above |
112.5688 |
166.2867 |
0.677 |
0.4985 |
as.factor(SDW)Others |
112.5903 |
70.4067 |
1.599 |
0.11 |
as.factor(EV)Yes |
-168.4981 |
86.5547 |
-1.947 |
0.0517. |
as.factor(HDR)Yes |
56.2634 |
125.449 |
0.448 |
0.6538 |
as.factor(HF)Yes |
-174.5133 |
118.95 |
-1.467 |
0.1425 |
as.factor(HC)Yes |
267.2227 |
109.8629 |
2.432 |
0.0151* |
as.factor(DBFa)Ever breastfed, not current |
284.9502 |
168.016 |
1.696 |
0.0901. |
as.factor(DBFa)Still breastfeeding |
158.8417 |
754.1461 |
0.211 |
0.8332 |
Table 5 Results of parametric coefficients in generalized additive model for BMI
edf |
Ref.df |
F |
p-value |
|
(Childage) |
2.321 |
2.926 |
2.371 |
0.0556. |
s(AMB) |
1.719 |
2.146 |
0.611 |
0.5204 |
s(BOD) |
3.224 |
4.037 |
1.618 |
0.1703 |
s(SBI) |
5.65 |
6.762 |
1.823 |
0.0843. |
s(Nhme) |
7.889 |
8.69 |
1.621 |
0.0690. |
Table 6 Summary approximate significance of smooth terms: for BMI
Signif. codes: 0 ’***’ 0.001 ’**’ 0.01 ’*’ 0.05 ’.’ 0.1 ’ ’ 1
R-sq.(adj) = 0.0211 Deviance explained = 4.44%
GCV = 2.1717e+06 Scale est. = 2.1189e+06 n = 1969
Mean |
Sd |
2.50% |
50% |
97.50% |
|
(Intercept) |
245.513 |
185.566 |
-123.042 |
246.533 |
615.123 |
as.factor(Region1)Tigray |
-213.347 |
136.051 |
-483.753 |
-209.979 |
54.242 |
as.factor(Schild)Female |
-145.889 |
66.049 |
-278.356 |
-146.296 |
-13.876 |
as.factor(Childbychoice)Yes |
-246.234 |
100.74 |
-440.729 |
-246.892 |
-47.235 |
as.factor(MEDL)Secondary and above |
811.363 |
442.597 |
-53.229 |
809.279 |
1674.61 |
as.factor(SDW)Others |
114.572 |
67.69 |
-15.373 |
113.176 |
246.921 |
as.factor(EV)Yes |
-175.298 |
81.266 |
-334.197 |
-174.001 |
-17.416 |
as.factor(HC)Yes |
196.325 |
92.622 |
17.874 |
197.192 |
376.061 |
as.factor(DBFa)Ever breastfed,not current |
272.941 |
159.247 |
-38.655 |
274.046 |
582.874 |
Table 7 Summary result of bayesian generalized additive model fixed effects estimation results of Weight-for-age
From Figure 2 Diagnostic Plots for the Generalized Additive Model
Here we can see the negative, positive and constant slight effect of the continuous variables on child nutritional status. From Figure 3 of Childage (months) versus s(Childage) shows the relationship between Childage and Wfa (weight for age) holding constant the BOD (Birth order), SBI (Succeeding birth interval), Nhme (Number of household member)and AMB (Age of mother’s at first birth) variables in the model. And, this interpretation is the same as true for the variables BOD (Birth order), SBI (Succeeding birth interval), Nhme (Number of household member)and AMB (Age of mother’s at first birth).
Since Wfa (weight for age) is expressed in mean deviation form, the smooth term s(Childage), s(AMB), s(BOD), s(SBI) and s(Nhme)are also centered and thus each plot represents how Wfa (weight for age) changes relative to its mean with changes in Childage, AMB, BOD, SBI, and Nhme respectively.
One can observe that from Figure 3 the influence of a child’s age on its nutritional status is considerably high in the age range between the ages of 0-26 months with decreasing trend; and then stabilizes.
As suggested by the nutritional literature, one can able to distinguish the continuous worsening of the nutritional status up until about 26 months of age. This deterioration set in right after birth and continues, more or less linearly, until 26 months. After 42 months the effect of age on underweight stabilizes at a low level. Through reduced growth and the waning impact of infections, children were apparently able to reach a low-level equilibrium that allows their nutritional status to stabilize.
It is also evident from Figure 3 we analyze that underweight status of children is some what stable until 21 years for age of mother at first birth and after 21 it becomes slightly increase. That is younger mothers tend to have stable underweighted children than older mothers.
As we can see in Table 8, is a non negative smoothing parameter that must be chosen by the data analyst. It governs the trade off between the goodness of fit to the data and the wiggleness of the function. When goes to infinity, the penalty term becomes more important, forcing thus the result is the least square line. While when goes to zero, the penality becomes unimportant, thus a solution is the second derivative function. The larger value of (Table 13, sx(Childage)=7500.000) produce smoother curves while the smallest values (Table, sx(Nhme)=10.924) produces more wiggly curve than the others.
In fact degree of freedom of smoother is inconvenient to express the desired smoothness of functions in terms of . Degree of freedom of a smoother, sometimes called effective number of parameters, is an indication of the amount of smoothing. The smoothing parameter is the major determinant of the degree of freedom. More smoothing means fewer degree of freedom (Table 8, sx(Childage)=1.9720) or higher span. The degree of freedom is not necessary an integer. The higher value of means lower value of degree of freedom and vise versa (Table 8).
|
lambda |
df |
sx(Childage) |
7500.000 |
1.9720 |
sx(SBI) |
166.975 |
5.0346 |
sx(Nhme) |
10.924 |
8.0457 |
Table 8 Summary result of bayesian generalized additive model smooth terms results of weight-for-age
Scale estimate: 2135160
N = 1969 DIC = 2017.825 pd = 24.43678 GCV = 2188290
method = STEP family = gaussian iterations = 10000 step = 2
From Table 9 sex of child, children by choice not by chance and ever had vaccination are significant. This can be seen as body mass index of female children is lowered by compared to male children. In similar way, body mass index of child who had ever vaccinated is lowered by comparing to not vaccinated, which indicate only vaccination is not enough for child.
Mean |
Sd |
2.50% |
50% |
97.50% |
|
(Intercept) |
397.852 |
181.139 |
39.4558 |
396.599 |
749.69 |
as.factor(Region1)SNNP |
147.32 |
90.1965 |
-25.4252 |
146.964 |
332.266 |
as.factor(Schild)Female |
-142.632 |
66.0772 |
-268.15 |
-143.985 |
-14.616 |
as.factor(Childbychoice)Yes |
-236.923 |
101.263 |
-437.785 |
-238.694 |
-35.788 |
as.factor(MEDL)Secondary and above |
760.134 |
435.53 |
-97.3098 |
759.167 |
1613.5 |
as.factor(EV)Yes |
-169.015 |
78.187 |
-317.646 |
-168.556 |
-18.669 |
as.factor(HC)Yes |
180.663 |
93.7865 |
-1.0511 |
180.554 |
369.391 |
as.factor(DBFa)Ever breastfed,not current |
250.279 |
157.265 |
-67.8082 |
251.921 |
552.87 |
Table 9 Summary result of bayesian generalized additive aodel fixed effects estimation results: for BMI
lambda |
df |
|
sx(Childage) |
7500 |
1.972 |
sx(SBI) |
166.975 |
5.0346 |
sx(Nhme) |
10.924 |
8.0457 |
Table 10 Summary result of bayesian generalized additive model smooth terms results: for BMI
Scale estimate: 2092320
N = 1969 DIC = 2016.043 pd = 23.5268 GCV = 2142190
method = STEP family = gaussian iterations = 10000 step = 2
The model in Bayesian approach for Bayesian generalized additive model for MCMC post estimation diagnosis, it is also possible to extract sampling paths of parameters with function samples or to plot the samples directly. For instance, coefficient sampling paths for term sx(SBI) are displayed which is important to check the model convergence so as we observe from Figure 4.
Model comparison
We analysed the DHS data set using four different approaches. It is reasonable to ask which type of model is better. The four models are listed in the following: Model 1 = Generalized additive model in frequentist Model 2 = Generalized additive model in Bayesian Model selection is an important part of statistical analysis, and indeed is a central to the pursuit of science in general. Many authors have examined this question, from both frequentist and Bayesian perspective, and many tools of selecting the "better model".
The models proposed in this study are quite general and the model building process can be quite challenging. Currently, an automated procedure for Bayesian model selection is not available. However, a few recommendations are possible. 1. Users should try to incorporate everything that is theoretically possible. 2. Different Bayesian models could be compared via the DIC.38
From Table 11, When we compare models 1 model 2, from Table 11 by GCV we easily see that models 2 have less GCV values, which means Bayesian Generalized additive models are clearly much better than models 1 (Generalized additive model in frequentist). Since, models 2(Generalized Additive model in Bayesian) has less GCV values (2188290) and (2142190) than models 1 (2.2178e+6) and (2.1717e+06) respectively.
AIC |
GCV |
DIC |
|
AIC |
GCV |
DIC |
|
Model 1 |
- |
2.22E+06 |
- |
Model 3 |
- |
2.17E+06 |
- |
Model 2 |
- |
2188290 |
2017.825 |
Model 4 |
- |
2142190 |
2016.043 |
Table 11 Summary result of models comparison criteria for weight-for-age on the left and body mass index on right hand side
Discussion
This study was intended to identify the determinants of the weight for age and body mass index of nutritional status of children under five years old in Ethiopia based on EDHS 2016 data. The nutritional status was measured by the weight-for-age and body mass index is measured by weight/height.
In the study, the effect of the age of the child is obviously nonlinear and decreasing between birth and an age of about 30 months and then stabilizes. That means the underweight of children increases until 30 months. This continuous worsening of the nutritional status may be caused by the fact that most of the children obtain liquids other than breast milk already shortly after birth. After 30 months a relatively stable, low level is reached. However, it reaches its minimum level between ages 40-50 months, then rises again and stabilizes thereafter at a middle level with a bump till 5 years. Previous studies also conform this (RE Black, 2013,39 Mohammad, 2018,40)
In the study, the effect of the age of the child is obviously nonlinear and decreasing between birth and an age of about 26 months and then stabilizes. That means the underweight of children increases until 26 months. This continuous worsening of the nutritional status may be caused by the fact that most of the children obtain liquids other than breast milk already shortly after birth. Between 30-40 months a relatively decreases and 41-50 months become stable, low level is reached. However, it reaches its minimum level between ages 30-50 months, then rises again thereafter at a middle level with a bump till 5 years. Previous studies also conform this (Mohammad, 2018,40Loader, 1999.41)
Conclusions
The results found that children living in the Afar and Oromia region of Ethiopia are associated with weight for age (underweight). While the children which are living in Harari and Somali are less associated with underweight comparing with other regions.
Determinants that affect levels of nutritional status of children have been explored using different models like generalized additive model in frequentest and Bayesian approach. By using model comparison methods, surprisingly generalized additive model in Bayesian approaches is the preferable one.
The analyses shows that child sex, children by choice not by chance, vaccination, source of drinking water, duration of breast feeding and cough important fixed factors which affect the weight for age of children in Ethiopia. Variables like succeeding birth interval, child gender, children by choice not by chance, vaccination, and cough are also affect body mass index.
Concerning the non-linear effects, a major finding is that the malnutrition status worsens until 26 months of age of their life and between 30-40 and 41-50 months become decrease and stable respectively and again increase. The effect of AMB (Age mothers at first birth) is slight on childhood malnutrition until 22 months but effect of underweight seems increase above 22 months. In contrast the effects of succeeding birth interval on underweight are also increase above 45 months and slightly change/fluctuate below 45 months.
Mother education level, which is related to wealth index, is a determinant of good child care knowledge and practices. In this study, the education attainment of mothers does not have a significant effect on the underweight of a child in Ethiopia in separate analysis. This result is against the suggestion that an educated mother assumes the responsibility of taking a sick child to receive care health. Mother education level, which is related to succeeding birth interval have also determine the body mass index since they are significant effect.
Recommendations
In addition to the efforts being made to reduce malnutrition in general, specific attention should be given to reduce the prevalence of malnutrition by taking the following in to consideration. The concerned body:
Limitations of the study
Spatial information can be analyzed in Bayesian approach and result in very important information to identify the prevalence of stunting region-wise and/or district-wise. Because of lack of having complete spatial information, in this study we have not analyzed the prevalence of child malnutrition in-depth spatially.
The data, we used in this study is EDHS 2011. The researcher find any option to get recent data. Since latest data is not released till now and we haven’t any option to do more. The only option we have is only doing on the EDHS 2011 data. Thus, the results may not necessarily reflect the current situation of Child’s nutritional status of children.
Under this study the regional differences are not considered either we observe which regional has highest prevalence of underweight or low prevalence of underweight.
©2020 Bacha. This is an open access article distributed under the terms of the, which permits unrestricted use, distribution, and build upon your work non-commercially.
2 7