Submit manuscript...
eISSN: 2378-315X

Biometrics & Biostatistics International Journal

Research Article Volume 6 Issue 1

Identification of the risk factors associated with ICU mortality

Nasser Abdullah K Alghamdi,1 Munni Begum2

1Mathematics department, Albaha university, Saudi Arabia
2Department of Mathematical Sciences, Ball State University, USA

Correspondence: Munni Begum, Department of Mathematical Sciences, Ball State University, Muncie, IN 47306, USA

Received: May 21, 2016 | Published: June 15, 2017

Citation: Alghamdi NAK, Begum M. Identification of the risk factors associated with icu mortality. Biom Biostat Int J. 2017;6(1):278-287. DOI: 10.15406/bbij.2017.06.00157

Download PDF

Abstract

The objective of this study is to identify the risk factors that influence the surgical and medical Intensive Care Unit (ICU) mortality. We considered data that was collected at Bay State Medical Center in Springfield, Massachusetts.1 We developed statistical models that identify the risk factors associate with ICU mortality. In order to identify the risk factors without subjective bias, we explored multiple variable selection methods. We explored several methods including what we call manually picked best model, forward selection, backward elimination, and the least absolute shrinkage and selection operator (LASSO). We applied 5-fold cross validation on the final model of manually picked best model, forward selection and backward elimination and applied both validation set approach and 5-fold cross validation on LASSO to create confusion matrices and calculate the error rate of each method. Finally, we recommended the model for predicting ICU mortality with lowest misclassification error rate.

Introduction

The care and treatment of critically ill patients in special rooms with life-saving technology is the major component of modern medical science. The diagnosis and treatment of the patients in critical conditions is highly dependent on invasive diagnostic as well as therapeutic procedures. However, the main disruption of host defense mechanisms comes from the life support systems.

According to Morandi, Jackson, and Wesley,2 there are ICU-acquired infections that is responsible for the high mortality rate of the ICU patients. The researchers’ study offers a useful information concerning the topic. The aim of their study was to determine the epidemiology as well as the risk factors for nosocomial infections and the mortality rate in the ICU. Due to the variations of the study methods, the infection rates from different ICUs are difficult to compare.

Girard, Pandharipande, & Ely3 show that delirium, a fluctuating disturbance of cognition, is a sign of acute brain dysfunction in the patients with critical illnesses in the ICU. The patients with critical illnesses are more likely to have delirium. Also, Morandi, Jackson, & Wesley Ely2 show that delirium can be related to the cognitive impairment that persists for an extended period after discharge. Techniques of treating delirium at the ICU has been the subject of investigations in the recent past.

Age is also suggested as one of the predictors of mortality in the ICU. The number of elderly patients who are being admitted to the ICU has been increased, not only in the USA but also internationally (Belayachi et al.4). There are few studies that have been conducted to link old age with the ICU mortality rate. The current research includes age as one of the risk factors for ICU mortality.

Background

The ICU mortality rate in any hospital is the highest compared to other units. The United States of America has an approximate 4 million ICU admissions annually and the mortality rate of 500,000 deaths every year. The medical errors occur in any unit of the hospital, but it more likely to occur in the ICU since the ICU patients undergo complex interventions.

A study on the mortality risk factors and validation of severity scoring systems in the critically ill patients with acute renal failure was conducted to identify the determinants for improving patient care. Renal failure has a high prevalence in the ICU and associated with high mortality rates. Identification of the mortality risk factors helps to address intervention to these risk factors and improves patient care (Lima, Zanetta, Castro & Yu).5

Iwuafor, et al.6 conducted a study sought to determine the prevalence, risk factors, clinical outcome, and the microbiological profile of the hospital-acquired infections in the ICU of a Nigerian hospital. Infections commonly affect critically ill patients and have a high association with mortality. The study identified blood stream infections and the urinary tract infections as a significant risk factors associated with the ICU mortality.

Data and variable description

We considered data collected at Bay State Medical Center in Springfield, Massachusetts that can be downloaded from University of Massachusetts, Amherst website.1 The dataset consists of 200 observations with 20 variables. The response variable (STA), vital status is categorical. The other categorical predictor variables are: Gender, Race, SER (Service at ICU Admission), CAN (Cancer Part of Present Problem), CRN (History of Chronic Renal Failure), INF (Infection Probable at ICU Admission), CPR (CPR Prior to ICU Admission), PRE (Previous Admission to an ICU within 6 Months), TYP (type of admission), FRA (Long Bone, Multiple, Neck, Single Area, or Hip Fracture), PO2 (PO2 from Initial Blood Gases), PH (PH from Initial Blood Gases), PCO (PCO2 from initial Blood Gases), BIC (Bicarbonate from Initial Blood Gases), CRE (Creatinine from Initial Blood Gases) and LOC (Level of Consciousness at ICU Admission). The continuous predictor variables are Age, SYS (Systolic Blood Pressure at ICU Admission) and HRA (Heart Rate at ICU Admission).

Table 1 shows the total number of ICU patients according to vital status. It shows 80% of patients survived and 20% died. Figure 1 shows the graphical representation of these statistics.

Status frequency

Status percentage

Survived

Died

Total

Survived (%)

Died (%)

Total (%)

160

40

200

80

20

100

Table 1 Vital status of ICU patients

Figure 1 Number of ICU patients according to vital status.

The table above shows that, 24 males died (12%) out of 124 total males and 16 females died (8%) out of 76 females at the ICU. From these results, it clearly demonstrates that males are more vulnerable to the ICU mortality as compared to females. The study also sought to determine whether ethnicity is associated with mortality. From the table above, 37 (18.5%) of 175 Whites died, 1 (0.5%) out of 15 Blacks died, and 2 (1%) out of 10 others died. The survival rate for the black patients and other races was less (7% and 4% respectively) compared to that of the white patients (69%). However, there wasn’t sufficient data in the Black and Other category to make decisive comparison. We can see 46.5% of patients were medically treated at ICU compared to 53.5% of patients who were treated surgically. There was no incidence of cancer in 90% of ICU patients. Out of those, 18% died. In addition, 2% of patients who had cancer died. The above results demonstrate that the cancer is a low predictor of ICU mortality for the critically ill patients admitted to the ICU. According to these results, the presence of cancer can lead to the survival of the patient in the ICU. We notice that only 9.5% of patients had chronic renal failure compared to 95.5% who did not have that failure. The results above show that the history of chronic renal disease is not a risk factor for mortality of the critically ill patients admitted to the ICU. For the patients with a history of chronic renal failure, the mortality was 4%, which is four times lower compared to those who did not have a history of chronic renal failure which was 16%. We notice that 12% of patients who had infection at ICU admission died compared to 8% without infection who died. Infection at ICU admission is a useful predictor of mortality in the intensive care unit with those infected having a higher likelihood of death compared to the uninfected ones. Also 6.5% of ICU patients had CPR prior to ICU admission. The percentage of patients with CPR prior to the ICU admission who died is low compared to the percentage of patients without CPR. It can be seen 15% of patients had previous admission to an ICU within 6 months. The previous admission to the intensive care unit is not a predictive factor for the mortality rate in the ICU. The percentage of people with previous admissions to the ICU who died is low compared to those who had not been admitted before. We can see that there are more deaths associated with emergency admission compared to the elective admission. From the above results, we can see that the mortality rate is 19% for the emergency admission as compared to the 1% of elective admission. The only 15 patients had fracture and 3 of them died. In contrast 37 patients out of 185 who did not have fracture died. The majority of patients at ICU had PO2 from initial blood gases greater than 60. The mortality rate for patients whose PO2 is greater than 60 is 17.5%. Patients whose initial blood gases’ PH was higher than 7.25 showed higher mortality rate that was twice that of the patients whose initial blood gases’ PH was below 7.25. In addition, the mortality rate for patients whose PCO2 from initial blood gases less than 45 is higher than those whose PCO2 is greater than 45. The mortality rate of patients whose Bicarbonate from initial blood gases was greater than 18 is seven times the mortality rate of patients whose Bicarbonate from initial blood gases was less than 18. It can be seen the ICU patients whose creatinine level from initial blood gases was greater than 2.0 have a higher mortality rate. The results above show that the mortality rate of ICU patients who had deep stupor or coma was high. Patients who had no coma survived with a probability of more than 85%.

The results in Table 2 demonstrate that age plays a significant role in the admission of patients to the ICU.

Table 3 and Figure 2 show that patients’ age is between 16 – 92 years old and the majority of ICU patients are between 40 – 80 years old. In addition, the boxplot shows the mean age of died patients is 70.

Status frequency

Status percentage

Survived

Died

Total

Survived (%)

Died (%)

Total (%)

Gender

Male

100

24

124

50

12

62

Female

60

16

76

30

8

38

Race

White

138

37

175

69

18.5

87.5

Black

14

1

15

7

0.5

7.5

Other

8

2

10

4

1

5

Service

Medical

67

26

93

33.5

13

46.5

Surgical

93

14

107

46.5

7

53.5

Cancer

No

144

36

180

72

18

90

Yes

16

4

20

8

2

10

Chronic

No

149

32

181

74.5

16

90.5

Yes

11

8

19

5.5

4

9.5

Infection

No

100

16

116

50

8

58

Yes

60

24

84

30

12

42

CPR

No

154

33

187

77

16.5

93.5

Yes

6

7

13

3

3.5

6.5

PRE

No

137

33

170

68.5

16.5

85

Yes

23

7

30

11.5

3.5

15

Type

Elective

51

2

53

25.5

1

26.5

Emergency

109

38

147

54.5

19

73.5

Fracture

No

148

37

185

74

18.5

92.5

Yes

12

3

15

6

1.5

7.5

PO2

> 60

149

35

184

74.5

17.5

92

< 60

11

5

16

5.5

2.5

8

PH

> 7.25

151

36

187

75.5

4.5

80

< 7.25

9

4

13

18

2

20

PCO2

< 45

144

36

180

72

18

90

> 45

16

4

20

8

2

10

BIC

> 18

150

35

185

75

17.5

92.5

< 18

10

5

15

5

2.5

7.5

CRE

< 2.0

155

35

190

77.5

17.5

95

> 2.0

5

5

10

2.5

2.5

5

LOC

No coma

158

27

185

79

13.5

92.5

Deep stupor

0

5

5

0

2.5

2.5

Coma

2

8

10

1

4

5

Table 2 Categorical variables and vital status of ICU patients

Summary

Min.

1st Qu.

Median

Mean

3rd Qu.

Max.

Age

16

46.75

63

57.54

72

92

Systolic blood pressure

36

110

130

132.3

150

256

Heart rate

39

80

96

98.92

118.2

192

Table 3 Continuous variables summary

Figure 2 Age and vital status.

Figure 3 shows that the majority of patients were at risk who had systolic blood pressure between 120 to 140. The mean of systolic blood pressure of the patients at ICU is 132.3 mmHg. The boxplot shows a patient Figure 4 who had 256 mmHg died.

Figure 3 Systolic blood pressure and vital status.

Figure 4 Heart rate at ICU admission and vital status.

Objective identification of risk factors of ICU mortality

Since the response variable in our data (vital status) is binary, binary logistic regression is an appropriate model to consider. Logistic regression is a predictive analysis technique used to illustrate the relationship between a binary response variable and the predictors using the regression on the logarithm of the odds of having a response:7,8

Let Y i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbacbaaaaaaa aapeGaamywaKGbaoaaBaaajuaGbaqcLbmacaWGPbaajuaGbeaaaaa@3B00@ be the binary response variable for i th MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbacbaaaaaaa aapeGaamyAaKGbaoaaCaaajuaGbeqaaKqzadGaamiDaiaadIgaaaaa aa@3B7B@ patient, with
Y i =0 MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbacbaaaaaaa aapeGaamywaKGbaoaaBaaajuaGbaqcLbmacaWGPbaajuaGbeaacqGH 9aqpcaaIWaaaaa@3CBF@ , the patient survived
Y i =1 MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbacbaaaaaaa aapeGaamywaKGbaoaaBaaajuaGbaqcLbmacaWGPbaajuaGbeaacqGH 9aqpcaaIXaaaaa@3CC0@ , the patient died
then, logit ( π i )= log( π i 1  π i ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbacbaaaaaaa aapeWaaeWaaeaacqaHapaCpaWaaSbaaeaapeGaamyAaaWdaeqaaaWd biaawIcacaGLPaaacqGH9aqpcaGGGcGaciiBaiaac+gacaGGNbWaae WaaeaadaWcaaWdaeaapeGaeqiWda3damaaBaaabaWdbiaadMgaa8aa beaaaeaapeGaaGymaiabgkHiTiaacckacqaHapaCpaWaaSbaaeaape GaamyAaaWdaeqaaaaaa8qacaGLOaGaayzkaaaaaa@4A5E@
β 0 + β 1 x i1 + + β k x ik MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbacbaaaaaaa aapeGaeqOSdiwcga4damaaBaaabaWdbiaaicdaa8aabeaajuaGcqGH RaWkpeGaeqOSdiwcga4damaaBaaabaWdbiaaigdaa8aabeaajuaGpe GaamiEa8aadaWgaaqcgayaa8qacaWGPbGaaGymaaqcfa4daeqaaiab gUcaR8qacqGHMacVcqGHMacVcqGHMacVcaGGGcGaey4kaSIaeqOSdi 2damaaBaaajyaGbaWdbiaadUgaaKqba+aabeaapeGaamiEaKGba+aa daWgaaqaa8qacaWGPbGaam4AaaWdaeqaaaaa@5245@
 Xβ MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbacbaaaaaaa aapeGaeyisISRaaiiOamrr1ngBPrwtHrhAXaqeguuDJXwAKbstHrhA G8KBLbacfaGae83fXJLaeqOSdigaaa@461F@
Where,
π i = e Xβ 1+  e Xβ MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbacbaaaaaaa aapeGaeqiWdaxcga4damaaBaaabaWdbiaadMgaa8aabeaajuaGcqGH 9aqppeWaaSaaa8aabaWdbiaadwgapaWaaWbaaeqajyaGbaWefv3ySL gznfgDOfdaryqr1ngBPrginfgDObYtUvgaiuaapeGae83fXJLaeqOS digaaaqcfa4daeaapeGaaGymaiabgUcaRiaacckacaWGLbWdamaaCa aabeqcgayaa8qacqWFxepwcqaHYoGyaaaaaaaa@530E@

We consider multiple variable selection methods in order to identify risk factors in an objective manner. These are manually picked best model, forward selection, backward elimination and least absolute shrinkage and selection operator (LASSO). We discuss these methods briefly as follows.

Manually picked best model

In order to identify the risk factors for the vital status at ICU, which is a binary response variable, we started with the process of manually picked best model. To correctly leverage this method, we begin with binary logistic regression model. First, we fit a model with all predictor variables. Next we pick the significant variables (in this case risk factors) with the smallest P-value (P <.10) manually and remove all the factors which are insignificant. We then refit the model with all the significant variables in the model.

Forward selection

For forward selection, a null model (a model with no predictors), serves as the starting point. We add one variable at a time to the null model and refitted the model including the added variables. The idea was to keep it if the variable that had been added was significant and then add the next variable. If not, we eliminated it and added the next variable. We refitted the model using the same procedure until the stopping rule was satisfied (all the variables in the model are significant).

Backward selection

Backward elimination method starts with a full model that contains all the predictors in the model. The least significant risk factors; that is, the ones having the largest P value (greater than 10%) are eliminated, and the model is then refitted. Each step removes the least significant variable from the model until the remaining variables have their P values smaller than the specified 0.10.

Least Absolute Shrinkage and Selection Operator (LASSO)

With this method, there is an automatic selection of predictors of the target variable from the large set of potential predictors. By doing so, the method will return the coefficients of the irrelevant variables to zero thereby performing an automatic selection of variables. The LASSO formulates a curve fitting as a quadratic programming problem with the objective function that penalizes the absolute size of the coefficients based on a value of a tuning parameter, say λ  MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbacbaaaaaaa aapeGaeq4UdWMaaiiOaaaa@3911@ . The method, therefore, shrinks the size of the nonzero coefficients and ends up with the most useful variables.

 logit ( π i )= log(   π i 1  π i ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbacbaaaaaaa aapeWaaeWaaeaacqaHapaCjyaGpaWaaSbaaeaapeGaamyAaaWdaeqa aaqcfa4dbiaawIcacaGLPaaacqGH9aqpcaqGGaGaamiBaiaad+gaca WGNbWaaeWaaeaacaGGGcWaaSaaa8aabaWdbiabec8aW9aadaWgaaqc gayaa8qacaWGPbaajuaGpaqabaaabaWdbiaaigdacqGHsislcaGGGc GaeqiWdaxcga4damaaBaaabaWdbiaadMgaa8aabeaaaaaajuaGpeGa ayjkaiaawMcaaaaa@4E59@
β 0 + β 1 x i1 + + β k x ik + LASSO penalty MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbacbaaaaaaa aapeGaeqOSdiwcga4damaaBaaabaWdbiaaicdaa8aabeaajuaGcqGH RaWkpeGaeqOSdi2damaaBaaajyaGbaWdbiaaigdaaKqba+aabeaape GaamiEaKGba+aadaWgaaqaa8qacaWGPbGaaGymaaWdaeqaaKqbakab gUcaR8qacqGHMacVcqGHMacVcqGHMacVcaGGGcGaey4kaSIaeqOSdi 2damaaBaaajyaGbaWdbiaadUgaaKqba+aabeaapeGaamiEa8aadaWg aaqaa8qacaWGPbqcgaOaam4Aaaqcfa4daeqaa8qacqGHRaWkcaqGGa GaamitaiaadgeacaWGtbGaam4uaiaad+eacaqGGaGaamiCaiaadwga caWGUbGaamyyaiaadYgacaWG0bGaamyEaaaa@5FC5@
 Xβ + LASSO penalty MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbacbaaaaaaa aapeGaeyisISRaaiiOamrr1ngBPrwtHrhAXaqeguuDJXwAKbstHrhA G8KBLbacfaGae83fXJLaeqOSdiMaaiiOaiabgUcaRiaabccacaWGmb GaamyqaiaadofacaWGtbGaam4taiaabccacaWGWbGaamyzaiaad6ga caWGHbGaamiBaiaadshacaWG5baaaa@5425@
LASSO penalty =λ  j=1 p | β j | MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbacbaaaaaaa aapeGaamitaiaadgeacaWGtbGaam4uaiaad+eacaqGGaGaamiCaiaa dwgacaWGUbGaamyyaiaadYgacaWG0bGaamyEaiaabccacqGH9aqpcq aH7oaBcaGGGcWaaybCaeqajyaGpaqaa8qacaWGQbGaeyypa0JaaGym aaWdaeaapeGaamiCaaqcfa4daeaapeGaeyyeIuoaamaaemaapaqaa8 qacqaHYoGypaWaaSbaaKGbagaapeGaamOAaaqcfa4daeqaaaWdbiaa wEa7caGLiWoaaaa@54B6@
Where,
π i  =  e Xβ+ λ  j=1 p | β j | 1+  e Xβ+ λ  j=1 p | β j | MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbacbaaaaaaa aapeGaeqiWdaxcga4damaaBaaabaWdbiaadMgaa8aabeaajuaGpeGa aeiOaiabg2da9iaabckadaWcaaWdaeaapeGaamyza8aadaahaaqabe aatuuDJXwAK1uy0HwmaeHbfv3ySLgzG0uy0Hgip5wzaGqbaKGba+qa cqWFxepwcqaHYoGyjuaGcqGHRaWkcaqGGcGaeq4UdWMaaiiOamaava dabeqcga4daeaapeGaamOAaiabg2da9iaaigdaa8aabaWdbiaadcha aKqba+aabaWdbiabggHiLdaadaabdaWdaeaapeGaeqOSdi2damaaBa aajyaGbaWdbiaadQgaaKqba+aabeaaa8qacaGLhWUaayjcSdaaaaWd aeaapeGaaGymaiabgUcaRiaacckacaWGLbWdamaaCaaabeqaaKGba+ qacqWFxepwcqaHYoGyjuaGcqGHRaWkcaqGGcGaeq4UdWMaaiiOamaa vadabeqcga4daeaapeGaamOAaiabg2da9iaaigdaa8aabaWdbiaadc haaKqba+aabaWdbiabggHiLdaadaabdaWdaeaapeGaeqOSdi2damaa BaaajyaGbaWdbiaadQgaaKqba+aabeaaa8qacaGLhWUaayjcSdaaaa aaaaa@7C55@
π i > 0.5 ,  y i = 1 ( dead ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbacbaaaaaaa aapeGaeqiWdaxcga4damaaBaaabaWdbiaadMgaa8aabeaajuaGpeGa eyOpa4JaaeiiaiaaicdacaGGUaGaaGynaiaabccacaGGSaGaaiiOai qadMhagaWeaKGba+aadaWgaaqaa8qacaWGPbaapaqabaqcfa4dbiab g2da9iaabccacaaIXaGaaeiia8aadaqadaqaa8qacaWGKbGaamyzai aadggacaWGKbaapaGaayjkaiaawMcaaaaa@4C8F@

With the predicted probability of the binary response, π i   MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbacbaaaaaaa aapeGaeqiWdaxcga4damaaBaaabaWdbiaadMgaa8aabeaajuaGpeGa aeiOaaaa@3B83@  we can predict the response itself using the above cut point.
Thus, if the predicted probability of dying is greater than 0.5, we coded the predicted response as 1 (died) and 0 (survived) otherwise.

Validation and k-fold cross validation approach

Validation/cross validation approach is an objective methodology to select an optimal strategy. To select the best model from the variable selection models selected by four strategies, we implement validation and cross validation. We applied k-fold cross validation approach on the final model of manually picked best model, forward selection and backward elimination to create the confusion matrices and calculate the error rate of each method. Since we have 200 observations, we decided to use 5 folds which split the data into two; training and testing datasets. Training set had 160 observations and testing set had 40 observations (total 5 sets each having 40 observations). k-fold cross-validation approach is applicable where the original sample is partitioned at random into k subsamples and one is left out in every iteration step. Let k parts be C 1 , C 2 ., C K , MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbacbaaaaaaa aapeGaam4qaKGba+aadaWgaaqcfayaaKqzadWdbiaaigdaaKqba+aa beaapeGaaiilaiaadoeapaWaaSbaaeaajugWa8qacaaIYaaajuaGpa qabaWdbiabgAci8kaac6cacaGGSaGaam4qa8aadaWgaaqaaKqzadWd biaadUeaaKqba+aabeaacaGGSaaaaa@4686@ where C i MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbacbaaaaaaa aapeGaam4qaKGba+aadaWgaaqcfayaaKqzadWdbiaadMgaaKqba+aa beaaaaa@3B18@ denotes the indices of the observations in part i. We have the following formula to estimate error rate:

CV=  i=1 K n i n  ( MS E i ) MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbacbaaaaaaa aapeGaam4qaiaadAfacqGH9aqpcaGGGcWaaybCaeqapaqaaKqzadWd biaadMgacqGH9aqpcaaIXaaajuaGpaqaaKqzadWdbiaadUeaaKqba+ aabaWdbiabggHiLdaadaWcaaWdaeaapeGaamOBa8aadaWgaaqaaKqz adWdbiaadMgaaKqba+aabeaaaeaapeGaamOBaaaacaGGGcWaaeWaa8 aabaWdbiaad2eacaWGtbGaamyra8aadaWgaaqaaKqzadWdbiaadMga aKqba+aabeaaa8qacaGLOaGaayzkaaaaaa@50F4@

Where MS E k  =  i C i ( y i   y i ) 2  / n i MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbacbaaaaaaa aapeGaamytaiaadofacaWGfbqcga4damaaBaaajuaGbaqcLbmapeGa am4Aaaqcfa4daeqaa8qacaqGGcGaeyypa0JaaeiOamaawafabeWdae aapeGaamyAaiabgIGiolaadoeajyaGpaWaaSbaaKqbagaajugWa8qa caWGPbaajuaGpaqabaaapeqab8aabaWdbiabggHiLdaadaqadaWdae aapeGaamyEaKGba+aadaWgaaqcfayaaKqzadWdbiaadMgaaKqba+aa beaapeGaeyOeI0IaaiiOa8aadaWfGaqaa8qaceWG5bGbambajyaGpa WaaSbaaKqbagaajugWa8qacaWGPbaajuaGpaqabaaabeqaaaaaa8qa caGLOaGaayzkaaqcga4damaaCaaajuaGbeqaaKqzadWdbiaaikdaaa qcfaOaaeiOaiaac+cacaWGUbqcga4damaaBaaajuaGbaqcLbmapeGa amyAaaqcfa4daeqaaaaa@62FF@ , y i MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbacbaaaaaaa aapeGabmyEayaataqcga4damaaBaaajuaGbaqcLbmapeGaamyAaaqc fa4daeqaaaaa@3B67@  is the fit for observation i, and n i =  n K MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbacbaaaaaaa aapeGaamOBaKGba+aadaWgaaqcfayaaKqzadWdbiaadMgaaKqba+aa beaacqGH9aqppeGaaiiOamaaliaapaqaa8qacaWGUbaapaqaa8qaca WGlbaaaaaa@3F8F@ . For this study, n=200, K= 5. So, there are 200/5 parts of 40. The data is split to two groups of testing and training: testing = 40 and training = 160. We fit logistic models on the training data sets and calculate misclassification error rate on the test data. In addition, we conducted k- fold cross validation and validation set on LASSO and compared the results of each method. Since we have 200 observations, we decided to use validation set which split the data into two; training and testing datasets. Training set had 100 observations and testing set had 100 observations (total 2 sets each having 100 observations).

The validation set error rate is determined using:

CV=  1 n   i=1 n [ e i 1 h i ] 2 MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbacbaaaaaaa aapeGaam4qaiaadAfacqGH9aqpcaGGGcWaaSaaa8aabaWdbiaaigda a8aabaWdbiaad6gaaaGaaiiOamaawahabeWdaeaajugWa8qacaWGPb Gaeyypa0JaaGymaaqcfa4daeaajugWa8qacaWGUbaajuaGpaqaa8qa cqGHris5aaWaamWaa8aabaWdbmaaliaapaqaa8qacaWGLbqcga4dam aaBaaajuaGbaqcLbmapeGaamyAaaqcfa4daeqaaaqaa8qacaaIXaGa eyOeI0IaamiAaKGba+aadaWgaaqcfayaaKqzadWdbiaadMgaaKqba+ aabeaaaaaapeGaay5waiaaw2faaKGba+aadaahaaqcfayabeaajugW a8qacaaIYaaaaaaa@5824@

Where e i MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbacbaaaaaaa aapeGaamyzaKGba+aadaWgaaqcfayaaKqzadWdbiaadMgaaKqba+aa beaaaaa@3B39@ is the residual obtained from fitting a model to all the n observations.

Results and discussion

Manually picked best model

In this method, we fit a model with all predictor variables. We pick the significant variables (in this case risk factors) with the smallest P-value (P < 0.10) manually and remove all the factors which are insignificant. We then refit the model with all the significant variables in the model. Table 4 shows the significant and insignificant predictors of the manually picked best model.

Estimate

Std. error

z value

P-value

Significance

(Intercept)

-6.75128

1.29885

-5.198

2.02E-07

 

AGE

0.04018

0.01311

3.066

0.00217

Significant

CAN

2.14668

0.84582

2.538

0.01115

Significant

TYP

2.81592

0.89512

3.146

0.00166

Significant

PH

1.7683

0.85459

2.069

0.03853

Significant

PCO

-2.13254

0.98844

-2.157

0.03097

Significant

LOC

2.3089

0.57504

4.015

5.94E-05

Significant

Table 4 Model 1 (all variables)

We refit the model with the significant predictors (risk factors in this case) and eliminate the insignificant predictors all together.

Table 5 below shows the final model with all significant variables. It shows that the risk factors age, cancer part of present problem, type of admission, PH from Initial blood gases, PCO2 from initial blood gases and Level of consciousness at ICU admission as the risk factors.9,10

Estimate

Std. error

z value

P-value

Significance

(Intercept)

-6.75128

1.29885

-5.198

2.02E-07

 

AGE

0.04018

0.01311

3.066

0.00217

Significant

CAN

2.14668

0.84582

2.538

0.01115

Significant

TYP

2.81592

0.89512

3.146

0.00166

Significant

PH

1.7683

0.85459

2.069

0.03853

Significant

PCO

-2.13254

0.98844

-2.157

0.03097

Significant

LOC

2.3089

0.57504

4.015

5.94E-05

Significant

Table 5 Model 2 (final model of manually picked best model)

From Table 5, our model equation can be written as:
logit ( π i )= log( π i 1  π i ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbacbaaaaaaa aapeWaaeWaaeaacqaHapaCpaWaaSbaaeaapeGaamyAaaWdaeqaaaWd biaawIcacaGLPaaacqGH9aqpcaGGGcGaciiBaiaac+gacaGGNbWaae WaaeaadaWcaaWdaeaapeGaeqiWda3damaaBaaabaWdbiaadMgaa8aa beaaaeaapeGaaGymaiabgkHiTiaacckacqaHapaCpaWaaSbaaeaape GaamyAaaWdaeqaaaaaa8qacaGLOaGaayzkaaaaaa@4A5E@
= β 0 + β 1 x 1 +  β 2 x 2 β 3 x 3 + β 4 x 4 + β 5 x 5 + β 6 x 6  Xβ MathType@MTEF@5@5@+= feaagKart1ev2aqatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbacbaaaaaaa aapeGaeyypa0JaeqOSdi2damaaBaaabaqcLbmapeGaaGimaaqcfa4d aeqaaiabgUcaR8qacqaHYoGypaWaaSbaaKqbGeaapeGaaGymaaWdae qaaKqba+qacaWG4bWdamaaBaaajuaibaWdbiaaigdaa8aabeaajuaG cqGHRaWkpeGaaiiOaiabek7aI9aadaWgaaqcfasaa8qacaaIYaaapa qabaqcfa4dbiaadIhapaWaaSbaaKqbGeaapeGaaGOmaaqcfa4daeqa aiabgkHiT8qacqaHYoGypaWaaSbaaKqbGeaapeGaaG4maaqcfa4dae qaa8qacaWG4bWdamaaBaaajuaibaWdbiaaiodaaKqba+aabeaacqGH RaWkpeGaeqOSdi2damaaBaaajuaibaWdbiaaisdaaKqba+aabeaape GaamiEa8aadaWgaaqcfasaa8qacaaI0aaajuaGpaqabaGaey4kaSYd biabek7aI9aadaWgaaqcfasaa8qacaaI1aaajuaGpaqabaWdbiaadI hapaWaaSbaaKqbGeaapeGaaGynaaWdaeqaaKqbakabgUcaR8qacqaH YoGypaWaaSbaaKqbGeaapeGaaGOnaaWdaeqaaKqba+qacaWG4bWdam aaBaaajuaibaWdbiaaiAdaa8aabeaajuaGpeGaeyisISRaaiiOamrr 1ngBPrwtHrhAXaqeguuDJXwAKbstHrhAG8KBLbacfaGae83fXJLaeq OSdigaaa@77F0@
                = 6.75128 + 0.04018 AGE + 2.14668 CAN + 2.81592 TYP  + 1.76830 PH + ( 2.13254 ) PCO+ 2.30890 LOC Xβ MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOabaeqabaqcfaieaa aaaaaaa8qacqGH9aqpcaqGGaGaeyOeI0IaaGOnaiaac6cacaaI3aGa aGynaiaaigdacaaIYaGaaGioaiaabccacqGHRaWkcaqGGaGaaGimai aac6cacaaIWaGaaGinaiaaicdacaaIXaGaaGioaiaabccacaWGbbGa am4raiaadweacaqGGaGaey4kaSIaaeiiaiaaikdacaGGUaGaaGymai aaisdacaaI2aGaaGOnaiaaiIdacaqGGaGaam4qaiaadgeacaWGobGa aeiiaiabgUcaRiaabccacaaIYaGaaiOlaiaaiIdacaaIXaGaaGynai aaiMdacaaIYaGaaeiiaiaadsfacaWGzbGaamiuaaGcbaqcfaOaaeii aiabgUcaRiaabccacaaIXaGaaiOlaiaaiEdacaaI2aGaaGioaiaaio dacaaIWaGaaeiiaiaadcfacaWGibGaaeiiaiabgUcaRiaabccapaWa aeWaaeaapeGaeyOeI0IaaGOmaiaac6cacaaIXaGaaG4maiaaikdaca aI1aGaaGinaaWdaiaawIcacaGLPaaapeGaaeiiaiaadcfacaWGdbGa am4taiabgUcaRiaabccacaaIYaGaaiOlaiaaiodacaaIWaGaaGioai aaiMdacaaIWaGaaeiiaiaadYeacaWGpbGaam4qaiabgIKi7kaaccka tuuDJXwAK1uy0HwmaeHbfv3ySLgzG0uy0Hgip5wzaGqbaiab=Dr8yj abek7aIbaaaa@8DF1@  
π i =  e Xβ 1+  e Xβ MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbacbaaaaaaa aapeGaeqiWdaxcga4damaaBaaajuaGbaqcLbmapeGaamyAaaqcfa4d aeqaaiabg2da9KGba+qacaGGGcqcfa4aaSaaa8aabaWdbiaadwgapa WaaWbaaeqabaWefv3ySLgznfgDOfdaryqr1ngBPrginfgDObYtUvga iuaajugWa8qacqWFxepwcqaHYoGyaaaajuaGpaqaa8qacaaIXaGaey 4kaSIaaiiOaiaadwgajyaGpaWaaWbaaKqbagqabaqcLbmapeGae83f XJLaeqOSdigaaaaaaaa@5965@
Considering the median age =63, presence of cancer =1, type of admission was elective =0, had no coma or stupor =0, and keep the rest factors fixed, we found:
π i =  e Xβ 1+  e Xβ  = 11.2% MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbacbaaaaaaa aapeGaeqiWdaxcga4damaaBaaajuaGbaqcLbmapeGaamyAaaqcfa4d aeqaaiabg2da9KGba+qacaGGGcqcfa4aaSaaa8aabaWdbiaadwgapa WaaWbaaeqabaWefv3ySLgznfgDOfdaryqr1ngBPrginfgDObYtUvga iuaajugWa8qacqWFxepwcqaHYoGyaaaajuaGpaqaa8qacaaIXaGaey 4kaSIaaiiOaiaadwgajyaGpaWaaWbaaKqbagqabaqcLbmapeGae83f XJLaeqOSdigaaaaajyaGcaGGGcqcfaOaeyypa0tcgaOaaiiOaKqbak aaigdacaaIXaGaaiOlaiaaikdacaGGLaaaaa@627A@ chance of mortality.
Considering the same as above except the type of admission was emergency =1, we found: π i =  e Xβ 1+  e Xβ  = 67.8% MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbacbaaaaaaa aapeGaeqiWdaxcga4damaaBaaajuaGbaqcLbmapeGaamyAaaqcfa4d aeqaaiabg2da9KGba+qacaGGGcqcfa4aaSaaa8aabaWdbiaadwgapa WaaWbaaeqabaWefv3ySLgznfgDOfdaryqr1ngBPrginfgDObYtUvga iuaajugWa8qacqWFxepwcqaHYoGyaaaajuaGpaqaa8qacaaIXaGaey 4kaSIaaiiOaiaadwgajyaGpaWaaWbaaKqbagqabaqcLbmapeGae83f XJLaeqOSdigaaaaajuaGcaGGGcGaeyypa0JaaiiOaiaaiAdacaaI3a GaaiOlaiaaiIdacaGGLaaaaa@60DF@ chance of mortality.

In addition, if type of admission is emergency =1, had coma =2, and keep the remaining factors fixed we found the chance of mortality increased from 67.8% to 99.5%.

Forward selection model

For forward selection, a null model (a model contains no predictors), served as the starting point. We added one variable at a time to the null model and refitted the model including the added variables. The idea was to keep it if the variable that had been added was significant and then add the next variable. If not, we eliminated it and added the next variable. We refitted the model using the same procedure until the stopping rule was satisfied (all the variables in the model are significant meeting the level of 10%).

The table above shows the final model of forward selection method. It shows that the risk factors age, type of admission and level of consciousness at ICU admission statistically significant for the ICU status.
From Table 6, our model equation can be written as:

Estimate

Std. error

z value

P-value

Significance

(Intercept)

-5.52063

1.09373

-5.048

4.48E-07

 

AGE

0.03291

0.01179

2.791

0.005247

Significant

TYP

2.18842

0.76276

2.869

0.004117

Significant

LOC

1.83445

0.51609

3.555

0.000379

Significant

Table 6 Final model of forward selection

logit ( π i )= log( π i 1  π i ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbacbaaaaaaa aapeWaaeWaaeaacqaHapaCpaWaaSbaaeaapeGaamyAaaWdaeqaaaWd biaawIcacaGLPaaacqGH9aqpcaGGGcGaciiBaiaac+gacaGGNbWaae WaaeaadaWcaaWdaeaapeGaeqiWda3damaaBaaabaWdbiaadMgaa8aa beaaaeaapeGaaGymaiabgkHiTiaacckacqaHapaCpaWaaSbaaeaape GaamyAaaWdaeqaaaaaa8qacaGLOaGaayzkaaaaaa@4A5E@
= β 0 + β 1 x 1 +  β 2 x 2 + β 3 x 3 MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbacbaaaaaaa aapeGaeyypa0JaeqOSdi2damaaBaaabaqcLbmapeGaaGimaaqcfa4d aeqaaiabgUcaR8qacqaHYoGypaWaaSbaaKqbGeaapeGaaGymaaWdae qaaKqba+qacaWG4bWdamaaBaaajuaibaWdbiaaigdaa8aabeaajuaG cqGHRaWkpeGaaiiOaiabek7aI9aadaWgaaqcfasaa8qacaaIYaaapa qabaqcfa4dbiaadIhapaWaaSbaaKqbGeaapeGaaGOmaaqcfa4daeqa aiabgUcaR8qacqaHYoGypaWaaSbaaKqbGeaapeGaaG4maaqcfa4dae qaa8qacaWG4bWdamaaBaaajuaibaWdbiaaiodaaKqba+aabeaaaaa@5253@
= 5.52063 + 0.03291 AGE + 2.18842 TYP + 1.83445 LOC  Xβ MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOabaeqabaqcfaieaa aaaaaaa8qacqGH9aqpcaqGGaGaeyOeI0IaaGynaiaac6cacaaI1aGa aGOmaiaaicdacaaI2aGaaG4maiaabccacqGHRaWkcaqGGaGaaGimai aac6cacaaIWaGaaG4maiaaikdacaaI5aGaaGymaiaabccacaWGbbGa am4raiaadweacaqGGaGaey4kaSIaaeiiaiaaikdacaGGUaGaaGymai aaiIdacaaI4aGaaGinaiaaikdacaqGGaGaamivaiaadMfacaWGqbGa aeiiaiabgUcaRiaabccacaaIXaGaaiOlaiaaiIdacaaIZaGaaGinai aaisdacaaI1aGaaeiiaiaadYeacaWGpbGaam4qaaqcgayaaKqbakab gIKi7kaacckatuuDJXwAK1uy0HwmaeHbfv3ySLgzG0uy0Hgip5wzaG qbaiab=Dr8yjabek7aIbaaaa@6E1C@  
Considering the median age =63, type of admission was elective=0, had no coma or stupor=0, and keep the rest factors fixed, we found:
π i =  e Xβ 1+  e Xβ  = 3.09% MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbacbaaaaaaa aapeGaeqiWdaxcga4damaaBaaajuaGbaqcLbmapeGaamyAaaqcfa4d aeqaaiabg2da98qacaGGGcWaaSaaa8aabaWdbiaadwgapaWaaWbaae qabaWefv3ySLgznfgDOfdaryqr1ngBPrginfgDObYtUvgaiuaajugW a8qacqWFxepwcqaHYoGyaaaajuaGpaqaa8qacaaIXaGaey4kaSIaai iOaiaadwgapaWaaWbaaeqabaqcLbmapeGae83fXJLaeqOSdigaaaaa juaGcaGGGcGaeyypa0JaaiiOaiaaiodacaGGUaGaaGimaiaaiMdaca GGLaaaaa@5E9C@ chance of mortality.
Considering the same as above except the type of admission was emergency =1, we found: π i =  e Xβ 1+  e Xβ  = 22.12% MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbacbaaaaaaa aapeGaeqiWdaxcga4damaaBaaajuaGbaqcLbmapeGaamyAaaqcfa4d aeqaaiabg2da98qacaGGGcWaaSaaa8aabaWdbiaadwgapaWaaWbaae qabaWefv3ySLgznfgDOfdaryqr1ngBPrginfgDObYtUvgaiuaajugW a8qacqWFxepwcqaHYoGyaaaajuaGpaqaa8qacaaIXaGaey4kaSIaai iOaiaadwgapaWaaWbaaeqabaqcLbmapeGae83fXJLaeqOSdigaaaaa juaGcaGGGcGaeyypa0JaaiiOaiaaikdacaaIYaGaaiOlaiaaigdaca aIYaGaaiyjaaaa@5F51@ chance of mortality.

Backward elimination model

In this method, we began with the full model which includes all predictors in the model and eliminate variables one at a time. The least significant risk factors; that is, the ones having the largest P value (greater than 10%) are eliminated, and the model is then refitted.

Table 7 shows the final model of backward elimination method. It shows that the risk factors age, cancer part of present problem, systolic blood pressure at ICU admission, type of admission, PH from initial blood gases, PCO2 from initial blood gases, and level of consciousness at ICU admission statistically significant for the ICU status.

From Table 7, backward elimination model equation can be written as:

 

Estimate

Std. error

z value

P-value

Significance

(Intercept)

-5.27888

1.55063

-3.404

0.000663

 

AGE

0.040425

0.013084

3.09

0.002004

Significant

CAN

2.16474

0.853723

2.536

0.011224

Significant

SYS

-0.01099

0.006753

-1.628

0.103512

Significant

TYP

2.75305

0.909096

3.028

0.002459

Significant

PH

1.809602

0.874858

2.068

0.038598

Significant

PCO

-2.29744

1.027075

-2.237

0.025294

Significant

LOC

2.343905

0.618393

3.79

0.00015

Significant

Table 7 Final model of backward elimination method

logit ( π i )= log( π i 1  π i ) MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbacbaaaaaaa aapeWaaeWaaeaacqaHapaCpaWaaSbaaeaapeGaamyAaaWdaeqaaaWd biaawIcacaGLPaaacqGH9aqpcaGGGcGaciiBaiaac+gacaGGNbWaae WaaeaadaWcaaWdaeaapeGaeqiWda3damaaBaaabaWdbiaadMgaa8aa beaaaeaapeGaaGymaiabgkHiTiaacckacqaHapaCpaWaaSbaaeaape GaamyAaaWdaeqaaaaaa8qacaGLOaGaayzkaaaaaa@4A5E@
= β 0 + β 1 x 1 +  β 2 x 2 + β 3 x 3 + β 4 x 4 + β 5 x 5 + β 6 x 6 + β 7 x 7 MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbacbaaaaaaa aapeGaeyypa0JaeqOSdi2damaaBaaabaqcLbmapeGaaGimaaqcfa4d aeqaaiabgUcaR8qacqaHYoGypaWaaSbaaKqbGeaapeGaaGymaaWdae qaaKqba+qacaWG4bWdamaaBaaajuaibaWdbiaaigdaa8aabeaajuaG cqGHRaWkpeGaaiiOaiabek7aI9aadaWgaaqcfasaa8qacaaIYaaapa qabaqcfa4dbiaadIhapaWaaSbaaKqbGeaapeGaaGOmaaqcfa4daeqa aiabgUcaR8qacqaHYoGypaWaaSbaaKqbGeaapeGaaG4maaqcfa4dae qaa8qacaWG4bWdamaaBaaajuaibaWdbiaaiodaaKqba+aabeaacqGH RaWkpeGaeqOSdi2damaaBaaajuaibaWdbiaaisdaaKqba+aabeaape GaamiEa8aadaWgaaqcfasaa8qacaaI0aaajuaGpaqabaGaey4kaSYd biabek7aI9aadaWgaaqcfasaa8qacaaI1aaajuaGpaqabaWdbiaadI hapaWaaSbaaKqbGeaapeGaaGynaaWdaeqaaKqbakabgUcaR8qacqaH YoGypaWaaSbaaKqbGeaapeGaaGOnaaWdaeqaaKqba+qacaWG4bWdam aaBaaajuaibaWdbiaaiAdaa8aabeaajuaGcqGHRaWkpeGaeqOSdi2d amaaBaaajuaibaWdbiaaiEdaa8aabeaajuaGpeGaamiEa8aadaWgaa qcfasaaiaaiEdaaeqaaaaa@6E7A@
= 5.278880 + 0.040425 AGE + 2.164740 CAN + ( 0.010994 ) SYS + 2.753050 TYP  + 1.809602 PH + ( 2.297444 ) PCO + 2.343905 LOC  Xβ MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOabaeqabaqcfaieaa aaaaaaa8qacqGH9aqpcaqGGaGaeyOeI0IaaGynaiaac6cacaaIYaGa aG4naiaaiIdacaaI4aGaaGioaiaaicdacaqGGaGaey4kaSIaaeiiai aaicdacaGGUaGaaGimaiaaisdacaaIWaGaaGinaiaaikdacaaI1aGa aeiiaiaadgeacaWGhbGaamyraiaabccacqGHRaWkcaqGGaGaaGOmai aac6cacaaIXaGaaGOnaiaaisdacaaI3aGaaGinaiaaicdacaqGGaGa am4qaiaadgeacaWGobGaaeiiaiabgUcaRiaabccapaWaaeWaaeaape GaeyOeI0IaaGimaiaac6cacaaIWaGaaGymaiaaicdacaaI5aGaaGyo aiaaisdaa8aacaGLOaGaayzkaaWdbiaabccacaWGtbGaamywaiaado facaqGGaGaey4kaSIaaeiiaiaaikdacaGGUaGaaG4naiaaiwdacaaI ZaGaaGimaiaaiwdacaaIWaGaaeiiaiaadsfacaWGzbGaamiuaaqaai aacckacqGHRaWkcaqGGaGaaGymaiaac6cacaaI4aGaaGimaiaaiMda caaI2aGaaGimaiaaikdacaqGGaGaamiuaiaadIeacaqGGaGaey4kaS Iaaeiia8aadaqadaqaa8qacqGHsislcaaIYaGaaiOlaiaaikdacaaI 5aGaaG4naiaaisdacaaI0aGaaGinaaWdaiaawIcacaGLPaaapeGaae iiaiaadcfacaWGdbGaam4taiaabccacqGHRaWkcaqGGaGaaGOmaiaa c6cacaaIZaGaaGinaiaaiodacaaI5aGaaGimaiaaiwdacaqGGaGaam itaiaad+eacaWGdbaakeaajuaGcqGHijYUcaGGGcWefv3ySLgznfgD Ofdaryqr1ngBPrginfgDObYtUvgaiuaacqWFxepwcqaHYoGyaaaa@A22F@
Considering the median age =63, presence of cancer =1, median systolic blood pressure =130, type of admission is elective =0, had no coma or stupor=0, and keep the rest factors fixed, we found:
π i =  e Xβ 1+  e Xβ  = 11.96%  MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbacbaaaaaaa aapeGaeqiWdaxcga4damaaBaaajuaGbaqcLbmapeGaamyAaaqcfa4d aeqaaiabg2da98qacaGGGcWaaSaaa8aabaWdbiaadwgapaWaaWbaae qabaWefv3ySLgznfgDOfdaryqr1ngBPrginfgDObYtUvgaiuaajugW a8qacqWFxepwcqaHYoGyaaaajuaGpaqaa8qacaaIXaGaey4kaSIaai iOaiaadwgapaWaaWbaaeqabaqcLbmapeGae83fXJLaeqOSdigaaaaa juaGcaGGGcGaeyypa0JaaiiOaiaaigdacaaIXaGaaiOlaiaaiMdaca aI2aGaaiyjaiaacckaaaa@607F@ chance of mortality.
Considering the same as above except the type of admission was emergency =1, we found: π i =  e Xβ 1+  e Xβ  = 68.06% MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbacbaaaaaaa aapeGaeqiWdaxcga4damaaBaaajuaGbaqcLbmapeGaamyAaaqcfa4d aeqaaiabg2da98qacaGGGcWaaSaaa8aabaWdbiaadwgapaWaaWbaae qabaWefv3ySLgznfgDOfdaryqr1ngBPrginfgDObYtUvgaiuaajugW a8qacqWFxepwcqaHYoGyaaaajuaGpaqaa8qacaaIXaGaey4kaSIaai iOaiaadwgapaWaaWbaaeqabaqcLbmapeGae83fXJLaeqOSdigaaaaa juaGcaGGGcGaeyypa0JaaiiOaiaaiAdacaaI4aGaaiOlaiaaicdaca aI2aGaaiyjaaaa@5F5E@ chance of mortality.

Least absolute shrinkage and selection operator (LASSO)

We fit LASSO with validation set approach. The results are presented in Table 8.

Table 8 shows the final model of LASSO (validation set). We can see type of admission, and level of consciousness at ICU admission as the risk factors.

Coefficient

(Intercept)

0.031276

AGE

0.001951

GENDER

 

RACE

 

SER

 

CAN

0.001493

CRN

0.017163

INF

0.018822

CPR

 

SYS

-0.00054

HRA

 

PRE

 

TYP

0.111793

FRA

 

PO2

 

PH

 

PCO

 

BIC

 

CRE

0.020118

LOC

0.277094

Table 8 Final model of LASSO (validation set)

We also fitted LASSO with applying 5-fold cross validation approach. Table 9 has the results of this approach.
Table 9 shows the final model of LASSO (5-fold cross validation). Cancer part of present problem, previous admission to an ICU within 6 months, type of admission, PH from initial blood gases, PCO2 from initial blood gases, and level of consciousness at ICU admission are identified as the risk factors by this approach.

coefficient

(Intercept)

-0.09257

AGE

0.003733

GENDER

-0.02657

RACE

 

SER

-0.02175

CAN

0.178869

CRN

0.016181

INF

0.02257

CPR

0.036143

SYS

-0.00092

HRA

 

PRE

0.068878

TYP

0.191538

FRA

0.019172

PO2

0.006562

PH

0.137662

PCO

-0.15182

BIC

 

CRE

0.035652

LOC

0.329276

Table 9 Final model of LASSO (5-fold cross validation)

Misclassification error rate

The cross-validation approach allows to compute misclassification error rate by calculating the confusion matrix. Tables 10 – 14 present confusion matrices for manually picked best model, forward selection model, backward elimination and LASSO.

From the confusion matrix in Table 10, we calculate the misclassification error rate of manually picked best model as (   7+68 200 )*100=37.5% MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbaoaabmaaba aeaaaaaaaaa8qacaGGGcWaaSaaa8aabaWdbiaaiEdacqGHRaWkcaaI 2aGaaGioaaWdaeaapeGaaGOmaiaaicdacaaIWaaaaaWdaiaawIcaca GLPaaacaGGQaGaaGymaiaaicdacaaIWaGaeyypa0JaaG4maiaaiEda caGGUaGaaGynaiaacwcaaaa@4613@ .

Actual

Predicted

0

1

0

92

7

1

68

33

Table 10 Manually picked best model confusion matrix

From the confusion matrix in Table 11, we calculate the misclassification error rate of forward selection model as (   4+80 200 )*100=42.0% MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbaoaabmaaba aeaaaaaaaaa8qacaGGGcWaaSaaa8aabaWdbiaaisdacqGHRaWkcaaI 4aGaaGimaaWdaeaapeGaaGOmaiaaicdacaaIWaaaaaWdaiaawIcaca GLPaaacaGGQaGaaGymaiaaicdacaaIWaGaeyypa0JaaGinaiaaikda caGGUaGaaGimaiaacwcaaaa@4601@ .

Actual

Predicted

0

1

0

80

4

1

80

36

Table 11 Forward selection confusion matrix

From the confusion matrix in Table 12, we calculate the misclassification error rate of backward elimination model as (   7+68 200 )*100=37.5% MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbaoaabmaaba aeaaaaaaaaa8qacaGGGcWaaSaaa8aabaWdbiaaiEdacqGHRaWkcaaI 2aGaaGioaaWdaeaapeGaaGOmaiaaicdacaaIWaaaaaWdaiaawIcaca GLPaaacaGGQaGaaGymaiaaicdacaaIWaGaeyypa0JaaG4maiaaiEda caGGUaGaaGynaiaacwcaaaa@4613@ .

Actual

Predicted

0

1

0

92

7

1

68

33

Table 12 Backward elimination confusion matrix

From the confusion matrix in Table 13, we calculate the misclassification error rate of LASSO under validation set approach as (   30+2 200 )*100=16.0% MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbaoaabmaaba aeaaaaaaaaa8qacaGGGcWaaSaaa8aabaWdbiaaiodacaaIWaGaey4k aSIaaGOmaaWdaeaapeGaaGOmaiaaicdacaaIWaaaaaWdaiaawIcaca GLPaaacaGGQaGaaGymaiaaicdacaaIWaGaeyypa0JaaGymaiaaiAda caGGUaGaaGimaiaacwcaaaa@45FB@ .

Actual

Predicted

0

4

0

158

30

1

2

10

Table 13 LASSO confusion matrix “validation set”

From the confusion matrix in Table 14, we calculate the misclassification error rate of LASSO under 5-fold cross-validation as (   27+2 200 )*100=14.5% MathType@MTEF@5@5@+= feaagKart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2CaerbuLwBLn hiov2DGi1BTfMBaeXatLxBI9gBaerbd9wDYLwzYbItLDharqqtubsr 4rNCHbGeaGqiVu0Je9sqqrpepC0xMi=hEeeu0xXdbba9frFj0=OqFf ea0dXdd9vqaq=JfrVkFHe9pgea0dXdar=Jb9hs0dXdbPYxe9vr0=vr 0=vqpWqaaeaabiGaciaacaqabeaadaqaaqaaaOqaaKqbaoaabmaaba aeaaaaaaaaa8qacaGGGcWaaSaaa8aabaWdbiaaikdacaaI3aGaey4k aSIaaGOmaaWdaeaapeGaaGOmaiaaicdacaaIWaaaaaWdaiaawIcaca GLPaaacaGGQaGaaGymaiaaicdacaaIWaGaeyypa0JaaGymaiaaisda caGGUaGaaGynaiaacwcaaaa@4604@ .

Actual

Predicted

0

1

0

158

27

1

2

13

Table 14 LASSO confusion matrix “5-fold”

From the results above we conclude that, LASSO (with applying 5-fold cross validation) is the best model for identifying the risk factors associated with the ICU mortality with the lowest error rate (14.5%). We can see cancer part of present problem, previous admission to an ICU within 6 months, type of admission, PH from initial blood gases, PCO2 from initial blood gases, and level of consciousness at ICU admission as the risk factors.11,12

Conclusion

The major objective of this study is to identify the risk factors associated with medical and surgical ICU mortality. In order to identify the risk factors without subjective bias, we considered different variable selection methods and recommended the method that had the lowest misclassification error rate. The variable selection methods considered in this study were manually picked best model, forward selection, backward elimination and least absolute shrinkage and selection operator (LASSO). Cross validation and validation set approach are applied to the final model of manually picked best model, forward selection, backward elimination, and conducted both validation set and 5-fold cross validation on LASSO. Validation set and 5-fold cross validation approaches allow us to calculate the misclassification error rates for each method and finalize the decision by choosing the model with the lowest misclassification error rate. The procedure determines a reliable model that would identify the risk factors associated with ICU mortality, in an objective manner.

From the results obtained in this study we recommend LASSO (with applying 5-fold cross validation) as the best model that identifies the risk factors associated with the ICU mortality since it has the lowest error rate (14.5%). The model identified cancer part of present problem, previous admission to an ICU within 6 months, type of admission, PH from initial blood gases, PCO2 from initial blood gases, and level of consciousness at ICU admission as the risk factors. One limitation of this study is that the methodology is applied to a limited publicly available data on ICU mortality from a single hospital. In order to confirm the results of this study an elaborative study on ICU mortality should be performed on a randomly selected hospitals throughout the country.

Acknowledgements

None.

Conflicts of interest

None.

References

  1. https://www.umass.edu/statdata/statdata/data
  2. Morandi A, Jackson JC, Wesley Ely E. Delirium in the intensive care unit. International Review of Psychiatry. 2009;21(1):43‒58.
  3. Girard TD, Pandharipande PP, Ely EW. Delirium in the intensive care unit. Critical Care. 2008;12(S3).
  4. Belayachi J, Dendane T, Madani N, et al. Factors predicting mortality in elderly patients admitted to a Moroccan medical intensive care unit. Southern African Journal of Critical Care. 2012;28(1):22‒27.
  5. Lima EQ, Zanetta DMT, Castro I, et al. Mortality risk factors and validation of severity scoring systems in critically ill patients with acute renal failure. Renal failure. 2005;27(5):547‒556.
  6. Iwuafor AA, Ogunsola FT, Oladele RO, et al. Incidence, Clinical Outcome and Risk Factors of Intensive Care Unit Infections in the Lagos University Teaching Hospital (LUTH), Lagos, Nigeria. PLoS One. 2016;11(10):e0165242.
  7. Bursac Z, Gauss CH, Williams DK, et al. Purposeful selection of variables in logistic regression.Source code for biology and medicine. 2008;3(1):17.
  8. Hosmer DW, Lemeshow S, Sturdivant RX. Applied Logistic Regression: Third Edition. 2013;528.
  9. http://www.mayoclinic.org/healthy‒lifestyle/fitness/expert‒answers/heart‒rate/faq‒20057979
  10. https://www.cdc.gov/bloodpressure/measure.htm
  11. Leong IYO, Tai DYH. Is Increasing Age Associated with Mortality in the Critically iII Elderly. Singapore Med J. 2002;43(1):33‒36.
  12. Toufen C, Franca SA, Okamoto VN, et al. Infection as an independent risk factor for mortality in the surgical intensive care unit. Clinics. 2013;68(8):1103‒1108.
Creative Commons Attribution License

©2017 Alghamdi, et al. This is an open access article distributed under the terms of the, which permits unrestricted use, distribution, and build upon your work non-commercially.