Patient length of stay and mortality prediction: A survey

Abstract

Over the past few years, there has been increased interest in data mining and machine learning methods to improve hospital performance, in particular hospitals want to improve their intensive care unit statistics by reducing the number of patients dying inside the intensive care unit. Research has focused on prediction of measurable outcomes, including risk of complications, mortality and length of hospital stay. The length of stay is an important metric both for healthcare providers and patients, influenced by numerous factors. In particular, the length of stay in critical care is of great significance, both to patient experience and the cost of care, and is influenced by factors specific to the highly complex environment of the intensive care unit. The length of stay is often used as a surrogate for other outcomes, where those outcomes cannot be measured; for example as a surrogate for hospital or intensive care unit mortality. The length of stay is also a parameter, which has been used to identify the severity of illnesses and healthcare resource utilisation. This paper examines a range of length of stay and mortality prediction applications in acute medicine and the critical care unit. It also focuses on the methods of analysing length of stay and mortality prediction. Moreover, the paper provides a classification and evaluation for the analytical methods of the length of stay and mortality prediction associated with a grouping of relevant research papers published in the years 1984 to 2016 related to the domain of survival analysis. In addition, the paper highlights some of the gaps and challenges of the domain.

Keywords

critical care data-driven approach length of stay prediction mortality prediction multi-stage models statistical methods

Introduction

Healthcare expenditure constitutes a significant share of the gross domestic product (GDP) of many countries. For example, in 2012 healthcare spending in the UK reached nearly a tenth (9.3%) of GDP.^1,2 Government funding in many countries has fallen behind patient care costs, leaving healthcare institutions to face the growing number of patients.³ Accordingly, cost containment has become one of the most critical challenges in healthcare today. Hospitalisation constitutes the principal cost of patient care and is therefore a main focus in healthcare management.^4,5

Patient hospital length of stay typically refers to the number of days that an inpatient stays in a healthcare facility during a single admission.⁶ It is considered one of the major indicators for the consumption of hospital resources.^7,8 It also provides a better understanding of the flow of patients through a healthcare system which is essential for evaluating both the operational and clinical functions of such systems. Previous research has attempted to group patients by their medical condition, assuming that each disease, illness, or procedure is associated with a recommended length of stay (LOS).⁹ Grubinger et al. in 2010 refer to these systems as diagnosis-related-group (DRG) systems.¹⁰ In addition, a relative value, namely case mix index (CMI) can be assigned to a DRG of patients in a medical care environment used in determining the allocation of resources to care for and/or treat the patients in the group. However, both studies assumed that all patients who fall within the same diagnosis-related-group are the same. However, the LOS is a complex metric affected by other factors including each individual’s demographics, treatment complexity, complications and discharge planning which may stretch the LOS beyond the target range.

A model that helps to predict a patient's LOS during a single visit – the time from hospital admission until discharge – can be an effective tool for health care providers to plan for preventive interventions and to improve the utilisation of hospital resources.¹¹ Moreover, usually caregivers maintain an overall assessment of their patients based on important observations and trends over the first few days of admission. Some research demonstrates a strong correlation between LOS and mortality;^12,13 however, other experts/ intensivists consider LOS highly unregarded as a determinant of mortality as it is subject to influences, which may not bear upon the outcome of real interest. For example, a diagnosis which simply requires a prolonged period of hospital care, but which confers a low risk of hospital mortality, would bias the use of LOS as a surrogate for mortality; a condition which requires a series of complex treatment interventions might prolong the stay, without necessarily conferring a high mortality risk. Also, conversely, presentations of high severity of illness scores, as quantified by acute physiology and chronic health evaluation (APACHE) II,¹⁴ Intensive Care National Audit and Research Centre (ICNARC) score,¹⁵ Sequential Organ Failure Assessment (SOFA),¹⁶ or mortality prediction model (MPM),¹⁷ might be associated with a short LOS, because of an early decease, but also a high mortality, further undermining the correlation between LOS and mortality. In contrast, the work by Vincent and Singer linked LOS to mortality.^12,13 Results showed significantly greater intensive care unit (ICU), hospital and long-term mortality in patients with an ICU stay longer than than 3 days, in comparison with those who have a stay of 3 days or less.

This paper reviews LOS and mortality applications in acute medicine and critical care units and the correlation among them. Moreover, the paper classifies and evaluates the analytical methods available in the literature over the past three decades. In addition, the paper highlights some of the gaps and challenges of the domain.

The rest of this paper is organised as follows; the ‘Applications of the LOS’ section provides a survey of the different applications of the LOS in the health care domain. The ‘Analytical methods for LOS prediction’ section provides a classification of the analytical methods for LOS prediction associated with a grouping of relevant research papers published in the domain. The ‘Applications in mortality prediction’ section provides a survey of the different applications in mortality prediction, the ‘Methods to mortality prediction’ section provides a classification for the methods of mortality prediction, the ‘Applications in concurrent prediction of LOS and mortality’ section provides some examples of applications in previous literature that predicts both the LOS and mortality, and the ‘Measuring the performance of mortality and LOS prediction models’ discusses performance evaluation of the different LOS and mortality prediction models. Finally, the conclusion is drawn in the ‘Conclusion’ section.

Applications of the LOS

LOS is often used as a surrogate for other outcomes in research, where those outcomes cannot be measured; for example as a surrogate for hospital mortality or ICU mortality. LOS is also a parameter, which has been used to identify the severity of illnesses and healthcare resource utilisation.^18–20 As a surrogate outcome measure it is sometimes not highly regarded, as it is subject to influences, which may not bear upon the outcome of real interest. For example, a diagnosis which simply requires a prolonged period of hospital care, but which confers a low risk of hospital mortality, would bias the use of LOS as a surrogate for mortality; a condition which requires a series of complex treatment interventions might prolong the stay, without necessarily conferring a high mortality risk. Also, conversely, presentations of high severity of illness scores, as quantified by APACHE II,¹⁴ ICNARC score,¹⁵ SOFA,¹⁶ or MPM,¹⁷ might be associated with a short LOS, because of an early decease, but also a high mortality, further undermining the correlation between LOS and mortality.

Moreover, the LOS in a hospital may be affected by factors unrelated to the disease, such as the availability of social care or community nursing support. There is an analogous effect upon discharge from critical care to the ward, in the event that there are insufficient ward beds for timely ICU discharge.^21,22 Finally, LOS may also be influenced by characteristics of the organisation including hospital management style.^23,24

As LOS is an important determinant of both healthcare costs and patient experience, it is a high priority for it to be optimal; therefore it is also significant to identify any factors which affect it. The following two subsections will examine LOS applications in acute medicine and critical care, highlighting factors affecting LOS prediction.

LOS in acute medicine

This section presents previous studies on the different applications used in modelling LOS and its association with influencing factors with respect to patient flow. Patient flow typically refers to the progressive movement of a patient through a sequence of processes.²⁵ Reducing delays and making sure that the patient receives the right care at the right time will have a significant beneficial effect on the quality of service. In turn, this will improve patient outcomes and reduce the cost of care.

In 2012, Freitas et al. studied variables associated with high LOS outliers, together with some hospital characteristics (administrative, economic and teaching characteristics).²⁶ Results show that age, type of admission and hospital type were significantly associated with high LOS outliers. Moreover the study conducted by Caetano et al. showed that the top three influential input attributes were the hospital episode type, the physical service where the patient is hospitalised and the associated medical speciality.²⁷ However, hospital related factors on their own are not sufficient to accurately predict the LOS.

An important variable associated with LOS prediction and common is several studies is the nutritional status of a patient prior to admission. Previous research has examined the effect of the variable malnutrition on patient LOS.^28–33 In the study of Robinson et al. in 1987,²⁸ on average LOS was 15.6 days for a malnourished patient group versus 10 days for the well nourished group. However, in 1997 Chima et al. showed that the LOS for the two patient groups was six days for the at-risk for malnutrition population and four days for the not-at-risk for malnutrition population.²⁹ The significant decrease in LOS may reflect the fact that all health institutions are under pressures of payment and reviews by government and other third-party payers. In addition, according to Correia et al.,³⁰ the length of hospital stay is shorter in the well-nourished patients, with a median of six days versus nine days for the malnourished. Warnold and Lundhon,³² studied the clinical significance of preoperative nutritional status in 215 non-cancer patients. The variables investigated included weight loss, weight-for-height index, serum protein levels (serum albumin, transferrin, prealbumin, retinol-binding protein), delayed hypersensitivity skin testing, arm circumference and triceps skinfold thickness. Of the markers evaluated, weight-for-height index, arm muscle circumference, serum albumin level and weight loss correlated significantly to the post-surgery outcome. In addition, Epstein et al. also emphasised that underweight patients have 40% higher LOS than normal weight patients.³³ Also according to Burritt et al.,³¹ a low serum albumin level is the most sensitive single nutrition-related variable in the prediction of complications and LOS.

Another important variable in a different clinical domain that was also associated with an increase in LOS was serum creatinine (SCr). Chertow et al.³⁴ evaluated the marginal effects of acute kidney injury (AKI) on mortality, LOS, and costs. Changes in serum creatinine (SCr) was used as a determinant for adverse outcomes. Results show that AKI was consistently associated with an independent increase in LOS. Larger increases in SCr were associated with longer relative increases in hospital LOS.

LOS in critical care

There are significant potential benefits from quantification and optimisation of LOS in critical care: Specifically, these relate to cost containment and clinical quality. The provision of critical care is of necessity expensive, deploying complex interventions and requiring a high intensity of clinician input to a relatively small group of patients. Greater LOS requires more critical care resource and greater cost. As critical care facilities experience increasing pressure and economic resources are more constrained, the priority given to improvements in the timeliness and efficiency of critical care, is rising.³⁵ Clinical quality in the critical care unit may also be affected by extended LOS. Prolonged LOS gives rise to capacity pressure; this may lead to the cancellation of elective surgery, which is both costly and harmful; it may increase the pressure to decline or delay emergency admission, which could potentially have an adverse effect upon outcome; it may dilute the attention given to the most seriously sick individuals.³⁶

The critical care unit is also an environment which is well suited to exploiting data for mathematical modelling and prediction, both because of analytical experience and data availability. There are well developed methodologies for performance benchmarking. The increasing use of electronic clinical information systems, means that computer analysis can now be performed directly on the patient record, rather than after specific-to-purpose hand data extraction; the physiological and laboratory data sets are relatively large by comparison with other patient groups.

In England, Wales and Northern Ireland, the benchmarking of critical care unit performance is conducted by the intensive care national audit and research centre (ICNARC), by means of its case mix programme (CMP).¹⁵ The CMP uses rigorous methods to ensure data are complete, valid and reliable;^37,38 admissions are scored for severity using an in house scoring system and also the APACHE II model, and then a predicted hospital mortality for admissions is calculated. Comparison is made with actual mortality and a standardised mortality ratio is generated quarterly.^37,38 Another example of a non-commercial database of this kind is that held by the Australia and New Zealand intensive care society, which contains data on over 900,000 ICU stays.³⁹

Several research groups have investigated LOS in the ICU as it has been felt to be a suitable target for improvement.^12,13 LOS has been linked to mortality; research in 2006 showed significantly greater ICU, hospital and long-term mortality in patients with an ICU stay longer than than three days, in comparison with those who have a stay of three days or less. Others have sought to develop models which predict LOS; Buchman et al predicted chronicity in a surgical intensive care unit by classifying patients LOS in accordance with a seven day norm.³⁸ Levin et al. developed a model to produce real-time, updated forecasts of patients intensive care LOS using naturally generated provider orders.⁴⁰ The model was designed to be integrated within a computerised decision support system to improve patient flow management. The study compared the predicted LOS to the actual LOS based on fixed variables, such as age, source of admission and readmission status; temporal variables, such as current LOS, day of the week, time of the day, and order-based predictor variables grouped by medication, ventilation, laboratory, diet, activity, foreign body and extra-corporeal membrane oxygenation.

LOS prediction would help with capacity planning. At present, LOS prediction tools are not used in mainstream critical care practice. Surges in demand are managed reactively, requiring considerable staffing flexibility and variability in the balance between demand and capacity. It is possible that accurate prediction of LOS would help to align these quantities in critical care, and improve resource allocation, in particular staffing resource. According to Celi et al,³⁶ healthcare delivery has worked as well as it has to date because clinicians are bright, hard-working and well-intentioned, not because systems are well designed nor data systematically harnessed.

It follows that the presence of complete, highly detailed critical care databases is essential, if the potential benefits of modelling and prediction are to be fully realised.^12,41 Several commercial ICU databases have been developed, archiving patient demographics and aggregating information such as underlying disease, severity of illness and hospital-specific information such as LOS, mortality and readmission. For example, among the commercial ICU databases is APACHE Outcomes, created at Cerner by merging APACHE with Project IMPACT,^14,42 and includes data from about 150,000 ICU stays since 2010. The commercial Philips eICU, a telemedicine intensive care support provider, archives data from participating ICUs; Philips eICU is estimated to maintain a database of over 1.5 million ICU stays, and is adding 400,000 patient records per year from over 180 subscribing hospitals in the US. More ambitious still is the multiparameter intelligent monitoring in intensive care (MIMIC) II database established in October 2003. Developed by an interdisciplinary team from academia (MIT), industry (Philips Medical Systems) and clinical medicine (Beth Israel Deaconess Medical Center), the database incorporates two different types of medical data: Clinical data is stored in a relational database and bedside monitoring is stored in flat binary files. There are over 25,000 patients in the MIMIC II relational database, which permits the systematic capture, analysis and integration of information contained within the massive quantity of data generated by each critical care admission. Clearly these kinds of datasets and the several listed earlier could be used to investigate LOS as they provide well-structured high quality data. They could also be exploited for broader research activity.

Analytical methods for LOS prediction

This section explores the methods used in the field of calculating and predicting patient LOS. After surveying the previous literature, LOS prediction methods were categorised into four subgroups as shown in Figure 1. A classification of the reviewed papers based on LOS prediction methods is shown in Table 1.

Figure 1

Classification of LOS & mortality prediction methods.

Table 1

A summary of research papers grouped by analytical methods for LOS and mortality prediction.

	Method	References	Applications	Common evaluation methods
Length of stay	Data-driven & data mining	5,10,27,35,46, 61,63,72,76	Stroke unit, intensive medicine, appendectomy, general	Sensitivity, specificity, harmonic measure (F-measure, precision and recall average) and RBF kernel
	Statistical methods	26,29,30,34,40, 44,134	Acute care, intensive care, stroke patients, ICU cardiac surgery, nutrition and acute kidney	Mean, median, std. deviation, skewness, kurtosis, minmax, 25th percentile, 75th percentile, confidence interval and area under the receiver-operating characteristic curve (AUROC).
	Multi-stage methods	49,60,86	Healthcare of elderly patients, obstetric unit (pregnancy and childbirth) and emergency departments	Sensitivity analysis, simulation models, generalized Erlang, hyperexponential and Coxian.
Mortality prediction	Scoring systems	12,87,88,93, 109–124	Critical care	Confidence interval and area under the receiver-operating characteristic curve (AUROC)
	Data mining	92,93,95–97, 105–108, 125–127	Critical care	Confidence interval and area under the receiver-operating characteristic curve (AUROC)
LOS + Mortality	Arithmetic, statistical (regression & scoring systems) & data mining methods	128–131	Acute medicine, critical care (trauma & surgical patients)	Mean, confidence interval, AUROC, C-statistic

Arithmetic and statistical approaches to LOS prediction

Despite the complex nature of the metric LOS, simple arithmetic methods still exist for the calculation of LOS.⁴³ Arithmetic methods usually compute the average length of stay or the median as shown. However, this is a very simple way to measure the LOS as it assumes that the LOS is normally distributed, typically the LOS has an exponential distribution. Also Vasilakis et al. 2003 illustrates how average LOS can be a misleading measure;⁴⁴ the research proposes alternative statistical techniques survival analysis, on stroke patients, aging 65 years and over. Survival analysis is a branch of statistics that typically uses LOS data to study the effect of different patient attributes on survival time.⁴⁵

In addition, Figure 1 highlights a special type of statistical method which includes the analysis of covariates regression analysis. Covariates are defined in the context of LOS as the patient’s characteristics and external factors which possibly predict LOS. Within this type are found linear regression and logistic regression, which is a special case of survival models.⁴³ The models developed often include the patient’s diagnoses, procedures, gender and age.^26,46 Moreover, Freitas et al. used regression models to examine the association of some administrative variables from inpatient episodes in public acute care hospitals in the Portuguese National Health Service with high LOS outliers.²⁶ The variables include year of discharge, comorbidities, age, adjacent DRG complexity (ADRG), readmission, admission and DRG type, discharge status, distance from residence to hospital and hospital type. Results show that age, type of admission and hospital type were significantly associated with high LOS outliers.

A hospital is a complex stochastic system, therefore simple deterministic approaches for planning and managing such a system is considered inadequate to provide a complete and accurate analysis;^4,47,48 also, the resulting models which are mostly based on simple rules modelled with regression trees, are usually further adjusted manually according to medical knowledge, decreasing the predictive accuracy of successive models.^49–52 Grubinger et al. argue that any minor change in the data of such simple models can lead to a completely different tree, although all of these trees can be statistically accurate.¹⁰ As a result, the work presented in their research used the bootstrap-based model method bumping to build diverse regression tree models through systematic re-sampling (uniform randomness) of the data.⁵³ Bootstrap methods are most commonly based on the idea of combining and averaging models to reduce prediction error. Examples of such methods include bagging,⁵³ boosting,⁵⁴ and random forests.⁵⁵

A data-driven approach,^56–59 which will be discussed thoroughly in the ‘A data-driven approach to LOS prediction’ subsection, can be used to predict which patients seem likely to experience an extended LOS by analysing survival data using decision trees (also called survival trees), artificial neural networks, ensemble methods, etc. Usually these approaches are used to predict categorical survival outcomes (dead or alive) for a given set of patient attributes, or used to measure patient length of stay above or below a certain threshold.

In contrast to the data-driven approaches listed above,^56–59 Caetano et al. do not perform a classification task to LOS,²⁷ instead a more information pure regression approach is adopted which predicts the actual number of LOS days and not classes. The study describes 14 input covariates to the LOS target variable. Six regression techniques were tested and compared: Average prediction (AP), multiple regression (MP), decision trees (DTs), artificial neural network (ANN) ensemble, support vector machines (SVMs) and random forests (RFs). The best results were obtained by the RF model to reveal high impact of inpatient clinical process attributes, instead of the patient’s characteristics. Effective predictions can aid healthcare institutions and clinicians to improve their decisions about patient managements and resource allocations.^60–63

Despite such attempts, Marshall et al. and Garg et al. argue that data-driven methods among the other statistical models fail to address the inherent uncertainty, complexity and heterogeneity in health processes.^5,7 To address such issues, a more reliable way is to model patient flow as it presents the temporal dimension as well as the structural dimension of the system.⁷ Numerous probabilistic models have been proposed to address the issue of LOS, namely Markov models, phase-type distributions, conditional phase-type distributions, compartmental and simulation modelling.^{5,46,49,60,64} Such models may be used for planning health services for both acute and chronic patients. These models will be discussed thoroughly in the ‘Markov model and phase-type distributions’, ‘Compartmental modelling’ and ‘Simulation modelling’ subsections.

A data-driven approach to LOS prediction

Whereas most previous research examines LOS numerically,^65–67 several studies take a data-driven approach to LOS prediction. A data-driven approach refers to a predictive model that is based on data-mining techniques, such as classification, clustering, etc. Such techniques are used to discover useful patterns in large datasets by showing novel and interesting relationships among data variables. Data mining techniques facilitate the creation of knowledge and support clinical decision making, in what is known as medical data mining.^68,69

The data-driven approach classification is used to generate early alerts with respect to a target LOS range for a specific diagnosis related group (DRG). For example, Buchman et al. predict chronicity in a surgical intensive care unit by classifying patients LOS in accordance with a recommended seven-day norm.⁷⁰ In response to the need for effective resource planning and cost containment, Mobley et al. predict the LOS of patients receiving post-coronary care over the range of 1–120 days.⁶⁶ Frye et al. use a technique to predict whether the LOS of patients suffering from burns will fall within a one-week period.⁷¹ Cheng et al. in 2009 introduce a study that examines the LOS management of appendectomy patients by building and empirically evaluating an automatic prediction system to identify those patients whose LOS will likely exceed the recommended five-day period.⁷²

Hachesu et al. apply three classification algorithms namely, DT, SVMs and ANN to draw an accurate model to predict the LOS of heart patients.⁶¹ To predict the target variable LOS 36 input variables were used. The findings demonstrated that the SVM was the best fit. There was a significant tendency for LOS to be longer in patients with lung or respiratory disorders and high blood pressure. One of the interesting findings was that most single patients (64.3%) had a LOS less than or equal to five days, whereas 41.2% of married patients had a LOS greater than 10 days. The most significant variables affecting LOS were drug categories, such as nitrates and anticoagulants as well as coronary artery disease (CAD) diagnosis. Comorbidity is also a strong predictor of prolonged LOS. Comorbidity is the presence of one or more additional diseases or disorders co-occurring with a primary disease or disorder. There was a significant tendency for LOS to be longer in patients with lung or respiratory disorders and high blood pressure. Gender was significant in predicting LOS, since men had longer LOS than women. Age played a notable role as well since analysis revealed that patients aged less than 50 and greater than or equal 80 statistically had increased mean LOS.

Rowan et al. implemented a software package demonstrating that artificial neural networks (ANNs) could be used as an effective LOS stratification instrument in postoperative cardiac patients.⁷³ In the work by Azari et al.,¹¹ an approach for predicting hospital length of stay using a multi-tiered data mining approach is proposed. They form training sets, using groups of similar claims identified by k-means clustering and perform classification using 10 different classifiers. They consistently found that using clustering as a precursor to form the training set gives better prediction results as compared to non-clustering based training sets. Binning the LOS to three groups of short, medium and long stays, their method identifies patients who need aggressive or moderate early interventions to prevent prolonged stays.

Liu et al.⁶³ applied two classifiers: DT C4.5 & its successor R-C4.5s, naive Bayesian classifier (NBC) and its successor NBCs to a geriatric hospital dataset, called Clinics Dataset, containing 4722 patient records including patient demographic details, admission reasons, discharge details, outcome and LOS, to predict inpatient LOS for long stay patients. According to Lim et al.,⁷⁴ C4.5 is one of the classifiers, which has the best combinations in terms of error rate and speed. Also, R-C4.5s combines branches with little classification contribution and thus resulted in building more robust and smaller trees.⁷⁵ In addition, NBC is robust and insensitive to missing data as stated in the work of Liu et al.⁶³

In addition, phase-type survival trees and mixed distribution survival trees are used to cluster stroke-related patients into clinically meaningful groups with respect to LOS where partitioning is based on covariates, such as gender, age at time of admission, primary diagnosis code, treatment outcome and discharge destination.^5,46 Moreover, Kudyba et al. in 2010 utilise the method of neural networks to analyze data describing inpatient cases to examine the effect of the independent variables of patient demographics, primary payer, admission and discharge dates, physician specialty, and detailed radiology procedural variables (including the sum of radiology hours) on the dependent variable of length of stay excess per patient case for a major New Jersey based healthcare provider.⁷⁶ Also, ANNs, DTs and ensemble methods are used in developing an intelligent decision support system, INTCare, for intensive medicine in the ICU of the hospital Santo Antonio (HAS) in Porto, Portugal.³⁵ In addition, the bootstrap-based method bumping is used by Grubinger et al. to build diverse and more accurate regression tree models for DRG systems in Austria.¹⁰ Eight datasets are used consisting of patient’s main diagnosis, secondary diagnoses, procedures, number of diagnoses, number of procedures, gender and age as well as patients’ LOS.

Markov model and phase-type distributions

Markov and semi-Markov chain models are models that assume sub-groups of patients are homogeneous and events occur at equally spaced intervals of time; queueing models and deterministic models of the transition of patients between states. These techniques are useful for examining patient flow in large population groups where Markov assumptions can be made.²⁵ Phase-type distributions describe the time to absorption of a finite Markov chain in continuous time when there is a single absorbing state and the stochastic process starts in a transient state.⁴³

The first probabilistic approach describes a special type of Markov model known as the Coxian phase-type distribution and its further development into the conditional phase-type distribution. The Coxian phase-type distribution, allows the representation of the continuous duration of stay of patients in hospital as a series of sequential phases, which the patients progress through until they leave the hospital completely.⁷⁷

It is possible to expand the theory of Coxian phase-type distributions to include a network of additional interrelated variables (such as patient characteristics) that may interact to influence patient LOS conditional phase-type distribution. This approach allows the incorporation of discrete and continuous variables representing causality. Marshall et al. uses conditional phase-type distribution to model the LOS of elderly patients in hospital.^7,78–80 The approach illustrates data on hospital processes for a number of geriatric patients along with personal details, admissions reasons, dependency levels and destination (the causal network). The final model represents patient LOS in terms of five of the most significant patient variables in the dataset, namely patient age, gender, admission method into hospital, Barthel grade (dependency score) and destination on departure from hospital.

Compartmental modelling

The second general approach described is the compartmental model. Compartmental modelling of patient flow is a type of mathematical model used for describing the way patients are transmitted among the compartments of a healthcare system. Each compartment is assumed to be a homogeneous entity within which the entities being modelled are equivalent. For instance, in a pharmaceutical model, the compartments may represent different sections of a body within which the concentration of a drug is assumed to be uniformly equal. Another example, in a healthcare facility, the compartments may represent the different stages that patient goes through- acute, long-stay and death.

Haigeng Xie et al. present a model-based approach to extract from an administrative social care dataset, high-level length of stay patterns of residents in long-term care (LTC).⁸¹ A continuous-time Markov model, a residents stay in both residence care (RC) and nursing care (NC) is modelled as consisting of a short-stay and a long-stay phase, was used to show the flow of residents within and between RC and NC, as well as discharge from RC and NC. The model has been extended to incorporate residents’ features, such as gender. The final model showed that gender has a significant influence on transition rates.

Irvine et al. describes the development of a two-stage continuous-time Markov model that describes the movement of patients through geriatric hospitals.⁸² Patients are initially admitted to the acute state from which they transfer to the long-stay state or leave the hospital completely through discharge or death state. McClean et al. extends the stochastic Markov model presented in the work by Irvine et al. to a three-stage one and attaches different costs to each stage thus taking cost into account.^82,83 Taylor et al. use a continuous time Markov model and apply it to the case of a four compartmental model, where the four stages are acute, long-stay, community and dead.⁸⁴ The model estimates the expected number of patients at any time t in each stage. Taylor et al. extend these models to contain six stages.⁸⁵ Garg et al. proposed a novel distribution, multi-absorbing state phase-type distribution, as a generalisation of the single absorbing state Coxian phase-type distribution for representing a Markov process having more than one absorbing state.⁵ The approach effectively forecasts the bed requirements in a care unit considering the effect of several factors, such as patient demography – age and gender, as well as treatment outcome based on diagnosis and patient’s expected destination after discharge, which may also affect a patient’s LOS in hospital.

Simulation modelling

Simulation-based models simulate scenarios which replicate real life in an attempt to understand the complex health processes and their interactions.⁵ Vasilakis et al. illustrate how the average LOS can be a misleading measure.⁴⁴ The research proposes alternative statistical techniques, such as survival analysis, the application of mixed exponential and phase-type distributions demonstrated in two dynamic models of patient flow – compartmental model (small, medium and long stay) and discrete event simulation model, introducing capacity constraints in the various stages of the model, such as bed blockage and refuse-admission rates.

Griffin et al. developed a simulation model using a path-based approach for an obstetric unit to study tradeoffs in blocking and system efficiency.⁶⁰ The model focuses on patient flow, considering patient classification, blocking effects, time dependent arrival and departure patterns and statistically supported distributions for LOS. Moreover, the study conducted by Wang et al. in the emergency department at a community hospital, Entral Baptist Hospital in Lexington, KY, uses a discrete-event simulation model to evaluate patient outcome, identify the impact of critical resources and procedures, conduct “What if” analysis for various staffing and operational scenarios and provide recommendations for hospital management.⁸⁶

Discrete event simulation models allow patients to have individual attributes and to interact with resource provision but they are more time consuming to test and run. They are particularly suitable for models of systems of patient care where the constraints on resource availability are important. They may also be used on unconstrained population models with several thousands of patients. A significant development in simulation is the facility to model entities so that they can participate in more than one activity simultaneously and interrupt each other. The credibility of any model is dependent on reliable data which are not always readily available in the British health service.²⁵

Applications in mortality prediction

The primary concern of any healthcare system is to relieve the patient symptoms, prevent complications and prolong the patient’s life. In order to achieve these goals, it is crucial in the ICU to provide the correct treatment and to predict clinical deterioration early enough so preventive or curative actions can be taken in time. Extensive bedside monitoring in hospital ICUs has resulted in complex data-intensive environment regarding patient physiology, which presents a rich context for clinical data analysis. The majority of mortality prediction research has focused on severity of illness scoring systems designed for risk estimation at 24 h after ICU admission or data-mining algorithms that help predict mortality. The following two subsections will illustrate the use of different scoring systems and data-mining algorithms in predicting mortality in critical care.

Scoring systems in critical care mortality prediction

A number of researchers have explored using daily severity of illness scores. In 1993, Le Gall et al. suggested that despite being too time-consuming for most ICUs, daily scores would be the most efficient way to evaluate the progression of the risk of death.⁸⁷ Rue et al. found that the mortality prediction on the current-day was the most informative – in fact, the mortality probability at admission and on previous days did not improve performance from the current days score.⁸⁸ The importance of the current-day mortality prediction that Rue et al. observed confirms Lemeshow et al.’s finding that the most important features change between the admission MPM model and the 24, 48 and 72 h MPM models. The logistic regression equation also changes between 24 h intervals to reflect an increasing probability of mortality.⁸⁹ From their observations, Lemeshow et al. make the general observation that a patient in the ICU with a “steady” clinical profile is actually getting worse. In addition, others have confirmed the usefulness of daily severity scores; Wagner et al. showed strong results looking at daily risk predictions based on the APACHE III score and several additional variables such as the primary reason for ICU admission and treatment before ICU admission. Wagner et al.’s study relied on over 17,440 patients from 40 US hospitals.⁹⁰

Data mining in critical care mortality prediction

Until today, a standard statistical method such as logistic regression used by the scoring systems has been well received by critical care professionals to predict the risk of mortality or adverse events for patients with critical illnesses or injuries admitted to an ICU. Despite warnings from many of the original researchers and several studies,⁹¹ many caregivers have come to expect the availability of a severity score to assist them in treating individual patients. However, these predictions are not accurate enough for individual patients and no tools exist to reliably predict an individual patient’s progress on a critical care condition in a timely manner.^92,93 As a result, local customised mortality prediction models could perform better as compared to the corresponding current standard severity scoring system. The study conducted by Celi et al.⁹⁴ revealed better results for all three subsets of patients: Patients with acute kidney injury (AUC = 0.875 for ANN, vs. SAPS, AUC = 0.642), patients with subarachnoid hemorrhage (AUC = 0.958 for BN, vs. SAPS, AUC = 0.84) and elderly patients undergoing open heart surgery (AUC = 0.94 for ANN, vs. EuroSCORE, AUC = 0.648). Moreover, studies performed in some research concluded that more flexible nonparametric approaches based on data mining techniques, such as ANN, SVMs and DTs, might perform at least as well, if not better, than standard logistic regression in ICU mortality prediction.^{92,93,95–104} Also the use of untransformed explanatory variables resulted in better results than those transformed using scores/weights.⁹³

In 1996, Dybowski et al.,⁹⁵ reported a significantly improved area under the receiver-operating characteristic curve (AUROC) using artificial neural networks as compared to standard logistic regression. Also research in the work by Nimgaonkar and Sudarshan reported better performance of ANN over APACHE III.¹⁰⁵ However, other research found that logistic regression and neural networks performed similarly for ICU mortality prediction.^96,106,107 Such conflicting results on the performance of different prediction tools reveal that no single algorithm invariably outperforms all others; it depends on the underlying population being tested, the set of explanatory variables available and the outcome of interest. Contradicting results were reported for other techniques as well. For instance, in 2011 Ribas et al. showed that the use of SVMs resulted in increased prediction accuracy as compared to the APACHE II score.⁹⁷ Likewise, the study conducted by Kim et al.⁹² compared the predictive accuracy of ANN, SVM and DT derived from the University of Kentucky Hospital’s ICU patients’ data with the APACHE III scoring system. Results showed that the best performing model is the Clementine’s C5.0 algorithm (DT) followed by SVM, APACHE III and ANN.

As mentioned earlier, there is no single algorithm that outperforms others; it depends on the population of interest, the variables measured and the outcome being tested. However, some models reveal strengths over others in certain aspects. For example, the major advantage for the use of DTs over other models lies in its descriptive modelling as it explains hidden clinical implications unlike ANNs which lacks logic between input and output nodes. From another perspective, DT, RF, ANN, Bayesian networks and kernel methods such as SVM can handle large size data samples and integrate background knowledge into analysis.¹⁰⁸

Methods to mortality prediction

This section explores the methods used in predicting patient mortality in critical care. After surveying the previous literature, critical care mortality prediction methods were categorised into two subgroups as shown in Figure 1. A classification of the reviewed papers based on mortality prediction methods is shown in Table 1.

Scoring systems for mortality prediction

Scoring systems can be divided into two categories, those that assess disease severity on admission and use it to predict outcome, for example, acute physiology and chronic health evaluation (APACHE),¹⁰⁹ simplified acute physiology score (SAPS),⁸⁷ mortality probability model (MPM),¹¹⁰ and those scores that assess the presence and severity of organ dysfunction, for example, sequential organ failure assessment (SOFA).¹¹¹ The SOFA score is limited to six organs by looking at respiration, coagulation, liver, cardiovascular, central nervous system, and renal measurements. For each organ, the score provides an assessment of derangement between 0 (normal) and 4 (highly deranged).

Several works in the literature have discussed and compared mortality prediction models in intensive care that rely on a panel of experts or statistical models, namely logistic regression.^{12,87,109,110,112–115} Scoring systems, such as APACHE and SAPS assess disease severity on admission and use it to predict outcome.^87,109 The objective of these models is to compare groups of patients and characterise disease severity from patient demographics and physiological variables obtained within the first 24 hours after ICU admission. However, despite their simplicity it is claimed that these models are not reliable enough for prediction of individual patients since they provide a value that can be averaged for a group of patients.¹¹⁶ To this day, SAPS II and APACHE II remain the most widely used scores in clinical practice despite attempts for their modification,^{12,87,93,109,115} specifically tailored for other populations, such as France, Southern Europe and Mediterranean countries, and to Central and Western Europe.^117–120

In 1985, the original model of the APACHE scoring system (1981) was revised and simplified to create APACHE II,¹²¹ now the worlds most widely used severity of illness score.¹² The score relies on a panel of experts for variable selection and weights. In APACHE II, there are just 12 physiological variables, compared to 34 in the original score. The effects of age and chronic health status are incorporated directly into the model, weighted according to their relative impact, to give a single score with a maximum of 71. The worst value recorded during the first 24 h of a patients admission to the ICU is used for each physiological variable. The score is not recalculated during the stay; it is by definition an admission score. If a patient is discharged from the ICU and readmitted, a new APACHE II score is calculated. However, many researchers have validated the use of severity of illness scores in settings that deviate from their original design. Alternative settings have included populations such as coronary care patients or subarachnoid hemorrhage patients or days subsequent to the initial 24 h after admission.^88,122–124 APACHE III was developed in 1991 and in 2002/2003 APACHE IV was developed, which provides length of stay prediction equations.^12,117 A more detailed comparison of the current scoring systems is available in the work by Vincent and Singer.¹²

Like the APACHE scores, SAPS was calculated from the worst values obtained during the first 24 h of ICU admission. In 1993, Le Gall et al. used logistic regression analysis to develop SAPS II, which includes 17 variables: 12 physiological variables, age, type of admission and three variables related to underlying disease.⁸⁷ The SAPS II score was validated using data from consecutive admissions to 137 ICUs in 12 countries⁸⁷. Research by Le Gall et al. introduced an expanded SAPS II by adding six admission variables: Age, gender, length of pre-ICU hospital stay, patient location before ICU, clinical category and whether drug overdose was present.¹¹⁸ Results show that the expanded SAPS II performed better than the original and a customised SAPS II, with an AUROC (area under the receiver operating characteristic curve) of 0.879. A study conducted by Gilani et al.,¹¹⁵ showed that the prognostic accuracy of APACHE II was excellent for (AUC: 0.828) score and acceptable for APACHE III (AUC: 0.782) and SAPS II (AUC: 0.778) scores. According to the clincial review conducted by Vincent et al.,¹² the different types of scores should be seen as complementary, rather than competitive and mutually exclusive.

Data mining techniques for mortality prediction

Various authors have advocated the use of machine learning techniques for predicting ICU mortality over the use of logistic regression methods. Research by Dybowski et al. and Nimgaonkar and Sudarshan have reported better performance of ANNs over logistic regression.^95,105 However, other research found that logistic regression and neural networks performed similarly for ICU mortality prediction.^96,106,107 Others found that DTs and SVMs performed better.^92,97–127 In 2011, Ribas et al. showed that the use of SVMs resulted in increased prediction accuracy as compared to the APACHE II score.⁹⁷ Likewise, the study conducted in the work by Kim et al. compared the predictive accuracy of ANN, SVM and DT derived from the University of Kentucky Hospital’s ICU patients’ data with the APACHE III scoring system.⁹² Results showed that the best performing model is the Clementine’s C5.0 algorithm (DT) followed by SVM, APACHE III and ANN. These results confirm earlier findings in work by Delen et al.,¹²⁶ which also reported that C5.0 was the best predictor with the highest accuracy of 93.6% in predicting breast cancer survivability. In addition, Crawford et al. concluded that a decision tree used in their study provided a clinically acceptable mining result in predicting susceptibility of prostate carcinoma patients at low risk for lymph node spread.¹²⁷ On the other hand, Ramon et al. reported that the AUCs of DT based algorithms (DT learning, 65%; first order RFs, 81%) yielded smaller areas compared to those of naive Bayesian networks (AUC, 85%) and tree-augmented naive Bayesian networks (AUC, 82%) in their study on a small dataset containing 1548 mechanically ventilated ICU patients.¹⁰² Similarly Pirracchio et al. reported that Bayesian additive regression trees (BARTs) is the best candidate when using transformed variables, while RFs outperformed all other candidates when using untransformed variables.⁹³ Other authors achieved improved mortality prediction using a method based on SVMs.¹²⁵

Such conflicting results on the performance of different prediction tools reveal that no single algorithm invariably outperforms all others; it depends on the population of interest, the variables measured and the outcome being tested. However, some models reveal strengths over others in certain aspects. For example, the major advantage for the use of DTs over other models lies in its descriptive modelling as it explains hidden clinical implications unlike ANNs which lacks logic between input and output nodes. From another perspective, DT, RF, ANN, Bayesian networks and kernel methods such as SVM can handle large size data samples and integrate background knowledge into analysis.¹⁰⁸

Applications in concurrent prediction of LOS and mortality

There are several works in the literature that handle both LOS and mortality prediction concurrently. The prediction models include arithmetic models, such as the mean and median; statistical models, such as regression analysis and data-driven models, such as Bayesian Network. The following are some examples of applications that attempt to concurrently predict patient LOS and mortality.

The work by Peterson et al. aims to assess the impact of the introduction of an early warning scoring system (SEWS) on physiological observations and patient outcomes in acute admissions at the point of entry to care.¹²⁸ The admission of the early warning score correlated both with in-hospital mortality (P < 0.001) and length of stay (P = 0.001). Moreover, Clark et al. developed a method for predicting concurrently both hospital survival and LOS for seriously ill patients from three trauma centres in Maine, with particular attention to the competing risks of death or discharge alive as determinants of LOS.¹²⁹ Poisson regression was used to develop a model for each type of terminal event, with risk factors on admission contributing proportionately to the subsequent rates for each outcome in each interval. Mean LOS and cumulative survival were calculated from a combination of the resulting piecewise exponential models.¹²⁹ Similarly, risk stratification indices (RSIs) for length of stay and mortality endpoints were derived from aggregate risk associated with individual diagnostic and procedure codes. Results showed that the RSI is a broadly applicable and robust system for assessing hospital LOS and mortality for groups of surgical patients based solely on administrative data.¹³⁰

Cai et al. built a Bayesian network model to estimate the probability of a hospitalised patient being “at home” in the hospital, or dead for each of the next seven days.¹³¹ Electronic health records from 32,634 patients admitted to a Sydney metropolitan hospital via the emergency department from July 2008 through December 2011 were used. The model achieved an average daily accuracy of 80% and AUROC of 0.82. The models predictive ability was highest within 24 h from prediction (AUROC = 0.83) and decreased slightly with time. Death was the most predictable outcome with a daily average accuracy of 93% and AUROC of 0.84.¹³¹

Measuring the performance of mortality and LOS prediction models

Mortality prediction is considered a binary classification problem where a classifier attempts to identify whether a patient will live or die. Evaluating classifier performance shows how well a method improves classification. Traditionally classification accuracy is used to measure classifier performance. The classification accuracy gives a good idea of classifier performance when the dataset is balanced, however when the dataset suffers from âclass imbalance (i.e the number of instances belonging to one class outnumbers that of any other class(es)) some problems emerge. For example, in a binary classification problem, if the majority class outnumbers the minority class 9:1, and all instances were classified as the majority class, the classifier would have an accuracy of 90%, despite 0% of the minority class being classified correctly. In a binary dataset, the classification accuracy shows the number of correctly classified minority instances (true positives), incorrectly classified minority instances (false positives), correctly classified majority instances (true negatives) and incorrectly classified majority instances (false negatives) as follows

ACC = \frac{TP + TN}{P + N}

(1)

For this reason, better evaluative measures that are independent of the class imbalance ratio and sufficiently recognises the minority class are preferred. One of such measures used in this paper is the geometric mean which calculates the geometric mean between the sensitivity or recall (true positive rate) and specificity (true negative rate)

Sensitivity = \frac{TP}{TP + FN}

(2)

Specificity = \frac{TN}{TN + FP}

(3)

The G-mean is an effective evaluative criterion as it is not dependent on the data distribution. The G-mean is defined as¹³²

G - mean = \sqrt{Sensitivity * Specificity}

(4)

It accentuates the balancing between the specificity and sensitivity while maximising the recognition between the minority and the majority class. Other evaluative measures used in the paper are precision (also called positive predictive value) which is the fraction of retrieved instances that are relevant. Sensitivity and precision can be combined using a metric known as the F-measure¹³³

Precision = \frac{TP}{TP + FN}

(5)

F - Measure = \frac{2 . Recall . Precision}{Recall + Precision}

(6)

In addition, the AUROC is also a widely used measure of classification performance in mortality prediction. It is a graphical plot that illustrates the performance of a binary classifier system. The curve is created by plotting the true positive rate (TPR) against the false positive rate (FPR) at various threshold settings.

There are several quantitative methods for measuring the performance of the different LOS prediction models. Measuring the performance of the statistical methods for LOS prediction include several approaches, such as the mean, median, standard deviation, kurtosis, min-max, confidence and AUROC are the most commonly used in literature.^{26,29,30,34,40,44,134} As for the data-driven and data mining methods for LOS prediction, there are several evaluation approaches in literature, such as sensitivity, specificity, F-measure or the harmonic mean of precision and recall.^{5,10,27,35,46,61,63,72,76} On the other hand, sensitivity analysis, simulation models, generalised Erlang, hyper-exponential and Coxian models are the most widely used in measuring the performance of multi-stage models for LOS prediction.^49,60,86

Conclusion

This paper has presented a comprehensive review of the methods and applications of LOS and mortality prediction in acute medicine and critical care.

An introduction to LOS theory and mortality prediction and the main drivers behind the interest in such research was given in the opening section. In addition, several applications for LOS and mortality prediction were demonstrated both in acute medicine and in the intensive care unit environment in particular, highlighting the challenges facing both physicians and information engineers today. However, only a small sample of such applications considered the prediction of both LOS and mortality concurrently. We see this as a limitation in our paper that could be enhanced in future work. An analysis of the various LOS and mortality prediction methods were presented and compared. Moreover, the paper provides a classification for some state-of-the-art literature based on each paper’s analytical method utilised to predict LOS and mortality. The four main categories include:

arithmetic methods;

statistical methods;

data-driven methods;

multi-stage methods.

The classification presents a brief summary of the analytical method, dataset and the evaluation method utilised. Given that LOS and mortality are relatively complex matrices as they are influenced by various external uncontrollable factors, there is no one good-for-all technique that serves its prediction. At present, in most cases several algorithms are tested, tweaked based on some domain knowledge or some performance criteria to enhance the accuracy of prediction. This is considered another limitation as there is no one reliable technique for prediction; it all depends on the situation at hand. However, it is clear that much research remains to be done, especially that the physiological and laboratory datasets in both acute medicine and the critical care environment are relatively large and well-structured in commercial and non-commercial databases.

Footnotes

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) received no financial support for the research, authorship, and/or publication of this article.

References

Wang

McPherson

Marsh

. Health and economic burden of the projected obesity trends in the USA and the UK. Lancet 2011; 378(9793): 815–825.

Gori C and Di Maio A. European study of long-term care expenditure. PSSRU Bulletin 14. 2003.

Roberts

Marshall

Charlesworth

. A decade of austerity? The funding pressures facing the NHS from 2010/11 to 2021/22, London: Nuffield Trust, 2012.

Bain

Taylor

McDonnell

. Myths of ideal hospital occupancy. Med J Aust 2010; 193(5): 311–311.

Garg

McClean

Barton

. Intelligent patient management and resource planning for complex, heterogeneous, and stochastic healthcare systems. IEEE Trans Syst Man Cybern A Syst Hum 2012; 42(6): 1332–1345.

Huntley

Cho

Christman

. Predicting length of stay in an acute psychiatric hospital. Psychiatr Serv 1998.

Marshall

McClean

Shapcott

. Developing a Bayesian belief network for the management of geriatric hospital care. Health Care Manag Sci 2001; 4(1): 25–30.

Marshall

Vasilakis

El-Darzi

. Length of stay-based patient flow models: Recent developments and future directions. Health Care Manag Sci 2005; 8(3): 213–220.

Shea

Sideli

DuMouchel

. Computer-generated informational messages directed to physicians: Effect on length of hospital stay. J Amer Med Info Assoc 1995; 2(1): 58–64.

10.

Grubinger

Kobel

Pfeiffer

. Regression tree construction by bootstrap: Model search for DRG-systems applied to Austrian health-data. BMC Med Inform Decis Mak 2010; 10(1): 9–9.

11.

Azari

Janeja

Mohseni

. Healthcare data mining: Predicting hospital length of stay (PHLOS). International Journal of Knowledge Discovery in Bioinformatics (IJKDB) 2012; 3(3): 44–66.

12.

Vincent

Singer

. Critical care: Advances and future perspectives. Lancet 2010; 376(9749): 1354–1361.

13.

Vincent

. Is the current management of severe sepsis and septic shock really evidence based? PLoS Med 2006; 3(9): e346–e346.

14.

Abbott

Setter

Chan

. Apache ii: Prediction of outcome of 451 icu oncology admissions in a community hospital. Ann Oncol 1991; 2(8): 571–574.

15.

Harrison

Parry

Carpenter

. A new risk prediction model for critical care: The intensive care national audit & research centre (ICNARC) model*. Crit Care Med 2007; 35(4): 1091–1098.

16.

Vincent

Moreno

Takala

. The sofa (sepsis-related organ failure assessment) score to describe organ dysfunction/failure. Intensive Care Med 1996; 22(7): 707–710.

17.

Lemeshow

Teres

Klar

. Mortality probability models (mpm ii) based on an international cohort of intensive care unit patients. JAMA 1993; 270(20): 2478–2486.

18.

Lim

Tongkumchum

. Methods for analyzing hospital length of stay with application to inpatients dying in southern Thailand. Glob J Health Sci 2009; 1(1): 27–38.

19.

Chang

Tseng

Weng

. Prediction of length of stay of first-ever ischemic stroke. Stroke 2002; 33(11): 2670–2674.

20.

Jiang

Davis

. Using data mining to analyze patient discharge data for an urban hospital. In: DMIN 2010, pp. 139–144.

21.

Appelros

. Prediction of length of stay for stroke patients. Acta Neurol Scand 2007; 116(1): 15–19.

22.

Robinson

Davis

Leifer

. Prediction of hospital length of stay. Health Serv Res 1966; 1(3): 287–287.

23.

McMullan

Silke

Bennett

. Resource utilisation, length of hospital stay, and pattern of investigation during acute medical hospital admission. Postgrad Med J 2004; 80(939): 23–26.

24.

Vahidi

Kushavar

Khodayari

. Factors affecting coronary artery patients hospital length of stay of tabriz madani hospital 2005-2006. J Health Adm 2006; 9(25): 63–68.

25.

Davies

. Modelling patient flows and resource provision in health systems. Omega 1994; 22(2): 123–131.

26.

Freitas

Silva-Costa

Lopes

. Factors influencing hospital high length of stay outliers. BMC Health Serv Res 2012; 12(1): 265–265.

27.

Caetano N, Laureano R and Cortez P. A data-driven approach to predict hospital length of stay: a Portuguese case study. In: Proceedings of 16th International Conference on Enterprise Information Systems–ICEIS 2014, 2014, pp.407–414.

28.

Robinson GE, Goldstein M and Levine GM. Impact of nutritional status on DRG length of stay. J Parenter Enteral Nutr 1987; 11(1): 49–51.

29.

Chima

Barco

La Dewitt

. Relationship of nutritional status to length of stay, hospital costs, and discharge status of patients hospitalized in the medicine service. J Am Diet Assoc 1997; 97(9): 975–978.

30.

Isabel

Correia

. The impact of malnutrition on morbidity, mortality, length of hospital stay and costs evaluated through a multivariate model analysis. Clin Nutr 2003; 22(3): 235–239.

31.

Anderson

Moxness

Meister

. The sensitivity and specificity of nutrition-related variables in relationship to the duration of hospital stay and the rate of complications. Mayo Clin Proc 1984; 59(7): 477–483.

32.

Warnold

Lundholm

. Clinical significance of preoperative nutritional status in 215 non cancer patients. Ann Surg 1984; 199(3): 299–299.

33.

Epstein

Read

Hoefer

. The relation of body weight to length of stay and charges for patients undergoing elective surgery: as study of two procedures. Am J Public Health 1987; 77(8): 993–997.

34.

Chertow

Burdick

Honour

. Acute kidney injury, mortality, length of stay, and costs in hospitalized patients. J Am Soc Nephrol 2005; 16(11): 3365–3370.

35.

Portela

Santos

Silva

. Adoption of pervasive intelligent information systems in intensive medicine. Procedia Technol 2013; 9: 1022–1032.

36.

Anthony Celi

Mark

Stone

. “Big data” in the intensive care unit. Closing the data loop. Am J Respir Crit Care Med 2013; 187(11): 1157–1160.

37.

Harrison

Brady

Rowan

. Case mix, outcome and length of stay for admissions to adult, general critical care units in England, Wales and Northern Ireland: The intensive care national audit & research centre case mix programme database. Crit Care 2004; 9(Suppl 3): S1–S1.

38.

Nolan

Laver

Welch

. Outcome following admission to uk intensive care unit after cardiac ICNARC case mix programme database*. Anaesthesia 2007; 62(12): 1207–1216.

39.

Stow

Hart

Higlett

. Development and implementation of a high-quality clinical database: The australian and New Zealand intensive care society adult patient database. J Crit Care 2006; 21(2): 133–141.

40.

Levin

Harley

Fackler

. Real-time forecasting of pediatric intensive care unit length of stay using computerized provider orders. Crit Care Med 2012; 40(11): 3058–3064.

41.

Gardner

Hawley

East

. Real time data acquisition: recommendations for the medical information bus (MIB). Int J Clin Monit Comput 1991; 8(4): 251–258.

42.

Cook

Visscher

Hobbs

. Project impact: Results from a pilot validity study of a new observational database. Crit Care Med 2002; 30(12): 2765–2770.

43.

Guzman Castillo M. Modelling patient length of stay in public hospitals in Mexico. PhD Thesis, University of Southampton, UK, 2012.

44.

Marshall

Vasilakis

El-Darzi

. Length of stay-based patient flow models: Recent developments and future directions. Health Care Manag Sci 2005; 8(3): 213–220.

45.

Cox

. Analysis of survival data, UK: Chapman & HAll, 1984.

46.

Garg

Mcclean

Meenan

. Phase-type survival trees and mixed distribution survival trees for clustering patients’ hospital length of stay. Informatica 2011; 22(1): 57–72.

47.

McCarthy

. Hospital capacity: What is the measure and what is the goal? Med J Aust 2010; 193(5): 252–253.

48.

Jones

. Myths of ideal hospital size. Med J Aust 2010; 193(5): 298–300.

49.

Fackrell

. Modelling healthcare systems with phase-type distributions. Health Care Manag Sci 2008; 12(1): 11–26.

50.

Garg

McClean

Meenan

. A non-homogeneous discrete time Markov model for admission scheduling and resource planning in a care system. Health Care Manag Sci 2010; 13(2): 155–169.

51.

Garg

McClean

Meenan

. Non-homogeneous Markov models for sequential pattern mining of heatlhcare data. IMA J Management Math 2009; 20(4): 327–344.

52.

Garg L, McClean S, Barton M, et al. Forecasting hospital bed requirements and cost of care using phase type survival trees. In: 2010 5th IEEE International Conference Intelligent Systems (IS), 7 July 2010, pp.185–190. IEEE.

53.

Breiman

. Bagging predictors. Mach Learn 1996; 24(2): 123–140.

54.

Friedman

. Greedy function approximation: A gradient boosting machine. Ann Stat 2001; 1189–1232.

55.

Breiman

. Random forests. Mach Learn 2001; 45(1): 5–32.

56.

Vapnik

. The nature of statistical learning theory, Springer science & business media, 2013, 2013.

57.

Kim

Kil

Kang

. Prediction on Lengths of Stay in the Postanesthesia Care Unit Following General Anesthesiai Preliminary Study of the Neural. J Korean Med Sci 2000; 15: 25–30.

58.

Pofahl

Walczak

Rhone

. Use of an artificial neural network to predict length of stay in acute pancreatitis. Am Surg 1998; 64(9): 868–868.

59.

Azari A, Janeja VP and Mohseni A. Predicting hospital length of stay (phlos): A multi-tiered data mining approach. In: 2012 IEEE 12th International Conference on Data Mining Workshops (ICDMW), 10 December 2012, pp.17–24. IEEE.

60.

Griffin

Xia

Peng

. Improving patient flow in an obstetric unit. Health Care Manag Sci 2012; 15(1): 1–14.

61.

Hachesu

Ahmadi

Alizadeh

. Use of data mining techniques to determine and predict length of stay of cardiac patients. Healthc Inform Res 2013; 19(2): 121–129.

62.

Rowan

Ryan

Hegarty

. The use of artificial neural networks to stratify the length of stay of cardiac patients based on preoperative and initial postoperative factors. Artif Intell Med 2007; 40(3): 211–221.

63.

Liu P, Lei L, Yin J, et al. Healthcare data mining: Prediction inpatient length of stay. In: 2006 3rd International IEEE Conference on Intelligent Systems, 4 September 2006, pp.832–837. IEEE.

64.

Vasilakis

Marshall

. Modelling nationwide hospital length of stay: Opening the black box**. J Oper Res Soc 2005; 56(7): 862–869.

65.

Grigsby

Kooken

. Simulated neural networks to predict outcomes, costs, and length of stay among orthopedic rehabilitation patients. Arch Phys Med Rehabil 1994; 75(10): 1077–1081.

66.

Mobley

Leasure

. Artificial nerual network predictions of lengths of stay in a post-coronary care unit. Heart Lung 1995; 24(3): 251–256.

67.

Zernikow

Holtmannspotter

Michel

. Predicting length of stay in preterm neonates. Eur J Pediatr 1999; 158(1): 59–62.

68.

Silva

Cortez

Santos

. Mortality assessment in intensive care units via adverse events using artificial neural networks. Artif Intell Med 2006; 36(3): 223–234.

69.

Silva

Cortez

Santos

. Rating organ failure via adverse events using data mining in the intensive care unit. Artif Intell Med 2008; 43(3): 179–193.

70.

Buchman

Kubos

Seidler

. A comparison of statistical and connectionist models for the prediction of chronicity in a surgical intensive care unit. Crit Care Med 1994; 22(5): 750–762.

71.

Frye

Izenberg

Williams

. Simulated biologic intelligence used to predict length of stay and survival of burns. J Burn Care Rehabil 1996; 17(6): 540–546.

72.

. A data-driven approach to manage the length of Stay for Appendectomy Patients. IEEE Trans Syst Man Cybern A Syst Hum 2009; 39(6): 1339–1347.

73.

Rowan

Ryan

Hegarty

. The use of artificial neural networks to stratify the length of stay of cardiac patients based on preoperative and initial postoperative factors. Artif Intell Med 2007; 40(3): 211–221.

74.

Lim

Loh

Shih

. A comparison of prediction accuracy, complexity, and training time of thirty-three old and new classification algorithms. Mach Learn 2000; 40(3): 203–228.

75.

Yao Z, Liu P, Lei L, et al. R-C4. 5 Decision tree model and its applications to health care dataset. InServices Systems and Services Management. In: 2005 International Conference on Proceedings of ICSSSM’05, 13 June 2005, Vol. 2, pp.1099–1103. IEEE.

76.

Kudyba

Gregorio

. Identifying factors that impact patient length of stay metrics for healthcare providers with advanced analytics. Health Informatics J 2010; 16(4): 235–45.

77.

Faddy

McClean

. Analysing data on lengths of stay of hospital patients using phase-type distributions. Appl Stoch Model Bus 1999; 15(4): 311–317.

78.

Marshal

McClean

Shapcott

. Modeling patient duration of stay to facilitate resource management of geriatric hospitals. Health Care Manag Sci 2002; 5(4): 313–319.

79.

Marshal

McClean

. Conditional phase-type distributions for modelling patient length of stay in hospital. Int Trans Oper Res 2003; 10(6): 565–576.

80.

Golüke

Huibers

Stalpers

. An observational, retrospective study of the length of stay, and its influencing factors, among elderly patients at the emergency department. Eur Geriatr Med 2015; 6(4): 331–335.

81.

Xie

Chaussalet

Millard

. A model-based approach to the analysis of patterns of length of stay in institutional long-term care. IEEE Trans Inf Technol Biomed 2006; 10(3): 512–518.

82.

Irvine

McClean

Millar

. Stochastic models for geriatric in-patient behaviour. Math Med Biol 1994; 11(3): 207–216.

83.

McClean

McAlea

Millard

. Using a Markov reward model to estimate spend-down costs for a geriatric department. J Oper Res Soc 1998; 49(10): 1021–1025.

84.

Taylor

McClean

Millar

. Continuous-time Markov model for geriatric patient behaviour. Appl Stoch Model D A 1998; 13(3–4): 315–323.

85.

Taylor

McClean

Millard

. Stochastic models of geriatric patient bed occupancy behaviour. J R Stat Soc Ser A Stat Soc 2000; 163(1): 1–10.

86.

Wang

Tussey

. Reducing length of stay in emergency department: A simulation study at a community hospital. IEEE Trans Syst Man Cybern A Syst Hum 2012; 42(6): 1314–1322.

87.

Le Gall

Lemeshow

Saulnier

. A new simplified acute physiology score (saps ii) based on a European/North American multicenter study. JAMA 1993; 270(24): 2957–2963.

88.

Rué

Quintana

Álvarez

. Daily assessment of severity of illness and mortality prediction for individual patients. Crit Care Med 2001; 29(1): 45–50.

89.

Lemeshow

Klar

Teres

. Mortality probability models for patients in the intensive care unit for 48 or 72 hours: A prospective, multicenter study. Crit Care Med 1994; 22(9): 1351–1358.

90.

Wagner

Knaus

Harrell

. Daily prognostic estimates for critically ill adults in intensive care units: Results from a prospective, multicenter, inception cohort analysis. Crit Care Med 1994; 22(9): 1359–1372.

91.

Schaufer

Maurer

Jochimsen

. Outcome prediction models on admission in a medical intensive care unit: Do they predict individual outcome? Crit Care Med 1990; 18(10): 1111–1118.

92.

Kim

Park

. A comparison of intensive care unit mortality prediction models through the use of data mining techniques. Healthc Inform Res 2011; 17(4): 232–243.

93.

Pirracchio

Petersen

Carone

. Mortality prediction in intensive care units with the super ICU learner algorithm (sicula): A population-based study. Lancet Respir Med 2015; 3(1): 42–52.

94.

Celi

Galvin

Davidzon

. A database-driven decision support system: Customized mortality prediction. J Pers Med 2012; 2(4): 138–148.

95.

Dybowski

Gant

Weller

. Prediction of outcome in critically ill patients using artificial neural network synthesised by genetic algorithm. Lancet 1996; 347(9009): 1146–1150.

96.

Clermont

Angus

DiRusso

. Predicting hospital mortality for patients in the intensive care unit: A comparison of artificial neural networks with logistic regression models. Crit Care Med 2001; 29(2): 291–296.

97.

Ribas VJ, López JC, Ruiz-Sanmartín A, et al. Severe sepsis mortality prediction with relevance vector machines. In: Engineering in medicine and biology society, EMBC, 2011 annual international conference of the IEEE, 30 August 2011, pp.100–103. IEEE.

98.

Foltran

Berchialla

Giunta

. Using vlad scores to have a look insight ICU performance: Towards a modelling of the errors. J Eval Clin Pract 2010; 16(5): 968–975.

99.

Gortzis

Sakellaropoulos

Ilias

. Predicting ICU survival: A meta-level approach. BMC Health Serv Res 2008; 8(1): 157–157.

100.

Lucas

. Bayesian analysis, pattern analysis, and data mining in health care. Curr Opin Crit Care 2004; 10(5): 399–403.

101.

Sierra

Serrano

LarrañAga

. Using Bayesian networks in the construction of a bi-level multi-classifier. A case study using intensive care unit patients data. Artif Intell Med 2001; 22(3): 233–248.

102.

Ramon

Fierens

Güiza

. Mining data from intensive care patients. Adv Eng Inform 2007; 21(3): 243–256.

103.

Silva I, Moody G, Scott DJ, et al. Predicting in-hospital mortality of icu patients: The physionet/computing in cardiology challenge 2012. In: Computing in Cardiology (CinC), 9 September 2012, pp.245–248. IEEE.

104.

Silva

Cortez

Santos

. Mortality assessment in intensive care units via adverse events using artificial neural networks. Artif Intell Med 2006; 36(3): 223–234.

105.

Nimgaonkar

Sudarshan

. Predicting hospital mortality for patients in the intensive care unit: A comparison of artificial neural networks with logistic regression models. Intensive Care Med 2004; 30: 248–253.

106.

Wong

Young

. A comparison of icu mortality prediction using the apache ii scoring system and artificial neural networks. Anaesthesia 1999; 54(11): 1048–1054.

107.

Doig GS, Inman KJ, Sibbald WJ, et al. Modeling mortality in the intensive care unit: comparing the performance of a back-propagation, associative-learning neural network with multivariate logistic regression. In: Proceedings of the Annual Symposium on Computer Application in Medical Care, 1993, p.361. American Medical Informatics Association.

108.

Meyfroidt

Güiza

Ramon

. Machine learning techniques to examine large patient databases. Best Pract Res Clin Anaesthesiol 2009; 23(1): 127–143.

109.

Knaus

Draper

Wagner

. Apache ii: A severity of disease classification system. Crit Care Med 1985; 13(10): 818–829.

110.

Lemeshow

Teres

Klar

. Mortality probability models (mpm ii) based on an international cohort of intensive care unit patients. JAMA 1993; 270(20): 2478–2486.

111.

Vincent

De Mendonça

Cantraine

. Use of the sofa score to assess the incidence of organ dysfunction/failure in intensive care units: Results of a multicenter, prospective study. Crit Care Med 1998; 26(11): 1793–1800.

112.

Le Gall

Loirat

Alperovitch

. A simplified acute physiology score for ICU patients. Crit Care Med 1984; 12(11): 975–977.

113.

Poole

Rossi

Latronico

. Comparison between saps ii and saps 3 in predicting hospital mortality in a cohort of 103 italian ICUs. is new always better? Intensive Care Med 2012; 38(8): 1280–1288.

114.

Rosenberg

. Recent innovations in intensive care unit risk-prediction models. Curr Opin Crit Care 2002; 8(4): 321–330.

115.

Gilani

Razavi

Azad

. A comparison of simplified acute physiology score ii, acute physiology and chronic health evaluation ii and acute physiology and chronic health evaluation iii scoring system in predicting mortality and length of stay at surgical intensive care unit. Niger Med J 2014; 55(2): 144–144.

116.

Hug C. Detecting hazardous intensive care patient episodes using real-time mortality models. PhD Thesis, Doctoral dissertation, Massachusetts Institute of Technology, 2009.

117.

Knaus

Wagner

Draper

. The apache iii prognostic system. risk prediction of hospital mortality for critically ill hospitalized adults. Chest 1991; 100(6): 1619–1636.

118.

Le Gall

Neumann

Hemery

. Mortality prediction using saps ii: An update for french intensive care units. Crit Care 2005; 9(6): R645–R645.

119.

Metnitz

Schaden

Moreno

. Austrian validation and customization of the saps 3 admission score. Intensive Care Med 2009; 35(4): 616–622.

120.

Moreno

Metnitz

Almeida

. Saps 3âfrom evaluation of the patient to evaluation of the intensive care unit. Part 2: Development of a prognostic model for hospital mortality at ICU admission. Intensive Care Med 2005; 31(10): 1345–1355.

121.

Knaus

Draper

Wagner

. Apache ii: A severity of disease classification system. Crit Care Med 1985; 13(10): 818–829.

122.

Schuster

Ritschel

. The ability of the simplified acute physiology score (saps ii) to predict outcome in coronary care patients. Intensive Care Med 1997; 23(10): 1056–1061.

123.

Hekmat

Kroener

Stuetzer

. Daily assessment of organ dysfunction and survival in intensive care unit cardiac surgical patients. Ann Thorac Surg 2005; 79(5): 1555–1562.

124.

Schuiling

de Weerd

Dennesen

. The simplified acute physiology score to predict outcome in patients with subarachnoid hemorrhage. Neurosurgery 2005; 57(2): 230–236.

125.

Citi

Barbieri

. Physionet 2012 challenge: Predicting mortality of icu patients using a cascaded svm-glm paradigm. In: Computing in cardiology (CinC) 9 September 2012, pp. 257–260. IEEE.

126.

Walker

Kadam

. Predicting breast cancer survivability: A comparison of three data mining methods. Artif Intell Med 2005; 34(2): 113–127.

127.

Crawford

Batuello

Snow

. The use of artificial intelligence technology to predict lymph node spread in men with clinically localized prostate carcinoma. Cancer 2000; 88(9): 2105–2109.

128.

Paterson

MacLeod

Thetford

. Prediction of in-hospital mortality and length of stay using an early warning scoring system: Clinical audit. Clin Med 2006; 6(3): 281–284.

129.

Clark

Ryan

. Concurrent prediction of hospital mortality and length of stay from risk factors on admission. Health Serv Res 2002; 37(3): 631–645.

130.

Sessler

Sigl

Manberg

. Broadly applicable risk stratification system for predicting duration of hospitalization and mortality. The Journal of the American Society of Anesthesiologists 2010; 113(5): 1026–1037.

131.

Cai

Perez-Concha

Coiera

. Real-time prediction of mortality, readmission, and length of stay using electronic health record data. J Am Med Inform Assn 2016; 23(3): 553–561.

132.

Kubat

Matwin

. Addressing the curse of imbalanced training sets: One-sided selection. ICML Nashville, TN, 8 July 1997; Vol. 97: 179–186.

133.

Bader-El-Den M, Teitei E and Adda M. Hierarchical classification for dealing with the Class imbalance problem. In: 2016 International Joint Conference on Neural Networks (IJCNN), 24 July 2016, pp. 3584–3591. IEEE.

134.

Hein

Birnbaum

Wernecke

. Prolonged intensive care unit stay in cardiac surgery: Risk factors and long-term-survival. Ann Thorac Surg 2006; 81(3): 880–885.