This article has Open Peer Review reports available.
Use of an electronic administrative database to identify older community dwelling adults at high-risk for hospitalization or emergency department visits: The elders risk assessment index
© Crane et al; licensee BioMed Central Ltd. 2010
Received: 1 December 2009
Accepted: 13 December 2010
Published: 13 December 2010
The prevention of recurrent hospitalizations in the frail elderly requires the implementation of high-intensity interventions such as case management. In order to be practically and financially sustainable, these programs require a method of identifying those patients most at risk for hospitalization, and therefore most likely to benefit from an intervention. The goal of this study is to demonstrate the use of an electronic medical record to create an administrative index which is able to risk-stratify this heterogeneous population.
We conducted a retrospective cohort study at a single tertiary care facility in Rochester, Minnesota. Patients included all 12,650 community-dwelling adults age 60 and older assigned to a primary care internal medicine provider on January 1, 2005. Patient risk factors over the previous two years, including demographic characteristics, comorbid diseases, and hospitalizations, were evaluated for significance in a logistic regression model. The primary outcome was the total number of emergency room visits and hospitalizations in the subsequent two years. Risk factors were assigned a score based on their regression coefficient estimate and a total risk score created. This score was evaluated for sensitivity and specificity.
The final model had an AUC of 0.678 for the primary outcome. Patients in the highest 10% of the risk group had a relative risk of 9.5 for either hospitalization or emergency room visits, and a relative risk of 13.3 for hospitalization in the subsequent two year period.
It is possible to create a screening tool which identifies an elderly population at high risk for hospital and emergency room admission using clinical and administrative data readily available within an electronic medical record.
The aging of the United States population represents a demographic imperative for innovation in the provision of healthcare to older Americans. Those aged 65 and older represented 12.4% of the total U.S. population in 2005, but this number is projected to double in the next twenty-five years . Accordingly, the population of older adults at high risk for hospitalization, nursing home placement or functional decline is also increasing, creating an enormous financial and capacity burden on the health care system.
Multiple interventions, such as case management and transition management programs, have target the prevention of recurrent hospitalizations among community dwelling older adults, and are under great scrutiny in the arenas of research and policy development [2–5]. The complexity and cost of many of these interventions, combined with the demographic challenges, require that the investment of these resources be made in the patient population that is most likely to benefit. In order to identify those patients, health care providers require some form of risk assessment to focus their efforts - recognizing that the elderly population is very heterogeneous in function and disease burden.
These challenges have led to the need for a predictive instrument that is accurate, easy to calculate, inexpensive, and does not require patient completion. Our group hypothesized that we could identify older adults at high risk for hospitalization or emergency department visits using only information readily available from a centralized electronic health record, without taking time away from staff and patients. This model is becoming increasingly feasible as national policy continues to strongly encourage the creation and use of electronic medical records. Hospitalization and emergency department encounters were chosen as independent outcomes, as both events are associated with premature institutionalization and high resource utilization [6, 7]. The primary aim of this study was to demonstrate that readily accessible information available in a provider's electronic medical record could be used to identify a population of community dwelling older adults at high-risk for hospitalization or emergency room utilization.
The study was a retrospective cohort of all patients age 60 and greater who were impaneled on January 1, 2005, in the Division of Primary Care Internal Medicine (PCIM) at Mayo Clinic in Rochester, MN. This division of the Department of Medicine serves local residents, Mayo Clinic employees, and their dependents. Rochester is a city of approximately 100,000 and is surrounded by small rural communities. There are only two other major alternative primary care providers for older adults in this community; the Department of Family Medicine at Mayo Clinic and the Olmsted Medical Group.
All adults age 60 and older, assigned to a PCIM primary care provider on January 1, 2005, were included in the analysis. All subjects were community dwelling or lived in an assisted living facility within Olmsted County, MN.
Patients who were residing within a skilled nursing facility on January 1, 2005, were excluded from the study. Patients who did not give consent for their medical chart review were also excluded from analysis, in accordance with Minnesota state law.
Information was electronically abstracted from the electronic medical record and administrative databases within Mayo Clinic's health records system. Mayo Clinic maintains all electronic medical record information within one system, including hospital, emergency room, nursing home, and clinic-visit information. No individual chart abstraction was performed.
The demographic predictor variables collected included: date of birth, gender, marital status, race, and the number of hospital admission days in the prior two years (January 1, 2003 to December 31, 2004). Hospital days were stratified into two risk groups: one to five and six or more. Age was stratified into categories of 60 to 69, 70 to 79, 80 to 89, and greater than 90.
Comorbid medical illnesses included the presence or history of diabetes mellitus, coronary artery disease (CAD), congestive heart failure (CHF), stroke, chronic obstructive pulmonary disease (COPD), history of cancer, history of hip fracture, and dementia. History of cancer excluded non-melanomatous skin cancers. Diagnoses were identified using ICD-9 billing codes entered by physicians during both inpatient and outpatient encounters. These comorbidities were chosen via consensus discussion based on their known risk for recurrent hospitalizations and greater complexity of care.
The primary outcome variable was the total number of hospitalizations or emergency room visits measured from the date of January 1, 2005, through December 31, 2006. Emergency room visits resulting in a direct hospital admission were recorded as a single outcome event. The total number of hospital admissions and admission days during the same two-year period were collected as secondary outcome measures.
Predictor variables for the primary outcome of the total number of hospitalizations or emergency room visits were screened for further analysis using univariate regression models and 1-way ANOVA. The variables with a p-value greater than 0.05 were discarded. A final multivariable regression model using stepwise elimination was then constructed with only those significant predictors identified by the univariate stage. The category of "unknown" race was a significant univariate predictor, but was not included in the final model as the category was not large enough (5%) to statistically influence the final multivariable model and it proved difficult to act upon prospectively in identifying new, at-risk patients.
A total risk score for each individual was calculated based on the significant risk factors using regression estimates multiplied by ten in order to generate manageable scores. The scores were divided by quartiles and the top quartile further divided into the top 10% and then the next 15% (75% to 90%). This split was chosen in an attempt to create categories in the highest risk groups with small enough populations to enable focused future interventions.
To estimate the precision of the score assignment, bootstrapping was used to draw 450 random samples from the original 12,650 patients with replacement. This method provides robust estimates of the standard error of a population parameter such as a regression coefficient. For every sample, a regression model was run using the same predictive variables. The estimate of each predictor in the validation model was the mean of the regression coefficients of each predictor from 450 runs. The standard error was obtained from the standard error of the mean estimates.
1-way ANOVA for mean, Wilcoxon rank sum tests for median and Pearson chi-square test for frequency were used to compare variables across the 5 score categories. Hospitalizations and emergency visits within 2 years were compared across score categories using logistic regression analysis to provide odds ratios. Receiver operating characteristic (ROC) curves were developed to show sensitivity and specificity of hospitalization or emergency visits in 2 years stratified by the risk score.
All information was directly entered via electronic abstraction into a Microsoft Excel (version 2003, Microsoft, Redmond, WA) spreadsheet for data entry, data retrieval, and analysis. The investigators analyzed the final information using SAS 9.1 (Cary, NC).
The Mayo Clinic Institutional Review Board (IRB) reviewed and approved the protocol. All aspects of the research on this project were made in accordance with the principles of the Declaration of Helsinki. The investigators also adhered to Minnesota state statues regarding medical record use and privacy.
Regression Estimates and Scoring of Predictive Risk Factors: Original Model and Bootstrapping Validation Model
Age 90 or more
1-5 hosp days in 2003 or 2004
6 or more hosp days in 2003 or 2004
History of Diabetes
History of CAD/MI/CHF
History of Stroke
History of COPD
History of Cancer
History of Dementia
Characteristics of the Population by Quartile and top 10%
N = 2106
N = 4114
N = 3115
N = 2129
N = 1186
Age (± SD)
Age, n (%)
• Age 60-69
• Age 70-79
• Age 80-89
• Age >90
Female, n (%)
Stayed in a hospital (2003-2004),
Total hospital days (2003-2004),
Median (Min, Max)
0 (0, 4)
0 (0, 5)
0 (0, 49)
2 (0, 123)
9.5 (0, 153)
Lived in a NH (2003-2004),
Previous history ever of NH stay,
History of Diabetes,
History of CAD/MI/CHF,
History of Stroke,
History of COPD,
History of Cancer,
History of Hip Fracture,
History of Dementia,
Total Number and Relative Risk of Total Emergency Room Visit and Hospital Stay, Emergency Room Visit Alone and Hospital Stay Alone by Risk Category in Two Years Follow-Up (2005-2006)
Relative Risk of ER Visit or Hospital Stay
OR (95% CI)
Relative Risk of ER Visit
OR (95% CI)
Relative Risk of Hospital Visit
Total Number of ER Visits/Hospital Admissions and Hospital Days By Risk Category in Two Years Follow-Up (2005-2006)
# of admissions (2 yrs)
# of hospital days (2 yrs)
0.4 ± 0.8
0.6 ± 3.5
0.7 ± 1.2
1.4 ± 5.2
1.1 ± 1.6
2.4 ± 6.3
1.6 ± 2.2
4.1 ± 9.0
2.6 ± 2.9
8.0 ± 13.3
In this study, a prognostic index was developed and validated, based on a scoring system that derived information from community-dwelling elderly patients' electronic medical records. The Elders Risk Assessment (ERA) index accurately identified older adults at high-risk of emergency department encounters and hospitalization; two outcomes that can lead to significant morbidity, functional decline, and institutionalization .
Previous authors have developed screening instruments aimed at identifying high risk populations of older adults. The ERA was developed to address and overcome a number of barriers that are typically associated with these instruments.
One of the primary barriers is the requirement for patient self-reporting of information. The best validated self-administered prognostic index is the Probability of Repeated Admissions (PRA) [9, 10]. This eight-item tool has been widely used by managed care organizations to prospectively identify enrollees at risk for repeated hospital admissions and health care resource utilization. This instrument has been shown to have good discriminating ability for one-year risk of hospitalization, with reported areas under the ROC curves ranging from 0.620-0.696, depending on the validation population and setting [11–13]. Similarly, the Community Assessment Risk Screen (CARS) index identifies those older adults at increased risk of hospitalization or emergency department visits with self-reported information about medical conditions, medication use, and health service utilization. Utilizing this risk classification, Shelton and colleagues found that the area under the ROC curve to be 0.74 for hospitalization or emergency department visits . Mazzaglia and colleagues utilized self-reported data (functional status, sensory impairment, unintentional weight loss, and use of home care services), from community dwelling older adults in Florence, Italy, to create a risk score that was also found to be predictive of hospitalization (in the subsequent 15 years) with AUC of 0.68 . Unfortunately, low response rates , recall bias , literacy requirements , time, and cost  have proven to be significant barriers to widespread use of self-reported instruments. Response rates for the PRA have ranged from 50-60% in the managed care setting [13, 17]. A major advantage of the ERA index is that it uses administrative data, which is unaffected by the aforementioned limitations which are intrinsic to self-reported data.
The ERA also performed favorably when compared with the administrative or "proxy" PRA. The administrative PRA model derives information from a health plan's multiple databases including a pharmacy database, chronic disease registries, billing data, and utilization data registries to calculate a risk score which performs similarly to the original self-reported Pra (AUC 0.694 vs. 0.696) in predicting hospitalization . While undoubtedly useful in the managed care setting, this proxy model is challenging to adopt in traditional fee-for-service medical practices, like ours, which serves patients who utilize a multitude of pharmacies and supplemental insurance carriers thus limiting access to those database sources.
Combined hospitalization and emergency room visits were chosen as the primary outcome because they are early precursors to the functional decline and institutionalization, which it is our goal to prevent. They also often result from acute changes in chronic conditions such as COPD, where early intervention by an outpatient provider may prevent recurrent admissions. In an effort to improve the primary care physician's awareness of these risks, we have subsequently developed it for real time use among our primary care providers in our electronic environment with a software system called Generic Disease Management Systems (GDMS). GDMS is a web-based application developed by Mayo Clinic and the Netherlands-based Noaber Foundation, which uses GE Web Services and a MSQweb.net platform to retrieve patient vital statistics such as blood pressure, weight, body mass index, age, demographic information, prior diagnoses, allergies, prior radiology diagnostic tests, and previous preventive services (eg, immunizations, cancer and metabolic screenings, laboratory test results pertaining to diabetes, coronary artery disease, asthma, and depression) from different clinical information systems. The ERA score is now calculated in real time based on the scoring system described in this article and displayed on the GDMS print out that we include in the rooming packet for all our patient visits. This allows our providers to easily identify at-risk elders and to pay special attention to the patient if clinically needed.
This ability to measure ERA scores in real time is now being further developed into a registry which allows us to identify these high-risk patients as a unique population, similar to the population-based systems used to manage diabetics. Currently, this real-time registry is allowing the implementation and measurement of interventions such as transitions programs, discussions regarding goals of care, appointment access prioritization, and accelerated triage aimed at preventing recurrent admissions and secondary functional decline.
This study is not without methodological limitations. First, the patient information obtained from administrative databases was recorded prior to the outcome of interest for purposes other than investigation of our hypothesis. Coding data were utilized to identify whether individuals had been diagnosed with any of the six predictor comorbid conditions. Coding data may under-estimate secondary diagnoses, however, other authors have found that administrative data such as ICD-9 codes, typically correlate well with patient chart diagnoses .
Second, this study was a retrospective cohort analysis. This creates the possibility of underreported risk factors, as well as outcomes. Although most patients receive both their acute and chronic care from Mayo Clinic, as their primary provider, it is certainly possible that they could have hospitalizations or chronic diagnoses which are identified elsewhere and of which our electronic medical record is therefore unaware. Although the outcome data requires further prospective validation, the retrospective collection of risk factor variables is an essential component of the model design and one of the factors this hypothesis was designed to examine.
Third, we did not include functional-status measures in our initial predictive modeling. Functional-status measures are known to be independently associated with hospitalization and emergency department visits, however, functional-status data is dependent on patient-provided history or clinician-administered performance testing and is neither routinely collected, nor easily extractible from administrative data [19–21]. Additionally, self-reported information such as functional status and medications, fluctuate throughout an individual's life, further challenging the accurate collection and maintenance of this data. Despite the fact that the functional status was not utilized in our final model, the ERA index compared favorably with the aforementioned indices in which it was included.
Despite these limitations, results from this study suggest that the ERA index represents a risk identification model, which is an example of an effective, inexpensive, electronic mechanism able to identify populations of older, community-dwelling adults who are at increased risk for hospitalization and emergency department encounters. Administrative and clinical data modeling may afford busy primary care practices or payor organizations the opportunity to identify high-risk populations so that they may effectively allocate resources and evidence-based preventive interventions to those individuals with the greatest need and greatest potential to benefit.
- US Census Bureau: [http://factfinder.census.gov]
- Coleman E, Parry C, Chalmers S, Min S: The Care Transitions Interventions: Results of a randomized controlled trial. Archives of Internal Medicine. 2006, 166: 1822-1828. 10.1001/archinte.166.17.1822.View ArticlePubMedGoogle Scholar
- Caplan G, Williams A, Daly B, Abraham K: A randomized controlled trial of comprehensive geriatric assessment and multidisciplinary intervention after discharge of the elderly from the emergency department - the DEED II study. Journal of the American Geriatrics Society. 2004, 52: 1417-1423. 10.1111/j.1532-5415.2004.52401.x.View ArticlePubMedGoogle Scholar
- CMMS: CMMS RFP for Care Management for High-Cost Beneficiaries (CMS-5015-N). 2004Google Scholar
- Counsell S, Callahan C, Clark D, Tu W, Buttar A, Stump T, Ricketts G: Geriatric care management for low-income seniors: a randomized controlled trial. Journal of the American Medical Association. 2007, 298: 2623-2633. 10.1001/jama.298.22.2623.View ArticlePubMedGoogle Scholar
- Shelton P, Sagar M, Schraeder C: Identifying elderly persons at risk for hospitalization or emergency department visit. American Journal of Managed Care. 2000, 40: 925-933.Google Scholar
- Miller E, Weissert W: Predicting elderly people's risk for nursing home placement, hospitalization, functional impairment and mortality: a synthesis. Med Care Res Rev. 2000, 57: 259-297. 10.1177/107755870005700301.View ArticlePubMedGoogle Scholar
- McGeechan K, Macaskill P, Irwig L, Liew G, Wong T: Assessing New Biomarkers and Predictive Models for Use in Clinical Practice. Archives of Internal Medicine. 2008, 168: 2304-2310. 10.1001/archinte.168.21.2304.View ArticlePubMedGoogle Scholar
- Boult C, Dowd B, McCaffrey D, Boult L, Hernandez R, Krulewitch H: Screening elders at risk for hospital admission. Journal of the American Geriatrics Society. 1993, 41: 811-817.View ArticlePubMedGoogle Scholar
- Pacala J, Boult C, Boult L: Predictive validity of a questionnaire that identifies older persons at risk for hospital admission. Journal of the American Geriatrics Society. 1995, 42: 374-377.View ArticleGoogle Scholar
- Coleman E, et al: Predicting hospitalization and functional decline in health plan enrollees: are administrative data as accurate as self report?. JAGS. 1998, 46: 419-425.View ArticleGoogle Scholar
- Wagner J, Bachmann L, Boult C, Harari D, von Renteln-Kruse W, et al: Predicting the risk of hospital admission in older persons-validation of a brief self-administered questionairre in three European countries. J Am Geriatr Soc. 2006, 54: 1271-1276. 10.1111/j.1532-5415.2006.00829.x.View ArticlePubMedGoogle Scholar
- Pacala J, Boult C, Reed R, Aliberti E: Predictive validity of the Pra Instrument among older recipients of managed care. J Am Geriatr Soc. 1997, 45: 614-617.View ArticlePubMedGoogle Scholar
- Mazzaglia G, et al: Screening of older community dwelling people at risk for death and hospitalization. J Am Geriatr Soc. 2007, 55: 1955-1960. 10.1111/j.1532-5415.2007.01446.x.View ArticlePubMedPubMed CentralGoogle Scholar
- Hertzog A, WL R: Age and response rates to interview sample surveys. J Gerontol. 1988, 43S: 200-205.View ArticleGoogle Scholar
- Root J, Stablesford S: Easy to read consumer communications: a missing link in Medicaid managed care. J Health Polit Policy Law. 1999, 24: 1-26.View ArticlePubMedGoogle Scholar
- Vojta C, Vojta D, TenHave T, Amaya M, Lavizzo-Mourey R, Asch D: Risk screening in a Medicare/Medicaid population. J Gen Internal Med. 2001, 16: 525-530. 10.1046/j.1525-1497.2001.016008525.x.View ArticleGoogle Scholar
- Quan H, Parsons G, Ghali W: Validity of information on comorbidity derived from ICD-9-CCM administrative data. Med Care. 2002, 40: 675-685. 10.1097/00005650-200208000-00007.View ArticlePubMedGoogle Scholar
- Fried L, Guralnik N: Disability in older adults: evidence regarding significance, etiology, and risk. J Am Geriatr Soc. 1997, 45: 92-100.View ArticlePubMedGoogle Scholar
- Fried L, Kronmal R, Newman A, et al: Risk factors for 5 year mortality in older adults: the Cardiovascular Health Study. JAMA. 1998, 279: 585-592. 10.1001/jama.279.8.585.View ArticlePubMedGoogle Scholar
- Bogardus S, Towle V, Williams C, Desai M, Inouye S: What does the medical record reveal about functional status. J Gen Internal Med. 2001, 16: 728-836. 10.1111/j.1525-1497.2001.00625.x.View ArticleGoogle Scholar
- The pre-publication history for this paper can be accessed here:http://0-www.biomedcentral.com.brum.beds.ac.uk/1472-6963/10/338/prepub
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.