Predictive Analytics Solutions

Defining Predictive Analytics in Healthcare

Predictive analytics and machine learning in healthcare are rapidly becoming some of the most-discussed, perhaps most-hyped topics in healthcare analytics. Machine learning is a well-studied discipline with a long history of success in many industries. Healthcare can learn valuable lessons from this previous success to jumpstart the utility of predictive analytics for improving patient care, chronic disease management, hospital administration, and supply chain efficiencies. The opportunity that currently exists for healthcare systems is to define what “predictive analytics” means to them and how can it be used most effectively to make improvements.

However, predictions made solely for the sake of making a prediction are a waste of time and money. In healthcare and other industries, prediction is most useful when that knowledge can be transferred into action. The willingness to intervene is the key to harnessing the power of historical and real-time data. Importantly, to best gauge efficacy and value, both the predictor and the intervention must be integrated within the same system and workflow where the trend occurs.

The following Health Catalyst® paper, “Using Predictive Analytics in Healthcare: Technology Hype vs Reality” is a good summary of both the hype and hope of predictive analytics in healthcare


How To Get Started With Predictive Analytics and Machine Learning

Given the many pitfalls to avoid in healthcare predictive analytics, then where do you get started?  The most important starting point is to establish a fundamental data and analytic infrastructure upon which to build. Deliberately but quickly move your organization up the levels of the Healthcare Analytics Adoption Model. This model draws upon lessons learned from the HIMSS EHR Adoption Model and describes a similar approach for assessing the adoption of analytics in healthcare. This model starts a level 1 foundation of an integrated, enterprise data warehouse combined with a basic set of foundational and discovery analytic applications.

1.  Start With an Integrated Data Warehouse and Analytics Platform

gearsEnterprise Data Warehouse

You need data across the entire continuum of care to manage patient populations. This requires an enterprise data warehouse (EDW) platform. An EDW is the central platform upon which you can build a scalable analytics approach to systematically integrate and make sense of the data.

Health Catalyst® deploys a unique Late-Binding™ Data Warehouse that enables healthcare organizations to automate extraction, aggregation, and integration of clinical, financial, administrative, patient experience, and other relevant data and apply advanced analytics to organize and measure clinical, patient safety, cost, and patient satisfaction processes and outcomes.

2.  Use the Three Basic Steps of Predictive Modeling


The following is a simple schematic of the predictive modeling process.  For predictive analytics to be effective, Lean practitioners must truly “live the process” to best understand the type of data, the actual workflow, the target audience and what action will be prompted by knowing the prediction.

  1. The first step is to carefully define the problem you want to address, then gather the initial data necessary and evaluate several different algorithm approaches.
  2. Step two refines this process by selecting one of the best performing models and testing with a separate data set to validate the approach.
  3. The final step is to run the model in a real world setting.

The more specific term is prescriptive analytics, which includes evidence, recommendations and actions for each predicted category or outcome.  Specifically, prediction should link carefully to clinical priorities and measurable events such as cost effectiveness, clinical protocols or patient outcomes. Finally, these predictor-intervention sets are best evaluated within that same data warehouse environment.

So many options exist when it comes to developing predictive algorithms or stratifying patient risk. This presents a daunting challenge to health care personnel tasked with sorting through all the buzzwords and marketing noise.  Healthcare providers need to partner with groups that have a keen understanding of the leading academic and commercial tools, and the expertise to develop appropriate prediction models.

Follow 4 Key Lessons Learned for Adopting Predictive Analytics and Machine Learning in Healthcare

Given that predictive analytics are listed as level 7 out of the 8 possible levels on the Healthcare Analytics Adoption Model, there are many keys and pitfalls that can occur at such a level if not properly prepared. Fortunately for healthcare, there are numerous existing models from other industries that can be combined with past healthcare examples to ease some of the potential pains and pitfalls. Highlights of some those key lessons include:

  1. Don’t confuse more data with more insight: While many solid scientific findings may be interesting, they do little to significantly improve current clinical outcomes.
  2. Don’t confuse insight with value: While many solid scientific findings may be interesting, they do little to significantly improve current clinical outcomes.
  3. Don’t overestimate the ability to interpret the data: Sometimes even the best data may afford only limited insight into clinical health outcomes.
  4. Don’t underestimate the challenge of implementation: Leveraging large data sets successfully requires a health system to be prepared to embrace new methodologies; this, however, may require a significant investment of time and capital and alignment of economic interests.

The following Health Catalyst Executive Report, “4 Essential Lessons for Adopting Predictive Analytics in Healthcare”  expounds more in detail around each of these 4 lessons:


In order to be successful, we feel that clinical event prediction and subsequent intervention should be both content driven and clinician driven.  Importantly, the underlying data warehouse platform is key to gathering rich data sets necessary for training and implementing predictors.  Notably, prediction should be used in the context of when and where needed—with clinical leaders that have the willingness to act on appropriate intervention measures.

In the end, the overall goal is to leverage historical patient data to improve current patient outcomes. Predictive analytics is a powerful tool in this regard.

Health Catalyst Predictive Analytics and Machine Learning Solutions

Join Health Catalyst’s Machine Learning Community

Learn from your fellow citizen data scientists about how to use to start using machine learning within your health system.

Receive weekly notifications, learning tips and live broadcasts.

Health Catalyst not only has the expertise to develop machine learning models, but our underlying healthcare analytics platform is key to gathering the rich data sets necessary for training and implementing predictors. Notably, our prediction is only used “in context”—meaning when and where needed, with clinical leaders that have the willingness to act on appropriate intervention measures. Most important, however, these predictor-intervention sets can best be monitored and measured within that same data warehouse environment where otherwise not possible. Health Catalyst’s new machine learning solution makes machine learning in healthcare routine, actionable, and pervasive through three avenues:

  •™—our machine learning models and strategy for building machine learning into all Health Catalyst products.
  •™—our way of stimulating the adoption of machine learning in healthcare through free, open-source machine learning software that democratizes machine learning by lowering barriers to entry.
  • Healthcare analytics platform—the second-to-none backbone (foundation) for machine learning.

Within Health Catalyst, data modeling and algorithm development is performed using industry leading tools for data mining and supervised machine learning via our open-source R and Python packages. Ongoing efforts include classification models for a generalized predictor of hospital readmissions, heart failure, length of stay, and clustering of patient outcomes to historical cohorts at time of admit. Most importantly, we have internal access to millions of de-identified hospital records in both the inpatient and outpatient settings and adult and pediatric populations. This training data is crucial to addressing the predictive analytics and machine learning demands of clients and site customization.

We have a number of analytic applications that can be used in predictive analytics and machine learning initiatives, including CLABSI, Labor Management Explorer, COPD, Patient Flow Explorer.  So, when your request comes—whether it involves classification or clustering or feature selection—Health Catalyst has the tools and the data and the expertise to successfully deliver top performing predictive analytics. If you have interest or questions on any of these applications, feel free to contact us or schedule a demo by filling out our online form.

Health Catalyst Predictive Analytics and Machine Learning Products and Services

View Webinar on Machine Learning Using On-Demand

Levi Thatcher and his data science team hosted a webinar titled “Machine Learning Using A hands-on Learning Session” with several learning objectives:

  • Describe and install
  • Build and evaluate a machine learning model
  • Deploy interpretable predictions to SQL Server
  • Discuss the process of deploying into a live analytics environment


Speak with Someone at Health Catalyst About Using Machine Learning

If you’re interested in learning more about using predictive analytics and machine learning to improve outcomes, contact the Health Catalyst Data Science team.


Read More About Predictive Analytics and Machine Learning in Healthcare

4 Essential Lessons for Adopting Predictive Analytics in Healthcare
David K. Crocket, Ph.D.

Predictive Analytics: Healthcare Hype or Reality?
David K. Crocket, Ph.D.

What is Data Mining and its Use for Predictive Analytics in Healthcare?
David K. Crocket, Ph.D.

3 Reasons Why Comparative Analytics, Predictive Analytics, and NLP Won’t Solve Healthcare’s Problems
Dale Sanders, Vice President

How Makes Machine Learning Accessible to Everyone in Healthcare
Levi Thatcher, Director of Data Science Blog
Our blog focuses on healthcare data science, including machine learning, visualization, R, Python, the predictive packages, as well as using these tools to understand and improve population health outcomes.

Join our growing community of healthcare leaders

Stay informed by getting the latest news and exclusive updates from Health Catalyst.

This site uses cookies

We take pride in providing you with relevant, useful content. May we use cookies to track what you read? We take your privacy very seriously. Please see our privacy policy for details and any questions.