Truveta Studio provides health data and analytics


BELLEVUE, Wash., Nov. 02, 2022 (GLOBE NEWSWIRE) — Today, Truveta announced the availability of Truveta Studio, bringing together unprecedented health data and analytics allowing researchers to study patient care and outcomes with any condition, drug or medical device. Offering the latest, most comprehensive, and highest quality US health data, Truveta Studio is an integrated solution that combines data and analytics to accelerate real-time learning.

Healthcare is profoundly challenged by inaccessible, fragmented and unstructured data. Clinical trials take many years, are very expensive, and lack appropriate diversity, leaving critical evidence gaps in medicine. Motivated by the lack of useful data on how best to respond during COVID-19, 25 healthcare systems providing 16% of all healthcare in the United States formed Truveta.

Today, Truveta introduces Truveta Studio, an unprecedented health analytics solution based on an exabyte of health data. Data from Truveta members is normalized, anonymized and made available for research daily, enabling researchers to uncover insights into care yesterday, today.

“Truveta Studio brings together unprecedented health data and analytics to study patient care and outcomes like never before,” said Terry Myerson, CEO of Truveta. “Humanity will benefit as healthcare advances with new insights made possible by AI, providing answers to complex medical questions in days, not years.”

Truveta Data is unprecedented

Today, most research is conducted on outdated claims data that does not include critical information, such as symptoms or lab test results that led to the diagnosis. When clinical data is accessible, it is unstructured and not useful for analysis. Truveta Studio is the first solution to make massive streams of daily clinical data useful for analysis through the integration of natural language processing and AI-based de-identification. Truveta Data is:

  • Timely: Updated daily from the care of Truveta’s 25 health system members, allowing researchers to learn in real time from the most current view of health in the United States.
  • Representing: Truveta members provide patient care in 43 states where 97% of the US population resides. Truveta Data covers the full diversity of the United States in terms of age, geography, race, ethnicity, and gender.
  • Complete: Truveta’s breadth of data is matched by unparalleled depth, including medical records with full diagnoses, vital signs, lab tests, clinical notes and images. Truveta data is linked across providers and with daily mortality data and comprehensive social factors from LexisNexis health data. Insurance claims complete the patient journey when medical records are not available. The result is a complete and anonymized longitudinal course for each patient.
  • Normalized: Unstructured medical record content is mapped to clinical ontology standards, such as LOINC for laboratory testing and GUDID for medical devices.

Today, data providers do not share the sources of their data. Committed to earning trust, Truveta provides a comprehensive fact sheet on the national population represented in Truveta data and on each population studied. This fact sheet includes patient counts, diversity statistics, completeness and timeliness, and sources for all data.

Truveta Studio enables fast and transparent clinical analyzes

Today, researchers face frustrating delays of months in assessing the feasibility of generating a representative population for analysis, and then further delays in setting up the secure data analysis infrastructure. Fragmented and limited tools slow research, increase costs, and limit transparency and confidence in study findings. Today, Truveta Studio eliminates these problems with several industry firsts:

Truveta Prose makes medical concepts calculable

Today, individual research projects define medical concepts with personalized, opaque and limited expressions. Truveta Prose is the first language to express computable medical concepts combining events from a patient’s longitudinal history, including diagnoses, labs, procedures, drugs, vaccinations, devices, or any concept found in a clinical note. Like Google searches the Internet, researchers can search Truveta data for any population defined by Prose in seconds.

“Researchers often spend countless hours trying to stratify and define the patient populations they seek to study before they can even begin their analysis,” said Eric Eskioglu, MD, MBA, executive vice president and director medical and scientific at Novant Health. “Truveta not only provides consistency and transparency between different clinical concepts and outcomes, but also fundamentally reduces the cost and increases the speed of research, enabling scientists to gain insights faster to save more lives.

For example, there is an important medical nuance in the definition of a patient hospitalized with COVID-19. Some researchers may take advantage of diagnostic codes for inpatient encounters; however, such a definition will also encompass patients infected with COVID (and not necessarily hospitalized due to their COVID infection). Other researchers can leverage logic such as COVID-specific drug use, oxygenation status, need for intubation, or specific lab markers. To help researchers navigate this complexity, Truveta Prose allows medical concepts such as “COVID Hospitalization from Diagnostic Criteria” or “COVID Hospitalization from Medication Use” to be defined in a computable form as Truveta definitions, which can be used to analyze Truveta data.

In fact, Truveta Research recently used the “COVID Hospitalization with Diagnostic Criteria” definition to explore potential racial and ethnic disparities in COVID hospitalizations during different time periods throughout the pandemic. The results are available now as a preprint and summary on the Truveta Research blog.

Or, if a researcher wanted to study people with diabetes who had kidney transplants within two years of being diagnosed with diabetes, the researcher would use both the Truveta definitions of “diabetes” and “kidney transplant” and specify easily the two-year time interval.

“Yesterday I found about 20,000 patients anonymized in Truveta Studio with this combination of criteria in just a few seconds,” said Michael Simonov, MD, vice president of clinical informatics at Truveta. “Finally, I can express complex clinical concepts in a calculable and logical format, combine complex clinical definitions, and quickly visualize population size and demographics. I love that I can run feasibility studies and generate hypotheses at incredible speed. »

Truveta Prose enables unprecedented transparency in how medical constructs are calculated in a study, to help gain confidence in the conclusions of that study.

The Truveta Library accelerates collaboration and learning

Truveta definitions can be shared within the Truveta library to facilitate the creation of study populations and accelerate the accumulation of computable medical knowledge. The Truveta Library already contains thousands of Truveta definitions provided by experienced clinical informaticians.

As an example of simplified complexity, the Truveta definition of hospitalization for heart failure contains dozens of diagnostic codes and combines data on encounters, medications administered, and lab results, all tied to time constraints. complex.

“For researchers, this is really exciting,” said Ari Robicsek, MD, director of medical analytics and senior vice president of research at Providence. “Truveta Studio offers a huge, comprehensive and up-to-date data set. And the Truveta Library facilitates critical documentation and communication of how we define our cohorts.

Truveta Laptops Enable Convenient Analytics

Today, individual research projects require a custom data infrastructure, which leads to delays, expense, privacy and security risks, and limits the ability to share underlying statistics seamlessly. Truveta Studio includes an integrated Jupyter notebook atop a serverless SQL experience, pre-installed with the latest medical statistics and visualization libraries, including pandas, NumPy, Matplotlib, SciPy, Tidyverse, Arrow, and dplyr, with support full of R and Python. Built-in analytics allows research and distributed data science teams to investigate daily updated Truveta data populations hassle-free in Studio – and share underlying statistics seamlessly, saving money confidence in the conclusions of this study.

Unlimited Discovery

Truveta offers the best real-world data value with subscriptions that include daily data and unlimited analytics by an unlimited number of users. A Truveta subscription supports unlimited quality of care, health equity, comparative effectiveness, safety, label expansion, AI training, regulatory filings, and publications. To learn more and schedule a demo, visit or contact us at [email protected].

About Truveta

Truveta was formed and governed by American health systems with a shared vision of saving lives through data. Truveta now offers the world’s first health data and analytics solution to study patient care and outcomes. To find out more, follow us on LinkedIn and visit

About Truveta Members

Truveta’s 25 members provide 16% of patient care in the United States at more than 20,000 clinics and 700 hospitals. Anonymized data from this care is provided daily to Truveta. Truveta membership includes Providence, Aurora Health Defender, Trinity Health, Health care principle, Northwell Health, AdventHealth, Baptist Health of Northeast Florida, Baylor Scott and White Health, Bon Secours Mercy Health, Centura Health, CommonSpirit Health, Hawaii Pacific Health, Henry Ford Health System, HonorHealth, MedStar Health, Memorial Hermann Health System, MetroHealth, Novant Health, Health Ochsner, Saint Luke’s Health System, Sentara Health, Texas Health Resources, UnityPoint Health, Virtua Healthand WellSpan Health.

  • Truveta Studio COVID-19 Dashboard

  • The 25 members of Truveta’s health system


Comments are closed.