By Richard Gliklich, MD, CEO, OM1 | May 22, 2023
Integrated evidence generation (IEG) is a framework for generating evidence to support decision-making in healthcare. It involves the integration of multiple sources of real-world data (RWD), including electronic health records (EHR), claims data, and patient-generated data, to provide a more comprehensive and accurate view of the effectiveness, safety, and value of healthcare interventions.
To make these data fit for purpose, the data used and analyses generated must meet the evidentiary standard for the particular use case. For example, a regulatory use case will have a different evidentiary bar for fit than a peer-reviewed publication use case.
FDA Guidance Documents
It’s important to set the table with Considerations for the Use of Real-World Data and Real-World Evidence to Support Regulatory Decision-Making for Drug and Biological Products: Guidance for Industry, a guidance document from the FDA. It defines RWD as “data relating to patient health status and/or the delivery of health care routinely collected from a variety of sources”. It also goes to great lengths to define an interventional study, (or clinical trial) as a study in which participants are “assigned to one or more interventions, according to a study protocol, to evaluate the effects of those interventions on subsequent health-related biomedical or behavioral outcomes”, from a non-interventional study, (or observational study) as a study in which “patients received the marketed drug of interest during routine medical practice and are not assigned to an intervention according to a protocol”.
Transparency regarding data collection and analysis is where making decisions regarding RWD for a regulatory purpose versus a research purpose is essential. One suggestion the document makes is, if certain RWD are owned by third parties, have agreements in place to ensure that all relevant patient-level data can be provided to FDA, and that source data necessary to verify the RWD are made available for inspection. This suggests that the FDA intends to inspect sites that are participating in the provision of RWD and want it to be auditable. The source data needs to be available to be compared to the data within the database.
Technology Platforms & Specialized Networks
We think about this technology in three categories:
- OM1 Origin™: connecting and sourcing platform
This technology connects to the source record systems such as EMR systems, labs, and existing registries. This meets the FDA requirement for maintaining the reliability of the RWD and data integrity, beginning with extraction of the data from its origin (i.e., data accrual). Automation is a game changer in these programs because it reduces burden on the practices and lowers cost.
- OM1 EMR Engine™: data processing and enrichment platform
The Engine brings in data from Origin to aggregate and de-identify so we can follow each person as a unique patent within the system while reducing the risk of exposure. Once this is done, additional data can be derived from it including unstructured data and linking to additional data sources. These additional sources are not primary to the study but should be disclosed in the protocol with the FDA. While they may add additional value, they may not be auditable like the EMR data. If this is done, both the source of data and the linkage parameters that will be used need to be included in the protocol as well. Data curation, transformation, and reporting of results is a requirement for FDA submission.
- OM1 Apps: advanced analytics platform
We leverage the clinical narrative in two ways. One is where we use medical language processing (MLP) to convert unstructured data to structured data to extract variables and identify key outcomes within a disease. Our process is validated against the ‘gold standard’, clinician abstraction.
The second use of unstructured data is the closest to magic we’re ever seen, by taking a clinical narrative and estimating or imputing an outcome measure that may not have been assessed during the evaluation. For example, EDSS, SLEDAI, CDAI, BASDAI, PHQ9I, and NYHA.
Automated data collection: Reducing Site Burden & Decreasing Costs
To create reusable sites for evidence generation, OM1 Origin™ connects directly to site EMR and other systems. We then utilize our extensive library of adapters specific to each site’s EMR instance to identify a sub-network within a network for a specific disease. The sites would then sign off on a protocol, if it were a regulatory study, bringing to light that they will be inspected by the FDA and have monitoring set-up.
On top of this, some programs require ancillary data collection including clinical outcomes assessments, patient reported outcomes (PRO), images or biospecimens, coupled with automated EMR extraction. According to the guidance documents, these are not considered clinical investigations under part 312. Applicable requirements for protection of human subjects must be met.
Regulatory-focused Research using Automated Networks
Automation is changing the paradigm by reducing site burden, reducing cost and waiving informed consent. The traditional model is to build a protocol, identify a site, recruit, go through contracting and IRB, implement systems, collect the data, lock the data and close out the sites. In a reusable infrastructure this can be done, but, new protocols can be added so you can keep adding to the network to do additional studies or sub-studies. This saves an enormous cost and achieves much greater efficiency when large parts of the data collection process have been automated and rely far less on work by investigators and research staff.