Clinical Trial Harmonization

Harmonize your entire trial portfolio into proprietary, analysis-ready data assets

Tremendous value lies dormant in completed clinical trial datasets, but accessing cross-trial insights depends on the ability to accurately and efficiently harmonize trial results to unified SDTM, ADaM, or other formats. Cornerstone’s AI-powered clinical trial harmonization tool accelerates research by maximizing the use of both existing internal and external data assets. Leverage pooled datasets to inform future trial design, create benchmarks for monitoring live trials, and more with Cornerstone.

Use pooled trial data from live and/or completed studies to train internal AI models

Improve new protocol designs using what has worked and what hasn’t.

Turn heterogenous data sources into large-scale, analysis-ready resources

The Challenge

SDTM format is focused on submission to the FDA, and while strict in some areas, it allows for flexibility across trials. For example, laboratory results, tumor response, and other key domains are not required to be standardized by the FDA. These domains may not be well standardized within any one trial and are rarely well-standardized across trials.
‍
As a result, traditional trial harmonization approaches are resource- and time-intensive.

One top-10 sponsor estimated that historically it took 1 - 3 months of technical employee time to manually standardize a single trial. In addition to this direct economic cost, the opportunity cost associated with the time it takes to complete vs. work on other higher-value projects is substantial and often underlooked.

‘Event’ Tables Dictionaries

Domain

Ontology

Challenges

Adverse Events

MedDRA (Preferred Term)

Lacking consistency across trials

Concomitant Medications

WHODrug + CDISC Frequency/Route

Lacking consistency across trials

Medical History

MedDRA

Poorly Standardized

Procedures

CPT

Poorly Standardized

‘Test’ Tables Dictionaries

Domain

Ontology

Challenges

Labs

CDISC Lab Test/Unit

CDISC is limited, noncommon labs may not be standardized and require other dictionaries (eg LOINC)

Disposition, Response

Not Standardized to Consistent Ontology

Custom dictionaries are needed for test name, test category, and result

Our Product

With Cornerstone, teams can access fully and consistently standardized SDTM, ADaM, or EHR data across multiple study datasets.

Unlock the potential of aggregated data assets

AI-powered platform that ingests, maps, and processes any dataset

Cornerstone can ingest any native SDTM, ADaM, or other data schema and produce consistent outputs across tables and fields.

Add standardizations where none exist across clinical domains

Cornerstone’s NLP allows for automated standardization of critical endpoints that may not be standardized within or across clinical trials, such as clinical lab names, units, and values. In addition to supporting industry-standard terminologies, Cornerstone supports custom dictionaries to meet the needs of every dataset.

Validate the consistency of existing standardizations from verbatims

Where fields such as adverse events and concomitant medications have been previously standardized by each trial team, Cornerstone produces an independent prediction and can validate the original standardization or flag terms which have been inaccurately or inconsistently standardized.

Reduce time to harmonized data by weeks with full human traceability

Human-in-the-loop processing with a dedicated user interface allows technical teams to review AI predictions and make additional adjustments if desired. All transformations are tracked in an automated, exportable audit trail.

Clinical Trial Harmonization In Action

SDTM does not require standardization of many tables and fields, leading to trial-specific naming conventions. Cornerstone adds lab, adverse event, and medication data that is consistent and comprehensive, while maintaining the original data for traceability and auditability.

The Problem

Test names, units and results vary significantly across clinical trials, making it impossible to confidently group the same tests and measurements across trials.

The Reason

The FDA only requires AE and CM to be standardized. As a result, lab tests are standardized to suit a particular study’s objective, but rarely standardized in a consistent way across trials.

The Solution

Cornerstone re-evaluates the original test name, specimen (not shown in below tables), unit, and result, and assigns standard test names as well as harmonized units.

Before

Study

LBTEST

LBORRES

LBORRESU

D-Dimer

3310

ng/ml FEU

DDIMER FEU

3.51

ng/ml FEU

Fibrin D-Dimer

1280

ng/ml

D Dimer

1640

ug/L

DDIMER

1.66

ug/mL

fibrin ddimer

1.8

ug/L

The original test names, results, and units vary substantially across trials.

With Cornerstone

Study

LBTEST

LBORRES

LBORRESU

D Dimer

3310

ng/ml FEU

D Dimer

3510

ng/ml FEU

D Dimer

2560

ng/ml FEU

D Dimer

3280

ng/ml FEU

D Dimer

3320

ng/ml FEU

D Dimer

3600

ng/ml FEU

Cornerstone harmonizes test results for consistency & comparability.

Before

Study

LBTEST

LBORRES

LBORRESU

D-Dimer

3310

ng/ml FEU

DDIMER FEU

3.51

ng/ml FEU

Fibrin D-Dimer

1280

ng/ml

D Dimer

1640

ug/L

DDIMER

1.66

ug/mL

fibrin ddimer

1.8

ug/L

The original test names, results, and units vary substantially across trials.

With Cornerstone

Study

LBTEST

LBORRES

D Dimer

3310

ng/ml FEU

D Dimer

3510

ng/ml FEU

D Dimer

2560

ng/ml FEU

D Dimer

3280

ng/ml FEU

D Dimer

3320

ng/ml FEU

D Dimer

3600

ng/ml FEU

Cornerstone harmonizes test results for consistency & comparability.

The Problem

Adverse events and concomitant medications may not have standard terms or codes present in the study data. In the example below, records containing oncology regimen information have not been coded.

The Reason

Coding may be absent due to data format (EDC vs. SDTM), trial status (discontinued or live), and/or because the study team included concepts which require additional processing before being mapped to WHODrug, like in the example below.

The Solution

Cornerstone supports mapping to WHODrug and MedDRA via NLP. In the oncology example below the regimen must first be standardized to the NCI Cancer regimen (see After-CMDETAIL), expanded to a single record per drug, and then standardized to WHODrug.

Before

USUBJID

CMTRT

CMDECOD

ACETAMINOPHEN

3310

ENOXAPARIN

3.51

RCHOP 8 Cycles

1.8

Cancer regimen not in WHODrug and has not been standardized in CMDECOD.

With Cornerstone

USUBJID

CMTRT

CMDETAIL

CMDECOD

ACETAMINOPHEN

Paracetamol

ENOXAPARIN

Enoxaparin

RCHOP 8 CYCLES

R-CHOP

Rituximab

R-CHOP

Cyclophosphamide

R-CHOP

Doxorubicin

R-CHOP

Vincristine

R-CHOP

Predinisone

Cornerstone standardizes ‘RCHOP 8 CYCLES’ to the appropriate term, creates a record per drug in the regimen, and maps the individual drugs to WHO Drug.

Before

USUBJID

CMTRT

CMDECOD

ACETAMINOPHEN

3310

ENOXAPARIN

3.51

RCHOP 8 Cycles

1.8

Cancer regimen not in WHODrug and has not been standardized in CMDECOD.

With Cornerstone

USUBJID

CMTRT

CMDETAIL

CMDECOD

ACETAMINOPHEN

Paracetamol

ENOXAPARIN

Enoxaparin

RCHOP 8 CYCLES

R-CHOP

Rituximab

R-CHOP

Cyclophosphamide

R-CHOP

Doxorubicin

R-CHOP

Vincristine

R-CHOP

Predinisone

Cornerstone standardizes ‘RCHOP 8 CYCLES’ to the appropriate term, creates a record per drug in the regimen, and maps the individual drugs to WHO Drug.

The Problem

It is common for the same or clinically close terms to be coded in different ways in different studies, introducing variability in the pooled data despite consistency within each trial.

The Reason

This variation is due to a combination of the complexity of the terminologies and the fact that trials are managed and coded by separate teams with distinct study aims.

The Solution

Cornerstone supports coding to MedDRA and WHO Drug via NLP to ensure that the source values are systematically and consistently coded across trials.

Before

Study

AETERM

AEDECOD

TRANSAMENITIS

hypertransaminasaemia

TRANSAMENITIS

transaminases increased

TRANSAMENITIS

transaminitis

The term ‘transaminitis’ is a lower-level term, and not a preferred term, and transaminases increases is related but not as precise as hypertransaminasemia.

With Cornerstone

Study

AETERM

AEDECOD

TRANSAMENITIS

hypertransaminasaemia

TRANSAMENITIS

hypertransaminasaemia

TRANSAMENITIS

hypertransaminasaemia

Cornerstone produces consistently and accurate standardized terms.

Before

Study

AETERM

AEDECOD

TRANSAMENITIS

hypertransaminasaemia

TRANSAMENITIS

transaminases increased

TRANSAMENITIS

transaminitis

The term ‘transaminitis’ is a lower-level term, and not a preferred term, and transaminases increases is related but not as precise as hypertransaminasemia.

With Cornerstone

Study

AETERM

AEDECOD

TRANSAMENITIS

hypertransaminasaemia

TRANSAMENITIS

hypertransaminasaemia

TRANSAMENITIS

hypertransaminasaemia

Cornerstone produces consistently and accurate standardized terms.

The Problem

Clinical trial data contains data with spelling errors, abbreviations, and other sources of variation introduced by clinical sites.

The Reason

It is common for electronic data capture of clinical trial data to allow for free-text entry. While fields supporting safety analysis are likely to be standardized, many other free text field are likely to remain unstandardized by the trial team.

The Solution

Cornerstone supports standardization to CDISC terminology as well as the creation of custom terminologies where standards may not be available.

Before

HOTERM

HODECOD

Study drug therapy

Study drug administration

Study therapy

Study cycle 1 administration

progression

progressive disease

disease progression

The original, verbatim terms contain significant variation but the standard SDTM column has not been populated in the study.

With Cornerstone

HOTERM

HODECOD

Study drug therapy

Study treatment administration

Study drug administration

Study treatment administration

Study therapy

Study treatment administration

Study cycle 1 administration

Study treatment administration

progression

Disease progression

progressive disease

Disease progression

disease progression

Disease progression

Cornerstone produces consistently and accurate standardized terms.

Before

HOTERM

HODECOD

Study drug therapy

Study drug administration

Study therapy

Study cycle 1 administration

progression

progressive disease

disease progression

The original, verbatim terms contain significant variation but the standard SDTM column has not been populated in the study.

With Cornerstone

HOTERM

HODECOD

Study drug therapy

Study treatment administration

Study drug administration

Study treatment administration

Study therapy

Study treatment administration

Study cycle 1 administration

Study treatment administration

progression

Disease progression

progressive disease

Disease progression

disease progression

Disease progression

Cornerstone produces consistently and accurate standardized terms.

The Problem

Records can be mislocated in the incorrect domain. In the example below, cytokine test results have been included in the LB domain instead of IS.

The Reason

The SDTM criteria change over time and are open to interpretation by different trial teams to suit the needs of the study.

The Solution

Cornerstone systematically scans domains likely to contain mislocated data, such as LB and CM, to identify which records are mislocated and then moves them to the appropriate domain.

Before

LB Domain

LBTEST

LBCAT

LBORRES

LBORRESU

Platelet Count

HEMATOLOGY

410

10^9/L

Interleukin 10

CYTOKINE

5.4

pg/mL

Glucose

CHEMISTRY

102

mg/dL

Interleukin 2

CYTOKINE

15.2

pg/mL

White Blood Cell Count

HEMATOLOGY

3.2

10^9/L

Interleukin 1 Beta

CYTOKINE

22.1

pg/mL

Hemoglobin

HEMATOLOGY

13.8

g/dL

Interleukin 6

CYTOKINE

48.7

pg/mL

Cytokine results have been mistakenly placed within the LB domain.

With Cornerstone

LB Domain

LBTEST

LBCAT

LBORRES

Glucose

CHEMISTRY

102

mg/dL

Hemoglobin

HEMATOLOGY

13.8

g/dL

White Blood Cell Count

HEMATOLOGY

3.2

10^9/L

Platelet Count

HEMATOLOGY

410

10^9/L

IS Domain

LBTEST

LBCAT

LBORRES

Interleukin 2

CYTOKINE

15.2

pg/mL

Interleukin 6

CYTOKINE

48.7

pg/mL

Interleukin 10

CYTOKINE

5.4

pg/mL

Interleukin 1 Beta

CYTOKINE

22.1

pg/mL

Cornerstone identifies mislocated data, creates the IS domain, and relocates the records to the correct domain.

Before

LB Domain

LBTEST

LBCAT

LBORRES

LBORRESU

Platelet Count

HEMATOLOGY

410

10^9/L

Interleukin 10

CYTOKINE

5.4

pg/mL

Glucose

CHEMISTRY

102

mg/dL

Interleukin 2

CYTOKINE

15.2

pg/mL

White Blood Cell Count

HEMATOLOGY

3.2

10^9/L

Interleukin 1 Beta

CYTOKINE

22.1

pg/mL

Hemoglobin

HEMATOLOGY

13.8

g/dL

Interleukin 6

CYTOKINE

48.7

pg/mL

Cytokine results have been mistakenly placed within the LB domain.

With Cornerstone

LB Domain

LBTEST

LBCAT

LBORRES

Glucose

CHEMISTRY

102

mg/dL

Hemoglobin

HEMATOLOGY

13.8

g/dL

White Blood Cell Count

HEMATOLOGY

3.2

10^9/L

Platelet Count

HEMATOLOGY

410

10^9/L

IS Domain

LBTEST

LBCAT

LBORRES

Interleukin 2

CYTOKINE

15.2

pg/mL

Interleukin 6

CYTOKINE

48.7

pg/mL

Interleukin 10

CYTOKINE

5.4

pg/mL

Interleukin 1 Beta

CYTOKINE

22.1

pg/mL

Cornerstone identifies mislocated data, creates the IS domain, and relocates the records to the correct domain.

Case Study

Customer

Top 10 Pharma company

The Need

The company indicated that an optimal solution would help them save time by reaching high thresholds of standardization and accuracy for automatically assigned mappings across SDTM tables. In addition, they wanted an intuitive user interface (UI) to allow for straightforward expert review of the output.

Impact

Cornerstone AI capabilities were evaluated on three blood cancer clinical trials, completing the pilot in a matter of days.

‘Event’ Tables Dictionaries

Domain

Sponsors' Success Criteria

Results Across 3 Trials

Standardize Fields Using Raw Terms

>75% auto-standardization rate
>90% accuracy rate for auto-standardizations (assessed via comparison to original study standardized terms + manual adjudication)
Intuitive and efficient UI for manual standardization

93-99% auto-standardization rate
98-99% accuracy rate
Custom dictionaries created
Intuitive and efficient UI for manual standardization

Harmonize Numeric Results

Consistent naming, units, values

99% test name consistency across studies
100% accuracy in unit harmonization

Reduce Overall Time to Harmonize Clinical Trials

Complete within ~6 weeks (the company’s internal harmonization benchmark)

Completed in 3 days

The Challenge

‘Event’ Tables Dictionaries

‘Test’ Tables Dictionaries

With Cornerstone, teams can access fully and consistently standardized SDTM, ADaM, or EHR data across multiple study datasets.

Clinical Trial Harmonization In Action

Case Study

Customer

The Need

Impact

Turn disparate trial datasets into aggregated, AI-ready assets today