Bayesian Historical Borrowing
Technical documentation for incorporating historical control data into current trial design with appropriate discounting. This module implements Power Priors, Commensurate Priors, and Meta-Analytic Predictive (MAP) Priors for external data synthesis.
Contents
1. Overview & Motivation
Historical borrowing leverages data from prior studies to strengthen inference in the current trial. When historical and current populations are similar, borrowing can reduce sample size requirements while maintaining statistical rigor.
Key Benefits
Smaller Trials
Reduce required sample size by 20-40% when historical data is highly relevant
Ethical
Fewer patients randomized to control when effect is well-established
Efficiency
Faster trials with preserved statistical precision
The Exchangeability Assumption
Borrowing is valid only when historical and current populations areexchangeable—meaning they can be treated as samples from the same underlying distribution. Key similarity dimensions:
- Patient Population: Same disease stage, demographics, prior treatments
- Endpoints: Identical definitions and assessment methods
- Standard of Care: Similar background therapies
- Time Period: No temporal drift in outcomes
Critical Warning
Inappropriate borrowing (from dissimilar populations) can inflate Type I error or bias treatment effect estimates. Always use conflict diagnostics and consider discounting when similarity is uncertain.
2. Power Prior Method
The power prior (Ibrahim & Chen, 2000) discounts historical likelihood by raising it to a power :
For Beta-Binomial models with historical data and base prior :
Effective Sample Size
The ESS from the power prior is:
| Discount Factor | Interpretation | When to Use |
|---|---|---|
| Full borrowing (100% weight) | Identical population, same sponsor's prior trial | |
| Skeptical borrowing (50% weight) | Similar population, minor protocol differences | |
| Conservative borrowing (20% weight) | Different indication, mechanism-based only | |
| No borrowing (ignore historical) | Populations clearly different |
Choosing the Discount Factor
The discount factor should be pre-specified in the protocol based on clinical judgment about similarity. A common approach: start with as a “skeptical default” and adjust based on formal similarity assessment.
3. Commensurate Prior Method
The commensurate prior (Hobbs et al., 2011) uses a hierarchical model where a between-source variance parameter controls borrowing strength:
Under the original Hobbs parameterization, small pins close to the historical estimate (strong borrowing), and large lets the current parameter drift freely (weak borrowing).
Zetyra's implementation
For computational efficiency Zetyra exposes a single borrowing-strength input (the API field is called commensurability_parameter) and maps it to a power-prior discount via:
Conventions are reversed relative to the Hobbs : here is a borrowing-strength knob, so larger means more borrowing.
(no borrowing)
(balanced)
(strong borrowing)
(full borrowing)
4. Meta-Analytic Predictive (MAP) Prior
When multiple historical studies are available, the MAP prior (Schmidli et al., 2014) synthesizes them using random-effects meta-analysis:
Where is the pooled effect and captures between-study heterogeneity.
Heterogeneity Assessment (I²)
The calculator reports the I² statistic to quantify heterogeneity:
| I² Range | Interpretation | Recommendation |
|---|---|---|
| 0-25% | Low heterogeneity | Full borrowing appropriate |
| 25-75% | Moderate heterogeneity | Use robust MAP |
| >75% | High heterogeneity | Borrow cautiously |
Robust MAP Component
To protect against prior-data conflict, the robust MAP mixes the informative MAP prior with a vague component:
Where is typically 0.1–0.2 (10–20% vague component).
5. Prior-Data Conflict Diagnostics
Critical: prior-data conflict can inflate Type I error by 208% in composed adaptive designs
When a MAP prior is combined with other adaptive mechanisms (sequential monitoring, sample-size re-estimation, response-adaptive randomization) that share interim information sets, even a 2-percentage-point drift between the historical prior mean and the current control rate — well within typical inter-study variation in oncology (Viele et al. 2014; Tang et al. 2010) — can drive pipeline-level Type I error to 0.0771 (+208% above α=0.025) even when each component individually passes its validation check (Qian 2026, JSM). Component-level calibration guarantees do not transfer under composition; the failure is invisible without pipeline-level evaluation.
Practical implications: (1) the conflict diagnostics on this page are necessary but not sufficient when MAP is combined with adaptive monitoring/SSR/RAR; (2) protocols using historical borrowing alongside other adaptive components should simulate operating characteristics at the full pipeline level under prior-data conflict scenarios (e.g. ±2pp and ±5pp drift from prior mean) and pre-specify a maximum acceptable T1E; (3) the FDA January 2026 draft guidance on Bayesian methodology requires this kind of composed evaluation under realistic perturbations.
The calculator assesses whether current trial data conflicts with the historical prior using a prior predictive check.
Conflict Detection Algorithm
Given current data and effective prior :
- 1.Compute current rate:
- 2.Compute prior mean:
- 3.Compute predictive variance (includes sampling variability)
- 4.Calculate z-score and two-tailed p-value
| P-value | Conflict Level | Action |
|---|---|---|
| > 0.10 | None | Proceed with borrowing |
| 0.01–0.10 | Moderate | Consider reducing discount (δ × 0.5) |
| < 0.01 | Severe | Minimal borrowing (δ ≤ 0.2) or none |
Regulatory Requirement
The FDA guidance recommends pre-specifying how prior-data conflict will be handled in the Statistical Analysis Plan (SAP). Document the conflict detection criteria and fallback procedures.
6. Sample Size Impact
The calculator compares sample size requirements with and without historical borrowing to quantify the efficiency gain.
Comparison Framework
With Borrowing
Use effective prior derived from historical data
Without Borrowing
Use uninformative prior
For each scenario, the calculator finds the minimum achieving target power (80%) and Type I error control (5%).
Sample Size Reduction
Typical reductions range from 15–40% depending on historical data quality and discount factor.
7. Regulatory Considerations
FDA Bayesian Guidance Section V.D.4
“When utilizing external data, sponsors should describe methods for assessing the similarity of external data to trial data, including approaches for adjusting the degree of borrowing if inconsistencies are identified.”
Documentation Requirements
- Historical Data Source: Study ID, publication reference, patient population, endpoints, and quality assessment
- Similarity Justification: Explicit comparison of inclusion/exclusion criteria, endpoints, and standard of care
- Borrowing Method: Power prior, commensurate, or MAP with parameter specifications
- Conflict Handling: Pre-specified criteria and fallback procedures
- Operating Characteristics: Type I error and power under various scenarios
- Sensitivity Analysis: Results under alternative discount factors and prior specifications
8. API Quick Reference
Key Parameters
| Parameter | Type | Description |
|---|---|---|
| method | string | "power_prior" | "commensurate_prior" | "map_prior" |
| historical_events, historical_n | int | Historical study data (power/commensurate) |
| discount_factor | float | Power prior δ ∈ [0, 1] (default: 0.5) |
| studies | array | List of studies for MAP prior (min 2) |
| robust_weight | float | MAP robust component weight (default: 0.1) |
Key Response Fields
effective_prior— Resulting Beta(α, β) parametersess— Effective sample size breakdowncomparison— Sample size with/without borrowingconflict_assessment— Prior-data conflict analysis
9. References
- Ibrahim JG, Chen MH. Power prior distributions for regression models. Statistical Science. 2000;15(1):46-60.
- Schmidli H, et al. Robust meta-analytic-predictive priors in clinical trials with historical control information. Biometrics. 2014;70(4):1023-1032.
- Morita S, Thall PF, Müller P. Determining the effective sample size of a parametric prior. Biometrics. 2008;64(2):595-602.
- Spiegelhalter DJ, Abrams KR, Myles JP. Bayesian Approaches to Clinical Trials and Health-Care Evaluation. Wiley; 2004.
- Berry SM, Carlin BP, Lee JJ, Muller P. Bayesian Adaptive Methods for Clinical Trials. CRC Press; 2010.
- U.S. Food and Drug Administration. Use of Bayesian Methodology in Clinical Trials of Drug and Biological Products: Draft Guidance for Industry. January 12, 2026.
- European Medicines Agency. Guideline on Adjustment for Baseline Covariates in Clinical Trials. EMA/CHMP/295050/2013. February 2015.
Last updated: May 2026
Related Documentation
Prior Elicitation
Define interpretable Bayesian priors with ESS and credible bounds.
Bayesian Sample Size
Single-arm sample size determination with operating characteristics.
Two-Arm Bayesian Design
Randomized treatment vs. control with superiority or non-inferiority testing.
Bayesian Predictive Power
Estimate trial success probability given interim data and a prior.
Ready to incorporate historical data?
Use our Bayesian Borrowing Calculator for power-prior, commensurate-prior, and MAP approaches with ESS-based diagnostics — potential 20-40% sample-size savings.
Open Bayesian Borrowing Calculator