Blog

August 1, 2025

No items found.

The Role of the Time Machine in Adaptive Platform Trials

No items found.

The “time machine” enables rigorous, unbiased comparisons in adaptive platform trials by modeling era effects and overlapping treatments, improving resource allocation while ensuring accurate estimation.

Platform trials enroll multiple experimental arms simultaneously, typically randomized versus a control, with arms entering and leaving and study over the course of months to years. This evolution of the platform over time creates a rich history of past data, both on past experimental arms as well as a continuing control. Can we use this past data to make better decisions in the present? If an active arm enrolled in the years 2022-2024, and the trial has been running 2010-2024, do we make better decisions only looking at 2022-2024, or the totality of the evidence from 2010?

While not at first comparable, this relates to a time-honored question in sports. How do we compare different players from different eras, for example Hank Aaron and Babe Ruth? Taking raw numbers (home runs, batting averages, etc.) may be quite flawed, given differences in the eras they played. However, while they never played at the same time, many players overlapped with Hank Aaron going back in time. Those players overlapped with others, going back farther in time, and so on until we have players that overlapped with Babe Ruth. While we never saw player A compared to B directly, we did see A and C together, then C and D together, then D and B together, and so on. This applies to thousands of players going back in time.

In the 1990s, Scott Berry and colleagues (https://www.tandfonline.com/doi/abs/10.1080/01621459.1999.10474163) proposed a model to formalize these comparisons in sports (hockey, baseball, and golf). This model specified era effects going back in time and used the large number of overlapping players to estimate these era effects and compare players. Essentially, the model gives an estimate of how Hank Aaron would have performed in 1920 (or Babe Ruth in 1970). Essentially, the player can be moved through time via “the time machine”.

Moving forward to the 2010s, platform trials create a seemingly different but mathematically similar structure. Therapies move in and out of trials like players move in and out of major sports, and we may use their overlapping times to increase the precision of our estimates, under certain assumptions.

Why use a time machine?

The goal behind any use of past data is to increase precision. If we have a platform trial, clearly patients that were concurrently randomized to control and treatment are relevant for estimating the treatment effect. However, we typically also have control data extending back in time, prior to the activation of the current arm of interest, as well as other arms. This information may result in more precise estimates of the control arm, and better decisions (it carries a risk of bias we discuss below). In many ways the platform situation is easier than the sports example above. Treatments may have different effectiveness over time, but we usually expect those differences to be smaller than the aging process of a major sports player. More importantly, often the control is present throughout the course of the study (imagine a single player staying in baseball throughout the course of the league).

The time machine and related methods use a model to incorporate this past information, increasing the precision of the treatment effect subject to a key assumption – that treatment effects between arms are constant over time. When this assumption is satisfied, substantial gains in precision can be obtained. This allows for quicker and/or better decisions. When the assumption is not satisfied (the assumption may be checked by testing for a time by treatment interaction), biases can occur that negatively impact decision making, in the form of either inflated type 1 error or reduced power.

What does this look like in practice?

When analyzing a platform trial, we divide time into “bins”. Often these bins are defined by time, for example every 3-month period might be a bin. Statistically, bins should occur whenever randomization changes, for example when an arm enters or exits the trial. Thus, if interims occur every 3 months and arms may be stopped or added around those interims, then this is a natural choice for the selection of bins.

We then fit a model of the form:

Outcome = Intercept + Arm effects + Time effects + error

with the exact details depending on the exact outcome and potentially including other covariates as desired. Thus, each time bin has its own effect, indicating how outcomes in that time bin might be higher or lower than other time bins. These effects might be due to changes in underlying disease, for example changes in the patient population over time.

We might fit this model with fully separate effects in each time bin (called the “time categorical” model in some papers). The model usually called “the time machine” smooths the time effects via a normal dynamic linear model, a model akin to a spline in terms of smoothing, reflecting an assumption that changes over time should occur gradually. The choice of separate versus smoothed estimates is a key user choice. In many situations, there may be little difference in actual performance (the time categorical model is easier to communicate). However, if many bins are used, the time machine smoothing may result in improved performance, like using a dose response model when multiple doses are considered in a dose finding trial.

At heart, then, this model essentially is a standard linear model which blocks over time. We can compute the amount of weight given to both concurrent data (time periods when control and the arm of interest are both enrolling) and data going back in time. As would be hoped, the concurrent data receives the most weight, with past data receiving less and less weight the farther back in time you go. The speed at which these weights decrease depend on the amount of overlap between the arms.

Note the assumption here allows for changes over time but not changes in treatment effects over time (there is no time by treatment interaction in the model). It is perfectly acceptable for outcomes to get better or worse over time, as long as the differences between arms remains constant. This assumption also may be satisfied after including desired covariates in the model, for example covariates which explain differences in the population over time. The prototypical violation of this assumption is infectious disease, where treatments might be differentially effective for variants of a disease, working well for some and poorly for others. Time by treatment interactions are simply very difficult problems to address with or without modeling. If we truly expect a treatment might be effective in 2023 and 2025, but not 2024 (or simply differentially effective), what is our basis for approval and labelling of a therapy going forward? Meta-analyses make similar assumptions of equal treatment effects (in fact the time machine can be shown to be analogous to a meta-analysis over the individual time bins), and most current protocols refer to “the treatment effect” repeatedly, implicitly assuming it is singular and unchanging.

Quantifying the benefits

The value of the time machine depends on the number of overlapping arms and the sample sizes within those overlaps. As with all statistical methods, larger sample sizes result in increased precision.

Overlap refers to the number of arms that continue in different time bins. For example, suppose in four consecutive time bins we have

(Ctrl, A, B, C), (Ctrl, B, C, D), (Ctrl, B, C, D), (Ctrl, C, D, E)

From time bin to time bin, multiple arms are continuing. Many comparisons, for example between arms C and D, occur in multiple time bins. If we were analyzing arm E, there are direct comparisons to arms C and D, and one level indirect comparisons to arms B and A. This would be considered a higher amount of overlap than if the four time bins contained

(Ctrl, A, B, C), (Ctrl, C, D, E), (Ctrl, E, F, G), (Ctrl, G, H, I)

In this second example the trial is almost fully resetting with different active arms in each time bin. The net result is that the weight assigned to past data in the former example, with high overlap, is greater and results in larger efficiency gains.

Depending on the sample sizes and degree of overlap, the time machine can achieve 20-50% effective sample size increases, allowing for smaller trials with similar accuracy (again when the required assumption is satisfied).

What are the risks?

Fundamentally, inferences based on a time machine are not a full randomized comparison. While the concurrent period is randomized, the past data will contain the control arm but not the treatment arm of interest. Patients were randomized to different arms, typically met the same inclusion/exclusion criteria, and were investigated within the same protocol at the same sites, but the past data is not a direct randomized comparison. As such, there are risks of biases if the modeling assumption (constant treatment effects) is violated.

When the assumption is violated, we expect estimates of the control arm to be biased. Similarly, when making inferences, we may see either inflated type 1 error or reduced power, depending on the direction of the interaction. This is like the effect of drift in historical borrowing. The magnitude of the bias depends on the size of the interaction. Having therapies which work incredibly well in some time periods but are “nulls” in other time periods will produce large biases, while smaller interactions will produce smaller negative effects.

Thus, any use of nonconcurrent data should examine the potential for time treatment interactions and include as a sensitivity a “concurrent controls only” analysis. We would also recommend sponsors consider choosing randomization ratios that provide reasonable power for these sensitivity analyses. In other words, instead of trying to use all the increased efficiency of a platform to reduce sample size, some of that increased efficiency should be used to create robust sensitivities.

Summary

The time machine and similar models offer the potential to use the full dataset in a platform trial to increase precision of estimates and produce better decisions. This efficiency is gained through dividing time into bins and then estimating time effects going back in time. When there are overlapping arms going back in time, precision may be increased by 20-50% depending on the degree of overlap. This efficiency comes with a key assumption. While arms can change over time, the model requires that differences between arms remain constant. If this assumption is violated, biases and degraded inferences can result. This assumption should be monitored, and the time machine should be supplemented with “concurrent only” sensitivity analyses.

Download PDF

View

Back to All Blogs

Other Blogs

View All Blogs

Digital Googols and the Future of Clinical Learning

Digital twins in clinical research generate discussion and controversy, but current use is limited by lack of rich data sets. The potential is great for modeling counterfactual outcomes in clinical research, and we will get there.

October 31, 2025

Navigating the Moving Standards and Scrutiny of Novel Trial Design

Novel clinical trial designs are often subject to heightened scrutiny for statistical risks that persist in standard methods, revealing inconsistencies in regulatory and scientific expectations. When evaluation of a novel design is done there are hurdles or criticism of the novel approach that already exist with the standard approach and many times are higher risk than in the novel approach.

October 17, 2025

Berry Consultants Provides Comments on the Draft ICH E20 Harmonised Guideline

With the ICH E20 Draft Guideline currently open for comments, Berry Consultants shares its current thoughts, general comments, and specific suggestions of the document in this blog.

October 15, 2025

Promising Zone Adaptive Designs in Phase III Trials

Promising zone adaptive sample size designs appear compelling in theory, but all simulated and practical evidence demonstrates that group sequential trials outperform these methods in terms of efficiency and power for confirmatory trials.

October 3, 2025

Clinical Trial Simulation and the Art of Adaptive Design Optimization

Clinical trial simulation is the core engine for creating adaptive designs. This approach enables careful performance evaluation and iterative improvement of trial designs before a single patient enrolls. The result: more efficient, mathematically rigorous, and stakeholder-aligned clinical trials.

September 6, 2025

A Bayesian Framework for Modern Trial Design

Bayesian statistics enables efficient, inclusive, and compliant clinical trial designs by rigorously updating evidence, supporting adaptation, and enabling comprehensive analysis across complex data landscapes.

August 9, 2025

The Role of the Time Machine in Adaptive Platform Trials

August 1, 2025

ICH E20 Reactions: Group Sequential Designs

With the ICH E20 draft guidance now released and entering the public comment stage, Dr. Kert Viele of Berry Consultants begins a new blog series on this topic. Dr. Viele provides explainers for the designs and principles under discussion, and some initial reactions to the draft ICH E20 related to Group Sequential Designs in this blog.

July 11, 2025

Goldilocks Designs – If Bayesians had conceived Group Sequential Designs

A Goldilocks trial design is an adaptive clinical trial methodology developed to optimize the sample size dynamically during the course of a trial. Its name references the "just right" principle from the Goldilocks fairy tale—neither too large nor too small. Goldilocks designs seek balance between flexibility and efficiency.

June 16, 2025

Alpha Allocation in Adaptive Clinical Trials: Misconceptions and Scientific Consequences

A source of widespread confusion is the entrenched belief that introducing interim analyses “costs” alpha; that is, the assumption that interim adaptations erode the available alpha and require the sponsor to “pay a penalty.” This notion also leads to the myth that just the action of “looking at data” at an interim analysis is bad and costs alpha.

June 13, 2025

Regression-to-the-Mean: Insights for Drug Developers from the Sports World

Regression-to-the-mean offers critical insights for drug development, like sports statistics, emphasizing the importance of understanding data variability across multiple levels of inference.

May 30, 2025

Navigating the Complex Role of DSMBs in Adaptive Clinical Trials

Understanding DSMBs is pivotal in ensuring trial integrity, safety, and success.

May 23, 2025

Implementing Adaptive Trials: A Comprehensive Exploration

A discussion on the intricacies of implementing adaptive clinical trials, their operational processes, and how Berry ensures timely execution.

May 9, 2025

Revisiting a Seamless 2/3 Trial: The Amazing Journey of a GLP-1 Agonist

Explore the intricacies of the AWARD-5 trial for Eli Lilly's dulaglutide, from complex trial design to the transformation of pharmaceutical development timelines.

May 2, 2025

The Role of Innovative Trial Designs in Transforming Clinical Research

Explore how adaptive platform trials like I-SPY2 and GBM AGILE are revolutionizing clinical research to accelerate drug development.

April 25, 2025

Longitudinal Modeling in Clinical Trial Design: Methodological Advantages & Challenges

Longitudinal models can improve the efficiency in clinical trial decision making. So are we taking full advantage of this opportunity? This blog discusses methodological advantages and challenges.

April 21, 2025

Integrating External Data in Clinical Trials

Exploring the use of external data in clinical trials and its implications for clinical trial design and analyses.

April 18, 2025

The Art and Slog of Innovation in Clinical Trials

Innovation in drug development requires perseverance, strategic thinking, and a shift in traditional practices to innovate clinical trials and improve drug development.

April 4, 2025

Navigating Controversy: Ordinal Outcomes in Clinical Trials

We explore the history and modern complexities of ordinal outcomes in clinical trials, discussing their importance and the contentious debates surrounding their analysis.

March 28, 2025

The HEALEY ALS Platform Trial: Revolutionizing Clinical Trials

An in-depth exploration of the HEALEY ALS Platform Trial's innovative design and impact on clinical trials.

March 21, 2025

New Release of FACTS Enhancing Trial Simulation

Discover how the latest release of FACTS enhances clinical trial simulations with greater complexity, flexibility, and usability.

March 14, 2025

When Should You Use Adaptive Design Clinical Trials?

Adaptive design clinical trials offer flexibility, efficiency, and improved outcomes in medical research, but when should you explore their usage?

March 12, 2025

Precision Promise Adaptive Platform Trial update

The Precision Promise platform trial is an adaptive study exploring multiple potential therapies for pancreatic cancer, with pamrevlumab recently advancing to the next stage after demonstrating a predictive probability of at least 35% for improved overall survival, highlighting the trial's innovative approach to efficiently identify effective treatments in a field with limited options.

January 26, 2024

Comments on the draft FDA master protocol guidance

Kert Viele's blog discusses the FDA's draft guidance on master protocols for drug and biological product development, highlighting key sections on trial design, randomization, control groups, informed consent, and regulatory considerations, while encouraging feedback from experts to enhance the guidance's effectiveness before the comment deadline of February 22.

January 11, 2024

Is early stopping biased? Maybe, maybe not….

The blog by Kert Viele discusses the potential biases in clinical trial results, particularly focusing on early stopping trials and the implications of only publishing successful outcomes, emphasizing that while biases exist, their significance varies based on the true response rate and the context of the trial.

November 7, 2023

Time Trends in Clinical Trials (related to 2023 ASA Biopharm panel)

On September 28, 2023, Kert Viele will moderate a panel at the ASA Biopharmaceutical Section Regulatory-Industry Statistics Workshop, discussing time trends in ongoing platform trials using real interim analyses from the PRINCIPLE and REMAP-CAP trials, focusing on their impact on clinical trial analyses, modeling adjustments, and the complexities of additive versus interactive time trends.

September 27, 2023

Prior Practicum: Interpretable Priors for CRM Designs

Joe Marion's blog discusses the challenges of designing phase I dose-finding studies in oncology using the Continual Reassessment Method (CRM) and Bayesian approaches, emphasizing the importance of selecting appropriate prior distributions to balance patient safety and effective dose escalation, while suggesting that re-parameterizing models can simplify the design process.

October 14, 2023

If Bayesian inference doesn’t depend on the experimental design, then why does “Bayesian optimal design” exist?

In his blog, Kert Viele discusses the importance of trial design in Bayesian analysis, emphasizing that while conclusions drawn from completed experiments remain consistent regardless of interim analyses, the design of the trial significantly impacts expected utilities, and optimal designs can enhance trial performance.

September 14, 2023

The use of synthetic or external data in clinical trials

The blog by Kert Viele discusses the tradeoffs of using external or synthetic data in clinical trials, highlighting how aggressive use can save patient resources but risks scientific robustness, and emphasizes the importance of understanding the agreement between synthetic and actual trial data to optimize inferential performance while minimizing patient enrollment.

August 28, 2023

HOW TO GET CONTROL? CONCURRENT VS CONTEMPORARY VS HISTORICAL VS SYNTHETIC CONTROLS

The discussion highlights the growing role of real-world evidence in clinical trials, particularly as a potential substitute for control arms, while emphasizing the need to address biases associated with various control methods and advocating for a future dominated by platform trials that balance cost savings with reduced bias risks.

February 27, 2019

WHEN SHOULD YOU BORROW HISTORICAL DATA (OR REAL-WORLD EVIDENCE)?

Kert Viele discusses the concept of historical borrowing in clinical trials, highlighting its potential benefits and risks, particularly in relation to FDA guidance and the importance of assessing "drift" to determine when it is appropriate to utilize historical control data for improving trial efficiency and accuracy.

November 8, 2019

IMPROVING PROGRAM RESULTS THROUGH BETTER PHASE 1 AND 2 TRIALS

Kert Viele discusses the challenges and probabilities of success in a drug development program, highlighting that a standard approach often leads to a high rate of failure due to poor dose selection in early trials, but suggests that a revised strategy of continuous patient allocation and dose escalation can significantly improve the chances of successfully bringing an effective therapy to market.

November 15, 2019

HYPOTHESIS TESTING, CLINICALLY IMPORTANT EFFECTS, AND DO WE PAY TOO MUCH FOR CLINICAL TRIAL INSURANCE?

Highly powered clinical trials are costly and often yield statistically significant but clinically meaningless results due to large sample sizes designed to mitigate random errors, suggesting the need for alternative approaches like flexible sample sizes and group sequential designs to optimize resource use and improve trial efficiency.

December 7, 2019

DESIGNING A COLLECTION OF TRIALS

The article emphasizes the importance of optimizing clinical trial designs by investigating multiple therapies simultaneously and utilizing strategies like Bayesian thinking and platform trials to significantly reduce the time and resources needed to identify effective treatments for difficult medical conditions.

January 10, 2020

Some Intuition Behind Hierarchical Modeling

Hierarchical modeling is an advanced statistical approach used in clinical trials to make inferences across multiple patient groups, enhancing power and reducing sample sizes while requiring careful implementation to account for variability and potential biases in observed data.

November 13, 2017

Should I use a Bayesian trial?

This week, we published an article in JAMA titled “Bayesian Analysis: Use of Prior Information in Clinical Trials,” which explores the nuances of Bayesian analysis in clinical trials, emphasizing the importance of transparency and community consensus when using informative priors to avoid bias and enhance trial efficiency.

October 27, 2017

Todd Graves: let me introduce myself

Todd Graves, who joined Berry Consultants in January 2012, plans to regularly blog about innovative clinical trial designs and his statistical modeling for college football team ratings, sharing insights and updates on both topics.

September 5, 2013

Jason Connor's Upcoming Events

Jason Connor will be participating in several upcoming events, including presentations on Bayesian adaptive trials and findings at various conferences and teaching a class at Johns Hopkins School of Public Health from May 31 to June 28.

May 24, 2013

ASA's new section for Medical Devices and Diagnostics

SIGMEDD, a statistics interest group within the American Statistical Association focused on medical devices and diagnostics, is seeking ASA members' support through signatures for its transition to a full section, with the current Chair-Elect encouraging participation via a petition.

October 17, 2012

SIGMEDD - Statistical Interest Group

As the Chair of the Statistical Interest Group in Medical Devices and Diagnostics (SIGMEDD), I am seeking support from 100 ASA members to transition our group to a Section, and invite interested members to sign our petition via the provided Survey Monkey link.

March 29, 2013

November Webinars and Conferences

Join us for three upcoming events this month focused on clinical trial simulation and adaptive trial design, including two webinars on November 14 and 15, and a conference on November 29-30, where experts will share insights and benefits of these innovative approaches.

November 12, 2012