Blog

June 13, 2025

No items found.

Alpha Allocation in Adaptive Clinical Trials: Misconceptions and Scientific Consequences

No items found.

Technical Foundations: Defining Alpha

In phase 3, pivotal, or adequate and well-controlled superiority trials, the role of type 1 error is standard—an allocation of a one-sided 2.55% defines these trials. This provides a standard threshold or allocation for a trial having a 2.5% chance of concluding the treatment is superior, when it is equal to the control arm. We refer to the type 1 error threshold, which we typically compare a p-value to, by the Greek letter, alpha. A standard fixed sample size trial evaluates the primary analysis, conducting a hypothesis test at one time point. The structure is simple: if the calculated p-value falls below the alpha threshold, 0.025, superiority is claimed. The probability of this occurring under the null hypothesis that the treatment and placebo are in fact equal, is the defined type I error, 2.5%.

When trials employ adaptive sample size approaches, with multiple time points evaluating superiority, the alpha-level at each time point is adjusted, so that the probability of concluding superiority at any time point in the trial is limited to 2.5%. To provide a comprehensive trial level 2.5% type 1 error, each individual test uses a nominal alpha-level that is adjusted. This adjustment across the multiple time points is commonly referred to as the alpha-spending function.

A source of widespread confusion is the entrenched belief that introducing interim analyses “costs” alpha; that is, the assumption that interim adaptations erode the available alpha and require the sponsor to “pay a penalty.” This notion also leads to the myth that just the action of “looking at data” at an interim analysis is bad and costs alpha. Scott Berry, in this week's episode of "In the interim…", is explicit in rejecting this characterization: “You haven’t lost anything.” The phrase “penalty” predominates in the literature and in industry dialogue, but, as Berry argues, it is semantically pejorative and functionally misleading.

Interim Analyses and Alpha: The Statistical Reality

Group sequential adaptive sample size designs often introduce interim analyses at pre-specified enrollment counts, such as after 200 and 300 patients in a planned maximum sample size 400-patient trial. These provide opportunities to assess superiority earlier than the maximum sample size. If superiority analyses are specified at these interim analyses, the technical approach utilizes so-called spending functions or boundaries, which control the overall type I error by assigning explicit, lower, nominal alpha values to each interim analysis.

In the above illustration, using O’Brien-Fleming group sequential boundaries, the nominal alpha-level at each analysis time point would be:

● Interim at 200 patients; nominal alpha = 0.0031

● Interim at 300 patients; nominal alpha = 0.0092

● Final at 400 patients; nominal alpha = 0.0213

These nominal values are each less than 0.025; they represent the threshold at each look for stopping. The overall type I error for the trial remains at 2.5%. The alpha-level at the final analysis is 0.0213, and less than 0.025, which would be the threshold with no interim analyses. This smaller nominal alpha at the 400-patient analysis leads to the misconception that the final analysis is “penalized”; Berry states, “You’ve just allocated it over the three analyses. You haven’t lost anything.” The act of spreading alpha over multiple analyses does not diminishes the overall error rate allowance nor requires any form of compensation beyond careful allocation.

Historically group sequential designs like the one presented here were the only types of adaptive designs. In these adaptive designs, superiority actions were specified at each of the analyses. So, historically each interim analysis conducted required allocation. This leads to the misconception that doing an interim analysis – looking at data – causes the need for alpha allocation. This further leads to the idea that any type of interim – even if it doesn’t have a superiority analysis, causes alpha allocation. The natural conclusion from this is that looking at data needs to be penalized and is bad. The adherence to the “penalty” vernacular has stifled efficient adaptive clinical trial designs.

Consequence for Power and Sample Size: Quantitative Examples

Consider the example above from the podcast:

● A fixed sample 400-patient trial (single final look) is powered at 85% to detect an effect size of 0.3.

● A group sequential design with interim analyses (using the boundaries above) reduces power from 85% to 84% but reduces the mean sample size from 400 to 313.

This small power reduction is not the result of alpha reduction from interim analyses per se, but from distributing decision-making across smaller, potentially less informative sample sizes earlier in the accrual process. Berry clarifies, “You haven’t lost any alpha. You don’t lose power because you pay a penalty in alpha. It’s because you distributed some of that alpha to smaller sample sizes.”

Expanding the design above to allow for up to 500 patients, with appropriate reallocation of alpha, can have positive effects. Taking O’Brien-Fleming boundaries at 200, 300, 400, and a final potential sample size of 500 leads to:

● Power increases from 85% to 91%

● The average sample size is 368, still below the fixed sample size of 400 for the original fixed design with sample size 400

Berry points out that this flexibility results in both higher power and a reduction in the average number of enrolled patients; “By allowing flexibility, my average sample size is smaller than 400. My power is greater, going from 85% to 91%.” These are the operational gains of adaptive design—the “penalty” is a misapplied term. The fear of spending alpha may push one to run the fixed sample size 400-patient trial, when the ability to distribute alpha can create more powerful, more efficient trial designs.

Action versus Observation: When is Alpha Adjustment Required?

Simply viewing interim data does not consume alpha or demand adjustment. It is not inspection of the data, but potential trial actions at each look that may require type I error adjustment. Futility analyses conducted at interim timepoints—where there is no possibility of early declaration of superiority—do not require alpha adjustment. Berry states, “You could do 100 interims for futility in your trial and your final analysis used an alpha of 0.025... because no action we took during the trial increased the probability of making a type I error.”

Berry discusses the SEPSIS-ACT trial as an example; in a design with more than 20 planned interim analyses, with no superiority stops, the trial maintained a final nominal alpha of 0.025 and was approved under a Special Protocol Assessment after strict regulatory scrutiny. At the interim analyses response adaptive randomization and a potential shift to phase 3 could occur – and these interims required no alpha adjustment.

Adjustment becomes necessary when interim analyses create pathways that can increase type 1 error probabilities. As an example, making a dose-selection in a seamless 2/3 trial, and carrying that data through to the end of the phase 3 portion, requires alpha-adjustment. As an example, suppose a phase 2 trial enrolls 30:30:30 to placebo and two experimental doses. The best dose is selected, and the trial enrolls 90 on the selected dose and placebo, and the phase 2 data (30 on experiment and 30 on placebo) are included in the phase 3 analysis. In such a case, as detailed in the podcast, if data from a selected phase II dose and placebo are carried forward and pooled with subsequent phase III accrual, the final alpha is adjusted downward (e.g., to 0.01693) to maintain proper type I error control. This is not bad – the power is higher by including the phase 2 data and adjusting alpha than the stand alone 90-versus-90 portion of the trial using an unadjusted 0.025. Again, it’s an example where allocating alpha for inclusion of phase 2 data improves power. It’s a good thing to do, certainly not a penalty.

Conclusion

Alpha allocation in adaptive designs is not a penalty; as Berry emphasizes, “You haven’t lost anything.” Observing data at interim analyses, without the possibility of making a claim for superiority, may not consume or require adjustment of alpha. Efficient trial designs benefit from pre-specified adaptive designs, utilizing interim analyses—not avoidance—of interim analysis. Embracing well designed, pre-specified, adaptive designs, with careful allocation of the allotted alpha is essential to advancing efficient clinical trial designs.

Download PDF

View

Back to All Blogs

Other Blogs

View All Blogs

Digital Googols and the Future of Clinical Learning

Digital twins in clinical research generate discussion and controversy, but current use is limited by lack of rich data sets. The potential is great for modeling counterfactual outcomes in clinical research, and we will get there.

October 31, 2025

Navigating the Moving Standards and Scrutiny of Novel Trial Design

Novel clinical trial designs are often subject to heightened scrutiny for statistical risks that persist in standard methods, revealing inconsistencies in regulatory and scientific expectations. When evaluation of a novel design is done there are hurdles or criticism of the novel approach that already exist with the standard approach and many times are higher risk than in the novel approach.

October 17, 2025

Berry Consultants Provides Comments on the Draft ICH E20 Harmonised Guideline

With the ICH E20 Draft Guideline currently open for comments, Berry Consultants shares its current thoughts, general comments, and specific suggestions of the document in this blog.

October 15, 2025

Promising Zone Adaptive Designs in Phase III Trials

Promising zone adaptive sample size designs appear compelling in theory, but all simulated and practical evidence demonstrates that group sequential trials outperform these methods in terms of efficiency and power for confirmatory trials.

October 3, 2025

Clinical Trial Simulation and the Art of Adaptive Design Optimization

Clinical trial simulation is the core engine for creating adaptive designs. This approach enables careful performance evaluation and iterative improvement of trial designs before a single patient enrolls. The result: more efficient, mathematically rigorous, and stakeholder-aligned clinical trials.

September 6, 2025

A Bayesian Framework for Modern Trial Design

Bayesian statistics enables efficient, inclusive, and compliant clinical trial designs by rigorously updating evidence, supporting adaptation, and enabling comprehensive analysis across complex data landscapes.

August 9, 2025

The Role of the Time Machine in Adaptive Platform Trials

The “time machine” enables rigorous, unbiased comparisons in adaptive platform trials by modeling era effects and overlapping treatments, improving resource allocation while ensuring accurate estimation.

August 1, 2025

ICH E20 Reactions: Group Sequential Designs

With the ICH E20 draft guidance now released and entering the public comment stage, Dr. Kert Viele of Berry Consultants begins a new blog series on this topic. Dr. Viele provides explainers for the designs and principles under discussion, and some initial reactions to the draft ICH E20 related to Group Sequential Designs in this blog.

July 11, 2025

Goldilocks Designs – If Bayesians had conceived Group Sequential Designs

A Goldilocks trial design is an adaptive clinical trial methodology developed to optimize the sample size dynamically during the course of a trial. Its name references the "just right" principle from the Goldilocks fairy tale—neither too large nor too small. Goldilocks designs seek balance between flexibility and efficiency.

June 16, 2025

Alpha Allocation in Adaptive Clinical Trials: Misconceptions and Scientific Consequences

June 13, 2025

Regression-to-the-Mean: Insights for Drug Developers from the Sports World

Regression-to-the-mean offers critical insights for drug development, like sports statistics, emphasizing the importance of understanding data variability across multiple levels of inference.

May 30, 2025

Navigating the Complex Role of DSMBs in Adaptive Clinical Trials

Understanding DSMBs is pivotal in ensuring trial integrity, safety, and success.

May 23, 2025

Implementing Adaptive Trials: A Comprehensive Exploration

A discussion on the intricacies of implementing adaptive clinical trials, their operational processes, and how Berry ensures timely execution.

May 9, 2025

Revisiting a Seamless 2/3 Trial: The Amazing Journey of a GLP-1 Agonist

Explore the intricacies of the AWARD-5 trial for Eli Lilly's dulaglutide, from complex trial design to the transformation of pharmaceutical development timelines.

May 2, 2025

The Role of Innovative Trial Designs in Transforming Clinical Research

Explore how adaptive platform trials like I-SPY2 and GBM AGILE are revolutionizing clinical research to accelerate drug development.

April 25, 2025

Longitudinal Modeling in Clinical Trial Design: Methodological Advantages & Challenges

Longitudinal models can improve the efficiency in clinical trial decision making. So are we taking full advantage of this opportunity? This blog discusses methodological advantages and challenges.

April 21, 2025

Integrating External Data in Clinical Trials

Exploring the use of external data in clinical trials and its implications for clinical trial design and analyses.

April 18, 2025

The Art and Slog of Innovation in Clinical Trials

Innovation in drug development requires perseverance, strategic thinking, and a shift in traditional practices to innovate clinical trials and improve drug development.

April 4, 2025

Navigating Controversy: Ordinal Outcomes in Clinical Trials

We explore the history and modern complexities of ordinal outcomes in clinical trials, discussing their importance and the contentious debates surrounding their analysis.

March 28, 2025

The HEALEY ALS Platform Trial: Revolutionizing Clinical Trials

An in-depth exploration of the HEALEY ALS Platform Trial's innovative design and impact on clinical trials.

March 21, 2025

New Release of FACTS Enhancing Trial Simulation

Discover how the latest release of FACTS enhances clinical trial simulations with greater complexity, flexibility, and usability.

March 14, 2025

When Should You Use Adaptive Design Clinical Trials?

Adaptive design clinical trials offer flexibility, efficiency, and improved outcomes in medical research, but when should you explore their usage?

March 12, 2025

Precision Promise Adaptive Platform Trial update

The Precision Promise platform trial is an adaptive study exploring multiple potential therapies for pancreatic cancer, with pamrevlumab recently advancing to the next stage after demonstrating a predictive probability of at least 35% for improved overall survival, highlighting the trial's innovative approach to efficiently identify effective treatments in a field with limited options.

January 26, 2024

Comments on the draft FDA master protocol guidance

Kert Viele's blog discusses the FDA's draft guidance on master protocols for drug and biological product development, highlighting key sections on trial design, randomization, control groups, informed consent, and regulatory considerations, while encouraging feedback from experts to enhance the guidance's effectiveness before the comment deadline of February 22.

January 11, 2024

Is early stopping biased? Maybe, maybe not….

The blog by Kert Viele discusses the potential biases in clinical trial results, particularly focusing on early stopping trials and the implications of only publishing successful outcomes, emphasizing that while biases exist, their significance varies based on the true response rate and the context of the trial.

November 7, 2023

Time Trends in Clinical Trials (related to 2023 ASA Biopharm panel)

On September 28, 2023, Kert Viele will moderate a panel at the ASA Biopharmaceutical Section Regulatory-Industry Statistics Workshop, discussing time trends in ongoing platform trials using real interim analyses from the PRINCIPLE and REMAP-CAP trials, focusing on their impact on clinical trial analyses, modeling adjustments, and the complexities of additive versus interactive time trends.

September 27, 2023

Prior Practicum: Interpretable Priors for CRM Designs

Joe Marion's blog discusses the challenges of designing phase I dose-finding studies in oncology using the Continual Reassessment Method (CRM) and Bayesian approaches, emphasizing the importance of selecting appropriate prior distributions to balance patient safety and effective dose escalation, while suggesting that re-parameterizing models can simplify the design process.

October 14, 2023

If Bayesian inference doesn’t depend on the experimental design, then why does “Bayesian optimal design” exist?

In his blog, Kert Viele discusses the importance of trial design in Bayesian analysis, emphasizing that while conclusions drawn from completed experiments remain consistent regardless of interim analyses, the design of the trial significantly impacts expected utilities, and optimal designs can enhance trial performance.

September 14, 2023

The use of synthetic or external data in clinical trials

The blog by Kert Viele discusses the tradeoffs of using external or synthetic data in clinical trials, highlighting how aggressive use can save patient resources but risks scientific robustness, and emphasizes the importance of understanding the agreement between synthetic and actual trial data to optimize inferential performance while minimizing patient enrollment.

August 28, 2023

HOW TO GET CONTROL? CONCURRENT VS CONTEMPORARY VS HISTORICAL VS SYNTHETIC CONTROLS

The discussion highlights the growing role of real-world evidence in clinical trials, particularly as a potential substitute for control arms, while emphasizing the need to address biases associated with various control methods and advocating for a future dominated by platform trials that balance cost savings with reduced bias risks.

February 27, 2019

WHEN SHOULD YOU BORROW HISTORICAL DATA (OR REAL-WORLD EVIDENCE)?

Kert Viele discusses the concept of historical borrowing in clinical trials, highlighting its potential benefits and risks, particularly in relation to FDA guidance and the importance of assessing "drift" to determine when it is appropriate to utilize historical control data for improving trial efficiency and accuracy.

November 8, 2019

IMPROVING PROGRAM RESULTS THROUGH BETTER PHASE 1 AND 2 TRIALS

Kert Viele discusses the challenges and probabilities of success in a drug development program, highlighting that a standard approach often leads to a high rate of failure due to poor dose selection in early trials, but suggests that a revised strategy of continuous patient allocation and dose escalation can significantly improve the chances of successfully bringing an effective therapy to market.

November 15, 2019

HYPOTHESIS TESTING, CLINICALLY IMPORTANT EFFECTS, AND DO WE PAY TOO MUCH FOR CLINICAL TRIAL INSURANCE?

Highly powered clinical trials are costly and often yield statistically significant but clinically meaningless results due to large sample sizes designed to mitigate random errors, suggesting the need for alternative approaches like flexible sample sizes and group sequential designs to optimize resource use and improve trial efficiency.

December 7, 2019

DESIGNING A COLLECTION OF TRIALS

The article emphasizes the importance of optimizing clinical trial designs by investigating multiple therapies simultaneously and utilizing strategies like Bayesian thinking and platform trials to significantly reduce the time and resources needed to identify effective treatments for difficult medical conditions.

January 10, 2020

Some Intuition Behind Hierarchical Modeling

Hierarchical modeling is an advanced statistical approach used in clinical trials to make inferences across multiple patient groups, enhancing power and reducing sample sizes while requiring careful implementation to account for variability and potential biases in observed data.

November 13, 2017

Should I use a Bayesian trial?

This week, we published an article in JAMA titled “Bayesian Analysis: Use of Prior Information in Clinical Trials,” which explores the nuances of Bayesian analysis in clinical trials, emphasizing the importance of transparency and community consensus when using informative priors to avoid bias and enhance trial efficiency.

October 27, 2017

Todd Graves: let me introduce myself

Todd Graves, who joined Berry Consultants in January 2012, plans to regularly blog about innovative clinical trial designs and his statistical modeling for college football team ratings, sharing insights and updates on both topics.

September 5, 2013

Jason Connor's Upcoming Events

Jason Connor will be participating in several upcoming events, including presentations on Bayesian adaptive trials and findings at various conferences and teaching a class at Johns Hopkins School of Public Health from May 31 to June 28.

May 24, 2013

ASA's new section for Medical Devices and Diagnostics

SIGMEDD, a statistics interest group within the American Statistical Association focused on medical devices and diagnostics, is seeking ASA members' support through signatures for its transition to a full section, with the current Chair-Elect encouraging participation via a petition.

October 17, 2012

SIGMEDD - Statistical Interest Group

As the Chair of the Statistical Interest Group in Medical Devices and Diagnostics (SIGMEDD), I am seeking support from 100 ASA members to transition our group to a Section, and invite interested members to sign our petition via the provided Survey Monkey link.

March 29, 2013

November Webinars and Conferences

Join us for three upcoming events this month focused on clinical trial simulation and adaptive trial design, including two webinars on November 14 and 15, and a conference on November 29-30, where experts will share insights and benefits of these innovative approaches.

November 12, 2012