Blog

May 30, 2025

No items found.

Regression-to-the-Mean: Insights for Drug Developers from the Sports World

No items found.

Regression-to-the-mean offers critical insights for drug development, like sports statistics, emphasizing the importance of understanding data variability across multiple levels of inference.

Understanding Regression-to-the-Mean

The concept of regression-to-the-mean, often misunderstood, is crucial for both sports enthusiasts and drug developers. This phenomenon explains why extreme performances or conditions often return to a more typical state over time. In sports, for example, a player early in a season performing extraordinarily high or low rarely maintains that level throughout a season. While this outcome may be familiar, why it happens may be misunderstood. Frequently in sports reasons are attached to why a player does not continue the extreme performance (pressure, media scrutiny, etc.). The belief in these external myths, rather than an understanding of regression-to-the-mean, creates poor decision making in sports. The same scenario and logic apply to clinical trials, where equally misunderstood attribution of the why creates poor decision making and has massive negative effects in the drug development world.

Regression-to-the-mean can be observed in various sports scenarios. In baseball, for instance, players who start a season with exceptional stats often see their performance normalize as the season progresses. This was famously seen with Rod Carew’s and George Foster's 1977 baseball season, Foster projected to break the single season home run record at mid-season, and Rod Carew on a pace to hit the magical 0.400 batting average, finished below mid-season projections due to this statistical phenomenon. Similarly, in golf, a player's performance can vary significantly from one day to the next, purely due to statistical fluctuations rather than actual changes in skill. Understanding the performance of a player in the context of the population of players is critical to projecting the future performance. By modeling the population of players as well as the random outcomes of the player creates a recognition that players in the tails of the distribution are rare, by definition – it is more likely random outcomes that place the player the tail then in truth the player’s true skill is in the tail of the population of players. Hence any estimation of the true skill of the player will be “shrunk” to the middle of the distribution of players. Recognizing these patterns and adjusting for them appropriately greatly improves predictions, estimation, and prevents the creation of mythical attribution.

Parallel Lessons for Clinical Trials

In drug development, regression-to-the-mean is equally prevalent. Clinical trial participants often join studies at a low point in their health journey. This results in a natural improvement over the course of the trial, which can occur without any intervention. This improvement, while real, is often misattributed to the "placebo effect," when it is frequently a manifestation of regression-to-the-mean. In trial design and data analysis, it becomes critical to distinguish genuine therapeutic outcomes from these natural statistical shifts. Think of the number of clinical trials where the investigators are shocked by the size of the placebo effect. When the trial design – inclusion and exclusion criteria, focuses more and more on the tail outcomes at baseline in the trial, this will create a regression-to-the-mean for these patients – without any intervention, sham, or placebo. By believing the effect is one of a “placebo” leads investigators to make trial decisions to address potential placebo effects, when the reason for the improvement may largely be regression-to-the-mean and not a psychological placebo effect.

Drug developers must apply this understanding to avoid attributing real-world decisions to this statistical phenomenon. Take an example a single-arm trial where all patients receive the experimental intervention and there is strong improvement in the patients – is it the intervention or is this regression-to-the-mean? This understanding ensures that decision-making is grounded in a realistic expectation of outcomes over time, thus preventing costly missteps driven by misinterpreted data.

Hierarchies in Sports and Trials

The hierarchical nature of data – individual outcomes and the broader population of players, patients, subgroups, endpoints, etc. – is a key consideration when predicting future performances, whether for athletes or medical treatments. In sports, the variability between and within players influences projections. Similarly, hierarchical models in clinical trials account for variability among patient populations and responses.

In the 2017 US Open, golfers who performed poorly on day one typically improved on day two, while those who initially excelled saw declines—an illustration of regression-to-the-mean. In clinical settings, understanding these statistical principles aids in differentiating between genuine effects of a treatment and natural variability. Think about a trial with 5 subpopulations of patients – and the results in one of the subsets looks very promising. If one estimates the effect in the best performing subset only from the data in that subset they are at huge risk when they make development decisions assuming that effect is real. There are very powerful approaches to creating estimates of one unit from the results across all units that account for the regression-to-the-mean effect. Bayesian hierarchical models or frequentists random effects models allow researchers to integrate variability across the hierarchical structure, providing a more accurate estimate of treatment efficacy. The challenge is in understanding and recognizing the need for these models.

For drug development, hierarchical models can be particularly useful. They allow for between-subject differences and improve prediction accuracy in clinical trials. These models consider individual patient variability as well as overall population variability, leading to a more nuanced understanding of how a treatment might perform across diverse groups. This approach enhances decision-making, leading to more effective trial designs and patient outcomes.

Implications for Decision-Making

Understanding regression-to-the-mean is crucial not only for evaluating trial results but also for strategic decisions based on these results. Misinterpretation of isolated outcomes can result in significant downstream effects, including poor go/no-go decisions, poorly designed trials, financial loss, and opportunity costs. Mis-estimating a treatment's efficacy based on isolated results leads to inflated projections and misguided investments.

As an example, in oncology, basket trials often show considerable variability in outcomes across patient subgroups. Recognizing regression-to-the-mean provides critical context in assessing which results hold promise and what are the best estimates in the subgroups. There are many other ways this effect is manifested in drug development – the results across 10 different endpoints – how should one estimate the effect in the best or worst results across those endpoints? What if 5 different doses are used in a phase 2 trial – should one estimate the effect of one dose when ignoring the result across the other doses? When a sponsor runs 20 phase 2 trials – should they estimate the effect on the best of those trials without putting it in context of all 20 trials?

Real-World Applications and Industry Perception

The pharmaceutical industry's success relies heavily on understanding regression-to-the-mean. When preparing for regulatory submissions or deciding on continuing, modifying, or halting development, companies must factor in this statistical reasoning. It’s not natural for scientists to incorporate the results from group A to better estimate group B, but it’s critical to good decision making.

The sports analogy serves well in communicating the subtleties of these statistical phenomenon. Ideas that seem common sense in sports, seem initially awkward to drug developers. Presenting complex clinical data through familiar paradigms aids in broader comprehension among executives, stakeholders, and regulators.

Focus on Genuine Advancements

Ultimately, regression-to-the-mean underscores the importance of statistical reasoning in interpreting data. For drug developers, leveraging these insights can refine clinical trial designs and improve decision-making. By learning from parallels in sports, they can better navigate the complexities of interpreting clinical trial results. The regression-to-the-mean phenomenon may the single most important statistical concept for drug developers.

Download PDF

View

Back to All Blogs

Other Blogs

View All Blogs

Debunking the Interim Analysis Penalty Myth in Adaptive Clinical Trials

Not all interim analyses in clinical trials require alpha adjustment; penalties depend entirely on the pre-specified adaptive action, not the simple act of reviewing data.

July 31, 2026

The Revised FDA Draft Guidance on Master Protocols

In today's blog, Dr. Kert Viele breaks down the most crucial updates of the draft guidance on "Master Protocols for Drug and Biological Product Development," specifically focusing on the new details surrounding basket trials, control group requirements, and the heavy integration of the FDA's draft Bayesian guidance.

July 1, 2026

Response Adaptive Randomization

In this week's blog, Dr. Kert Viele discusses his thoughts on the RAR literature and the current state of RAR in practical clinical trials. He focuses on general principles that have emerged in the literature, the nuts and bolts of constructing and tuning RAR designs, and operational issues.

June 19, 2026

Fighting Time in Adaptive Clinical Trials with Longitudinal and Predictive Modeling

Longitudinal modeling and predictive modeling are the foundation of learning efficiently in adaptive clinical trials, allowing real-time learning from early data. Bayesian models enable faster, more informed decisions even when primary endpoints are distant.

June 5, 2026

How a Multi-Platform Randomized Clinical Trial Impacted the COVID-19 Pandemic

REMAP-CAP, ATTACC, and ACTIV-4a unified under a multi-platform randomized clinical trial with a joint Bayesian adaptive design, rapidly determining that therapeutic anticoagulation benefits moderate, but not severe, COVID-19 patients.

May 22, 2026

Regulatory Guidance, Adaptive Trials, and the Misconception of Efficiency

ICH-E20’s regulatory caution towards adaptive designs is often misapplied, resulting in inefficient or unrealistic alternatives for sponsors and patients. Operational casework demonstrates that so-called “complexity” in adaptive design is frequently misunderstood and that regulatory “false choices” undermine trial effectiveness.

April 24, 2026

Enhancing Phase 3 Trials Through Bayesian Borrowing

Bayesian borrowing in Phase 3 trials formally combines prior evidence with the new trial data to enhance development efficiency and regulatory decision making. This approach requires rigorous statistical modeling, careful selection of historical information, and detailed regulatory dialogue.

March 13, 2026

Technical Realities of Ordinal Endpoint Analysis in Clinical Trials

A rigorous review of ordinal endpoint analyses, showing every approach—utility weighting, proportional odds, dichotomization, or non-parametric—inevitably assigns relative weights to outcome states. Berry Consultants’ mathematical demonstration reveals how proportional odds analysis embeds prevalence-based weights, underscoring the need for transparency and clinical input in trial design.

February 27, 2026

Guide to the Draft FDA Bayesian Guidance 2026

The FDA Draft Bayesian guidance is a dramatic leap forward for Bayesian clinical trials and regulatory science. In this blog, Dr. Kert Viele outlines the key concepts and motivations for the guidance recommendations.

January 30, 2026

The Rumored Shift to a One-Trial Standard for FDA Substantial Evidence

In recent public discussion, FDA leaders have indicated a possible shift from the longstanding two-trial requirement for substantial evidence of drug efficacy to acceptance of a single, highly stringent trial. Dr. Scott Berry and Dr. Kert Viele, on the "In the Interim..." podcast, analyze the statistical, regulatory, and scientific implications and highlight those in this blog.

January 2, 2026

Administrative Analyses for Funding Decisions in Adaptive Clinical Trials

Seamless adaptive trials can deliver higher statistical power with fewer patients and shorter timelines, yet practical funding hurdles persist. Objective administrative (financial) analyses—often Bayesian-driven—can define funding triggers without compromising trial integrity.

December 12, 2025

Digital Googols and the Future of Clinical Learning

Digital twins in clinical research generate discussion and controversy, but current use is limited by lack of rich data sets. The potential is great for modeling counterfactual outcomes in clinical research, and we will get there.

October 31, 2025

Navigating the Moving Standards and Scrutiny of Novel Trial Design

Novel clinical trial designs are often subject to heightened scrutiny for statistical risks that persist in standard methods, revealing inconsistencies in regulatory and scientific expectations. When evaluation of a novel design is done there are hurdles or criticism of the novel approach that already exist with the standard approach and many times are higher risk than in the novel approach.

October 17, 2025

Berry Consultants Provides Comments on the Draft ICH E20 Harmonised Guideline

With the ICH E20 Draft Guideline currently open for comments, Berry Consultants shares its current thoughts, general comments, and specific suggestions of the document in this blog.

October 15, 2025

Promising Zone Adaptive Designs in Phase III Trials

Promising zone adaptive sample size designs appear compelling in theory, but all simulated and practical evidence demonstrates that group sequential trials outperform these methods in terms of efficiency and power for confirmatory trials.

October 3, 2025

Clinical Trial Simulation and the Art of Adaptive Design Optimization

Clinical trial simulation is the core engine for creating adaptive designs. This approach enables careful performance evaluation and iterative improvement of trial designs before a single patient enrolls. The result: more efficient, mathematically rigorous, and stakeholder-aligned clinical trials.

September 6, 2025

A Bayesian Framework for Modern Trial Design

Bayesian statistics enables efficient, inclusive, and compliant clinical trial designs by rigorously updating evidence, supporting adaptation, and enabling comprehensive analysis across complex data landscapes.

August 9, 2025

The Role of the Time Machine in Adaptive Platform Trials

The “time machine” enables rigorous, unbiased comparisons in adaptive platform trials by modeling era effects and overlapping treatments, improving resource allocation while ensuring accurate estimation.

August 1, 2025

ICH E20 Reactions: Group Sequential Designs

With the ICH E20 draft guidance now released and entering the public comment stage, Dr. Kert Viele of Berry Consultants begins a new blog series on this topic. Dr. Viele provides explainers for the designs and principles under discussion, and some initial reactions to the draft ICH E20 related to Group Sequential Designs in this blog.

July 11, 2025

Goldilocks Designs – If Bayesians had conceived Group Sequential Designs

A Goldilocks trial design is an adaptive clinical trial methodology developed to optimize the sample size dynamically during the course of a trial. Its name references the "just right" principle from the Goldilocks fairy tale—neither too large nor too small. Goldilocks designs seek balance between flexibility and efficiency.

June 16, 2025

Alpha Allocation in Adaptive Clinical Trials: Misconceptions and Scientific Consequences

A source of widespread confusion is the entrenched belief that introducing interim analyses “costs” alpha; that is, the assumption that interim adaptations erode the available alpha and require the sponsor to “pay a penalty.” This notion also leads to the myth that just the action of “looking at data” at an interim analysis is bad and costs alpha.

June 13, 2025

Regression-to-the-Mean: Insights for Drug Developers from the Sports World

Regression-to-the-mean offers critical insights for drug development, like sports statistics, emphasizing the importance of understanding data variability across multiple levels of inference.

May 30, 2025

Navigating the Complex Role of DSMBs in Adaptive Clinical Trials

Understanding DSMBs is pivotal in ensuring trial integrity, safety, and success.

May 23, 2025

Implementing Adaptive Trials: A Comprehensive Exploration

A discussion on the intricacies of implementing adaptive clinical trials, their operational processes, and how Berry ensures timely execution.

May 9, 2025

Revisiting a Seamless 2/3 Trial: The Amazing Journey of a GLP-1 Agonist

Explore the intricacies of the AWARD-5 trial for Eli Lilly's dulaglutide, from complex trial design to the transformation of pharmaceutical development timelines.

May 2, 2025

The Role of Innovative Trial Designs in Transforming Clinical Research

Explore how adaptive platform trials like I-SPY2 and GBM AGILE are revolutionizing clinical research to accelerate drug development.

April 25, 2025

Longitudinal Modeling in Clinical Trial Design: Methodological Advantages & Challenges

Longitudinal models can improve the efficiency in clinical trial decision making. So are we taking full advantage of this opportunity? This blog discusses methodological advantages and challenges.

April 21, 2025

Integrating External Data in Clinical Trials

Exploring the use of external data in clinical trials and its implications for clinical trial design and analyses.

April 18, 2025

The Art and Slog of Innovation in Clinical Trials

Innovation in drug development requires perseverance, strategic thinking, and a shift in traditional practices to innovate clinical trials and improve drug development.

April 4, 2025

Navigating Controversy: Ordinal Outcomes in Clinical Trials

We explore the history and modern complexities of ordinal outcomes in clinical trials, discussing their importance and the contentious debates surrounding their analysis.

March 28, 2025

The HEALEY ALS Platform Trial: Revolutionizing Clinical Trials

An in-depth exploration of the HEALEY ALS Platform Trial's innovative design and impact on clinical trials.

March 21, 2025

New Release of FACTS Enhancing Trial Simulation

Discover how the latest release of FACTS enhances clinical trial simulations with greater complexity, flexibility, and usability.

March 14, 2025

When Should You Use Adaptive Design Clinical Trials?

Adaptive design clinical trials offer flexibility, efficiency, and improved outcomes in medical research, but when should you explore their usage?

March 12, 2025

Precision Promise Adaptive Platform Trial update

The Precision Promise platform trial is an adaptive study exploring multiple potential therapies for pancreatic cancer, with pamrevlumab recently advancing to the next stage after demonstrating a predictive probability of at least 35% for improved overall survival, highlighting the trial's innovative approach to efficiently identify effective treatments in a field with limited options.

January 26, 2024

Comments on the draft FDA master protocol guidance

Kert Viele's blog discusses the FDA's draft guidance on master protocols for drug and biological product development, highlighting key sections on trial design, randomization, control groups, informed consent, and regulatory considerations, while encouraging feedback from experts to enhance the guidance's effectiveness before the comment deadline of February 22.

January 11, 2024

Is early stopping biased? Maybe, maybe not….

The blog by Kert Viele discusses the potential biases in clinical trial results, particularly focusing on early stopping trials and the implications of only publishing successful outcomes, emphasizing that while biases exist, their significance varies based on the true response rate and the context of the trial.

November 7, 2023

Time Trends in Clinical Trials (related to 2023 ASA Biopharm panel)

On September 28, 2023, Kert Viele will moderate a panel at the ASA Biopharmaceutical Section Regulatory-Industry Statistics Workshop, discussing time trends in ongoing platform trials using real interim analyses from the PRINCIPLE and REMAP-CAP trials, focusing on their impact on clinical trial analyses, modeling adjustments, and the complexities of additive versus interactive time trends.

September 27, 2023

Prior Practicum: Interpretable Priors for CRM Designs

Joe Marion's blog discusses the challenges of designing phase I dose-finding studies in oncology using the Continual Reassessment Method (CRM) and Bayesian approaches, emphasizing the importance of selecting appropriate prior distributions to balance patient safety and effective dose escalation, while suggesting that re-parameterizing models can simplify the design process.

October 14, 2023

If Bayesian inference doesn’t depend on the experimental design, then why does “Bayesian optimal design” exist?

In his blog, Kert Viele discusses the importance of trial design in Bayesian analysis, emphasizing that while conclusions drawn from completed experiments remain consistent regardless of interim analyses, the design of the trial significantly impacts expected utilities, and optimal designs can enhance trial performance.

September 14, 2023

The use of synthetic or external data in clinical trials

The blog by Kert Viele discusses the tradeoffs of using external or synthetic data in clinical trials, highlighting how aggressive use can save patient resources but risks scientific robustness, and emphasizes the importance of understanding the agreement between synthetic and actual trial data to optimize inferential performance while minimizing patient enrollment.

August 28, 2023

HOW TO GET CONTROL? CONCURRENT VS CONTEMPORARY VS HISTORICAL VS SYNTHETIC CONTROLS

The discussion highlights the growing role of real-world evidence in clinical trials, particularly as a potential substitute for control arms, while emphasizing the need to address biases associated with various control methods and advocating for a future dominated by platform trials that balance cost savings with reduced bias risks.

February 27, 2019

WHEN SHOULD YOU BORROW HISTORICAL DATA (OR REAL-WORLD EVIDENCE)?

Kert Viele discusses the concept of historical borrowing in clinical trials, highlighting its potential benefits and risks, particularly in relation to FDA guidance and the importance of assessing "drift" to determine when it is appropriate to utilize historical control data for improving trial efficiency and accuracy.

November 8, 2019

IMPROVING PROGRAM RESULTS THROUGH BETTER PHASE 1 AND 2 TRIALS

Kert Viele discusses the challenges and probabilities of success in a drug development program, highlighting that a standard approach often leads to a high rate of failure due to poor dose selection in early trials, but suggests that a revised strategy of continuous patient allocation and dose escalation can significantly improve the chances of successfully bringing an effective therapy to market.

November 15, 2019

HYPOTHESIS TESTING, CLINICALLY IMPORTANT EFFECTS, AND DO WE PAY TOO MUCH FOR CLINICAL TRIAL INSURANCE?

Highly powered clinical trials are costly and often yield statistically significant but clinically meaningless results due to large sample sizes designed to mitigate random errors, suggesting the need for alternative approaches like flexible sample sizes and group sequential designs to optimize resource use and improve trial efficiency.

December 7, 2019

DESIGNING A COLLECTION OF TRIALS

The article emphasizes the importance of optimizing clinical trial designs by investigating multiple therapies simultaneously and utilizing strategies like Bayesian thinking and platform trials to significantly reduce the time and resources needed to identify effective treatments for difficult medical conditions.

January 10, 2020

Some Intuition Behind Hierarchical Modeling

Hierarchical modeling is an advanced statistical approach used in clinical trials to make inferences across multiple patient groups, enhancing power and reducing sample sizes while requiring careful implementation to account for variability and potential biases in observed data.

November 13, 2017

Should I use a Bayesian trial?

This week, we published an article in JAMA titled “Bayesian Analysis: Use of Prior Information in Clinical Trials,” which explores the nuances of Bayesian analysis in clinical trials, emphasizing the importance of transparency and community consensus when using informative priors to avoid bias and enhance trial efficiency.

October 27, 2017

Todd Graves: let me introduce myself

Todd Graves, who joined Berry Consultants in January 2012, plans to regularly blog about innovative clinical trial designs and his statistical modeling for college football team ratings, sharing insights and updates on both topics.

September 5, 2013

Jason Connor's Upcoming Events

Jason Connor will be participating in several upcoming events, including presentations on Bayesian adaptive trials and findings at various conferences and teaching a class at Johns Hopkins School of Public Health from May 31 to June 28.

May 24, 2013

ASA's new section for Medical Devices and Diagnostics

SIGMEDD, a statistics interest group within the American Statistical Association focused on medical devices and diagnostics, is seeking ASA members' support through signatures for its transition to a full section, with the current Chair-Elect encouraging participation via a petition.

October 17, 2012

SIGMEDD - Statistical Interest Group

As the Chair of the Statistical Interest Group in Medical Devices and Diagnostics (SIGMEDD), I am seeking support from 100 ASA members to transition our group to a Section, and invite interested members to sign our petition via the provided Survey Monkey link.

March 29, 2013

November Webinars and Conferences

Join us for three upcoming events this month focused on clinical trial simulation and adaptive trial design, including two webinars on November 14 and 15, and a conference on November 29-30, where experts will share insights and benefits of these innovative approaches.

November 12, 2012