This guide explains how to create a holdout experiment to measure the overall effectiveness of your Experimentation program.
As you begin planning your Experimentation program, you may want to track how much of an impact your experiments have over time. Will there be any measurable differences in behavior between the end users you include in experiments, and those you do not? Which group of end users will spend more money, sign up for services, or affect other metrics at higher rates? Holdout groups can help you answer these questions.
Holdouts let you exclude a percentage of your audience from your Experimentation program. This enables you to see the overall effect of your experiments on your customer base, and helps determine how effective the experiments you’re running are. If, after a set period of time, such as a month or quarter, there are no measurable differences between the two groups you may want to reconsider the number, scope, and design of the experiments you’re running.
In this guide, you will:
To learn more about LaunchDarkly’s Experimentation offering, read Experimentation.
To complete this tutorial, you must have the following prerequisites:
To complete this guide, you should understand the following concepts:
First, decide what metric you want to measure. Choose a metric that aligns with the same KPIs or goals you want to experiment on, such as average revenue per customer, or percentage of customers that sign up for your service.
In this example, you’re in charge of your organization’s growth program. You might be using a variety of metrics to measure things like total sign-ups, revenue per customer, or revenue per cart. Here, your main metric measuring the success of your organization’s Experimentation program will be new, completed account sign-ups.
To create your metric:

Sending custom events to LaunchDarkly requires a unique event key. You can set the event key to anything you want. Adding this event key to your codebase lets your SDK track actions customers take in your app as events. To learn more, read Sending custom events.
LaunchDarkly also automatically generates a metric key when you create a metric. You only use the metric key to identify the metric in API calls. To learn more, read Creating and managing metrics.
You can create and add secondary metrics to your holdout if needed.
Before you create a holdout, you must decide the following:
To create the holdout:
user to Randomize by.Here is what your holdout will look like:

You can add experiments to your holdout when you create them. When deciding which experiments to add to a holdout, think about whether you want to measure the effectiveness of your Experimentation program as a whole, or only for a subset of your business.
In this example, you are adding only experiments related to front-end changes in your app such as pop-ups, different versions of copy, or different sign-up flows. You are not including experiments related to back-end changes that don’t have a direct affect on how customers interact with your app.
To learn how to add experiments to your holdout when you create them, read Creating experiments.
At the end of the quarter, you can analyze your holdout experiments to understand how much of an impact your Experimentation program is having on your chosen metrics.
On the holdout details page, the “Results” section displays the results of the metric for two variations:
You can see which variation is performing better if at least one of the experiments in the holdout is running. However, we don’t recommend making any decisions about your holdout experiment until all of the experiments within the holdout are finished and you have shipped winning variations for all of them. To learn how, read Analyzing experiments.

When you’re confident you have recorded enough data, you can stop the holdout and analyze its results. To do this, navigate to the holdout details page and click End. The contexts that were in the holdout will no longer be excluded from future experiments.
You can then make a decision about the results of your holdout based on which variation is performing better:
In this guide you learned how to create a holdout experiment using a prerequisite flag to measure the overall impact of your Experimentation program. By assessing the impact of your experiments as a whole, you can fine-tune your audiences and the metrics you’re measuring, and ensure you’re getting the most value out of LaunchDarkly Experimentation.