# Data Confidence

<figure><img src="https://d33v4339jhl8k0.cloudfront.net/docs/assets/5f61d6774cedfd00173b8695/images/602fc9768502d1120e9093a5/file-xUCHDjlkKt.png" alt=""><figcaption></figcaption></figure>

### What is Data Confidence?

Data Confidence measures the trustworthiness of the results from our experiments. This is an important metric that determines the likelihood that the impact of your experiment is accurate and not caused by random chance. To test the data confidence yourselves, try the [A/B Test Calculator](https://abtestguide.com/calc/).

The Data Confidence indicates how certain you can be that there is a significant impact of your test. This confidence is based on a combination of the amount of change (impact) and number of trials. So, if you have a very small measured impact, you need to run the test longer for more trials to get a significant result.

Our data bars go from red (not enough data to be trustworthy) to yellow (slightly trustworthy) to green (trustworthy). In general, the longer you run the test, the higher the Data Confidence. So, if your results are currently red, you should give the campaign more time to collect data or adjust the timeframe in which you are viewing the data.

However, AB test results can be difficult to read sometimes. For example:

1. A 10% measured impact with 95% confidence does not mean that you are 95% confident that you have a 10% change. It means that the change is big enough to be 95% confident that there is a significant change.
2. Because of some seasonal change or any other influence, the measured impact reverses (gets smaller) while your number of trials increases. In this case the significance can go down, and you need to keep the test running.
3. An unexpected event happened that influenced your measurements, in which case you should rerun the test.
4. You have so many trials, that even a minute measured impact of, for example, 0.02% is calculated with 100% confidence. The small impact is more likely to result from other unknown small effects than a causal connection of your test.

### How do we measure Data Confidence?

We measure Data Confidence based on statistical significance using a [null hypothesis test](https://simple.wikipedia.org/wiki/Null_hypothesis). This tests the difference between the performance of the Crobox group vs. Control group. In the Crobox group, users are exposed to our messages. In the Control group, users are exposed to invisible messages so that we can test the difference between the two.

The null hypothesis test determines whether or not the use of Crobox’s Campaigns has any impact on your KPIs.

There are two metrics of importance when determining statistical significance: P Value and Power. The P Value in your performance tracking is how we measure statistical significance. [Power](https://www.statisticsteacher.org/2017/09/15/what-is-power/) is the probability that a test of significance will pick up on an effect that is present.

A P Value of 0.05 (5%) means you can say with 95% confidence that there is a difference between the Crobox group vs. Control group. We always make sure our experiments have at least 95% statistical significance.


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.crobox.com/how-to-guides/analytics/data-confidence.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
