TURF stands for Total Unduplicated Reach and Frequency. It is a technique that came into prominence during the 1950s in the space of media planning. In this post, we are covering only the “reach” component of this technique.
Reach is the percentage of respondents for whom at least one of the claims in a particular combination is their most preferred claim. That is, it is a measure of how many respondents can be “activated” by a combination of claims. Importantly, claims are tested one-by-one (not in combinations). Therefore combinations analysis through TURF is not the optimal way to assess preferences for combinations of claims.
Spot On AI-empowered analysis is an automated tool that helps you to automatically find top claim combinations.
Imagine you are launching a new brand of πΉ vegetable juices. As you are preparing for the launch, you want to have a range of flavors that will appeal to (“reach”) the largest number of potential customers. But what flavors should you offer if your budget allows only two flavors?
Let’s start by listing all possibilities:
What do you do? One way to solve this problem is to run a Product Variant Selector study, which will help you rank flavors by consumer preference. The same study will also give you individual-level preferences, such as:
Respondent ID | π₯ | π₯ | π₯ | π½ | πΆ | π₯ | π₯¦ | π | π° |
---|---|---|---|---|---|---|---|---|---|
1 | 4.9 | 1.5 | 0.6 | -0.8 | 1.2 | 1.8 | 3.9 | -8.5 | 4.0 |
2 | 4.5 | 0.8 | -1.3 | 0.1 | -0.2 | 1.7 | 3.8 | -8.4 | 2.0 |
3 | 5.5 | -1.0 | 0.3 | -0.3 | 1.4 | 0.7 | 3.6 | -7.3 | 2.9 |
4 | 5.7 | -1.2 | -1.0 | 6.0 | 0.0 | 0.1 | 5.2 | -8.9 | 3.1 |
5 | 4.3 | -1.5 | -0.3 | -1.5 | -0.8 | 0.8 | 5.9 | -7.8 | 3.7 |
6 | 3.2 | 0.8 | -1.0 | -0.2 | 0.2 | 0.2 | 3.7 | -8.5 | 3.5 |
7 | 3.1 | 0.2 | 1.2 | -0.2 | -0.9 | 0.5 | 3.1 | -7.5 | 1.5 |
8 | 5.1 | 0.5 | -0.1 | -1.4 | -1.4 | 0.3 | 3.1 | -6.5 | 3.8 |
9 | 4.6 | 0.9 | -0.9 | -1.4 | 1.0 | 0.1 | 3.4 | -8.0 | 1.2 |
10 | 3.2 | 0.9 | 0.5 | -1.0 | 0.4 | 0.3 | 3.6 | -7.8 | 1.7 |
11 | 3.4 | -0.5 | -1.4 | 5.0 | 0.2 | 1.5 | 5.8 | -7.9 | 2.2 |
12 | 4.8 | -0.9 | 0.5 | 0.3 | -1.0 | 2.9 | 3.0 | -7.2 | 3.7 |
Each cell shows the part-worth utility of a certain flavor for a particular respondent. We can then assume that if a particular flavor is among the top two most liked by a person, then we call it appealing to them β. Therefore, these scores can be used to identify which flavor will be the most or the second most liked by each respondent:
Respondent ID | π₯ | π₯ | π₯ | π½ | πΆ | π₯ | π₯¦ | π | π° |
---|---|---|---|---|---|---|---|---|---|
1 | β | β | |||||||
2 | β | β | |||||||
3 | β | β | |||||||
4 | β | β | |||||||
5 | β | β | |||||||
6 | β | β | |||||||
7 | β | β | |||||||
8 | β | β | |||||||
9 | β | β | |||||||
10 | β | β | |||||||
11 | β | β | |||||||
12 | β | β | |||||||
… | … | … | … | … | … | … | … | … | … |
Next, we assemble several possible combinations of flavors and calculate a couple of metrics:
Respondent ID | π₯ + π½ | π₯ + π₯¦ | π₯ + π° | π½ + π₯¦ | π½ + π° | π₯¦ + π° |
---|---|---|---|---|---|---|
1 | β | β | ββ | β | β | |
2 | β | ββ | β | β | β | |
3 | β | ββ | β | β | β | |
4 | ββ | β | β | β | β | |
5 | β | ββ | β | β | β | |
6 | β | β | β | β | ββ | |
7 | β | ββ | β | β | β | |
8 | β | β | ββ | β | β | |
9 | β | ββ | β | β | β | |
10 | β | ββ | β | β | β | |
11 | β | β | ββ | β | β | |
12 | β | β | ββ | β | β | |
… | … | … | … | … | … | … |
Reach | 11 | 12 | 11 | 9 | 6 | 11 |
Reach % | 92% | 100% | 92% | 75% | 50% | 92% |
Frequency | 1.1 | 1.5 | 1.3 | 1.1 | 1.0 | 1.1 |
As you see, everyone likes at least one of π₯ avocado + π₯¦ broccoli (Reach = 100%). This combination is a winner!
If you have more budget to launch three combinations, you can do the same analysis with three-way combinations:
Respondent ID | π₯ + π½ + π₯¦ | π₯ + π½ + π° | π₯ + π₯¦ + π° | π½ + π₯¦ + π° |
---|---|---|---|---|
1 | β | ββ | ββ | β |
2 | ββ | β | ββ | β |
3 | ββ | β | ββ | β |
4 | ββ | ββ | β | β |
5 | ββ | β | ββ | β |
6 | β | β | ββ | ββ |
7 | ββ | β | ββ | β |
8 | β | ββ | ββ | β |
9 | ββ | β | ββ | β |
10 | ββ | β | ββ | β |
11 | ββ | β | β | ββ |
12 | β | ββ | ββ | β |
… | … | … | … | |
Reach | 12 | 12 | 12 | 12 |
Reach % | 100% | 100% | 100% | 100% |
Frequency | 1.7 | 1.3 | 1.8 | 1.2 |
This time, all combinations have equally good reach. Now you need to look at frequency. If you offer the combination of π₯ avocado + π₯¦ broccoli + π° chestnut, for an average consumer, there will 1.8 liked flavors from your brand. This is the way to go.
NB: This is not an analysis of mixing flavors in one juice product. If you want to look into combinations of features in one product, you are better off with a Claims Combination Test or another type of conjoint analysis.
TURF analysis can be used to determine an efficient number of SKUs to launch for your range. For example, based on your internal finance calculations, you can determine that you must sell at least 100,000 items for any single SKU to recoup advertising investment and that you generally want to have only a handful of SKUs to avoid destocking by your retail channels.
Say, your total market is 2M units a year and you can sell 1 unit per consumer a year. That means, that any new SKU launched must reach 5% of the market.
Now, we need to model a few scenarios:
This type of analysis is called the TURF ladder:
In this example, it is efficient to launch six SKUs because the additional share of the seventh SKU is below 5%. Adding it means you will not recoup advertising investment.
This analysis is most helpful for most FMCG/CPG situations because:
Theoretically, it is possible to test a very large number of items (SKUs, products, flavors). For example, even 1000, as long as you have information about relative preferences for these 1000 SKUs across your consumers.
As TURF is performed on survey data, it is not often realistic to ascertain relative preferences for more than 50 items for each individual. That’s why Spot On is configured to limit the TURF analysis to 50 items.
TURF is commonly performed on a set of your own flavors, SKUs, or product variants. If you do not also consider your competition, this could result in optimization for your range only.
There are two ways to avoid this pitfall:
If you see low reach percentages for top combinations, it means that respondents’ preferences are non-homogeneous. That is different respondents like different things.
For example, if the best two-way combination of items reaches only 10% of the sample, it implies that the other 90% of samples like something else, and there is not a clear winning combination.
Generally, the greater the number of items in the test, the lower the TURF reach percentage. However, what level of low threshold to use depends on your objectives.