How can each brand’s buyer base buy the category at above the average rate?

Abstract

An apparent anomaly in brand metrics data is that the aggregated buyer base of each brand appears to buy the category at above the average rate. This seems arithmetically impossible. The effect is real, but it has a simple explanation. It is caused by the fact that heavy category buyers buy more brands than light buyers do. They therefore appear in the buyer base of multiple brands, and so inflate the category buying rate for each brand to above the overall average rate. This study shows what it calls the ‘category purchase rate anomaly’ in empirical data, as well as demonstrating how it occurs via a simulated dataset.

Keywords

brand performance metrics brand loyalty category purchasing Dirichlet model

Introduction

The purpose of this paper is to investigate and explain an apparent anomaly that occurs in brand performance data derived from consumer purchase panels. The anomaly is that each brand’s buyer base appears to buy the category at above the average rate. At face value this seems arithmetically impossible, but is shown to be due to (a) the distribution of category purchase frequency is highly skewed (many infrequent, very few frequent buyers) and (b) those frequent buyers buy more brands than the infrequent ones do, thereby they appear in the buyer base of each brand and so inflate the category buying rate of each brand.

Consumer Panels and Panel Data Metrics

Brand managers and consumer insights specialists in hundreds of consumer-packaged goods companies subscribe to the data and reporting provided by consumer panel companies such as GfK, Kantar and Nielsen. These research corporations run large-scale panels of households in dozens of countries, whereby shoppers scan the codes of products they buy on a daily or weekly basis, with the results collated into large databases. Subscribing client companies access the resultant information, often in the form of ready-designed brand metrics reports via electronic interface.

Key metrics provided to managers and researchers from consumer panel data include:

Brand penetration – the proportion of households that purchase a brand in a time period, such as quarter or a year.

Brand Purchase Occasions – the number of occasions the brand is bought in a time period.

Category purchase occasions – overall – the average number of occasions the category is bought by households that buy it.

Category purchase occasions – each brand’s buyers - the average number of occasions the buyers of each brand buy the category.

Share of Category Requirements – the average proportion of total category purchasing that is allocated to each brand by its buyers.

And of course,

Market Share – measured as the brand’s share of total units, volume, revenue or purchase occasions.

Metrics such as these are indispensable to consumer goods companies, and are used to monitor their market share, assess competitor activity and growth, as well as one’s own actions or interventions. These metrics allow managers to understand factors such as the composition of their market share, which comprises the number of brand buyers, multiplied by how much each of them bought. The metrics also allow for ‘pattern spotting’ to discern, for example, how the brands in a category follow the famous double jeopardy pattern, whereby small brands not only have fewer buyers, but they get somewhat less loyalty from those customers (e.g. Graham et al. 2017). A frequent point of interest is whether certain brands receive unusually high loyalty or unusually low loyalty, given their penetration level (Pare & Dawes, 2011). Likewise, managers may be interested to see the extent to which their brand appeals to lighter or heavier buyers of the category (e.g. Ehrenberg et al. 2004; Romaniuk & Wight, 2015; Stern & Hammond, 2004). At face value, appealing more to heavy category buyers appears desirable, since they offer more potential volume (Hallberg, 1995). That said, brand growth arises from attracting new buyers to the brand, most of whom will be light (Sharp, 2010; Dawes, 2016).

Table 1 shows two typical examples of Brand Performance Measures, also referred to as BPMs, from data supplied by NielsenIQ¹. The categories are yoghurt and light duty detergent. The top 10 national brands (names masked) are arranged in descending order of market share. The metrics included are the penetration and overall purchase frequency of the category, the penetration of each brand, brand purchase frequency, and the average number of occasions each brand’s buyer base buys the category.

Table 1.

Brand performance metrics.

Yoghurt				Light duty Detergent
	Penetration (% buying in a year)	Brand purchase frequency (occasions in a year)	Category purchase frequency (occasions in a year)	Penetration (% buying in a year)	Brand purchase frequency (occasions in a year)	Category purchase frequency (occasions in a year)
Category	86	—	12	78	—	3.4
Brands
1	62	5.5	14	59	2.4	3.6
2	57	6.1	15	28	2.3	4.6
3	41	4.7	16	17	2.4	5.2
4	22	3.4	15	8	1.8	5.5
5	12	3.8	17	7	1.6	4.8
6	8	3.4	16	3	2.0	4.2
7	8	3.3	19	2	1.8	6.0
8	6	3.4	18	2	2.0	5.6
9	5	3.4	16	1	2.8	5.2
10	4	4.2	17	1	1.8	5.8

An Apparent Anomaly in Category Purchasing

The focal issue examined in this study is that consumer panel data of the type used in Table 1 consistently produces an anomalous result. That is, the rate at which each brand’s buyers buy the category is almost always higher than the overall average rate. For example, in Table 1 the average number of purchase occasions per year for Yoghurt is 12. However, every brand’s buyer base purchases the category at above this average rate – from 14 for the largest brand, to 17 for the 10th ranked. Likewise, the average purchase rate for detergent is 3.4, but we again see that all the brands’ buyer bases purchase at above this average rate; ranging from 3.6 to 5.8 occasions per year.

Basic arithmetic tells us that if we create an average from a set of numbers, all those numbers cannot be above the average. But the anomaly is plainly in evidence, and has the potential for marketers or research analysts to question their data or the way they interpret it, since it seems so obviously wrong.

The answer was explained to this author by the late Mr John Scriven of London SouthBank University, (a colleague of the legendary researcher Andrew Ehrenberg). Unfortunately, this author only thought he understood the explanation. It became clear only years later. The explanation is that categories have many light buyers and few frequent buyers; and the frequent category buyers buy more brands. They therefore appear in the buyer base of multiple brands, and so inflate the category buying rate for each brand.

To make the explanation clearer, Table 2 presents a worked example of typical consumer panel data for a packaged good over a 12-month period. We show 20 hypothetical households and four brands. The households differ in the rate at which they buy the category and the brands in it. For example, Household 1 only bought the category once, and happened to buy brand C. Household 13 bought the category three times –brand A once and brand C twice. The average rate of buying the category across the 20 households is 2.9 as shown in the bottom row of the ‘Number of Category Purchases’ column. In other words, these households bought the category 2.9 occasions on average over the time period. However, in the right-side columns we also see that the category purchase rate for each brand’s buyer base is considerably above 2.9. It is 3.3 for brand A, 3.8 for B, 4.0 for C and 4.8 for D. The question arises, how can each brand’s buyers buy the category at above the average rate?

Table 2.

Worked example of category and brand buying rates.

Household	No. purchases of Brand A	No. purchases of Brand B	No. purchases of Brand C	No. of purchases of Brand D	Number of Category purchases		Category purchase frequency of brand A buyers	Category purchase frequency of brand B buyers	Category purchase frequency of brand C buyers	Category purchase frequency of brand D buyers
1	0	0	1	0	1				1
2	1	0	0	0	1		1
3	1	0	0	0	1		1
4	0	0	1	0	1				1
5	1	0	0	0	1		1
6	1	0	0	0	1		1
7	0	1	0	0	1			1
8	0	1	0	1	2			2		2
9	0	1	1	0	2			2	2
10	1	1	0	0	2		2	2
11	1	0	0	1	2		2			2
12	1	1	0	0	2		2	2
13	1	0	2	0	3		3		3
14	1	1	1	0	3		3	3	3
15	2	1	1	0	4		4	4	4
16	2	2	1	0	5		5	5	5
17	2	1	2	0	5		5	5	5
18	1	2	1	2	6		6	6	6	6
19	3	1	2	1	7		7	7	7	7
20	2	3	1	1	7		7	7	7	7
Total No. HH’s buying brand at all	15	12	11	5
Average No. purchases of brand	1.4	1.3	1.3	1.2	Average number of category purchases = 2.9	Average number purchases of category by each brand’s buyers	3.3	3.8	4.0	4.8

We see that the overall average category purchase rate of 2.9 is formed from the number of category purchases made by the entire 20 households. Most of these buy only once or twice; a couple of them buy seven times. But if we look over into the columns that show the category purchase frequency of each brand’s buyers, we notice that the lightest category buyers appear in only one brand’s buyer base; but the heaviest category buyers appear in multiple brands’ buyer base. Households 19 and 20, for example appear in all four brand’s buyer base. Consequently their seven purchase occasions ‘overweights’ the category purchase rate for all those brands – this is the explanation. The heaviest category buyers buy multiple brands, but the lighter ones buy say, only one – so the relative proportion of heavy category buyers is higher in each brand’s buyer base than it is in the category overall.

Therefore, if a researcher, insights analyst or marketer notices that their brand’s buyers are buying the category at higher than the average rate, it means (1) their data or method of extracting it is not necessarily incorrect, but (2) it also does not mean their brand is ‘different’ in that it really does skew to heavy category buyers. The category purchase rate anomaly they are seeing is a form of statistical selection effect, whereby heavy category buyers are more likely to be in any brand’s customer base relative to their incidence in the category as a whole. The data output is likely to be correct, albeit it can be puzzling without an explanation.

As a final note, we observe in Table 1 that the category purchase rate becomes larger as the size of the brand gets smaller. In other words, smaller brand buyers tend to purchase the category more often; whereas bigger brand’s buyers tend, on average, to be less frequent buyers of the category. This is a manifestation of what is called the ‘Natural Monopoly’ effect (e.g. Dawes, 2020) and in turn, helps explain Double Jeopardy (e.g. Ehrenberg et al. 1990). The buyers of bigger brands tend to be somewhat less frequent or less knowledgeable category buyers; they buy the brands they know, which tend to be the market leaders. By contrast, heavy or more knowledgeable category buyers, while obviously still buying market-leading brands, also buy the smaller brands but because they know more about the brands in the market, they buy more brands. The outcome is that buyers of small brands buy more other brands, therefore the small brands all get somewhat lower loyalty.

In conclusion, an apparent anomaly that occurs in brand performance data derived from consumer panels has a straightforward explanation. Knowing this explanation may save effort by analysts or marketers querying why it occurs, or wondering if their data is incorrect.

Footnotes

Declaration of Conflicting Interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) received no financial support for the research, authorship, and/or publication of this article.

ORCID iD

John G Dawes

Note

References

Dawes

(2016). Brand growth in packaged goods markets: Ten cases with common patterns. Journal of Consumer Behaviour, 15(5), 475−489. https://doi.org/10.1002/cb.1595

Dawes

(2020). The natural monopoly effect in brand purchasing: Do big brands really appeal to lighter category buyers? Australasian Marketing Journal (AMJ), 28(2), 90−99. https://doi.org/10.1016/j.ausmj.2020.01.006

Ehrenberg

Goodhardt

G.,

Barwise

T. P.

(1990). Double jeopardy revisited. Journal of Marketing, 54(3), 82−91. https://doi.org/10.1177/002224299005400307

Ehrenberg

Uncles

M. D.,

Goodhardt

G. J.

(2004). Understanding brand performance measures: Using Dirichlet benchmarks. Journal of Business Research, 57(12), 1307−1325. https://doi.org/10.1016/j.jbusres.2002.11.001

Graham

Bennett

Franke

Henfrey

C.L.,

Nagy-Hamada

(2017). Double jeopardy–50 Years on. Reviving a forgotten tool that still predicts brand loyalty. Australasian Marketing Journal (AMJ), 25(4), 278−287. https://doi.org/10.1016/j.ausmj.2017.10.009

Hallberg

(1995). All customers are not created equal. John Wiley & Sons.

Pare

V.,

Dawes

(2011). The persistence of excess brand loyalty over multiple years. Marketing Letters, 21(2), 163−175. https://doi.org/10.1007/s11002-011-9144-3

Romaniuk

J.,

Wight

(2015). The stability and sales contribution of heavy buying households. Journal of Consumer Behaviour, 14(1), 13−20. https://doi.org/10.1002/cb.1490

Sharp

(2010). How brands grow. Oxford University Press.

10.

Stern

P.,

Hammond

(2004). The relationship between customer loyalty and purchase incidence. Marketing Letters, 15(1), 5−19. https://doi.org/10.1023/b:mark.0000021967.80283.c8