Data trending

In summary

This section provides a step-by-step explanation of how to use microbiological indicator test results to identify historical trends and improve environmental hygiene management. It introduces several statistical methods including arithmetic mean, geometric mean, rolling geometric mean, cumulative sums (cusums).

I. Arithmetic mean

Also known as a simple average or log arithmetic mean, this method involves adding together all counts and dividing by the number of samples.

Example:

If you collected 5 samples and the test results (cfu/100 ml) were:

30
1,000
2,650
150,000
25

To calculate the arithmetic mean:

Add 230 + 1,000 + 2,650 + 150,000 + 25 = 153,905

Average = 153,905 ÷ 5 = 30,781 cfu/100 ml

The arithmetic mean of test results over time can be plotted as shown in figure 1.

Trend of E. coli in water

Considerations

This method gives disproportionate weight to extreme values. In this case, one very high result (150,000) distorts the average, which otherwise would be below 3,000. This can give a misleading impression of overall hygiene levels.

Taking the log of the arithmetic mean (log₁₀ 30,781 = 4.49) is optional and can make plotting trends easier. This approach is also called a log arithmetic mean, which is less influenced by extreme values.

II. Geometric Mean

This is also known as the mean log. You calculate a geometric mean by taking logs before you take any averages.

To calculate the geometric or mean log of the same five samples:

Count (cfu/100 ml)	Log10 Value
230	2.36
1000	3.00
2650	3.42
150,000	5.18
25	1.40

Calculation:

Add logs = 2.36 + 3.00 + 3.42 + 5.18 + 1.40 = 15.36

Average = 15.36 ÷ 5 = 3.08

Again, mean log values can be plotted over time to show a trend.

How to interpret the graph: Note the zero line in the centre of the graph on the y-axis. If an upward trend is present, the results are worse than the baseline, conversely if a downward trend is observed, results are better than the baseline. A flat line indicates that results are consistent with the baseline. In the above graph the samples for October 2008 get progressively worse over the course of the month (above the baseline determined for that business), but then improve over the course of November. There is a temporary increase for the 28th November testing, but then a steady decrease until early January. If, in this example, a new measure was introduced at the end of October, then this graph would demonstrate that this new measure was working well to improve the microbial cleanliness in the business.

Insight: Cusums amplify small changes in performance, making them valuable when indicator counts usually fall within a narrow range.

Log trend of E. coli in water

Date	Estimated log E. coli (cfu/100 ml)
01-Jan	4.2
01-Feb	3.7
01-Mar	4.1
01-Apr	3.8
01-May	4.2
01-Jun	3.7
01-Jul	4.1
01-Aug	4.0
01-Sep	3.7
01-Oct	4.1
01-Nov	4.2
01-Dec	3.9

Log trend of E. coli in water

III. Rolling Geometric mean

The rolling geometric mean is calculated in a similar way to the geometric mean. As with the geometric mean you start off by taking the logs of your indicator counts, add them together and divide by the number of samples taken. The main difference between a geometric mean and a rolling geometric mean is that all of the results from several weeks of testing are used to calculate a rolling geometric mean, i.e. this method averages test results over a rolling period, updating weekly to create a smoothed long-term trend. In microbiology it is common to use the results of six weeks of testing for the rolling geometric mean. It is important to note that the six weeks do not have to be consecutive.

How it works: Calculate the geometric mean for weeks 1–6 by taking the log of each result, add them all up and divide the sum by the total number of samples (6 in this case). For week 7, drop week 1 and include week 7 (i.e. weeks 2-7). Continue this pattern for each subsequent week.

Six week rolling geometric mean of E. coli in water versus log trend

IV. Cumulative Sums (Cusums)

Cusum charts track whether test results are improving or deteriorating over time by comparing weekly geometric means to a baseline average which is generated from your own data. So, essentially, you are comparing how your current samples are comparing against their historical selves. Cusums tend to exaggerate trends in data so are particularly useful in data sets with a narrow range of values.

Step-by-step calculation:

Choose a baseline period (often13 weeks which is three months’ worth of data) during which you would consider your sampling to be representative of your business. Calculate the baseline geometric mean from those weeks, i.e. take the log of each result, add them up, and divide the sum by the total number of results, 13 in this case – this is your baseline. For each new week calculate the week’s geometric mean then subtract it from the baseline and add this difference to a running tally. An example of the running tally can be seen below. The explanation may seem a bit complicated, but if you have a look at the table below, and then re-read the above paragraph, it will (hopefully) all become a bit clearer.

Date	Weekly Geometric Mean	Baseline Average	Difference	Running Tally
10/10/24	1.9	1.8	0.1	0.1
17/10/24	2.2	1.8	0.4	0.5
24/10/24	2.3	1.8	0.5	1
31/10/24	2.4	1.8	0.6	1.6
07/11/24	0.8	1.8	-1.0	0.6
14/11/24	1.1	1.8	-0.7	-0.1
21/11/24	1.3	1.8	-0.5	-0.6
28/11/24	2.4	1.8	0.6	0
05/12/24	0.5	1.8	-1.3	-1.3
12/12/24	1.6	1.8	-0.2	-1.5
19/12/24	1.4	1.8	-0.4	-1.9
26/12/24	1.9	1.8	0.1	-1.8
02/01/25	1.8	1.8	0	-1.9

A cumulative sum of E. coli numbers

Did you find this helpful? We would love to hear from you.

Yes

Your feedback

Thank you for your feedback!

Data trending

In summary

Trending Microbiological Indicator Results

I. Arithmetic mean

Trend of E. coli in water

Here is a visual only interactive chart of: ddd

Considerations

II. Geometric Mean

Log trend of E. coli in water

Here is a visual only interactive chart of: x

Log trend of E. coli in water

Here is a visual only chart of: Chart showing the geometric trend of E. coli in water over time using mean log values

III. Rolling Geometric mean

Six week rolling geometric mean of E. coli in water versus log trend

IV. Cumulative Sums (Cusums)

Step-by-step calculation:

A cumulative sum of E. coli numbers