Posted in Uncategorized

Margins of error in opinion polls

Hmmm: what’s the fuss about an opinion poll?

The Guardian published an article with the headline “Labour lead falls as Greens hit 20-year high in Guardian/ICM poll”; but can this headline really be supported by the evidence they supply?

According to the footnote at the bottom of the piece: ICM interviewed a random sample of 1,002 adults aged 18+ by telephone on 16–19 January 2015. Interviews were conducted across the country and the results have been weighted to the profile of all adults. ICM is a member of the British Polling Council and abides by its rules.

Why do many national opinion polls use about 1,000 respondents?

The margin of error for support of a party – or how close we expect our sample to be to the true value (which we don’t know), depends on the percentage of people (p) expressing support for a party and the sample size (n) used.  More precisely, we use a 95% Confidence Interval (so if we were able to calculate this interval many times, 95% of the time the true level of support would be within these intervals – but in any given interval, we aren’t certain if the true (but unknown) value is actually contained in the interval).

It is: \pm 1.96\times \sqrt{\frac{p(100-p)}{n}}

The closer the value of p is to 50%, then the higher this value will be for any given n. The worst case scenario (in terms of the widest margin of error) occurs at 50%, so let’s examine that case:

\pm 1.96\times \sqrt{\frac{50(100-50)}{n}} = \pm 1.96\times \sqrt{\frac{50\times 50}{n}} = \pm 1.96\times 50\times \sqrt{\frac{1}{n}} = \pm 98 \sqrt{\frac{1}{n}}

The 40% case (which has the same results as the 60% case) is

\pm 1.96\times \sqrt{\frac{40(100-40)}{n}} = \pm 1.96\times \sqrt{\frac{40\times 60}{n}} = \pm 39.2 \sqrt{\frac{6}{n}}

The 30% case (which has the same results as the 70% case) is

\pm 1.96\times \sqrt{\frac{30(100-30)}{n}} = \pm 1.96\times \sqrt{\frac{30\times 70}{n}} = \pm 19.6 \sqrt{\frac{21}{n}}

pollSampleSize

The graph illustrates this quite neatly – the margin of error at a sample size of 500 is \pm4.0, at 1000 it is \pm2.8, while at 1500 it is \pm2.3. Once you get a sample of about 1000, any additional gain in terms of reducing your margin of error is hard won.

So, how does all this actually reflect on the current party standings?

The current poll reports the following:

  • Labour Party: 33% no change
  • Conservative Party: 30% +2
  • Liberal Democrats: 11% -3
  • United Kingdom Independence Party (UKIP) 11% -3
  • Green Party 9% +4
  • Other 7% +1

Using the formula introduced above, the 95% confidence intervals for percentage support for each party was calculated and included on the graph below (as dashed lines).

ICM_GuardianPoll_withErrorsIf ICM have managed to randomly sample from the voting population and appropriately weight the results to match the profile of all adults then the current state of affairs can be viewed as a snapshot of the support at the moment (with some wiggle room as illustrated by the graph above). However, that’s the biggest if of the lot!! Hopefully, I’ll be returning to that issue in the run up to the election in May.

Unknown's avatar

Author:

I was previously an academic applied statistician (based in the University of the West of England, Bristol) with a variety of interests. This blog reflects that variety! I now work in official statistics - which will not be covered at all here.

Leave a comment