A public opinion poll is a questionnaire used to measure public opinion, or the collective attitudes held by a population. Because of the impracticality of administering a questionnaire to all of a large population's members, public opinion polls assess the opinions of the total population by surveying a sample that is sufficiently large and representative of the population as a whole to produce statistically valid results.
Polling is by far the predominant means for measuring public opinion in this day and age and poll administration practices have grown increasingly sophisticated and rigorous since the 1930s inception of the enterprise. Nevertheless, it remains an imperfect instrument, the accuracy of which is frequently compromised by a variety of factors over which even the most diligent pollsters exert limited control.
While straw polling, or the estimation of public opinion based on informal sampling and survey procedures, dates back at least to the early nineteenth century, the emergence of scientific public opinion polling is a much more recent development.
Prior to the 1930s, the most impressive attempt at public opinion polling was the Literary Digest poll, a national-scale poll conducted by the now-defunct Literary Digest magazine as a means for forecasting U.S. presidential election results. Its first run in 1916 correctly predicted Woodrow Wilson's reelection based on a simple tabulation of the returns of millions of postcard questionnaires that had been sent to Americans listed in telephone directories and state automobile registries. The exercise was repeated to good effect in the next four election cycles, in each case accurately predicting the presidential victor based on the returns of extraordinarily large numbers of questionnaires from Americans nationwide.
In the meantime, several up-and-coming public opinion researchers, including Archibald Crossley, Claude Robinson, Elmo Roper, and George Gallup, were underway with major breakthroughs in the development of statistical sampling methodology.
In 1935, Gallup founded the American Institute of Public Opinion, precursor to the still-extant Gallup Organization, and proposed a challenge to the Literary Digest poll's preeminece as a means for establishing the legitimacy of his quota sampling technique. Leading up to the 1936 presidential election, Literary Digest conducted its usual postcard survey. While its 2.3 million respondents constituted an extraordinarily large sample, relying on telephone and automobile listings — especially during the Great Depression — yielded results that did not reflect the voting intentions of the public at large. A week before election day, Literary Digest predicted that Alf Landon, the Republican Party candidate, would win the election by a large margin. Gallup, on the other hand, conducted his own survey with a much smaller but demographically representative sample. Gallup correctly predicted Roosevelt's landslide victory. The Literary Digest went out of business soon afterwards, while the polling industry started to take off.
Gallup launched a subsidiary in Britain, where it correctly predicted Labour's victory in the 1945 general election, in contrast with virtually all other commentators, who expected the Conservative Party, led by Winston Churchill, to win easily.
Soon after the 1948 election, the Social Science Research Council (SSRC) formed an independent, academic Committee on the Analysis of Pre-election Polls and Forecasts to investigate the pollsters' methods and pinpoint why they failed to predict Truman's victory.
By the 1950s, polling had spread to most democracies. Nowadays they reach virtually every country, although in more autocratic societies they tend to avoid sensitive political topics. In Iraq, surveys conducted soon after the 2003 war helped measure the true feelings of Iraqi citizens to Saddam Hussein, post-war conditions and the presence of US forces.
For many years, opinion polls were conducted mainly face-to-face, either in the street or in people's homes. This method remains widely used, but in some countries it has been overtaken by telephone polls, which can be conducted faster and more cheaply. Because of the common practice of telemarketers to sell products under the guise of a telephone survey and due to the proliferation of residential call screening devices and use of cell phones, response rates for phone surveys have been plummeting. Mailed surveys have become the data collection method of choice among local governments that conduct a citizen survey to track service quality and manage resource allocation. In recent years, Internet and short message service (SMS, or text) surveys have become increasingly popular, but most of these draw on whomever wishes to participate rather than a scientific sample of the population, and are therefore not generally considered accurate.
Designing a public opinion poll entails several steps, from determining the population of interest and adopting an appropriate method for recruiting a representative sample of that population to developing a questionnaire that is well-suited to obtaining unbiased results.
Public opinion polls may be self-administered, in which case respondents read and complete the questionnaire on their own. Internet polls are nearly always self-administered. Polls can also be administered by an interviewer, or an individual who reads the questionnaire to respondents and records their responses on their behalf. Interviewer-administered polling can take place in person, that is, in the format of face-to-face conversations between the interviewer and respondents. More commonly, though, interviewer-administered polls are conducted by telephone.
Various factors affect public opinion poll results' validity and reliability -- that is, the extent to which they accurately reflect the true population opinions that they are intended to measure. These factors might be sorted into three broad categories: those stemming from the use of a sample to estimate the opinions of a whole population; those stemming from the design of the questionnaire used to gather information from respondents; and those stemming from the way in which the questionnaire is administered.
All polls administered to population samples are subject to sampling error, which refers to the extent to which the opinions expressed by the surveyed sample do not reflect the opinions of the population as a whole. Sampling error is typically expressed as a confidence interval of plus or minus some number of percentage points associated with a statistical confidence level. For example, the maximum sampling error (MSE) for a sample of 1050 drawn from a population of 1,000,000 is +/-3 percentage points at the 95% confidence level; this means that there is a 95 percent chance that the results of a survey administered to that sample fall within a 6-point range around the true opinion of the population as a whole.
Pollsters can reduce sampling error by administering a poll to a larger sample. For example, a sample of 10,000 drawn from the 1,000,000-member population would yield an MSE of +/-1% at the 95% confidence level, and a sample of 100,000 would reduce the MSE to just +/-0.3%. In practice, however, increasing a sample size enough to reduce sampling error substantially usually entails undue financial and logistical costs.
Sampling error does not reflect other sampling-related sources of inaccuracy, including sampling bias, which comes about when a poll is administered to a sample, however large, that is not representative of the population as a whole. A form of selection bias, sampling bias can be the result of a variety of factors, including convenience sampling, the use of an inappropriate sampling frame, and non-response bias.
Convenience sampling is the practice of administering a poll to individuals who are easiest to recruit regardless of their representativeness of the population whose opinions are intended to be measured.
A sampling frame is a defined set of individuals within a population from which a sample is to be drawn. It may but does not necessarily consist of a literal list of all of the population's members. In fact, exhaustive population lists often do not exist or cannot be readily obtained by pollsters. When this is the case, pollsters use some sort of proxy frame, which may consist of a literal list, such as a directory of listed telephone numbers, or a figurative one, as in the case of random digit dialing, which samples from a hypothetical "list" of all possible telephone number permutations. Coverage error is the discrepancy between such non-exhaustive sampling frames and the full population. To the extent that a sampling frame's non-coverage of the population systematically excludes some segments of the population, poll results will suffer from coverage bias. For example, random digit dialing excludes those population members who do not have telephones, a group that is not evenly distributed within the population since it is most likely comprised of individuals at the lower end of the socioeconomic spectrum. A telephone directory sampling frame yields still more coverage error and bias, since it excludes not only those population members who do not have telephones, but also those who do but have unlisted numbers. This second excluded group is also not likely to be evenly distributed within the population; for example, the burgeoning "cell-phone only" sector, whose phone numbers are unlisted by default, draws disproportionatly from the younger segments of the population.
While coverage bias is typically associated with undercoverage, or the exclusion of one or more segments of the population, it is also possible for a sample to suffer from overcoverage -- that is, the inclusion of individuals who do not strictly belong in the population of interest. For example, pre-election polls in the United States sometimes use a sampling frame that includes all adult Americans regardless of whether they're registered or likely to vote. To the extent that the opinions of non-voters, who are disproportionately young, less-educated and non-affluent, differ systematically from voters', their inclusion skews the results and makes it difficult to use the for election forecasting and campaign strategy purposes. To avoid this problem, many polling organizations limit their pre-election poll samples to either registered voters or, increasingly, to "likely voters," whom they typically identify with a battery of questions at the start of the poll about past voting behavior and levels of political interest.[1]
Whereas coverage bias stems from pollsters' sampling frame choices, non-response bias is caused by respondents' decision whether or not to participate in polls. Since some people do not answer calls from strangers or refuse to respond to polls, samples may lack population representativeness despite pollsters' best efforts to construct them appropriately. As with those excluded from participation due to the use of inappropriate sampling frames, the characteristics of the people who agree to be interviewed may be systematically different from those who decline. To the extent that this is the case, non-response bias ensues and contributes to inaccurate polling results.
Also known as "pseudo-opinions," nonattitudes refer to the propensity for respondents to express an opinion despite not actually having one. First identified by political scientist Philip Converse in 1964,[2] the problem of nonattitudes is a constant source of vexation for public opinion researchers.
There are a variety of steps pollsters can take to minimize error stemming from nonattitudes. The simplest fix is to offer a "no opinion" option among the questionnaire's response alternatives. Pollsters might also include screening questions at the start of a poll to gauge which respondents are unlikely to have a true attitude based on their self-reported level of passion or knowledge about the subject of the poll. Or, they might ask open-ended follow-up questions that push respondents to explain why they favor or oppose it. Finally, they can construct a "mushiness index" based on respondents' answers to a battery of questions regarding their level of interest, information, and opinion stability regarding the poll's subject.
A related source of inaccuracy in public opinion polling is respondent insincerity, or the expression of opinions that are not sincerely held. Often, this takes the form of social desirability response bias (SDRB), which refers to respondents' tendency to provide answers that, true or not, present them in the most socially acceptable light.
SDRB is frequently cited as a factor in the Bradley effect (also known as the Wilder effect or Bradley-Wilder effect) that is sometimes evident in elections featuring a black candidate running against a white opponent.[3]
Another potential source of inaccuracy in public opinion polling is the content of the questionnaire itself. Specifically, the wording of questions, the order in which they are asked, and the response alternatives that are made available to respondents can all influence the results of public opinion polls.
There are various ways in which the wording of a poll's questions can affect the accuracy of its results.
Perhaps the most blatant way in which question wording may induce inaccurate polling results is through the inclusion of leading questions, or those whose wording leads respondents to answer a certain way regardless of their true opinions.
Even when all of a poll's questions are optimally worded, it is possible for the order in which they are asked to bias its results. In the typical question order bias scenario, respondents' answers to questions early on in the poll affect their answers to subsequent questions, often because they want to avoid coming across as inconsistent or hypocritical.
Poll results might also be skewed by the method used to administer the poll.
Polls that are not self-administered -- that is, those in which responses are recorded by an interviewer rather than the respondent himself -- are also subject to interviewer effects, or inaccurate results due to the tendency for respondents to tailor their responses based on their perception of the interviewer's race, gender, or age.
An oft-quoted example of opinion polls succumbing to errors was the British election of 1992. Despite the polling organisations using different methodologies virtually all the polls in the lead up to the vote (and exit polls taken on voting day) showed a lead for the opposition Labour party but the actual vote gave a clear victory to the ruling Conservative party.
In their deliberations after this embarrassment the pollsters advanced several ideas to account for their errors, including:
The relative importance of these factors was, and remains, a matter of controversy, but since then the polling organisations have adjusted their methodologies and have achieved more accurate predictions in subsequent elections.
One controversial aspect of public opinion polling, especially when it comes to pre-election polling, is that by promulgating information about a population's voting intentions, polls can influence voting behavior. There are two principal ways in which this occurs: bandwagon effects and strategic voting.
A bandwagon effect occurs when a poll prompts voters to back the candidate who appears to be in the lead. The idea that voters are susceptible to such effects is old, stemming at least from 1884; Safire (1993: 43) reported that it was first used in a political cartoon in the magazine Puck in that year. It has also remained persistent in spite of a lack of empirical corroberation until the late 20th century. George Gallup spent much effort in vain trying to discredit this theory in his time by presenting empirical research. A recent meta-study of scientific research on this topic indicates that from the 1980's onward the Bandwagon effect is found more often by researchers (Irwin & van Holsteyn 2000).
The opposite of the bandwagon effect is the Underdog effect. It is often mentioned in the media. This occurs when people vote, out of sympathy, for the party perceived to be 'losing' the elections. There is less empirical evidence for the existence of this effect than there is for the existence of the Bandwagon effect (Irwin & van Holsteyn 2000).
The second category of theories on how polls directly affect voting is called strategic or tactical voting. This theory is based on the idea that voters view the act of voting as a means of selecting a government. Thus they will sometimes not choose the candidate they prefer on ground of ideology or sympathy, but another, less-preferred, candidate from strategic considerations. An example can be found in the general election of 1997. Then Cabinet Minister, Michael Portillo's constituency of Enfield was believed to be a safe seat but opinion polls showed the Labour candidate Stephen Twigg steadily gaining support, which may have prompted undecided voters or supporters of other parties to support Twigg in order to remove Portillo. Another example is the Boomerang effect where the likely supporters of the candidate shown to be winning feel that s/he is "home and dry" and that their vote is not required, thus allowing another candidate to win.
These effects only indicate how opinion polls directly affect political choices of the electorate. Other effect can be found on journalists, politicians, political parties, civil servants etc. in, among other things, the form of media framing and party ideology shifts.