Fisher introduced the null hypothesis by an example, the now famous Lady tasting tea experiment, as a casual wager. She claimed the ability to determine the means of tea preparation by taste. Fisher proposed an experiment and an analysis to test her claim. She was to be offered 8 cups of tea, 4 prepared by each method, for determination. He proposed the null hypothesis that she possessed no such ability, so she was just guessing. With this assumption, the number of correct guesses (the test statistic) formed a hypergeometric distribution. Fisher calculated that her chance of guessing all cups correctly was 1/70. He was provisionally willing to concede her ability (rejecting the null hypothesis) in this case only. Having an example, Fisher commented:[6]
"...the null hypothesis is never proved or established, but is possibly disproved, in the course of experimentation. Every experiment may be said to exist only in order to give the facts a chance of disproving the null hypothesis."
"...the null hypothesis must be exact, that is free from vagueness and ambiguity, because it must supply the basis of the 'problem of distribution,' of which the test of significance is the solution."
"We may, however, choose any null hypothesis we please, provided it is exact."
Regarding an alternative non-directional significance test of the Lady tasting tea experiment:
"For this purpose the new test proposed would be entirely inappropriate, and no experimenter would be tempted to employ it. Mathematically, however, it is as valid as any other, in that with proper randomisation it is demonstrable that it would give a significant result with known probability, if the null hypothesis were true."
Regarding which test of significance to apply:
"The notion that different tests of significance are appropriate to test different features of the same null hypothesis presents no difficulty to workers engaged in practical experimentation, but has been the occasion of much theoretical discussion among statisticians."
On selecting the appropriate experimental measurement and null hypothesis:
"This question, when the answer to it is not already known, can be fruitfully discussed only when the experimenter has in view, not a single null hypothesis, but a class of such hypotheses, in the significance of deviations from each of which he is equally interested."
^Stanley, Julian C. (1966). "The Influence of Fisher's "The Design of Experiments" on Educational Research Thirty Years Later". American Educational Research Journal. 3 (3): 223–229. doi:10.3102/00028312003003223. JSTOR1161806. S2CID145725524.
^OED, "null hypothesis," first usage: 1935 R. A. Fisher, The Design of Experiments ii. 19, "We may speak of this hypothesis as the 'null hypothesis'...the null hypothesis is never proved or established, but is possibly disproved, in the course of experimentation."
^The Design of Experiments (2 ed.). Edinburgh: Oliver and Boyd. 1937. The book was published in 9 editions from 1935 to 1971. The last two editions were published posthumously. The publisher of the 8th edition of 1966 was Hafner of Edinburgh. The publisher of the 9th edition of 1971 was Macmillan with an ISBN of 0-02-844690-9. A more recent publication was as part of Statistical methods, experimental design, and scientific inference by the Oxford University Press in 1990 with an ISBN of 0198522290. While pagination was inconsistent among editions, Fisher maintained consistent section numbering where feasible. The most relevant sections of the text are (Chapter II: The Principles of Experimentation, Illustrated by a Psycho-physical Experiment, Section 8: The Null Hypothesis) and (Chapter X: The Generalization of the Null Hypothesis).