When small data beats big data

Abstract

Small data is sometimes preferable to big data. A high quality small sample can produce superior inferences to a low quality large sample. Data has acquisition, computation and privacy costs which require costs to be balanced against benefits. Statistical inference works well on small data but not so well on large data. Sometimes aggregation into small datasets is better than large individual-level data. Small data is a better starting point for teaching of Statistics.

Publication
Statistics & Probability Letters
Julian Faraway
Julian Faraway
Professor of Statistics

Professor of Statistics at the University of Bath

Related