10182

Automatically Selecting Histogram Bins

Choosing the bin sizes for a histogram can be surprisingly tricky. If there are too few bins, it is hard to pick out the underlying distribution of the data. If there are too many bins, the result is either unpleasant to look at because the bins have deteriorated into sticks or noise in the data is not sufficiently averaged out, also making it hard to see the underlying distribution. Here we present several methods for selecting (uniform-width) bins for a histogram.
Fixed number of bins: always use the same number of bins, regardless of the data.
Sturges: the number of bins grows with the log of the size of the data.
Scott: the bin width is proportional to the standard deviation of the values divided by the cube root of the size of the data.
Freedman–Diaconis: the bin width is proportional to the interquartile range of the data divided by the cube root of the size of the data.
Wand: the bin width is chosen to minimize the mean integrated squared error.

DETAILS

By default Mathematica rounds bin widths to "nice" values, minimizing some of the differences between the various binning methods.
D. Freedman and P. Diaconis, "On the Histogram as a Density Estimator: Theory," Zeitschrift für Wahrscheinlichkeitstheorie und verwandte Gebeite, 57, 1981 pp. 453–476.
D. W. Scott, "On Optimal and Data-Based Histograms," Biometrika, 66(3), 1979 pp. 605–610.
H. A. Sturges, "The Choice of a Class Interval," Journal of the American Statistical Association, 21(153), 1926 pp. 65–66.
M. P. Wand, "Data-Based Choice of Histogram Bin Width," The American Statistician, 51(1), 1997 pp. 59–64.

PERMANENT CITATION

 Share: Embed Interactive Demonstration New! Just copy and paste this snippet of JavaScript code into your website or blog to put the live Demonstration on your site. More details » Download Demonstration as CDF » Download Author Code »(preview ») Files require Wolfram CDF Player or Mathematica.

Related Topics

 RELATED RESOURCES
 The #1 tool for creating Demonstrations and anything technical. Explore anything with the first computational knowledge engine. The web's most extensive mathematics resource. An app for every course—right in the palm of your hand. Read our views on math,science, and technology. The format that makes Demonstrations (and any information) easy to share and interact with. Programs & resources for educators, schools & students. Join the initiative for modernizing math education. Walk through homework problems one step at a time, with hints to help along the way. Unlimited random practice problems and answers with built-in Step-by-step solutions. Practice online or make a printable study sheet. Knowledge-based programming for everyone.