Automatically Selecting Histogram Bins

Initializing live version

Choosing the bin sizes for a histogram can be surprisingly tricky. If there are too few bins, it is hard to pick out the underlying distribution of the data. If there are too many bins, the result is either unpleasant to look at because the bins have deteriorated into sticks or noise in the data is not sufficiently averaged out, also making it hard to see the underlying distribution. Here we present several methods for selecting (uniform-width) bins for a histogram.

[more]

Contributed by: Brett Champion (December 2008)
Open content licensed under CC BY-NC-SA

Snapshots

Details

By default Mathematica rounds bin widths to "nice" values, minimizing some of the differences between the various binning methods.

D. Freedman and P. Diaconis, "On the Histogram as a Density Estimator: Theory," Zeitschrift für Wahrscheinlichkeitstheorie und verwandte Gebeite, 57, 1981 pp. 453–476.

D. W. Scott, "On Optimal and Data-Based Histograms," Biometrika, 66(3), 1979 pp. 605–610.

H. A. Sturges, "The Choice of a Class Interval," Journal of the American Statistical Association, 21(153), 1926 pp. 65–66.

M. P. Wand, "Data-Based Choice of Histogram Bin Width," The American Statistician, 51(1), 1997 pp. 59–64.

Permanent Citation

Brett Champion "Automatically Selecting Histogram Bins"
http://demonstrations.wolfram.com/AutomaticallySelectingHistogramBins/
Wolfram Demonstrations Project
Published: December 6 2008


Feedback (field required)

Email (field required)	Name

Occupation	Organization

Note: Your message & contact information may be shared with the author of any specific Demonstration for which you give feedback. Send

Automatically Selecting Histogram Bins

Snapshots

Details

Related Links

Permanent Citation