Visualizing R-Squared in Statistics

Requires a Wolfram Notebook System

Interact on desktop, mobile and cloud with the free Wolfram CDF Player or other Wolfram Language products.

Requires a Wolfram Notebook System

Edit on desktop, mobile and cloud with any Wolfram Language product.

In general, is often referred to as the coefficient of determination and its verbal interpretation is that it is the fraction of variation explained by the model. This in fact is illustrated in our Demonstration in the case of simple linear regression.

[more]

A random sample of size is generated from a bivariate normal distribution with correlation parameter , means 0, and variances 1. The first plot, graph 1, shows the data and the fitted regression line. Next, graph 2 shows the data and the fitted points. In graph 3, a rug is added on each of the axes. The axes on the left with the blue rug show the values; the axes on the right with the red rug show the fitted values.

The plot label shows , where is the correlation coefficient, is the sum of the squared deviations of the 's from their mean, and is the sum of the squared deviations of the 's from their mean. In graph 3, the rugs provide a visualization of the spread of the 's and 's.

In statistics courses not based on calculus, it is often mentioned when discussing simple linear regression that and that is the fraction of variation explained by the model. This Demonstration is especially useful for explaining the concept to these students.

Notice how the plot changes dramatically as the parameter is changed. You may also wish to experiment with different sample sizes by changing or different random samples by changing the seed.

[less]

Contributed by: Ian McLeod (March 2011)
Open content licensed under CC BY-NC-SA

Permanent Citation

Ian McLeod

 Feedback (field required) Email (field required) Name Occupation Organization Note: Your message & contact information may be shared with the author of any specific Demonstration for which you give feedback. Send