Visualizing R-Squared in Statistics
In general, is often referred to as the coefficient of determination and its verbal interpretation is that it is the fraction of variation explained by the model. This in fact is illustrated in our Demonstration in the case of simple linear regression.[more]
A random sample of size is generated from a bivariate normal distribution with correlation parameter , means 0, and variances 1. The first plot, graph 1, shows the data and the fitted regression line. Next, graph 2 shows the data and the fitted points. In graph 3, a rug is added on each of the axes. The axes on the left with the blue rug show the values; the axes on the right with the red rug show the fitted values.
The plot label shows , where is the correlation coefficient, is the sum of the squared deviations of the 's from their mean, and is the sum of the squared deviations of the 's from their mean. In graph 3, the rugs provide a visualization of the spread of the 's and 's.
In statistics courses not based on calculus, it is often mentioned when discussing simple linear regression that and that is the fraction of variation explained by the model. This Demonstration is especially useful for explaining the concept to these students.
Notice how the plot changes dramatically as the parameter is changed. You may also wish to experiment with different sample sizes by changing or different random samples by changing the seed.[less]