Median Split

Splitting a continuous variable into two groups at its median is sometimes used in data analysis. On average, splitting a predictor variable is equivalent in correlation and regression to replacing all the values with either the mean value for the low group or the high group, as appropriate. Each data value is replaced by the weighted average , where is the mean for the group containing and is a weight value with . Moving the weight slider to the right moves the values toward a median split. Observe the decrease in , the -statistic, its -value, and the predictor's variance as the data is moved towards the equivalent of a median split. Much is lost and nothing gained by a median split.

It is interesting to observe the movement of points near each other and near the median. Moving the slider from left to right exaggerates the difference between those observations. At the same time, extreme observations are grouped together with observations near the median as the slider moves from left to right. Exaggerating the difference between observations that were originally close together while at the same time minimizing the differences between observations that were originally very far apart cannot possibly be a useful strategy for data analysis.
For further consideration of this example and other negative consequences of dichotomizing continuous variables, see:
J. R. Irwin and G. H. McClelland, "Negative Consequences of Dichotomizing Continuous Predictor Variables," Journal of Market Research, 40(3), 2003 pp. 366–371.
R. C. MacCallum, S. Zhang, K. J. Preacher and D. D. Rucker, "On the Practice of Dichotomization of Quantitative Variables," Psychological Methods, 7(1), 2002 pp. 19–40.
comments
 
Powered by Wolfram Mathematica
Give us your feedback
Give us your feedback

Source page:




 often  occasionally  never

Note: Please do not include anything you consider confidential or proprietary. Your message and contact information may be shared with the author of any specific Demonstration for which you give feedback, but will not otherwise be published or distributed.
Privacy Policy »

Note: To run this Demonstration you need the free
Mathematica Player
or Mathematica 7+
Download or upgrade to Mathematica Player 7
I already have Mathematica Player or Mathematica 7+