What is the difference between covariance and correlation?

Covariance and correlation both measure how two random variables behave in conjunction with one another, but correlation is a normalized measure scaled to the interval [-1, 1], while covariance is unbounded. Positive covariance or correlation implies that as the value of one random variable increases, the values of the other also increase. Negative implies that as the value of one random variable increases, those of the other decrease, or vice versa.

For two random variables X and Y, Covariance is found by:

where and correspond to the sample means of X and Y, respectively.

Correlation is then found by:

where and refer to the standard deviations of X and Y, respectively. 

Author

Help us improve this post by suggesting in comments below:

– modifications to the text, and infographics
– video resources that offer clear explanations for this question
– code snippets and case studies relevant to this concept
– online blogs, and research publications that are a “must read” on this topic

Leave the first comment

Partner Ad
Find out all the ways that you can
Contribute