What is the Pearson Correlation Coefficient? Pearson Correlation Coefficient Explained
The Pearson correlation coefficient, also known as Pearson’s r or simply correlation coefficient, is a statistical measure that quantifies the linear relationship between two variables. It measures the strength and direction of the linear association between two continuous variables.
The Pearson correlation coefficient, denoted by the symbol “r," ranges between -1 and +1. The value of r indicates the following:
Positive Correlation: If r is close to +1, it indicates a strong positive correlation. This means that as one variable increases, the other variable tends to increase as well. The variables move in the same direction.
Negative Correlation: If r is close to -1, it indicates a strong negative correlation. This means that as one variable increases, the other variable tends to decrease. The variables move in opposite directions.
No Correlation: If r is close to 0, it indicates no or very weak linear correlation between the variables. The variables do not show a clear linear relationship.
The calculation of the Pearson correlation coefficient involves the following steps:
Data Preparation: Ensure that both variables are continuous and properly scaled. If needed, standardize the variables to have zero mean and unit variance.
Compute Means: Calculate the means (average values) of both variables.
Compute Deviations: Calculate the deviations of each data point from its mean for both variables.
Compute Covariance: Multiply the deviations of the two variables and take their average (covariance). Covariance measures how the variables vary together.
Compute Standard Deviations: Calculate the standard deviations of both variables.
Compute Correlation Coefficient: Divide the covariance by the product of the standard deviations to obtain the Pearson correlation coefficient.
The formula for the Pearson correlation coefficient (r) is:
r = (covariance of X and Y) / (standard deviation of X * standard deviation of Y)
The Pearson correlation coefficient is widely used in various fields, including statistics, social sciences, finance, and data analysis. It helps in understanding the relationship between variables, assessing the strength of associations, and identifying potential dependencies.
It’s important to note that the Pearson correlation coefficient measures only the linear relationship between variables. It may not capture nonlinear associations between variables. In such cases, alternative correlation measures, such as Spearman’s rank correlation or Kendall’s tau, may be more appropriate.
SoulPage uses cookies to provide necessary website functionality, improve your experience and analyze our traffic. By using our website, you agree to our cookies policy.
This website uses cookies to improve your experience while you navigate through the website. Out of these, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may affect your browsing experience.
Necessary cookies are absolutely essential for the website to function properly. This category only includes cookies that ensures basic functionalities and security features of the website. These cookies do not store any personal information.
Any cookies that may not be particularly necessary for the website to function and is used specifically to collect user personal data via analytics, ads, other embedded contents are termed as non-necessary cookies. It is mandatory to procure user consent prior to running these cookies on your website.