Open In App

How to Calculate Point-Biserial Correlation in Excel?

Last Updated : 11 Sep, 2025
Comments
Improve
Suggest changes
Like Article
Like
Report

The Point-Biserial Correlation Coefficient is a statistical measure that evaluates the strength and direction of the relationship between a binary variable and a continuous variable. This correlation is particularly useful when you want to understand how a dichotomous classification(like gender, success/failure, yes/no) relates to a measurable quantity (like test scores or revenue).

Point-Biserial Correlation Coefficient

The point-biserial correlation coefficient, like the Pearson correlation coefficient, has a value between -1 and 1 where:

  • A correlation between two variables that is entirely negative is represented by the number -1.
  • 0 means that there is no connection between the two variables.
  • A correlation coefficient of 1 denotes a totally positive relationship between two variables.

This will demonstrate how to compute the point-biserial correlation between two variables. It only accepts two value ranges as arguments.

= CORREL ( Variable1, Variable2 )

Variables 1 and 2 are the two variables for which you wish to compute the Point-Biserial Correlation.

Example 1: Assume we have a binary variable, x, and a continuous variable, y:

dataset
Binary and continuous variables used for point-biserial correlation.

We can easily use the =CORREL() method to determine the point-biserial correlation between x and y.

Using-correl-function
=CORREL() formula used to compute correlation in Excel

The point-biserial correlation between x and y is 0.242811. Although this number is positive, it implies that when the variable x is set to "1," the variable y tends to take on greater values than when the variable x is set to "0." This is simply demonstrated by computing the average value of y when x is 0 and when x is 1.

using-averageif-function
Point-biserial correlation result: 0.242811.

The average value of y for x = 0 is 14.6. The average value of y for x = 1 is 17.75. This confirms that the two variables' point-biserial correlation should be positive.

Example 2: Assume we have a continuous variable y, and a binary variable x:

dataset
Average y-values for x = 0 and x = 1.

We can simply find the point-biserial correlation between x and y using the =CORREL() method:

using-correl-function
Point-biserial correlation output

The point-biserial correlation between x and y is 0.38833. Although this number is positive, it means that when variable x is set to "1," variable y tends to take on larger values than when variable x is set to "0." Simply compute the average value of y when x is 0 and when x is 1 to illustrate this:

using-averageif-function
Computed correlation: 0.38833 using =CORREL().

For x = 0, the average value of y is 23.5. For x = 1, the average value of y is 35.983. This shows that the point-biserial correlation between the two variables should be positive.


Explore