Statistics
COVAR_SAMP
Overview
The COVAR_SAMP()
aggregate function calculates the sample covariance between two sets of number pairs. This function measures how changes in one variable relate linearly to changes in another variable within a sample dataset.
Syntax
The syntax for this function is as follows:
Parameters
y
: variable being predictedx
: variable used for prediction
Example
For the needs of this section, we’re going to use a simplified version of the film
table from the Pagila database, containing only the title
, length
and rating
columns. The complete schema for the film
table can be found on the
Pagila database website.
The query below query uses the COVAR_SAMP()
function to calculate the sample covariance between film length
and rating
where rating
is greater than or equal to 4:
By running the above query will get the following output: