How to obtain the variance over a specified axis in pandas

Overview

The var() function in pandas obtains the variance of the values of a specified axis of a given DataFrame.

Mathematically, variance is defined as the measure of the spread between the values of a data set.

It takes the formula below:

S²= $\frac{Σ(xi -x)}{n-1}$

Where:

In another context, the variance of a dataset is given as √standard deviation. That is, the square root of the standard deviation.

The var() function takes the following syntax:

The var() function takes the following optional parameter values:

axis: This represents the name of the row (designated as 0 or 'index') or the column (designated as 1 or columns) axis.
skipna: This takes a boolean value indicating whether NA or null values are to be excluded.
ddof: This takes an int that represents the delta degrees of freedom.
numeric_only: This takes a boolean value indicating whether to include only float, int, or boolean columns.
**kwargs: This is an additional keyword argument that can be passed to the function.

The var() function returns a DataFrame object holding the results.

New on Educative

Learn to Code

Learn any Language as a beginner

Develop a human edge in an AI powered world and learn to code with AI from our beginner friendly catalog

🏆 Leaderboard

Daily Coding Challenge

Solve a new coding challenge every day and climb the leaderboard

Free Resources