Why do convolutional filters use odd parity?

In deep learning models, convolutional filters are usually used to extract spatial features from the data. The use of odd-sized convolutional filters is common in these models. Most of the state-of-the-art architecture goes with this convention. In this answer, we will dive into the “why” of this convention.

What is a convolution operation in deep learning?

In deep learning, a convolution operation is used to encode the information of the input matrix in terms of filters or kernels. The mathematics behind the operation is simple. That is, a convolutional filter is multiplied by the input matrix element-wise. The following figure highlights the working of the convolution operation.

The same operation can be extended to a larger input matrix. In that case, however, the convolution operation can not be applied at once. The alternative is to apply the convolutional filter in a series of windows.

What should be the parity of the convolutional filter?

We have briefly discussed the convolution operation and how it can be applied to matrices of different sizes. Now, we will talk about determining the parityParity refers to a number being even or odd. of the convolutional filters. Although the size of filters can be even or odd, odd-sized filters offer the advantages of simplicity and symmetry.

Simplicity: The use of padding is very common in deep learning architectures. When adding $p*p$ padding to the $n*n$ input matrix, we want the resulting matrix to be in the $n*n$ shape. The odd-sized convolutional filters keep the mathematics simple. The following is an equation for calculating the resulting size of a convolution operation:

Free AI Mock Interviews

Coding Interview

Coding PatternsFree Interview

Gain insights and practical experience with coding patterns through targeted MCQs and coding problems, designed to match and challenge your expertise level.

System Design

You TubeFree Interview

Learn to design a video streaming platform like YouTube by tackling functional and non-functional requirements, core components, and high-level to detailed design challenges.

Free Resources

Why do convolutional filters use odd parity?

What is a convolution operation in deep learning?

What should be the parity of the convolutional filter?

Conclusion