International Journal of Computer Vision | 2019

Representation Learning on Unit Ball with 3D Roto-translational Equivariance

Abstract

Convolution is an integral operation that defines how the shape of one function is modified by another function. This powerful concept forms the basis of hierarchical feature learning in deep neural networks. Although performing convolution in Euclidean geometries is fairly straightforward, its extension to other topological spaces—such as a sphere ( $$\\mathbb {S}^2$$ S 2 ) or a unit ball ( $$\\mathbb {B}^3$$ B 3 )—entails unique challenges. In this work, we propose a novel ‘ volumetric convolution ’ operation that can effectively model and convolve arbitrary functions in $$\\mathbb {B}^3$$ B 3 . We develop a theoretical framework for volumetric convolution based on Zernike polynomials and efficiently implement it as a differentiable and an easily pluggable layer in deep networks. By construction, our formulation leads to the derivation of a novel formula to measure the symmetry of a function in $$\\mathbb {B}^3$$ B 3 around an arbitrary axis, that is useful in function analysis tasks. We demonstrate the efficacy of proposed volumetric convolution operation on one viable use case i.e., 3D object recognition.

Volume 128

International Journal of Computer Vision | 2019

Representation Learning on Unit Ball with 3D Roto-translational Equivariance

Abstract

Volume 128

Pages 1612-1634

DOI 10.1007/s11263-019-01278-x

Language English

Journal International Journal of Computer Vision

Full Text