International Journal of Computer Vision | 2019

Representation Learning on Unit Ball with 3D Roto-translational Equivariance

 
 
 
 

Abstract


Convolution is an integral operation that defines how the shape of one function is modified by another function. This powerful concept forms the basis of hierarchical feature learning in deep neural networks. Although performing convolution in Euclidean geometries is fairly straightforward, its extension to other topological spaces—such as a sphere ( $$\\mathbb {S}^2$$ S 2 ) or a unit ball ( $$\\mathbb {B}^3$$ B 3 )—entails unique challenges. In this work, we propose a novel ‘ volumetric convolution ’ operation that can effectively model and convolve arbitrary functions in $$\\mathbb {B}^3$$ B 3 . We develop a theoretical framework for volumetric convolution based on Zernike polynomials and efficiently implement it as a differentiable and an easily pluggable layer in deep networks. By construction, our formulation leads to the derivation of a novel formula to measure the symmetry of a function in $$\\mathbb {B}^3$$ B 3 around an arbitrary axis, that is useful in function analysis tasks. We demonstrate the efficacy of proposed volumetric convolution operation on one viable use case i.e., 3D object recognition.

Volume 128
Pages 1612-1634
DOI 10.1007/s11263-019-01278-x
Language English
Journal International Journal of Computer Vision

Full Text