We replace the conventional isotropic
Gaussian diffusion training and sampling procedure with a novel nonisotropic formulation that accounts for joint relations directly in the
generation process.
More in detail, in denoising diffusion models the forward diffusion process employed during training adds Gaussian noise to the data at each diffusion timestep. In conventional diffusion approaches, such noise is sampled from a Gaussian distribution with diagonal covariance Σt, hence the process is defined as isotropic.
The isotropic formulation does not take into account that the HMP problem is defined by the skeleton kinematic graph (given by the adjacency matrix A).
We exploit this knowledge to define a fixed, non-diagonal noise covariance for the diffusion process, based on correlations
ΣN extracted from the adjacency matrix.
From here we derive all necessary equations for diffusion training and sampling. To the best of our knowledge, this is the first nonisotropic formulation for a structured problem.
The correlated noise easies the generation task for the denoiser network: body joints are not assumed independet from each other (i.i.d) and the noise suggests that connected joints should be diffused similarly.
As shown in our experiments, the nonisotropic formulation achieves better performance than the isotropic approach, requires fewer parameters and comes at no extra computational cost.