I am trying to export to ONNX format a model who's traversed computation graph depends on certain calculations done on the data. The model is a variation of the Visual Transformer (VIT); In essence, ...
I have been playing around with converting diffmpm from the difftaichi package into a jax version, and while the forward pass has been working wonderfully, the backward pass has been using way too ...