The main novelty seems to be an extra layer of indirection with the prior network (whether it is an autoregressive transformer or a diffusion network), which predicts an image embedding based on the ...
This repository contains the implementation for the paper: MeshGPT: Generating Triangle Meshes with Decoder-Only Transformers by Yawar Siddiqui, Antonio Alliegro, Alexey Artemov, Tatiana Tommasi, ...
We propose DPCrossU-Net, a dual-branch parallel encoder–decoder network that integrates convolutional and Vision Transformer representations. The encoder employs parallel CNN and ViT branches with a ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results