> Stable Video 3D (SV3D) is a generative model based on Stable Video Diffusion that takes in a still image of an object as a conditioning frame, and generates an orbital video of that object.
So can it actually output a 3d model? Or just images of what it thinks the object would look like from other angles?
The reference video (https://youtu.be/Zqw4-1LcfWg) says they use a NeRF / structure from motion and then create a mesh with marching cubes from the generated radiance field. This is how most soa text-to-object generators work now as well
So can it actually output a 3d model? Or just images of what it thinks the object would look like from other angles?