> Stable Video 3D (SV3D) is a generative model based on Stable Video Diffusion t...

krebby · on March 19, 2024

The reference video (https://youtu.be/Zqw4-1LcfWg) says they use a NeRF / structure from motion and then create a mesh with marching cubes from the generated radiance field. This is how most soa text-to-object generators work now as well

2StepsOutOfLine · on March 19, 2024

I'm also struggling to find any examples of how to actually get a 3D model output. Very few references to this capability outside of the blog post.