amram_art's comments

amram_art · 2025-12-21T16:00:00 1766332800

The problem is not in the image models rather the training data and its context. "British museum" for MJ is the image source, "British museum" is the setting for Nano Banana.

amram_art · 2025-07-26T14:53:56 1753541636

https://lore.kernel.org/lkml/20250725175358.1989323-2-sashal... part2 https://lore.kernel.org/lkml/20250725175358.1989323-3-sashal... part 3

amram_art · on Nov 27, 2024

It is licensed under Apache 2.0 license. The model is capable of generating 6-second videos at 720p resolution and 15 FPS based on prompt and image. Architecture is a 175M parameter VideoVAE and a 2.8B parameter VideoDiT model, which uses only 9.3 GB of GPU memory in BF16 mode with CPU offloading.