mattnewton a day ago

Hi HN, we're releasing weights for our latest text to image model and publishing this writeup on how we trained it in quite a bit of depth.

I hope there is something in the report for everyone, we included a fair bit on the actual training and data infrastructure usually not written about much, that I think will be interesting to people here. There's more that didn't fit, happy to answer questions!

  • ttul 3 hours ago

    This is a massive technical report for an open weights image gen model. As someone who has followed this space closely, it’s really cool to read about the behind-the-scenes experimentation and effort that went into the final product. I hope you will release some of the find tuning tools so the community can experiment with them as well and really push what the model’s capable of.

    • mattnewton 5 minutes ago

      You can find some links and details in the GitHub readme for finetuning / LoRA support. Ostiris, musubi tuner, fal and hugging face diffusers are all day-0 supported :) https://github.com/krea-ai/krea-2

      We recommend training off the undistilled, Raw checkpoint, and then applying the LoRA to the Turbo model for inference.

BoredPositron 40 minutes ago

It's a good model sadly the use of the qwen vae is a bit of a downer.

  • mobiuscog 39 minutes ago

    It's been mentioned by some that using the wan2.1 vae instead solves this. I haven't personally had time to try yet.