More

toebee · on April 22, 2025

Thanks for the kind words :)))

toebee · on April 22, 2025

We'll try to give a high-level overview when we publish the technical report!

toebee · on April 22, 2025

Thank you for the kind words! We don't have plans for that yet, but you can always open an issue or RP on Github.

toebee · on April 22, 2025

We have a ZeroGPU Space provided by HuggingFace up and running! Test it now on https://huggingface.co/spaces/nari-labs/Dia-1.6B

daemonologist · on April 22, 2025

The examples on your site are impressive, but I'm having trouble getting good results on HF - it's generating a lot of near-silence (often nothing but) and when it does produce speech it bears no resemblance to the audio prompt and only produces parts of the text prompt. Would you suggest any adjustments to the default parameters to improve adherence, or might I expect better results running locally? Thanks!

toebee · on April 22, 2025

Thank you for the contribution! We'll be merging PRs and cleaning code up very soon :)

toebee · on April 22, 2025

Sorry for the confusion. the license is plain Apache 2.0, and we changed the wording to "intended for research and educational use." The point was, users are free to use it for their use cases, just don't do shady stuff with it.

Thanks for the feedback :)

crooked-v · on April 22, 2025

So is that actually part of the license (making it non-Apache 2.0), or not?

toebee · on April 22, 2025

not part of the license!

toebee · on April 21, 2025

We are in the progress of fixing it! Thanks for letting us know :)

toebee · on April 21, 2025

We use descript audio codec! I’m not sure if DAC works on iOS…

toebee · on April 21, 2025

Thank you for the kind words! Dia wasn’t fine tuned on certain speaker, so you will get random voices every time you run it, unless you add a prompt / fix the seed.

The outputs are a bit unstable, might need to add cleaner training data and run longer training sessions. Hopefully we can do something like OAI Whisper and update with better performing checkpoints!

toebee · on April 21, 2025

Thank you!! Indeed the script was inspired from a scene in the Office.