> most models I've seen are 300GB+ and require significant computational resourc...

inhumantsar · on May 17, 2023

Even if you're a more lightweight model, it's still not very practical to require a dedicated 24GB GPU for every active gamer, whether local or cloud hosted.

For all intents and purposes, it's as much of a non-starter in a production game as the multiple A100 scenario.

Of course that isn't going to remain the case for long as the recent advancements in optimization make their way into live systems, but still.

theaiquestion · on May 17, 2023

> it's still not very practical to require a dedicated 24GB GPU

totally agreed, you could get away with 12GB too which is in the midrange.

That said yeah it's still not something you could make a game with yet, I'm just pointing out 300GB+ of VRAM isn't the bar for entry here, it is reachable for medium-high end consumers but that's not really including the games resources either, and most gamers aren't medium-high end so...

nm980 · on May 18, 2023

> EDIT: It actually says in the readme they plan to support StableLM which is interesting because at least at the moment that's not a well performing model

I chose StableLM because that's the only other model I knew of besides ChatGPT. I'm open to adding support for other models after I fix some bugs first.

theaiquestion · on May 18, 2023

You might consider supporting ooba's api which would give you a lot of support for different things really quickly.

https://github.com/oobabooga/text-generation-webui/

nullsense · on May 18, 2023

Yeah, I second this. I use this frequently and lots of models downloaded that I test out with it. I'm keen to see a more API led approach.

AgentK20 · on May 17, 2023

Oh, fair enough. I hadn't been keeping up too much but hadn't realized they had progressed that far. I'll have to do some tinkering this evening.