oobabooga-text-generation-webui now supports EXLlama

Blaed@lemmy.world · edit-2 1 year ago

oobabooga-text-generation-webui now supports EXLlama

ArkyonVeil@lemmy.world · 1 year ago

It would be absolutely awesome, with infinite context length that would mean a much greater ease when it comes to handling models. I can be lazy and instead of creating a LORA, just use an entire book’s style as a reference right there in the prompt.

For programmers, just dump the entire codebase, or Documentation.

Of course, all this is only possible if VRAM is less of a bottleneck than it currently is, as well as the fact that it can reliably reference information on an arbitrarily large context. (Not much use having huge context if performance degrades, it loses its marbles or forgets key pieces of information along the way)

Blaed@lemmy.world · edit-2 1 year ago

I’m with you there. I love how Mosaic just fed the entire Great Gatsby to StoryWriter. This is the sort of context length I need in my life. Would make my projects so much easier. I don’t think we’re too far from having it on consumer hardware.

You should check out my latest post - which ironically addresses parts of your first comment, but you still need a lot of VRAM… 6000+ tokens context is now possible with ExLlama.

It’s crazy to see how fast these developments are happening!