• Viri4thus@feddit.org
    link
    fedilink
    English
    arrow-up
    2
    ·
    5 hours ago

    Ty. I’ll try ollama with the Q-4-M quantization. I wouldn’t expect to see a difference between ollama and SGlang.