I want to host some LLM’s locally and use more advanced models. Since new hardware is out of the question, I think I should be able to pull something off buying some yesteryear equipment on ebay etc. Did anybody attempt such a project? Does it scale horizontally? (I.e. can I connext two boxes to overcome single box slowness?)

  • civ@lemmy.civl.cc
    link
    fedilink
    English
    arrow-up
    5
    ·
    18 hours ago

    I’m running Qwen 3.6 35B A3B on my Ryzen 8700g and it runs pretty well, but the bigger problem there is probably the cost of RAM