I want to host some LLM’s locally and use more advanced models. Since new hardware is out of the question, I think I should be able to pull something off buying some yesteryear equipment on ebay etc. Did anybody attempt such a project? Does it scale horizontally? (I.e. can I connext two boxes to overcome single box slowness?)

  • tal@lemmy.today
    link
    fedilink
    English
    arrow-up
    3
    ·
    edit-2
    15 hours ago

    If you’re willing to wait until 2028 when memory prices are expected to drop, and if you’re willing to get new hardware if memory prices drop, I’d give real consideration to waiting until then. There’ll also probably be better hardware and better models then.