Hardware for local inference?

droopy4096@lemmy.ca · 2 months ago

Hardware for local inference?

ryokimball@infosec.pub · 2 months ago

How old we talking? I personally wouldn’t go further back than 2000 series rtx. A friend has had good luck with Intel GPUs for ‘cheap’.

No, you absolutely cannot scale horizontally for speed. VRAM is king, with local RAM being swappable with major speed penalties. SSD is even slower than that and all those are orders of magnitude faster than ant Ethernet you’ll be connecting boxes together with. That’s not to say clustering isn’t an option, just that speed is going to be worse the more you scale out like that.

droopy4096@lemmy.ca · 2 months ago

I’ve got some circa 2010 cards laying about with a 32Gb server that already has 8Gb carved out for TrueNAS, so essentially I could squeeze 16-24Gb out of it, but it’s an older i5 Intel CPU

robber@lemmy.ml · 2 months ago

Your biggest issue with 2010 cards will be software (inference engine) support, I assume.

ffhein@lemmy.world · 2 months ago

2010 is ancient technology, according to wikipedia Nvidia released the 600 series in 2012… Even if there was some inference engine supporting it then lack of computational speed and memory bandwidth would probably make it not worth the effort.