LocalLLaMA Reddit12/8|研究・論文プロダクト・サービス

1年かけてGPUを追加、ついに完成したローカルLLMシステム - 8x3090 (192GB VRAM) 64コアEPYC Milan 250GB RAM

Redditorが1年かけて8基のNVIDIA RTX 3090 GPUを組み込んだ大規模なローカルLLMシステムを完成させた。合計192GBのVRAMと64コアのEPYC Milanプロセッサを搭載し、約8万ドルをかけて構築した。GLM 4.5 AIモデルを使ってテストした結果、49 t/sの推論速度を記録した。今後は電力制限の調整やより高度なAWQモデルの検証を行う予定。

Save

Read original

Cached

Comments

No comments yet

Be the first to comment

Got lots of VRAM? Want to help a developer refine methods a…

1年かけてGPUを追加、ついに完成したローカルLLMシステム - 8x3090 (192GB VRAM) 64コアEPYC Milan 250GB RAM

Dive deeper

Related Articles

Got lots of VRAM? Want to help a developer refine methods a…

Devstral 123Bの調整可能化を要望

Best coding and agentic models - 96GB

Heretic Versions of TheDrummer AI Models Released

VRAM Advice? 24GB or 32GB for Starters

シャオミのMiMo-V2-Flashモデル(309B)が大手に躍進

Realistic Entry Point for a Good Local LLM Experience in 20…

Nvidia Introduces 'NitroGen': A Foundation Model for Genera…

Kimi k2 thinking vs GLM 4.6

Strix Halo with eGPU

AI Curator

Ask me anything about AI