๐Ÿ“š ์„ธํ˜„'s Vault

๐ŸŒ ๋„๋ฉ”์ธ

  • ๐Ÿ”ฎ3D-Vision
  • ๐ŸŽจRendering
  • ๐Ÿค–Robotics
  • ๐Ÿง LLM
  • ๐Ÿ‘๏ธVLM
  • ๐ŸŽฌGenAI
  • ๐ŸฅฝXR
  • ๐ŸŽฎSimulation
  • ๐Ÿ› ๏ธDev-Tools
  • ๐Ÿ’ฐCrypto
  • ๐Ÿ“ˆFinance
  • ๐Ÿ“‹Productivity
  • ๐Ÿ“ฆ๊ธฐํƒ€

๐Ÿ“„ Papers

  • ๐Ÿ“š์ „์ฒด ๋…ผ๋ฌธ172
Home

โฏ

bookmarks

โฏ

serve 1000s of llms on a single gpu lorax by predibase allows users to serve

serve-1000s-of-llms-on-a-single-gpu-lorax-by-predibase-allows-users-to-serve

2025๋…„ 6์›” 29์ผ1 min read

  • LLM
  • AR
  • fine-tuning
  • inference

Akshay ๐Ÿš€ (@akshay_pachaar)

2025-06-29 | โค๏ธ 740 | ๐Ÿ” 152


Serve 1000s of LLMs on a Single GPU!

LoRAX by Predibase allows users to serve thousands of fine-tuned models on one GPU, reducing costs without compromising speed or performance.

(100% open-source) https://x.com/akshay_pachaar/status/1939300362495938779/photo/1


๐Ÿ”— Related

See similar notes in domain-llm, domain-dev-tools, domain-ai-ml

Tags

type-resource domain-llm, domain-dev-tools, domain-ai-ml


๊ทธ๋ž˜ํ”„ ๋ทฐ

  • Akshay ๐Ÿš€ (@akshay_pachaar)
  • ๐Ÿ”— Related
  • Tags

๋ฐฑ๋งํฌ

  • domain-LLM

Created with Quartz v4.5.2 ยฉ 2026

  • GitHub
  • Sehyeon Park