📚 세현's Vault

🌍 도메인

🔮3D-Vision
🎨Rendering
🤖Robotics
🧠LLM
👁️VLM
🎬GenAI
🥽XR
🎮Simulation
🛠️Dev-Tools
💰Crypto
📈Finance
📋Productivity
📦기타

📄 Papers

📚전체 논문172

❯

❯

llama 32 1b in 4 bit runs at 60 tokssec with mlx swift on my iphone 15 pro its

llama-32-1b-in-4-bit-runs-at-60-tokssec-with-mlx-swift-on-my-iphone-15-pro-its

2024년 9월 26일1 min read

LLM
inference

Awni Hannun (@awnihannun)

2024-09-26 | ❤️ 2732 | 🔁 240

Llama 3.2 1B in 4-bit runs at ~60 toks/sec with MLX Swift on my iPhone 15 pro.

It’s quite good and easily runs on-device: https://x.com/awnihannun/status/1839330067039887622/video/1

미디어

video

Tags

그래프 뷰

Awni Hannun (@awnihannun)
미디어
Tags

백링크

domain-LLM

Created with Quartz v4.5.2 © 2026

GitHub
Sehyeon Park