All
Articles 104,594Blog Posts 117,155Tech Tutorials 26,387Research Papers 21,860News 16,182
⚡ AI Lessons

Dev.to · Vincent Du
🧠 Large Language Models
5mo ago
How to Run MLPerf Llama 2 70B Training on AMD MI325X Without SLURM
A practical guide to running MLPerf Training v5.1 on a multi-node AMD MI325X cluster without SLURM, achieving near-linear scaling.

Dev.to · Vincent Du
☁️ DevOps & Cloud
5mo ago
Building a File Copier 4x Faster Than cp Using io_uring
How I used Linux io_uring to build a file copier that's 4.2x faster than cp for ML datasets. Lessons on when async I/O helps—and when it doesn't.
DeepCamp AI