📰 Dev.to · Papers Mache
37 articles · Updated every 3 hours · View all reads
All
Articles 95,923Blog Posts 112,576Tech Tutorials 24,165Research Papers 20,260News 15,375
⚡ AI Lessons

Dev.to · Papers Mache
📄 Paper
11h ago
AI reviewers fall for repackaging attacks
Minor presentation tweaks can inflate AI‑reviewer scores by more than a point on a ten‑point scale....

Dev.to · Papers Mache
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1d ago
Multilingual code gap exposed by Multi‑LCB
LLMs achieve high scores on Python coding tasks, yet their proficiency drops for the eleven other...

Dev.to · Papers Mache
📄 Paper
2d ago
AI/ML Research Digest — Jun 20, 2026
Persistent state and memory for embodied agents Linear‑temporal attention lets agents keep a running...

Dev.to · Papers Mache
📄 Paper
2d ago
Sparse KV Caches Cut Attention Scaling
Sparse key‑value caches collapse the quadratic blow‑up of softmax attention into a cost that grows...

Dev.to · Papers Mache
📐 ML Fundamentals
📄 Paper
⚡ AI Lesson
3d ago
Local Gradient Accumulation Speeds Training 1.7
PACI removes the bubbles that cripple asynchronous pipeline parallelism and shaves as much as 1.69×...

Dev.to · Papers Mache
📄 Paper
4d ago
Intra‑Model Routing Accelerates Speculative Decoding
Intra‑model routing trims token‑generation latency by roughly a third to almost a full 80 % compared...

Dev.to · Papers Mache
📄 Paper
1w ago
Aligning Hidden States Stabilizes LLM Distillation
Hidden‑representation alignment drives KL variance to exactly 0, turning on‑policy LLM distillation...

Dev.to · Papers Mache
📄 Paper
1w ago
8 FPS Real‑Time Video on Consumer GPU
MoVerse delivers 360° walkthrough video at roughly 8 FPS on a single RTX 4090, proving that...

Dev.to · Papers Mache
📄 Paper
1w ago
AI/ML Research Digest — Jun 13, 2026
Infrastructure and inference optimization for scale Sparse‑attention mechanisms cut the quadratic...

Dev.to · Papers Mache
📄 Paper
1w ago
Optimal Transport Converts Dense Layers to Sparse Experts
Differentiable optimal transport rewrites a dense feed‑forward layer into a balanced...

Dev.to · Papers Mache
📄 Paper
1w ago
90% Less Memory Enables Infinite Video Generation
A shared low‑rank cache slashes the memory footprint of autoregressive video diffusion by more than...

Dev.to · Papers Mache
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Linear Ensembles Can Erase LLM Watermarks
Watermarking schemes that embed distributional perturbations into LLM outputs are effectively broken...

Dev.to · Papers Mache
📄 Paper
1w ago
Benchmarks Evaluate Memory Quality and Adaptive Planning in LLM Agents
Newly released test suites expose two blind spots that have long lurked behind headline scores: how...

Dev.to · Papers Mache
📄 Paper
3w ago
AI/ML Research Digest — May 23, 2026
Extreme KV‑Cache Compression and Long‑Context Efficiency Static quantization is giving way...

Dev.to · Papers Mache
📄 Paper
3w ago
AI/ML Research Digest — May 30, 2026
Efficiency and Cost Reduction in LLM Agents Recent work tackles the high inference cost of LLM‑driven...

Dev.to · Papers Mache
📐 ML Fundamentals
📄 Paper
⚡ AI Lesson
1mo ago
KV cache eviction improves long‑context performance
A learned, globally‑calibrated KV‑cache eviction policy can shave memory usage and, paradoxically,...

Dev.to · Papers Mache
📄 Paper
1mo ago
Self-evolving retrieval lifts benchmark scores 25%
Agents that adapt their retrieval configurations while running deliver roughly a quarter more...

Dev.to · Papers Mache
📄 Paper
1mo ago
AI/ML Research Digest — May 16, 2026
Distillation + low‑rank tricks cut compute Combining knowledge distillation with low‑rank adapters...

Dev.to · Papers Mache
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Shared expert pool reduces parameters while maintaining performance
Conventional mixture‑of‑experts designs hand each transformer layer its own private expert set,...

Dev.to · Papers Mache
📄 Paper
1mo ago
HERMES++ answers language queries while predicting roads
The prevailing view has been that autonomous‑driving world models must choose between two extremes: a...

Dev.to · Papers Mache
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Entropy of first token predicts hallucinations
The entropy of the very first content‑bearing token already separates factual answers from...

Dev.to · Papers Mache
📄 Paper
1mo ago
AI/ML Research Digest — May 09, 2026
Diffusion as a unifying backbone for multimodal generation Latent diffusion now drives both image...

Dev.to · Papers Mache
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Distillation that keeps confidence honest
On‑policy distillation has become the go‑to recipe for squeezing a large language model’s...

Dev.to · Papers Mache
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Diffusion models approach AR quality and improve inference speed
Diffusion language models have long promised parallel generation, yet their serving speed has lagged...
DeepCamp AI