📰 Dev.to · Papers Mache

37 articles · Updated every 3 hours · View all reads

All Articles 95,923 Blog Posts 112,576 Tech Tutorials 24,165 Research Papers 20,260 News 15,375 ⚡ AI Lessons

AI reviewers fall for repackaging attacks

Dev.to · Papers Mache 📄 Paper 11h ago

AI reviewers fall for repackaging attacks

Minor presentation tweaks can inflate AI‑reviewer scores by more than a point on a ten‑point scale....

Multilingual code gap exposed by Multi‑LCB

Dev.to · Papers Mache 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago

Multilingual code gap exposed by Multi‑LCB

LLMs achieve high scores on Python coding tasks, yet their proficiency drops for the eleven other...

AI/ML Research Digest — Jun 20, 2026

Dev.to · Papers Mache 📄 Paper 2d ago

AI/ML Research Digest — Jun 20, 2026

Persistent state and memory for embodied agents Linear‑temporal attention lets agents keep a running...

Sparse KV Caches Cut Attention Scaling

Dev.to · Papers Mache 📄 Paper 2d ago

Sparse KV Caches Cut Attention Scaling

Sparse key‑value caches collapse the quadratic blow‑up of softmax attention into a cost that grows...

Local Gradient Accumulation Speeds Training 1.7

Dev.to · Papers Mache 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 3d ago

Local Gradient Accumulation Speeds Training 1.7

PACI removes the bubbles that cripple asynchronous pipeline parallelism and shaves as much as 1.69×...

Intra‑Model Routing Accelerates Speculative Decoding

Dev.to · Papers Mache 📄 Paper 4d ago

Intra‑Model Routing Accelerates Speculative Decoding

Intra‑model routing trims token‑generation latency by roughly a third to almost a full 80 % compared...

Aligning Hidden States Stabilizes LLM Distillation

Dev.to · Papers Mache 📄 Paper 1w ago

Aligning Hidden States Stabilizes LLM Distillation

Hidden‑representation alignment drives KL variance to exactly 0, turning on‑policy LLM distillation...

8 FPS Real‑Time Video on Consumer GPU

Dev.to · Papers Mache 📄 Paper 1w ago

8 FPS Real‑Time Video on Consumer GPU

MoVerse delivers 360° walkthrough video at roughly 8 FPS on a single RTX 4090, proving that...

AI/ML Research Digest — Jun 13, 2026

Dev.to · Papers Mache 📄 Paper 1w ago

AI/ML Research Digest — Jun 13, 2026

Infrastructure and inference optimization for scale Sparse‑attention mechanisms cut the quadratic...

Optimal Transport Converts Dense Layers to Sparse Experts

Dev.to · Papers Mache 📄 Paper 1w ago

Optimal Transport Converts Dense Layers to Sparse Experts

Differentiable optimal transport rewrites a dense feed‑forward layer into a balanced...

90% Less Memory Enables Infinite Video Generation

Dev.to · Papers Mache 📄 Paper 1w ago

90% Less Memory Enables Infinite Video Generation

A shared low‑rank cache slashes the memory footprint of autoregressive video diffusion by more than...

Linear Ensembles Can Erase LLM Watermarks

Dev.to · Papers Mache 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Linear Ensembles Can Erase LLM Watermarks

Watermarking schemes that embed distributional perturbations into LLM outputs are effectively broken...

Benchmarks Evaluate Memory Quality and Adaptive Planning in LLM Agents

Dev.to · Papers Mache 📄 Paper 1w ago

Benchmarks Evaluate Memory Quality and Adaptive Planning in LLM Agents

Newly released test suites expose two blind spots that have long lurked behind headline scores: how...

AI/ML Research Digest — May 23, 2026

Dev.to · Papers Mache 📄 Paper 3w ago

AI/ML Research Digest — May 23, 2026

Extreme KV‑Cache Compression and Long‑Context Efficiency Static quantization is giving way...

AI/ML Research Digest — May 30, 2026

Dev.to · Papers Mache 📄 Paper 3w ago

AI/ML Research Digest — May 30, 2026

Efficiency and Cost Reduction in LLM Agents Recent work tackles the high inference cost of LLM‑driven...

KV cache eviction improves long‑context performance

Dev.to · Papers Mache 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago

KV cache eviction improves long‑context performance

A learned, globally‑calibrated KV‑cache eviction policy can shave memory usage and, paradoxically,...

Self-evolving retrieval lifts benchmark scores 25%

Dev.to · Papers Mache 📄 Paper 1mo ago

Self-evolving retrieval lifts benchmark scores 25%

Agents that adapt their retrieval configurations while running deliver roughly a quarter more...

AI/ML Research Digest — May 16, 2026

Dev.to · Papers Mache 📄 Paper 1mo ago

AI/ML Research Digest — May 16, 2026

Distillation + low‑rank tricks cut compute Combining knowledge distillation with low‑rank adapters...

Shared expert pool reduces parameters while maintaining performance

Dev.to · Papers Mache 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Shared expert pool reduces parameters while maintaining performance

Conventional mixture‑of‑experts designs hand each transformer layer its own private expert set,...

HERMES++ answers language queries while predicting roads

Dev.to · Papers Mache 📄 Paper 1mo ago

HERMES++ answers language queries while predicting roads

The prevailing view has been that autonomous‑driving world models must choose between two extremes: a...

Entropy of first token predicts hallucinations

Dev.to · Papers Mache 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Entropy of first token predicts hallucinations

The entropy of the very first content‑bearing token already separates factual answers from...

AI/ML Research Digest — May 09, 2026

Dev.to · Papers Mache 📄 Paper 1mo ago

AI/ML Research Digest — May 09, 2026

Diffusion as a unifying backbone for multimodal generation Latent diffusion now drives both image...

Distillation that keeps confidence honest

Dev.to · Papers Mache 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Distillation that keeps confidence honest

On‑policy distillation has become the go‑to recipe for squeezing a large language model’s...

Diffusion models approach AR quality and improve inference speed

Dev.to · Papers Mache 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Diffusion models approach AR quality and improve inference speed

Diffusion language models have long promised parallel generation, yet their serving speed has lagged...