📰 Dev.to · Bonzai2Carn
6 articles · Updated every 3 hours · View all reads
All
Articles 95,380Blog Posts 111,880Tech Tutorials 24,041Research Papers 20,249News 15,294
⚡ AI Lessons

Dev.to · Bonzai2Carn
19h ago
Most PDF Extractors Use the Wrong API: Here’s What We Built Instead
TLDR: PDF.js exposes three data sources at three fidelity levels. The industry default is the one...

Dev.to · Bonzai2Carn
3d ago
Why Splitting a 2,500-Line File Broke Our Architecture
TLDR A single 2500-line index.html with all JS inline worked. Splitting it into...

Dev.to · Bonzai2Carn
1w ago
Stop Parsing PDFs at Render Time: A Better Architecture for Structured Extraction
TLDR: The reason most frontend PDF extraction is wrong is that developers try to infer document...

Dev.to · Bonzai2Carn
1mo ago
The Empty Quadrant: Mapping the Design Space of Frontend PDF Extraction
A user asked me a sharp question yesterday: Looking at your extraction pipeline, pdfjs +...

Dev.to · Bonzai2Carn
1mo ago
How to Stop PDF Parsers from Hallucinating Tables out of Thin Air
PDF extraction is usually blind. If you've ever tried to write a script to scrape a PDF, you know...

Dev.to · Bonzai2Carn
2mo ago
Cleaning Broken HTML Tables from PDFs, Scrapes, and Legacy Exports in Vanilla JS
HTML tables are liars. If you haven't worked deeply with HTML tables, you might think a table is...
DeepCamp AI