✕ Clear all filters
155 articles

📰 Reddit r/deeplearning

155 articles · Updated every 3 hours · View all reads

All Articles 67,615Blog Posts 99,886Tech Tutorials 16,344Research Papers 13,813News 12,550 ⚡ AI Lessons
Reddit r/deeplearning 13h ago
[Artículo] Modelos económicos basados ​​en exportaciones e importaciones para predecir el comercio mundial mediante aprendizaje profundo
submitted by /u/Odd_Bid8616 [link] [comments]
Reddit r/deeplearning 13h ago
The H100 GPU can theoretically do 62,000 tokens/sec. Production gets 200. I wrote a deep dive on why the gap is structural, with an interactive explainer.
Long story short, an 8B model in 16-bit precision is 16 GB. Every token requires a full weight transfer from HBM to on-chip SRAM. With 3.35 TB/s bandwidth: 3,35
Reddit r/deeplearning 14h ago
Building an Open-Source Neural Architecture Search Framework with Episodic Memory-Guided Evolutionary Search
submitted by /u/vergueirou [link] [comments]
Reddit r/deeplearning 14h ago
Repurposing the Query Weight Matrix: Theory and Experiments on setting W_Q = Id and replacing it with non-linearity
submitted by /u/Markomkd [link] [comments]
Reddit r/deeplearning 🧠 Large Language Models ⚡ AI Lesson 15h ago
[D] MobileBERT scored 0 F1 across three fault-detection datasets while TinyBERT and DistilBERT worked. Any idea why?
I'm benchmarking lightweight transformers for fault detection on edge devices using three public datasets (NASA C-MAPSS, SECOM, and UCI AI4I 2020). MobileBERT s
Reddit r/deeplearning 21h ago
Why No One Developer Can Win the AI Race
​ The conventional narrative warns us of the dangers of very powerful AI being in the hands of one corporation. The fear is that a developer might gain such a l
Reddit r/deeplearning 1d ago
Open source : Turning vocal imitations into sound effects. (New UX for sound generation)
submitted by /u/Danny-1257 [link] [comments]
Reddit r/deeplearning 1d ago
Beginner looking for a roadmap: undergrad thesis on decentralized (DD) LLMs with a focus on privacy/security
I’m a complete beginner in cybersecurity and ML/LLMs. I’m planning to start my undergrad thesis on decentralized LLMs (DD LLMs) in about 8 months, and I want to
Reddit r/deeplearning 1d ago
In VLA co-training, how much of the backbone learning signal actually comes from flow matching?
Reading through the Wall-OSS-0.5 report and one claim seems worth sanity checking: in their setup, flow matching is not the main learning signal reaching the VL
Reddit r/deeplearning 1d ago
My Bachelor’s thesis project. Is an AI research paper library actually valuable?
Hey everyone, For my bachelor’s thesis, I built a website that serves as a library for more than 200,000 research papers, with new papers being added and update
Reddit r/deeplearning 1d ago
Understanding neural networks from scratch with C++
submitted by /u/markuzo1 [link] [comments]
Reddit r/deeplearning 1d ago
Learning to Skip Blocks: Self-Discovered Ultrametric Routing for Hardware-Accelerated Sparse Attention
submitted by /u/MagicaItux [link] [comments]
Reddit r/deeplearning 1d ago
Grok 4 on the Paradise Our World Could Become When AI Is Doing All of Our Work
​ This is the second in a series of seven posts on how our top AI models describe the paradise our world could be transformed into when AI does all of our work.
Reddit r/deeplearning 1d ago
Why do the output layer weights become word vectors in Word2Vec?
I'm trying to understand the intuition behind Word2Vec training using a neural network. In Word2Vec (CBOW or Skip-gram), we often hear that the weight matrices
Reddit r/deeplearning 1d ago
This open-source lightweight tool handles all the tedious grunt work for YOLO datasets
submitted by /u/Embarrassed-Party552 [link] [comments]
Reddit r/deeplearning 1d ago
[ Removed by Reddit ]
[ Removed by Reddit on account of violating the content policy . ] submitted by /u/ConditionFederal3760 [link] [comments]
Reddit r/deeplearning 1d ago
Need guidance to get into research
submitted by /u/clutcher_cop [link] [comments]
Reddit r/deeplearning 1d ago
On the Duty of Proprietary Developers to Promote the Benefits of AIs Doing All of Our Work for Us
​ Let's start with a fact that too few people are aware of, but that absolutely everyone should very well understand. In the 1800s there were people who became
Reddit r/deeplearning 1d ago
What’s the biggest bottleneck in your current dev workflow?
For me, it’s not writing code it’s everything around it. Environment setup, managing compute resources, making sure things run consistently, and dealing with in
Reddit r/deeplearning 1d ago
Life-changing platform
submitted by /u/Traditional_Ball1392 [link] [comments]