Storytelling using LangChain and Hugging Face! | Convert an Image to an Audio Story!

Quick Tutorials · Intermediate ·🧠 Large Language Models ·2y ago
In this video, we demo a project by which you can make an audio story out of a single input image! We do this using LangChain and a number of FREE Hugging Face models. Specifically, we use the open-source Falcon model as our Large Language Model (LLM) to generate a story from a scenario. To get the scenario, we pass our input image to an image captioning model from Hugging Face and use the generated caption as our scenario. Finally, we use a text to speech model from Hugging Face to convert our story to speech to get an audio story! LangChain is an open-source framework that enables developer…
Watch on YouTube ↗ (saves to browser)
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Next Up
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Dave Ebbelaar (LLM Eng)