Storytelling using LangChain and Hugging Face! | Convert an Image to an Audio Story!

Name: Storytelling using LangChain and Hugging Face! | Convert an Image to an Audio Story!
Uploaded: 2024-02-03T20:33:04+00:00
Channel: Quick Tutorials
Description: In this video, we demo a project by which you can make an audio story out of a single input image! We do this using LangChain and a number of FREE Huggi...

Quick Tutorials · Intermediate ·🧠 Large Language Models ·2y ago

In this video, we demo a project by which you can make an audio story out of a single input image! We do this using LangChain and a number of FREE Hugging Face models. Specifically, we use the open-source Falcon model as our Large Language Model (LLM) to generate a story from a scenario. To get the scenario, we pass our input image to an image captioning model from Hugging Face and use the generated caption as our scenario. Finally, we use a text to speech model from Hugging Face to convert our story to speech to get an audio story! LangChain is an open-source framework that enables developer…

Watch on YouTube ↗ (saves to browser)

Next Up

5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems

Dave Ebbelaar (LLM Eng)