Storytelling using LangChain and Hugging Face! | Convert an Image to an Audio Story!
In this video, we demo a project by which you can make an audio story out of a single input image! We do this using LangChain and a number of FREE Hugging Face models. Specifically, we use the open-source Falcon model as our Large Language Model (LLM) to generate a story from a scenario. To get the scenario, we pass our input image to an image captioning model from Hugging Face and use the generated caption as our scenario. Finally, we use a text to speech model from Hugging Face to convert our story to speech to get an audio story!
LangChain is an open-source framework that enables developer…
Watch on YouTube ↗
(saves to browser)
DeepCamp AI