Image to Audio Story Converter

Overview

This project is an innovative application that transforms images into audio stories. Utilizing cutting-edge AI models, it captions images, crafts stories based on those captions, and converts the text to speech, offering a unique auditory experience from visual inputs.

Features

Image Captioning: Leverages a pre-trained model to describe images.
Story Generation: Creates short, engaging narratives based on image descriptions.
Text-to-Speech Conversion: Transforms generated stories into audio format using Hugging Face's API.
Streamlit Web App: Provides an interactive interface for users to upload images and receive audio stories.

Technologies

Python
Streamlit
Hugging Face Transformers
Langchain
dotenv for environment management

Setup and Usage

Install dependencies: pip install -r requirements.txt
Rename .env.example to .env and update it with your API Keys.
Run the Streamlit app: streamlit run describe_image.py

Demo Video

Check out our project in action! Uploaded a demonstration video showcasing how the Image to Audio Story Converter works, from uploading an image to hearing the generated audio story. Watch the video to see the app's features and capabilities firsthand.

Kanye.mp4

Contribution

Contributions are welcome! Feel free to fork the repo and submit pull requests.

License

This project is open-sourced under the MIT license.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
describe_image.py		describe_image.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image to Audio Story Converter

Overview

Features

Technologies

Setup and Usage

Demo Video

Contribution

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Image to Audio Story Converter

Overview

Features

Technologies

Setup and Usage

Demo Video

Contribution

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages