Metadata-Version: 2.1
Name: podcast-llm
Version: 0.2.2
Summary: An intelligent system that automatically generates engaging podcast conversations using LLMs and text-to-speech technology.
Home-page: https://github.com/evandempsey/podcast-llm
License: Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0)
Keywords: podcast,llm,ai
Author: Evan Dempsey
Author-email: evandempsey@gmail.com
Requires-Python: >=3.12,<4.0
Classifier: Development Status :: 3 - Alpha
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: MIT License
Classifier: License :: Other/Proprietary License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.12
Classifier: Programming Language :: Python :: 3.13
Requires-Dist: audioop-lts (>=0.2.0,<0.3.0) ; python_version >= "3.13" and python_version < "4.0"
Requires-Dist: elevenlabs (>=1.12.1,<2.0.0)
Requires-Dist: google-cloud-texttospeech (>=2.21.1,<3.0.0)
Requires-Dist: gradio (>=5.6.0,<6.0.0)
Requires-Dist: gradio-log (>=0.0.7,<0.0.8)
Requires-Dist: langchain (>=0.3.7,<0.4.0)
Requires-Dist: langchain-anthropic (>=0.3.0,<0.4.0)
Requires-Dist: langchain-community (>=0.3.7,<0.4.0)
Requires-Dist: langchain-google-genai (>=2.0.4,<3.0.0)
Requires-Dist: langchain-openai (>=0.2.8,<0.3.0)
Requires-Dist: lxml-html-clean (>=0.4.1,<0.5.0)
Requires-Dist: newspaper3k (>=0.2.8,<0.3.0)
Requires-Dist: openai (>=1.54.4,<2.0.0)
Requires-Dist: pydub (>=0.25.1,<0.26.0)
Requires-Dist: pypdf (>=5.1.0,<6.0.0)
Requires-Dist: python-docx (>=1.1.2,<2.0.0)
Requires-Dist: python-dotenv (>=1.0.1,<2.0.0)
Requires-Dist: tavily-python (>=0.5.0,<0.6.0)
Requires-Dist: wikipedia (>=1.4.0,<2.0.0)
Requires-Dist: youtube-transcript-api (>=0.6.2,<0.7.0)
Project-URL: Documentation, https://evandempsey.github.io/podcast-llm/
Project-URL: Repository, https://github.com/evandempsey/podcast-llm
Description-Content-Type: text/markdown

# Podcast-LLM: AI-Powered Podcast Generation

![Tests](https://github.com/evandempsey/podcast-llm/actions/workflows/pytest.yml/badge.svg)
[![codecov](https://codecov.io/gh/evandempsey/podcast-llm/branch/main/graph/badge.svg)](https://codecov.io/gh/evandempsey/podcast-llm)
[![GitHub stars](https://img.shields.io/github/stars/evandempsey/podcast-llm.svg?style=social&label=Star)](https://github.com/evandempsey/podcast-llm)
[![GitHub Downloads](https://img.shields.io/github/downloads/evandempsey/podcast-llm/total.svg)](https://github.com/evandempsey/podcast-llm/releases)
[![GitHub issues](https://img.shields.io/github/issues/evandempsey/podcast-llm.svg)](https://github.com/evandempsey/podcast-llm/issues)
[![License: CC BY-NC 4.0](https://img.shields.io/badge/License-CC%20BY--NC%204.0-lightgrey.svg)](https://creativecommons.org/licenses/by-nc/4.0/)


An intelligent system that automatically generates engaging podcast conversations using LLMs and text-to-speech technology.

[View Documentation](https://evandempsey.github.io/podcast-llm/)

## Features

- Two modes of operation:
  - Research mode: Automated research and content gathering using Tavily search
  - Context mode: Generate podcasts from provided source materials (URLs and files)
- Dynamic podcast outline generation
- Natural conversational script writing with multiple Q&A rounds
- High-quality text-to-speech synthesis using Google Cloud or ElevenLabs
- Checkpoint system to save progress and resume generation
- Configurable voices and audio settings
- Gradio UI

## Examples

Listen to sample podcasts generated using Podcast-LLM:

### Structured JSON Output from LLMs (Google multispeaker voices)

[![Play Podcast Sample](https://img.shields.io/badge/Play%20Podcast-brightgreen?style=for-the-badge&logo=soundcloud)](https://soundcloud.com/evan-dempsey-153309617/llm-structured-output)

### UFO Crash Retrieval (Elevenlabs voices)

[![Play Podcast Sample](https://img.shields.io/badge/Play%20Podcast-brightgreen?style=for-the-badge&logo=soundcloud)](https://soundcloud.com/evan-dempsey-153309617/ufo-crash-retrieval-elevenlabs-voices)

### The Behenian Fixed Stars (Google multispeaker voices)

[![Play Podcast Sample](https://img.shields.io/badge/Play%20Podcast-brightgreen?style=for-the-badge&logo=soundcloud)](https://soundcloud.com/evan-dempsey-153309617/behenian-fixed-stars)

### Podcast-LLM Overview (Google multispeaker voices)

[![Play Podcast Sample](https://img.shields.io/badge/Play%20Podcast-brightgreen?style=for-the-badge&logo=soundcloud)](https://soundcloud.com/evan-dempsey-153309617/podcast-llm-with-anthropic-and-google-multispeaker)

### Robotic Process Automation (Google voices)

[![Play Podcast Sample](https://img.shields.io/badge/Play%20Podcast-brightgreen?style=for-the-badge&logo=soundcloud)](https://soundcloud.com/evan-dempsey-153309617/robotic-process-automation-google-voices)


## Web Interface

![Gradio Web Interface](https://raw.githubusercontent.com/evandempsey/podcast-llm/main/assets/images/gradio_ui.png)

## Installation

1. Install using pip:
   ```bash
   pip install podcast-llm
   ```

2. Set up environment variables in `.env`:
   ```
   OPENAI_API_KEY=your_openai_key
   GOOGLE_API_KEY=your_google_key 
   ELEVENLABS_API_KEY=your_elevenlabs_key
   TAVILY_API_KEY=your_tavily_key
   ANTHROPIC_API_KEY=your_anthropic_api_key
   ```

## Usage

1. Generate a podcast about a topic:
   ```bash
   # Research mode (default) - automatically researches the topic
   podcast-llm "Artificial Intelligence"

   # Context mode - uses provided sources
   podcast-llm "Machine Learning" --mode context --sources paper.pdf https://example.com/article
   ```

2. Options:
   ```bash
   # Customize number of Q&A rounds per section
   podcast-llm "Linux" --qa-rounds 3

   # Disable checkpointing
   podcast-llm "Space Exploration" --checkpoint false

   # Generate audio output
   podcast-llm "Quantum Computing" --audio-output podcast.mp3

   # Generate Markdown output
   podcast-llm "Machine Learning" --text-output podcast.md
   ```

3. Customize voices and other settings in `config/config.yaml`

4. Launch the Gradio web interface:
   ```bash
   # Start the web UI
   podcast-llm-gui
   ```

   This launches a user-friendly web interface where you can:
   - Enter a podcast topic
   - Choose between research and context modes
   - Upload source files and URLs for context mode
   - Configure Q&A rounds and checkpointing
   - Specify output paths for text and audio
   - Monitor generation progress in real-time


## License

This project is licensed under Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0)

This means you are free to:
- Share: Copy and redistribute the material in any medium or format
- Adapt: Remix, transform, and build upon the material

Under the following terms:
- Attribution: You must give appropriate credit, provide a link to the license, and indicate if changes were made
- NonCommercial: You may not use the material for commercial purposes
- No additional restrictions: You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits

For commercial use, please contact evandempsey@gmail.com to obtain a commercial license.

The full license text can be found at: https://creativecommons.org/licenses/by-nc/4.0/legalcode

## Acknowledgements

This project was inspired by [podcastfy](https://github.com/souzatharsis/podcastfy), which provides a framework for generating podcasts using LLMs. 

This implementation differs by automating the research and content gathering process, allowing for fully autonomous podcast generation about any topic without requiring manual research or content curation.

