Metadata-Version: 2.4
Name: playlist-summarizer
Version: 0.1.0
Summary: A monorepo for YouTube playlist transcript fetching and AI-powered summarization
Author-email: Anthony Kovach <aj@ajxd2.dev>
Requires-Python: >=3.14
Description-Content-Type: text/markdown
License-File: LICENSE
Dynamic: license-file

# Playlist Summarizer

A monorepo containing tools for fetching YouTube playlist transcripts and generating AI-powered summaries using local or cloud-based LLM models via Ollama.

## Packages

| Package | Description |
|---------|-------------|
| [`playlist-summarizer-core`](./packages/core) | Core library for transcript fetching and AI summarization |
| [`playlist-summarizer-cli`](./packages/cli) | Command-line interface for the summarizer |

## Features

- 📥 **Fetch Playlists**: Download transcripts from YouTube playlists
- 🤖 **AI Summarization**: Generate summaries using Ollama models (local or cloud)
- 📝 **Markdown Output**: Clean, structured summaries in Markdown format
- 🎯 **Selective Processing**: Choose which videos to summarize
- 📊 **Progress Tracking**: Real-time progress bars and status updates
- 🔒 **Secure Prompts**: Hardened system prompts resistant to injection attacks
- 🎨 **Rich CLI**: Beautiful terminal interface with colors and formatting

## Requirements

- Python 3.14+
- [Ollama](https://ollama.com/) installed and running (for local models)
- Or Ollama Cloud API key (for cloud models)

## Quick Start

1. Clone the repository:
```bash
git clone https://github.com/ajxd2/playlist-summarizer.git
cd playlist-summarizer
```

2. Install dependencies using `uv`:
```bash
uv sync
```

3. (Optional) Set up environment variables:
```bash
cp .env.example .env
# Edit .env and add:
# OLLAMA_HOST=http://localhost:11434  # Optional, defaults to localhost
# OLLAMA_API_KEY=your_api_key_here     # Required for cloud models
```

4. Run the CLI:
```bash
uv run playlist-summarizer
```

## Project Structure

```
playlist-summarizer/
├── packages/
│   ├── core/                    # Core library
│   │   ├── src/
│   │   │   └── playlist_summarizer_core/
│   │   │       ├── __init__.py
│   │   │       ├── fetcher.py   # YouTube transcript fetching
│   │   │       └── summarizer.py # AI summarization logic
│   │   ├── pyproject.toml
│   │   └── README.md
│   └── cli/                     # CLI application
│       ├── src/
│       │   └── playlist_summarizer_cli/
│       │       ├── __init__.py
│       │       ├── __main__.py
│       │       └── main.py      # CLI entry point
│       ├── pyproject.toml
│       └── README.md
├── transcripts/                 # Downloaded transcripts (gitignored)
├── summaries/                   # Generated summaries (gitignored)
├── pyproject.toml               # Workspace configuration
├── uv.lock                      # Dependency lock file
└── README.md
```

## Usage

### Using the CLI

Run the CLI application:
```bash
uv run playlist-summarizer
```

#### Fetch Playlist Transcripts

1. Select "Fetch playlist" from the main menu
2. Enter the YouTube playlist URL
3. Specify the output directory (defaults to `transcripts/{playlist_id}/`)
4. Transcripts will be saved as `.txt` files

#### Summarize Playlists

1. Select "Summarize playlist" from the main menu
2. Enter the directory path containing transcripts (defaults to `transcripts/`)
3. Choose the Ollama model to use (e.g., `gemma3:4b`, `llama3.1:8b`, `qwen2.5:7b`)
4. Specify the output directory for summaries (defaults to `summaries/`)
5. Select a playlist folder
6. Choose which videos to summarize (use space to select/deselect)
7. Optionally create a master summary of all video summaries

### Using the Core Library

```python
from playlist_summarizer_core import get_playlist_videos, get_transcript, Summarizer

# Fetch transcripts
videos = get_playlist_videos("https://www.youtube.com/playlist?list=...")
for video in videos:
    transcript = get_transcript(video)
    print(f"{video.title}: {len(transcript)} characters")

# Summarize content
summarizer = Summarizer(model="gemma3:4b")
summary = summarizer.summarize("Your transcript text...")
```

## Configuration

### Environment Variables

- `OLLAMA_HOST` - Ollama server URL (default: `http://localhost:11434`)
- `OLLAMA_API_KEY` - API key for Ollama Cloud (required for cloud models)

### Default Directories

- Transcripts: `transcripts/`
- Summaries: `summaries/`

## Development

### Setup Development Environment

```bash
uv sync
```

### Run from Source

```bash
# Run CLI
uv run playlist-summarizer

# Or run as module
uv run python -m playlist_summarizer_cli
```

### Working with Individual Packages

```bash
# Work on core package
cd packages/core
uv sync

# Work on CLI package
cd packages/cli
uv sync
```

## License

MIT License - see [LICENSE](LICENSE) file for details.

## Contributing

Contributions are welcome! Please feel free to submit a Pull Request. See [CONTRIBUTING.md](CONTRIBUTING.md) for guidelines.

## Author

Anthony Kovach - [aj@ajxd2.dev](mailto:aj@ajxd2.dev)
