Metadata-Version: 2.1
Name: distyll-info
Version: 0.2.4
Summary: Information parsing assistant
License: MIT
Author: JP Hwang
Author-email: 61258750+databyjp@users.noreply.github.com
Requires-Python: >=3.10,<4.0
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Requires-Dist: beautifulsoup4 (>=4.12.3,<5.0.0)
Requires-Dist: moviepy (>=1.0.3,<2.0.0)
Requires-Dist: openai (>=1.13.3,<2.0.0)
Requires-Dist: pydub (>=0.25.1,<0.26.0)
Requires-Dist: pypdf (>=4.1.0,<5.0.0)
Requires-Dist: weaviate-client (>=4.5.0,<5.0.0)
Requires-Dist: yt-dlp (>=2023.12.30,<2024.0.0)
Description-Content-Type: text/markdown

## Installation

Install `ffmpeg` (`brew install ffmpeg` on macOS)
Install the package with `pip install distyll-info`

## Usage

- distyll.text.from_pdf(pdf_url) -> pdf_text
- distyll.text.from_arxiv_paper(arxiv_url) -> {"title": title, "url": arxiv_url, "text": pdf_text}
- distyll.transcripts.from_youtube(youtube_url) -> {"title": title, "date": date, "yt_url": youtube_url, "uploader": uploader, "channel": channel, "transcripts": List[transcript]}
- distyll.transcripts.from_local_video(video_url) -> List[transcript]

Please see the docstrings for more information.

### API keys

OpenAI: Audio -> text functionalities make use of OpenAI's Whisper. You will need an API key to use this functionality.
- Option 1: Provide it as an argument to the function.
- Option 2: Set it using `distyll.set_api_key(openai=<YOUR_API_KEY>)`.
- Option 3: Set it in the `OPENAI_APIKEY` environment variable.

## What happened to the old version?

Sorry! I'm working on making this more streamlined and better. For the old version, please see the `distyll_old` branch.

