Metadata-Version: 2.4
Name: rag-colls
Version: 0.2.0.6
Summary: rag-colls - Implement recent advanced RAG techniques
Project-URL: Homepage, https://github.com/hienhayho/rag-colls
Project-URL: Issues, https://github.com/hienhayho/rag-colls/issues
Requires-Python: >=3.10
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: bm25s>=0.2.10
Requires-Dist: chromadb<1.0.0,>=0.6.3
Requires-Dist: elasticsearch[async]>=8.17.2
Requires-Dist: gdown>=5.2.0
Requires-Dist: html2text>=2024.2.26
Requires-Dist: jax[cpu]>=0.5.3
Requires-Dist: litellm>=1.65.0
Requires-Dist: llama-index-embeddings-openai>=0.3.1
Requires-Dist: loguru>=0.7.3
Requires-Dist: openpyxl>=3.1.5
Requires-Dist: pandas<2.2.3,>=2.2.0
Requires-Dist: platformdirs>=4.3.7
Requires-Dist: pymupdf>=1.25.4
Requires-Dist: python-docx>=1.1.2
Requires-Dist: rich>=13.9.4
Requires-Dist: tenacity>=9.0.0
Provides-Extra: dev
Requires-Dist: pre-commit>=4.2.0; extra == "dev"
Requires-Dist: pytest>=8.3.5; extra == "dev"
Provides-Extra: docs
Requires-Dist: esbonio>=0.12.0; extra == "docs"
Requires-Dist: myst-parser>=4.0.1; extra == "docs"
Provides-Extra: huggingface-embedding
Requires-Dist: llama-index-embeddings-huggingface>=0.5.2; extra == "huggingface-embedding"
Dynamic: license-file

# rag-colls

<p align="center">
  <img src="assets/Final_logo.png" alt="Logo" width="350"/>
</p>

**rag-colls** a.k.a **RAG Coll**ection**s**.

Simple and easy to use, production-ready advanced RAG techniques.

<div align="center">

![Downloads](https://img.shields.io/pypi/dm/rag_colls) ![License](https://img.shields.io/badge/license-MIT-green)

![GitHub CI](https://github.com/hienhayho/rag-colls/actions/workflows/docker-build.yml/badge.svg) ![GitHub CI](https://github.com/hienhayho/rag-colls/actions/workflows/installation-testing.yml/badge.svg)

</div>

## 📑 Table of Contents

- [📖 Documentation](#-documentation)
- [🔧 Installation](#-installation)
- [📚 Notebooks](#-notebooks)
- [🚀 Upcoming](#-upcoming)
- [🎉 Quickstart](#-quickstart)
- [💻 Develop Guidance](#-develop-guidance)
- [✨ Contributors](#-contributors)
- [©️ License](#️-license)

## 📖 Documentation

Please visit [documentation](https://rag-colls.readthedocs.io/en/latest/) to get latest update.

## 🔧 Installation

- You can easily install it from **pypi**:

```bash
pip install -U rag-colls
```

- **Docker** - 🐳:

```bash
# Clone the repository
git clone https://github.com/hienhayho/rag-colls.git
cd rag-colls/

# Docker build
DOCKER_BUILDKIT=1 docker build -f docker/Dockerfile \
                    --build-arg OPENAI_API_KEY=<YOUR_OPENAI_KEY> \
                    --build-arg PYTHON_VERSION="3.10" \
                    -t rag-colls:3.10 .
```

## 📚 Notebooks

We have provided some notebooks for example usage.

|   RAG Tech    |                      Code                      |                                       Guide                                        |                                                            Tech Description                                                            |
| :-----------: | :--------------------------------------------: | :--------------------------------------------------------------------------------: | :------------------------------------------------------------------------------------------------------------------------------------: |
|   BasicRAG    |     [BasicRAG](./rag_colls/rags/basic_rag)     | [Colab](https://colab.research.google.com/drive/19hzGSQqx-LIsSbnNkV71ipRAIiFingvP) |                             Integrate with [`Chromadb`](rag_colls/databases/vector_databases/chromadb.py)                              |
| ContextualRAG | [ContextualRAG](rag_colls/rags/contextual_rag) | [Colab](https://colab.research.google.com/drive/1vT2Wl8FzYt25_4CMMg-2vcF4y17iTSjO) | Integrate with [`Chromadb`](rag_colls/databases/vector_databases/chromadb.py) and [`BM25s`](rag_colls/databases/bm25/bm25s.py) version |

## 🚀 Upcoming

We are currently working on these projects and will be updated soon.

| RAG Tech |                                                                                Link                                                                                 |
| :------: | :-----------------------------------------------------------------------------------------------------------------------------------------------------------------: |
| Graph-RAG | [Blog](https://microsoft.github.io/graphrag/), [Paper](https://arxiv.org/pdf/2404.16130) |
|   RAFT   | [Blog](https://techcommunity.microsoft.com/blog/aiplatformblog/raft-a-new-way-to-teach-llms-to-be-better-at-rag/4084674), [Paper](https://arxiv.org/pdf/2403.10131) |
|  RAG-RL  |                                                              [Paper](https://arxiv.org/pdf/2503.12759)                                                              |

## 🎉 Quickstart

Please refer to [example](./examples) for more information.

## 💻 Develop Guidance

Please refer to [DEVELOP.md](./DEVELOP.md) for more information.

## ✨ Contributors

<table>
<tr>
    <td align="center" style="word-wrap: break-word; width: 120.0; height: 120.0">
        <a href=https://github.com/hienhayho>
            <img src=https://avatars.githubusercontent.com/u/115549171?v=4 width="80;"  style="border-radius:50%;align-items:center;justify-content:center;overflow:hidden;padding-top:10px" alt=Ho Trong Hien/>
            <br />
            <sub style="font-size:12px"><b>Ho Trong Hien</b></sub>
        </a>
    </td>
    <td align="center" style="word-wrap: break-word; width: 120.0; height: 120.0">
        <a href=https://github.com/congtuong>
            <img src=https://avatars.githubusercontent.com/u/132115321?v=4 width="80;"  style="border-radius:50%;align-items:center;justify-content:center;overflow:hidden;padding-top:10px" alt=congtuong/>
            <br />
            <sub style="font-size:12px"><b>congtuong</b></sub>
        </a>
    </td>
    <td align="center" style="word-wrap: break-word; width: 120.0; height: 120.0">
        <a href=https://github.com/datheobc123>
            <img src=https://avatars.githubusercontent.com/u/142462660?v=4 width="80;"  style="border-radius:50%;align-items:center;justify-content:center;overflow:hidden;padding-top:10px" alt=Phan Thanh Dat/>
            <br />
            <sub style="font-size:12px"><b>Phan Thanh Dat</b></sub>
        </a>
    </td>
</tr>
</table>

## ©️ LICENSE

`rag-colls` is under [MIT LICENSE.](./LICENSE)
