llama-cpp-python
transformers
accelerate
guidance
minml

[develop]
build
pytest
