Metadata-Version: 2.1
Name: thematos
Version: 0.1.2
Summary: A Python Library for Topic Modeling
Home-page: https://thematos.entelecheia.ai
License: MIT
Author: Young Joon Lee
Author-email: entelecheia@hotmail.com
Requires-Python: >=3.8.1,<3.12
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Requires-Dist: hyfi (>=0.2.6,<0.3.0)
Requires-Dist: toml (>=0.10.2,<0.11.0)
Description-Content-Type: text/markdown

# ThematOS: A Python Library for Topic Modeling

[![pypi-image]][pypi-url]
[![license-image]][license-url]
[![version-image]][release-url]
[![release-date-image]][release-url]
[![jupyter-book-image]][docs-url]

<!-- Links: -->

[pypi-image]: https://img.shields.io/pypi/v/thematos
[license-image]: https://img.shields.io/github/license/entelecheia/thematos
[license-url]: https://github.com/entelecheia/thematos/blob/main/LICENSE
[version-image]: https://img.shields.io/github/v/release/entelecheia/thematos?sort=semver
[release-date-image]: https://img.shields.io/github/release-date/entelecheia/thematos
[release-url]: https://github.com/entelecheia/thematos/releases
[jupyter-book-image]: https://jupyterbook.org/en/stable/_images/badge.svg
[repo-url]: https://github.com/entelecheia/thematos
[pypi-url]: https://pypi.org/project/thematos
[docs-url]: https://thematos.entelecheia.ai
[changelog]: https://github.com/entelecheia/thematos/blob/main/CHANGELOG.md
[contributing guidelines]: https://github.com/entelecheia/thematos/blob/main/CONTRIBUTING.md

<!-- Links: -->

- Documentation: [https://thematos.entelecheia.ai][docs-url]
- GitHub: [https://github.com/entelecheia/thematos][repo-url]
- PyPI: [https://pypi.org/project/thematos][pypi-url]

ThematOS is a comprehensive and user-friendly Python library designed for topic modeling, a key technique in natural language processing (NLP) and machine learning (ML) used to extract hidden patterns and themes from large collections of text data. The name ThematOS is derived from the Greek word θέματος (topic), reflecting the library's primary focus on uncovering the underlying structure of textual data.

## Features

ThematOS offers a rich set of features that cater to both beginners and experienced practitioners in the field of NLP:

- **User-friendly API**: ThematOS provides an intuitive API, enabling users to easily create, train, and apply topic models with minimal lines of code.

- **Various topic modeling algorithms**: The library supports a wide range of state-of-the-art topic modeling techniques, including Latent Dirichlet Allocation (LDA), Non-negative Matrix Factorization (NMF), and Hierarchical Dirichlet Process (HDP), allowing users to explore different methods and choose the most suitable one for their projects.

- **Scalability**: ThematOS is designed to handle large-scale text corpora efficiently, utilizing advanced algorithms and parallel processing techniques to ensure rapid processing and analysis.

- **Customizability**: Users can create custom topic models with full control over model parameters, preprocessing steps, and evaluation metrics, tailoring the models to their specific needs.

- **Visualization tools**: ThematOS includes a variety of visualization tools to help users explore and interpret the results of topic models, such as word clouds, topic distribution charts, and interactive visualizations.

## Installation

You can install ThematOS using pip:

```bash
pip install thematos
```

## Getting Started

To get started with ThematOS, visit the [official documentation](https://thematos.entelecheia.io/) and the [GitHub repository](https://github.com/entelecheia/thematos) for examples, tutorials, and more information.

## Changelog

See the [CHANGELOG] for more information.

## Contributing

Contributions are welcome! Please see the [contributing guidelines] for more information.

## License

This project is released under the [MIT License][license-url].

