Metadata-Version: 2.1
Name: vrt-generator
Version: 0.0.6
Summary: creating vrt corpora
Home-page: https://github.com/miweru/vrt_generator
Author: Michael Ruppert
Author-email: michael.ruppert@fau.de
License: UNKNOWN
Platform: UNKNOWN
Classifier: Programming Language :: Python :: 3
Classifier: License :: OSI Approved :: GNU General Public License v3 or later (GPLv3+)
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Classifier: Topic :: Text Processing :: Linguistic
Requires-Python: >=3.5
Description-Content-Type: text/markdown
Requires-Dist: smart-open (>=1.9.0)

# vrt_generator
Python class for creating vrt-annotated corpora.
Still in very early testing stage.

Install by typing:
```bash
pip install vrt_generator
```

Usage Example:
```python
from vrt import Corpus,  S, Text
with Corpus("~","meinkorpus",4,"text_name") as c:
    with Text(c, text_name="Text2") as t:
        with S(c) as s:
            s.writep("Test","TAG","TAG","Lemma")  
```

Features:
-
- Represent Corpus, Text, P and S Attributes
- Integration of spacy for automatic generation of a vrt-representation of texts
- Using Context Manager for xml-hierarchy representation
- Reduces to utf8mb3 and checks formatting compatibility
- If you want to add texts that are automatically POS-Tagged with Spacy, you might look at vrt_spacy


