Metadata-Version: 2.1
Name: pyw2v2
Version: 0.0.1a3
Summary: Simple wav2vec2 wrapper
Home-page: https://github.com/vbrydik/pyw2v2
Author: vbrydik (Vitalii Brydinskyi)
Author-email: <vbrydinskyi@gmail.com>
License: UNKNOWN
Description: 
        # pyw2v2
        
        **Work in progress!**
        
        This module is a wrapper for Wav2Vec2 models, intended to accelerate ARS research.
        
        ## Install
        
        This module can be easily be installed with `pip`:
        
        ```sh
        pip install pyw2v2
        ```
        
        ## Examples
        
        Different examples can be found [here](https://github.com/vbrydik/pyw2v2/tree/master/examples/).
        
        ### Fine-tuning CTC model example
        
        This example will show how to load a pretrained model,
        load dataset, process dataset, and fine-tune CTC model.
        
        Example configuration files can be found [here](https://github.com/vbrydik/pyw2v2/tree/master/configs/).
        
        ```py
        from pyw2v2 import ModelCTC, DatasetPreprocessor
        from pyw2v2.utils import load_config, load_custom_dataset_commonvoice_format
        
        if __name__ == "__main__":
            # Load pretrained model
            model_config = load_config("../configs/ctc/default.yaml")
            model = ModelCTC(model_config)
            
            # Loading dataset in Common Voice format
            train_set = load_custom_dataset_commonvoice_format('../datasets/example', 'train')
            eval_set = load_custom_dataset_commonvoice_format('../datasets/example', 'test')
        
            # Set up dataset preprocessor
            dataproc_config = load_config("../configs/dataproc/default.yaml").data_proc
            data_processor = DatasetPreprocessor(dataproc_config)
            data_processor.processor = model.processor
        
            # Process data
            train_set = data_processor.process(train_set, dataproc_config.n_samples_train)
            eval_set = data_processor.process(eval_set, dataproc_config.n_samples_test)
            
            # Train/Fine-tune model
            model.train(train_set, eval_set)
        ```
        
        
        
        
        
Keywords: python
Platform: UNKNOWN
Classifier: Development Status :: 1 - Planning
Classifier: Intended Audience :: Developers
Classifier: Programming Language :: Python :: 3
Classifier: Operating System :: Unix
Classifier: Operating System :: MacOS :: MacOS X
Classifier: Operating System :: Microsoft :: Windows
Description-Content-Type: text/markdown
