Metadata-Version: 2.1
Name: datamaestro_text
Version: 2019.12.16
Summary: Text related datasets
Home-page: https://github.com/bpiwowar/datamaestro_texts
Author: Benjamin Piwowarski
Author-email: benjamin@piwowarski.fr
License: MIT
Description: [![CircleCI](https://circleci.com/gh/experimaestro/datamaestro_text.svg?style=svg)](https://circleci.com/gh/experimaestro/datamaestro) [![PyPI version](https://badge.fury.io/py/datamaestro-text.svg)](https://badge.fury.io/py/datamaestro-text)
        
        # Text-related datasets
        
        This [datamaestro](https://github.com/bpiwowar/datasets) plugin covers text-related datasets:
        
        - Information Retrieval
        - Natural Language Processing tasks
        
        The list of available datasets and usage instruction can be found in the [documentation](http://experimaestro.github.io/datamaestro_text/).
        
        ## List of available datasets
        
        Below is the list of available datasets along with ids. Some datasets have several versions; in this case, the dataset id is suffixed with this information.
        
        ### Documents
        
        - Aquaint `edu.upenn.ldc.aquaint`
        - TIPSTER `gov.nist.trec.tipster`
        - WikiText-2 and WikiText-103 `io.metamind.research.wikitext`
        
        ### Word embeddings
        
        - [Glove](http://nlp.stanford.edu/projects/glove/) `edu.stanford.glove`
        
        ### Sentiment analysis
        
        - [IMDB](http://ai.stanford.edu/~amaas/data/sentiment) `edu.stanford.aclimdb`
        
        ### Information Retrieval
        
        #### TREC
        
        - [TREC-1 to TREC-8, Robust 2004 and 2005](https://trec.nist.gov/) `gov.nist.trec.adhoc`
        
Platform: UNKNOWN
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Information Technology
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python
Classifier: Programming Language :: Python :: 3 :: Only
Classifier: Programming Language :: Python :: 3.5
Classifier: Programming Language :: Python :: 3.6
Classifier: Programming Language :: Python :: 3.7
Requires-Python: >=3.5
Description-Content-Type: text/markdown
