Metadata-Version: 2.1
Name: mosyn
Version: 1.0.9
Summary: A morphological dictionary tool.
Home-page: https://github.com/uagdataanalysis/mosynapi
Author: uagdataanalysis
Author-email: uagdataanalysis@gmail.com
License: UNKNOWN
Download-URL: https://github.com/uagdataanalysis/mosynapi/tarball/1.0.9
Project-URL: Bug Reports, https://github.com/uagdataanalysis/mosynapi/issues
Project-URL: Source, https://github.com/uagdataanalysis/mosynapi/
Keywords: dictionary,morphological,analysis
Platform: UNKNOWN
Classifier: Programming Language :: Python :: 2
Classifier: License :: Other/Proprietary License
Classifier: Operating System :: OS Independent
Requires-Dist: nltk (>=3.0)

# Morpho-Syntactic "MoSyn" API
## UAG - BIG DATA RESEARCH GROUP

----

### Description
This API has been created by the Universidad Autónoma de Guadalajara(UAG) Big Data Research Group as a Natural Language Processing tool. It provides a python library with functions that help to perform morphological anaysis on texts written in Spanish.

 - Quick overview on what morphological analysis is in the following video: [El análisis morfológico de una oración](https://www.youtube.com/watch?v=BgAHya5ejJ8)
 - Link to EAGLES standard:[INTRODUCCIÓN A LAS ETIQUETAS EAGLES](http://www.cs.upc.edu/~nlp/tools/parole-sp.html) 

Additional information about the creation of the morphological dictionary can be found in the following Article:
 * P.J. Castro Pérez, A.A. García Fuentes, M. E. Huerta Arreola, R. Dávila Pérez. (2016). Machine Readable Dictionary for Mexican Spanish. En Tecnologías Modernas para la Industria y la Educación. Cuernavaca, Morelos, Mexico: Institute Eng Electric Electronics Morelos Section, S.C. (607-95255).

----


## Installation
The following packages are required to install mosyn
 - Python: [https://www.python.org/](https://www.python.org/)
 - PyPI: [https://pip.pypa.io/en/stable/installing/](https://pip.pypa.io/en/stable/installing/)
 - NLTK: [http://www.nltk.org/](http://www.nltk.org/)

Once having installed the depenencies above then install mosyn:
```
# pip install mosyn
```

If at any point the following error appears:
```python
Resource u'tokenizers/punkt/english.pickle' not found.  Please
use the NLTK Downloader to obtain the resource:

    >>>nltk.download()

Searched in:
- '/home/ec2-user/nltk_data'
- '/usr/share/nltk_data'
- '/usr/local/share/nltk_data'
- '/usr/lib/nltk_data'
- '/usr/local/lib/nltk_data'
- u''
```
try the instructions in the following link:
[http://stackoverflow.com/questions/26570944/resource-utokenizers-punkt-english-pickle-not-found](http://stackoverflow.com/questions/26570944/resource-utokenizers-punkt-english-pickle-not-found)




## Running examples
Examples may need to be downloaded from GitHub and locally copied. Examples should be copied to the same level where mosyn has been installed for they to work out of the box. 

Use the command `pydoc mosyn´ to find out the directory where mosyn has been installed installed to:
```
$ pydoc mosyn
```

After hit enter a screen with a text similar to the following should appear:
```
Help on package mosyn:

NAME
    mosyn - # -*- coding: iso-8859-15 -*-

FILE
    /Library/Python/2.7/site-packages/mosyn-1.0.5-py2.7.egg/mosyn/__init__.py

PACKAGE CONTENTS
    __main__
    mosyn
    util (package)

(END)
```

In that example mosyn is installed in `/Library/Python/2.7/site-packages/mosyn-1.0.5-py2.7.egg/mosyn`. Download examples directory to that location:
```
-rw-r--r--   1 root  wheel    413 Sep 20 23:18 __init__.py
-rw-r--r--   1 root  wheel    439 Sep 20 23:18 __init__.pyc
-rw-r--r--   1 root  wheel    265 Sep 20 23:18 __main__.py
-rw-r--r--   1 root  wheel    256 Sep 20 23:18 __main__.pyc
drwxr-xr-x   3 root  wheel    102 Sep 20 23:18 dict
drwxr-xr-x   8 root  wheel    272 Sep 20 23:37 examples    <<===
-rw-r--r--   1 root  wheel  12023 Sep 20 23:18 mosyn.py
-rw-r--r--   1 root  wheel  12875 Sep 20 23:18 mosyn.pyc
drwxr-xr-x  12 root  wheel    408 Sep 20 23:18 util
```


Navigate to that directory and execute one of the examples; e.g.:
```
$ cd examples
python python2.x/parseFileSample.py
Processing: Poema20.txt.
.............................................................
" PUEDO " ( lema: poder )
	VMIP1S0 -> singular verb without gender 
----------------------------------------------------

" escribir " ( lema: escribir )
	V0N0000 -> undefined number verb without gender 
----------------------------------------------------

" los " ( lema: el )
	DA0MP0 -> plural male determinant 
	PP3MPA00 -> plural male pronoun 
	NCMS000 -> singular male name 
----------------------------------------------------

. . .

----------------------------------------------------

" escribo " ( lema: escribir )
	V0IP1S0 -> singular verb without gender 
	VMIP1S0 -> singular verb without gender 
----------------------------------------------------

" . " ( lema: . )
	FP -> undefined number punctuation without gender 
----------------------------------------------------
```

## Contact
Please address questions to uagdataanalysis@gmail.com
Report a bug by creating an issue in the following link:
[https://github.com/uagdataanalysis/mosynapi/issues](https://github.com/uagdataanalysis/mosynapi/issues)


<div style="text-align:center"><a rel="license" href="http://creativecommons.org/licenses/by-nc-sa/4.0/"><img alt="Creative Commons License" style="border-width:0" src="https://i.creativecommons.org/l/by-nc-sa/4.0/88x31.png" /></a><br />This work is licensed under a <a rel="license" href="http://creativecommons.org/licenses/by-nc-sa/4.0/">Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License</a>.</CENTER></div>


