Metadata-Version: 2.1
Name: loanpy
Version: 0.1.9
Summary: Python framework for detecting old loanwords
Home-page: https://github.com/martino-vic/loanpy
Author: Viktor Martinović
Author-email: viktor.martinovic@hotmail.com
License: UNKNOWN
Project-URL: Bug Tracker, https://github.com/pypa/sampleproject/issues
Description: ============
        Installation
        ============
        
        ::
        
            $ python -m pip install loanpy
        	
        .. image:: PyPI_logo.svg
           :target: https://pypi.org/project/loanpy/
        	
        
        ================
        Documentation
        ================
        
        
        .. image:: white_logo_dark_background.jpg
           :target: https://martino-vic.github.io/loanpy/index.html
           
            
        ====================
        Citation
        ====================
        
        .. image:: zenodo.4716940.svg
           :target: https://zenodo.org/badge/latestdoi/259611202
        
        ============
        Description
        ============
        
        loanpy is a toolkit for historical linguists.
        It extracts sound changes from an etymological dictionary.
        It reconstructs hypothetical roots of modern L1 words.
        It creates hypothetical adaptions of L2 words into proto-L1.
        It searches for potential loanwords by first finding phonetic matches
        and then calculating their semantic similarity.
        
        
        Data Sources
        ~~~~~~~~~~~~~~~~~~~~~~~
        
        - **dfhun_zaicz_backup.csv**: data frame based on the `Hungarian etymological dictionary (Zaicz 2006) <https://regi.tankonyvtar.hu/hu/tartalom/tinta/TAMOP-4_2_5-09_Etimologiai_szotar/adatok.html>`__
        
        - **dfgot_wikiling_backup.csv**: data frame based on `Wikiling <https://koeblergerhard.de/wikiling/?f=got>`__
        
        - **dfgot_wiktionary_backup.csv**: data frame based on `Wiktionary <https://en.wiktionary.org/wiki/Category:Gothic_lemmas>`__
        
        - **dfuralonet.csv**: data frame based on `Uralonet <http://uralonet.nytud.hu>`__
        
        - **substi.csv**: Sound substitutions based on `Information-theoretic causal inference of lexical flow (Dellert 2017)  <https://langsci-press.org/catalog/book/233>`__
        
        - **wordvectornames.xlsx**: Names of pretrained word vector models from `gensim-data  <https://github.com/RaRe-Technologies/gensim-data>`__
        
        
        Dependencies
        ~~~~~~~~~~~~~~~~~~~
        
        - `gensim  <https://pypi.org/project/gensim/>`__
        
        - `ipatok  <https://pypi.org/project/ipatok/>`__
        
        - `python-Levenshtein <https://pypi.org/project/python-Levenshtein>`__
        
        - `pandas <https://pypi.org/project/pandas/>`__
        
        - see also requirements.txt
        
        
        License
        ~~~~~~~~~~~~~~~~
        
        Academic Free License (AFL)
        
        
        
        =======
        Git
        =======
        
        .. image:: Octocat.png
           :target: https://github.com/martino-vic/loanpy
           :scale: 30%
Platform: UNKNOWN
Classifier: Programming Language :: Python :: 3
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Requires-Python: >=3.6
Description-Content-Type: text/x-rst
