Metadata-Version: 2.1
Name: zoegas
Version: 1.1.2
Summary: Old Norse Zoëga's dictionary
Home-page: https://github.com/cltk/old_norse_dictionary_zoega
Author: Clément Besnier
Author-email: clemsciences@aol.com
License: License :: OSI Approved :: MIT License
Keywords: nlp,old-norse,philology,dictionary,corpus
Platform: UNKNOWN
Classifier: Programming Language :: Python :: 3
Classifier: Operating System :: OS Independent
Classifier: License :: OSI Approved :: MIT License
Classifier: Topic :: Text Processing
Classifier: Topic :: Text Processing :: Markup :: XML
Requires-Python: >=3.6
Description-Content-Type: text/markdown
Requires-Dist: cltk (==0.1.121)
Requires-Dist: lxml

[![PyPI](https://img.shields.io/pypi/v/zoegas)](https://pypi.org/project/zoegas/) [![Build Status](https://travis-ci.org/cltk/old_norse_dictionary_zoega.svg?branch=master)](https://travis-ci.org/cltk/old_norse_dictionary_zoega)

# Old Norse Dictionary Zoega
Zoëga's A Concise Dictionary of Old Icelandic parser

With the **reader.py** module, you can:

* search a word with an edit distance below a given threshold,
* extract the POS tags in dictionary entries,
* search for exact entry and approximate entry.

However POS tag extractor is not very efficient. More special cases need to be handled.

TODO list:
* [x] look up a word in dictionary
* [x] search all words in dictionary which are at most at a given edit distance with the query word
* [x] for a given dictionary entry, give all its inflected forms (partially done),
* [ ] handle more dictionary entry,
* [ ] process all entries so that we would get virtually all the Old Norse words,
* [ ] for each form, we can associate lemmata with a proposed POS tag.

## Elaboration of data

Data come from https://github.com/GreekFellows/lesser-dannatt and http://norroen.info/dct/zoega by Tim Ermolaev.
Then `utils.first_step()` is launched. Files are modified in order to ensure 
XML syntax consistency, finally `utils.second_step()` is launched.


