Metadata-Version: 2.1
Name: cedict-utils
Version: 0.0.4
Summary: A Python parser for the CC-CEDICT Chinese-English dictionary
Home-page: https://github.com/marcanuy/cedict_utils
Author: Marcelo Canina
Author-email: me@marcanuy.com
License: MIT
Keywords: cedict dictionary chinese cc-cedict
Platform: UNKNOWN
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Developers
Classifier: Topic :: Software Development :: Libraries
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.5
Classifier: Programming Language :: Python :: 3.6
Classifier: Programming Language :: Python :: 3.7
Classifier: Programming Language :: Python :: 3.8
Description-Content-Type: text/markdown
Provides-Extra: dev
Requires-Dist: check-manifest ; extra == 'dev'
Provides-Extra: test
Requires-Dist: coverage ; extra == 'test'

A Python parser for the CC-CEDICT Chinese-English dictionary
============================================================

.. image:: https://img.shields.io/travis/rust-lang/rust.svg
	   :target: https://travis-ci.org/marcanuy/cedict_utils

.. image:: https://img.shields.io/pypi/v/nine.svg
	   :target: https://pypi.python.org/pypi/cedict-utils/

----

A parser for the CC-CEDICT Chinese-English dictionary.

# Install

## PIP

https://pypi.org/project/cedict-utils/

~~~
pip install cedict-utils
~~~

## REPO

1. Create virtualenv.

~~~
python3 -m venv ~/.virtualenvs/cedict-utils
~~~

2. Activate venv.

~~~
source ~/.virtualenvs/cedict-utils/bin/activate
~~~

3. Install requirements.

~~~
pip install -r requirements.txt
~~~

# Dictionary

Download dictionary

~~~
wget -O - https://www.mdbg.net/chinese/export/cedict/cedict_1_0_ts_utf-8_mdbg.txt.gz | gunzip > data/cedict_ts.u8
~~~

# Usage

~~~ python
$ python
>>> from cedict_utils.cedict import CedictParser
>>> parser = CedictParser()
>>> parser.read_file()
>>> entries = parser.parse()
>>> for e in entries:
...     print(e)
... 
..
龟缩 (龜縮) - gui1 suo1
龟背竹 (龜背竹) - gui1 bei4 zhu2
龟船 (龜船) - gui1 chuan2
..
>>> entries[200].simplified
'敦'
>>> entries[200].traditional
'㪟'
>>> entries[200].pinyin
'dun1'
>>> entries[200].raw_line
'㪟 敦 [dun1] /variant of 敦[dun1]/'
>>> entries[200].meanings
['variant of 敦[dun1]']
>>> 
~~~


# Tests

Running tests

~~~
$ make test
pytest
================================================================ test session starts ================================================================
platform linux -- Python 3.6.8, pytest-3.2.3, py-1.4.34, pluggy-0.4.0
rootdir: /Development/cedict-utils, inifile:
collected 8 items                                                                                                                                    

tests/test_cedict.py ........

============================================================= 8 passed in 0.09 seconds ==============================================================
~~~

# Resources

- Cedict project https://www.mdbg.net/chinese/dictionary?page=cc-cedict


