Metadata-Version: 2.1
Name: icdcodex
Version: 0.2.0
Summary: icd embedding for machine learning
Home-page: https://github.com/jeremyadamsfisher/icdcodex
Author: Jeremy Fisher
Author-email: jeremyf@cmu.edu
License: MIT license
Keywords: icdcodex
Platform: UNKNOWN
Classifier: Development Status :: 2 - Pre-Alpha
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: MIT License
Classifier: Natural Language :: English
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.7
Classifier: Programming Language :: Python :: 3.8
Requires-Python: >=3.5
Requires-Dist: networkx
Requires-Dist: node2vec
Requires-Dist: xmltodict

========
icdcodex
========

.. image:: https://img.shields.io/pypi/v/icdcodex.svg
        :target: https://pypi.python.org/pypi/icdcodex


.. image:: https://readthedocs.org/projects/icd-codex/badge/?version=latest
        :target: https://icd-codex.readthedocs.io/en/latest/?badge=latest
        :alt: Documentation Status

ICD embedding for machine learning, created for `MedHacks2020 ❤️
<http://medhacks.org/?fbclid=IwAR0L-JQotA_wdVe5PTySOrPMCCknlZBb8xlHmwCkcyWPEFwTeVlk3jkyuJg/>`_.

* Free software: MIT license
* Documentation: https://icdcodex.readthedocs.io.

What is Medhacks?
-----------------
MedHacks hosted by Johns Hopkins University aims to unite talented and diverse minds from all backgrounds in order to foster a collaborative environment that aims to solve the world’s medical obstacles and issues. 

The Problem
-----------
ICD coding is a laborous, but difficult to automate by machine learning because the output space if intractably large. (ICD-10CM has over 70,000 codes.) icdcodex creates a vector embedding for this input space, making it simpler for machine learning practioners to efficiently adapt algorithms for ICD coding.

Our Solution
------------
We rely on the word2vec model to generate this embedding. In this set up, each ICD code represents a "word," whereas a path sampled from breadth-first or depth-first search represents the "sentence."


The Team
--------
* Jeremy Adams Fisher
* Alhusain Abdalla
* Natasha Nehra
* Tejas Patel
* Hamrish Saravanakumar


Features
--------

* Curated networkX graphs representing ICD9 and ICD10 hierarchies
* A simple API to generate continuous embeddings for these hierarchies

Credits
-------

This package was created with Cookiecutter_ and the `audreyr/cookiecutter-pypackage`_ project template.

.. _Cookiecutter: https://github.com/audreyr/cookiecutter
.. _`audreyr/cookiecutter-pypackage`: https://github.com/audreyr/cookiecutter-pypackage


=======
History
=======

0.1.0 (2020-09-04)
------------------

* First release on PyPI.


