Metadata-Version: 2.1
Name: pyresparser
Version: 1.0.1
Summary: A simple resume parser used for extracting information from resumes
Home-page: https://github.com/OmkarPathak/pyresparser
Author: Omkar Pathak
Author-email: omkarpathak27@gmail.com
License: GPL-3.0
Platform: UNKNOWN
Classifier: Intended Audience :: Developers
Classifier: Topic :: Software Development :: Libraries
Classifier: License :: OSI Approved :: GNU General Public License v3 (GPLv3)
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.3
Classifier: Programming Language :: Python :: 3.4
Classifier: Programming Language :: Python :: 3.5
Classifier: Programming Language :: Python :: 3.6
Classifier: Programming Language :: Python :: 3.7
Requires-Dist: attrs (>=19.1.0)
Requires-Dist: blis (>=0.2.4)
Requires-Dist: certifi (>=2019.6.16)
Requires-Dist: chardet (>=3.0.4)
Requires-Dist: cymem (>=2.0.2)
Requires-Dist: docx2txt (>=0.7)
Requires-Dist: idna (>=2.8)
Requires-Dist: jsonschema (>=3.0.1)
Requires-Dist: nltk (>=3.4.3)
Requires-Dist: numpy (>=1.16.4)
Requires-Dist: pandas (>=0.24.2)
Requires-Dist: pdfminer.six (>=20181108)
Requires-Dist: preshed (>=2.0.1)
Requires-Dist: pycryptodome (>=3.8.2)
Requires-Dist: pyrsistent (>=0.15.2)
Requires-Dist: python-dateutil (>=2.8.0)
Requires-Dist: pytz (>=2019.1)
Requires-Dist: requests (>=2.22.0)
Requires-Dist: six (>=1.12.0)
Requires-Dist: sortedcontainers (>=2.1.0)
Requires-Dist: spacy (>=2.1.4)
Requires-Dist: srsly (>=0.0.7)
Requires-Dist: thinc (>=7.0.4)
Requires-Dist: tqdm (>=4.32.2)
Requires-Dist: urllib3 (>=1.25.3)
Requires-Dist: wasabi (>=0.2.2)

pyresparser
===========

A simple resume parser used for extracting information from resumes

Features
========

-  Extract name
-  Extract email
-  Extract mobile numbers
-  Extract skills
-  Extract total experience
-  Extract education (not very accurate)
-  Extract experience (not very accurate)

Installation
============

-  You can install this package using

.. code:: bash

    pip install pyresparser

-  For NLP operations we use spacy and nltk. Install them using below
   commands:

.. code:: bash

    # spaCy
    python -m spacy download en_core_web_sm

    # nltk
    python -m nltk.downloader words

-  For extracting other supporting dependencies, execute:

.. code:: bash

    # If you want to parse .docx and .pdf files (all OS supported)
    pip install -r resume_parser/requirements.txt

    # If you want to parse .docx, .doc and .pdf files (Ubuntu and OSX supported)
    pip install -r resume_parser/requirements_with_textract.txt

-  Modify ``resume_parser/resume_parser/skills.csv`` as per your
   requirements

-  Modify ``Education Degrees`` as per you requirements in
   `resume\_parser/resume\_parser/constants.py <https://github.com/OmkarPathak/ResumeParser/blob/master/constants.py>`__

-  Place all the resumes that you want to parse in
   ``resume_parser/resumes/`` directory

-  Run ``python resume_parser/cli.py -f <resume_file_path>``

CLI
===

For running the resume extractor you can also use the ``cli`` provided

.. code:: bash

    usage: cli.py [-h] [-f FILE] [-d DIRECTORY]

    optional arguments:
      -h, --help                              show this help message and exit
      -f FILE, --file FILE                    resume file to be extracted
      -d DIRECTORY, --directory DIRECTORY     directory containing all the resumes to be extracted
      -r REMOTEFILE, --remotefile REMOTEFILE  remote path for resume file to be extracted

For extracting data from a single resume file, use

.. code:: bash

    python resume_parser/cli.py -f <resume_file_path>

For extracting data from several resumes, place them in a directory and
then execute

.. code:: bash

    python resume_parser/cli.py -d <resume_directory_path>

For extracting data from remote resumes, execute

.. code:: bash

    python resume_parser/cli.py -r <path_to_remote_resume_file>

Notes:
======

-  If you are running the app on windows, then you can only extract
   .docs and .pdf files

Result
======

The module would return a list of dictionary objects with result as
follows:

::

    [{'education': [('BE', '2014')],
      'email': 'omkarpathak27@gmail.com',
      'experience': [' Schlumberger DATA ENGINEER Pune'],
      'mobile_number': '8087996634',
      'name': 'Omkar Pathak',
      'no_of_pages': 3,
      'skills': ['Python',
                 'C',
                 'Technical',
                 'Linux',
                 'Machine learning',
                 'System',
                 'Html',
                 'C++',
                 'Security',
                 'Testing',
                 'Content',
                 'Apis',
                 'Engineering',
                 'Payments',
                 'Django',
                 'Excel',
                 'Admissions',
                 'Mysql',
                 'Windows',
                 'Automation',
                 'Opencv',
                 'Website',
                 'Css',
                 'Js',
                 'Algorithms',
                 'Flask',
                 'Programming',
                 'Writing',
                 'Training',
                 'Php',
                 'Reports',
                 'Photography',
                 'Open source',
                 'Github',
                 'Analytics',
                 'Api'],
      'total_experience': 0.58}]

References that helped me get here
==================================

-  https://www.kaggle.com/nirant/hitchhiker-s-guide-to-nlp-in-spacy

-  https://www.analyticsvidhya.com/blog/2017/04/natural-language-processing-made-easy-using-spacy-%E2%80%8Bin-python/

-  [https://medium.com/@divalicious.priya/information-extraction-from-cv-acec216c3f48](https://medium.com/@divalicious.priya/information-extraction-from-cv-acec216c3f48)

Built with ♥ and :coffee: by ```Omkar Pathak`` <http://www.omkarpathak.in/>`__
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Donation
========

If you have found my softwares to be of any use to you, do consider
helping me pay my internet bills. This would encourage me to create many
such softwares :)

+-----------+----+
| PayPal    |    |
+===========+====+
| ₹ (INR)   |    |
+-----------+----+



