Metadata-Version: 2.1
Name: xmpdf
Version: 0.5.2
Summary: Extracts email metadata and text from a PDF file
Home-page: https://github.com/history-lab/xmpdf
Author: Ben Lis, History Lab @ Columbia University
License: MIT License
Platform: UNKNOWN
Requires-Python: >=3.8
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: pdftotext
Requires-Dist: jsonpickle

# xmpdf
Extracts email metadata and text body from a PDF containing emails.

## Installation

    pip install xmpdf

## Usage

    from xmpdf import Xmpdf

    ems = Xmpdf(pdf_file)
    # print summary info about emails in PDF file
    print(ems.info())
    # process emails
    for m in ems.emails:
        process(m)

## Notes
* The initial development of this package was funded in part by The Mellon Foundation’s “Email Archives: Building Capacity and Community” program.


