pdf·mux /piːdiːɛf'mʌks/ transitive verb
to route a PDF to the best extractor for each page, and verify the result automatically.
No single PDF tool wins at everything. PyMuPDF is fast but chokes on tables. Docling nails tables but crawls on simple text. Marker handles scans but is overkill for digital docs.
pdfmux picks the right tool per page, audits the output, and re-extracts if confidence is low.
pip install pdfmux