nltk
scikit-learn
PyPDF2
