Metadata-Version: 2.1
Name: lxml2pandas
Version: 0.10
Summary: lxml to pandas for fast web scraping
Home-page: https://github.com/hansalemaos/lxml2pandas
Author: Johannes Fischer
Author-email: aulasparticularesdealemaosp@gmail.com
License: MIT
Keywords: lxml,pandas,web scraping
Classifier: Development Status :: 4 - Beta
Classifier: Programming Language :: Python :: 3 :: Only
Classifier: Programming Language :: Python :: 3.10
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Classifier: Topic :: Utilities
Description-Content-Type: text/markdown
License-File: LICENSE.rst
Requires-Dist: a_pandas_ex_apply_ignore_exceptions
Requires-Dist: fake_headers
Requires-Dist: lxml
Requires-Dist: numpy
Requires-Dist: pandas
Requires-Dist: requests


# lxml to pandas for fast web scraping

## Tested against Windows / Python 3.11 / Anaconda

## pip install lxml2pandas

```python
from lxml2pandas import subprocess_parsing
htmldata = [
    r"C:\Users\hansc\Downloads\bet365 - Apostas Desportivas Online2.mhtml",
    r"C:\Users\hansc\Downloads\bet365 - Apostas Desportivas Online.mhtml",
    r"C:\Users\hansc\Downloads\Your Repositories.mhtml",
    'https://pandas.pydata.org/docs/reference/api/pandas.concat.html'
]

df=subprocess_parsing(
    htmldata, chunks=1, processes=5, print_stdout=True, print_stderr=True
)

```
