Metadata-Version: 2.1
Name: mwtextextractor
Version: 0.1.2
Summary: Extracts body text from MediaWiki wikitext by stripping off templates, html tags, tables, headers, etc.
Home-page: https://github.com/danmichaelo/mwtextextractor
Author: Dan Michael O. Heggø
Author-email: danmichaelo@gmail.com
License: MIT
Description-Content-Type: UNKNOWN
Keywords: mediawiki
Platform: UNKNOWN
Classifier: Programming Language :: Python
Classifier: Programming Language :: Python :: 2.6
Classifier: Programming Language :: Python :: 2.7
Classifier: Programming Language :: Python :: 3.2
Classifier: Programming Language :: Python :: 3.3
Requires-Dist: lxml
Requires-Dist: mwtemplates

mwtextextractor
===================

.. image:: https://travis-ci.org/danmichaelo/mwtextextractor.png?branch=master
   :target: https://travis-ci.org/danmichaelo/mwtextextractor
.. image:: https://coveralls.io/repos/danmichaelo/mwtextextractor/badge.png
   :target: https://coveralls.io/r/danmichaelo/mwtextextractor

mwtextextractor extracts simple body text from MediaWiki wikitext by stripping off templates, html tags, tables, headers, etc.
The extracted text can be used for word counting.


Example:

.. code-block:: python

    from mwtextextractor import get_body_text
    print get_body_text('Lorem {{ipsum}} dolor')


