Metadata-Version: 1.1
Name: urlextract
Version: 0.2.7
Summary: Collects and extracts URLs from given text.
Home-page: https://github.com/lipoja/URLExtract
Author: Jan Lipovský
Author-email: janlipovsky@gmail.com
License: MIT
Description: URLExtract
        ----------
        
        URLExtract is python class for collecting (extracting) URLs from given
        text.
        
        How does it work
        ~~~~~~~~~~~~~~~~
        
        It tries to find any occurrence of TLD in given text. If TLD is found it
        starts from that position to expand boundaries to both sides searching
        for "stop character" (usually whitespace, comma, single or double
        quote).
        
        Requirements
        ~~~~~~~~~~~~
        
        -  IDNA for converting links to IDNA format
        
           ::
        
               pip install idna
        
        Example
        ~~~~~~~
        
        You can look at command line program *bin/urlextract*.
        But everything you need to know is this:
        
        .. code:: python
        
            from URLExtract import URLExtract
        
            extractor = URLExtract()
            urls = extractor.find_urls("Text with URLs. Let's have URL janlipovsky.cz as an example.")
            print(urls) # prints: ['janlipovsky.cz']
        
        License
        ~~~~~~~
        
        This piece of code is licensed under The MIT License.
        
Keywords: url,extract,find,finder,collect,link,tld,list
Platform: UNKNOWN
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python
Classifier: Programming Language :: Python :: 3.4
Classifier: Programming Language :: Python :: 3.5
Classifier: Topic :: Text Processing
Classifier: Topic :: Text Processing :: Markup :: HTML
Classifier: Topic :: Software Development :: Libraries :: Python Modules
