Metadata-Version: 1.1
Name: lurk
Version: 0.1.2
Summary: Extract html from one or multiple url's
Home-page: https://github.com/mateogianolio/lurk
Author: Mateo Gianolio
Author-email: gianoliomateo@gmail.com
License: MIT
Description: lurk
        ====
        
        A script which extracts HTML from web pages that match a certain CSS pattern.
        ::
        
            $ pip install lurk
        
        =====
        usage
        =====
        
        **in python**
        
        In python, lurk returns a dictionary:
        
        ::
        
            from lurk import lurk
        
            for link in lurk('http://en.wikipedia.org/wiki/en', 'a'):
                if 'href' in link:
                    print link['href']
        
        **in bash**
        
        In bash, lurk returns JSON.
        
        Familiarize yourself with `CSS attribute selectors <https://developer.mozilla.org/en-US/docs/Web/CSS/Attribute_selectors>`_.
        
        ::
        
            $ lurk \
            http://www.gnu.org/software/libc/manual/html_node/Function-Index.html \
            'a[href*="#index-"]' \
            > links.json
        
        This command saves a JSON object containing an array of links to all GNU C functions into **links.json**:
        
        ::
        
            [
              {
                "code": "*pthread_getspecific",
                "href": "Thread_002dspecific-Data.html#index-_002apthread_005fgetspecific"
              },
        
              {
                "code": "*sbrk",
                "href": "Resizing-the-Data-Segment.html#index-_002asbrk"
              },
        
              // ...
            ]
        
Keywords: lurk lurker scrape scraper scraping webscrape crawl crawler crawling
Platform: UNKNOWN
Classifier: Development Status :: 3 - Alpha
Classifier: Intended Audience :: Developers
Classifier: Topic :: Software Development :: Libraries
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 2
Classifier: Programming Language :: Python :: 2.6
Classifier: Programming Language :: Python :: 2.7
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.2
Classifier: Programming Language :: Python :: 3.3
Classifier: Programming Language :: Python :: 3.4
