Metadata-Version: 2.1
Name: wikipedia-ql
Version: 0.0.5
Summary: Query Language for Wikipedia
Home-page: https://github.com/zverok/wikipedia_ql
Author: Victor Shepelev
Author-email: zverok.offline@gmail.com
License: UNKNOWN
Platform: UNKNOWN
Classifier: Development Status :: 3 - Alpha
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Education
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3.7
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Operating System :: OS Independent
Classifier: Topic :: Scientific/Engineering :: Information Analysis
Requires-Python: >=3.7
Description-Content-Type: text/markdown
Requires-Dist: requests
Requires-Dist: lark
Requires-Dist: soupsieve (>=2.3.1)
Requires-Dist: bs4
Requires-Dist: nltk
Requires-Dist: pyaml


# WikipediaQL: querying structured data from Wikipedia

**WikipediaQL** is an _experimental query language_ and, executable script, and Python library for querying structured data from Wikipedia. It looks like this:

```
$ wikipedia_ql --page "Guardians of the Galaxy (film)" \
    '{
      page@title as "title";
      section[heading="Cast"] as "cast" >> {
          li >> text:matches("^(.+?) as (.+?):") >> {
              text-group[group=1] as "actor";
              text-group[group=2] as "character"
          }
      };
      section[heading="Critical response"] >> {
          sentence:contains("Rotten Tomatoes") as "RT ratings" >> {
              text:matches("\d+%") as "percent";
              text:matches("(\d+) (critic|review)") >> text-group[group=1] as "reviews";
              text:matches("[\d.]+/10") as "overall"
          }
      }
    }'

title: Guardians of the Galaxy (film)
RT ratings:
  overall: 7.8/10
  percent: 92%
  reviews: '334'
cast:
- actor: Chris Pratt
  character: Peter Quill / Star-Lord
- actor: Zoe Saldaña
  character: Gamora
...
```

[Read full README.md on GitHub](https://github.com/zverok/wikipedia_ql)


