Metadata-Version: 2.1
Name: twarc-hashtags
Version: 0.0.5
Summary: A twarc plugin to extract hashtags from Twitter data
Home-page: https://github.com/docnow/twarc-hashtags
Author: Ed Summers
Author-email: ehs@pobox.com
License: UNKNOWN
Platform: UNKNOWN
Requires-Python: >=3.3
Description-Content-Type: text/markdown
License-File: LICENSE

# twarc-hashtags

This module is extends [twarc] with a `hashtags` command that will extract and
count the hashtags in a tweet dataset.

## Install

    pip install twarc-hashtags

Collect some Twitter data, for example:

    twarc2 search blacklivesmatter tweets.jsonl 

Because you installed the plugin you have a new subcommand `hashtags`:

    twarc2 hashtags tweets.jsonl hashtags.csv

Then open `hashtags.csv` in your favourite spreadsheet program or
DataFrame library.

Behind the scenes twarc-hashtags uses Python's native support for SQLite to
create a database and then insert/query it. You can see this database after the
program finishes as `hashtags.db` in your current working directory.

## Options

**--group**: group results by day, week, month, year

**--limit**: limit to this number of hashtags (per group if --group is used)

**--db**: if you would like to name the database something other than
`hashtags.db`

**--no-insert**: use an existing database instead of inserting (useful for
large numbers of tweets)

[twarc]: https://github.com/docnow/twarc


