Metadata-Version: 2.4
Name: nielsen
Version: 3.1.6
Summary: Organize your digital media collections.
Author-email: "Michael \"Irish\" O'Neill" <irish.dot@gmail.com>
License-Expression: GPL-3.0-or-later
License-File: LICENSE
Keywords: metadata,renamer,tv
Classifier: License :: OSI Approved :: GNU General Public License v3 or later (GPLv3+)
Classifier: Operating System :: POSIX :: Linux
Classifier: Operating System :: Unix
Classifier: Programming Language :: Python :: 3 :: Only
Classifier: Programming Language :: Python :: 3.12
Classifier: Topic :: Communications :: File Sharing
Classifier: Topic :: Multimedia
Classifier: Topic :: Multimedia :: Video
Classifier: Topic :: System :: Archiving
Classifier: Topic :: Utilities
Classifier: Typing :: Typed
Requires-Python: >=3.12
Requires-Dist: requests>=2.32.3
Requires-Dist: typer>=0.15.1
Description-Content-Type: text/markdown

# Nielsen

Nielsen helps rename and organize digital media collections.

Consider a filename such as `The.Wheel.of.Time.S01E06.720p.WEB.x265-MiNX.mkv`.
If you'd prefer something more along the lines of `The Wheel of Time -01.06-
The Flame of Tar Valon.mkv`, Nielsen can help.

Nielsen endeavors to automatically handle these types of renaming operations,
as well as move the files into a more hierarchical folder structure that keeps
your media directories tidy.

## Requirements

- Python 3.12 (or newer)
- [uv][uv] (for source installations)

## Install

### PyPI

This package is published on [PyPI][pypi-nielsen] and can be installed from
there with the following command.

```bash
pip install nielsen
```

### Source

This package can also be installed from source by cloning this repository and
using [uv][uv].

```bash
uv sync
# Optionally, install the dev dependencies if you wish modify the package
uv sync --all-extras --dev
```

## Concepts

- [`Media`](./ARCHITECTURE.md#nielsenmediamedia) - Represents a file on disk
  and its metadata. The specific subclass determines how metadata is inferred
  and which `Fetcher` should be used.
- [`Fetcher`](./ARCHITECTURE.md#nielsenfetcherfetcher) - Queries external
  systems/services for metadata about a given `Media` instance.
- [`Processor`](./ARCHITECTURE.md#nielsenprocessorprocessor) - A grouping of
  a `MediaType` and `Fetcher`. Given a file, the `Processor` creates the
  appropriate `Media` (based on the `MediaType`) and `Fetcher` to query for
  metadata.

Under the hood, "processing" a file will create a `Media` object from the given
path and `MediaType`, infer metadata based on the filename (e.g. TV series
name, season number, episode number, and episode title).

If the `fetch` option is enabled, then a `Fetcher` will also be created to
query more information (e.g. `Ted.Lasso.S01.E01.mp4` doesn't give us an episode
title, but the `Fetcher` can ask [TVMaze][tvmaze] for it).

`nielsen` can then rename the file (e.g. `Ted Lasso -01.04- For the
Children.mp4`) and move it to your media library (e.g. `~/Videos/TV/Ted
Lasso/Season 01/`).

See the [ARCHITECTURE][architecture] file for more detailed information.

## Usage

The primary use case `nielsen` is intended to handle is that of renaming a file
and moving it into the correct location on disk. This is done with the
`process` subcommand.

All commands (and subcommands) will accept a `--help` flag to provide more
information about usage.

```bash
# Process a given file
nielsen process PATH_TO_FILE

# See a list of all subcommands
nielsen --help
```

### `tv` subcommand

The `tv` subcommand can be used to make some simple queries to TVMaze and to
apply the results of those queries to files on-demand.

#### `nielsen tv fetch`

The `tv fetch` subcommand queries the TVMaze API. The more information you
provide, the more specific the results. You can fetch information about a
series, a season, or an episode.

```bash
# Get information about a series
nielsen tv fetch --series "Ted Lasso"
# Get information about a specific season of a series
nielsen tv fetch --series "Ted Lasso" --season 2
# Get information about a specific episode of a season of a series
nielsen tv fetch --series "Ted Lasso" --season 2 --episode 5
```

Additionally, the `--raw` flag can be used with any of the commands above to
pretty print the actual JSON response.

#### `nielsen tv apply`

The `tv apply` subcommand can be used to apply specific metadata to a file and
rename it rather than attempting to infer all the metadata from the filename.

For example, you may have a collection of files that are already organized by
series name and season, but the file names are simply: `1.mkv`, `2.mkv`,
`3.mkv`, etc. While you may know exactly what they are, `nielsen` only
considers the filename when attempting to infer metadata (not parent directory
names).

Specifying the series name, season number, and episode number applies the
information to a single file. Specifying only the series name and season number
can apply information about each episode in the season to a collection of
files.

```bash
# Rename a single file
nielsen tv apply --series "Ted Lasso" --season 2 --episode 5 "5.mkv"
# Map each file in a directory to an episode in a season
nielsen tv apply --series "Ted Lasso" --season 2 $(ls | sort -n)
```

It's worth checking on how your shell will expand the `*` before attempting to
use `apply` on all files in a directory, because the files are processed in the
order they're given. Without leading zeroes, they're likely to expand in
lexicographical order rather than numerical order, e.g. `1.mkv`, `10.mkv`,
`11.mkv`, `2.mkv`, `3.mkv`, `4.mkv`, etc. `ls | sort -n` can be used to ensure
the files are sorted numerically.

## Configuration

`nielsen` uses an [INI file][wikipedia-ini] for configuration. The module will
look for files in the following locations, and load them in order (overwriting
old values with newer values as they're loaded):

- `/etc/nielsen/config.ini`
- `~/.config/nielsen/config.ini`

The CLI uses these configuration files to determine its default behavior, but
also accepts a few command line flags (e.g. `--[no-]organize`) to modify its
behavior at runtime. Command line arguments will take precedence over the
configuration files.

### Sections and Options

Each section (denoted with square-brackets around the name) may contain
multiple options.

#### `[nielsen]`

Contains options about the behavior of the application itself.

##### `fetch`

Whether or not to create and use a `Fetcher` to obtain additional metadata
about a file. If there is sufficient information in the filename or there is no
network connection, the data fetching process can be skipped altogether.

*Values*: `True` or `False`

*Default*: `True`

##### `interactive`

Whether or not to prompt the user to select their desired result when a
`Fetcher` yields multiple results (e.g. "The Office"). In non-interactive mode,
the first result is used.

*Values*: `True` or `False`

*Default*: `True`

##### `logfile`

The location on disk to which `nielsen` log output should be written.

*Value*: *A file the user can write to*

*Default*: `~/.local/log/nielsen/nielsen.log`

##### `loglevel`

The [Python logging level][python-logging] at which to filter log messages.

*Values*: `NOTSET`, `DEBUG`, `INFO`, `WARNING`, `ERROR`, `CRITICAL`

*Default*: `WARNING`

##### `mode`

The [numeric notation][wikipedia-fs-mode] for the file mode (or permissions) a
file should be given during processing.

*Default*: 644

##### `organize`

Whether or not to move files to the media library during processing.

*Values*: `True` or `False`

*Default*: `True`

##### `rename`

Whether or not to rename a file with its new metadata during processing.

*Values*: `True` or `False`

*Default*: `True`

##### `simulate`

Whether or not to actually modify files (e.g. rename and move them), or just
print/log the actions which would otherwise be taken during processing.

*Values*: `True` or `False`

*Default*: `False`

##### `transform`

Whether or not to transform (or modify) the inferred or fetched data during
processing. The exact nature of these transformations are defined by the
`MediaType`.

*Values*: `True` or `False`

*Default*: `True`

#### `[media]`

Contains options to apply to generic `Media` objects. Currently, there's no
real use for this. However, options described here can be used in any other
`Media` type section (e.g. `tv`).

##### `library`

The root directory into which a file should be moved as part of its processing.
Each `Media` type may define its own `library` path.

*Value*: *A directory the user can write to*

*Default*: `$HOME`

##### `owner`

The user name or system UID which should own the file after organizing. This
value is optional, and if left undefined, the file ownership will not be
changed when organizing.

*Value*: A system username (e.g. `irish`) or UID (e.g. `1000`).

*Default*: `None`

##### `group`

The group name or system GID which should own the file after organizing. This
value is optional, and if left undefined, the file ownership will not be
changed when organizing.

*Value*: A system group name (e.g. `user`) or GID (e.g. `1001`).

*Default*: `None`

#### `[tv]`

Contains options to apply to `TV` objects. It is recommended that you set a
`library` option to keep each media type separated. This value will overwrite
the default `[nielsen]` or `[media]` section options for files processed as
`TV`.

#### `[tv/transform/series]`

This section defines transformations for `TV.series` values. This can be used
to normalize series names so that all files get organized into the same
directory and are named consistently. Consider the series [Marvel's Agents of
S.H.I.E.L.D.][tvmaze-agents-of-shield]. You may have files with various naming
conventions:

1. Agents Of S.H.I.E.L.D.
1. Agents Of SHIELD
1. Agents Of Shield
1. Agents of S.H.I.E.L.D.
1. Agents of SHIELD
1. Agents of Shield
1. Marvel's Agents Of S.H.I.E.L.D.
1. Marvel's Agents Of SHIELD
1. Marvel's Agents Of Shield
1. Marvel's Agents of S.H.I.E.L.D.
1. Marvel's Agents of SHIELD
1. Marvel's Agents of Shield

By default, `nielsen` will see each of these as a different series. Defining a
variant and a preferred name allows `nielsen` to treat them all as a single
series. The variants used to choose a transformation are case insensitive, so
you can fix capital letters/title casing with minimal repetition.

*Values*: `Variant Series Name = Preferred Series Name`

```ini
[tv/transform/series]
Agents of S.H.I.E.L.D. = Marvel's Agents of S.H.I.E.L.D.
Agents of SHIELD = Marvel's Agents of S.H.I.E.L.D.
Marvel's Agents of S.H.I.E.L.D. = Marvel's Agents of S.H.I.E.L.D.
Marvel's Agents of SHIELD = Marvel's Agents of S.H.I.E.L.D.

Its Always Sunny In Philadelphia = It's Always Sunny in Philadelphia
It's Always Sunny In Philadelphia = It's Always Sunny in Philadelphia
```

#### `[tvmaze/ids]`

Fetching an episode title from [TVMaze][tvmaze] requires a series ID, season
number, and episode number. Once a series name to series ID mapping has been
determined, `nielsen` will attempt to save it in this section to avoid having
to do the same lookup again in the future.

*Values*: `Series Name = Series ID`

[architecture]: ARCHITECTURE.md
[pypi-nielsen]: https://pypi.org/project/Nielsen/
[python-logging]: https://docs.python.org/3/library/logging.html#logging-levels
[tvmaze-agents-of-shield]: https://www.tvmaze.com/shows/31/marvels-agents-of-shield
[tvmaze]: https://tvmaze.com/
[uv]: https://docs.astral.sh/uv/
[wikipedia-fs-mode]: https://en.wikipedia.org/wiki/File-system_permissions#Numeric_notation
[wikipedia-ini]: https://en.wikipedia.org/wiki/INI_file
