Metadata-Version: 2.4
Name: whatenc
Version: 0.1.0
Summary: Simple text encoding type classifier
Requires-Python: >=3.13
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: numpy>=2.3.4
Requires-Dist: onnxruntime>=1.23.2
Provides-Extra: dev
Requires-Dist: build>=1.3.0; extra == "dev"
Requires-Dist: twine>=6.2.0; extra == "dev"
Provides-Extra: train
Requires-Dist: datasets>=4.3.0; extra == "train"
Requires-Dist: scikit-learn>=1.7.2; extra == "train"
Requires-Dist: skl2onnx>=1.19.1; extra == "train"
Requires-Dist: joblib>=1.5.2; extra == "train"
Dynamic: license-file

# whatenc

Simple text encoding type classifier.

`whatenc` uses statistical and linguistic features to detect how a given string is encoded.

# Usage

```
pipx install whatenc
```

```
whatenc aGVsbG8gd29ybGQ=
whatenc samples.txt
```

# Examples

```
[+] input: aGVsbG8gd29ybGQ=
   [=] top guess   = base64
      [~] base64   = 0.455
      [~] plain    = 0.312
      [~] url      = 0.126

[+] input: hello
   [=] top guess   = plain
      [~] plain    = 0.552
      [~] url      = 0.246
      [~] rot13    = 0.192

[+] input: uryyb jbeyq
   [=] top guess   = rot13
      [~] rot13    = 0.555
      [~] plain    = 0.440
      [~] url      = 0.004
```
