Metadata-Version: 2.1
Name: dvg
Version: 1.0.0b11
Summary: document-vector grep
Home-page: https://github.com/tos-kamiya/dvg
Author: Toshihiro Kamiya
Author-email: kamiya@mbj.nifty.com
License: BSD 2-Clause License
Platform: UNKNOWN
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: License :: OSI Approved :: BSD License
Classifier: Operating System :: MacOS :: MacOS X
Classifier: Operating System :: POSIX :: Linux
Classifier: Operating System :: Microsoft :: Windows :: Windows 10
Requires-Python: >=3.8
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: appdirs
Requires-Dist: docx2txt
Requires-Dist: html2text
Requires-Dist: init-attrs-with-kwargs
Requires-Dist: nltk
Requires-Dist: numpy
Requires-Dist: psutil
Requires-Dist: toml
Requires-Dist: transformers
Requires-Dist: win-wildcard
Requires-Dist: colorama ; platform_system == "Windows"
Provides-Extra: docopt
Requires-Dist: docopt ; extra == 'docopt'
Provides-Extra: docopt-ng
Requires-Dist: docopt-ng ; extra == 'docopt-ng'
Provides-Extra: ja
Requires-Dist: fugashi ; extra == 'ja'
Requires-Dist: ipadic ; extra == 'ja'

# dvg

dvg, a document-vector grep. 

`dvg` is an off-the-shelf grep tool that performs semantic similarity search.
With SCDV models, search document files that contain similar parts to query.
Supports searching within text files (.txt), PDF files (.pdf), and MS Word files (.docx).

Screenshot:

![](https://github.com/tos-kamiya/dvg/blob/main/docs/images/run1.png?raw=True)

Please refer to the [home page on the github](https://github.com/tos-kamiya/dvg) for installation and usage.


