Metadata-Version: 2.4
Name: narremgen
Version: 0.9.6
Summary: A library for generating artificial structured texts of advice from narrative, emotional, and world-context templates.
Maintainer-email: "R. Priam" <rpriam@gmail.com>
License-Expression: MIT
Project-URL: Homepage, https://github.com/narremgen
Project-URL: Repository, https://github.com/narremgen/narremgen
Project-URL: Issues, https://github.com/narremgen/narremgen/issues
Requires-Python: >=3.10
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: requests
Requires-Dist: pandas
Requires-Dist: numpy
Requires-Dist: openai>=1.0.0
Requires-Dist: google-genai>=1.0.0
Provides-Extra: emotions
Requires-Dist: transformers>=4.40; extra == "emotions"
Requires-Dist: torch; extra == "emotions"
Requires-Dist: vaderSentiment; extra == "emotions"
Provides-Extra: textstats
Requires-Dist: lexicalrichness; extra == "textstats"
Provides-Extra: plots
Requires-Dist: matplotlib; extra == "plots"
Provides-Extra: gui
Provides-Extra: all
Requires-Dist: transformers>=4.40; extra == "all"
Requires-Dist: torch; extra == "all"
Requires-Dist: google-genai>=1.0.0; extra == "all"
Requires-Dist: openai>=1.0.0; extra == "all"
Requires-Dist: lexicalrichness; extra == "all"
Requires-Dist: matplotlib; extra == "all"
Requires-Dist: vaderSentiment; extra == "all"
Requires-Dist: python-docx; extra == "all"
Requires-Dist: reportlab; extra == "all"
Dynamic: license-file

# NarrEmGen: Narrative Generation Pipeline (CLI & GUI)

Narremgen is a Python package for structured narrative text generation, combining narrative schemas (SN) and emotional dynamics (DE) to produce coherent short texts assembled into full booklets of advice or answers from a topic or a question with optional chapters. It provides a reproducible multi-batch pipeline for controlled text generation using LLM models with narrative+emotional structures with:

- Implementing Partially The SN/DE/K Method For a Controlled Generation
- Artificial structured advice micro-texts from narrative, emotional, context
- Writing in file Full Booklets of Advice or Answers from a Topic or Question
- Suitable for education & learning, or comparing and training the llm output
- Process data with five different llm calls from key in file or in environment
- Available: command lines, graphical user interface, or python programming


## Main modules of narremgen

- `pipeline`: Entry point for batch generation, variants, stats, and exports per topic run.
- `llmcore`: Unified LLM router (role→model mapping, retries, multi-provider support).
- `data`: Input preparation and CSV handling for topic–advice–prompt-based generation.
- `narratives`: Text post-processing, style control, and SN/DE-aware narrative realization.
- `variants`: Planning and batch rewriting into alternative styles (direct, formal, etc.) with stats.
- `themes`: LLM-based theme discovery and assignment for advice corpora, producing themes+assignments.
- `chapters`: Build chaptered corpora (CSV/JSON) from themes or manual grouping, for book-like exports.
- `export`: Plain-text and LaTeX exporters (merged `.txt` and `book_*.tex` from neutral and variants).
- `analyzestats`: Length, lexical, emotion and SN/DE distribution analysis, with CSV summaries and plots.
- `utils`: Shared helpers for workdirs, filenames, CSV repair, backups, and neutral corpus construction.
- `gui`: Optional Tkinter GUI for generation, or readings aligned/selected texts, or segmentation.
- `main`: Optional command-line terminal module for the generation with input arguments.

## Key features

- Generation of a Corpus of Stories (of varying and controlled structures) and Formal Texts for advice from a topic (full sentence).
- Multi-batch narrative pipeline using a configurable LLM router (`llmcore`) across several providers with a command-line interface.
- Automatic topic and advice mapping, SN/DE-structured neutral generation, and aligned variant rewriting (direct, formal, other styles).
- Robust CSV workflow: filtering, renumbering, safe merging of advice/sentence/context/mapping, consistent filenames, variant workdirs.
- LLM-driven theme extraction and assignment, plus chapter construction for organizing texts into coherent sections (classes of texts).
- Plain-text and TeX export of neutral and variant corpora (merged narrative files and full chaptered books for text reading/selection).
- Integrated corpus analysis: lexical richness, length, emotion profiles, and SN/DE distributions, including neutral vs. variant comparison.
- Textual statistics and emotion statistics of specialized language models from the literature for evaluation of generated texts or corpus.
- Ready-to-use structure for reproducible experiments in text generation with emotions for character and educational content synthesis.
- Graphical user interface for generation with api key checkings, creation of variants, and reading/selection of aligned textes for a topic.
- Available connection to OpenAI, OpenRouter, Google-GenAI, Mistral, etc for text generation (see python code and interface for dry-run).
- No limited length for topic str, available command for adding file/str long text as context for advice or generation stages in pipeline.
- Input in the pipeline a list of pre-written advice with a csv table path with col name Advice, or prefer a dedup automatically generated.

Output: Each generated corpus is stored under `outputs/` in CSV and TXT format. <br> 
The naming convention is: `outputs/<corpus_name>_1/` for its directory.<br>
Each directory contains:

```text
topic, advice, and mapping tables in csv format and generated texts
and two subdirectories containing generated batched texts + csv files
plus directories for variants with statistics + chaptered tex files
```

Note: This package is provided *“as is”* for the research and educational purposes.  <br>
      The code was written/debogged in iterative way with help of gpt5 openai + vs code. <br>
      All texts generated are synthetic and intended for future experimentations only. <br>
      Last version in directory package for pypi. 

To do: improve genericity, generality and robutness, add parallelism, classes re-factor.

## Installation

```text
pip install narremgen
```

## Ask for help and the first examples in the cli

```text
narremgen --help
```

## Usage from cli, examples of command lines

OpenAI gpt4o as the default model (use also sys env key OPENAI_API_KEY instead of txt file) + export TeX booklet

```text
narremgen --topic "Small walks, big effects" --default-model "openai\gpt-4o-mini" --export-book-tex
```

OpenRouter for  DeepSeek for mapping, Llama for narrative, GPT-4o-mini for the rest + multiple variants  

```text
narremgen --topic "Walk habits in the city" --model-advice "openrouter\openai/gpt-4o-mini" --model-mapping "openrouter\deepseek/deepseek-reasoner" --model-context "openrouter\openai/gpt-4o-mini" --model-narrative "openrouter\meta-llama/llama-3.1-70b-instruct" --model-variants-generation "openrouter\openai/gpt-4o-mini"
```

Mistral direct (OpenAI-compatible api, use sys env key) + themes enabled with custom range and batch size  

```text
narremgen --topic "Healthy routines for a walk everyday" --default-model "mistral\mistral-large-latest" --themes-min 1 --themes-max 15 --themes-batch-size 30
```

Grok default (use sys env key) + bypass variants generation to local Phi-4 (Ollama) with larger token budget  

```text
narremgen --topic "Walking around in a small town" --default-model "xai\grok-2-latest" --model-variants-generation "ollama\phi4:14b" --variant-batch-size 40 --variant-max-tokens 2500
```

Quick connectivity check (no files generated): diagnostic dry-run with longer timeout to check which models are available

```text
narremgen --diagnostic-dry-run --request-timeout 90 --model-theme-analysis "ollama\\phi3-chat:latest" --model-advice "ollama\\phi3-chat:latest" --model-mapping "ollama\\phi3-chat:latest" --model-context "ollama\\phi3-chat:latest" --model-narrative "google\\gemini-2.0-flash"
```

## Launch call for GUI

```python
# Interface generation+reading+saving
python -m narremgen.gui
```

## Custom calls with python programming for pipeline

```python
from importlib.resources import files
from narremgen import LLMConnect, run_pipeline
LLMConnect.init_global(default_model="openai/gpt-4o-mini")
assets_dir = str(files("narremgen").joinpath("settings"))
run_pipeline(
    topic="Walking in the city",
    workdir="./outputs",
    assets_dir=assets_dir,
    n_batches=2,
    n_per_batch=20,
    output_format="txt",
    verbose=False,
)
```

## Custom calls with python programming for llmcore

| Provider | Required env variables or key file | Model example (`provider\\model`) |
|---|---|---|
| OpenAI | `OPENAI_API_KEY` | `openai\\gpt-4o-mini` |
| Mistral | `MISTRAL_API_KEY` | `mistral\\mistral-small-latest` | 
| xAI / Grok | `XAI_API_KEY` or `GROK_API_KEY` | `xai\\grok-2-mini` | 
| Gemini/Google | `GEMINI_API_KEY` or `GOOGLE_API_KEY` | `gemini\\gemini-2.0-flash` | 
| Ollama (local) | `OLLAMA_HOST` (optional) | `ollama\\llama3.2:3b` | 
| OpenRouter | `OPENROUTER_API_KEY` | `openrouter\\anthropic/claude-3.5-sonnet` |

```python
from narremgen.llmcore import LLMConnect

llm = LLMConnect(
    default_model="ollama\\phi3-chat:latest",
    max_tokens=400,
    request_timeout=60,
)

messages = [
    {"role": "system", "content": "Answer concisely."},
    {"role": "user", "content": "Write 3 advice for walking in a city, 1 sentence each."},
]

reply = llm.safe_chat_completion(model="gemini\\gemini-2.0-flash", messages=messages)
print(reply)
```

## Warning

- Only informed users or trainers should use this system in practice. 
- Some advice may be missing or mistaken du to ia/programming. 
- In future automatic checkings may be implemented for end user. 
- Always do a dry-run before launching to check models in use. 

## References

- Rodolphe Priam (2025). *Narrative and Emotional Structures For Generation Of Short Texts For Advice*, [hal-05135171](https://inria.hal.science/hal-05135171), 2025.

---

© NarrEmGen Project, 2025-2026.
