Reference Pipeline¶
This pipeline is for private literary analysis using publicly readable reference text.
Run crawl + analysis¶
uv run story-reference --max-episodes 20
Outputs are written under:
work/reference_data/n2267be/meta/index.jsonwork/reference_data/n2267be/raw/*.jsonwork/reference_data/n2267be/samples/story_sample.mdwork/reference_data/n2267be/analysis/analysis.jsonwork/reference_data/n2267be/analysis/analysis.md
Translate with LibreTranslate¶
Run your own local LibreTranslate service first, then:
uv run story-reference \
--translate-provider libretranslate \
--libretranslate-url http://localhost:5000 \
--max-episodes 3
Offline translation¶
Install the optional group:
uv sync --group translation
Then run with:
uv run story-reference --translate-provider argos --max-episodes 3
Use --translate-provider chain to try Argos first and fall back to LibreTranslate.
Batch pipeline runner¶
For chapter text files on disk, use the batch pipeline:
uv run story-pipeline-batch \
--source-dir work/resources/re_zero/n2267be/chapters \
--run-id re-zero \
--translate-provider none \
--mode analyze
Outputs land in work/pipeline_runs/re-zero/ with per-chapter summaries and a
summary.json timing report.
Focus names for character tracking¶
Edit:
work/resources/focus_names.txt
Then re-run the pipeline to refresh mention counts in analysis output.
Politeness and usage¶
- Default crawl delay is
1.1s. - Keep this for local reference and study.
- Avoid redistributing full third-party text.