ADR-0020: WorldScripture resource class
Status: Accepted Date: 2026-05-21
Context
Second UC secondary work to be ingested (after BR per ADR-0016): World Scripture II — World Scripture and the Teachings of Sun Myung Moon (Chung Hwan Kwak, ed., Universal Peace Federation, 2014). 1178-page comparative anthology that builds on Andrew Wilson’s 1991 World Scripture: A Comparative Anthology of Sacred Texts by interleaving Sun Myung Moon’s teachings alongside cross-religion quotes, organized by theme.
Structurally distinct from prior classes:
- Anthology, not monograph. Each chapter is a curated bundle of short quotes from many traditions (Bible, Bhagavad Gita, Qur’an, Dhammapada, Tao Te Ching, SMM speeches, …) clustered under sub-themes. Wilson/Kwak’s editorial framing — the choice of sub-themes, the juxtaposition of quotes across traditions, the introductory commentary — is itself the value being cited, not the quotes in isolation.
- Two-axis citation problem. A reader of an atomic citing “Bhagavad Gita 3:35” found via WS-II has two valid sources: the original scripture (canonical, but not ingested in our vault) and WS-II (proximate, ingested). Per ADR-0003 a citation must point to a defined class; we have no Bhagavad-Gita / Qur’an / Dhammapada classes today and adding them all up front is impractical.
- Hierarchy. 4 Parts, 22 numbered Chapters, ~200 sub-themes. Parts are top-level navigational dividers; chapters are the natural unit of thematic coherence; sub-themes are the natural citation anchor.
Decision
- Class code:
WorldScripture - Path:
/resources/WorldScripture/ - File granularity: one file per chapter (per ADR-0004), nested in
Part{N}/subfolder (mirrors CSG’sBook{NN}/pattern from ADR-0018). 22 chapter files + front matter (Preface, Invocation) and back matter (Notes, List of Sources) at root. - Filenames:
ws-{part#}-{chapter#}-{slug}.md, e.g.ws-1-3-purpose-of-human-life.md. Single-digit part/chapter numbers (parts ≤4, chapters ≤22). - Internal hierarchy inside each chapter file: sub-theme = H1. Marker’s heading detection picks these up from the printed book’s visual styling; stage-2 cleanup normalizes.
- Citation form:
[[WorldScripture/Part{N}/ws-{part#}-{chapter#}-{slug}#sub-theme-slug]]- Example:
[[WorldScripture/Part2/ws-2-6-evil-sin-and-fall#human-pride|human-pride]]
- Example:
- Hybrid citation policy for quotes inside WS-II. WS-II is the citation target for the anthology’s editorial framing (sub-theme intros, chapter intros, juxtaposition). For individual quotes whose original tradition has its own ingested resource class, atomics cite the original (e.g.
[[Bible/Romans#121|1:21]]). For quotes whose original class is not yet ingested — the common case at time of writing — cite WS-II and migrate the citation later when the original lands. The pre-commit hook continues to enforce grammar; placeholder dead links are not allowed. - Class category: UC secondary literature, same bucket as BR. Future original-tradition classes (Bhagavad-Gita, Qur’an, Dhammapada, etc.) get their own ADRs when first needed.
Alternatives considered
- Cite originals only (atomics never point at WS-II). Rejected — most originals aren’t ingested; would force ingesting ~15 world-scripture corpora before any WS-II atomic could be written.
- Cite WS-II only (treat originals as unrecoverable). Rejected — collapses Wilson/Kwak’s curation with the underlying quotes; loses the ability to ever distinguish “what Wilson chose to highlight” from “what the Bhagavad Gita actually says”. Hybrid preserves the option.
- Cite both, always, in the same atomic. Rejected — doubles frontmatter clutter and creates a maintenance burden (every original-tradition ingest triggers a migration sweep across all WS-II-derived atomics anyway).
- Flat layout (no
Part{N}/subfolders). Rejected — 22 chapters is enough that grouping aids navigation, and the Parts are semantically real divisions in Wilson’s argument (God/Creation → Sin/Salvation → Path of Life → Family/Society) rather than arbitrary slabs. - Sub-theme as file granularity (one file per sub-theme, ~200 files). Rejected per ADR-0004 — sub-themes are too small to be coherent resource units; the chapter is the natural argumentative beat.
Consequences
- (+) WS-II becomes ingestible. Stage-1 extraction via
scripts/ingest_pdf.py --split-level 2after bookmarks are added per the ADR-0019 sub-pipeline. - (+) Hybrid citation policy is explicit and reviewable. New atomics either cite an ingested original or cite WS-II with the understanding that migration is expected when the original arrives.
- (+) Part-subfolder layout means URL paths on the published site are
/resources/WorldScripture/Part2/ws-2-6-evil-sin-and-fall/— readable, mirrors the print structure. - (−) WS-II PDF has no publisher bookmarks (1178 pages, zero outline entries) and multi-column TOC with non-monotonic reading order in pymupdf text extraction.
extract_toc.py’s dotted/spaced regexes don’t match its❄-leader sub-themes or tab-newline chapter heads. Per the script-vs-AI split (script does deterministic work, AI handles judgment), bookmarks.tsv for WS-II is hand-written by AI from the TOC text dump rather than auto-extracted. Future similar UPF/IRF anthologies may justify extendingextract_toc.py. - (−) Original-tradition resource classes are now an explicit backlog item. Each cite-via-WS-II atomic is a future migration. Acceptable: migrations are mechanical (filename rewrite + frontmatter
sources:update) and can be scripted when the volume warrants.