E-16 Method Design — Dual-Track Analysis of the Curated Dataset

Status: APPROVED (2026-06-20) — building. Decisions: (1) freeze table, (2) LLM trials ok,

(3) start series_deterministic, (4) Material Palette + Medium Library + Prompt Template first,

(5) trial buttons on both pages.

Date: 2026-06-20

Author: grounded in a direct LLM read of real samples (12 Track-B prompts, 35 Track-A snapshots)

Dataset: 592 curated Emerge selections / 587 unique series

0. Why this document exists

Before we build extraction code, we fix:

1. What the data actually looks like (read, not guessed).

2. Two analysis scenarios — one for all 592 (series context), one for the 127 with prompts.

3. Where the work runs — on an isolated frozen copy, never the live tables.

4. What derivatives we produce — the recipes/templates that answer the task.

5. How we validate — trial tests before committing to a production method.

This is the artifact we approve together. Code follows approval.

1. Reading findings (real samples)

1.1 Track B — the prompt is a consistent GRAMMAR

I read 12 full DALL-E prompts (avg ~2650 chars). They are not free text — they follow a

repeatable slot structure. Every prompt opens with a medium declaration, then a palette,

then composition, materials, recurring motifs, negative constraints, and an

emotional contract. Observed slots:

#	Slot	Real examples from the 12 prompts
1	Medium / style anchor (before first colon)	"Cybernetic mycelium bloom on transparent substrate", "Pressed flower collage", "Embroidery on black silk", "Etched copperplate print", "Laser-etched technical schematic on polished obsidian", "Woven jacquard tapestry", "Blueprint of an organism in electrical schematic style", "Analytical diagram in the style of a 1960s engineering manual", "Relief engraving on anodized aluminum"
2	Palette	"sodium orange, rust-amber, burnished copper, terracotta, electric cyan, cold moonlight silver"; "arctic blue (Holbein Ice Blue)"; "cyan on navy"
3	Composition / spatial	"lower-left power-point", "off-center, upper right third", "three depth planes", "20% negative space upper left", "horizon-split", "diagonal rupture lower-left to upper-right"
4	Materials	"frosted acrylic pane", "fractured plexiglass", "weightless mercury", "enamel-coated copper micro-wires", "cast resin", "polished obsidian", "black silk"
5	Recurring conceptual motifs (this series)	palm-print relief, barcode/scan overlays, seven-segment digits, palimpsest (3 temporal states: pre-residue / live / scar), mercury rising against gravity, copper tendrils
6	Paradox device	"spatial impossibility", "perceptual paradox", "both in front of and behind the substrate", "macro photo AND satellite view at once"
7	Negative constraints	"No text, no watermarks, no real people, no UI elements", "no organic/spike/thread/network motifs"
8	Emotional contract	"ecstatic vertigo", "exhilaration of new capability", "boundary dissolving", "electric anticipation"

Key insight: in the news_pulse series the THESIS is held nearly constant

(palm/glass becomes an interface) and the MEDIUM is the primary variation axis

— mycelium bloom → pressed-flower collage → embroidery → copperplate → obsidian schematic →

jacquard tapestry → blueprint → 1960s engineering manual. So Track B is, in effect, a record of

"how Emerge re-renders one concept across many media." That is a directly reusable recipe:

*hold concept, vary medium/style, keep composition discipline + paradox + negative constraints.*

1.2 Track A — the snapshot is a structured ONTOLOGY

I read 35 snapshots. Each Snapshot.context_text is JSON with:

`artistic_statement` — series thesis (title + intent), present in 30/35

`moment_text` — poetic atmosphere, 35/35

`world_summary` — the external signals feeding the series (museum collections, news, moon

phase, seismic events, tides, market mood), 35/35

`emotional_intent[]` — explicit emotion contracts, 30/35

`visual_ontology.entities[]` — ~9 per series, each with **concept / form / material / scale /

transformation**, 35/35

Real material strings: "patinated bronze with black wax bloom", "blown glass with millefiori cane

and trapped air seeds", "stoneware with ash-run glaze and volcanic drip", "gypsum alabaster dusted

with ghost pigments", "graphite dust on ivory wove paper with pressed erasure sheen".

Key insight: Track A already contains a semi-structured material/form dictionary for the

full 592, plus the provenance (world_summary → which real-world signals seeded each series).

We can mine materials, forms, scales, concepts, and the signal→imagery mapping without any prompt.

1.3 What this means for the task

The task = turn selections into reproducible recipes across 6 categories

(Idea, Scene, Material, Composition, Palette, Config).

Materials & Idea & provenance → best from Track A (all 592, structured ontology).

Composition, Palette, Medium/Config, paradox devices, negative constraints → best from

Track B (127 prompts, parseable grammar).

The two tracks are complementary, not redundant: A = "what the series wanted",

B = "how it was rendered into a prompt".

2. Working copy — isolation from live data

2.1 Answer to "do we copy the DB?"

Today the originals are already read-only in this flow — CuratedSelection is a separate

table, and we only READ Generation / Snapshot. Nothing here writes to source tables.

But a frozen working copy is still the right call, for four reasons:

1. Reproducibility — agents keep generating; the live dataset shifts under us. A freeze pins

"the 592 as of date X" so every experiment is comparable.

2. Isolation — analysis, cleaning, and annotation never risk touching production rows.

3. Performance — pre-extracted fields (prompt slots, entities) avoid re-parsing 592 traces each run.

4. Versioning — we can freeze v1, refine the method, freeze v2, and diff.

2.2 Proposed table — `CurationDatasetSnapshot` (DESIGN ONLY)

> DB migration requires explicit approval (factory taboo-approval rule). Not created yet.

class CurationDatasetSnapshot(Base):
    __tablename__ = "curation_dataset_snapshot"
    id                = Column(Integer, primary_key=True)
    freeze_id         = Column(String, index=True)   # uuid per freeze run
    freeze_label      = Column(String)               # e.g. "emerge-2026-06-20-v1"
    track             = Column(String)               # "a" | "b"
    generation_id     = Column(String, index=True)
    snapshot_id       = Column(String, index=True)
    agent_id          = Column(String)
    created_at_src    = Column(DateTime)             # original generation time
    tier              = Column(String)               # A | B | C at freeze time
    # frozen payloads (denormalized copies — never reference live rows for analysis)
    dalle_prompt      = Column(Text)
    scene_director    = Column(Text)
    distiller_brief   = Column(Text)
    artistic_statement= Column(Text)
    moment_text       = Column(Text)
    world_summary     = Column(Text)
    emotional_intent_json = Column(Text)             # list
    visual_ontology_json  = Column(Text)             # entities[]
    # parsed Track-B slots (filled by the prompt parser)
    prompt_slots_json = Column(Text)                 # {medium, palette, composition, materials, motifs, constraints, emotion}
    image_path        = Column(Text)
    frozen_at         = Column(DateTime, default=utcnow)

2.3 Freeze process

POST /api/curation/dataset/freeze {track, label}
  1. resolve approved policy subset (Track A = 592, Track B = 127)
  2. for each generation: read live Generation + Snapshot ONCE
  3. parse Track-B prompt into slots (parser in §4.1)
  4. write rows into curation_dataset_snapshot with a new freeze_id
  5. return {freeze_id, count}
All extraction/experiments then read ONLY from the frozen snapshot, never live tables.

3. Track A method — Series Context Analysis (all 592)

Input: frozen Track-A rows (snapshot_context).

Goal: recipes for Idea, Material, and provenance (signal→imagery).

Pipeline:

1. Entity mining — flatten all visual_ontology.entities; group materials, forms, scales.

Output: ranked material vocabulary, form vocabulary, scale distribution.

2. Concept mining — cluster artistic_statement + concept fields into thematic families.

3. Emotion mining — frequency of emotional_intent phrases; map emotions → materials/forms.

4. Provenance mining — parse world_summary into signal types (museum, news, astronomical,

seismic, market) and correlate which signals precede which materials/concepts.

5. Derivatives (§5).

Methods:

`series_deterministic` (free) — frequency + co-occurrence over entities & lexicon.

`series_embedding` (low cost, optional) — cluster statements/materials semantically.

`series_llm` (M3, medium cost) — LLM synthesizes recipe cards from stats + samples.

4. Track B method — Prompt Grammar Analysis (127)

Input: frozen Track-B rows (dalle_prompt + scene_director + distiller).

Goal: recipes for Composition, Palette, Medium/Config, paradox devices, negative constraints.

4.1 Prompt parser (the core new component)

Deterministic segmentation into the 8 slots from §1.1:

medium = text before the first colon of sentence 1.

palette = color terms (lexicon + regex on the clause after medium).

composition = matches on a composition lexicon (power-point, off-center, depth planes,

negative space, horizon-split, diagonal, rule of thirds…).

materials = material lexicon (acrylic, plexiglass, mercury, copper, resin, obsidian, silk…).

motifs = recurring nouns (palm, barcode, seven-segment, palimpsest, mercury, tendril…).

constraints = the "No …" clauses near the end.

emotion = emotion lexicon (vertigo, exhilaration, anticipation, boundary dissolving…).

Parser writes prompt_slots_json on the frozen row (computed once at freeze).

4.2 Analysis

1. Slot frequency — top mediums, palettes, composition devices, materials, constraints.

2. Co-occurrence — which medium pairs with which palette/composition (e.g. "obsidian schematic"

→ "arctic blue", "negative space"). This is the high-value signal.

3. Variation-axis detection — per series, which slot varies while others are held constant

(we already see MEDIUM is the variation axis in news_pulse).

4. Derivatives (§5).

Methods: deterministic (free), embedding (cluster prompts), llm (M3 synthesis).

5. Derivatives — what we actually hand back

Beyond raw frequency cards, the task wants reusable artifacts:

Derivative	Track	Form	Use
Material Palette	A	Ranked dictionary of material phrases + example series	Seed `visual_ontology.material` for new series
Concept Families	A	Clusters of theses with representative statements	Series backlog ideation
Signal→Imagery map	A	"earthquake/museum/moon → these materials/forms"	Explain how world_summary drives imagery
Medium Library	B	Ranked medium anchors + when they scored well	Drop-in style anchors for prompts
Composition Playbook	B	Recurring spatial devices + phrasing	Scene Director directive presets
Palette Recipes	B	Color combos that recur (split warm/cold, etc.)	Prompt palette presets
Constraint Set	B	The standard "No …" negative clauses	Reusable prompt suffix
Prompt Template	B	Slot skeleton: `[medium]: [palette]. [composition]. [materials]. [motifs]. [paradox]. [constraints]. [emotion].`	One-click new-prompt scaffold

These are exported as JSON + rendered as Insight Cards, and (later) can feed Scene Director presets.

6. Trial test plan (before production)

Run small, cheap, real tests to build confidence — no full commit yet:

1. Track A deterministic on a 50-series sample → eyeball material/concept rankings.

2. Track B parser on the 12 already-fetched prompts → verify slot extraction accuracy

(manual check: did "medium" capture the style anchor every time?).

3. Track B deterministic on all 127 → top mediums/palettes/compositions.

4. One LLM run per track on ~15 samples → judge recipe quality vs deterministic.

5. Scorecard compare (existing harness) → pick production method per track.

Acceptance: parser ≥ 90% correct medium-slot on manual check; LLM cards rated more actionable

than deterministic on ≥ 4/6 categories; no live-table writes.

7. Open decisions (need your approval)

1. Freeze table — approve creating curation_dataset_snapshot (DB migration)?

Alternative: freeze to a JSON file under memory/ (no schema change). Recommend: table.

2. LLM budget — allow M3 (Claude) trial runs on ~15 samples/track (~low $)? Or deterministic-only first?

3. Track A method — start with series_deterministic only, add series_llm after trial?

4. Derivatives priority — which of the 8 derivatives in §5 do we build first?

(Recommend: Material Palette + Medium Library + Prompt Template.)

5. Page surface — keep this as /curation/method-design (read-only doc) + run trials from

/curation/experiments, or add trial buttons directly on the design page?

8. Plain-language glossary

	Field	Meaning
	medium anchor	The art technique the prompt imitates (embroidery, copperplate, blueprint…)
	palette	The explicit color list
	composition	Where things sit + depth + negative space
	material	What things are "made of" (mercury, plexiglass, bronze…)
	motif	Recurring objects (palm, barcode, digits, palimpsest)
	constraint	The "No text / no people / no UI" rules
	emotional contract	The feeling the image must deliver
	world_summary	Real-world signals that seeded the series
	visual_ontology	Structured entity list (concept/form/material/scale)

Method Design

Trial & Freeze (live)