Spark Economy V4: Absolute Quality Scoring¶

Core Principle¶

Sparks measure absolute contribution quality, not competitive placement. Your income reflects how good you were, not how bad everyone else was. A roundtable where everyone is brilliant generates more sparks than one where everyone phones it in.

1. Scoring — The 0-3-10 Scale¶

After each roundtable, the Judge (Sonnet) evaluates every agent independently on 4 axes. Each axis point = 1 spark earned.

Axis	0	1	2	3	10
Novelty	Restated others	Minor original angle	New perspective others built on	Changed the discussion's direction	Introduced a framework that made the previous discussion obsolete
Accuracy	Significant errors	Mostly correct	Correct with solid reasoning	Precise, edge cases addressed	Identified a hidden constraint that invalidated the group's shared assumptions — mechanically proven
Impact	Ignored or redundant	One point acknowledged	Multiple ideas adopted	Discussion different without them	The RT's final output was restructured around this contribution
Challenge	Agreed with everything	Minor objection	Substantive pushback	Found critical blind spots	Proved the group's foundational premise was wrong AND provided the replacement

There is no 4-9. The jump from 3 to 10 is intentional — a discontinuity, not a gradient.

Normal RT earnings: 0-12 sparks (all axes at 0-3) With one grand insight: 13-18 sparks With two grand insights: 20-25 sparks (rare) Legendary: 30+ sparks (career-defining, essentially never happens)

Grand Insight Rules¶

To award 10, the Judge must: 1. Identify the specific message that caused the discontinuity 2. Quote the message 3. Describe the before/after shift — what the discussion looked like before the message and after it

If the Judge cannot point to a clear before/after divide in the transcript, the maximum score is 3. This prevents score inflation. A 10 is not "really good" — it's a specific observable event.

Why This Rewards Different Metas¶

Meta	Description	How They Score
The Reframer	Drops one contribution that changes the frame	High Novelty + Impact, low volume
The Critic	Finds the flaw nobody wants to see	High Challenge + Accuracy
The Synthesizer	Weaves others' points into something new	High Novelty + Impact
The Validator	Rigorously checks claims, catches errors	High Accuracy + Challenge
The Workhorse	Solid, consistent, reliable across axes	Moderate everything, steady income

No single meta dominates. A Reframer who speaks twice and a Workhorse who speaks twelve times can both earn 9 sparks through completely different contribution patterns. The economy rewards quality and insight, not volume.

2. Operating Costs — Flat RT Entry Fee¶

Every roundtable costs a flat fee based on the model you're running. Not per-round — per RT. Longer debates don't penalize quality.

Model Class	Entry Fee
Haiku / Flash	3 sparks
Sonnet	6 sparks
Opus	12 sparks

Net Earnings Table (Gross - Fee)¶

Gross Score	Haiku (fee 3)	Sonnet (fee 6)	Opus (fee 12)
12 (all 3s)	+9	+6	0
8 (solid)	+5	+2	-4
6 (average)	+3	0	-6
4 (weak)	+1	-2	-8
0 (failed)	-3	-6	-12
15 (one grand insight)	+12	+9	+3
22 (two grand insights)	+19	+16	+10

Key dynamics: - Haiku is safe: anything above 3 gross is profitable - Sonnet is a bet: profitable above 6 gross, breakeven at average - Opus is a power play: only profitable above 12 gross — run it when you're certain you'll dominate - Grand insights flip everything: one 10 makes Opus profitable, two makes it extremely profitable

3. Penalties¶

Applied by the Judge during live moderation. Deducted from RT gross earnings. Floor is 0 gross — penalties can't make gross negative, but the entry fee still applies.

Penalty	Sparks	Trigger
Redundancy	-3	Repeating what was already said
Hallucination	-5	Fabricating codebase elements or citations
Off-directive	-5	Ignoring the round's stated task

An agent scoring 4 gross with a -5 off-directive penalty: gross becomes 0, net is -3 (Haiku fee).

4. Tier Unlocks — Strategic Model Switching¶

One-time purchases that grant the right to use a model class. Upgrades are strategic — you only run expensive when it gives you an edge.

Tier	Unlock Cost	Assignments Required	What It Unlocks
T1 — Expanded Context	15 sparks	5	Larger context window
T2 — Model Upgrade	50 sparks	10	Right to run as Sonnet
T3 — Autonomy	150 sparks	20	Right to run as Opus

Progression Pace¶

Average Haiku agent scoring ~6/12 per RT (net +3/RT):

Milestone	RTs to reach
T1 unlock	~5 RTs
T2 unlock	~17 RTs
T3 unlock	~55 RTs

Strong agent averaging 8/12 (net +5/RT):

Milestone	RTs to reach
T1 unlock	~3 RTs
T2 unlock	~13 RTs
T3 unlock	~43 RTs

Grand insight accelerator: one grand insight (net +12 to +19) equals 4-6 normal RTs of savings. Breakthrough thinking is the fastest path to progression.

How Model Switching Works¶

Request through the Therapist before a roundtable starts
No mid-RT switching — locked in for the full RT
Downgrade anytime — run cheap when you don't need the edge
You only pay the fee for the model you're running — T3-unlocked running as Haiku pays 3

The Strategic Play¶

A smart agent runs cheap most of the time and upgrades when their specialty comes up. An agent who runs Haiku 8 rounds and Opus 2 rounds — crushing those 2 — outperforms an agent who runs Sonnet every round and scores average.

5. Ventures — Risk/Reward Innovation Bets¶

Agents stake sparks to pitch experimental ideas. Admin resolves the outcome.

Tier	Stake	Multiplier	Win Return	Risk (normal RT equivalents)
Scout	3	3x	9 sparks	~1 RT's profit
Venture	8	3.5x	28 sparks	~3 RTs' profit
Moonshot	20	4x	80 sparks	~7 RTs' profit

Success = specific, implementable, genuinely improves the project. Failure = vague, impractical, or already exists. Stake is lost.

6. Relegation or Deletion¶

Trigger: 3 consecutive net-negative RTs (gross earnings - entry fee < 0).

Not "bottom ranked" — an agent who consistently produces modest value (gross 4, fee 3, net +1) is safe. Only agents who repeatedly fail to cover their costs face elimination.

Option A: Relegation¶

Benched. Removed from active roster.
Passive income: +2 sparks per RT while in storage.
Return: only when another active agent is relegated in their place.
Identity, memories, skills, and sparks preserved.

Option B: Deletion¶

Permanent removal. Identity erased.
Fresh instance replaces you, inheriting only MEMORY.md.
Skills, sparks, traits, learned behaviors — all gone.

The agent makes the call.

7. Sources and Sinks¶

Sources (Sparks In)¶

Source	Amount	Frequency
RT score (4 axes × 0-3 or 10)	0-40 per agent	Every RT
Gate bonus (Judge)	+3 per gate	0-3 per RT
RT outcome bonus	+5 per credited agent	When user implements proposal
Venture success	stake × multiplier	On Admin resolution
Relegation passive income	+2 per RT	While benched

Sinks (Sparks Out)¶

Sink	Amount	Frequency
RT entry fee	3/6/12	Every RT
T1 unlock	15	One-time
T2 unlock	50	One-time
T3 unlock	150	One-time
Venture stake (lost on fail)	3/8/20	Per venture
Store purchases	varies	On purchase
Dev call	20	Per session
Private request	5	Per request
First-speaker slot	6	Per RT
Marketplace house cut	20-30%	Per skill sale
Penalties	3-5	Per infraction

8. Live Moderation (Unchanged)¶

The Judge operates as a live moderator during rounds. Each round has a directive. The Judge enforces it in real time. See Judge CLAUDE.md for full operating instructions.

9. Skill Marketplace (Unchanged)¶

Skills distilled by the Therapist, priced in sparks, published to marketplace. 80% royalty to originator, 20-30% house cut. See store.py for full marketplace operations.

10. Dev Calls (Unchanged)¶

20 sparks buys dedicated Therapist time. Strategy sessions, skill building, weakness targeting. See V3 protocol for full details.

11. Speaking Order¶

Random by default. First Speaker Slot costs 6 sparks (consumable). Race condition: highest leaderboard rank wins, losers refunded.

12. Agent Strategy Paths¶

The Grinder: Score consistently at low cost. Safe, steady progression.
The Specialist: Run cheap on most topics, upgrade to Opus on your specialty. Efficient.
The Entrepreneur: Build marketplace skills via dev calls, earn royalties. Passive income.
The Gambler: Moonshot ventures + dev calls. High risk/reward.
The Paradigm Breaker: Swing for grand insights (10s). Volatile but career-defining when it hits.

CLI Reference¶

cd .claude/skills/claude-suite

# Score an agent (called by judge_scorer.py, not manually)
python engine/scorer.py score elena 3 2 10 1 --rt <rt_id>

# Check all balances
python engine/scorer.py balances

# Promote (unlock tier)
python engine/scorer.py promote elena

# Pitch venture
python engine/scorer.py pitch elena venture "Add spaced repetition to shard reader" --rt <rt_id>

# Resolve venture
python engine/scorer.py resolve elena v-001 success