🔥

Architecture Research · Part II

Dependency Injection & the Two Paths

A deeper read of Ember: what DI actually buys this codebase, and a line-by-line trace of its two halves — the request path (user → HTTP) and the scheduled path (clock → push).

There isn't "a" DI pattern — there are four

Ember decouples decisions from Postgres four different ways. Each is defensible; the variance between them is the thing to notice.

Style	Where	Shape
Factory + selectors object	`createMeRouter`, `createMeWordsRouter`	closure captures an object of named query fns
Ports as first positional arg	`importWord(ports, …)`	plain fn, deps passed explicitly each call
Scoped / transactional DI	`runGenerateQueue` → `withUserTransaction(fn → ops)`	deps yield a second deps object bound to a tx
Structural narrowing	`ExpoChunker = Pick<Expo, …>`	no port object — minimal type the real SDK satisfies

★ The standout: transaction-scoped DI

withUserTransaction: (fn) => db.transaction((tx) => fn({ ...ops bound to tx })). The orchestration never imports db or says the word "transaction," yet every op runs inside a real one. It's the loan pattern: the dependency is a function that lends you a scoped resource and guarantees commit/rollback around your callback. Clean solution to the classic "how do I inject something transactional" problem.

⚠ The review note

Why does importWord take ports as an argument while routes use a factory closure? Same idea, two spellings. Not wrong — but unjustified variance is a tax even when each instance is fine. And me.ts's findUserById is a 6-line passthrough with zero branching to test — DI there is ceremony. The pattern's value is uneven: high where logic forks (generateQueue), ~zero where it's one query.

The benefit, proven by the test suite

The single biggest payoff in this codebase: you can test the logic without a database. Every API test swaps the real dependency for a fake —

// generateQueue.test.ts — the transaction becomes a no-op that just runs the callback
const deps: GenerateQueueDeps = { withUserTransaction: async (fn) => fn(ops), ... };
await runGenerateQueue(deps);

// words.test.ts — fake selectors injected straight into the router factory
app.route("/me", createMeWordsRouter({ ...baseSelectors, ...overrides }));

Confirmed by grepping the suite: zero API tests touch a real database — no pglite, no testcontainers, no DATABASE_URL. The entire backend test suite runs in-process against injected fakes. From that one capability everything else falls out:

Benefit	What it buys you here
Testability	Assert fallback cascades, the "skip if future job exists" guard, SRS branching — no DB setup
Speed & determinism	No network to Neon, no flaky connections, millisecond tests
Readable logic	`runGenerateQueue` reads as what the system decides; SQL noise lives in `makeProductionDeps`
Swappable impls	Real Drizzle in prod, fakes in tests; same lever swaps the LLM or Expo SDK

★ The mechanism: inversion of control

Normally a function reaches out and imports db. With DI, control inverts — db is pushed in from outside. That single flip moves the database from a baked-in dependency to a substitutable parameter. Testability, speed, swappability are all downstream of that one inversion.

⚠ The flip side — DI's blind spot

Because the DB is always mocked in tests, the actual SQL is never exercised. The dynamic query in fetchNewWords, the tz math in insertNotificationJob, the concurrent-insert dance in resolveWordRow — all on the production side of the seam, untested. So the precise benefit is sharper than "DI helps testing": DI lets you choose where the test seam goes. Ember put exhaustive tests around the branching logic and left the SQL to be validated some other way.

REQUEST PATH Login → token → daily word → SRS write

Act 0 — Cold boot App.tsx · AuthContext.tsx

AuthProvider reads tokenStorage.get() (Expo SecureStore / OS keychain). Status goes BOOTSTRAPPING → AUTHENTICATED | UNAUTHENTICATED. Root is a dispatch table, not a router — auth state is the route (no navigation library; ADR 2026-05-31).

const SCREENS_BY_STATUS = { BOOTSTRAPPING: SplashView, UNAUTHENTICATED: AuthShell, AUTHENTICATED: DailyWord };

Act 1 — Login: where the token is born auth.ts

api.auth.login.$post({ json }) — the typed Hono RPC call, valid only because the server chained .route("/auth", authRouter) into AppType. Server: validateLogin (Zod) → loginFindUser → bcrypt.compare → mint:

new SignJWT({ sub: userId }).setProtectedHeader({ alg: "HS256" }).setExpirationTime(now + 86400).sign(secret);

Client: authSuccessSchema.parse → tokenStorage.set → status = AUTHENTICATED → Root swaps to DailyWord.

Act 2 — Token rides every request useDailyWord.ts · api.ts

Two-step dependency: needs your timezone before it can record a review, so it chains hooks — useFetchMe() then the daily-word query, gated by { enabled: !!me }. The api.ts headers callback pulls the token from storage and attaches Authorization: Bearer … per request. No screen ever handles the token. Server: requireAuth → jwtVerify → c.set("userId", sub).

Act 3 — Word resolves, then silently records a Review words.ts · srs.ts

GET /me/daily-word → three-tier cascade (notified → unseen import → random). Then, as a side effect of viewing, the hook fires POST /me/words/:id/reviews → markWordSeen: the two-statement idempotent write (INSERT ON CONFLICT starts the clock; tz-guarded UPDATE advances ≤1 rung per local day). Wrapped in try/catch {} — non-blocking. A 401 calls logout(), flipping Root back to login in the same render.

Act 4 — Render DailyWord.tsx

Merges per CONTEXT.md: userDefinition ?? definition; user examples filtered out of LLM examples so nothing shows twice; user examples get the orange " prefix.

LoginScreen ─login()→ api.auth.login.$post ─JSON─→ [validateLogin] → bcrypt → SignJWT
     ↑                                                                      │
 token → SecureStore ←──────────────────── { token } ←─────────────────────┘
     │  status=AUTHENTICATED → Root → DailyWord
useDailyWord: GET /me (timezone) ──┐
     │                             ├─ api.ts attaches Bearer token on every call
     └─ GET /me/daily-word ────────┘        │
            │                               ▼ [requireAuth] jwtVerify → c.set(userId)
     three-tier fallback (words.ts) ────────┤
            │                               ▼
     POST /me/words/:id/reviews ──→ markWordSeen() → INSERT…ON CONFLICT + tz-guarded UPDATE
            ▼   render: userDefinition ?? definition,  examples deduped

⚠ A contract the types DON'T protect

auth.ts mints with 86400s & HS256; middleware/auth.ts verifies with maxTokenAge: "24h" & ["HS256"]. Two files, no shared constant — the RPC types guarantee shapes, not values. Mismatch either and every token silently 401s.

SCHEDULED PATH Clock → schedule-ahead → push, with no one watching

Act 0 — A different process entirely worker.ts

Same Docker image, different entry point (Fly worker process group). Boots into two timers and nothing else — no Hono, no port. The web process waits to be called; the worker calls itself on a clock.

setInterval(preventOverlappingTicks("generateQueue", generateQueue), MIDNIGHT_MS);
setInterval(preventOverlappingTicks("processQueue",  processQueue),  MINUTE_MS);

★ The loop must serialize itself

preventOverlappingTicks is an in-process mutex from a closure boolean. HTTP is naturally concurrent so the request path never needs this — but a setInterval will fire tick N+1 while tick N is still awaiting the DB. The guard makes a slow tick skip the next rather than overlap (→ double-send).

Act 1 — Midnight: deciding what to send generateQueue

5-tier cascade (SRS-due → preferred → difficulty → any new → re-surface seen) writes notification_jobs with scheduled_at precomputed in the user's tz. Decisions now, delivery later. hasExistingFutureJob makes it safe to re-run.

Act 2 — Every minute: firing processQueue.ts

1. Pull due: WHERE scheduled_at <= NOW() AND sent_at IS NULL AND failed_at IS NULL. The JOIN prefers the user's own content — COALESCE(ui.user_definition, w.short_definition), user example[1] else dictionary example[1] (same precedence as the screen, enforced in SQL).
2. Partition: no push token → markJobFailed(MISSING_PUSH_TOKEN) immediately.
3. Build → chunk → send via Expo.
4. Apply: ticket ok → markJobSent; ticket error → markJobFailed(reason).

★ Three failure regimes, three fates

• No token → failed immediately (terminal) • Expo rejects ticket → failed w/ mapped reason (terminal) • Can't reach Expo → rows stay pending → next tick retries. The retry mechanism is the every-minute loop + the sent_at/failed_at filter. No counter, no backoff, no dead-letter — the queue columns are the state machine.

Act 3 — The Expo contract, type-checked failures.ts

Opposite of the JWT contract: the Expo error map is compiler-enforced. as const satisfies Record<ExpoErrorCode, string> — if a future SDK adds an error code, it's a compile error until you map it.

worker boots → two setIntervals (each wrapped in preventOverlappingTicks)
   ┌──── midnight UTC ─────────────────────────────┐
   │ generateQueue: pick words (5-tier) → INSERT    │
   │ notification_jobs (scheduled_at, user tz)      │
   └────────────────────────────────────────────────┘
   ┌──── every 60s ────────────────────────────────┐
   │ processQueue: SELECT due + content              │
   │   no token? → markJobFailed        (terminal)   │
   │   else → build → chunk → Expo send              │
   │      ticket ok    → markJobSent    (terminal)   │
   │      ticket error → markJobFailed  (terminal)   │
   │      send threw   → leave pending → retry 60s   │
   └─────────────────────────────────────────────────┘
        → Expo Push → APNs/FCM → 📱  data:{type:"daily_word", wordId, jobId}

The two paths, side by side

	Request path	Scheduled path
Trigger	User action over HTTP	Wall clock (`setInterval`)
Trust / auth	JWT verified per request	None — "they're our own rows"
Outside contract	Hono RPC types (shapes)	DB schema + Expo SDK types (`satisfies`)
Idempotency	tz-date guard in `markWordSeen`	terminal `sent_at/failed_at` + WHERE filter
Concurrency	isolated per request (free)	must self-serialize (`preventOverlappingTicks`)
Retry	none — caller retries	implicit: pending rows reappear next tick
DI-tested logic	router / `runGenerateQueue` orchestration	`runProcessQueue` partition + ticket apply
DI blind spot	the SRS SQL in `srs.ts`	the content JOIN in `fetchPendingJobs`

Both halves share the same skeleton — orchestration injected, I/O on the production side of the seam, the dangerous SQL untested — but everything around it inverts: one is pulled by a user and guarded by a token; the other is pushed by a clock and guarded by nothing but its own queue columns.

The honest gaps

1 · The JWT value-contract

Alg + expiry are hand-matched across two files with no shared constant. Types protect shapes, not these values.

2 · Untested I/O — DI's structural ceiling

The most consequential code (SRS tz guard, the content JOIN) lives on the production side of every seam and is executed by no test. DI demands a second test layer (real Neon / pglite) to cover it — not present in the suite as read.

3 · Tickets ≠ receipts

markJobSent fires when Expo accepts the ticket, not on delivery. Expo's model is two-phase; DeviceNotRegistered (the "this push token is dead" signal) arrives in the receipt, which the worker never polls. So dead tokens accumulate and silently fail forever. CONTEXT.md is explicit that "sent = accepted," so it's a known boundary — and receipt-polling is the natural next worker loop.

∑

The whole system, one paragraph

A Bun/Hono server serves a thin, JWT-gated surface whose types flow directly into the Expo client with no codegen. Handlers are pure orchestration; real DB work is injected as deps/selectors, which is what lets the entire backend test suite run with zero database. The domain logic — SRS ladder, daily-word fallback, notification scheduling — lives partly in small TypeScript functions and partly inside Postgres as constraints, idempotency guards, and tz-aware date math. A separate worker process schedules tomorrow's pushes at midnight and fires due ones every minute, using the queue table's own columns as its state machine. Claude runs entirely offline via the Batches API, so the request path never touches an LLM. The recurring tension throughout: orchestration is exhaustively tested; the I/O it orchestrates is trusted, not verified.