Public Execution Evidence

Execution Shifted From Steady Delivery To High-Volume Productization

4,351 total contributions
3,755 commit contributions
74 pull requests
21 PR reviews
1,149 PR changed files
72 Huly MCP npm versions

Growth Arc

From Reflector Work To MCP Productization And Formal Methods

2025-04 partial
13

Baseline activity before the main report window ramps.

2025-05
142

Client/project work and article/repo activity; resume data points to Cubbi and early AI-agent automation work.

2025-06
167

Monadical site/blog work, Reflector consent/database work, and public writing around keeping control of AI-assisted code.

2025-07
69

Reflector work around diarization, room concurrency, migrations, and LLM backend experiments.

2025-08
50

Reflector peak PR month: search backend/frontend, transcription batching, Parakeet, PostgreSQL test migration, and evaluation fixes.

2025-09
23

Private/client-heavy period; public evidence is lighter but local history shows continued TypeScript/project work.

2025-10
12

Private/client-heavy period; resume repository list shows Effect presentation and LLM evaluation repositories around this period.

2025-11
17

Reflector/transcription-eval timestamps suggest continued AI/transcription evaluation work.

2025-12
22

Public evidence limited; a fuller private report should add calendar, Huly, or client work logs.

2026-01
55

Re-ramp before the Huly/D&D surge.

2026-02
649

@firfi/huly-mcp created on npm, rapid package publishing begins, and the awesome-mcp-servers PR opens.

2026-03
1,437

Open-source bug-hunting burst across Actual, Outline, RxDB, Crawlee, Tiptap, PeerJS, n8n, TON, Activepieces, and Quint.

2026-04 partial
1,695

Dominant activity in 5e-quint, huly-mcp, D&D formalization, Codex-heavy workflows, PRD planning, and test-review loops.

Engineering Operating System

Explicit Models, Written Plans, Mechanical Checks

Turns ambiguous domains into explicit, inspectable, mechanically checked systems.

Uses writing, PRDs, plans, audits, and talks as part of the engineering process.

Builds quality harnesses before scaling AI-assisted work: strict checks, coverage thresholds, integration tests, and repeatable review rubrics.

Past-year growth concentrates around MCP productization, tabletop-rule formalization, open-source reliability fixes, and agent orchestration.

Requirements before implementation

Major work starts as scoped product requirements: initial product surface, lazy loading, integrations, and rule-system expansion.

Mechanical quality gates

Every change is expected to pass compilation, type checks, linting, duplicate-code checks, tests, and release verification.

Coverage and integration discipline

The main MCP server uses 99% coverage thresholds and real end-to-end lifecycle tests against Huly.

Explicit review rubric

Rules for dead code, casts, boundary typing, state-space minimality, connascence, immutability.

File-backed agent execution

Long agent runs are managed with fresh sessions, persisted task state, learned guardrails, quality gates, and git commits.

Agent Harness

Agent Usage Moved From Chat Assistance To Managed Execution

349 parallel agent executions logged in April
106 Huly MCP tool calls covered by the full integration suite
99% coverage thresholds in the Huly MCP Vitest harness
2,183 Codex sessions captured in April local logs

Reference TypeScript/Effect harness

Huly MCP explicitly names its quality harness as the template to copy into new TypeScript/Effect projects: Effect services, Context, Layers, schema decoding, and tagged errors.

Source: Huly MCP project instructions and core service modules

Build/type/lint/test gate

Completion runs through build, TypeScript, ESLint, duplication checks, and tests; prepublish also runs the same gate.

Source: package scripts and release gate

Quality tools beyond unit tests

The harness includes 99% v8 coverage thresholds, jscpd duplicate detection, madge circular dependency checks, gitleaks, lint-staged, and dead-export checks.

Source: test, lint, duplication, dependency, and pre-commit configuration

Effect-first testability

Module mocks and spies are banned; tests must use Effect services, Layers, ports, deterministic clocks, and @effect/vitest instead of monkey-patching.

Source: project rules, lint restrictions, and test suite

Review rubric as executable taste

Review agents enforce casts, dead code, boundary typing, state-space minimality, comments-as-fixes, connascence, and immutability.

Source: codified review rules and architecture audit

Container-aware integration suite

The Huly suite documents container networking, runtime URL patching, 106 MCP tool calls across 18 domains, and full-suite cleanup rules.

Source: integration testing guide and container runtime patch

Permissionless agent lane

Agents run in an isolated container-oriented lane with permission-skipping/yolo-style execution; quality gates catch failures after the model acts.

Source: autonomous agent loop and quality gate scripts

Fresh sessions with file memory

Ralph rotates fresh sessions per task; memory lives in git commits, progress files, guardrails, errors, and the last generated prompt.

Source: Ralph guide, progress state, and learned guardrails

Formal verification track

Tabletop-rule work moves beyond code harnesses into model-based testing, state-space exploration, and executable rule specifications.

Source: 5e Quint project, D&D rules engine, and public model-based testing article

Secondary test audit

Test-Revizorro is one supporting layer: useful for auditing generated tests, but subordinate to the main harness.

Source: automated test-review loop

MCP Productization

Huly MCP Became A Public Package And Directory Entry

461,138 PostHog MCP tool calls from February through April 2026
13,625 sessions/restarts over the same period
2,816 npm downloads in the last month
5 human contributors in the GitHub contributor list

@firfi/huly-mcp

Latest
0.9.1
Created
2026-02-02
Latest publish
2026-04-25
Published versions
72

PostHog MCP Usage

Feb 2026 15,591 922 sessions
Mar 2026 312,427 5,194 sessions
Apr 2026 133,120 7,509 sessions

Contributors

Total
6
Human
5

Formal Tools And Rule Normalization

Research Outputs Became Public Tools, Plugins, And Runnable Rule Systems

Quint JetBrains Plugin

Marketplace
Quint
Plugin ID
com.dearlordylord.quint.idea
Downloads
187
Latest version
0.5.7

Model-Based Testing Article

Hacker News
109 points
Discussion
76 comments
View HN thread

Savage Worlds Normalization

Savage Worlds combat and status rules became a runnable state machine with formal specs, model-based tests, scenario pages, English/Russian routes, and a deployed UI.

Open site

Executable Rules

๐Ÿšจ๐Ÿšจ๐Ÿšจ The important part ๐Ÿšจ๐Ÿšจ๐Ÿšจ is not the files; ๐Ÿง ๐Ÿง ๐Ÿง  it is the method ๐Ÿง ๐Ÿง ๐Ÿง : normalize tabletop rules into state machines, Quint specs, model-based tests, and scenario pages that can be run and checked.

state machines Quint specs MBT tests scenario cookbook EN/RU rules

Open Source Contributions

Merged Upstream PRs Across Real Projects

This section excludes professional Reflector work, Monadical repositories, and personal fork staging PRs. It includes merged upstream fixes, integration work, and ecosystem contributions accepted by maintainers.