Phase 1 Week 8: Wide to Learn, Narrow to Ship

Four weeks ago we promised two things: local development and CI improvements, and a backend tech stack evaluation. Both delivered. The CI improvements shipped. The backend evaluation—well, we evaluated eleven of them. That is not a typo. We built the same expense tracker REST API in eleven different backend frameworks spanning ten programming languages, all sharing a single OpenAPI 3.1 contract and a single Gherkin BDD specification suite. AI-assisted development made this practical. Discipline made it useful.

Beyond the backends, both content platforms migrated from Hugo to fullstack Next.js 16, OrganicLever's backend pivoted from Java/Spring Boot to F#/Giraffe with OpenAPI contract enforcement, three demo frontends shipped across different frameworks, the project adopted FSL-1.1-MIT licensing, and CI/CD grew from 7 workflows to 29.

A note on scale: every line of code pushed into a production codebase is a liability as much as an asset. More code means more surface area for things to break, more entropy to manage, more maintenance burden. The demo backends are reference implementations—learning artifacts. But for OrganicLever, the production app, the lesson is the opposite of "build more." The tech stack decision that followed is about keeping the production surface area small and manageable. We went wide to learn. We go narrow to ship.

The Polyglot Backend Experiment

Why Eleven Backends

The Week 4 update mentioned we were developing OrganicLever's backend in multiple tech stacks to evaluate which serves potential users best. The original plan was perhaps three or four stacks—enough to make an informed choice. What actually happened was more ambitious.

AI-assisted development compressed what would have been months of work into weeks. Each backend took roughly two to three days from scaffold to passing E2E tests, because the patterns were established by the first few implementations. The OpenAPI contract defined what to build. The Gherkin specifications defined how to verify it. Each new backend was an exercise in translating established patterns into a new language's idioms.

The goal was never benchmarking. We deliberately avoided putting excessive weight on performance numbers—request latency, throughput, memory footprint—because performance problems have architectural and system design solutions. You can cache, scale horizontally, add a CDN, or move hot paths to a faster runtime. Performance is important, but it is not the whole story and not the most important one for our usage.

What we focused on instead was the experience of building a real application with each stack: authentication, CRUD with business rules, reporting endpoints, admin operations, database migrations, repository abstractions, test coverage enforcement, and CI/CD integration. You learn different things writing a repository pattern in Clojure's defprotocol than you do writing one with Go's interfaces or Rust's traits.

We especially evaluated each stack through the lens of AI-assisted development—because that is how we build. Three criteria matter when coding with AI:

How expressive the type system's guardrails are. A type system that catches more errors at compile time means an AI coding assistant can work more autonomously, produce more correct code on the first pass, and waste fewer tokens on round-trips fixing type errors. Languages with strong static typing and expressive type systems—F#, Rust, TypeScript, Kotlin—outperform dynamically typed or weakly typed alternatives in this regard.
Ecosystem coverage for the chosen architecture. Community packages and libraries need to support the patterns we use: OpenAPI codegen, Gherkin BDD testing, database migrations, repository abstractions. Some ecosystems have mature tooling for all of these. Others required us to build custom libraries (Clojure and Elixir OpenAPI codegen, Elixir Gherkin testing). The ecosystem gap directly affects how much infrastructure work competes with product work.
Conciseness and expressiveness of the language. AI-assisted software development means reviewing large volumes of generated code. A language that expresses the same logic in fewer, clearer lines reduces review burden and makes it easier to spot errors. F#'s computation expressions, Kotlin's coroutines, and Clojure's data-oriented style all score well here. Java's verbosity works against it.

These criteria—along with ecosystem stability and architectural possibilities—weighted heavily in the final tech stack decision. A language can have excellent runtime performance, but if its type system cannot guide an AI assistant, if its ecosystem lacks the libraries we need, or if reviewing its output requires reading three times as much code, the practical development experience suffers. Performance can be solved with architecture. A poor development experience compounds with every line of code written.

Every backend implements the same domain: an expense tracker with user registration, password-based authentication, JWT token management, expense CRUD with currency and unit handling, attachment management, reporting endpoints, admin operations, and a health check. Realistic enough to exercise real patterns. Consistent enough to compare meaningfully.

One Contract, Many Languages

The shared infrastructure that made this possible is the contract layer. Two artifacts define what every backend must implement:

OpenAPI 3.1 contract at specs/apps/a-demo/contracts/openapi.yaml—a single source of truth for every endpoint, request body, response schema, error format, and authentication requirement. Each backend has a codegen Nx target that generates language-specific types, models, and encoders/decoders from this specification. The generated code lives in generated-contracts/ (gitignored, regenerated on build). Contract violations are caught automatically—codegen runs before typecheck and build, so if the implementation drifts from the contract, the build fails.

Gherkin BDD specifications at specs/apps/a-demo/be/gherkin/—14 feature files across 8 domains defining 78 scenarios that every backend must satisfy:

admin (6 scenarios) — user management, role-based access
authentication (12 scenarios) — password login, token lifecycle
expenses (33 scenarios) — CRUD, currency handling, unit handling, attachments, reporting
health (2 scenarios) — service health checks
security (5 scenarios) — authorization, input validation
test-support (2 scenarios) — test data management
token-management (6 scenarios) — JWT refresh, revocation
user-lifecycle (12 scenarios) — registration, account management

These specifications are consumed at all three testing levels. Unit tests mock dependencies and call service functions directly. Integration tests use a real PostgreSQL database via Docker Compose but still call service functions directly—no HTTP. E2E tests hit real HTTP endpoints via Playwright. Same Gherkin scenarios, different step implementations. Specification drift between testing tiers is unlikely to happen unnoticed.

%% Color Palette: Purple #CC78BC (contracts), Blue #0173B2 (scripting), Orange #DE8F05 (JVM), Teal #029E73 (functional)
graph LR
    OA["OpenAPI 3.1 Contract<br/>specs/apps/a-demo/contracts/"]:::contract
    GK["Gherkin BDD Specs<br/>14 features, 78 scenarios"]:::contract
 
    GO["Go / Gin"]:::scripting
    PY["Python / FastAPI"]:::scripting
    TS["TypeScript / Effect"]:::scripting
    JS["Java / Spring Boot"]:::jvm
    JV["Java / Vert.x"]:::jvm
    KT["Kotlin / Ktor"]:::jvm
    FS["F# / Giraffe"]:::functional
    CS["C# / ASP.NET Core"]:::functional
    EX["Elixir / Phoenix"]:::functional
    RS["Rust / Axum"]:::functional
    CJ["Clojure / Pedestal"]:::functional
 
    PG["PostgreSQL<br/>shared schema pattern"]:::contract
 
    OA -->|"codegen"| GO
    OA -->|"codegen"| PY
    OA -->|"codegen"| TS
    OA -->|"codegen"| JS
    OA -->|"codegen"| JV
    OA -->|"codegen"| KT
    OA -->|"codegen"| FS
    OA -->|"codegen"| CS
    OA -->|"codegen"| EX
    OA -->|"codegen"| RS
    OA -->|"codegen"| CJ
 
    GK -->|"drives tests"| GO
    GK -->|"drives tests"| PY
    GK -->|"drives tests"| TS
    GK -->|"drives tests"| JS
    GK -->|"drives tests"| JV
    GK -->|"drives tests"| KT
    GK -->|"drives tests"| FS
    GK -->|"drives tests"| CS
    GK -->|"drives tests"| EX
    GK -->|"drives tests"| RS
    GK -->|"drives tests"| CJ
 
    GO --> PG
    PY --> PG
    TS --> PG
    JS --> PG
    JV --> PG
    KT --> PG
    FS --> PG
    CS --> PG
    EX --> PG
    RS --> PG
    CJ --> PG
 
    classDef contract fill:#CC78BC,stroke:#000000,color:#FFFFFF,stroke-width:2px
    classDef scripting fill:#0173B2,stroke:#000000,color:#FFFFFF,stroke-width:2px
    classDef jvm fill:#DE8F05,stroke:#000000,color:#FFFFFF,stroke-width:2px
    classDef functional fill:#029E73,stroke:#000000,color:#FFFFFF,stroke-width:2px

What Every Backend Shares

Despite the language diversity, every backend implements the same patterns:

Contract codegen — Language-specific code generation from the OpenAPI 3.1 specification. Go uses oapi-codegen, Java and Kotlin use the OpenAPI Generator Gradle plugin, Python uses datamodel-code-generator, Rust uses a custom build script with utoipa, TypeScript uses openapi-typescript, F# and C# use NSwag, Elixir uses a custom Mix task, Clojure uses a custom library (libs/clojure-openapi-codegen).
Gherkin BDD specs — All 78 scenarios consumed at unit and integration levels. Go uses Godog, Java and Kotlin use Cucumber JVM, Python uses pytest-bdd, Rust uses Cucumber-rs, TypeScript uses Vitest-Cucumber, F# uses TickSpec, C# uses Reqnroll, Elixir uses a custom Cabbage fork (libs/elixir-cabbage), Clojure uses clj-cucumber.
Database migrations — Each backend manages its own PostgreSQL schema using language-idiomatic tooling. No shared migration files—each stack owns its schema lifecycle independently.
Repository pattern — Interface-based abstraction separating domain logic from database access. The implementation varies by language paradigm: interfaces in Go, Java, Kotlin, C#, and TypeScript; traits in Rust; protocols in Elixir and Clojure; function-record abstractions in F#.
90%+ test coverage — Enforced via rhino-cli test-coverage validate running as part of test:quick. Coverage tools vary by language: Go's built-in cover, JaCoCo for Java, Kover for Kotlin, coverage.py for Python, cargo-llvm-cov for Rust, Vitest v8 for TypeScript, AltCover for F#, Coverlet for C#, ExCoveralls for Elixir, Cloverage for Clojure.
Docker integration testing — Each backend has a docker-compose.yml spinning up PostgreSQL for integration tests. Tests call service functions directly against the real database—no HTTP layer involved.
Three-level testing — Unit (mocked, cacheable), integration (real PostgreSQL, not cacheable), E2E (real HTTP via Playwright, not cacheable). All three levels consume the same Gherkin specifications.
Dedicated CI workflow — Each backend has its own GitHub Actions workflow running lint, typecheck, test:quick, and test:integration on every push to main.

Database Migration Tooling

Each backend uses the migration tool native to its ecosystem:

Language	Framework	Migration Tool	Notes
Go	Gin	goose	SQL-based migrations
Python	FastAPI	Alembic	Auto-generated from SQLAlchemy models
Rust	Axum	sqlx CLI	Compile-time SQL verification
Java	Spring Boot	Liquibase	XML changelogs
Java	Vert.x	Liquibase	Same tooling, different framework
Kotlin	Ktor	Flyway	Kotlin DSL configuration
F#	Giraffe	DbUp	Embedded SQL scripts
C#	ASP.NET Core	EF Core	Code-first migrations
TypeScript	Effect	@effect/sql Migrator	Effect-native migration system
Elixir	Phoenix	Ecto	Elixir migration modules
Clojure	Pedestal	Migratus	EDN-configured SQL migrations

The migration tooling standardization was its own completed plan (database-migration-tooling). Each backend manages schema independently—no shared migration files across languages. The schema is equivalent but owned by each stack's native tooling.

Lines of Code: Same Domain, Different Languages

Before the qualitative observations, a quantitative one. Every backend implements the same domain—same endpoints, same business rules, same test coverage standard. The difference in lines of code is purely a function of the language and framework:

Language	Framework	Source LOC	Test LOC	Total
Elixir	Phoenix	2,290	5,348	7,638
Java	Spring Boot	2,382	6,835	9,217
TypeScript	Effect	2,435	6,087	8,522
Kotlin	Ktor	2,649	7,235	9,884
Clojure	Pedestal	2,717	2,190	4,907
F#	Giraffe	2,857	7,347	10,204
Go	Gin	2,902	6,861	9,763
Java	Vert.x	3,442	5,520	8,962
Python	FastAPI	3,809	3,266	7,075
C#	ASP.NET Core	3,817	6,313	10,130
Rust	Axum	3,965	7,061	11,026

Source LOC counts committed application code only—no tests, no generated files, no configs. Test LOC includes unit tests, integration step definitions, and BDD step implementations. Sorted by source LOC ascending.

Elixir and TypeScript/Effect are the most concise in source—under 2,500 lines each. Rust and C# are the most verbose—approaching 4,000 lines for the same functionality. Spring Boot's low line count (2,382) is deceptive: Java's annotation-driven style offloads logic to framework magic, so the source is short but the individual lines are not more readable or expressive than other stacks. Compare with Vert.x (3,442)—same language, less framework magic, more honest line count. Clojure stands out with the lowest total (4,907) because its conciseness extends to test code too—the most concise end-to-end. F# sits in the middle at 2,857 source lines—not the fewest, but its expressiveness-per-line is high given the ML-family readability discussed below.

These numbers are not the whole story. Conciseness without guardrails is not necessarily better, and verbose code is not necessarily worse if the type system catches more errors. But they give a concrete sense of how much code you are committing to maintain for the same domain.

Language-by-Language Observations

These are observations from building the same application, not rankings. Four weeks of part-time work is too short for a complete comparison of eleven stacks—what follows is a general feeling from hands-on experience, not a definitive verdict. Every language has trade-offs. The point was to experience those trade-offs firsthand rather than reading about them.

JVM Family — Java/Spring Boot, Java/Vert.x, Kotlin/Ktor

Spring Boot remains the most conventional path. JSpecify with NullAway provides compile-time null safety that catches real bugs. JaCoCo coverage reporting is straightforward. The ecosystem is mature—every problem has a well-documented solution. The verbosity is real but predictable.

Vert.x offers a reactive alternative on the same JVM. The programming model differs significantly—event-loop-based, non-blocking by default. Liquibase migrations shared the same tooling as Spring Boot, but the application structure diverged. Interesting for high-concurrency scenarios, but the ecosystem is smaller.

Kotlin/Ktor brings JVM reliability with modern language features. Coroutines make async code readable. Null safety is baked into the type system rather than bolted on. Flyway migrations integrate cleanly. The codebase reads more concisely than Java line-for-line—each line carries more intent—though the total line count lands between Spring Boot and Vert.x. One practical friction point: editor support. Kotlin tooling in VS Code and its forks is improving but still falls short of the JetBrains/IntelliJ experience. We prefer VS Code-compatible editors—they work seamlessly on local machines and remote servers alike, which matters for our development workflow. Not wanting to depend on a paid IDE subscription for a core development language is a real consideration, and it weighs against Kotlin for primary production use.

Functional Family — F#/Giraffe, Elixir/Phoenix, Clojure/Pedestal

F# surprised us. It is essentially OCaml on top of the .NET ecosystem—a mature functional language with access to the entire .NET library ecosystem, NuGet packages, and native Windows platform support when needed. As an ML-family language, F# code reads like you are writing on a whiteboard: descriptive, minimal ceremony, the intent visible at a glance.

The function-record pattern for repository abstraction—defining a record type whose fields are functions—is elegant and testable. Computation expressions handle async and error flows cleanly. AltCover with --linecover avoids the BRDA inflation that task{} expressions cause in branch coverage. DbUp migrations are simple and reliable.

The .NET foundation also means that if the product ever needs to target Windows-native enterprise environments—a real possibility for Sharia-compliant business systems—the path is already there. Compile times are reasonable at our current codebase size. Of all eleven backends, F# delivered the best combination of expressiveness, guardrails, ecosystem maturity, and development experience.

Clojure/Pedestal represents a fundamentally different approach. defprotocol for the repository pattern, immutable data structures everywhere, REPL-driven development. Migratus for migrations required locale-aware configuration to handle currency formatting correctly. The codebase has the lowest total line count of all eleven when source and tests are combined—as beautiful and descriptive as an ML-family language in its own way. The learning curve is the steepest. Where Clojure falls short for our purposes is the guardrails needed for AI-assisted coding: dynamic typing means fewer compile-time guarantees, and an AI assistant working without static type feedback produces more errors that only surface at runtime. That said, Clojure runs on both the JVM and CLR, giving it a broader base ecosystem than most dynamic languages.

Elixir/Phoenix brings the BEAM's concurrency model. Pattern matching, immutability by default, and the supervision tree are genuinely different from everything else in this list. Elixir shares Clojure's dynamic typing limitation for AI-assisted guardrails, and has a much smaller base ecosystem—the BEAM runtime is powerful but niche compared to the JVM or .NET platforms that Clojure and F# can draw from. Multiple custom libraries were needed to fill ecosystem gaps (detailed in the Gherkin and shared libraries sections below).

C#/ASP.NET Core

C# shares the .NET foundation with F# but takes the OOP path. EF Core migrations are well-integrated, Coverlet provides LCOV coverage, and Reqnroll handles Gherkin cleanly. A solid language with a solid ecosystem. For our purposes, F# on the same platform gives better expressiveness and conciseness—but knowing C# is there as an escape hatch for F# is reassuring. If we ever hit a wall with F# tooling or need a library that only has C# bindings, we can drop into C# without leaving .NET. That safety net made the F# decision easier.

Python/FastAPI

FastAPI's type hints give Python the closest thing it has to static typing, and Alembic migrations auto-generate from SQLAlchemy models. The developer experience is fast—rapid prototyping with minimal boilerplate. pytest-bdd integrates well for Gherkin. The guardrails and typecheck experience is not as refined as TypeScript though—type hints are advisory rather than enforced—and Python's concurrency story still lags behind languages with built-in async runtimes. Python remains the right choice for data analysis and ML workloads, but not for a primary backend where we want strong typing and concurrent request handling.

Systems Languages — Go/Gin, Rust/Axum

Go is straightforward in the way Go always is. Interfaces for the repository pattern, GORM for database access, goose for migrations. The testing story with Godog is clean. go test with coverage just works. Compile times are fast and runtime performance is strong. Where Go falls short is guardrails and domain modeling expressiveness—the type system is intentionally simple, which means fewer compile-time guarantees for complex domain logic. For an enterprise product where we want the type system to encode business rules and guide AI assistants, that simplicity becomes a limitation.

Rust/Axum demands more upfront thought. The ownership model catches real bugs—use-after-free, data races—at compile time. Traits for the repository pattern work well. cargo-llvm-cov for coverage and sqlx for compile-time SQL verification add safety that other languages cannot match. The trade-off is verbosity and the longest compile times of any language in this evaluation—too much friction for a general-purpose backend where developer velocity matters. That said, Rust has a future in OSE for a different role: client-side safety-critical components where resource efficiency and correctness are non-negotiable. Think measurement, validation, or computation that needs to run lean and correct on user devices.

TypeScript/Effect

Effect brings algebraic effects and structured concurrency to TypeScript. The @effect/sql Migrator handles database migrations within the Effect ecosystem. Error handling is type-safe and composable. The codebase reads differently from conventional TypeScript—Effect's pipe-based composition is closer to functional languages than to typical Node.js code. Vitest integration is seamless.

The TypeScript ecosystem produces genuinely interesting work—Effect-TS for typed error handling, XState for state machines, and the possibility of using the same language across web frontend, backend, mobile, and CLI. That cross-platform story is compelling.

That said, TypeScript on the backend carries the npm ecosystem's never-ending security problem—dependency chains are deep, supply chain vulnerabilities are frequent, and the constant churn of packages creates maintenance burden that has nothing to do with your own code. For the frontend, where TypeScript is the natural choice and these libraries shine, we lean in fully. For the backend, where we have the choice and the security surface area matters more, we chose F# on .NET—a platform with a more stable and security-conscious package ecosystem.

A note on Gherkin library maturity across languages.

Every application can be built without Gherkin—it is not a technical requirement. We chose Gherkin as an extra layer for domain knowledge and guardrails, readable by both humans and LLMs alike. When an AI assistant generates an implementation, the Gherkin scenarios serve as a specification it can verify against. When a human reviews the code, the same scenarios serve as documentation of what the system should do. That dual value justified the investment.

But the polyglot experiment exposed a wide spectrum of BDD tooling quality. Cucumber JVM (Java, Kotlin) and Godog (Go) are mature, well-maintained, and straightforward to integrate. pytest-bdd (Python) and Cucumber-rs (Rust) work well with minor quirks. Vitest-Cucumber (TypeScript) is newer but functional. TickSpec (F#) and Reqnroll (C#) integrate cleanly with their respective .NET test runners.

The pain points were Elixir and Clojure. Elixir's Cabbage library required a custom fork (libs/elixir-cabbage) and a custom Gherkin parser (libs/elixir-gherkin) because the existing tooling did not support our test patterns. Clojure's clj-cucumber needed careful configuration for locale-aware scenarios. The Gherkin ecosystem maturity of a language directly affects how much infrastructure work you absorb before you can start testing business logic—and that cost compounds across every backend that uses it.

Demo Frontends: Three Frameworks, One API

The backend experiment has a frontend counterpart. Three frontend frameworks consume the same backend API, validated by the same OpenAPI contract:

a-demo-fe-ts-nextjs — Next.js 16 with React Server Components, TypeScript, and shadcn-ui. The default frontend, exercising the same patterns used in OrganicLever and the content platforms. App Router with server and client components, route-based code splitting, and Vitest for unit tests. Next.js is more complicated than we would like—the mental model around server components, client boundaries, and caching layers adds real cognitive overhead. But it is the de facto web frontend framework right now, with the broadest ecosystem support and the most mature deployment story. For a production product, pragmatism wins over taste.

a-demo-fe-ts-tanstack-start — TanStack Start, a newer full-stack React framework. Type-safe routing, built-in data loading patterns, and a different mental model from App Router. TanStack is gaining momentum and we included it as a hedge—if the ecosystem shifts, we want hands-on experience with a likely successor rather than scrambling to catch up later. When we ran this experiment, TanStack Start had not yet reached version 1, so the evaluation reflects pre-stable APIs and tooling.

a-demo-fe-dart-flutterweb — Flutter Web in Dart. Cross-platform potential is the draw—the same codebase could target mobile and desktop. The web development experience was startling though—the rendering pipeline differs fundamentally from DOM-based frameworks, and the tooling feels foreign coming from web development. More importantly, Dart currently lacks a mature library equivalent to Effect-TS for typed error handling and composability. Flutter remains a candidate for mobile development, where its cross-platform story is strongest, but the web story is not there yet.

a-demo-fs-ts-nextjs — A fullstack Next.js 16 demo combining frontend and backend in one application. Route Handlers serve the API, React Server Components render the UI, and the OpenAPI contract governs both sides. Useful for understanding the trade-offs between separate frontend/backend deployments versus a unified fullstack application.

All frontends have contract codegen from the shared OpenAPI specification and are validated by Playwright E2E tests in a-demo-fe-e2e.

%% Color Palette: Blue #0173B2 (frontends), Purple #CC78BC (contract), Orange #DE8F05 (backends), Teal #029E73 (testing)
graph LR
    subgraph Frontends
        NX["Next.js 16<br/>React + shadcn-ui"]:::frontend
        TS["TanStack Start<br/>Type-safe routing"]:::frontend
        FL["Flutter Web<br/>Dart"]:::frontend
        FS["Next.js 16<br/>Fullstack"]:::frontend
    end
 
    OA["OpenAPI 3.1<br/>Contract"]:::contract
 
    subgraph Backends
        B1["11 Backend<br/>Implementations"]:::backend
    end
 
    FE["a-demo-fe-e2e<br/>Playwright"]:::testing
    BE["a-demo-be-e2e<br/>Playwright"]:::testing
 
    NX --> OA
    TS --> OA
    FL --> OA
    FS --> OA
    OA --> B1
 
    FE -->|"tests"| NX
    FE -->|"tests"| TS
    FE -->|"tests"| FL
    BE -->|"tests"| B1
 
    classDef frontend fill:#0173B2,stroke:#000000,color:#FFFFFF,stroke-width:2px
    classDef contract fill:#CC78BC,stroke:#000000,color:#FFFFFF,stroke-width:2px
    classDef backend fill:#DE8F05,stroke:#000000,color:#FFFFFF,stroke-width:2px
    classDef testing fill:#029E73,stroke:#000000,color:#FFFFFF,stroke-width:2px

Hugo to Next.js: Platform Migrations

Both content platforms—ayokoding.com and oseplatform.com—migrated from Hugo static sites to fullstack Next.js 16 applications during this period. The Hugo sites served well through Phase 0 and early Phase 1, but the limitations became apparent as the project grew: no API layer, no server components, no shared TypeScript component libraries, limited search capabilities, and a separate build toolchain from the rest of the monorepo.

ayokoding-web

The ayokoding-web migration was the larger effort. The Hugo site used the Hextra theme and contained over 1,039 markdown files—915 in English and 124 in Indonesian. All content migrated to the new Next.js 16 application, preserving URLs, bilingual routing, and content structure.

The new platform gained capabilities the Hugo site never had:

Full-text search via FlexSearch, indexing all content client-side for instant results without a search service
Mermaid diagram rendering — diagrams defined in markdown render as interactive SVGs
KaTeX math rendering — mathematical notation renders correctly in technical content
tRPC API layer — type-safe API routes for content querying, search, and navigation
React Server Components — content pages render on the server, shipping minimal JavaScript to the client
Shared UI libraries — components from libs/web-ui and libs/web-ui-token used across ayokoding-web, ose-web, and OrganicLever

Three completed plans tracked this migration: ayokoding-web-v2 for the initial rewrite, ayokoding-web-v1-to-v2-migration for content migration and URL preservation, and ayokoding-web-ci-quality-standardization for test infrastructure.

Both backend and frontend E2E test suites were created: ayokoding-web-be-e2e validates the tRPC API, and ayokoding-web-fe-e2e validates the rendered UI via Playwright.

ose-web

The ose-web migration followed the same pattern. The Hugo site used the PaperMod theme—a simpler site with fewer pages but the same architectural limitations. The new Next.js 16 application shares the component library, deployment patterns, and testing infrastructure established by the ayokoding-web migration.

The plan ose-web-nextjs-rewrite tracked this work, and ose-web-e2e-apps added the E2E test suites.

You are reading this update on the migrated ose-web. The Hugo site that published the Week 4 update no longer exists—this is its Next.js successor.

%% Color Palette: Orange #DE8F05 (before), Blue #0173B2 (after), Teal #029E73 (shared)
graph LR
    subgraph Before
        AH["ayokoding-web<br/>Hugo + Hextra"]:::before
        OH["ose-web<br/>Hugo + PaperMod"]:::before
    end
 
    M["Migration"]:::shared
 
    subgraph After
        AN["ayokoding-web<br/>Next.js 16 + tRPC<br/>FlexSearch, Mermaid, KaTeX"]:::after
        ON["ose-web<br/>Next.js 16 + tRPC"]:::after
        UI["libs/web-ui<br/>libs/web-ui-token<br/>Shared Components"]:::shared
    end
 
    AH --> M
    OH --> M
    M --> AN
    M --> ON
    AN --> UI
    ON --> UI
 
    classDef before fill:#DE8F05,stroke:#000000,color:#FFFFFF,stroke-width:2px
    classDef after fill:#0173B2,stroke:#000000,color:#FFFFFF,stroke-width:2px
    classDef shared fill:#029E73,stroke:#000000,color:#FFFFFF,stroke-width:2px

OrganicLever Fullstack Evolution

OrganicLever—the Phase 1 product that exercises the platform—underwent a significant architectural shift during this period.

The Chosen Stack

The Week 4 update described OrganicLever's backend as Spring Boot 4.0.3 on Java 25. That backend served its purpose: it validated the CI/CD pipeline, E2E testing patterns, and Docker Compose workflows. The polyglot experiment settled the question.

F#/Giraffe is the chosen backend. The language observations above detail why—expressiveness, guardrails, .NET ecosystem maturity, Windows platform access, and reasonable compile times. The decision was pragmatic: F#'s strengths matched our domain, and the demo implementation proved those strengths were not theoretical.

Next.js with Effect-TS and TypeScript is the chosen web frontend. Next.js is the de facto standard despite its complexity. Effect-TS brings type-safe error handling that matches the functional discipline of the F# backend. The demo experiments with TanStack Start and Flutter Web informed this decision, but Next.js + Effect-TS gives the broadest ecosystem support and the strongest alignment with our principles.

Mobile stack remains undecided. Flutter is a candidate given the demo frontend evaluation, but the decision will come later when OrganicLever's domain features are mature enough to warrant a mobile client.

These decisions might seem final, but they are not permanent. As the application grows in production, any tech stack can fall short of expectations in ways we cannot predict today. That is why Gherkin specs, OpenAPI contracts, E2E tests, and C4 architecture diagrams are among the most important investments in this project—not because they make the current stack better, but because they make rewriting or porting to a different stack a manageable experience rather than a painful one. The behavioral specifications define what the system does independently of how it is implemented. If F# needs to become something else in two years, the specs, contracts, and tests carry over. The implementation is replaceable. The specification is the asset—which is also why the FSL-1.1-MIT license protects the specifications specifically. More on that below.

The backend now runs F#/Giraffe with PostgreSQL, DbUp migrations, AltCover for test coverage, and the same three-level testing and contract-driven patterns applied to the demo backends. OrganicLever adopted the same OpenAPI contract enforcement—an OpenAPI 3.1 specification at specs/apps/organiclever/containers/contracts/ with codegen for both organiclever-be and organiclever-web.

Authentication and OAuth

JWT-based authentication with refresh tokens was implemented initially in Spring Boot, then migrated to F#/Giraffe as part of the pivot. Google OAuth login was integrated for user authentication. The auth flow is end-to-end tested via Playwright in organiclever-be-e2e and organiclever-web-e2e.

%% Color Palette: Blue #0173B2 (frontend), Orange #DE8F05 (backend), Purple #CC78BC (contract), Teal #029E73 (external)
graph LR
    FE["organiclever-web<br/>Next.js 16 + React 19"]:::frontend
    BE["organiclever-be<br/>F# / Giraffe"]:::backend
    CT["organiclever-contracts<br/>OpenAPI 3.1"]:::contract
    PG["PostgreSQL"]:::external
    GA["Google OAuth"]:::external
    FEE["organiclever-web-e2e<br/>Playwright"]:::frontend
    BEE["organiclever-be-e2e<br/>Playwright"]:::backend
 
    CT -->|"codegen"| FE
    CT -->|"codegen"| BE
    FE -->|"REST API"| BE
    BE --> PG
    BE -->|"OAuth"| GA
 
    FEE -->|"tests"| FE
    BEE -->|"tests"| BE
 
    classDef frontend fill:#0173B2,stroke:#000000,color:#FFFFFF,stroke-width:2px
    classDef backend fill:#DE8F05,stroke:#000000,color:#FFFFFF,stroke-width:2px
    classDef contract fill:#CC78BC,stroke:#000000,color:#FFFFFF,stroke-width:2px
    classDef external fill:#029E73,stroke:#000000,color:#FFFFFF,stroke-width:2px

Infrastructure Maturation

The polyglot explosion and platform migrations were built on infrastructure that matured significantly during this period. None of it appeared overnight—each improvement was a completed plan responding to real friction discovered during development.

rhino-cli Improvements

rhino-cli evolved from v0.10.0 to handle the demands of a monorepo with 30+ projects across 10+ languages.

doctor --fix — The doctor command previously diagnosed missing tools but required manual installation. The --fix flag now auto-installs missing dependencies, with --dry-run to preview changes. The --scope minimal flag checks only core tools (git, Volta, Node.js, npm, Go, Docker, jq) for faster CI runs.

env init — Bootstraps .env files from .env.example templates. No more manually copying and editing environment files when setting up a new development environment.

env backup and env restore — Environment variable management across the monorepo. Backup captures all .env files, restore replays them. Two plans (env-backup-restore and env-enhanced-backup-restore) refined this workflow.

Expanded tool verification — The doctor command now checks Playwright browser versions, Rust toolchain versions, Flutter SDK versions, and Brewfile dependencies. As the polyglot monorepo grew, so did the list of tools that needed to be present and correctly versioned.

The native-dev-setup-improvements and cli-testing-alignment plans tracked these changes. All rhino-cli commands are backed by Godog BDD scenarios with mock-based unit tests and real-filesystem integration tests.

Spec Coverage Enforcement

The spec-coverage Nx target—which validates that Gherkin specifications are consumed by test implementations—was extended to cover all projects in the monorepo. The spec-coverage-full-enforcement plan added multi-language step extraction supporting Go, TypeScript, Java, Kotlin, Python, Rust, F#, C#, Elixir, Clojure, and Dart. The specs-structure-consistency plan ensured specification directories follow a consistent structure across all applications.

rhino-cli spec-coverage validate now runs as a separate Nx target in the pre-push hook, enforced alongside typecheck, lint, and test:quick. If a Gherkin scenario exists without a corresponding step implementation, the push is blocked.

CI/CD: From 7 to 29 Workflows

The CI/CD infrastructure grew from 7 workflows to 29, organized around 8 reusable workflow templates:

Reusable templates:

_reusable-backend-coverage.yml — Coverage upload for any backend
_reusable-backend-e2e.yml — E2E test execution with Docker Compose
_reusable-backend-integration.yml — Integration test execution with real PostgreSQL
_reusable-backend-lint.yml — Lint and typecheck for any backend
_reusable-backend-spec-coverage.yml — Spec coverage validation
_reusable-backend-typecheck.yml — Type checking
_reusable-frontend-e2e.yml — Frontend E2E test execution
_reusable-test-and-deploy.yml — Test and deploy content sites

Each of the 11 demo backends has a dedicated workflow (test-a-demo-be-*.yml) that composes these reusable templates. The 3 demo frontends and 1 fullstack demo each have their own workflows too. OrganicLever, ayokoding-web, and ose-web round out the total.

%% Color Palette: Purple #CC78BC (templates), Blue #0173B2 (backend), Orange #DE8F05 (frontend), Teal #029E73 (platform)
graph LR
    subgraph Reusable Templates
        RC["backend-coverage"]:::template
        RE["backend-e2e"]:::template
        RI["backend-integration"]:::template
        RL["backend-lint"]:::template
        RS["backend-spec-coverage"]:::template
        RT["backend-typecheck"]:::template
        RF["frontend-e2e"]:::template
        RD["test-and-deploy"]:::template
    end
 
    subgraph Backend Workflows
        B1["Go/Gin"]:::backend
        B2["Python/FastAPI"]:::backend
        B3["...9 more"]:::backend
    end
 
    subgraph Frontend Workflows
        F1["Next.js FE"]:::frontend
        F2["TanStack Start"]:::frontend
        F3["Flutter Web"]:::frontend
    end
 
    subgraph Platform Workflows
        P1["ayokoding-web"]:::platform
        P2["ose-web"]:::platform
        P3["OrganicLever"]:::platform
    end
 
    RC --> B1
    RL --> B1
    RI --> B1
    RC --> B2
    RL --> B2
    RI --> B2
    RC --> B3
    RL --> B3
    RI --> B3
    RF --> F1
    RF --> F2
    RF --> F3
    RD --> P1
    RD --> P2
 
    classDef template fill:#CC78BC,stroke:#000000,color:#FFFFFF,stroke-width:2px
    classDef backend fill:#0173B2,stroke:#000000,color:#FFFFFF,stroke-width:2px
    classDef frontend fill:#DE8F05,stroke:#000000,color:#FFFFFF,stroke-width:2px
    classDef platform fill:#029E73,stroke:#000000,color:#FFFFFF,stroke-width:2px

The demo-ci-test-standardization and ci-standardization plans tracked the workflow buildout. Reusable templates eliminated duplication—each backend workflow is roughly 30 lines composing shared templates, rather than 200+ lines of duplicated YAML.

Developer Experience

Brewfile — A declarative Homebrew manifest listing every tool the monorepo needs. brew bundle installs everything. Combined with rhino doctor --fix, a new developer can go from a fresh machine to a working environment with two commands.

Docker Compose standardization — Every backend that needs PostgreSQL has a docker-compose.yml for local development and integration testing. The patterns are consistent across languages: same PostgreSQL version, same port mapping conventions, same health check configurations.

Shared libraries — The libs/ directory grew from 2 to 8 libraries during this period:

golang-commons — Shared Go utilities (existing)
hugo-commons — Hugo-specific utilities (existing, will be archived when Hugo sites are fully removed)
web-ui — Shared React UI components built with shadcn-ui and Radix
web-ui-token — Design tokens for consistent theming across TypeScript applications
clojure-openapi-codegen — Custom OpenAPI code generator for Clojure (the ecosystem lacked a suitable one)
elixir-cabbage — Custom fork of the Cabbage BDD library for Elixir Gherkin testing
elixir-gherkin — Gherkin parser for Elixir
elixir-openapi-codegen — Custom OpenAPI code generator for Elixir

Four of the six new libraries were created to fill gaps in the Clojure and Elixir ecosystems where existing OpenAPI codegen or BDD tooling did not meet our needs. Building custom libraries was not the plan—it was the pragmatic response to real gaps discovered during the polyglot experiment. It also illustrates the entropy cost: each custom library is code we now own and maintain. The Clojure and Elixir libraries serve the demo backends—they are not on the critical path for the production app. The two TypeScript libraries (web-ui, web-ui-token) are, and those have the ecosystem support to justify their maintenance cost.

FSL-1.1-MIT: Protecting the Mission (Historical Record)

Update (2026-04-22): The project reverted from FSL-1.1-MIT back to MIT on 2026-04-22, after reassessing the strategic tradeoff. The section below is preserved as a historical record of the reasoning at the time of the FSL migration. See the MIT license rationale for the reasoning behind the reversion.

On April 4—one day before this update—the project completed its migration from MIT to FSL-1.1-MIT (Functional Source License).

What FSL-1.1-MIT Means

The Functional Source License is a source-available license created by Sentry. The code is publicly available—anyone can read it, fork it, modify it, and run it. The single restriction: you cannot offer a competing product in the same functional domain during the initial license period. After two years from each version's release, the code automatically converts to the MIT license with no restrictions.

This is not proprietary. This is not closed-source. This is time-delayed open source with a single commercial protection: don't use our code to build a competing Sharia-compliant enterprise platform and sell it against us during the initial years.

Per-App Domain Scoping

The restriction is scoped per application domain, not blanket across the repository:

organiclever-web, organiclever-be — Restricted domain: productivity tracking for Sharia-compliant businesses
ose-web — Restricted domain: Sharia-compliant enterprise platform marketing and content
Behavioral specs (specs/) — FSL-1.1-MIT protecting the WHAT (behavioral contracts that define our product)
Demo apps (a-demo-*) — MIT license. Reference implementations meant for learning. No restrictions.
Libraries (libs/) — MIT license. Reusable infrastructure should be freely available.
ayokoding-web — Educational content. Knowledge sharing is part of the mission, not something to restrict.

The distinction follows a principle: the HOW (implementation patterns, libraries, educational content) is freely available under MIT. The WHAT (behavioral specifications that define our specific products) and the products themselves are protected under FSL-1.1-MIT during the initial period.

Why Now

The project reached the point where the specifications and product code represent meaningful intellectual property. During Phase 0, everything was scaffolding—protecting it would have been premature. Now, with 78 Gherkin scenarios defining the demo domain, OrganicLever's auth flows implemented, and content platforms live, the specifications encode real product decisions worth protecting.

The license change also prepares for Phase 2 and beyond. When external contributors join, the licensing terms need to be clear from the start. Changing licenses after community contributions creates legal complexity. Setting the terms now—while the contributor base is small—is cleaner.

Educational content on ayokoding.com remains freely available. The governance documentation, conventions, and development practices are MIT-licensed. The mission is to democratize access to Sharia-compliant enterprise systems. FSL-1.1-MIT protects the commercial viability needed to reach that goal while guaranteeing eventual full open-source freedom.

Testing and Quality Standards

The testing infrastructure that served 7 projects in Week 4 now serves 30+ projects across 10+ languages. The three-level testing standard and contract-driven pipeline described in the polyglot section above now apply uniformly across the monorepo. The main evolution during this period was coverage recalibration.

Coverage Recalibration

Week 4 enforced 95%+ coverage uniformly. That worked for 7 projects in 2 languages. With 11 backends, 3 frontends, 2 content platforms, and multiple CLI tools, the thresholds were recalibrated to reflect the different testing economics of each project type:

Backend apps — 90%+ (service logic, repository implementations, and route handlers)
Frontend apps — 70%+ (UI components with mocked API layers by design)
Content platforms — 80%+ (content rendering, search, API routes)
CLI tools — 90%+ (core logic, command parsing, output formatting)

The reduction from 95% to 90% for backends was deliberate. Across 11 languages, some coverage tools measure differently—AltCover's line coverage for F# task{} expressions, Cloverage's Clojure macro handling, cargo-llvm-cov's treatment of Rust's pattern matching. A uniform 95% threshold created false pressure to write tests that tested coverage tools rather than business logic. 90% is the right floor—it catches gaps without incentivizing coverage gaming.

What Changed: Week 16 to Week 20

These are structural changes—capabilities the project has now that it did not have four weeks ago:

Demo backend apps: 0 to 11 (Go, Python, Rust, Java x2, Kotlin, F#, C#, TypeScript, Elixir, Clojure)
Demo frontend apps: 0 to 3 + 1 fullstack (Next.js, TanStack Start, Flutter Web, Next.js fullstack)
Gherkin specifications (demo): 0 to 14 features, 78 scenarios across 8 domains
GitHub Actions workflows: 7 to 29 (8 reusable templates)
Shared libraries: 2 to 8 (6 new, including 4 custom for Clojure/Elixir ecosystem gaps)
OrganicLever backend: Spring Boot (Java) to F#/Giraffe
Content platforms: Hugo to Next.js 16 (both sites migrated)
License: MIT to FSL-1.1-MIT
Coverage threshold: 95% uniform to tiered (90% BE, 80% content, 70% FE)
Languages with production backends: 1 (Java) to 10 (Go, Python, Rust, Java, Kotlin, F#, C#, TypeScript, Elixir, Clojure)
Content files migrated: 1,039 markdown files (915 EN, 124 ID) in ayokoding-web alone
Spec coverage: enforced across all projects with multi-language step extraction

Every item on this list adds to the codebase's surface area. The demo backends and their supporting libraries are learning infrastructure—valuable for evaluation, but each one is code that can break, drift, or rot. The production app (OrganicLever) benefits from the learning without carrying all of the entropy. Going forward, the focus shifts from expanding the surface area to maintaining what matters and letting the rest serve as reference.

What's Actually Next

The polyglot foundation is laid. The platforms are migrated. The licensing is set. CI is finally taking shape. The next four weeks focus on CD and infrastructure—that is the priority. OrganicLever's core domain features will start building on the sideline, but the deployment and infrastructure story comes first, Insha Allah.

Continuous Deployment — CI validates code. CD delivers it. The 29 workflows verify quality, but nothing automatically promotes a validated build to production. The next phase builds the deployment pipeline: automated promotion from main to production branches, deployment verification, rollback capabilities, and environment management. The goal is confidence that a green CI run can flow to production without manual intervention.

Infrastructure exploration — The monorepo now has 30+ projects across 10+ languages. That scale creates real infrastructure questions. Nx remote caching for faster builds across CI and local development. Container orchestration patterns for the multi-backend landscape. Database provisioning and migration automation in deployment contexts. Observability—knowing what is running, how it is performing, and when something breaks—before users tell us.

Fundamental building (sideline) — OrganicLever has authentication and an API contract. It does not yet have the features that make it a productivity tracker. While CD and infrastructure take priority, domain fundamentals start building on the sideline: the core data models, business rules, and user workflows that define what OrganicLever actually does. The tech stack decisions are made—F# backend, Next.js + Effect-TS + TypeScript web frontend. The contract-driven pipeline is ready to carry real business logic while the deployment story matures around it.

Building in the Open

Eight weeks of Phase 1 complete. Eleven backends across ten languages, three frontends, two platform migrations, one license change. The tech stack is chosen. The infrastructure designed in Phase 0 and stress-tested in early Phase 1 carried the load. Now the focus shifts from proving the platform to building on it.

Every commit visible on GitHub. Platform updates published here on oseplatform.com. Educational content shared on ayokoding.com.

We publish platform updates every first weekend of each month. Subscribe to our RSS feed or check back regularly to follow along as Phase 1 continues, Insha Allah.

Phase 1 Week 8: Wide to Learn, Narrow to Ship

Command Palette