Cut deploy time from 3 hrs → 4 min for a 200-engineer trading platform.

A regulated equities platform replaced a three-hour Jenkins deploy train with GitOps on EKS — shipping safely during market hours for the first time.

98%
deploy-time cut
$2.1M
cloud spend / yr saved
0
outage minutes post-launch

Onyx runs an equities and derivatives trading platform used by institutional desks. Two hundred engineers across 14 teams shipped through a single Jenkins deploy train: every release was a three-hour, all-hands affair scheduled outside market hours.

Release freezes around trading windows meant features queued for days. Rollbacks took 45 minutes — an eternity when a regression touches order flow.

Regulatory change-control required a full audit trail for every production change, which the team had bolted onto Jenkins with manual ticketing. Config drift between the 40-service fleet's environments made every deploy a small gamble.

The hard constraint: migrate without a single minute of downtime during market hours, and make the audit trail better, not worse.

  1. Discovery

    Weeks 1–2

    Dependency-mapped all 40 services, audited the deploy train end-to-end, and scored each service for migration risk. Output: a sequenced cutover plan the compliance team signed off on.

  2. Platform build

    Weeks 3–10

    Stood up multi-AZ EKS with ArgoCD GitOps. Every change became a pull request — the git history is the audit trail. Ephemeral preview environments replaced the shared staging bottleneck.

  3. Mesh & canary cutover

    Weeks 11–14

    Istio service mesh with mTLS everywhere. Services migrated one at a time behind canary releases — 1% of order flow first, automated rollback on SLO breach.

  4. Hardening & handover

    Weeks 15–16

    OpenTelemetry tracing across the order path, error budgets per service, and on-call runbooks. Their platform team ran the final five cutovers solo.

Edge
NLBIstio ingressmTLS
Delivery
GitHub PRsArgoCDCanary + auto-rollback
Platform
EKS multi-AZCluster Autoscaler40 services
Observability
OpenTelemetryPrometheusGrafana SLOs
GitOps delivery path: every production change is a signed commit, rolled out by ArgoCD behind Istio canaries.
4 min
deploy time

Down from 3 hours. Teams deploy independently, during market hours.

$2.1M
annual cloud savings

Right-sized requests, spot for non-prod, and retired duplicate staging fleets.

0
outage minutes

Zero unplanned downtime in the first 180 days post-cutover.

90 s
rollback time

Down from 45 minutes — ArgoCD reverts to the last healthy commit.

We went from dreading release nights to shipping during market hours. The audit team likes the git trail more than the old ticketing system, too.

VP Platform Engineering, Onyx Trading
EKSArgoCDIstioOpenTelemetry