Skip to main content
Real Profitable HFT Market Making Lives or Dies by Infrastructure – The Brutal Truth
Trading

Real Profitable HFT Market Making Lives or Dies by Infrastructure – The Brutal Truth

//TIME: 9 min read//AUTH: Richard Soutar
HFTmarket makinglow latencycolocationFPGAdevopstrading

Real Profitable HFT Market Making Lives or Dies by Infrastructure

You backtested your shiny Avellaneda-Stoikov (or Guéant-Lehalle-Fernandez, or whatever flavour) model and got a Sharpe of 15. Congrats!

Now try running it live with 500 μs latency while Jane Street is operating at 87 nanoseconds in the same book.

You’re not competing. You’re free alpha for everyone faster than you.

In high-frequency market making, the model is maybe 10–20 % of the battle. The other 80–90 % is pure infrastructure and devops on steroids.

Here’s the no-BS breakdown of what actually matters in 2025.

Market Making: Avellaneda–Stoikov model> FIG: Market Making: Avellaneda–Stoikov model

The Latency Hierarchy of Pain

Latency RangeWho lives hereWhat it feels likeProfitable MM possible?
< 150 ns (tick-to-trade)Top-tier HFT shops (Jane Street, Jump, Citadel, HRT, Flow Traders)You are the predatorYes
150 ns – 1 μsSerious independent firms & prop teamsYou can still eat, but you have to be smartYes, selectively
1 μs – 50 μsRetail “HFT” bots, most crypto snipersYou’re mostly eating crumbs and getting pickedOnly on illiquid venues
50 μs – 1 msTraditional algo desksAdverse selection hellNo (you lose money)
> 1 msYour laptop + Python + Binance APIFree money for everyone elseLOL no

If you’re not in the first two rows on liquid instruments… just don’t.

The Real Stack – What Winners Actually Use

ComponentWhy it existsCost (rough)Latency savedMandatory for profit?
Colocation / ProximityBe physically next to the matching engine$10k–$100k+/month1–10 μs round-tripYes
FPGA everythingParse feeds, calculate quotes, risk-check in hardware$100k–$2M+ dev cost50–300 ns tick-to-tradeYes for top tier
Kernel bypass (Solarflare Onload, EFVI, DPDK)Skip Linux kernel networking stackFree–$20k/year500–1500 ns per packetYes
Microwave / Laser linksLight travels faster in air than glass (CHI↔NY route)$300k–$1M+/year2–3 ms saved cross-countryFor cross-venue arb
Custom NICs / SmartNICsInline pre-trade risk checks in silicon$15k–$50k per cardAvoids CPU bounceYes for safety
Raw UDP / Exchange binary protocolsNo FIX overhead, direct binary parsingTens of μs savedYes
Deterministic OS (Linux + tuned realtime kernel)No random GC pauses or scheduler hiccupsPredictabilityYes
Queue position trackingKnow exactly where you are in the LOB queueCustom code + exchange depth feedChanges quoting logic completelyYES

Real 2025 example stack for a profitable independent shop on Nasdaq/ CME:

Tick-to-trade> FIG: Tick-to-trade

Total tick-to-trade: 80–150 nanoseconds on a good day.

Language Choice – Where Python Dies

  • Python → research, backtesting, crypto toys only
  • C++ / Rust → production quoting engine (if not on FPGA)
  • Verilog / SystemVerilog → the real winners write the entire strategy in hardware

A single garbage collection pause of 200 μs just wiped out your entire day’s P&L.

The Hidden Killer: Queue Position & Adverse Selection

Even with perfect latency, if you don’t track your exact position in the price-time priority queue, your model is lying to you.

Example:

  • You think you’re first in line at the bid → quote aggressively
  • Actually you’re 50th → every informed seller hits you first → you get run over

Top shops track every add/cancel on the wire and maintain their own shadow book with sub-microsecond accuracy.

Without this, AS (or any model) overestimates profits by 5–20× in real markets.

Crypto vs Traditional – Slightly Different Rules

Crypto is more forgiving because:

  • 24/7 markets (no end-of-day inventory panic)
  • Higher volatility → wider spreads → more room for latency slop
  • Many venues are still slow (Binance spot can be profitable with 5–20 ms)

But the big boys (Wintermute, Jump Crypto, Cumberland) are already running the exact same FPGA/microwave/colocation game on centralized exchanges and on-chain (MEV, Solana Jito bundles, etc.).

Bottom Line – Can a Solo Dev Win?

Yes… but only in niches:

  • Illiquid altcoins
  • Emerging L2s / new perp DEXs
  • Geographic arbitrage (e.g., Korea premium)
  • On-chain market making where latency is measured in block times

On BTC/USD or ES futures? Forget it unless you have $5M–$50M war chest for infrastructure and a team of ex-Citadel FPGA wizards.

Final Reality Check

Beautiful model + mediocre infrastructure = consistent losses
Decent model + god-tier infrastructure = printing money

Choose wisely where you spend your next 12 months.

Next post: “How I built a sub-millisecond market maker in Rust for under $10k” (spoiler: it makes $3/day on some random altcoin… but it’s fun).

Stay fast, or stay home 🚀

// RELATED_ARCHIVES