Blog
Engineering notes from the people building ApiLink. Billing edge cases, integration guides, post-mortems — written so the next person doesn’t have to re-derive them.
ApiLink vs OpenRouter vs ZenMux: an honest gateway comparison
Three AI gateways, side by side. Where each one wins, where each one loses, and the honest answer about using more than one.
Pointing OpenAI Codex CLI at a third-party gateway
Two environment variables and Codex talks to Claude, Gemini, or DeepSeek instead of GPT. Plus the same trick for Cursor, Aider, Cline.
Using Claude/GPT/Gemini from China: a compliance checklist
Payment, invoicing, forex, data residency — every wall Chinese teams hit a quarter into using OpenAI or Anthropic, with a concrete checklist.
Anthropic prompt cache: when the 25% write premium actually pays back
Anthropic's prompt cache is 90% cheaper — on reads. The catch is the 25% write premium. Here's the math on when that pays off.
Where streaming AI gateways quietly leak money on every disconnect
Most gateways refund users when streams drop mid-flight. A specific disconnect pattern turns that refund into free inference. Here is the attack and the fix.