Building an AI Gateway on Fastly Compute

aspires · March 10, 2026, 6:05pm

Check out this blog from @JonathanSpeek on his process for building an AI routing gateway with Mercury2 on Fastly compute (there’s even a video demo ).

TL;DR – Based on the nature of the end user’s query, the gateway routes to the most efficient and cost-effective model for the job. At scale, these token utilization efficiencies add up to major cost savings!

As always, we’d love to hear more about what you’re building on Fastly, especially with AI. Let us know in the comments <3

Dandelionss · March 11, 2026, 9:02am

Are there plans to offer managed models in the future? This would enable a one-stop service with Fastly. I would only need to pay Fastly for AI model usage fees to access multiple top-tier models.

aspires · March 11, 2026, 3:19pm

Hey @Dandelionss I’ll pass this on to the team. It’s an interesting idea, but I can’t comment on whether this is something we’ll pursue or not.

Topic		Replies	Views
About the AI category AI	0	251	June 3, 2024
Announcing the AI Accelerator Beta! AI	0	741	June 13, 2024
Using AI Accelerator without Fastly-Key AI	1	154	November 18, 2024
Monetize AI Traffic - an implementation for Fastly Compute General edge	3	96	January 6, 2026
Announcing AI Accelerator to General Availability! AI	4	316	September 9, 2025

Building an AI Gateway on Fastly Compute

Related topics