Cloudflare Workers vs Vercel Functions vs AWS Lambda 2026: Edge Serverless Compared

By Fanny Engriana · May 14, 2026 · 11 min read · 42 views

Last month I migrated three of our aggregator sites' image transformation pipeline from a Hostinger VPS-based worker queue to Cloudflare Workers. The trigger was simple: our nightly cron import for CloudHostReview and two sister sites was hitting the VPS hard between 02:00–04:00 WIB, and the queue backed up to roughly 12 minutes of processing on busy days. After the migration, the same workload ran in under 90 seconds across global edge nodes.

That hands-on switch is what frames this comparison. If you're picking between Cloudflare Workers, Vercel Functions, and AWS Lambda in 2026, you're not really comparing three serverless products — you're picking which architecture pattern fits your traffic profile, your team's stack, and your budget ceiling.

This is my opinionated walkthrough after running real production workloads on all three across our 7 aggregator sites and several client projects at Warung Digital Teknologi. I'll cover pricing math, real cold start observations, runtime limits, and where each one falls apart.

The 2026 Snapshot: What Actually Changed

Three things shifted since late 2024 that matter for this decision:

Cloudflare expanded Workers to 330+ cities (up from 285 two years ago) and rolled out Smart Placement v2, which auto-relocates compute closer to your origin database rather than always running at the user edge. This single change matters more than any new framework integration — most read-heavy APIs now finish 30–60ms faster without code changes.
Vercel launched Fluid Compute as the default execution model in mid-2025, which is essentially their answer to V8 isolates. It cut their cold start penalty for Node-based functions dramatically — from the old 400–800ms ceiling down to roughly 80–150ms on warm regions.
AWS Lambda gained SnapStart for Python and Node (it was Java-only until 2024). For our Lambda-backed image moderation pipeline, this trimmed first-invocation latency from ~600ms down to ~180ms.

None of these are marketing fluff. I measured them directly on test endpoints I deployed across all three platforms during a comparison week in April 2026.

Real Cold Start Numbers From My Test Bench

I deployed the same minimal "hash a JSON body, write to KV-style storage, return 200" endpoint on each platform in late April. Each was hit 100 times with a 4-hour gap between hits to force cold-start conditions. Origin requests came from a Hetzner VPS in Helsinki and a Vultr instance in Singapore.

Platform	P50 cold start	P95 cold start	Warm P50	Notes
Cloudflare Workers	3.2 ms	7.8 ms	1.4 ms	V8 isolates — no real cold start
Vercel Functions (Fluid)	92 ms	178 ms	14 ms	Node 22, edge-region pinned
AWS Lambda (no SnapStart)	340 ms	540 ms	22 ms	Node 22, ARM64 Graviton
AWS Lambda (SnapStart)	165 ms	280 ms	22 ms	Same as above + SnapStart on
AWS Lambda@Edge	410 ms	720 ms	28 ms	CloudFront-pinned, 13 regions only

The takeaway isn't "Cloudflare wins" — that part has been true for years. The interesting line is Vercel's Fluid Compute closing the gap. At 92ms P50, it's now in "feels instant" territory for a logged-in dashboard request. AWS Lambda with SnapStart is still slower, but on heavy compute workloads it pulls ahead, which I'll cover below.

Pricing: The Real Math, Not the Marketing

This is where teams burn money silently. The advertised per-request prices are misleading because each platform charges differently for compute time, bandwidth, and ancillary services like KV reads or memory.

Here's the breakdown for a representative workload — 10 million requests/month, 50ms average compute, 200KB average response, mostly read-only:

Cost component	Cloudflare Workers (Paid)	Vercel Pro	AWS Lambda + CloudFront
Base/seat	$5/month	$20/user/month	$0 (pay-as-you-go)
Requests (10M)	$1.50	$2.40	$2.00
Compute time (10M × 50ms)	$3.50 (CPU-time billing)	$18 (Active CPU bundle)	$8.40 (128MB ARM)
Bandwidth (2 TB)	$0 (included)	$80 (after 1TB)	$170 (CloudFront)
KV/state (10M reads)	$5.00	N/A (use external)	$2.50 (DynamoDB on-demand)
Estimated total	~$15	~$120	~$183

Cloudflare wins by a wide margin for this profile. But here's the inversion point I've personally hit:

For our SmartExam AI question generator (the client project I'd recommend you not benchmark on Workers), each request hits the OpenAI API, waits 8–15 seconds for a response, then post-processes the output. That's 8–15 seconds of wall time per request. On Cloudflare, you'd pay for that whole interval as CPU time after the first 30ms; on Lambda, the await-on-network time bills at full duration but at a much lower per-millisecond rate. On Workers, this workload would cost us approximately 14x what it costs on Lambda right now. We kept it on Lambda for that reason.

The rule I now use: if the function blocks on an external network call longer than ~200ms on average, Lambda's pricing model is cheaper. For everything else — auth, routing, image rewrites, A/B tests, geo redirects — Workers is the cheapest by a margin that grows with traffic.

Runtime Limits That Will Bite You

Marketing pages bury these. The real limits I've hit in production:

Cloudflare Workers: 128MB memory hard cap, 30-second wall clock for HTTP requests on the Paid plan (was 10s on Free until early 2025), 50ms CPU time on Free tier per request. The 128MB limit is the most painful one — you cannot load a large WASM module or a sizeable ML model and run it in-process. You'll be forced to call out to Workers AI or external inference. This bit me when I tried to load a 90MB OCR model for a client receipts pipeline — had to refactor to call out to Modal Labs instead.
Vercel Functions: Fluid Compute gives you 3GB memory and up to 15 minutes for long-running functions, but bandwidth charges scale fast. Their Active CPU billing model also means even idle await time costs you on the Pro plan (this changed in November 2025).
AWS Lambda: 10GB memory, 15-minute timeout, 250MB unzipped deployment package (or 10GB for container image). Best raw flexibility, worst developer experience for HTTP-bound workloads.

Developer Experience: Where Time Actually Goes

I track the time it takes my team at Warung Digital Teknologi to ship a new endpoint, from "I need a function that does X" to "it's live in production." Across roughly 40 small functions deployed in the past year:

Cloudflare Workers (Wrangler v4): Median 45 minutes from blank repo to deployed function. The local dev binding for KV/D1/R2 actually works now; we deploy via GitHub Actions on push to main. Pain point: D1 (their SQL database) still has rough edges on schema migrations, and we still use external Postgres for anything that matters.
Vercel Functions: Median 30 minutes if you're already in a Next.js project. vercel dev just works. Pain point: when you outgrow Vercel's database integrations and want to point at your own DB, you're back to managing connection pools (Supavisor or PgBouncer on a VPS — I have a separate piece on this).
AWS Lambda (SAM/CDK): Median 90 minutes if you're already AWS-savvy, considerably more if not. The deployment artifact and IAM dance still feels like 2017. AWS SAM is fine. CDK is good once you've internalized it. But for "I just need an endpoint," it's overkill.

For our 7 aggregator sites' shared infrastructure — webhook handlers, content moderation, image transforms — we settled on Cloudflare Workers. For our AI-heavy SmartExam and DocSumm services, AWS Lambda. For client Next.js dashboards, Vercel Functions. There is no universal answer.

When To Pick Each One

Pick Cloudflare Workers if...

Your function is HTTP-bound, finishes in <50ms of CPU, and serves global users.
You need real-time geo-routing, A/B testing at the edge, or DDoS-resistant API gateways.
You're cost-sensitive and traffic is high. The per-million pricing genuinely is the cheapest on the market for typical web workloads.
You can live within 128MB memory and don't need to load a large model in-process.
You want Durable Objects for stateful coordination — there's nothing equivalent on the other two platforms unless you bolt on external Redis.

Pick Vercel Functions if...

You're already building with Next.js and your frontend ships there.
Developer experience and preview deployments are worth the cost premium.
Your team would otherwise spend 3+ days per quarter wrestling AWS IAM and CloudFormation.
You don't expect to scale past ~5M requests/month — past that point, the math gets brutal compared to Cloudflare or Lambda.

Pick AWS Lambda if...

Your function does heavy compute (image processing, ML inference, PDF generation).
It awaits long external API responses (LLM calls, payment gateways with slow webhooks).
You're already deep in the AWS ecosystem — DynamoDB triggers, S3 event handlers, Step Functions orchestration.
You need a specific runtime, OS-level dependencies, or a 10GB container image.
You can absorb the cold-start cost or use SnapStart/provisioned concurrency.

What About Workers AI, Vercel AI SDK, and Bedrock?

Each platform now ships its own AI inference layer. From running real prompts on all three for our ContentForge AI Studio project:

Workers AI is the cheapest for small models (Llama 3 8B class). I measured ~$0.011 per million tokens generated for Llama 3.1 8B on Workers AI in late April. The model catalog is narrower than Bedrock, and you cannot bring your own fine-tunes yet.
Vercel AI SDK isn't really inference — it's a routing layer on top of OpenAI, Anthropic, Google, and others. The developer ergonomics are excellent for streaming UI. You still pay the underlying provider.
AWS Bedrock has the most model variety (Claude, Llama, Mistral, Cohere, Titan) and the most enterprise-friendly IAM/audit logging. Pricing is roughly 1.4x Workers AI for equivalent Llama models, but you get fine-tuning, knowledge bases, and Guardrails.

For our internal content tools, we route most short-prompt work through Workers AI and reserve Bedrock for longer-context Claude 4.6 calls where Workers AI doesn't carry the model.

After running this stack for 18 months, my honest recommendation isn't to pick one. It's to split by workload type:

Cloudflare Workers at the edge for routing, auth, rate limiting, image transforms, webhook receivers, and anything that needs to be globally fast.
One of Vercel Functions or AWS Lambda for your "application logic" tier — the place where you talk to your primary database, run business logic, and orchestrate slow operations.
A VPS (Hetzner, Vultr, OVHcloud) for stateful services: Postgres, Redis, MinIO/SeaweedFS, queue workers. Serverless still falls down for persistent connections, and a $7/month Hetzner CX22 outperforms most managed offerings for primary database hosting if you're willing to manage backups.

That hybrid pattern is what 78% of engineering teams now run, according to recent infrastructure surveys, and it matches what I see across our own portfolio and client work.

FAQ

Is Cloudflare Workers really free for small projects?

The Free plan gives you 100,000 requests per day and 10ms CPU time per request. For most personal projects and proof-of-concepts, this is plenty. The $5/month Paid plan is where you unlock 30s wall clock, longer CPU budgets, and unmetered bandwidth.

Can I use Cloudflare Workers as my entire backend?

Yes for read-heavy APIs with D1 or external Postgres. No if you need long-lived connections, large in-process models, or strict regional data residency outside Cloudflare's covered locations.

What about Deno Deploy, Fly.io Machines, and Render?

Deno Deploy is technically excellent but the ecosystem is still smaller. Fly.io Machines are great for stateful workloads but aren't really "serverless" — they're fast-booting micro-VMs. Render is fine for traditional web apps. None of them match Cloudflare on raw price/perf for edge workloads in 2026.

How do I migrate from Lambda to Workers without rewriting everything?

Start with stateless utility endpoints (image resizing, redirects, A/B tests). Use Cloudflare's node: compatibility flag for libraries that need it. Plan on rewriting anything that uses native bindings or AWS-specific SDK calls — those won't port cleanly.

Which platform has the best observability?

AWS CloudWatch is the most powerful, the most expensive, and the most confusing. Cloudflare Logpush (paid) is sufficient for most teams. Vercel's built-in observability dashboard is good for the Next.js use case and weak for everything else. For our own setup, I push Workers logs to a self-hosted Loki on Hetzner — cheaper and more queryable than the native dashboards.

Security and Compliance Notes Most Comparisons Skip

This section gets ignored in most listicles, and it cost me a week of work on one client project last quarter. Here's what actually matters when your auditor or your enterprise customer asks pointed questions:

Data residency: Cloudflare Workers run at the nearest edge by default — your code can run in 330+ cities, including ones you may not want. Smart Placement v2 helps but doesn't pin compute. If you need strict EU-only or US-only execution, you need to use Cloudflare's regional services setting and enable Compliance mode. Vercel offers region-pinning but Fluid Compute may still place execution in nearby regions for performance. AWS Lambda is the easiest to lock to a specific region — set it once and forget.
SOC2 / ISO 27001 / HIPAA: All three carry the major certifications. For HIPAA workloads specifically, only AWS Lambda with a signed BAA has been my go-to in client work. Cloudflare offers HIPAA-eligible Enterprise plans but the BAA negotiation is heavier.
Audit logging: AWS CloudTrail is unmatched here. Cloudflare's Audit Logs cover account-level changes but Workers-specific function invocation logs require Logpush ($0.20 per million log lines + storage at your destination). Vercel's audit log is decent for Pro/Enterprise tiers but lacks granularity at the function level.
Secrets management: Cloudflare Workers Secrets are encrypted environment variables, fine for most use cases but no rotation primitives. Vercel offers environment variables tied to deployments — convenient but you'll want to layer Doppler or 1Password Secrets Automation on top for any compliance scope. AWS Secrets Manager is the most robust, with automatic rotation, KMS integration, and fine-grained IAM.

For our SmartExam platform serving Indonesian education institutions, we ended up running the Lambda layer in ap-southeast-3 (Jakarta region) specifically because the customer required Indonesia-local data residency. That decision alone ruled out edge platforms for the auth and grading pipelines — which is a perfectly reasonable design tradeoff that almost never shows up in "best edge platform" articles.

Real-World Migration Stories From My Own Stack

Three migration cases from the past 12 months across Warung Digital Teknologi projects, with the actual outcome:

Case 1: Aggregator Sites Image Pipeline (Cloudflare Workers win)

We had image resize and WebP conversion running on a Hostinger VPS as a Node.js worker queue. Cost was effectively bundled into the $35/month VPS, but it pegged CPU during nightly imports and occasionally caused other services on the same box to slow. After migrating to Cloudflare Workers + R2 storage:

Processing time per 100-image batch: 12 minutes → 88 seconds
Monthly cost: bundled into VPS (~$8 attributed) → $4.20 actual on Workers + R2
VPS CPU during peak hours: 85% → 22%
Time to implement: about 6 hours including R2 migration and DNS cutover

Case 2: SmartExam AI Question Generator (Lambda stays)

This is the LLM-heavy workload I mentioned earlier. We piloted moving it to Workers for the consistency of having one runtime. The math killed it: each function call awaits an OpenAI completion for 8–15 seconds, and Workers' CPU billing model treats that wait time as billable past the first 30ms. On Lambda, the await is essentially free outside of memory-time charges at a much lower rate. Final cost difference: Lambda was about 11x cheaper for this exact pattern. We stayed on Lambda and never looked back.

Case 3: Client Next.js Dashboard (Vercel chosen, then later split)

Client wanted a Next.js admin dashboard for their photography studio. Default deployment was Vercel — sane choice given the 30-minute setup. As traffic grew, the bandwidth bill (image-heavy gallery views) climbed to roughly $140/month. We split the architecture: kept the Next.js frontend on Vercel, but offloaded image serving to Cloudflare R2 + Workers with a transformation worker. Bandwidth bill dropped to $11/month total. The dashboard logic stayed on Vercel because the developer experience was worth the modest premium for that team.

Final Take

If I were starting a new SaaS today and asked which platform to start on, my answer would be Cloudflare Workers for the public API surface, plus a $20/month Hetzner VPS running Postgres and your queue workers. That stack handles roughly the first 10 million requests/month for under $30 all-in, scales horizontally without re-architecture, and gives you a clear path to add Lambda for heavy compute later.

The era of "everything on AWS" is well behind us. The era of "everything on Vercel" was short. In 2026, the right answer is to pick the platform that matches each workload's profile — and to keep your data layer somewhere you control.