Rust Didn't Fix Performance. It Fixed Our Assumptions.

We dropped p99 latency by 18% without touching a single line of Rust code.

Dead Code Hunter

~4 min read · December 29, 2025 (Updated: December 31, 2025) · Free: No

That number came from a Grafana panel I'd stared at for months. Same load. Same traffic shape. Same infra. The only thing that changed was our understanding of what Rust was actually doing for us. I remember sitting in the on-call room thinking, "Wait… if Rust was the fix, why did nothing change until we changed our thinking?" That's when it hit me — the win wasn't the language. It was the lie we stopped believing.

The promise that Rust would save us

We moved a hot service to Rust for all the right reasons.

Memory safety. Predictable latency. No GC pauses.

It was the kind of migration that looks good in a postmortem slide deck. We rewrote a request-heavy service that sat in the middle of our ingestion pipeline. The Java version had grown slow under burst traffic, and everyone agreed Rust would "solve it properly."

And it did. Kind of.

The first load test looked better. Lower CPU variance. Fewer tail spikes. Cleaner flame graphs. But something felt wrong. Production still felt heavy. On-call noise didn't drop. P99s still crept during regional bursts.

We'd replaced the engine, but the car still struggled uphill.

That's when I realized the real problem wasn't Java. It was what we believed performance problems looked like.

What Rust actually changed (and what it didn't)

We expected Rust to remove overhead. What it actually removed was ambiguity.

The Java version had hidden a lot behind GC behavior and thread scheduling. Rust made everything explicit. Ownership boundaries. Allocation patterns. Backpressure. The code didn't get magically faster — it got louder about what it was doing.

Here's a simplified version of the hot path we migrated:

public Response handle(Request req) {
    Payload payload = parser.parse(req.body());
    Result result = processor.process(payload);
    return encoder.encode(result);
}

Clean. Predictable. And under load, quietly expensive.

The Rust version looked almost identical at a glance:

pub fn handle(req: Request) -> Response {
    let payload = parse(req.body());
    let result = process(payload);
    encode(result)
}

But the difference wasn't syntax. It was that Rust forced us to confront ownership boundaries that Java had been smoothing over with allocation and GC heuristics.

Suddenly, we could see where memory lived, how long it lived, and when it should have died.

The performance gain didn't come from Rust being faster. It came from us finally seeing what we were doing wrong.

The invisible cost we kept paying

Here's what the profiler showed after the rewrite:

Request -> Parse -> Allocate 6 objects
         -> Transform -> Clone buffers
         -> Serialize -> Copy again

Nothing outrageous. Nothing "slow." Just a thousand tiny decisions multiplied by 40k RPS.

In Java, this was amortized by a very forgiving allocator and a generational GC tuned to make bad habits survivable.

In Rust, those habits were still there — just louder.

The mistake we made was assuming the language change was the optimization.

It wasn't.

The real cost was architectural: We were treating request processing as a pure function when it was really a pipeline with memory ownership leaking between stages.

Rust didn't fix that. Rust just stopped hiding it.

The moment it finally clicked

The turning point came during a late-night load test. No code changes. Just better metrics.

We added one histogram: bytes allocated per request.

The numbers were uncomfortable.

Not catastrophic. Not buggy. Just… wasteful.

That's when we stopped arguing about languages and started asking a different question: "Why are we allocating at all here?"

That led to a refactor that would've helped in any language.

We stopped passing owned data through the pipeline and started streaming references where possible. Fewer buffers. Fewer copies. Smaller lifetimes.

In Rust, it looked like this:

pub fn handle<'a>(req: &'a Request) -> Response<'a> {
    let view = parse_view(&req.body);
    let result = process_view(view);
    encode_view(result)
}

No magic. Just honesty.

The result?

Lower allocation rate. Flatter latency curves. And a system that finally behaved like we expected weeks earlier.

The uncomfortable truth about "performance rewrites"

We didn't gain performance because Rust is faster than Java.

We gained performance because Rust refused to lie to us.

Java let us pretend allocation was free. Rust forced us to notice it.

Java let us defer thinking about ownership. Rust made ownership unavoidable.

The rewrite didn't fix the system. It exposed it.

And that's the part most postmortems skip.

What this means for your system

If you're considering a rewrite — Rust, Go, C++, whatever — ask yourself this first:

Are you trying to go faster? Or are you trying to see clearly?

Because if your architecture leaks work, no language will save you. And if your mental model is wrong, faster code just makes the mistake arrive sooner.

The uncomfortable truth is this:

Most performance gains don't come from faster languages.

They come from removing the assumptions those languages quietly protected you from.

The System Still Remembers

Every system carries the history of its assumptions. Some hide them behind abstractions. Some force you to face them head-on.

Rust didn't make our system fast. It made our misunderstandings expensive enough to notice.

If your metrics improved after a rewrite, ask yourself why. If they didn't, ask what the old system was protecting you from.

Disagree with a claim here? Post your numbers and the exact reproducer.

#software-engineering #system-design-interview #programming #rust #performance

< Go to the original