AndresFreundTec

1 year ago • •

AndresFreundTec
1 year ago • •

I accidentally found a security issue while benchmarking postgres changes.

If you run debian testing, unstable or some other more "bleeding edge" distribution, I strongly recommend upgrading ASAP.

openwall.com/lists/oss-securit…

oss-security - backdoor in upstream xz/liblzma leading to ssh server compromise

^{www.openwall.com}

like this

in reply to AndresFreundTec

AndresFreundTec

in reply to AndresFreundTec • 1 year ago • •

I was doing some micro-benchmarking at the time, needed to quiesce the system to reduce noise. Saw sshd processes were using a surprising amount of CPU, despite immediately failing because of wrong usernames etc. Profiled sshd, showing lots of cpu time in liblzma, with perf unable to attribute it to a symbol. Got suspicious. Recalled that I had seen an odd valgrind complaint in automated testing of postgres, a few weeks earlier, after package updates.

Really required a lot of coincidences.

like this

reshared this

in reply to AndresFreundTec

egregious philbin

in reply to AndresFreundTec • 1 year ago • •

Sure glad you did. This would have been unimaginably worse if it'd gone undetected for another six months.

in reply to AndresFreundTec

Glyph

in reply to AndresFreundTec • 1 year ago • •

I feel both confident and also kind of queasy when saying this: it seems extremely likely that this is not the first time something like this has happened, it's just the first time we have been lucky enough to notice.

clacke: exhausted pixie dream boy 🇸🇪🇭🇰💙💛 likes this.

in reply to Glyph

LisPi

in reply to Glyph • 1 year ago • •

That is true.

Binary artifacts have no business existing in Free Software (or near-binary considering how auditable pre-generated config scripts end-up being). The way it was compromised in this case is almost certain to have happened before and reminds me of the SourceForge malware debacle (so arguably that's another famous example of it happening before).

I"m not sure if many other projects do like Guix and record the checksum of the whole repository so as to ensure reproducibility purely from source.

in reply to LisPi

Glyph

in reply to LisPi • 1 year ago • •

In general this is reasonable, but this there are some clear exceptions for test vectors in cryptographic libraries and compression libraries (which this was).

in reply to Glyph

LisPi

in reply to Glyph • 1 year ago • •

In this case the actual malicious vector was the near-binary injected code in the practical binary of the unaudited autotools vomit (always autoreconf) which was then bundled in the actual binary artifact that was the compromised tarballs.

None should have ever been part of the project.

As for the test files, I still think that having a hex dump with comments explaining what flaws particular parts test would be desirable in a lot of cases.

in reply to AndresFreundTec

Ian Coldwater 👻🌿

in reply to AndresFreundTec • 1 year ago • •

Thank you.

in reply to AndresFreundTec

Mark Eichin

in reply to AndresFreundTec • 1 year ago • •

And a lot of persistence! Reminds me of one of the classics of the industry, Cliff Stoll's Cuckoo's Egg - "Stoll traced the error to an unauthorized user who had apparently used nine seconds of computer time and not paid for it" leading to a german hacker selling content to the KGB - 38 years ago. It is impressive (but uncommon) to see someone paying that level of attention to anomalies these days, with how thick tech stacks have gotten...

clacke: exhausted pixie dream boy 🇸🇪🇭🇰💙💛 likes this.

in reply to AndresFreundTec

abadidea

in reply to AndresFreundTec • 1 year ago • •

we were very, very lucky to have you on the case!

in reply to AndresFreundTec

railmeat

in reply to AndresFreundTec • 1 year ago • •

Thanks for your work on this. Everyone should follow your example.

in reply to AndresFreundTec

AndresFreundTec

in reply to AndresFreundTec • 1 year ago • •

One more aspect that I think emphasizes the number of coincidences that had to come together to find this:

I run a number "buildfarm" instances for automatic testing of postgres. Among them with valgrind. For some other test instance I had used -fno-omit-frame-pointer for some reason I do not remember. A year or so ago I moved all the test instances to a common base configuration, instead of duplicate configurations. I chose to make all of them use -fno-omit-frame-pointer.

in reply to AndresFreundTec

AndresFreundTec

in reply to AndresFreundTec • 1 year ago • •

Afaict valgrind would not have complained about the payload without -fno-omit-frame-pointer. It was because _get_cpuid() expected the stack frame to look a certain way.

Additionally, I chose to use debian unstable to find possible portability problems earlier. Without that valgrind would have had nothing to complain.

Without having seen the odd complaints in valgrind, I don't think I would have looked deeply enough when seeing the high cpu in sshd below _get_cpuid().

clacke: exhausted pixie dream boy 🇸🇪🇭🇰💙💛 likes this.

in reply to AndresFreundTec

AndresFreundTec

in reply to AndresFreundTec • 1 year ago • •

There are more coincidences that are even less interesting. But even the above should make it clear how unlikely it was that I found this thing.

in reply to AndresFreundTec

AndresFreundTec

in reply to AndresFreundTec • 1 year ago • •

Just to be clear: I didn't mean that I didn't do good - I did. I mean that we got unreasonably lucky here, and that we can't just bank on that going forward.

like this

reshared this

in reply to AndresFreundTec

Cyril Brulebois

in reply to AndresFreundTec • 1 year ago • •

This is incredible work, thank you so much.

in reply to Cyril Brulebois

Anisse

in reply to Cyril Brulebois • 1 year ago • •

@CyrilBrulebois I concur — we are all your debt for spotting it so early.

@Cyril Brulebois

in reply to AndresFreundTec

BD

in reply to AndresFreundTec • 1 year ago • •

Major Cliff Stoll / Cuckoo's Egg vibes here. A minor system-usage matter ends up uncovering much more!

in reply to AndresFreundTec

Matt "msw" Wilson

in reply to AndresFreundTec • 1 year ago • •

thank you for your diligence.

in reply to AndresFreundTec

dgilman

in reply to AndresFreundTec • 1 year ago • •

congrats and thank you for the investigation- IMO this is going to go down as the vuln of the decade. What a find.

in reply to dgilman

AndresFreundTec

in reply to dgilman • 1 year ago • •

@dgilman Unfortunately I suspect we'll see a lot more such attacks going forward, in all likelihood with more success in some cases.

@dgilman

in reply to AndresFreundTec

Rihards Olups

in reply to AndresFreundTec • 1 year ago • •

This is insane. I expect full-fledged articles out soon, but another interesting bit in news.ycombinator.com/item?id=3… :

"the apparent author of the backdoor was in communication with me over several weeks trying to get xz 5.6.x added to Fedora 40 & 41 because of it's "great new features""

This is CVE-2024-3094 for easier tracking.

#JiaT75 #CVE20243094

Very annoying - the apparent author of the backdoor was in communication with me... | Hacker News

^{news.ycombinator.com}

This entry was edited (1 year ago)

reshared this

in reply to Rihards Olups

Rihards Olups

in reply to Rihards Olups • 1 year ago • •

@dgilman
From the same thread:

"Fascinating. Just yesterday the author added a `SECURITY.md` file to the `xz-java` project.

> If you discover a security vulnerability in this project please report it privately. *Do not disclose it as a public issue.* This gives us time to work with you to fix the issue before public exposure, reducing the chance that the exploit will be used before a patch is released."

@dgilman

in reply to Rihards Olups

Dianora (Diane Bruce)

in reply to Rihards Olups • 1 year ago • •

@richlv @dgilman It does not affect FreeBSD at all. Simply because this person was targeting a specific OS.

@dgilman @Rihards Olups

in reply to AndresFreundTec

John Regehr

in reply to AndresFreundTec • 1 year ago • •

every internet user owes you a beer

Haelwenn /элвэн/ likes this.

in reply to AndresFreundTec

silverwizard

in reply to AndresFreundTec • 1 year ago from ZoobopDeDoDop! •

@AndresFreundTec Good job! You deserve a lot of kudos!

@AndresFreundTec

Hypolite Petovan likes this.

in reply to AndresFreundTec

Billy O'Neal

in reply to AndresFreundTec • 1 year ago • •

Congrats on going viral (ha!) and breaking open the biggest security deal in a *long* time

Unknown parent

Alexander The 1st

Unknown parent • 1 year ago • •

It's my understanding that at that point though, you're trusting the developers of the CPU assembly/byte/machine code.

The problem isn't necessarily the binaries - it may have been harder to do so, but they could've hid it in a commonly distributed compiler ala ( cs.cmu.edu/~rdriley/487/papers… ) - it's a matter of trust in the person doing the development.

Part of the problem is that the developer's submissions were pushed with less review than necessary.

Unknown parent

LisPi

Unknown parent • 1 year ago • •

Right, I suppose I did omit to mention that one.

Though that suggests my initial statement of "no binaries, full stop" is the right idea.

in reply to Alexander The 1st

LisPi

in reply to Alexander The 1st • 1 year ago • •

Indeed, unfortunately until we have Libre FPGAs and the ability to verify their design and configuration, it is necessary to trust the hardware.

In my post here: udongein.xyz/notice/AgMU728f3a…

One of the articles (I'm not sure which again) explicitly links to the Trusting Trust problem & documentation.

It is possible, if tedious, to bootstrap a Forth implementation in a minimal amount of machine code in an hex editor or other basic input method, and then retain the ability to audit both the initial binary bootstrap seed and then bootstrap everything else from source.

Of course in this case the Guix project contributors decided that a small Scheme (which iirc isn't standard-compliant) interpreter would do.

The commits weren't adequately reviewed, yes.

That said, if binary artifacts like tarballs weren't accepted, and pratically binary by its difficulty of auditing autotools vomit was deemed equally unacceptable (better use autoreconf on whatever dev machine is used to build the project as necessary), it would be far more difficult to replicate this incident.

LisPi

2024-03-30 01:12:05

GCC as well.
Ref:
- guix.gnu.org/blog/2019/guix-re…
- guix.gnu.org/blog/2020/guix-fu…
- guix.gnu.org/en/blog/2023/the-…

Guix Reduces Bootstrap Seed by 50% — 2019 — Blog — GNU Guix
Blog posts about GNU Guix.
^guix.gnu.org

Unknown parent

LisPi

Unknown parent • 1 year ago • •

GCC as well.

Ref:
- guix.gnu.org/blog/2019/guix-re…
- guix.gnu.org/blog/2020/guix-fu…
- guix.gnu.org/en/blog/2023/the-…

Guix Reduces Bootstrap Seed by 50% — 2019 — Blog — GNU Guix

Blog posts about GNU Guix.

^guix.gnu.org

Unknown parent

deliverator

Unknown parent • 1 year ago • •

@malwaretech saw reddit comment to the effect: don't cause perf issues for database people, they will wreck you.

Nice job Andres!

@Marcus Hutchins

in reply to AndresFreundTec

Graham Perrin

in reply to AndresFreundTec • 1 year ago • •

thank you, I see the paragraph that begins:

"To reproduce outside of systemd, …"

I have someone claiming:

"The exploit requires systemd. …"

Can both be true?

Postscript: thanks to @vi for helping me to realise my misunderstanding.

Apologies for the noise.

@Vi •

This entry was edited (1 year ago)

in reply to Graham Perrin

Vi •

in reply to Graham Perrin • 1 year ago • •

@grahamperrin the exploit only works if sshd is patched to depend on libsystemd as that's what pulls in the compromised liblzma.

You don't have to *run* sshd via systemd to be compromised, it can be started manually because the binary is still linked with the malicious library.

@AndresFreundTec

@AndresFreundTec @Graham Perrin

in reply to Vi •

Mina

in reply to Vi • • 1 year ago • •

@vi @grahamperrin and OmniOS links with liblzma (via libxml2)

@Vi • @Graham Perrin

⇧

AndresFreundTec 1 year ago • •

AndresFreundTec
1 year ago • •