MySQL at Uber

99 points by 0xFA11 5 months ago

hamish-b 5 months ago

Immediately something felt off when I saw the AI generated cover photo. Hard to describe, but I immediately profile a page online when I see clear AI slop. I wouldn't want to taint something that I had taken the time to write with something so... low effort.

noname120 5 months ago

To be fair the quality of this article is abysmal. Not sure that a lot of time and effort went into this
gregors 5 months ago

AI imagery is pure disgusting garbage. It's 100% worse than no image at all. Anyone thinking of using AI imagery .....don't.

vvladymyrov 5 months ago

More posts about mysql from Uber - it is interesting to compare state of Mysql in Uber with 2016.

* Why Uber Engineering Switched from Postgres to MySQL (2016) https://news.ycombinator.com/item?id=26283348

* Upgrading Uber's MySQL Fleet https://news.ycombinator.com/item?id=41836748

jppope 5 months ago

The comments are pretty interesting. There was a discussion about how different they were trying to make the public image about their tech look compared to their actual usage - supposedly to entice more engineers to join.
I think its interesting to attach any company prominently to a database technology since theoretically there would be varied use cases across an org like uber which would likely want different technologies depending on those use cases. Of course they might just have 50 other articles like this for all the other tech they use.
- ilrwbwrkhv 5 months ago
  
  Yup Netflix is similar. Your tech is not that hard and the good engineers can see through the BS.
  - ffsm8 5 months ago
    
    Streaming video at Netflix's scale in the year 2010 was hard though?
    I mean sure, it's a lot easier nowadays - but that's mostly down to cloud providers replacing half of the server-side challenge with a big bill and fiber making the last mile easier too.
    You're massively underestimating the challenge of transferring these vast amounts of data without interrupting service for buffing etc that they had to solve back then
    
    ilrwbwrkhv 5 months ago
    
    Pornhub solved it much earlier with PHP
    
    merb 5 months ago
    
    tbf it’s easier if you only need to serve the first 5 minutes of a video
    
    canucker2016 5 months ago
    
    According to Pornhub's 2024 Insights blog post, (nope, not gonna provide an URL, use a search engine :))"
    "...making the average engagement time per visit 9 minutes and 40 seconds"
    Reading about the variance and what demographic groups visited the site for shorter or longer amounts of time will keep you amused.
    
    merb 5 months ago
    
    ah I know the blog post ;) I just thought it was ~5 minutes, but its ~10 minutes ;) still way easier to do that than netflix ;)
ggregoire 5 months ago

Another interesting one about their Storage Platform (which includes MySQL): https://www.uber.com/blog/odin-stateful-platform
> The [Odin] platform supports 23 technologies, ranging from traditional online databases such as MySQL® and Cassandra® to advanced data platform technologies, including HDFS™, Presto™, and Kafka®.

bratao 5 months ago

Every six months, I explore switching from MySQL to something new for a more modern tech stack. However, MyRocks (https://docs.percona.com/percona-server/8.4/myrocks-index.ht...) is truly impressive. It allows me to efficiently compress my text-rich rows.

chasd00 5 months ago

Read the parent post to this one, AI slop has an uncanny valley associated with it. Somehow it sticks out like a sore thumb but i can't put my finger on why.
- willvarfar 5 months ago
  
  I don't know, I didn't spot that this was AI generated. Perhaps because there's some truth that MyRocks is actually really good at compression.
  Back when I was exploring migrating from TokuDB to MyRocks the only problem with it was that it didn't have a file per partition, meaning if you were doing retention you couldn't just drop old daily partitions cheaply.
- scarface_74 5 months ago
  
  @bratao may not be a native English speaker.
  - dllthomas 5 months ago
    
    Indeed, per their profile they're in Brazil, which is not a guarantee of that but does make it likely.
- roncesvalles 5 months ago
  
  Didn't ring any alarm bells for me. I suspect you're not the target audience of this post.

andrewmcwatters 5 months ago

I’m amazed how I’ve worked with all of the critical technologies in this article, I’ve dealt with the same or similar concerns, and yet the author or authors have written this engineering blog post in such a way as to not convey anything meaningful at all.

One part of it is the constant talk of high level abstract infrastructural pieces, and the other is bad product or concept naming.

Odin, “the controller,” the constant obsession with certain engineering orgs to use words like “plane,” and likewise, “fabric” was used at a previous org I worked for.

I’m sure Uber is doing Real™ Work, but this kind of crap sets off all my wank and bullshit alarms.

It’s just clients talking to servers talking to servers talking to proxies talking to servers talking to databases talking to replicas. Can you please stop with the false high engineering bullshit?

arccy 5 months ago

the plane separation is important though, you want to ensure your control messages always go through, like how servers have a dedicated line for management.

noname120 5 months ago

Mirror with working images: https://web.archive.org/web/20250130201835/https://www.uber....

tofukant 5 months ago

This is ai written garbage

scarface_74 5 months ago

I ran this through ChatGPT and it found a few grammatical errors - that I agree with - and awkward wording. Usually AI generated text doesn’t have these types of errors.
It reads like a standard corporate blog post. You can find plenty of those on AWS blogs that were written before LLMs were publicly available.
layer8 5 months ago

Your AI detector seems to be in need of recalibration.
roncesvalles 5 months ago

I think you feel that way because it's written more in a whitepaper tone instead of a blog article tone. If you're not deeply mired in distributed relational database systems, you likely won't get anything out of it.
It's not written like a "fable" as most technical blog articles that do well on HN tend to be. There is no great philosophical life-insight that the authors stumbled upon while doing this mundane technical thing.
noname120 5 months ago

I ran the text through several AI detectors and they all returned 0% AI 100% human.
The cover image is however:
> Cover Photo Attribution: The cover photo was generated using OpenAI ChatGPT Enterprise.
arccy 5 months ago

this comment is ai written garbage

kazinator 5 months ago

When people say MySQL nowadays, do they really mean MySQL, or MariaDB?

paulryanrogers 5 months ago

IME MySQL. Folks using MariaDB aren't shy about saying so.
- kazinator 5 months ago
  
  In shops that have been on MySQL for thirty years, people probably continue to say "MySQL" even though it's MariaDB. They have things named after MySQL, like host names, configuration files, lines in configuration files, discussion channels and mailing lists, and whatever else.
- krick 5 months ago
  
  I wouldn't be so sure. I always was just saying "MySQL" almost exclusively, even though it actually wasn't MySQL for like 10 years, I guess.
  - evanelias 5 months ago
    
    Uber uses MySQL, not MariaDB. They literally have a recent blog post about their upgrade from MySQL 5.7 to MySQL 8.0.

4ndrewl 5 months ago

Page is 404-ing now.

aeyes 5 months ago

So if these clusters are using binlog replication, do they just ignore the possibility of lost writes and inconsistent data after a failover?

bigmutant 5 months ago

That all depends on the setup. The "standard" setup (not specific to MySQL) is:
- Single Write Leader per partition
- Backup Write Leader that is setup with synchronous replication (so WL -> WLB and waits for commit)
- Read Followers all connected asynchronously using either binlog replication (not recommended anymore) or GTID-based row replication (recommended)
In the above scenario, the odds of loss are pretty small since the Write Leader has a direct backup, and any of the Read Followers can be promoted to a Write Leader/Backup. DDIA calls the above semi-synchronous replication, although MySQL now supports a similar-but-slightly different version out of the box: https://dev.mysql.com/doc/refman/8.4/en/replication-semisync...
- evanelias 5 months ago
  
  Terminology-wise, that's not quite right. "Binlog replication" is the general term for all built-in MySQL logical replication, including all formats (statement, mixed, row); positioning via coordinates or via GTID; async or semi-sync or group replication.
  Among AWS users, "binlog replication" is often contrasted against Aurora's system, which uses physical replication instead of logical.
  When using binlog replication, you are correct that GTID positioning and row-based replication are strongly recommended and widely used.
  Regarding semi-sync replication: that's been a MySQL feature for over a decade now, and there are some indications from Oracle that it may become deprecated in the future. (which is surprising, since many large MySQL users do leverage it to ensure writes cannot be lost. But it seems Group Replication is promoted more by Oracle.)
  MySQL semi-sync doesn't necessarily involve the setup you've described as "standard" above. In my experience it's more common to see 2 replicas doing semi-sync ack'ing, not 1. And sometimes these are just binlog servers rather than full MySQL replicas; that's the setup Facebook adopted in 2015, although more recently they've moved to a home-grown Raft-based replication system.
roncesvalles 5 months ago

Yes. This is in fact a CAP theorem tradeoff. Binlog asynchronous replication means they've elected to be AP. If you want zero risk of data loss for committed transactions (for single node failure), you'd replicate synchronously to one other replica. But then, both the primary and replica would need to be available and connected for the leader to accept write transactions, making it CP.
matthewaveryusa 5 months ago

Correct, you would need to use something like galera clustering or synchronous replication to not lose writes.

ksec 5 months ago

I wonder if Uber has ever explored using Vitess. And they seems to be very slow in updating MySQL if they only came to 8.0 when latest is on 9.2 already.

noname120 5 months ago

Is it just me or all the diagram images are broken? It looks like the lazy-loading is not working. I tried on archive.org and archive.is and same issue: the images didn't load.

Edit: yeah those images have their src attribute in the form of “lh7-rt.googleusercontent.com/docsz/[VERY LONG SLUGS]” and a quick look at the dev console shows that Google returns “429 Too Many Requests” for all of them.

I guess they just copy-pasted the URLs from some Google Doc and tried to hotlink them here? Interestingly if you access these directly (instead of embedded into Uber's blog) then they display fine and Google doesn't complain with a 429.

jakub_g 5 months ago

Looks like images are broken in a lot of posts on this blog
zlagen 5 months ago

it may be that google is checking the referer header.
mousetree 5 months ago

Broken for me too.

kingnothing 5 months ago

No mention of Schemaless? Was that retired?

rabbbit 5 months ago

Nah, they're both going on strong.
The new version is called Docstore though - https://www.uber.com/en-PE/blog/how-uber-serves-over-40-mill...

hipadev23 5 months ago

Uber you guys need to delete this, the quality of this post is absolutely embarrassing.

noname120 5 months ago

Too late I already archive.org'd it lol: https://web.archive.org/web/20250130201835/https://www.uber....

lelandfe 5 months ago

Looks like Uber's ditched their expensive designers for AI on their corporate blog's lead images.

PaulHoule 5 months ago

It reads like AI slop too. Not so sure why it is getting voted up.
- heyoni 5 months ago
  
  People see uber tech blog and jump straight to upvoting.
  - PaulHoule 5 months ago
    
    I remember when that blog used to be top notch.
    Today it just seems odd that anybody is still using MySQL. Postgres? Sure. SQLlite? Hell yeah! DuckDB? Of course. MySQL? Not so much.
    
    bigmutant 5 months ago
    
    Absolutely not true in my experience. MySQL has its share of issues (all DBs do) but it is rock-solid when using the correct engine (InnoDB for most cases, RocksDB for high-throughput writes, Memory for caching). MySQL is very hard to beat for very high-volume OLTP workloads, both reads and writes. Its replication systems were years ahead of other systems (SQL Server, Postgres, SQLite doesn't have replication). DuckDB AFAIK is OLAP and they don't compete in the same space. Every DB system has "the things its good at" and MySQL really shines at very high-volume OLTP spread across partitions.
    
    kyawzazaw 5 months ago
    
    we are almost all in MySQL. we are also old and 500b

calmbonsai 5 months ago

This is an embarrassment and just one more example of how far this enterprise has fallen.

Even as a consumer, I've stopped using Uber due to, of all things "biz 101", incompetent billing.

SaintSeiya 5 months ago

Disgusting company: Uber takes 70% cut from rider's fare, drivers made only 30% on top of that Uber place those stolen earnings in the driver's tax report, not on them.

kazinator 5 months ago

That sounds extremely improbable.
You simply cannot take 70% from the revenue coming from an employee-like agent, yet report that to the government as being that person's income. Not only would that be blatant fraud, but depending on the exact percentages and absolute amount, the employee might have to give all their pay to the government to covert the tax, and even owe some more after that.
In what country are they doing this?
- rileymat2 5 months ago
  
  It has been years ago now, but when I did it this was partly true. Yes it showed up as income, but it also showed up as deductible expense.
  - kazinator 5 months ago
    
    Ah okay. What did deductible mean in that jurisdiction? Was it a straight write off, or a tax credit?
    Where I am in Canada, tax deductions are shitty because they come off the bottom of your income, so to speak. That is to say, the eligible expenses are added up and the lowest tax bracket rate is applied to them. You then get that as a tax credit. That's shitty because you paid for those expenses with your post-tax dollars at your full tax rate, but the credit is for a low tax rate.
    Whereas a write-off comes off your income pre-tax, and so the benefit effectively operates at the marginal tax rate, and can knock you into a lower bracket.
    
    rileymat2 5 months ago
    
    USA, in the USA deductions come off the top like your writeoffs.
throwawaytodey 5 months ago

[dead]

feverzsj 5 months ago

Even the authors listed at end looks like AI generated.

wordofx 5 months ago

Uber is not a company worth following for technology. It’s like a bunch of juniors using tech wrong and changing tech as a solution to a non existent problem.

astrange 5 months ago

I always got the impression their tech stack was designed to be as complicated as possible so they could hire juniors who thought it looked cool.
davidczech 5 months ago

They used to produce some neat stuff. Jaeger and Cadence were quite cool.