144 points | by bddicken5 days ago
Backblaze writes about these data sets, and includes some more conventional graphs. For example:
https://www.backblaze.com/blog/backblaze-drive-stats-for-202...
I wanted to do something more fun!
And I'd want to see failed drives somehow organized by TimeInService and maybe origin...
We of course expect their drive usage to grow, but what would be surprising (& provide more info) is how the drives fail or age-out. None of us without huge data centers can get that kind of info
Still fun to watch as it is, though.
There's also a few places where there's duplicate labels (e.g. Hitachi 3TB)
Would be great to group by manufacturer somehow (e.g. color) and make the size more prominent.
Very cool visualization regardless.
(It loaded about 0.3MB/s for me)
I hope you had a better time with ingesting the data than I did :)
One of the things LLMS are really good at is writing scripts for processing and pairing down data. I wanna do a blog post talking about how did some of this, maybe coming up!
Interesting visualization though.
But it sure is fun to look at. I enjoyed it ;-)
Not sure why it would intermittently redraw the whole scene though.. could be a Chrome thing.
And I didn't understand what all those dots falling down from above mean, but that's another story. If they wanted to show how many drives there are if each model they could have made the circles larger.
I hope we reach parity. Right now prices have gone up since 2023, and flash is about 3x as expensive as hard drives.