Mike Doherty (u/Kaitaan)Senior Engineer, Data Engineering
This post is the first installment of a three-part mini-series on data architecture at Reddit, starting with Reddit’s data “origin story” and finishing with our present-day practices.
Last year, we shared a few blog posts explaining some of the interesting technical challenges we face here at Reddit. From fixing search (again), to how we route requests to different stacks, to (one of my favourites) counting the number of people who’ve viewed a piece of content, among several others, we’ve covered a very small part of what it is our technical teams do here. One area we haven’t covered much of yet is our data systems.
There’s always room for improvement in our systems, whether it’s adding more functionality, scaling to handle more throughput, or throwing everything out the window and starting from scratch. I’ve been here for all of it. And as Reddit’s very first dedicated data engineering hire,
Original URL: http://feedproxy.google.com/~r/feedsapi/BwPx/~3/oWgxmqEJiUE/