MapReduce: Simplified data processing on large clusters
Read Here
TAO: Facebook’s distributed data store for the social graph
Read Here
Google File System: A scalable distributed file system
Read Here
BigTable: A distributed storage system for structured data
Read Here
Google Spanner: Google’s globally distributed database
Read Here
Amazon Dynamo: Amazon’s highly available key-value store
Read Here
Kafka: A distributed messaging system for log processing
Read Here
Chubby: The lock service for loosely coupled distributed systems
Read Here
CAP Theorem: Perspectives on the CAP theorem
Read Here
Raft: An understandable consensus algorithm
Read Here
The Part-Time Parliament: Paxos consensus algorithm
Read Here
Scaling Memcache at Facebook: Distributed cache architecture
Read Here
Out of the Tar Pit: Functional programming to keep it simple
Read Here
The Next 700 Programming Languages: A universal framework for language design
Read Here
TLA+: Modeling concurrent and distributed systems
Read Here
Log Structured Merge Tree: Data structure for efficiently storing key-value pairs
Read Here
Consistent Hashing: Distributed caching protocols for relieving hot spots
Read Here
ZooKeeper: Distributed coordination service
Read Here
Reactive Manifesto: Principles for reactive systems
Read Here
F1: Google’s scalable RDBMS for AdWords
Read Here
Apache Cassandra: A decentralized structured storage system
Read Here
If you’re interested in exploring or contributing to open-source implementations inspired by these papers, check out:
GitHub Repository
This repository hosts collections of research papers and implementations across various topics.