pic

转自:dancers

A Distributed System Reading List

Introduction

I often argue that the toughest thing about distributed systems is changing the way you think. The below is a collection of material I've found useful for motivating these changes.

Thought Provokers

Ramblings that make you think about the way you design. Not everything can be solved with big servers, databases and transactions.

Amazon

Somewhat about the technology but more interesting is the culture and organization they've created to work with it.

Google

Current "rocket science" in distributed systems.

eBay

Interesting they dumped most of J2EE and use a lot of db partitioning. Check out their site upgrade tool as well.

Consistency Models

Key to building systems that suit their environments is finding the right tradeoff between consistency and availability.

Theory

Papers that describe various important elements of distributed systems design.

Languages and Tools

Issues of distributed systems construction with specific technologies.

Infrastructure

Storage

Paxos Consensus

Understanding this algorithm is the challenge. I would suggest reading "Paxos Made Simple" before the other papers and again afterward.

Other Consensus Papers

Gossip Protocols (Epidemic Behaviours)

P2P

  • Chord: A Scalable Peer-to-peer Lookup Protocol for Internet Applications
  • Kademlia: A Peer-to-peer Information System Based on the XOR Metric
  • Pastry: Scalable, decentralized object location and routing for large-scale peer-to-peer systems
  • PAST: A large-scale, persistent peer-to-peer storage utility - storage system atop Pastry
  • SCRIBE: A large-scale and decentralised application-level multicast infrastructure - wide area messaging atop Pastry