Big Data Currents: Scalable Distributed Systems

In recent years couple factors have increasingly become important in design of distributed systems i.e., Scalability & Reliability of the system. Over time I picked few things related to these factors. This post series is an attempt to share my modest knowledge on scalability aspects.

What is Scalability?
Simply put it is ability of the system to handle increasing load whether it is addition of users or resources or both. Now typically the scale of a system has 3 dimensions -

The quantity dimension i.e., number of users, resources, objects etc that are part of the system
The distribution dimension i.e., geographical distribution of servers, services, data etc.
The administrative dimension i.e., the number of organizations, multi-tenancy etc

These dimensions in turn affect a whole host of components that are needed for a distributed system.

Building a scalable system does not happen by accident. Similarly a distributed system is not automatically a scalable system. So it is important to consider the effects of scale in these dimensions early on.

Effects of Scale

Now the components that typically get affected by scale in the above 3 dimensions are -

Naming
Service Registries and Service Discovery
Data at Rest - Management, Storage & Distribution
Data in Motion - Caching & Cache management
Security - Authentication & Authorization
Administration - Deployment & Configuration Management @ Scale
Communication - Group communication
Heterogeneity - Interfaces, Languages & Protocols
User View - Data Organization, Summaries, Visualization @ Scale
Reliability - Availability, Performance, Faults & Fault Tolerance

Techniques

Now the solutions to above typically involve a common set of techniques like -

Replication (i.e., Services Replication, Data Replication)
Partitioning (i.e., Services Partitioning, Data Partitioning)
Distribution (i.e., Geo Distribution of Services and Data)
Caching (i.e., Cache placement & consistency etc)
Messaging i.e., decouple in space, time & synchronization
Data Organization, Summarization & Visualization
Automation for Deployment & Config Management
Clocks, Consensus, Coordination and Concurrency Control etc primitives
CAP theorem & Various tradeoffs

Wrap up...

In subsequent posts I plan to cover in more detail each of the above components and techniques. I think a good understanding of these key concepts will be helpful for anyone working on distributed and scalable systems. Feel free to let me know your thoughts in the comments section.

Big Data Currents

Pages

Jan 6, 2016

Scalable Distributed Systems - Introduction

About

Blog Archive

Popular Posts