Category System Design

System Design

Design a “Likes” Counter for Social Media: Discusses designing a scalable likes counting system

In this case study, we examine the design of a scalable system for managing and counting “likes” on a social media platform. Such a system must handle high volumes of user interactions efficiently, ensuring reliability, low latency, and scalability to…

Uma Mahesh
12/11/2025

System Design

System Design Case Study: Designing a Scalable Notification Service

1. Overview A scalable notification service is a critical backend component for modern applications, responsible for delivering timely, personalized messages to users across multiple channels (push notifications, email, SMS, in-app, WhatsApp, etc.). This system powers user engagement in products like…

Uma Mahesh
12/07/2025

System Design

System Design Case Study: Designing a Distributed Job Scheduler

1. Overview A distributed job scheduler is a system that reliably executes tasks (jobs) at specified times or intervals across a cluster of machines. It powers background processing in virtually every large-scale application: cron jobs, ETL pipelines, report generation, cache…

Uma Mahesh
12/04/2025

System Design

System Design Case Study: Designing a Distributed Rate Limiter

1. Overview A distributed rate limiter is a critical infrastructure component that enforces per-entity request quotas across a fleet of stateless application servers (e.g., microservices behind a load balancer). In a single-node system, rate limiting is trivial (in-memory counter), but…

Uma Mahesh
11/30/2025

System Design

System Design Case Study: Designing a Distributed Key-Value Store (Inspired by Amazon DynamoDB)

1. Overview This case study presents a comprehensive design for a fully distributed, highly available, and linearly scalable key-value store, directly modeled after Amazon’s Dynamo technology (the foundational system behind DynamoDB, Riak, Cassandra, and Voldemort). The core philosophy is radical…

Uma Mahesh
11/27/2025

System Design

System Design Case Study: Designing a Distributed Web Crawler

1. Overview This case study outlines a scalable, distributed web crawler architecture capable of processing billions of pages daily. It incorporates fault tolerance, politeness policies, and deduplication. The implementation leverages .NET 8 for high performance, with components for orchestration, fetching,…

Uma Mahesh
11/23/2025

System Design

System Design Case Study: Designing a Scalable URL Shortener (like bit.ly)

1. Functional Requirements 2. Non-Functional Requirements 3. Estimation (Back-of-the-envelope) 4. High-Level Architecture Key Insight: 99%+ of traffic is GET /:shortCode → 301 redirect → Optimize the redirect path aggressively. 5. Core Design Decisions 5.1 Short Code Generation Strategy Option Chosen:…

Uma Mahesh
11/21/2025

Architecture, System Design

Scaling MySQL to Serve Billions: The Vitess Architecture at YouTube

Abstract In the realm of distributed systems, achieving horizontal scalability while maintaining the reliability of a battle-tested relational database like MySQL presents formidable challenges. YouTube, the world’s premier video-sharing platform, exemplifies this triumph through its development of Vitess—an open-source database…

Uma Mahesh
11/16/2025

System Design

Enhancing Reliability in E-Commerce Transaction Processing: A Comprehensive Overview

In the domain of e-commerce transaction processing, a reliable workload is characterized by its ability to consistently fulfill predefined reliability objectives, such as ensuring seamless order placement, payment authorization, and inventory updates without interruption. This reliability is achieved through proactive…

Uma Mahesh
11/13/2025