Low Level Design GitHub Materials
Sanjeevi Subramani
Posted on July 5, 2022
๐) ๐๐๐ ๐๐๐
Low level Design / Machine Coding Question Collections
What is Machine Coding Round ?
Machine Coding Round has become very popular interview round in most Product and startup companies like Flipkart, Cleartrip, navi and Udaan,the way the round is structured is the candidate will be provided with a Problem statement and the minimum set of features which needs to implemented within the 90 min window period, we as a candidate need to identity the data models, the service interfaces which will be consumed by the clients and other set of functions and classes, we should be able to present the same to the panel with are design choices / design patterns being made / used and the complete working code.
What is the purpose behind this LLD repo ?
During my recent Preparation, found out that there was no single resources where we get a collections of all the questions and the corresponding Answers, so currently updating the repo with all the materials I found out from the web and the Questions which has been asked during my interview. feel free to contribute any questions you have got during your recent interview.There other useful YT channel and Git repo which are attached in the resources section.
What is target Audience, do all companies ask ?
If you are currently targeting/looking out for any SDE - I,II opportunities. It totally depends on the company you interviewing with, you can check with the HR if they evaluate via machine coding round, but certain companies do ask a object oriented design round where they focus only on one particular feature and DB design as well, so its good get practiced with few problems beforehand.
Below are the list of LLD problem which are asked during my machine coding rounds from tech companies and the resources I collected.
Note : page is under construction, you can click on the individual packages to view all the questions and solutions.
| Low Level Design Question
| Question Link to PDF / Drive
| Company Asked
| Credits to author
|
| Cricket Match Dashboard | Leetcode | Udaan Assignment | |
| Event Calendar | PDF link | Flipkart LLD Question | |
| FoodKart or food ordering System | Doc link | Flipkart LLD Question | |
| Stock Exchange | Problem Link | Navi LLD Question | |
| PropertyHunt or property Listing site | Question image | ClearTrip LLD Question | |
| ledger company | Question Link | Navi LLD Question | Github code |
| Leetcode Like platform LLD | PDF link | Flipkart LLD Question | Github code |
Frequently asked Problems
| Low Level Design Question | Question Link to PDF / Drive | Company Asked | Credits to author |
| Ride Sharing like App | Problem Link | Frequently asked in all companies |
Contribute
Contributions welcome!
If you found this resource helpful, give it a ๐ otherwise contribute to it and give it a .
https://github.com/kumaransg/LLD
๐) ๐๐๐-๐๐๐๐๐ ๐๐๐๐๐๐ ๐๐๐๐๐๐
low-level-design-primer
Motivation
Learn low level design of system at scale. prepare for the low level design (LLD) / Machine Coding round interviews.
Learn to design low level system
Learning low level design of scalable systems will help you become better engineer.
This repo is an organized collection of resources to help you learn low level design of systesm's.
Resources
- Low level Design Questions
- Low level Design Solutions
Low Level Design Resources #TODO
Add new questions
Improve new questions
add new solutions
Fix errors
Improve sections
Add new sections
Under development
Interested in adding a section or helping complete one in-progress? Contribute!
- how to guide and study material along with various resources.
- add more questions and improve exisiting questions.
- add solutions for the problems along with their detailed explaination (maybe video)
Credits
Credits and sources are provided throughout this repo.
Special thanks to:
- Donne Martin
- System Design Primer
- Cracking the Coding Interview
- Leetcode
- GeeksforGeeks
- G Balasubramanian
- Udit Agarwal ## Contact Info Feel free to contact me to discuss any issues, questions, or comments.
My contact info can be found on my Github Page
License
- TODO
https://github.com/prasadgujar/low-level-design-primer
๐)๐๐ฐ๐๐ฌ๐จ๐ฆ๐ ๐๐จ๐๐ญ๐ฐ๐๐ซ๐ ๐๐ง๐ ๐ข๐ง๐๐๐ซ๐ข๐ง๐ ๐๐ง๐ญ๐๐ซ๐ฏ๐ข๐๐ฐ
Awesome Software Engineering Interview
Motivation
This repository aims at providing collated content at a single place to prepare for Software Engineering interviews.
Content
- Coding/DS, Algo
- Top DSA interview questions - Companywise
- Object Oriented Design/Low Level Design
- System Design/High Level Design
- CS Fundamentals
- Behavioral
Under development
- Low level design questions - Company Wise
- Top interview questions - Company Wise
- How to ace Behavioural interview rounds
- Mock interview platforms
- Resume building
Contributing
Feel free to submit pull requests to help with:
- Add new questions/topics
- Improve new questions/topics
- Fix errors
- Improve sections
- Add new sections
Contact info
Feel free to contact me to discuss any issues, questions, or comments.
License
I am providing code and resources in this repository to you under an open source license. Because this is my personal repository, the license you receive to my code and resources is from me and not my employer (Microsoft).
Copyright 2017 Karan Garg
Creative Commons Attribution 4.0 International License (CC BY 4.0)
http://creativecommons.org/licenses/by/4.0/
https://github.com/imkgarg/Awesome-Software-Engineering-Interview
๐)๐๐๐๐ก๐ ๐๐จ๐ฐ ๐๐๐ฏ๐๐ฅ ๐๐ฒ๐ฌ๐ญ๐๐ฆ ๐๐๐ฌ๐ข๐ ๐ง
Cache - Low level system design
Repository for low level system design of a cache
Video explanation
Problem statement
https://github.com/anomaly2104/cache-low-level-system-design
๐)๐๐ก๐๐ฌ๐ฌ ๐๐จ๐ฐ-๐๐๐ฏ๐๐ฅ ๐๐ฒ๐ฌ๐ญ๐๐ฆ ๐๐๐ฌ๐ข๐ ๐ง
Chess - Low level system design
Video Explanation
Problem Statements
Further enhancements
- Implement checkmate feature.
- Write more unit tests.
- Support special move of pawn where it can go diagonal when it kills.
- At many places, we are evaluating conditions like:
- OR Operation : We are allowed to do something if any condition out of given conditions fulfill.
- And Operation : We are allowed to do something if all conditions fulfill. Try to improve the design for this.
- Add history of moves for each player.
- Add support for casteling move.
- Can we remove putting currentCell in Piece? How about introducing something like position?
- A piece will have a position and you can always get the cell back from board using this position.
https://github.com/anomaly2104/chess-low-level-system-design
๐)๐๐ก๐ ๐๐ฒ๐ฌ๐ญ๐๐ฆ ๐๐๐ฌ๐ข๐ ๐ง
System Design:
Introduction:
What is system?
- A system is a loosely used term for an architecture or collection of software or technology that communicate with each other or interact with each other in order to serve a certain set of users with a certain set of requirements.
- A system can be defined and built keeping these three factors in mind:
- The user of the systems
- The requirements of those users, and
- The components that are chosen in order to build that system to serve those users and their requirements.
What is design?
- Design is a process of understanding the user requirements and selecting the components, modules, and software technologies how they are going to be intertwined and communicating with each other to actually serve the need of the system.
Components of System Design:
- Components are the basic building blocks of system components.
- It could be divided into two parts:
- Logical entities:
- Data
- Databases
- Applications
- Cache
- Message Queues
- Infra
- Communication Protocol
- Requests (API, RPC, etc.)
- Tangible entities:
- Text, images, videos..
- MongoDB, MySQL, Cassandra..
- Java, Golang, Python, React..
- Redis, MemeCache..
- Kafka, RabbitMQ..
- AWS, GCP, Azure..
- APIs, RPCs, Messages..
Client Server Architecture:
- Client-server architecture is a computing model in which the server hosts, delivers and manages most ofthe resources and services to be consumed by the client.
- Client: a piece ofsoftware or application that takes the input and sends request to the servers.
- Server: a piece ofsoftware that receives and processes requestsfrom clients.
- Load balancer: responsible for distributing incoming network traffic across a group of backend servers to optimize resource usage.
A typical topological data flow goes as follows:
- Client requests data from server
- Load balancer routes the request to the appropriate server
- Server processes the request client
- Server queries appropriate database for some data
- Database returns the queried data back to the server
- The server processes the data and sends the data back to the client
- This process repeats
- #### Types of Architecture:
- Thin client:
- A thin client is designed to be especially small so that the bulk of the data processing occurs on the server.
- Example: Ecommerce, streaming applications.
- Thick client:
- A thick client (fat client) is one that will perform the bulk of the processing in client/server applications. With thick clients, there is no need for continuous server communications as it is mainly communicating archival storage information to the server.
- Example: Gaming apps, video editing software.
Tier based architecture:
- 1-Tier:
- It is the simplest one as it is equivalent to running the application on the personal computer.
- 2-Tier:
- It is like Client-Server architecture, where communication takes place between client and server.
- 3-Tier:
- The 3-tier architecture has three different layers.
- Presentation layer
- Business Logic layer
- Database layer
- The 3-tier architecture has three different layers.
- N-Tier:
- An N-Tier Application program is one that is distributed among three or more separate computers in a distributed network.
Proxies:
- Proxy is an intermediary server between client and the internet.
- Proxy servers allow to hide, conceal and make your network id anonymous by hiding your IP address.
- Proxy servers offers the following basic functionalities:
- Firewall and network data filtering.
- Network connection sharing
- Data caching
-
Types of proxy:
- Forward Proxy
- In this the client requests its internal network server to forward to the internet.
- Reverse Proxy
- In this the requests are forwarded to one or more proxy servers and the response from the proxy server is retrieved as if it came directly from the original server.
Data and Data Flow:
-
Different formats of data (representation):
- In Business Layer:
- Texts, Videos, Images, etc.
- In Application Layer:
- JSON/XML
- In Data Stores:
- Databases, Tables, Indexes, Cache, Queues, etc.
- Network Layer:
- Packets.
- Hardware Layer:
- 0s and 1s.
Data Generation:
- Users
- Internal Data (System populates on their own).
- Insights
Data Flow Methods:
- APIs.
- Messages.
- Events.
Factors to be considered:
- Type of data
- Volume
- Consumption/Retrieval
- Security
Types of System (Examples):
- Authorization System
- Streaming System
- Transactional System
- Heavy Compute System
Databases:
Types of databases:
- If we consider data as people, in terms of buildings, then the way those buildings house people can be said as databases.
- Some common types of databases are:
- Relational
- Non-relational
- File type
- Network, etc.
Anatomy of applications and services:
- Applications or services performs certain tasks, and at different layers they have different responsibility.
- Tech stack:
- All the codes in applications are written in some languages using some frameworks.
- Any application can be written solely with the use of language. But frameworks do most of the bootstrapping so we can use this feature to make an application on the top of the framework.
- Responsibilities:
- Client app:
- Render UI.
- Handle interactions.
- Collect data.
- Communicate with backend (API) to fetch and store data.
- Render static data/informations.
- Backend app:
- Expose API endpoints.
- House business logics.
- Handle data modelling/transformation.
- Interact with data stores.
- Interact with other services.
- Elements/factors of application development:
- Feature requirements.
- Layer.
- Tech Stack.
- Code structure/design pattern.
- Data store interactions.
- Performance/cost.
- Deployment.
- Monitoring.
- Operational excellence/reliability.
Application Programming Interface (API):
- An API is a set of defined rules that explain how computers or applications communicate with one another.
- Advantages:
- Communication.
- Abstraction.
- Platform agnostic.
- Examples:
- Private APIs: The hidden APIs. Not accessible to everyone.
- Public APIs: Available to public. (Ex: Google maps api)
- Web APIs: Superset of public and private APIs.
- SDK/Library APIs
- Factors to consider:
- API contracts
- Documentation
- Data format
- Security
- Standards:
- REST:
- Stands for REpresentational State Transfer.
- Guidelines:
- Client Server.
- Cacheable.
- Layered.
- Stateless.
- Uniform Interface.
- Code on demand.
- RPC
- SOAP
Caching:
- A hardware or software component which helps in serving the data which is either frequently requested or it is expensive to compute on, so cache stores the computed response and saves the cost of computing.
- Cache hit: If a response for a request is available in cache memory, it is called a cache hit.
- Cache miss: If a response for a request is not available in cache memory, it is called a cache miss.
- Invalidation and eviction:
- Invalidation:
- The data that is kept in cache is not there for forever, it is volatile.
- The data is going to change at some point of time, hence we need to update the cache as well.
- The process of updating the data in cache by replacing the old value with new value is called data invalidation.
- Methods to invalidate:
- Cache expiry (TTL: Time to live).
- Remove the cache. (When a new request come, cache miss will happen, and data will be fetched.)
- Update the cache.
- Eviction:
- A cache eviction algorithm is a way of deciding which element to evict when the cache is full.
- Catch eviction methods:
- FIFO.
- LRU.
- LFU.
Cache Patterns:
Where can be keep cache?
MESSAGE QUEUE:
Performance Metrics (used to evaluate how good the system is performing):
-
Throughput
- Throughput is the number of actions executed or results produced in a certain amount of time.
- In system design, throughput comes into picutre when we need to understand how many API calls are being served in a particular amount of time.
-
Bandwidth
- Bandwidth is the maximum amount of data that can travel through a 'channel'.
-
Latency
- Latency is the time required to perform some action or to produce some result.
-
Response Time
- Response time is the time between a client sending a request and receiving a response. It is the sum of round trip latency and service time.
-
EXAMPLES (analogies):
- Water Analogy:
- Latency is the amount of time it takes to travel through the tube.
- Bandwidth is how wide the tube is.
- The amount of water flow will be your throughput
- Vehicle Analogy:
- Vehicle travel time from source to destination is latency.
- Types of Roadways are bandwidth.
- Number of Vehicles traveling is throughput.
Performance metrics of different components:
-
Applications:
- API Response Time
- Throughput of APIs
- Error occurences
- Bug/defect in the code
-
Databases:
- Time taken by various database queries
- Number of queries executed per unit time(or throughput)
- Memory
-
Caches:
- Latency of writing to cache
- Number of cache eviction and invalidation
- Memory of cache instance
-
Message Queues:
- Rate of production and consumption
- Fraction of stale or unprocessed messages
- Number of consumers affects bandwidth and throughput
-
Workers:
- Time taken for job completion
- Resources used in processing
-
Server instances:
- Memory/RAM
- CPU
Fault v/s Failure:
- Fault is the cause, failure is the effect.
Scaling:
- The ability to handle more request by buying more machines/bigger machines.
- Key features:
- Able to handle the increased load.
- Not complex to implement and maintain.
- Performance shouldn' takr a hit or rather performance should increase.
-
Vertical Scaling:
- When we increase the capacity of existing resource it is vertical scaling.
-
Horizontal Scaling:
- When we increase the number of resources it is horizontal scaling.
-
Horizontal v/s Vertical Scaling:
| Horizontal | Vertical | |------------|----------| |Need load balancers.|Load balancers not needed.| |Resilient.|Single point of failure.| |Slow remote procedure calls.|Fast inter process communication.| |Data inconsistency.|Data consistent.| |Scales well as uses increases.| Hardware limit.|
Database Replication:
- Replication: To have a copy.
- Having exact copy of data present in other databases in other machines.
- The database that has main source of writes/updates becomes the primary db. (Master)
- The database which has the copies from the primary db's is called the secondary database. (Slave)
- Why do we need replication?
- Having replicas helps in tolerating faults.
- Having replicas helps in reducing latency.
- Replica databases can be used for read queries, whereas the primary one can be used for write queries. (Gain application performance)
- Replication lag:
- The time it takes for the value to be copied from the primary to secondary database.
- If replication lag is huge, then it becomes a problem.
- Replicas will give inconsistent data.
- To overcome this there are several consistenct models:
- Synchronous replication:
- All replicas have to be updated before host is acknowledged.
- Advantages:
- No lag.
- Data is always consistent.
- Disadvantage:
- Performance might take a hit, because every write will have to wait for all replicas to get updated as well as acknowledge. (High latency)
- If any replica goes down, and couldn't give any acknowledgement, write will fail.
- Asynchronous replication:
- Host is acknowledged after primary database is updated. Replicas update asynchronously.
- Advantage:
- Write opeartion becomes faster.
- Disadvantage:
- If any replica fails, system will be in a inconsistent state.
- Semi-synchronous replication:
- Whenever a new write is issued, the primary database will update the value to all the replicas, and will wait for one of the replicas to acknowledge.
CAP (Consistency, Availability, and Partitioning)
-
Consistency:
- In a consistent system, once a client writes a value to any server and gets a response, it expects to get that value (or a fresher value) back from any server it reads from.
-
Availability:
- In an available system, if the client sends a request to a server and the server has not crashed, then the server must eventually respond to the client. The server is not allowed to ignore the client's requests.
-
Partitioning:
- The system continues to function and upholds its consistency guarantees in spite of network partitions.
- If we can tolerate the partition, and even though if partition happens and system can still be available and consistent is called partition tolerance.
-
CAP theorem (Brewer Theorem):
- Any network shared system wants to have these three properties.
- In such a system, having all three properties is nearly impossible.
- We need to sacrifice one of them.
- Partition tolerance happens due to network failures, and we do not have complete control over network failures.
- Hence, partition tolerance becomes a mandatory property to support.
Database Sharding
https://github.com/sanyathisside/System-Design
๐)๐๐๐ฆ๐ฉ๐ฅ๐ ๐ฉ๐ซ๐จ๐๐ฅ๐๐ฆ๐ฌ ๐๐จ๐ซ ๐๐๐
LLD implemented for following sample problems
https://github.com/imk13/low-level-design
๐) ๐๐๐๐ฅ๐๐ซ ๐๐๐๐๐๐ฆ๐ฒ ๐๐จ๐ญ๐๐ฌ ๐๐จ๐ซ ๐๐ฒ๐ฌ๐ญ๐๐ฆ ๐๐๐ฌ๐ข๐ ๐ง
https://github.com/naqushab/ScalerAcademy/tree/develop/Useful
๐) ๐๐๐ ๐๐๐ฌ๐จ๐ฎ๐ซ๐๐๐ฌ
Low-level Design
Resources
work@tech Machine Coding section
[Youtube] Christopher - Design patterns in OOPS
[Youtube] The Code Mate - LLD Series
[Youtube] Derek Banas - OOPS Playlist
Head first Design patterns - Book summary (Principles and patterns)
[Youtube] Udit Agarwal LLD playlist
[Youtube] Design patterns and SOLID
[Youtube] Google's Clean code talks
Educative's Grokking Object-oriented Design Interview
[Youtube] Low level System design
[Youtube] Low Level Design by Amazonian
[Youtube] SDE Skills - Object oriented design
Design Patterns for Humans [PHP] [Github]
Design patterns implemented in Java [Github]
[Medium] How to approach Object Oriented Design Questions step by step
[Rooftop Slushie] How to approach an LLD question for SDE1?
[Hashnode] Blog series on Software Design - by Maxi Contieri
[Hashnode] Blog series on Code smells - by Maxi Contieri
[Github] Design patterns - Code examples
Credits
- This repo is just a collection of Low-level Design resources available on the Internet and these resources are created by their respective owners.
License
https://github.com/sbvkrishna/low-level-design
Reference
Contributed by
Posted on July 5, 2022
Join Our Newsletter. No Spam, Only the good stuff.
Sign up to receive the latest update from our blog.