Read path

After a read request is directed to a specific node, it first checks if data is in the memory cache. If so, the data is returned to the client

If the data is not in memory, it will be retrieved from the disk instead. We need an efficient way to find out which SSTable contains the key. Bloom filter is commonly used to solve this problem.

The read path is shown in the picture below when data is not in memory.

The system first checks if the data is in memory. If not, go to step 2.
If data is not in memory, the system checks the bloom filter.
The bloom filter is used to figure out which SSTables might contain the key.
SSTables return the result of the data set.
The result of the data set is returned to the client.

PreviousWrite path NextDesign a unique id generator in distributed systems

Last updated 1 year ago