more questions

Signed-off-by: Alex Chi Z <iskyzh@gmail.com>
2024-01-30 16:58:50 +08:00
parent 8184505b27
commit 24b5f9ea3b
3 changed files with 3 additions and 0 deletions
--- a/mini-lsm-book/src/week2-03-tiered.md
+++ b/mini-lsm-book/src/week2-03-tiered.md
@@ -130,6 +130,7 @@ As tiered compaction does not use the L0 level of the LSM state, you should dire
 * What happens if compaction speed cannot keep up with the SST flushes?
 * What might needs to be considered if the system schedules multiple compaction tasks in parallel?
 * SSDs also write its own logs (basically it is a log-structured storage). If the SSD has a write amplification of 2x, what is the end-to-end write amplification of the whole system? Related: [ZNS: Avoiding the Block Interface Tax for Flash-based SSDs](https://www.usenix.org/conference/atc21/presentation/bjorling).
+* Consider the case that the user chooses to keep a large number of sorted runs (i.e., 300) for tiered compaction. To make the read path faster, is it a good idea to keep some data structure that helps reduce the time complexity (i.e., to `O(log n)`) of finding SSTs to read in each layer for some key ranges? Note that normally, you will need to do a binary search in each sorted run to find the key ranges that you will need to read. (Check out Neon's [layer map](https://neon.tech/blog/persistent-structures-in-neons-wal-indexing) implementation!)

 We do not provide reference answers to the questions, and feel free to discuss about them in the Discord community.