finish 2.2 write-up

Signed-off-by: Alex Chi Z <iskyzh@gmail.com>
2024-01-23 14:43:44 +08:00
parent 274a2cb7a2
commit 016c6ccadc
9 changed files with 195 additions and 7 deletions
--- a/README.md
+++ b/README.md
@@ -56,7 +56,7 @@ We are working on a new version of the mini-lsm tutorial that is split into 3 we
 | 1.6            | Storage Engine - Write Path                     | ✅        | ✅            | ✅       |
 | 1.7            | Bloom Filter and Key Compression                | ✅        | ✅            | ✅       |
 | 2.1            | Compaction Implementation                       | ✅        | ✅            | ✅       |
-| 2.2            | Compaction Strategy - Simple                    | ✅        | 🚧            | 🚧       |
+| 2.2            | Compaction Strategy - Simple                    | ✅        | ✅            | ✅       |
 | 2.3            | Compaction Strategy - Tiered                    | ✅        | 🚧            | 🚧       |
 | 2.4            | Compaction Strategy - Leveled                   | ✅        | 🚧            | 🚧       |
 | 2.5            | Manifest                                        | ✅        | 🚧            | 🚧       |
--- a/mini-lsm-book/src/week1-02-merge-iterator.md
+++ b/mini-lsm-book/src/week1-02-merge-iterator.md
@@ -138,6 +138,7 @@ We are finally there -- with all iterators you have implemented, you can finally
 * Why do we need to ensure the merge iterator returns data in the iterator construction order?
 * Is it possible to implement a Rust-style iterator (i.e., `next(&self) -> (Key, Value)`) for LSM iterators? What are the pros/cons?
 * The scan interface is like `fn scan(&self, lower: Bound<&[u8]>, upper: Bound<&[u8]>)`. How to make this API compatible with Rust-style range (i.e., `key_a..key_b`)? If you implement this, try to pass a full range `..` to the interface and see what will happen.
+* The starter code provides the merge iterator interface to store `Box<I>` instead of `I`. What might be the reason behind that?

 We do not provide reference answers to the questions, and feel free to discuss about them in the Discord community.

--- a/mini-lsm-book/src/week2-01-compaction.md
+++ b/mini-lsm-book/src/week2-01-compaction.md
@@ -84,6 +84,28 @@ You can also change your compaction implementation to leverage the concat iterat

 You will need to implement `num_active_iterators` for concat iterator so that the test case can test if concat iterators are being used by your implementation, and it should always be 1.

+To test your implementation interactively,
+
+```shell
+cargo run --bin mini-lsm-cli-ref -- --compaction none # reference solution
+cargo run --bin mini-lsm-cli -- --compaction none # your solution
+```
+
+And then,
+
+```
+fill 1000 3000
+flush
+fill 1000 3000
+flush
+full_compaction
+fill 1000 3000
+flush
+full_compaction
+get 2333
+scan 2000 2333
+```
+
 ## Test Your Understanding

 * What are the definitions of read/write/space amplifications? (This is covered in the overview chapter)
--- a/mini-lsm-book/src/week2-02-simple.md
+++ b/mini-lsm-book/src/week2-02-simple.md
@@ -7,12 +7,147 @@ In this chapter, you will:
 * Implement a simple leveled compaction strategy and simulate it on the compaction simulator.
 * Start compaction as a background task and implement a compaction trigger in the system.

-## Task 1: Simple Level Compaction
+## Task 1: Simple Level Compaction + Compaction Simulation

-## Task 2: Compaction Simulation
+In this chapter, we are going to implement our first compaction strategy -- simple leveled compaction. In this task, you will need to modify:
+
+```
+src/compact/simple_leveled.rs
+```
+
+Simple leveled compaction is similar the original LSM paper's compaction strategy. It maintains a number of levels for the LSM tree. When a level (>= L1) is too large, it will merge all of this level's SSTs with next level. The compaction strategy is controlled by 3 parameters as defined in `SimpleLeveledCompactionOptions`.
+
+* `size_ratio_percent`: lower level number of files / upper level number of files. In reality, we should compute the actual size of the files. However, we simplified the equation to use number of files to make it easier to do the simulation. When the ratio is too high (upper level has too many files), we should trigger a compaction.
+* `level0_file_num_compaction_trigger`: when the number of SSTs in L0 is larger than or equal to this number, trigger a compaction of L0 and L1.
+* `max_levels`: the number of levels (excluding L0) in the LSM tree.
+
+Assume size_ratio_percent=200, max_levels=3, level0_file_num_compaction_trigger=2, let us take a look at the below example.
+
+Assume the engine flushes two L0 SSTs. This reaches the `level0_file_num_compaction_trigger`, and your controller should trigger an L0->L1 compaction.
+
+```
+--- After Flush ---
+L0 (2): [1, 2]
+L1 (0): []
+L2 (0): []
+L3 (0): []
+--- After Compaction ---
+L0 (0): []
+L1 (2): [3, 4]
+L2 (0): []
+L3 (0): []
+```
+
+Now, L2 is empty while L1 has two files. The size ratio for L1 and L2 is `L2/L1=0/2=0 < size_ratio`. Therefore, we will trigger a L1+L2 compaction to push the data lower to L2. The same applies to L2 and these two SSTs will be placed at the bottom-most level after 2 compactions.
+
+```
+--- After Compaction ---
+L0 (0): []
+L1 (0): []
+L2 (2): [5, 6]
+L3 (0): []
+--- After Compaction ---
+L0 (0): []
+L1 (0): []
+L2 (0): []
+L3 (2): [7, 8]
+```
+
+Continue flushing SSTs, we will find:
+
+```
+L0 (0): []
+L1 (0): []
+L2 (2): [13, 14]
+L3 (2): [7, 8]
+```
+
+At this point, `L3/L2=1 < size_ratio`. Therefore, we need to trigger a compaction between L2 and L3.
+
+```
+--- After Compaction ---
+L0 (0): []
+L1 (0): []
+L2 (0): []
+L3 (4): [15, 16, 17, 18]
+```
+
+As we flush more SSTs, we will possibly end up at a state as follows:
+
+```
+--- After Flush ---
+L0 (2): [19, 20]
+L1 (0): []
+L2 (0): []
+L3 (4): [15, 16, 17, 18]
+--- After Compaction ---
+L0 (0): []
+L1 (0): []
+L2 (2): [23, 24]
+L3 (4): [15, 16, 17, 18]
+```
+
+Because `L3/L2 = 2 >= size_ratio`, we do not need to merge L2 and L3 and will end up with the above state. Simple leveled compaction strategy always compact a full level, and keep a fanout size between levels, so that the lower level is always some multiplier times larger than the upper level.
+
+We have already initialized the LSM state to have `max_level` levels. You should first implement `generate_compaction_task` that generates a compaction task based on the above 3 criteria. After that, implement `apply_compaction_result`. We recommend you implement L0 trigger first, run a compaction simulation, and then implement the size ratio trigger, and then run a compaction simulation. To run the compaction simulation,
+
+```shell
+cargo run --bin compaction-simulator-ref simple # Reference solution
+cargo run --bin compaction-simulator simple # Your solution
+```
+
+The simulator will flush an L0 SST into the LSM state, run your compaction controller to generate a compaction task, and then apply the compaction result. Each time a new SST gets flushed, it will repetitively call the controller until no compaction needs to be scheduled, and therefore you should ensure your compaction task generator will converge.
+
+In your compaction implementation, you should reduce the number of active iterators (i.e., use concat iterator) as much as possible. Also, remember that merge order matters, and you will need to ensure that the iterators you create produces key-value pairs in the correct order, when multiple versions of a key appear.
+
+**Note: we do not provide fine-grained unit tests for this part. You can run the compaction simulator and compare with the output of the reference solution to see if your implementation is correct.**
+
+## Task 2: Compaction Thread
+
+In this task, you will need to modify:
+
+```
+src/compact.rs
+```
+
+Now that you have implemented your compaction strategy, you will need to run it in a background thread, so as to compact the files in the background. In `compact.rs`, `trigger_compaction` will be called every 50ms, and you will need to:
+
+1. generate a compaction task, if no task needs to be scheduled, return ok.
+2. run the compaction and get a list of new SSTs.
+3. Similar to `force_full_compaction` you have implemented in the previous chapter, update the LSM state.

 ## Task 3: Integrate with the Read Path

+In this task, you will need to modify:
+
+```
+src/lsm_storage.rs
+```
+
+Now that you have multiple levels of SSTs, you can modify your read path to include the SSTs from the new levels. You will need to update the scan/get function to include all levels below L1. Also, you might need to change the `LsmStorageIterator` inner type again.
+
+To test your implementation interactively,
+
+```shell
+cargo run --bin mini-lsm-cli-ref -- --compaction simple # reference solution
+cargo run --bin mini-lsm-cli -- --compaction simple # your solution
+```
+
+And then,
+
+```
+fill 1000 3000
+flush
+fill 1000 3000
+flush
+fill 1000 3000
+flush
+get 2333
+scan 2000 2333
+```
+
+You may print something, for example, the compaction task information, when the compactor triggers a compaction.
+
 ## Test Your Understanding

 * Is it correct that a key will only be purged from the LSM tree if the user requests to delete it and it has been compacted in the bottom-most level?
--- a/mini-lsm-starter/src/bin/compaction-simulator.rs
+++ b/mini-lsm-starter/src/bin/compaction-simulator.rs
@@ -1,15 +1,18 @@
+mod wrapper;
+use wrapper::mini_lsm_wrapper;
+
 use std::collections::HashMap;
 use std::sync::Arc;

 use bytes::{Buf, BufMut, Bytes, BytesMut};
 use clap::Parser;
-use mini_lsm::compact::{
+use mini_lsm_wrapper::compact::{
    LeveledCompactionController, LeveledCompactionOptions, SimpleLeveledCompactionController,
    SimpleLeveledCompactionOptions, TieredCompactionController, TieredCompactionOptions,
 };
-use mini_lsm::lsm_storage::LsmStorageState;
-use mini_lsm::mem_table::MemTable;
-use mini_lsm::table::SsTable;
+use mini_lsm_wrapper::lsm_storage::LsmStorageState;
+use mini_lsm_wrapper::mem_table::MemTable;
+use mini_lsm_wrapper::table::SsTable;

 #[derive(Parser, Debug)]
 #[command(author, version, about, long_about = None)]
@@ -220,6 +223,7 @@ fn main() {
            level0_file_num_compaction_trigger,
            max_levels,
        } => {
+            // TODO(chi): use unified logic for all 3 compactions...
            let controller =
                SimpleLeveledCompactionController::new(SimpleLeveledCompactionOptions {
                    size_ratio_percent,
--- a/mini-lsm-starter/src/bin/mini-lsm-cli.rs
+++ b/mini-lsm-starter/src/bin/mini-lsm-cli.rs
@@ -91,6 +91,12 @@ fn main() -> Result<()> {
            }

            println!("{} values filled with epoch {}", end - begin + 1, epoch);
+        } else if line.starts_with("del ") {
+            let Some((_, key)) = line.split_once(' ') else {
+                println!("invalid command");
+                continue;
+            };
+            lsm.delete(key.as_bytes())?;
        } else if line.starts_with("get ") {
            let Some((_, key)) = line.split_once(' ') else {
                println!("invalid command");
@@ -114,6 +120,7 @@ fn main() -> Result<()> {
                std::ops::Bound::Included(begin_key.as_bytes()),
                std::ops::Bound::Included(end_key.as_bytes()),
            )?;
+            let mut cnt = 0;
            while iter.is_valid() {
                println!(
                    "{:?}={:?}",
@@ -121,11 +128,15 @@ fn main() -> Result<()> {
                    Bytes::copy_from_slice(iter.value()),
                );
                iter.next()?;
+                cnt += 1;
            }
+            println!("{} keys scanned", cnt);
        } else if line == "dump" {
            lsm.dump_structure();
        } else if line == "flush" {
            lsm.force_flush()?;
+        } else if line == "full_compaction" {
+            lsm.force_full_compaction()?;
        } else if line == "quit" {
            lsm.close()?;
            break;
--- a/mini-lsm-starter/src/compact/simple_leveled.rs
+++ b/mini-lsm-starter/src/compact/simple_leveled.rs
@@ -28,6 +28,9 @@ impl SimpleLeveledCompactionController {
        Self { options }
    }

+    /// Generates a compaction task.
+    ///
+    /// Returns `None` if no compaction needs to be scheduled. The order of SSTs in the compaction task id vector matters.
    pub fn generate_compaction_task(
        &self,
        _snapshot: &LsmStorageState,
@@ -35,6 +38,13 @@ impl SimpleLeveledCompactionController {
        unimplemented!()
    }

+    /// Apply the compaction result.
+    ///
+    /// The compactor will call this function with the compaction task and the list of SST ids generated. This function applies the
+    /// result and generates a new LSM state. The functions should only change `l0_sstables` and `levels` without changing memtables
+    /// and `sstables` hash map. Though there should only be one thread running compaction jobs, you should think about the case
+    /// where an L0 SST gets flushed while the compactor generates new SSTs, and with that in mind, you should do some sanity checks
+    /// in your implementation.
    pub fn apply_compaction_result(
        &self,
        _snapshot: &LsmStorageState,
--- a/mini-lsm/Cargo.toml
+++ b/mini-lsm/Cargo.toml
@@ -34,3 +34,7 @@ path = "src/bin/mini-lsm-cli.rs"
 [[bin]]
 name = "mini-lsm-wrapper-ref"
 path = "src/bin/wrapper.rs"
+
+[[bin]]
+name = "compaction-simulator-ref"
+path = "src/bin/compaction-simulator.rs"
--- a/mini-lsm/src/bin/compaction-simulator.rs
+++ b/mini-lsm/src/bin/compaction-simulator.rs
@@ -0,0 +1 @@
+../../../mini-lsm-starter/src/bin/compaction-simulator.rs
				`@@ -0,0 +1 @@`
				`../../../mini-lsm-starter/src/bin/compaction-simulator.rs`