feat: [MR-617] Enforce subnet-wide best-effort message memory limit #1835

alin-at-dfinity · 2024-10-03T19:00:18Z

At the end of each round, if the total best-effort message memory usage is above a configured limit, build a priority queue and keep shedding the largest message of the canister with the highest best-effort message memory usage until we're back under the limit.

At the end of each round, if the total best-effort message memory usage is above the limit, build a priority queue and keep shedding the largest messages from the canisters with the highest best-effort message memory usage until we're back under the limit.

rs/messaging/src/state_machine.rs

rs/replicated_state/src/replicated_state.rs

…ort non-zer best-effort message memory usage but then fail to shed any message.

derlerd-dfinity · 2024-10-29T15:39:49Z

rs/replicated_state/src/replicated_state.rs

@@ -668,6 +668,18 @@ impl ReplicatedState {
        canisters_memory_usage + subnet_memory_usage
    }

+    /// Computes the memory taken by best-effort response messages.
+    pub fn best_effort_message_memory_taken(&self) -> NumBytes {


It is somewhat confusing that we include the subnet queue usage here but then below don't shed from the queues?

Should we maybe exclude the usage related to the subnet queues here and re-introduce it in the PR that will start shedding messages from the subnet queues?

The reason subnet queues were included here is that this is a copy-pasta of guaranteed_response_message_memory_taken() just above (which had it).

The reason why the shedding logic doesn't have it is that I forgot about it. I wrote that code before this, so it wasn't front of mind. And this one only while writing the test.

I'll try extending the shedding logic to subnet queues in this PR. If it turns out to be a sizable change, I'll break it out into its own PR.

Done. Also extended the test to shed from the subnet queues.

rs/replicated_state/src/replicated_state.rs

oggy-dfin

Left a bunch of nitpicks. What's the plan for subnet queues, is that a separate ticket?

rs/messaging/src/state_machine.rs

oggy-dfin · 2024-10-30T14:16:40Z

rs/replicated_state/src/replicated_state.rs

@@ -668,6 +668,18 @@ impl ReplicatedState {
        canisters_memory_usage + subnet_memory_usage
    }

+    /// Computes the memory taken by best-effort response messages.
+    pub fn best_effort_message_memory_taken(&self) -> NumBytes {


Nitpick: why is this called "taken" but the methods on the state/queues are called "usage"?

Also, this seems only used in tests, I forget if we mark these methods somehow?

It's to make it consistent with guaranteed_response_message_memory_taken() above (most (good) programming style guides will tell you to prioritize consistency over other rules). I believe Execution came up with that one, so you can bring it up with them. (o:

As for it only being used in tests, that's true. If it were to remain so, we could add a #[cfg(test)] attribute. But I can easily imagine that we'd e.g. want to expose this as a metric, in this PR or a follow-up.

Well I'm exactly asking because it's inconsistent :) Just between the different layers. I'd prefer if you could change it to something uniform everywhere, but not a blocker for me.

rs/replicated_state/src/replicated_state.rs

* Switch to using `best_effort_message_memory_taken()` instead of a hand-rolled calculation. * Add a `debug_assert!()` to ensure that our running `memory_usage` amount matches the actual memory usage.

stiegerc

Lgtm, except the PR description could be more precise too.

stiegerc · 2024-11-03T20:24:26Z

rs/replicated_state/src/replicated_state.rs

+            let memory_usage_delta = memory_usage_before - memory_usage_after;
+            debug_assert!(memory_usage_delta > ZERO_BYTES);
+            memory_usage -= memory_usage_delta;


What happens here if memory_usage_after > memory_usage_before? I know it shouldn't be possible and if it were the accounting would be screwed up anyway. But I'd replace this debug_assert!() with debug_assert!(memory_usage_before > memory_usage_after) and put that in front.

I guess unless ZERO_BYTES is intended to be possibly not actually zero bytes for some reason in the future.

Good point. Unless memory_usage_after was equal to memory_usage_before, the debug_assert!() wouldn't do anything useful as is. Because what would happen is that the two subtractions would underflow. (OTOH I believe that in a debug build an integer overflow will panic anyway.)

Still, there is one case where shedding a message may result in (marginally) higher memory usage: when we shed an outbound request, we enqueue an actual reject response; if the request was really tiny, the reject response may be larger than it (even though the payload only consists of the error message "Request timed out."). It is not clear to me what is the right thing to do at that point (apart from not enqueuing an actual response). I suppose we should keep going regardless, because the alternative is to end up with arbitrarily many tiny messages that are never shed.

So I've changed the debug_assert!() as suggested, but left everything else as it was.

stiegerc · 2024-11-03T20:35:27Z

rs/replicated_state/src/replicated_state.rs

+    /// Enforces the best-effort message limit by repeatedly shedding the largest
+    /// best-effort messages of the canisters with the largest best-effort memory
+    /// usage until the memory usage drops below the limit.


This seems imprecise. I had to really look at the code to understand what exactly this means.

IIUC the priority queue holds the canisters by memory usage, i.e. the one with the largest usage in front. It then sheds the largest message from this canister until we either go below the limit or a different canister becomes the one with the largest memory usage (or we run out of messages altogether). So it's not some sort of round robin scheme over the canisters with the largest best-effort memory usage where each has its largest message shed or something (which I was sort of guessing from reading the comment).

Changed it to say "repeatedly shedding the largest best-effort message of the canister with the highest best-effort message memory usage" (i.e. dropped the plurals).

If you have a better suggestion, I'm happy to apply it.

rs/replicated_state/tests/replicated_state.rs

…extra assert in unit test.

derlerd-dfinity

Thank you. LGTM.

rs/replicated_state/src/replicated_state.rs

derlerd-dfinity · 2024-11-05T08:52:45Z

rs/replicated_state/src/replicated_state.rs

+            let message_shed;
+            let memory_usage_after;
+            if canister_id.get() == self.metadata.own_subnet_id.get() {


I would find it easier to read if we'd change this so that

Suggested change

let message_shed;

let memory_usage_after;

if canister_id.get() == self.metadata.own_subnet_id.get() {

let (message_shed, memory_usage_after) = if canister_id.get() == self.metadata.own_subnet_id.get() {

...

I only found out about this now and I find it a lot cleaner than the usual

let (message_shed, memory_usage_after) = if foo() { let message_shed = bar(); let memory_usage_after = baz(); // More code here. (message_shed, memory_usage_after) } else { let message_shed = qux(); let memory_usage_after = quux(); // Other code here. (message_shed, memory_usage_after) };

Changed, nonetheless, for the sake of consistency.

I don't have a strong opinion. So if you feel strongly you can also leave it as it was. What prompted me to comment was that I wasn't aware of the possibility of uninitialized variables in Rust either and I had to Google it to make sure I understand the semantic.

They're not uninitialized, they just look like that. If you were to not initialize them on every code path (i.e. on both if branches) the compiler would complain about it. Which is also why they don't need to be mut: they're simply declared at the top and initialized further down.

rs/replicated_state/tests/replicated_state.rs

alin-at-dfinity requested review from stiegerc, derlerd-dfinity and oggy-dfin October 3, 2024 19:00

github-actions bot added the feat label Oct 3, 2024

alin-at-dfinity added 2 commits October 3, 2024 19:16

Make clippy happy.

228adfd

Merge branch 'master' into alin/MR-617-best-effort-message-load-shedding

e50d560

derlerd-dfinity reviewed Oct 11, 2024

View reviewed changes

alin-at-dfinity added 4 commits October 25, 2024 11:15

Merge branch 'master' into alin/MR-617-best-effort-message-load-shedding

39f4873

Have enforce_best_effort_message_limit() deal with canisters that rep…

9ed9390

…ort non-zer best-effort message memory usage but then fail to shed any message.

Export counters for total messages and bytes shed.

798d98a

Add a test for ReplicatedState::enforce_best_effort_message_limit().

839416b

alin-at-dfinity marked this pull request as ready for review October 25, 2024 14:48

alin-at-dfinity requested review from a team as code owners October 25, 2024 14:48

github-actions bot added @ic-message-routing-owners @execution labels Oct 25, 2024

derlerd-dfinity reviewed Oct 29, 2024

View reviewed changes

oggy-dfin reviewed Oct 30, 2024

View reviewed changes

alin-at-dfinity added 3 commits November 1, 2024 11:04

Merge branch 'master' into alin/MR-617-best-effort-message-load-shedding

dcde88e

Address review comments:

9506c72

* Switch to using `best_effort_message_memory_taken()` instead of a hand-rolled calculation. * Add a `debug_assert!()` to ensure that our running `memory_usage` amount matches the actual memory usage.

Also shed messages from subnet queues.

f08266b

alin-at-dfinity requested review from derlerd-dfinity and oggy-dfin November 1, 2024 15:11

stiegerc approved these changes Nov 3, 2024

View reviewed changes

alin-at-dfinity added 2 commits November 4, 2024 14:39

Address review comments: better debug_assert(); clearer doc comment; …

e8cb5b5

…extra assert in unit test.

Merge branch 'master' into alin/MR-617-best-effort-message-load-shedding

a485017

derlerd-dfinity approved these changes Nov 5, 2024

View reviewed changes

Address review comments.

a08dfcb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: [MR-617] Enforce subnet-wide best-effort message memory limit #1835

feat: [MR-617] Enforce subnet-wide best-effort message memory limit #1835

alin-at-dfinity commented Oct 3, 2024 •

edited

Loading

derlerd-dfinity Oct 29, 2024

alin-at-dfinity Nov 1, 2024

alin-at-dfinity Nov 1, 2024

oggy-dfin left a comment

oggy-dfin Oct 30, 2024

oggy-dfin Oct 30, 2024

alin-at-dfinity Nov 1, 2024

oggy-dfin Nov 5, 2024

stiegerc left a comment

stiegerc Nov 3, 2024 •

edited

Loading

alin-at-dfinity Nov 4, 2024

stiegerc Nov 3, 2024

alin-at-dfinity Nov 4, 2024

derlerd-dfinity left a comment

derlerd-dfinity Nov 5, 2024

alin-at-dfinity Nov 5, 2024

derlerd-dfinity Nov 5, 2024

alin-at-dfinity Nov 6, 2024

feat: [MR-617] Enforce subnet-wide best-effort message memory limit #1835

Are you sure you want to change the base?

feat: [MR-617] Enforce subnet-wide best-effort message memory limit #1835

Conversation

alin-at-dfinity commented Oct 3, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

oggy-dfin left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

stiegerc left a comment

Choose a reason for hiding this comment

stiegerc Nov 3, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

derlerd-dfinity left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

alin-at-dfinity commented Oct 3, 2024 •

edited

Loading

stiegerc Nov 3, 2024 •

edited

Loading