condition_variable: mutual exclusion for notify/wait #416

wence- · 2025-11-24T13:09:26Z

To avoid lost wakeups, use an internal mutex to protect modification of the awaiter list.

Now, while we are notifying, if another notification task arrives it must wait for our modification of the awaiter list to complete. Hence waiters that were not ready and are pushed back on the list have a chance to be woken up by the next notification.

Closes deadlock/missing wake in condition_variable notify_all? #398

I came back to the issue raised in #398 because we have a use case now where condvars make the most sense and we legitimately might have multiple simultaneous notification calls.

Since the modification awaiter list representing the suspended awaiters waiting for a notification crosses coroutine boundaries, I don't think we can protect it with a std::mutex. Hence I use a coro::mutex (internally managed by the condvar).

To avoid lost wakeups, use an internal mutex to protect modification of the awaiter list. Now, while we are notifying, if another notification task arrives it must wait for our modification of the awaiter list to complete. Hence waiters that were not ready and are pushed back on the list have a chance to be woken up by the next notification. - Closes jbaldwin#398

jbaldwin · 2025-11-24T19:22:07Z

include/coro/condition_variable.hpp

-    auto notify_all(std::unique_ptr<executor_type>& executor) -> void
+    auto notify_all(std::unique_ptr<executor_type>& executor) -> coro::task<void>
    {
+        co_await m_notify_mutex.scoped_lock();


This will spawn all the tasks but it doesn't guarantee that the notify tasks have completed and put non-ready tasks back onto the list. I think this is still race conditiony.

Perhaps we need to run the notify is ready / predicate synchronously here and then spawn the user task IIF the predicate is ready?

This is psuedo code and might have some restrictions I haven't thought of but I think splitting the predicate/notify and the user resume into a spawn would allow for the new lock to be held for the correct amount of time to guarantee any waiters that are not ready on the notify_all() will be properly placed back on the waiters list.

co_await m_notify_mutex.scoped_lock(); auto* waiter = detail::awaiter_list_pop_all(m_awaiters); while (waiter != nullptr) { auto* next = waiter->m_next; // the notify calls need to be inline with the new lock to guarantee non-ready waiters are placed // back onto the waiters list to not miss notifications await make_notify_all_executor_individual_task(waiter); waiter = next; } co_return; .... within make_notify_all_executor_individual_task.... auto make_notify_all_executor_individual_task(awaiter_base* waiter) -> coro::task<void> { // this on_notify() is a problem since it resumes internally and we cannot spawn.. so we might // need to rethink how this works, perhaps split it as on_notify() -> resume() where this switch // statement can call executor->spawn(waiter->resume()) ? // or we pass in the executor to on_notify() by pointer and if it isn't nullptr then it can spawn // otherwise it resumes inline -- that will hold the internal lock for the duration of the condvar lock // as well, which is maybe ok? switch (co_await waiter->on_notify()) { case notify_status_t::not_ready: // Re-enqueue since the predicate isn't ready and return since the notify has been satisfied. detail::awaiter_list_push(m_awaiters, waiter); break; case notify_status_t::ready: case notify_status_t::awaiter_dead: // Don't re-enqueue any awaiters that are ready or dead. break; } }

Ah, good point. I suppose another would be to use a task_container and then yield until the tasks have all completed in this function. Then the lock is held for the notification and wakeups

👍 I think that would work.

I also think task_container needs a latch style wait though, it spins currently which is really inefficient. coro::when_all is a little heavier on managing the tasks but it doesn't spin wait.

jbaldwin · 2025-11-25T18:03:37Z

@wence- I've played around with renaming to task_group and make it more ergonomic in that you have to give it the full set of tasks upfront. There were definitely some race conditions on the yield_until_empty() if you keep adding tasks into it. I think giving it the full set of tasks upfront makes sense? This way its basically a dynamic lifetime task tracker backed by a coro::latch so the co_await task_group.wait() method is super efficient now.

Let me know your thoughts and if this works for the use case you are considering (I think it should for the cond var change).

task_group PR
#418

jbaldwin · 2025-11-26T16:29:43Z

#419

Heres a stab at getting it to work with the upgraded task_group, I think this should solve the wait()/notify() race conditions.

wence- · 2025-11-26T18:20:04Z

#419

Heres a stab at getting it to work with the upgraded task_group, I think this should solve the wait()/notify() race conditions.

Thanks. It will take me a few days to find the time to do some proper testing, but from a quick glance that looks like it solves the underlying issues to me.

wence- · 2025-11-26T18:27:34Z

One thing I would note with the task_group change, I was using a task_container morally like this:

struct ThingThatReceivesCallbacks {
  coro::task<void> async_notify(coro::latch& latch, args) {
      do_stuff_with(args);
      latch.count_down();
  }

  // This is called from a background thread
  void bridge_from_sync_code(...) {
     task_container_.start(async_notify(latch_, ...));
  }

  coro::task<void> wait_for_all_notifications(...) {
     co_await latch_;
     // This is so the coroutine unwinding doesn't jump into our dtor
     // before the final `async_notify` task completes.
     co_await task_container_.yield_until_empty();
  }
};

I think with the new task_group I can't do this, because I don't have the tasks up front, I only know how many there are.

I think I can instead do:

struct ThingThatReceivesCallbacks {
  coro::task<void> async_notify(coro::latch& latch, args) {
      do_stuff_with(args);
      latch.count_down();
  }

  // This is called from a background thread
  void bridge_from_sync_code(std::unique_ptr<thread_pool>& executor) {
     executor.spawn(async_notify(latch_, ...));
  }

  coro::task<void> wait_for_all_notifications(std::unique_ptr<thread_pool>& executor, ...) {
     co_await latch_;
     // This is so the coroutine unwinding doesn't jump into our dtor
     // before the final `async_notify` task completes.
     co_await executor.yield();
  }
};

I think that's equivalent because I can be guaranteed that I only need to yield once to put myself at the back of the task queue to ensure that the final async_notify completes before I go out of scope.

jbaldwin · 2025-11-26T18:39:20Z

Yeah I think that's correct on your change. You're basically using the latch as a group already so that should work and just scheduling into the executor.

I found that the task group has a real race condition if it allows tasks to be added slowly when you wait for it to be empty, which is why I changed it to be all upfront. Technically it was a problem before with the yield until empty but that must have been slow enough for the tests to never trigger.

jbaldwin · 2025-12-07T20:43:23Z

I re-added the task_group::start(task) method and managed to figure out how to do the tests without the race condition, so if you want to re-update this PR with the updated task_group we can move it forward again.

Sorry for the turbulence in the api, I think we've landed on something good now though.

edit: I had forgotten I made a modification ontop of this PR, if that is the preferred approach I can rebase and get that version merged.

wence- mentioned this pull request Nov 24, 2025

Adding Fanout node rapidsai/rapidsmpf#636

Merged

jbaldwin requested changes Nov 24, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

condition_variable: mutual exclusion for notify/wait #416

condition_variable: mutual exclusion for notify/wait #416

Uh oh!

wence- commented Nov 24, 2025

Uh oh!

jbaldwin Nov 24, 2025

Uh oh!

wence- Nov 24, 2025

Uh oh!

jbaldwin Nov 25, 2025

Uh oh!

jbaldwin commented Nov 25, 2025

Uh oh!

jbaldwin commented Nov 26, 2025

Uh oh!

wence- commented Nov 26, 2025

Uh oh!

wence- commented Nov 26, 2025

Uh oh!

jbaldwin commented Nov 26, 2025 •

edited

Loading

Uh oh!

jbaldwin commented Dec 7, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

condition_variable: mutual exclusion for notify/wait #416

Are you sure you want to change the base?

condition_variable: mutual exclusion for notify/wait #416

Uh oh!

Conversation

wence- commented Nov 24, 2025

Uh oh!

jbaldwin Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

wence- Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

jbaldwin Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

jbaldwin commented Nov 25, 2025

Uh oh!

jbaldwin commented Nov 26, 2025

Uh oh!

wence- commented Nov 26, 2025

Uh oh!

wence- commented Nov 26, 2025

Uh oh!

jbaldwin commented Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jbaldwin commented Dec 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jbaldwin commented Nov 26, 2025 •

edited

Loading

jbaldwin commented Dec 7, 2025 •

edited

Loading