Data race detected in libfabric shm provider with multi-threaded client-server setup #10528

piotrchmiel · 2024-11-08T15:37:34Z

Describe the bug
When using the shm provider in a simple setup with one client thread and one server thread within a single process, a data race is detected when compiled with clang-19 and run with ThreadSanitizer. The client performs one fi_send, and the server performsone fi_recv, with a message size of 1000 bytes. The data race appears during the fi_cq_read operation on the server side and the fi_send operation on the client side.

To Reproduce

Set up a process with two threads: one acting as a client and the other as a server.
Use the shm provider with libfabric version 1.22.0.
Compile the program with clang-19 and enable ThreadSanitizer (-fsanitize=thread).
In the client thread, perform an fi_send operation with a message of 1000 bytes.
In the server thread, perform an fi_recv operation to receive the message.
Observe that ThreadSanitizer reports a data race during execution

Expected behavior
No data race should be detected when performing simple send and receive operations between a client and server thread within the same process using the shm provider.

Output
WARNING: ThreadSanitizer: data race (pid=543759)

Server thread:
Read of size 8 at 0x7febb28f3000 by thread T4 (mutexes: write M0, write M1, write M2):
#0 strncmp /home/piotrchmiel/llvm-project/compiler-rt/lib/tsan/rtl/../../sanitizer_common/sanitizer_common_interceptors.inc:487:3 (test+0x9d66d)
#1 smr_name_compare /home/piotrchmiel/test/third_party/libfabric/prov/shm/src/smr_util.c:351:9 (libfabric.so.1+0xe5a33)

during cq_read

Client thread:
Previous write of size 8 at 0x7febb28f3000 by main thread:
#0 memcpy /home/piotrchmiel/llvm-project/compiler-rt/lib/tsan/rtl/../../sanitizer_common/sanitizer_common_interceptors_memintrinsics.inc:115:5 (test+0x8e9de)
#1 smr_send_name /home/piotrchmiel/test/third_party/libfabric/prov/shm/src/smr_ep.c:206:2 (libfabric.so.1+0xdcfcd)

during fi_send

Environment:

OS (if not Linux), provider, endpoint type, etc.
Operating System: Ubuntu 22.04
Compiler: clang-19 with ThreadSanitizer
libfabric Version: 1.22.0
Provider: shm
Endpoint Type: FI_EP_RDM

Additional context
The data race occurs specifically on memory access in smr_name_compare (during fi_cq_read on the server side) and in smr_send_name (during fi_send on the client side)

aingerson · 2024-11-08T18:13:49Z

@piotrchmiel Thanks for reporting! Any chance you have an existing reproducer you could send so I don't have to try to implement it?

piotrchmiel · 2024-11-12T12:56:15Z

@aingerson I prepared reproducer:
https://github.com/piotrchmiel/shm_fabric_reproducer
Steps to reproduce the problem:

./build.sh
FI_LOG_LEVEL=Debug ./build/reproducer (issue reproduces more often with logs)

I'm using clang19.1.2 (https://github.com/llvm/llvm-project/tree/llvmorg-19.1.2)

Thread sanitizer logs from reproducer:

WARNING: ThreadSanitizer: data race (pid=607864)
  Read of size 8 at 0x7f16ccd75000 by thread T1 (mutexes: write M43, write M30, write M28, write M41):
    #0 strncmp ../../../../src/libsanitizer/sanitizer_common/sanitizer_common_interceptors.inc:483 (libtsan.so.0+0x67576)
    #1 strncmp ../../../../src/libsanitizer/sanitizer_common/sanitizer_common_interceptors.inc:479 (libtsan.so.0+0x67576)
    #2 smr_name_compare prov/shm/src/smr_util.c:351 (libfabric.so.1+0x155b6d)
    #3 operator() /home/piotrc/reproducer/main.cpp:176 (reproducer+0xa1e7)
    #4 __invoke_impl<void, main()::<lambda()> > /usr/include/c++/11/bits/invoke.h:61 (reproducer+0xc0d8)
    #5 __invoke<main()::<lambda()> > /usr/include/c++/11/bits/invoke.h:96 (reproducer+0xbfc6)
    #6 _M_invoke<0> /usr/include/c++/11/bits/std_thread.h:259 (reproducer+0xbe88)
    #7 operator() /usr/include/c++/11/bits/std_thread.h:266 (reproducer+0xbdd4)
    #8 _M_run /usr/include/c++/11/bits/std_thread.h:211 (reproducer+0xbd40)
    #9 <null> <null> (libstdc++.so.6+0xdc252)

  Previous write of size 8 at 0x7f16ccd75000 by thread T2 (mutexes: write M14):
    #0 memcpy ../../../../src/libsanitizer/sanitizer_common/sanitizer_common_interceptors.inc:827 (libtsan.so.0+0x6243e)
    #1 memcpy ../../../../src/libsanitizer/sanitizer_common/sanitizer_common_interceptors.inc:819 (libtsan.so.0+0x6243e)
    #2 smr_send_name prov/shm/src/smr_ep.c:206 (libfabric.so.1+0x14d58e)
    #3 operator() /home/piotrc/reproducer/main.cpp:250 (reproducer+0xaceb)
    #4 __invoke_impl<void, main()::<lambda()> > /usr/include/c++/11/bits/invoke.h:61 (reproducer+0xc04f)
    #5 __invoke<main()::<lambda()> > /usr/include/c++/11/bits/invoke.h:96 (reproducer+0xbf27)
    #6 _M_invoke<0> /usr/include/c++/11/bits/std_thread.h:259 (reproducer+0xbe2e)
    #7 operator() /usr/include/c++/11/bits/std_thread.h:266 (reproducer+0xbd8a)
    #8 _M_run /usr/include/c++/11/bits/std_thread.h:211 (reproducer+0xbcee)
    #9 <null> <null> (libstdc++.so.6+0xdc252)

Reproducer shows also another issue that appears more often:

WARNING: ThreadSanitizer: data race (pid=607864)
  Read of size 8 at 0x7f16ccd75018 by thread T1 (mutexes: write M43, write M30, write M28):
    #0 memcpy ../../../../src/libsanitizer/sanitizer_common/sanitizer_common_interceptors.inc:827 (libtsan.so.0+0x6243e)
    #1 memcpy ../../../../src/libsanitizer/sanitizer_common/sanitizer_common_interceptors.inc:819 (libtsan.so.0+0x6243e)
    #2 ofi_memcpy include/ofi_hmem.h:263 (libfabric.so.1+0x2dc82)
    #3 operator() /home/piotrc/reproducer/main.cpp:176 (reproducer+0xa1e7)
    #4 __invoke_impl<void, main()::<lambda()> > /usr/include/c++/11/bits/invoke.h:61 (reproducer+0xc0d8)
    #5 __invoke<main()::<lambda()> > /usr/include/c++/11/bits/invoke.h:96 (reproducer+0xbfc6)
    #6 _M_invoke<0> /usr/include/c++/11/bits/std_thread.h:259 (reproducer+0xbe88)
    #7 operator() /usr/include/c++/11/bits/std_thread.h:266 (reproducer+0xbdd4)
    #8 _M_run /usr/include/c++/11/bits/std_thread.h:211 (reproducer+0xbd40)
    #9 <null> <null> (libstdc++.so.6+0xdc252)

  Previous write of size 8 at 0x7f16ccd75018 by thread T2 (mutexes: write M14, write M13083449636225080):
    #0 memcpy ../../../../src/libsanitizer/sanitizer_common/sanitizer_common_interceptors.inc:827 (libtsan.so.0+0x6243e)
    #1 memcpy ../../../../src/libsanitizer/sanitizer_common/sanitizer_common_interceptors.inc:819 (libtsan.so.0+0x6243e)
    #2 ofi_memcpy include/ofi_hmem.h:263 (libfabric.so.1+0x2dc82)
    #3 operator() /home/piotrc/reproducer/main.cpp:250 (reproducer+0xaceb)
    #4 __invoke_impl<void, main()::<lambda()> > /usr/include/c++/11/bits/invoke.h:61 (reproducer+0xc04f)
    #5 __invoke<main()::<lambda()> > /usr/include/c++/11/bits/invoke.h:96 (reproducer+0xbf27)
    #6 _M_invoke<0> /usr/include/c++/11/bits/std_thread.h:259 (reproducer+0xbe2e)
    #7 operator() /usr/include/c++/11/bits/std_thread.h:266 (reproducer+0xbd8a)
    #8 _M_run /usr/include/c++/11/bits/std_thread.h:211 (reproducer+0xbcee)
    #9 <null> <null> (libstdc++.so.6+0xdc252)

piotrchmiel · 2024-12-17T12:51:32Z

@aingerson I just wanted to kindly follow up and ask if you’ve had a chance to take a look at the reproducer I shared for the issue.

aingerson · 2024-12-17T17:55:13Z

@piotrchmiel Hello again! I was having issues building libfabric with the thread sanitizer (I've never used it before). After doing research on the thread sanitizer, I learned that it doesn't do well with code that uses atomics for serialization (which is what shm uses) and that it is likely to throw a lot of false positives because of this. I've taken a look at the backtraces and code you've provided and don't see how there could be a race there and they are managed by atomics. So I think it is just a false positive and it moved down on my priority list because of the release. I can revisit but I think most likely the thread sanitizer will really struggle with analyzing shm effectively

piotrchmiel · 2024-12-18T14:41:19Z

Thanks for the update and the detailed explanation regarding the ThreadSanitizer behavior with atomics. I understand that handling these cases can be tricky and prone to false positives.

However, could you perhaps provide a short explanation, with references to the relevant sections of the code, describing how the data race warning is avoided in this case? Specifically, I’m interested in the synchronization approach that ensures a data race should never occur between, for example, smr_name_compare and smr_send_name.

Understanding the synchronization mechanism in play—such as mutexes or atomic operations—would help clarify why this warning might be a false positive. Any insights or pointers to the relevant parts of the code would be greatly appreciated.

aingerson · 2024-12-18T20:01:36Z

@piotrchmiel Absolutely!
The shm communication is all controlled with a circular queue whose writes and reads are controlled through atomics. All buffers are specified through this queue (cmd_queue). In other words, I cannot access any buffer in your memory unless you tell me which one I am allowed to read/write from/to in the form of a command. The claims, inserts, reads, and discards of this queue are all specified in this header.
The flow on the sender is
next->claims a command on the receiver
commit->submits it to the receiver to be read
and on the receive side, the following:
head->polls the head for a command that has been committed (will not return a command claimed but not committed)
release->moves the head over when done with a read command (allows another writer to claim that command if needed)

This flow happens on every single send/receive, including the start up flow which involves exchanging name information in order to do the shm mapping (if needed). This second part is where the send_name and name_compare come into play. The sender claims a command and uses a local inject buffer to copy its own name into and then commits it the receiver's command queue. So by the time we get to this commit in send_name, we have specified the unique buffer which no one else can claim (see freestack implemention for that control). The receiver will read that committed command and then insert it and map it here. The receiver has a map of names->region so it needs to look in that map and see if it already knows about this peer (that's where the name_compare happens). Once that's all done, it releases the tx_buffer and command.

So in your backtrace, it looks like the thread sanitizer is complaining that the tx_buf (which has the name being sent/compared on the receive side), is racy on the read and write. But the flow is claim (atomics), write, commit (atomics), poll (atomics), read, release (atomics). So we should never be reading and writing to that buffer at the same time because of the command queue atomics that control access to it.

piotrchmiel added the bug label Nov 8, 2024

aingerson self-assigned this Nov 8, 2024

j-xiong added the prov/shm label Nov 14, 2024

syskovprdap mentioned this issue Nov 14, 2024

Data race detected in libfabric shm provider with multi-threaded client-server setup - Issue#10528 Testkovair/Test_Repo#23

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Data race detected in libfabric shm provider with multi-threaded client-server setup #10528

Data race detected in libfabric shm provider with multi-threaded client-server setup #10528

piotrchmiel commented Nov 8, 2024 •

edited

Loading

aingerson commented Nov 8, 2024

piotrchmiel commented Nov 12, 2024

piotrchmiel commented Dec 17, 2024

aingerson commented Dec 17, 2024

piotrchmiel commented Dec 18, 2024

aingerson commented Dec 18, 2024

Data race detected in libfabric shm provider with multi-threaded client-server setup #10528

Data race detected in libfabric shm provider with multi-threaded client-server setup #10528

Comments

piotrchmiel commented Nov 8, 2024 • edited Loading

aingerson commented Nov 8, 2024

piotrchmiel commented Nov 12, 2024

piotrchmiel commented Dec 17, 2024

aingerson commented Dec 17, 2024

piotrchmiel commented Dec 18, 2024

aingerson commented Dec 18, 2024

piotrchmiel commented Nov 8, 2024 •

edited

Loading