Fix: uatomic arm32: add missing release barrier before uatomic_xchg
authorMathieu Desnoyers <mathieu.desnoyers@efficios.com>
Mon, 5 Dec 2016 14:35:34 +0000 (09:35 -0500)
committerMathieu Desnoyers <mathieu.desnoyers@efficios.com>
Mon, 5 Dec 2016 14:36:38 +0000 (09:36 -0500)
__sync_lock_test_and_set() only imply a release barrier, but
uatomic_xchg() guarantees both acquire and release barrier semantics.
Therefore, add the missing release barrier.

Signed-off-by: Mathieu Desnoyers <mathieu.desnoyers@efficios.com>
include/urcu/uatomic/arm.h

index fd8fe6abfc6c22348a354e49db5e0690a609d514..95f32f3d47b497c79373c7fcbf71bb65c8aa5dda 100644 (file)
 
 #include <urcu/compiler.h>
 #include <urcu/system.h>
+#include <urcu/arch.h>
 
 #ifdef __cplusplus
 extern "C" {
 #endif
 
 /* xchg */
-#define uatomic_xchg(addr, v) __sync_lock_test_and_set(addr, v)
+
+/*
+ * Based on [1], __sync_lock_test_and_set() is not a full barrier, but
+ * instead only an acquire barrier. Given that uatomic_xchg() acts as
+ * both release and acquire barriers, we therefore need to have our own
+ * release barrier before this operation.
+ *
+ * [1] https://gcc.gnu.org/onlinedocs/gcc-4.1.0/gcc/Atomic-Builtins.html
+ */
+#define uatomic_xchg(addr, v)                          \
+       ({                                              \
+               cmm_smp_mb();                           \
+               __sync_lock_test_and_set(addr, v);      \
+       })
 
 #ifdef __cplusplus
 }
This page took 0.02661 seconds and 4 git commands to generate.