Commits · 5c7e3e9b6e7607cebe5dc9e476e3368d84a7ce6d · Lehrstuhl für Informatik 4 (Systemsoftware) / manycore / emper

Dec 10, 2021

[meson] introduce dependencies to io configuration options · 5c7e3e9b
Florian Fischer authored 3 years ago

5c7e3e9b

Introduce waitfree workstealing · 1c538024

Florian Fischer authored 3 years ago

Waitfree work stealing is configured with the meson option
'waitfree_work_stealing'.

The retry logic is intentionally left in the Queues and not lifted to
the scheduler to reuse the load of an unsuccessful CAS.

Consider the following pseudo code examples:

steal() -> bool:                       steal() -> res
  load                                   load
loop:                                    if empty return EMPTY
  if empty return EMPTY                  cas
  cas                                    return cas ? STOLEN : LOST_RACE
  if not WAITFREE and not cas:
    goto loop                          outer():
  return cas ? STOLEN : LOST_RACE      loop:
                                         res = steal()
outer():                                 if not WAITFREE and res == LOST_RACE:
  steal()                                  goto loop

In the right example the value loaded by a possible unsuccessful CAS
can not be reused. And a loop of unsuccessful CAS' will result in
double loads.

The retries are configurable through a template variable maxRetries.
* maxRetries < 0: indefinitely retries
* maxRetries >= 0: maxRetries

1c538024

Dec 09, 2021
- [IoContext] use a static_assert instead of runtime assert · 49e4d36f
  Florian Fischer authored 3 years ago
  
  49e4d36f
- Merge branch 'stealing-changes' into 'master' · c7670d99
  Florian Schmaus authored 3 years ago
  
  Multiple changes to improve IO stealing See merge request !288
  c7670d99
Dec 08, 2021
- [IoContext] make number of cqes to reap as template parameter · a336d96f
  Florian Fischer authored 3 years ago
  
  This has the benefit of adequat sized intermediate arrays reducing the needed stack size.
  a336d96f
- Merge branch 'cache-meson-packagecache' into 'master' · 4f25e7f2
  Florian Schmaus authored 3 years ago
  
  [gitlab-ci] Cache subprojects/packagecache See merge request !287
  4f25e7f2
- [io/Stats] fix comment referring to non present members · 58aceeb4
  Florian Fischer authored 3 years ago
  
  58aceeb4
- [IoContext] add debug helper to track the requests in an io_uring · 799e5055
  Florian Fischer authored 3 years ago
  
  799e5055
- [IoContext] use intermediate c-array during reaping · 84cf13b8
  Florian Fischer authored 3 years ago
  
  This removes the rather expensive (reported by perf) initialization of the std::arrays.
  84cf13b8
- [IoContext] add waitfree reaping of a single CQE · 3d1f2608
  Florian Fischer authored 3 years ago
  
  To distinguish the outcomes of the waitfree reap attempt a new enum StealingResult::{Empty, LostRace, Stolen} is introduced.
  3d1f2608
- Merge branch 'iwyu-0.17' into 'master' · 7e460851
  Florian Schmaus authored 3 years ago
  
  Fix includes (as reported by IWYU 0.17) and update CI container image See merge request !285
  7e460851
- [gitlab-ci] Cache subprojects/packagecache · c3472767
  Florian Schmaus authored 3 years ago
  
  c3472767
Dec 06, 2021

Merge branch 'auto-check-aq-while-stealing' into 'master' · f5278a2a
Florian Schmaus authored 3 years ago
```
[meson] set check_anywhere_queue_while_steal automatic

See merge request !286
```
f5278a2a

[meson] set check_anywhere_queue_while_stealing automatic · 7da8e687

Florian Fischer authored 3 years ago

We introduced the check_anywhere_queue_while_steal configuration
as an optimization to get the IO completions reaped by the completer
faster into the normal WSQ.
But now the emper has configurations where we don't use a completer
thus making this optimization useless or rather harmful.

By default automatically decide the value of
check_anywhere_queue_while_stealing based on the value of
io_completer_behavior.

7da8e687

Merge branch 'do-not-record-io-steal-attempts' into 'master' · 3b2d6f8e
Florian Schmaus authored 3 years ago
```
[io/Stats] do not record steal attempts

See merge request !284
```
3b2d6f8e
[gitlab-ci] Bump container image to 1.17 · ba55e55a
Florian Schmaus authored 3 years ago

ba55e55a
Fix includes (as reported by IWYU 0.17) · 42f5a853
Florian Schmaus authored 3 years ago

42f5a853

Dec 05, 2021

[io/Stats] do not record steal attempts · 100f8532

Florian Fischer authored 3 years ago

Recording every steal attempt is rather excessive and we are not doing
it for normal work.
Flamegraphs have show io-stealing takes considerable more time
than normal work stealing because of the recording of steal attempts,
especially if we are using atomics to aggregate them.

100f8532

Dec 03, 2021

Merge branch 'adjust-test-load' into 'master' · 2548d32c
Florian Schmaus authored 3 years ago
```
reduce test load when logging

See merge request !282
```
2548d32c

reduce test load when logging · 905fb18b

Florian Fischer authored 3 years ago

I suspect some test which scale whith the number of CPUs to timeout
mostly on jenkins2.
This patch reduces the load when logging is active and increases the
load when logging is off.
Therefore our test build with debugoptimized will do less and hopefully
only timeout when they actually hung and the release test will do
more.

905fb18b

Merge branch 'optimize-lockless-stealing' into 'master' · d829d688
Florian Schmaus authored 3 years ago
```
load CQ->tail only once during lockless stealing

See merge request !281
```
d829d688

Dec 02, 2021

load CQ->tail only once during lockless stealing · 2abd53e0

Florian Fischer authored 3 years ago

Currently we load the CQ->tail with acquire semantic to determine
if we should steal from teh victim and load it again in the actual
stealing logic which will also immediately abort if there are no
CQEs to steal.

Keep the optimization for the locked case.

2abd53e0

Merge branch 'improve-echo-client-help' into 'master' · 8e7c2676
Florian Schmaus authored 3 years ago
```
EchoClient: improve the help message

See merge request !280
```
8e7c2676

Nov 29, 2021
- EchoClient: improve the help message · 992b6813
  Florian Fischer authored 3 years ago
  
  992b6813
Nov 24, 2021
- Merge branch 'add-bps-test' into 'master' · a892f8f1
  Florian Schmaus authored 3 years ago
  
  add concurrent BPS test See merge request !279
  a892f8f1
- Merge branch 'echoclient-add-debug-state' into 'master' · 743686d4
  Florian Schmaus authored 3 years ago
  
  echoclient: add a state variable for debugging See merge request !278
  743686d4
Nov 23, 2021

echoclient: add a state variable for debugging · 127f6296
Florian Fischer authored 3 years ago

127f6296

add concurrent BPS test · 65a593bc

Florian Fischer authored 3 years ago

The test introduces multiple cycles of Semaphores and
a Fiber for each semaphore blocking and signaling the next.
Through work-stealing the fibers from a cycle should be spread
across different workers and thus test concurrent use of
BinaryPrivateSemaphores.

Cycle of length 3: Sem A -> Sem B -> Sem C -> Sem A -> ...
Algorithm:
	if isFirstInCycle
		signal next

	wait

	if not isFirstInCycle
		signal next

65a593bc

Nov 15, 2021

Merge branch 'fix_pipe_sleep_notifyFromAnywhere' into 'master' · cc63bd70
Florian Schmaus authored 3 years ago
```
[PipeSleepStrategy] fix notifyFromAnywhere

See merge request !277
```
cc63bd70

[PipeSleepStrategy] fix notifyFromAnywhere · d31442ad

Florian Fischer authored 3 years ago

Don't decrease the sleeper count in the CAS loop further than
-count, which is the threshold we need to ensure that the notification
will be observed.
Decreasing it further than our threshold is not faulty it just results
in unnecessary skipped sleeps.

Don't call writeNotifications with a negative count.
Which will be interpreted as an unsigned value and thus results
in writing way to much hints to the pipe, jamming it.
If the original value before a successfully CAS is already negative
we called writeNotifications with this negative value.
This is fixed by using max(toWakeup, 0).

d31442ad

Nov 11, 2021
- Merge branch 'configure-work-stealing-victim-count' into 'master' · c58a9143
  Florian Schmaus authored 3 years ago
  
  make the victim count in work-stealing configurable See merge request !276
  c58a9143
Nov 10, 2021

make the victim count in work-stealing configurable · cd06496d

Florian Fischer authored 3 years ago

Add two new mutual exclusive meson_options:
* work_stealing_victim_count: Which sets an absolute number of victims
* work_stealing_victim_denominator: Set victim count to #workers/denominator

cd06496d

Merge branch 'update-check-format' into 'master' · 189075cd
Florian Schmaus authored 3 years ago
```
[tools] Update check-format (from Mazstab)

See merge request !275
```
189075cd
Merge branch 'clang-tidy-13' into 'master' · 75eb34a5
Florian Schmaus authored 3 years ago
```
Fixes for clang-tidy 13

See merge request !274
```
75eb34a5
[tools] Update check-format (from Mazstab) · 6adb7047
Florian Schmaus authored 3 years ago
```
Sync tools/check-format of EMPER and Mazstab by using the newer
Mazstab version of the script.
```
6adb7047

Fixes for clang-tidy 13 · dfa64867

Florian Schmaus authored 3 years ago

While we do not have yet LLVM 13 in the gitlab-ci, I use it on my
systems. So fix the new warnings found with clang-tidy 13.

dfa64867

Oct 29, 2021
- Merge branch 'random-computation-echoserver' into 'master' · 6e9c20de
  Florian Schmaus authored 3 years ago
  
  Random computation echoserver See merge request !272
  6e9c20de
Oct 28, 2021

Merge branch 'ci-debian-testing-dev-bump' into 'master' · fbe0b98f
Florian Schmaus authored 3 years ago
```
Ci debian testing dev bump

See merge request !273
```
fbe0b98f
[gitlab-ci] Update debian-testing-dev from 1.14 to 1.15 · e41ddcc1
Florian Schmaus authored 3 years ago

e41ddcc1

[EchoSever] implement random computations variants · 964278bc

Florian Fischer authored 3 years ago

Now three variants of computation are available:

* fixed (echoserver <port> <computation>:
   This will always consume computation us before sending the echo
   back to the client.
* random range (echoserver <port> <computation> <computation-max>:
   This will consume a random computation uniformly selected
   from the interval [computation, computation-max] us.
* random min-max (echoserver <port> <computation> <computation-max> <max-probability>
   This will either consume computation or computation-max us.
   The max computation is randomly chosen with the specified probability.

All random values are generated with a thread_local mt19937 generator
and uniformly distributed with uniform_{int,real}_distribution.

964278bc