Commits · 4c9a76c311ed1c3103567a1ed3e50a0e9f98de6a · Florian Schmaus / emper

Apr 25, 2022
- Merge branch 'configure-sleep-sem-threhold' into 'master' · 4c9a76c3
  Florian Schmaus authored 2 years ago
  
  make sleep semaphore threshold configurable See merge request i4/manycore/emper!378
  4c9a76c3
Apr 24, 2022
- make sleep semaphore threshold configurable for mazstab · 0da9832c
  Florian Fischer authored 2 years ago
  
  0da9832c
Apr 23, 2022

Merge branch 'inc-sleep-sem-threshold' into 'master' · 2ab1777a
Florian Schmaus authored 2 years ago
```
increase the sleep semaphore threshold

See merge request i4/manycore/emper!377
```
2ab1777a
Merge branch 'pulse-eval' into 'master' · fca60937
Florian Schmaus authored 2 years ago
```
Pulse: initial pulse evaluation commit

See merge request i4/manycore/emper!376
```
fca60937

Merge branch 'fsearch-finer-fiber-control' into 'master' · 75a7ff00

Florian Schmaus authored 2 years ago

fsearch: add more fine grained control about the used fiber throttles

See merge request i4/manycore/emper!375

75a7ff00

increase the sleep semaphore threshold · c54a6bd4

Florian Fischer authored 2 years ago

Also remove the negation of the condition (!> equals <=).

We currently use the semaphore of our sleep strategy very greedy.
This is due to skipping the semaphore's V() operation if we are sure
that it does not kill our progress guaranty.

When scheduling new work from within the runtime we skip the wakeup if
we observe nobody sleeping. This is fine in terms of progress and
limits the amount of atomic operations on global state to a minimum.

Using a threshold of 0 (observe nobody sleeping) however introduces a
race between inserting new work and going to sleep which harm the
latency when the worker goes to sleep and is not notified about the
new work.

This race is common and can be observed in the pulse micro benchmark.
A emper with a threshold of 0 shows high latency compared to using
an io-based sleep strategy or when increasing the threshold.

$ build-release/eval/pulse | python -c "<calc mean>"
Starting pulse evaluation with pulse=1, iterations=30 and utilization=80
mean: 1721970116.425

$ build-increased-sem-threshold/eval/pulse | python -c "<calc mean"
Starting pulse evaluation with pulse=1, iterations=30 and utilization=80
mean: 1000023942.15

$ build-pipe-release/eval/pulse | python -c "<calc mean>
Starting pulse evaluation with pulse=1, iterations=30 and utilization=80
mean: 1000030557.0861111

$ build-pipe-no-completer/eval/pulse | python -c "<calc mean>"
Starting pulse evaluation with pulse=1, iterations=30 and utilization=80
mean: 1000021514.1805556

I could not measure any significant overhead due to the more atomics
on my 16 core machine using the fs-eval on a SSD.

$ ./eval.py -r 50 -i emper-vanilla emper-inc-sem-threshold emper-pipe emper-pipe-no-completer
...
$ ./summarize.py results/1599f44-dirty-pasture/<date>/ -f '{target}-{median} (+- {std})'
duration_time:u:
emper-vanilla-0.202106189 (+- 0.016075981812486713)
emper-inc-sem-threshold-0.2049344115 (+- 0.015348506939891596)
emper-pipe-0.21689131 (+- 0.015198438371285145)
emper-pipe-no-completer-0.1372724185 (+- 0.005865720218833998)

c54a6bd4

Apr 21, 2022
- Pulse: initial pulse evaluation commit · 70871060
  Florian Fischer authored 2 years ago
  
  70871060
Apr 14, 2022
- fsearch: add more fine grained control about the used fiber throttles · 86c71e53
  Florian Fischer authored 2 years ago
  
  86c71e53
Apr 10, 2022

Merge branch 'io-serialization' into 'master' · 1957be38
Florian Schmaus authored 2 years ago
```
add io synchronous option

See merge request i4/manycore/emper!373
```
1957be38
CI: add synchronous IO test · 3c3c48df
Florian Fischer authored 2 years ago

3c3c48df
test/io: skip tests not supported when using synchronous IO · 323e1d77
Florian Fischer authored 2 years ago

323e1d77

ReuseFutureTest: do not use an offset when writing to an eventfd · f4ca362d

Florian Fischer authored 2 years ago

Eventfd's does not support seeking. This is also invalid code:

int main() {
    int64_t b = 42;
    int efd = eventfd(0, 0);
    if (efd < 0) err(EXIT_FAILURE, "creating eventfd failed");
    ssize_t res = pwrite(efd, &b, sizeof(b), 0);
    if (res != sizeof(b)) err(EXIT_FAILURE, "pwrite to evfd failed");
}

f4ca362d

add synchronous io option · 94a00a78

Florian Fischer authored 2 years ago

When EMPER is build with -Dio_synchronous each Future will be
completed synchronously when calling Future::wait().

94a00a78

Apr 08, 2022
- Merge branch 'throttle-recursive-dir-walk' into 'master' · 7c341849
  Florian Schmaus authored 2 years ago
  
  io: improve recursive_directory_walk See merge request i4/manycore/emper!374
  7c341849
Apr 06, 2022

io: improve recursive_directory_walk · d07a81c4

Florian Fischer authored 2 years ago

* Add optional throttle Semaphore pointer to limit the number
  of spawned fn as well as directory walk fibers
* Use const references to the passed functions instead of values
* fsearch: Use max_running as fn and recursion throttle

d07a81c4

Apr 04, 2022
- Merge branch 'improve-echoservers' into 'master' · 819a0246
  Florian Fischer authored 2 years ago
  
  [EchoServers] improve the callback based echoserver See merge request i4/manycore/emper!371
  819a0246
- [EchoServers] improve the callback based echoserver · 0bb7c145
  Florian Fischer authored 2 years ago
  
  * Pass the IO results on the stack instead of storing them in the client object. * Terminate the runtime on quit to print stats. * Free Client objects.
  0bb7c145
- Merge branch 'run-clang-tidy-parallel' into 'master' · e45b2bc4
  Florian Schmaus authored 2 years ago
  
  Run clang tidy in parallel See merge request i4/manycore/emper!372
  e45b2bc4
- [run-clang-tidy] Correctly quote RUN_CLANG_TIDY_CANDIDATES · 89dd8fae
  Florian Schmaus authored 2 years ago
  
  89dd8fae
- [run-clang-tidy] Use -v to check if RUN_CLANG_TIDY is set · d2d289c9
  Florian Schmaus authored 2 years ago
  
  d2d289c9
- [run-clang-tidy] Set -euo pipefail · b9cb98fe
  Florian Schmaus authored 2 years ago
  
  b9cb98fe
- [run-clang-tidy] Run clang-tidy in parallel · 1e90d499
  Florian Schmaus authored 2 years ago
  
  1e90d499
Mar 30, 2022

Merge branch 'test-fixes' into 'master' · 3afd04de

Florian Schmaus authored 2 years ago

[tests] Infer test name from first test source file

See merge request i4/manycore/emper!370

3afd04de

Merge branch 'cancel-future-test-add-assert-msg' into 'master' · f2281308
Florian Schmaus authored 2 years ago
```
add assert message in future callback

See merge request i4/manycore/emper!369
```
f2281308

[tests] Infer test name from first test source file · 40aebf36

Florian Schmaus authored 2 years ago

This also fixes a bug in
	 source = test_dict['source'][0]
so the source(s) used for the test was always only the first source
file. I did/does not matter, as we do not have tests that span
multiple source files (and I am not sure if we ever will have).

40aebf36

add assert message in future callback · 67418925

Florian Fischer authored 2 years ago

Add a message to help investigate CI failures for single-uring
configurations like:
https://gitlab.cs.fau.de/i4/manycore/emper/-/jobs/574828

67418925

Mar 24, 2022
- Merge branch 'meson-full-build-option' into 'master' · 5537bde5
  Florian Schmaus authored 2 years ago
  
  [meson] Add build_only_emper_dep option See merge request i4/manycore/emper!368
  5537bde5
- [meson] Add build_only_emper_dep option · c119e77f
  Florian Schmaus authored 2 years ago
  
  c119e77f
Mar 23, 2022

Merge branch 'assert-in-rts' into 'master' · 8c99ee45

Florian Schmaus authored 2 years ago

[AbstractWorkStealingScheduler] assert in runtime system in pushBottom

See merge request i4/manycore/emper!367

8c99ee45

Mar 21, 2022
- [AbstractWorkStealingScheduler] assert in runtime system in pushBottom · 42d2c275
  Florian Schmaus authored 2 years ago
  
  42d2c275
- Merge branch 'fix-forgotten-futures-on-linux-lt-5.17' into 'master' · 61a0da33
  Florian Schmaus authored 2 years ago
  
  io: do not use IO_FORGOTTEN_FUTURES_SKIP_CQE on linux <= 5.17 See merge request i4/manycore/emper!366
  61a0da33
Mar 17, 2022

io: do not use IO_FORGOTTEN_FUTURES_SKIP_CQE on linux <= 5.17 · ecd0d112

Florian Fischer authored 3 years ago

Apparantly my assumption that io_uring ignores unknown cqe flags is
wrong. And io_uring < Linux 5.17 completes forgotten futures with
-EINVAL.

Fixes: e140759d

ecd0d112

Mar 16, 2022

Merge branch 'overflow-queue-warn' into 'master' · faf89ab5

Florian Schmaus authored 3 years ago

[AbstractWorkStealingScheduler] Emit warning if overflow queue is used

See merge request i4/manycore/emper!365

faf89ab5

Mar 15, 2022

Merge branch 'register-io_urings' into 'master' · 7eb9700e
Florian Schmaus authored 3 years ago
```
incorporate new io_uring features

See merge request i4/manycore/emper!364
```
7eb9700e
[AbstractWorkStealingScheduler] Emit warning if overflow queue is used · 62197ce3
Florian Schmaus authored 3 years ago

62197ce3
warn about failing forgotten futures · 22fa63ba
Florian Fischer authored 3 years ago

22fa63ba

skip cqes for sucessfull forgotten Futures · e140759d

Florian Fischer authored 3 years ago

We can not deal with cqes for forgotten Futures and not creating them
prevents the kernel and userspace overhead introduced by them.

e140759d

LinuxVersion: fix compare of versions with not the same amount of parts · f5c1c97d

Florian Fischer authored 3 years ago

LinuxVersion::compare reported two versions as equal if it could
find a new '.' in the first version but not in the second.
But this is clearly wrong because it skips comparision of the valid last
part.

The comparision 5.16.12 >= 5.18 returned true because compare
reported the version as equal after comparing the first parts and
finding a second '.' in the first but not the second.

Fix this behavior by marking the current position as the last but
do not skip its comparision.

Add tests for the desired behaviour.

f5c1c97d

IoContext: register the worker io_uring fds on Linux >= 5.18 · 8ae1f189

Florian Fischer authored 3 years ago

Linux 5.18 introduces IORING_REGISTER_RING_FDS with
e7a6c00dc77aedf27a601738ea509f1caea6d673.

Registering the io_uring's fd prevents having to look it up for each
io_uring_enter call reducing contention on the process file table.
Jens Axboe reports good results in his fio based benchmarks and
I see no reason for EMPER to not register the io_uring fds, especially
because we never pass or share rings.

Do not register the global io_uring since it is shared in the SINGLE_URING
case or it is passed by the main thread to the completer thread breaking the
assumption liburing has about the registered io_uring fd.

8ae1f189

depend on liburing 2.2 · 171ae9d4

Florian Fischer authored 3 years ago

Liburing 2.2 and Linux 5.18 support IORING_REGISTER_RING_FDS, preventing
the fget(ring_fd) overhead for each io_uring_enter call, as well as
IORING_OP_MSG_RING, greatly simplifying the IO-based sleep strategy code.

171ae9d4