Commits · 932ded46c1f54a6654e19e81e72fd45f7899abe3 · Lehrstuhl für Informatik 4 (Systemsoftware) / manycore / emper

Jan 15, 2022
- Optionally build with libc++ · 932ded46
  Florian Schmaus authored 3 years ago
  
  932ded46
Dec 14, 2021
- [CI] build sqpoll variants · c59fb81b
  Florian Fischer authored 3 years ago
  
  c59fb81b
Dec 13, 2021

[CI] add fast checks for various emper variants · a740f980

Florian Fischer authored 3 years ago

A "fast check" consists of our smoke tests and the fast static analysis
this ensures that the emper variants even build successfully and
are not totally broken.

a740f980

Dec 10, 2021

[CI] print the number of available CPUs · 2653df7e
Florian Fischer authored 3 years ago

2653df7e

Introduce waitfree workstealing · 1c538024

Florian Fischer authored 3 years ago

Waitfree work stealing is configured with the meson option
'waitfree_work_stealing'.

The retry logic is intentionally left in the Queues and not lifted to
the scheduler to reuse the load of an unsuccessful CAS.

Consider the following pseudo code examples:

steal() -> bool:                       steal() -> res
  load                                   load
loop:                                    if empty return EMPTY
  if empty return EMPTY                  cas
  cas                                    return cas ? STOLEN : LOST_RACE
  if not WAITFREE and not cas:
    goto loop                          outer():
  return cas ? STOLEN : LOST_RACE      loop:
                                         res = steal()
outer():                                 if not WAITFREE and res == LOST_RACE:
  steal()                                  goto loop

In the right example the value loaded by a possible unsuccessful CAS
can not be reused. And a loop of unsuccessful CAS' will result in
double loads.

The retries are configurable through a template variable maxRetries.
* maxRetries < 0: indefinitely retries
* maxRetries >= 0: maxRetries

1c538024

Dec 08, 2021
- [gitlab-ci] Cache subprojects/packagecache · c3472767
  Florian Schmaus authored 3 years ago
  
  c3472767
Dec 06, 2021
- [gitlab-ci] Bump container image to 1.17 · ba55e55a
  Florian Schmaus authored 3 years ago
  
  ba55e55a
Oct 28, 2021
- [gitlab-ci] Update debian-testing-dev from 1.14 to 1.15 · e41ddcc1
  Florian Schmaus authored 3 years ago
  
  e41ddcc1
Oct 13, 2021

[meson] introduce lockless memory order and rename lockless option · 67b0c77a

Florian Fischer authored 3 years ago

The lockless algorithm can now be configured by setting -Dio_lockless_cq=true
and the used memory ordering by setting -Dio_lockless_memory_order={weak,strong}.

io_lockless_memory_order=weak:
    read with acquire
    write with release

io_lockless_memory_order=strong:
    read with seq_cst
    write with seq_cst

67b0c77a

Oct 11, 2021

[IoContext] implement lockless CQ reaping · d9d350d9
Florian Fischer authored 3 years ago
```
TODO: think about stats and possible ring buffer pointers overflow and ABA.
```
d9d350d9

implement IO stealing · 0abc29ad

Florian Fischer authored 3 years ago

IO stealing is analog to work-stealing and means that worker thread
without work will try to steal IO completions (CQEs) from other worker's
IoContexts. The work stealing algorithm is modified to check a victims
CQ after findig their work queue empty.

This approach in combination with future additions (global notifications
on IO completions, and lock free CQE consumption) are a realistic candidate
to replace the completer thread without loosing its benefits.

To allow IO stealing the CQ must be synchronized which is already the
case with the IoContext::cq_lock.
Currently stealing workers always try to pop a single CQE (this could
be configurable).
Steal attempts are recorded in the IoContext's Stats object and
successfully stolen IO continuations in the AbstractWorkStealingWorkerStats.

I moved the code transforming CQEs into continuation Fibers from
reapCompletions into a seperate function to make the rather complicated
function more readable and thus easier to understand.

Remove the default CallerEnvironment template arguments to make
the code more explicit and prevent easy errors (not propagating
the caller environment or forgetting the function takes a caller environment).

io::Stats now need to use atomics because multiple thread may increment
them in parallel from EMPER and the OWNER.
And since using std::atomic<T*> in std::map is not easily possible we
use the compiler __atomic_* builtins.

Add, adjust and fix some comments.

0abc29ad

Oct 04, 2021

[WakeupStrategy] fix the throttle algorithm for notifiaction from anywhere · baedc874

Florian Fischer authored 3 years ago

The throttle algorithm had the same problem like our sleep algorithms
where notifications from anywhere may race with a worker going to
sleep resulting in lost wakeups.
In the sleep strategy we prevent those races by preventing sleep attempts
when notifing from anywhere.
The throttle algorithm also does now exactly this. A notifier from anywhere
will now always set the WakeupStrategy state to notified.
If the state was previously pending this new approach does not differ from
the previous behavior and a sleeping worker will be notified.
If the state was waking the waking worker skips its sleep if it observes
the WakeupStrategy state as notified.

baedc874

Sep 27, 2021

[log] improve timestamp scalability and increase LogBuffer size · 442ead84

Florian Fischer authored 3 years ago

std::localtime takes a global lock and is therefore not scalable and
inapplicable for analyzing timing sensible bugs.
Introduce a new option to add UTC timestamps. This allows on my system
to double the CPU load while using mmapped logging.

Also increase the LogBuffer size from 1MB to 1GB because I had some
crashes where a renewed buffer was still used.

442ead84

Sep 24, 2021
- [CI] disable throttle test until it is fixed · 13191760
  Florian Fischer authored 3 years ago
  
  13191760
Sep 21, 2021
- [CI] add test for the throttle wakeup strategy · a1713408
  Florian Fischer authored 3 years ago
  
  a1713408
- [CI] add more test for various io configuration · c6d29327
  Florian Fischer authored 3 years ago
  
  * single io_uring * pipe sleep strategy * pipe sleep strategy without completer
  c6d29327
- [CI] print ulimit · b8350c34
  Florian Fischer authored 3 years ago
  
  b8350c34
- [CI] enable IO in the now buster based CI · ba51a599
  Florian Fischer authored 3 years ago
  
  ba51a599
Sep 13, 2021

[Debug] implement logging to a memory-mapped log file · ad10eb3a

Florian Fischer authored 3 years ago


When setting the environment variable EMPER_LOG_FILE=<logfile> EMPER
will write its log messages to <logfile> instead of stderr.
This removes the need for the mutex protecting std::cerr as well as
multiple write calls to flush std:cerr.

To efficiently write log messages to the log file the algorithm uses
three memory 1MiB mapped views into <logfile> to store the log messages.
One buffer is active, one is new, and one is old.
The next buffer ensures that threads can write log messages even if the
active buffer would overflows.
The old buffer allows slower threads to still write safely while everyone
else uses the active buffer.
When a thread changes from the active buffer to the new buffer it
is responsible to renew the current old buffer and changing the semantic
of the buffers:

* active -> old
* next -> active
* old -> next

This buffer scheme allows wait-free logging.
But the approach is NOT sound because delayed thread may still use the
old buffer which gets renewed by the first thread touching the next buffer.
But the likeliness for this situation decreases with bigger sizes of the
buffers.

ATTENTION: Using SCHED_IDLE for the completer may break this likeliness
assumption.

Add new CI test job with mmaped log files.

This contains code cleanups
Suggested-By: Florian Schmaus <flow@cs.fau.de>

ad10eb3a

Jul 26, 2021
- [CI] bump docker image to 1.14 · 933860f7
  Florian Fischer authored 3 years ago
  
  933860f7
May 28, 2021
- [gitlab-ci] Bump flowdalic/debian-testing-dev to 1.12 · c1eb0f12
  Florian Schmaus authored 3 years ago
  
  c1eb0f12
May 20, 2021
- [gitlab-ci] Bump debian-testing-dev image to 1.10 · 1ff04e8e
  Florian Schmaus authored 3 years ago
  
  1ff04e8e
May 05, 2021
- [Blockable] Set affinity on block · 2680c470
  Florian Schmaus authored 3 years ago
  
  2680c470
Apr 12, 2021
- [gitlab-ci] Update flowdalic/debian-testing-dev to 1.8 · af4cf471
  Florian Schmaus authored 3 years ago
  
  af4cf471
- [gitlab-ci] Fix test-with-stats · 1468eb91
  Florian Schmaus authored 3 years ago
  
  The EMPER option Meson option for stats is called 'stats' not 'worker_stats'.
  1468eb91
- [iwyu] Take load average into consideration on CI · ed93dc6e
  Florian Schmaus authored 3 years ago
  
  This required that we backport iwyu_tool.py from https://github.com/include-what-you-use/include-what-you-use/pull/891 into tools/, which supports --load.
  ed93dc6e
Mar 22, 2021
- [gitlab-ci] Save meson-logs as artifact, add meson junit report · 6bad2b9c
  Florian Schmaus authored 4 years ago
  
  6bad2b9c
Mar 01, 2021
- [gitlab-ci] Bump flowdalic/debian-testing-dev docker image to 1.5 · ff724a95
  Florian Schmaus authored 4 years ago
  
  ff724a95
- [gitlab-ci] Factor iwyu & clang-tidy out of the static-analysis task · 02ecb722
  Florian Schmaus authored 4 years ago
  
  This should improve the CI response time, as we now (potentially) perform iwyu and clang-tidy in parallel.
  02ecb722
Feb 25, 2021
- [gitlab-ci] Fix default BUILDTYPE s/debugoptmized/debugoptimized/ · cdba8613
  Florian Schmaus authored 4 years ago
  
  cdba8613
Feb 23, 2021

[WorkerWakeupSemaphore] add three possible implementations · 3cde3e16

Florian Fischer authored 4 years ago

LockedSemaphore is the already existening Semaphore using
a mutex and a condition variable.
PosixMutex is a thin wrapper around a POSIX semaphore.
SpuriousFutexSemaphore is a atomic/futex based implementation
prune to spurious wakeups which is fine for the worker wakeup usecase.

3cde3e16

Feb 20, 2021

[Makefile] fix smoke-test/static-analysis target · 1b39754b

Florian Schmaus authored 4 years ago

This adds yet another target "smoke-test-suite", which just runs all
tests in the 'smoke' test suite. The static-analysis target was
changed to include *all* static analysis thingies we have, even 'doc'
as Doxygen does also do some checks that the documentation is
"correct".

The smoke-test target is also kept, as it allows developers to simply
run all smoke tests. Furthermore, this adds some missing PHONY
declarations in the Makefile.

The gitlab-ci now runs the smoke-test-suite and static-analysis
targets in two different jobs. Previously the smoke-test would also
run the static-analysis target, which was not intended.

1b39754b

Feb 02, 2021

[Makefile][CI] Add static-analysis Make target, and gitlab-ci stage · 69eff936

Florian Schmaus authored 4 years ago

The 'static-analysis' Make target runs all static analysis we
currently have. This allows to run at least static analysis, in cases
where we are able build a particular EMPER configuration, but no able
to execute it (e.g. because the kernel lacks io_uring support).

69eff936

Feb 01, 2021
- fix build with locked work-stealing queues and add test · 2cc07cdc
  Florian Fischer authored 4 years ago
  
  2cc07cdc
Jan 26, 2021
- [gitlab-ci] disable IO since it io_uring is not available in our CI · 1205afea
  Florian Fischer authored 4 years ago
  
  1205afea
Jan 14, 2021
- [test] Add EMPER test runner · e74d29ca
  Florian Schmaus authored 4 years ago
  
  e74d29ca
Jan 13, 2021
- Add option to include timestamp in EMPER log messages · e2de6234
  Florian Schmaus authored 4 years ago
  
  This also changes emper_log so that a std::ostringstream is used to assemble the log message.
  e2de6234
Jan 11, 2021
- [CI] Add job to test worker wakeup strategy "all" · 7ab252ce
  Florian Schmaus authored 4 years ago
  
  7ab252ce
Dec 17, 2020
- Add meson option for scheduling strategy and according CI jobs · 0b360982
  Florian Schmaus authored 4 years ago
  
  Co-authored-by: Florian Fischer <florian.fl.fischer@fau.de>
  0b360982
Dec 05, 2020
- [gitlab-ci] Bump CI container to flowdalic/debian-testing-dev:1.4 · 09e81805
  Florian Schmaus authored 4 years ago
  
  09e81805