Commits · 4c9a76c311ed1c3103567a1ed3e50a0e9f98de6a · Lehrstuhl für Informatik 4 (Systemsoftware) / manycore / emper

Apr 10, 2022
- CI: add synchronous IO test · 3c3c48df
  Florian Fischer authored 2 years ago
  
  3c3c48df
Mar 24, 2022
- [meson] Add build_only_emper_dep option · c119e77f
  Florian Schmaus authored 3 years ago
  
  c119e77f
Feb 28, 2022
- Add stats_blocked_context(_count) · 2d43c259
  Florian Schmaus authored 3 years ago
  
  This further split up the stats machinery into smaller parts.
  2d43c259
Feb 26, 2022
- fix build of test waitfree io-stealing · 7d341a2e
  Florian Fischer authored 3 years ago
  
  7d341a2e
Feb 24, 2022
- Add WsClv3Queue and WsClv4Queue · 85b0451c
  Florian Schmaus authored 3 years ago
  
  85b0451c
Feb 18, 2022
- Make the implementations of the work-stealing queue(s) selectable · c7f953bf
  Florian Schmaus authored 3 years ago
  
  This required to break an include cycle between Fibril and LockedQueue.
  c7f953bf
Feb 11, 2022
- [Context] Add option for guard page at the bottom of stack · 7fb99a46
  Florian Schmaus authored 3 years ago
  
  7fb99a46
Feb 07, 2022

Add support for continuation stealing · cf3ac3ed

Florian Schmaus authored 3 years ago


Thanks to Nicolas Pfeiffer for writing the initial prototypical
implementation of continuation stealing and the cactus stack
mechanism, on which this is based.

Co-authored-by: Nicolas Pfeiffer <pfeiffer@cs.fau.de>

cf3ac3ed

Jan 27, 2022
- [CI] Move iwyu and clang-tidy out of the smoke-test stage · 1b662ec6
  Florian Schmaus authored 3 years ago
  
  1b662ec6
- [CI] Bump container image to 1.20 · 70227007
  Florian Schmaus authored 3 years ago
  
  70227007
Jan 22, 2022
- [CI] add test without IO · d3073b6c
  Florian Fischer authored 3 years ago
  
  d3073b6c
Jan 21, 2022

disable throttle wakeup strategy until it works with scheduleOn · 8cb084c9
Florian Fischer authored 3 years ago

8cb084c9

add semaphore using futex_waitv(2) supporting notify_specific · 96a846a1

Florian Fischer authored 3 years ago

The SpuriousFutex2Semaphore is able to notify a specific worker
by using two futexes two wait on.

One working like a normal semaphore used for global non specific
notifications via notify() and notify_many().

And a second one per worker which is based on a SleeperState.
To notify a specific worker we change its SleeperState to Notified
and call FUTEX_WAKE if needed.

96a846a1

Jan 15, 2022
- Optionally build with libc++ · 932ded46
  Florian Schmaus authored 3 years ago
  
  932ded46
Dec 14, 2021
- [CI] build sqpoll variants · c59fb81b
  Florian Fischer authored 3 years ago
  
  c59fb81b
Dec 13, 2021

[CI] add fast checks for various emper variants · a740f980

Florian Fischer authored 3 years ago

A "fast check" consists of our smoke tests and the fast static analysis
this ensures that the emper variants even build successfully and
are not totally broken.

a740f980

Dec 10, 2021

[CI] print the number of available CPUs · 2653df7e
Florian Fischer authored 3 years ago

2653df7e

Introduce waitfree workstealing · 1c538024

Florian Fischer authored 3 years ago

Waitfree work stealing is configured with the meson option
'waitfree_work_stealing'.

The retry logic is intentionally left in the Queues and not lifted to
the scheduler to reuse the load of an unsuccessful CAS.

Consider the following pseudo code examples:

steal() -> bool:                       steal() -> res
  load                                   load
loop:                                    if empty return EMPTY
  if empty return EMPTY                  cas
  cas                                    return cas ? STOLEN : LOST_RACE
  if not WAITFREE and not cas:
    goto loop                          outer():
  return cas ? STOLEN : LOST_RACE      loop:
                                         res = steal()
outer():                                 if not WAITFREE and res == LOST_RACE:
  steal()                                  goto loop

In the right example the value loaded by a possible unsuccessful CAS
can not be reused. And a loop of unsuccessful CAS' will result in
double loads.

The retries are configurable through a template variable maxRetries.
* maxRetries < 0: indefinitely retries
* maxRetries >= 0: maxRetries

1c538024

Dec 08, 2021
- [gitlab-ci] Cache subprojects/packagecache · c3472767
  Florian Schmaus authored 3 years ago
  
  c3472767
Dec 06, 2021
- [gitlab-ci] Bump container image to 1.17 · ba55e55a
  Florian Schmaus authored 3 years ago
  
  ba55e55a
Oct 28, 2021
- [gitlab-ci] Update debian-testing-dev from 1.14 to 1.15 · e41ddcc1
  Florian Schmaus authored 3 years ago
  
  e41ddcc1
Oct 13, 2021

[meson] introduce lockless memory order and rename lockless option · 67b0c77a

Florian Fischer authored 3 years ago

The lockless algorithm can now be configured by setting -Dio_lockless_cq=true
and the used memory ordering by setting -Dio_lockless_memory_order={weak,strong}.

io_lockless_memory_order=weak:
    read with acquire
    write with release

io_lockless_memory_order=strong:
    read with seq_cst
    write with seq_cst

67b0c77a

Oct 11, 2021

[IoContext] implement lockless CQ reaping · d9d350d9
Florian Fischer authored 3 years ago
```
TODO: think about stats and possible ring buffer pointers overflow and ABA.
```
d9d350d9

implement IO stealing · 0abc29ad

Florian Fischer authored 3 years ago

IO stealing is analog to work-stealing and means that worker thread
without work will try to steal IO completions (CQEs) from other worker's
IoContexts. The work stealing algorithm is modified to check a victims
CQ after findig their work queue empty.

This approach in combination with future additions (global notifications
on IO completions, and lock free CQE consumption) are a realistic candidate
to replace the completer thread without loosing its benefits.

To allow IO stealing the CQ must be synchronized which is already the
case with the IoContext::cq_lock.
Currently stealing workers always try to pop a single CQE (this could
be configurable).
Steal attempts are recorded in the IoContext's Stats object and
successfully stolen IO continuations in the AbstractWorkStealingWorkerStats.

I moved the code transforming CQEs into continuation Fibers from
reapCompletions into a seperate function to make the rather complicated
function more readable and thus easier to understand.

Remove the default CallerEnvironment template arguments to make
the code more explicit and prevent easy errors (not propagating
the caller environment or forgetting the function takes a caller environment).

io::Stats now need to use atomics because multiple thread may increment
them in parallel from EMPER and the OWNER.
And since using std::atomic<T*> in std::map is not easily possible we
use the compiler __atomic_* builtins.

Add, adjust and fix some comments.

0abc29ad

Oct 04, 2021

[WakeupStrategy] fix the throttle algorithm for notifiaction from anywhere · baedc874

Florian Fischer authored 3 years ago

The throttle algorithm had the same problem like our sleep algorithms
where notifications from anywhere may race with a worker going to
sleep resulting in lost wakeups.
In the sleep strategy we prevent those races by preventing sleep attempts
when notifing from anywhere.
The throttle algorithm also does now exactly this. A notifier from anywhere
will now always set the WakeupStrategy state to notified.
If the state was previously pending this new approach does not differ from
the previous behavior and a sleeping worker will be notified.
If the state was waking the waking worker skips its sleep if it observes
the WakeupStrategy state as notified.

baedc874

Sep 27, 2021

[log] improve timestamp scalability and increase LogBuffer size · 442ead84

Florian Fischer authored 3 years ago

std::localtime takes a global lock and is therefore not scalable and
inapplicable for analyzing timing sensible bugs.
Introduce a new option to add UTC timestamps. This allows on my system
to double the CPU load while using mmapped logging.

Also increase the LogBuffer size from 1MB to 1GB because I had some
crashes where a renewed buffer was still used.

442ead84

Sep 24, 2021
- [CI] disable throttle test until it is fixed · 13191760
  Florian Fischer authored 3 years ago
  
  13191760
Sep 21, 2021
- [CI] add test for the throttle wakeup strategy · a1713408
  Florian Fischer authored 3 years ago
  
  a1713408
- [CI] add more test for various io configuration · c6d29327
  Florian Fischer authored 3 years ago
  
  * single io_uring * pipe sleep strategy * pipe sleep strategy without completer
  c6d29327
- [CI] print ulimit · b8350c34
  Florian Fischer authored 3 years ago
  
  b8350c34
- [CI] enable IO in the now buster based CI · ba51a599
  Florian Fischer authored 3 years ago
  
  ba51a599
Sep 13, 2021

[Debug] implement logging to a memory-mapped log file · ad10eb3a

Florian Fischer authored 3 years ago


When setting the environment variable EMPER_LOG_FILE=<logfile> EMPER
will write its log messages to <logfile> instead of stderr.
This removes the need for the mutex protecting std::cerr as well as
multiple write calls to flush std:cerr.

To efficiently write log messages to the log file the algorithm uses
three memory 1MiB mapped views into <logfile> to store the log messages.
One buffer is active, one is new, and one is old.
The next buffer ensures that threads can write log messages even if the
active buffer would overflows.
The old buffer allows slower threads to still write safely while everyone
else uses the active buffer.
When a thread changes from the active buffer to the new buffer it
is responsible to renew the current old buffer and changing the semantic
of the buffers:

* active -> old
* next -> active
* old -> next

This buffer scheme allows wait-free logging.
But the approach is NOT sound because delayed thread may still use the
old buffer which gets renewed by the first thread touching the next buffer.
But the likeliness for this situation decreases with bigger sizes of the
buffers.

ATTENTION: Using SCHED_IDLE for the completer may break this likeliness
assumption.

Add new CI test job with mmaped log files.

This contains code cleanups
Suggested-By: Florian Schmaus <flow@cs.fau.de>

ad10eb3a

Jul 26, 2021
- [CI] bump docker image to 1.14 · 933860f7
  Florian Fischer authored 3 years ago
  
  933860f7
May 28, 2021
- [gitlab-ci] Bump flowdalic/debian-testing-dev to 1.12 · c1eb0f12
  Florian Schmaus authored 3 years ago
  
  c1eb0f12
May 20, 2021
- [gitlab-ci] Bump debian-testing-dev image to 1.10 · 1ff04e8e
  Florian Schmaus authored 3 years ago
  
  1ff04e8e
May 05, 2021
- [Blockable] Set affinity on block · 2680c470
  Florian Schmaus authored 3 years ago
  
  2680c470
Apr 12, 2021
- [gitlab-ci] Update flowdalic/debian-testing-dev to 1.8 · af4cf471
  Florian Schmaus authored 3 years ago
  
  af4cf471
- [gitlab-ci] Fix test-with-stats · 1468eb91
  Florian Schmaus authored 3 years ago
  
  The EMPER option Meson option for stats is called 'stats' not 'worker_stats'.
  1468eb91
- [iwyu] Take load average into consideration on CI · ed93dc6e
  Florian Schmaus authored 3 years ago
  
  This required that we backport iwyu_tool.py from https://github.com/include-what-you-use/include-what-you-use/pull/891 into tools/, which supports --load.
  ed93dc6e
Mar 22, 2021
- [gitlab-ci] Save meson-logs as artifact, add meson junit report · 6bad2b9c
  Florian Schmaus authored 4 years ago
  
  6bad2b9c