Commits · ba55e55a05e01745c4bcf4c26af7e0e1fcb23c42 · Lehrstuhl für Informatik 4 (Systemsoftware) / manycore / emper

Dec 06, 2021
- [gitlab-ci] Bump container image to 1.17 · ba55e55a
  Florian Schmaus authored 3 years ago
  
  ba55e55a
- Fix includes (as reported by IWYU 0.17) · 42f5a853
  Florian Schmaus authored 3 years ago
  
  42f5a853
Dec 03, 2021

Merge branch 'adjust-test-load' into 'master' · 2548d32c
Florian Schmaus authored 3 years ago
```
reduce test load when logging

See merge request !282
```
2548d32c

reduce test load when logging · 905fb18b

Florian Fischer authored 3 years ago

I suspect some test which scale whith the number of CPUs to timeout
mostly on jenkins2.
This patch reduces the load when logging is active and increases the
load when logging is off.
Therefore our test build with debugoptimized will do less and hopefully
only timeout when they actually hung and the release test will do
more.

905fb18b

Merge branch 'optimize-lockless-stealing' into 'master' · d829d688
Florian Schmaus authored 3 years ago
```
load CQ->tail only once during lockless stealing

See merge request !281
```
d829d688

Dec 02, 2021

load CQ->tail only once during lockless stealing · 2abd53e0

Florian Fischer authored 3 years ago

Currently we load the CQ->tail with acquire semantic to determine
if we should steal from teh victim and load it again in the actual
stealing logic which will also immediately abort if there are no
CQEs to steal.

Keep the optimization for the locked case.

2abd53e0

Merge branch 'improve-echo-client-help' into 'master' · 8e7c2676
Florian Schmaus authored 3 years ago
```
EchoClient: improve the help message

See merge request !280
```
8e7c2676

Nov 29, 2021
- EchoClient: improve the help message · 992b6813
  Florian Fischer authored 3 years ago
  
  992b6813
Nov 24, 2021
- Merge branch 'add-bps-test' into 'master' · a892f8f1
  Florian Schmaus authored 3 years ago
  
  add concurrent BPS test See merge request !279
  a892f8f1
- Merge branch 'echoclient-add-debug-state' into 'master' · 743686d4
  Florian Schmaus authored 3 years ago
  
  echoclient: add a state variable for debugging See merge request !278
  743686d4
Nov 23, 2021

echoclient: add a state variable for debugging · 127f6296
Florian Fischer authored 3 years ago

127f6296

add concurrent BPS test · 65a593bc

Florian Fischer authored 3 years ago

The test introduces multiple cycles of Semaphores and
a Fiber for each semaphore blocking and signaling the next.
Through work-stealing the fibers from a cycle should be spread
across different workers and thus test concurrent use of
BinaryPrivateSemaphores.

Cycle of length 3: Sem A -> Sem B -> Sem C -> Sem A -> ...
Algorithm:
	if isFirstInCycle
		signal next

	wait

	if not isFirstInCycle
		signal next

65a593bc

Nov 15, 2021

Merge branch 'fix_pipe_sleep_notifyFromAnywhere' into 'master' · cc63bd70
Florian Schmaus authored 3 years ago
```
[PipeSleepStrategy] fix notifyFromAnywhere

See merge request !277
```
cc63bd70

[PipeSleepStrategy] fix notifyFromAnywhere · d31442ad

Florian Fischer authored 3 years ago

Don't decrease the sleeper count in the CAS loop further than
-count, which is the threshold we need to ensure that the notification
will be observed.
Decreasing it further than our threshold is not faulty it just results
in unnecessary skipped sleeps.

Don't call writeNotifications with a negative count.
Which will be interpreted as an unsigned value and thus results
in writing way to much hints to the pipe, jamming it.
If the original value before a successfully CAS is already negative
we called writeNotifications with this negative value.
This is fixed by using max(toWakeup, 0).

d31442ad

Nov 11, 2021
- Merge branch 'configure-work-stealing-victim-count' into 'master' · c58a9143
  Florian Schmaus authored 3 years ago
  
  make the victim count in work-stealing configurable See merge request !276
  c58a9143
Nov 10, 2021

make the victim count in work-stealing configurable · cd06496d

Florian Fischer authored 3 years ago

Add two new mutual exclusive meson_options:
* work_stealing_victim_count: Which sets an absolute number of victims
* work_stealing_victim_denominator: Set victim count to #workers/denominator

cd06496d

Merge branch 'update-check-format' into 'master' · 189075cd
Florian Schmaus authored 3 years ago
```
[tools] Update check-format (from Mazstab)

See merge request !275
```
189075cd
Merge branch 'clang-tidy-13' into 'master' · 75eb34a5
Florian Schmaus authored 3 years ago
```
Fixes for clang-tidy 13

See merge request !274
```
75eb34a5
[tools] Update check-format (from Mazstab) · 6adb7047
Florian Schmaus authored 3 years ago
```
Sync tools/check-format of EMPER and Mazstab by using the newer
Mazstab version of the script.
```
6adb7047

Fixes for clang-tidy 13 · dfa64867

Florian Schmaus authored 3 years ago

While we do not have yet LLVM 13 in the gitlab-ci, I use it on my
systems. So fix the new warnings found with clang-tidy 13.

dfa64867

Oct 29, 2021
- Merge branch 'random-computation-echoserver' into 'master' · 6e9c20de
  Florian Schmaus authored 3 years ago
  
  Random computation echoserver See merge request !272
  6e9c20de
Oct 28, 2021

Merge branch 'ci-debian-testing-dev-bump' into 'master' · fbe0b98f
Florian Schmaus authored 3 years ago
```
Ci debian testing dev bump

See merge request !273
```
fbe0b98f
[gitlab-ci] Update debian-testing-dev from 1.14 to 1.15 · e41ddcc1
Florian Schmaus authored 3 years ago

e41ddcc1

[EchoSever] implement random computations variants · 964278bc

Florian Fischer authored 3 years ago

Now three variants of computation are available:

* fixed (echoserver <port> <computation>:
   This will always consume computation us before sending the echo
   back to the client.
* random range (echoserver <port> <computation> <computation-max>:
   This will consume a random computation uniformly selected
   from the interval [computation, computation-max] us.
* random min-max (echoserver <port> <computation> <computation-max> <max-probability>
   This will either consume computation or computation-max us.
   The max computation is randomly chosen with the specified probability.

All random values are generated with a thread_local mt19937 generator
and uniformly distributed with uniform_{int,real}_distribution.

964278bc

implement a builder pattern for Runtime · 7b7f7535

Florian Fischer authored 3 years ago

This new builder pattern in addition to a more powerful Runtime constructor
allows the user to pass additional new worker hooks.
This is useful for example if an applications wants to initialize
thread local variables on each worker.

Current code does not need any modification and has no semantic changes.
Future code can use the new RuntimeBuilder class to construct
more sophisticated Runtime objects.

7b7f7535

[Echoclient] properly format parameter output · 42fda2bb
Florian Fischer authored 3 years ago

42fda2bb
[EchoServer] implement random computation in range · d7d109a1
Florian Fischer authored 3 years ago

d7d109a1

Oct 13, 2021

Merge branch 'io-stealing' into 'master' · 9527d2f4
Florian Schmaus authored 3 years ago
```
[RFC] IO-stealing analogue to work-stealing

See merge request !260
```
9527d2f4

[meson] introduce lockless memory order and rename lockless option · 67b0c77a

Florian Fischer authored 3 years ago

The lockless algorithm can now be configured by setting -Dio_lockless_cq=true
and the used memory ordering by setting -Dio_lockless_memory_order={weak,strong}.

io_lockless_memory_order=weak:
    read with acquire
    write with release

io_lockless_memory_order=strong:
    read with seq_cst
    write with seq_cst

67b0c77a

add gdb scripts to dump the runtime state · a2c02de9

Florian Fischer authored 3 years ago

In a running gdb process use:

source tools/gdb/dump_runtime_state.py

to dump the state of all threads, all WSL queues and all worker IoContexts.

a2c02de9

Merge branch 'cleanup-fsearch' into 'master' · a8cc5c53
Florian Schmaus authored 3 years ago
```
[fsearch] remove obsolete offset tracking code

See merge request !271
```
a8cc5c53

Oct 12, 2021
- [fsearch] remove obsolete offset tracking code · 7f0bf3c4
  Florian Fischer authored 3 years ago
  
  7f0bf3c4
Oct 11, 2021

[IoContext] document and fix possible scenario resulting in lost wakeup · 1e647338
Florian Fischer authored 3 years ago
```
This is fixed by using a normal lock instead of the try lock in the OWNER
case.
```
1e647338
[IoContext] implement lockless CQ reaping · d9d350d9
Florian Fischer authored 3 years ago
```
TODO: think about stats and possible ring buffer pointers overflow and ABA.
```
d9d350d9

implement IO stealing · 0abc29ad

Florian Fischer authored 3 years ago

IO stealing is analog to work-stealing and means that worker thread
without work will try to steal IO completions (CQEs) from other worker's
IoContexts. The work stealing algorithm is modified to check a victims
CQ after findig their work queue empty.

This approach in combination with future additions (global notifications
on IO completions, and lock free CQE consumption) are a realistic candidate
to replace the completer thread without loosing its benefits.

To allow IO stealing the CQ must be synchronized which is already the
case with the IoContext::cq_lock.
Currently stealing workers always try to pop a single CQE (this could
be configurable).
Steal attempts are recorded in the IoContext's Stats object and
successfully stolen IO continuations in the AbstractWorkStealingWorkerStats.

I moved the code transforming CQEs into continuation Fibers from
reapCompletions into a seperate function to make the rather complicated
function more readable and thus easier to understand.

Remove the default CallerEnvironment template arguments to make
the code more explicit and prevent easy errors (not propagating
the caller environment or forgetting the function takes a caller environment).

io::Stats now need to use atomics because multiple thread may increment
them in parallel from EMPER and the OWNER.
And since using std::atomic<T*> in std::map is not easily possible we
use the compiler __atomic_* builtins.

Add, adjust and fix some comments.

0abc29ad

[CallerEnvironment] Add a new OWNER caller environment · 69e73b98

Florian Fischer authored 3 years ago

The OWNER caller environment can be used when the executed algorithm
should be different if the current worker owns the objects it touches.
For example a worker reaping completions on a foreign IoContext may
use the EMPER callerEnvironment and the worker of the IoContext OWNER.

Also implement the stream operator to print caller environments.

69e73b98

Merge branch 'write-stats-to-file' into 'master' · e677b305
Florian Schmaus authored 3 years ago
```
print runtime stats to the environment variable EMPER_STATS_FILE

See merge request !269
```
e677b305

print runtime stats to the environment variable EMPER_STATS_FILE · a757eb0e

Florian Fischer authored 3 years ago

* Make all stats print methods accept a std::ostream as output.
* Move the printing of runtime component stats into Runtime::printStats.
* Use Runtime::printStats instead of Runtime::printLastRuntimeStats in
  ~Runtime, because we are already in a runtime which may differ from
  Runtime::currentRuntime.
* Write the stats in ~Runtime to a possible file passed in the
  environment variable EMPER_STATS_FILE

a757eb0e

Merge branch 'remove-useless-private-stats-members' into 'master' · 8160f360
Florian Schmaus authored 3 years ago
```
[sleep_strategy/Stats] remove old obsolete stats member

See merge request !270
```
8160f360
Merge branch 'fix-cache-line-exclusive-macro' into 'master' · 8c6aceca
Florian Schmaus authored 3 years ago
```
[Common.hpp] fix CACHE_LINE_EXCLUSIVE macro

See merge request !268
```
8c6aceca