Commits · 96d27755d3cef892aef112730b4d6177095e88fc · Lehrstuhl für Informatik 4 (Systemsoftware) / manycore / emper

Aug 11, 2021

Merge branch 'random-worker-id' into 'master' · 96d27755
Florian Schmaus authored 3 years ago
```
[AbstractWorkStealingScheduler] Get rid of "rand() % workerCount"

See merge request !229
```
96d27755

[AbstractWorkStealingScheduler] Get rid of "rand() % workerCount" · bf8cf516

Florian Schmaus authored 3 years ago

The "rand() % workerCount" constructed used in the work-stealing
scheduler is flawed. It has a bias toward lower worker IDs due the
modulo operation. This is something I always wanted to get rid of, but
never found the time to do it. Until know.

Get rid of it and replace it with
std::uniform_int_distribution<workerid_t> (as field the Worker
instance).

The main changes in AbstractWorkStealingScheduler are
- use currentWorker->nextRandomWorkerId() (instead of the flawed construct)
- currentWorker->getWorkerId() (instead of Runtime::getWorkerId())

bf8cf516

Aug 10, 2021
- Merge branch 'stats-blocked-contexts-high-water' into 'master' · 381f6f39
  Florian Schmaus authored 3 years ago
  
  [stats] blocked contexts See merge request !224
  381f6f39
Aug 09, 2021
- [stats] Add stats for amount, purpose, and affinity of blocked contexts · 8a7c0c22
  Florian Schmaus authored 3 years ago
  
  8a7c0c22
- [LockedSet] Add insertAndGetSize() · 67f93b95
  Florian Schmaus authored 3 years ago
  
  67f93b95
- [WorkerLocalData] Reserve space in std::vector · 379ea507
  Florian Schmaus authored 3 years ago
  
  379ea507
- Merge branch 'distributed-echo-client' into 'master' · 987c17e1
  Florian Schmaus authored 3 years ago
  
  Support distributing multiple echoclient over the network See merge request !228
  987c17e1
- add submit variant using iterators to submit multiple Futures at once · ad3d2fc1
  Florian Fischer authored 3 years ago
  
  ad3d2fc1
Aug 08, 2021
- [EchoClient] add network barrier support to synchronize multiple clients · 174c2326
  Florian Fischer authored 3 years ago
  
  174c2326
- implement a Coordinator · 16ed71a6
  Florian Fischer authored 3 years ago
  
  The Coordinator is used for our echo evaluation and implements a barrier style synchronization mechanism for processes spread across the network.
  16ed71a6
Aug 02, 2021
- Merge branch 'build-without-filesystem' into 'master' · ed1b0c8f
  Florian Schmaus authored 3 years ago
  
  [meson] allow building EMPER on systems whithout <filesystem> See merge request !221
  ed1b0c8f
- Merge branch 'work-stealing-improvements' into 'master' · 64b617a5
  Florian Schmaus authored 3 years ago
  
  Add meson option for "check anywhere queue while stealing" See merge request !207
  64b617a5
- Add check_anywhere_queue_while_stealing meson option · 94c099e2
  Florian Schmaus authored 3 years ago
  
  94c099e2
- Merge branch 'add_more_timeouts' into 'master' · d317c7b0
  Florian Schmaus authored 3 years ago
  
  [io.hpp] add blocking functions using timeouts See merge request !226
  d317c7b0
- [AbstractWorkStealingScheduler] Increase DEQUEUE_FROM_ANYWHERE_MAX to 512 · 8338d9a2
  Florian Schmaus authored 3 years ago
  
  8338d9a2
- Merge branch 'stats-queue-high-water' into 'master' · c6721c50
  Florian Schmaus authored 3 years ago
  
  [stats] Add max-queue-length stats to AbstractWorkStealingScheduler See merge request !223
  c6721c50
- [stats] Add max-queue-length stats to AbstractWorkStealingScheduler · a448bfb1
  Florian Schmaus authored 3 years ago
  
  a448bfb1
Jul 29, 2021
- [io.hpp] add blocking functions using timeouts · 5f29bfc8
  Florian Fischer authored 3 years ago
  
  5f29bfc8
- Merge branch 'echoserver-print-setup' into 'master' · d6e35882
  Florian Fischer authored 3 years ago
  
  [echoclient] print a short description of parameters See merge request !225
  d6e35882
Jul 28, 2021
- [echoclient] print a short description of parameters · 07590a87
  Florian Fischer authored 3 years ago
  
  07590a87
Jul 27, 2021
- Merge branch 'add_docker_tooling' into 'master' · 6bded656
  Florian Schmaus authored 3 years ago
  
  Add docker tooling See merge request !219
  6bded656
Jul 26, 2021

[meson] allow building EMPER on systems whithout <filesystem> · 6753d982

Florian Fischer authored 3 years ago

Check if std::filesystem::recursive_directory_iterator and std::filesystem::path
are available before using those in EMPER code.

We do not export the symbols using the not supported filesystem features
in our public headers using preprocessor ifdef.

But the code in the cpp files using it not removed using the preprocessor.
To allow linkage we use a constexpr which throws a logic_error on runtime
rendering the rest of the code dead und thus prevents its generation by
the compiler.
This methods allows the compiler to see the code in its analysis passes
but does not fail during linking.

Allow meson.build files in emper/ subdirectories add configuration options
by consuming the conf_data object after all subdirectories were visited.

Introduce a quasi naming standard for cpp feature flags in meson code:
	cpp_has_<namespace>_<feature>

Examples:
	cpp_has_fs_path

6753d982

[CI] bump docker image to 1.14 · 933860f7
Florian Fischer authored 3 years ago

933860f7

add docker tooling · 5798f15c

Florian Fischer authored 3 years ago

Usage run "docker.sh <your command>" to execute <your command> in the
docker image extracted from .gitlab-ci.yml in the emper root directory

NOTE: seccomp filtering is disabled for now since io_uring_* syscalls
are not working everywhere as expected.

5798f15c

Merge branch 'fsearch-allow-stats-reporting' into 'master' · c60a2484
Florian Schmaus authored 3 years ago
```
[fsearch] gracefully terminate the runtime to print the collected stats

See merge request !222
```
c60a2484
[fsearch] gracefully terminate the runtime to print the collected stats · b5634c12
Florian Fischer authored 3 years ago

b5634c12
Merge branch 'fix-PipeSleepStrategy-destructor' into 'master' · 884b9563
Florian Schmaus authored 3 years ago
```
[PipeSleepStrategy] use C++ smart ptrs instead of manual memory management

See merge request !220
```
884b9563

[PipeSleepStrategy] use C++ smart ptrs instead of manual memory management · b7f93c8c

Florian Fischer authored 3 years ago

The destructor of PipeSleepStrategy caused segmentation faults when
running optimized. Because the stats pointer is not initialized it was
possibly to pass a garbage pointer to delete. Now we use a well defined
C++ smart pointer which fixes the problem and is more idiomatic anyway.

b7f93c8c

Merge branch 'improve-reap-completions' into 'master' · 29736cf9
Florian Schmaus authored 3 years ago
```
[IoContext] wrap CQ locking in if constexpr

See merge request !218
```
29736cf9

[IoContext] wrap CQ locking in if constexpr · 1055a7aa

Florian Fischer authored 3 years ago

We don't need to pay the overhead of the atomic operations on each
dispatch loop if there is no concurrent access to worker CQs.

1055a7aa

Jul 23, 2021
- Merge branch 'improve-fsearch' into 'master' · 1e495961
  Florian Schmaus authored 3 years ago
  
  Further improve fsearch See merge request !217
  1e495961
- add and use emper native recursive directory walk · 41d31c71
  Florian Fischer authored 3 years ago
  
  41d31c71
Jul 21, 2021
- [fsearch] use emper IO instead of printf · dbd489ff
  Florian Fischer authored 3 years ago
  
  dbd489ff
- Merge branch 'improve-fsearch' into 'master' · d3080cef
  Florian Schmaus authored 3 years ago
  
  Improve fsearch See merge request !215
  d3080cef
- Merge branch 'add_ioc_prefix_to_log' into 'master' · 46cc46b0
  Florian Schmaus authored 3 years ago
  
  [Debug] prefix log messages from the I/O completer thread with "IOC" See merge request !216
  46cc46b0
- [fsearch] add global semaphore to limit the amount of concurrent requests · 08bbef86
  Florian Fischer authored 3 years ago
  
  08bbef86
- [fsearch] close files after searching them · 11aa4a38
  Florian Fischer authored 3 years ago
  
  11aa4a38
- [Debug] prefix log messages from the I/O completer thread with "IOC" · 5a8bdbd3
  Florian Fischer authored 3 years ago
  
  5a8bdbd3
Jul 15, 2021
- Merge branch 'pipe-sleep-strategy' into 'master' · 7f845e8a
  Florian Schmaus authored 3 years ago
  
  Implement sleep strategy using the IO subsystem See merge request !214
  7f845e8a
Jul 14, 2021

implement a pipe based sleep strategy using the IO subsystem · 4ec30fd4

Florian Fischer authored 3 years ago

Design goals
============

* Wakeup either on external newWork notifications or on local IO completions
  -> Sleep strategy is sound without the IO completer
* Do as less as possible in a system saturated with work
* Pass a hint where to find new work to suspended workers

Algorithm
=========

Data:
	Global:
		hint pipe
		sleepers count
	Per worker:
		dispatch hint buffer
		in flight flag

Sleep:
	if we have no sleep request in flight
		Atomic increment sleep count
		Remember that we are sleeping
		Prepare read cqe from the hint pipe to dispatch hint buffer
	Prevent the completer from reaping completions on this worker's IoContext
	Wait until IO completions occurred

NotifyEmper(n):
	if observed sleepers <= 0
		return

	// Determine how many we are responsible to wake
	do
		toWakeup = min(observed sleepers, n)
	while (!CAS(sleepers, toWakeup))

	write toWakeup hints to the hint pipe

NotifyAnywhere(n):
	// Ensure all n notifications take effect
	while (!CAS(sleepers, observed sleepers - n))
		if observed sleeping <= -n
			return

	toWakeup = min(observed sleeping, n)
	write toWakeup hints to the hint pipe

onNewWorkCompletion:
	reset in flight flag
	allow completer to reap completions on this IoContext

Notes
=====

* We must decrement the sleepers count on the notifier side to
  prevent multiple notifiers to observe all the same amount of sleepers,
  trying to wake up the same sleepers by writing to the pipe and jamming it up
  with unconsumed hints and thus blocking in the notify write resulting
  in a deadlock.
* The CAS loops on the notifier side are needed because decrementing
  and incrementing the excess is racy: Two notifier can observe the
  sum of both their excess decrement and increment to much resulting in a
  broken counter.
* Add the dispatch hint code in AbstractWorkStealingScheduler::nextFiber.
  This allows workers to check the dispatch hint after there
  where no local work to execute.
  This is a trade-off where we trade slower wakeup - a just awoken worker
  will check for local work - against a faster dispatch hot path when
  we have work to do in our local WSQ.
* The completer tread must not reap completions on the IoContexts of
  sleeping workers because this introduces a race for cqes and a possible
  lost wakeup if the completer consumes the completions before the worker
  is actually waiting for them.
* When notifying sleeping workers from anywhere we must ensure that all
  notifications take effect. This is needed for example when terminating
  the runtime to prevent sleep attempt from worker thread which are
  about to sleep but have not incremented the sleeper count yet.
  We achieve this by always decrementing the sleeper count by the notification
  count.

Thanks to Florian Schmaus <flow@cs.fau.de> for spotting bugs and suggesting
improvements.

4ec30fd4