opendc.git - The OpenDC repository.

Age	Commit message (Collapse)	Author
2022-11-27	refactor(compute/service): Do not split interface and implementation	Fabian Mastenbroek
	This change inlines the implementation of the compute service into the `ComputeService` interface. We do not intend to provide multiple implementations of the service. In addition, this approach makes more sense for a Java implementation.
2022-11-27	refactor(compute/service): Expose state directly to clients	Fabian Mastenbroek
	This change updates the implementation of the compute service to expose state to clients created by the compute service.
2022-11-27	refactor(compute/api): Do not suspend in compute API	Fabian Mastenbroek
	This change updates the API interface of the OpenDC Compute service to not suspend execution using Kotlin Coroutines. The suspending modifiers were introduced in case the ComputeClient would communicate with the service over a network connection. However, the main use-case has been together with the ComputeService, where the suspending modifiers only frustrate the user experience when writing experiments. Furthermore, with the advent of Project Loom, it is not necessarily a problem to block the (virtual) thread during network communications.
2022-11-13	refactor: Replace use of CoroutineContext by Dispatcher	Fabian Mastenbroek
	This change replaces the use of `CoroutineContext` for passing the `SimulationDispatcher` across the different modules of OpenDC by the lightweight `Dispatcher` interface of the OpenDC common module.
2022-11-13	refactor(sim/core): Re-implement SimulationScheduler as Dispatcher	Fabian Mastenbroek
	This change updates the `SimulationScheduler` class to implement the `Dispatcher` interface from the OpenDC Common module, so that OpenDC modules only need to depend on the common module for dispatching future task (possibly in simulation).
2022-11-13	refactor: Use InstantSource as time source	Fabian Mastenbroek
	This change updates the modules of OpenDC to always accept the `InstantSource` interface as source of time. Previously we used `java.time.Clock`, but this class is bound to a time zone which does not make sense for our use-cases. Since `java.time.Clock` implements `java.time.InstantSource`, it can be used in places that require an `InstantSource` as parameter. Conversion from `InstantSource` to `Clock` is also possible by invoking `InstantSource#withZone`.
2022-11-04	refactor: Use RandomGenerator as randomness source	Fabian Mastenbroek
	This change updates the modules of OpenDC to always accept the `RandomGenerator` interface as source of randomness. This interface is implemented by the slower `java.util.Random` class, but also by the faster `java.util.SplittableRandom` class
2022-10-31	feat(sim/compute): Add support for snapshotting workloads	Fabian Mastenbroek
	This change updates the interface of `SimWorkload` to support snapshotting workloads. We introduce a new method `snapshot()` to this interface which returns a new `SimWorkload` that can be started at a later point in time and on another `SimMachine`, which continues progress from the moment the workload was snapshotted.
2022-10-28	perf(compute/sim): Use static logger field	Fabian Mastenbroek
	This change updates the `Guest` class implementation to use a static logger field instead of allocation a new logger for every guest.
2022-10-28	refactor(compute/service): Do not suspend on guest start	Fabian Mastenbroek
	This change updates the `Host` interface to remove the suspend modifiers to the start, stop, spawn, and delete methods of this interface. We now assume that the host immediately launches the guest on invocation of this method.
2022-10-28	feat(compute/sim): Model host boot time	Fabian Mastenbroek
	This change updates `SimHost` to support modeling the time and resource consumption it takes to boot the host. The boot procedure is modeled as a `SimWorkload`.
2022-10-28	refactor(compute/sim): Use workload chaining for boot delay	Fabian Mastenbroek
	This change updates the implementation of `SimHost` to use workload chaining for modelling boot delays. Previously, this was implemented by sleeping 1 millisecond using Kotlin coroutines. With this change, we remove the need for coroutines and instead use the `SimDurationWorkload` to model the boot delay. In the future, we envision a user-supplied stochastic boot model to model the boot delay for VM instances.
2022-10-28	feat(sim/compute): Add completion parameter to startWorkload	Fabian Mastenbroek
	This change updates the interface of `SimMachine#startWorkload` to introduce a parameter `completion` that is invoked when the workload completes either succesfully or due to failure. This functionality has often been implemented by wrapping a `SimWorkload` and catching its exceptions. However, since this functionality is used in all usages of `SimMachine#startWorkload` we instead embed it into `SimMachine` itself.
2022-10-21	fix: Add log4j-core dependency	Fabian Mastenbroek
	This change adds the log4j-core dependency to various modules of OpenDC using log4j2, to ensure logging keeps working. The upgrade to SLF4J 2.0 broke the Log4j2 functionality, since the log4j-core artifact is not automatically shipped with the SLF4J implementation.
2022-10-21	refactor(sim/compute): Re-implement using flow2	Fabian Mastenbroek
	This change re-implements the OpenDC compute simulator framework using the new flow2 framework for modelling multi-edge flow networks. The re-implementation is written in Java and focusses on performance and clean API surface.
2022-10-10	fix(compute/service): Expose number of registered servers	Fabian Mastenbroek
	This change updates the compute service telemetry to also expose the number of servers that are registered with the service.
2022-10-06	build: Switch to Spotless for formatting	Fabian Mastenbroek
	This change updates the build configuration to use Spotless for code formating of both Kotlin and Java.
2022-10-06	style: Eliminate use of wildcard imports	Fabian Mastenbroek
	This change updates the repository to remove the use of wildcard imports everywhere. Wildcard imports are not allowed by default by Ktlint as well as Google's Java style guide.
2022-10-05	refactor(sim/core): Rename runBlockingSimulation to runSimulation	Fabian Mastenbroek
	This change renames the method `runBlockingSimulation` to `runSimulation` to put more emphasis on the simulation part of the method. The blocking part is not that important, but this behavior is still described in the method documentation.
2022-10-05	refactor(sim/core): Use SimulationScheduler in coroutine dispatcher	Fabian Mastenbroek
	This change updates the implementation of `SimulationDispatcher` to use a (possibly user-provided) `SimulationScheduler` for managing the execution of the simulation and future tasks.
2022-10-03	refactor(exp/compute): Integrate compute workload classes	Fabian Mastenbroek
	This change integrates the classes from the old `opendc-compute-workload` module into the `opendc-experiments-compute` module. This new module contains helper classes for setting up experiments with the OpenDC compute service.
2022-10-03	feat(exp/compute): Add provisioners for compute service	Fabian Mastenbroek
	This change adds a new module `opendc-experiments-compute` that provides provisioner implementations for experiments to use for setting up the compute service of OpenDC and provisioning (simulated) hosts.
2022-09-23	refactor(compute): Provide access to instances in compute service	Fabian Mastenbroek
	This change updates the interface of `ComputeService` to provide access to the instances (servers) that have been registered with the compute service. This allows metric collectors to query the metrics of the servers that are currently running.
2022-09-22	refactor(compute): Pass failure model during workload evaluation	Fabian Mastenbroek
	This change updates the `ComputeServiceHelper` class to provide the failure model via a parameter to the `run` method instead of constructor parameter. This separates the construction of the topology from the simulation of the workload.
2022-09-22	refactor(sim/compute): Simplify SimHypervisor class	Fabian Mastenbroek
	This change simplifies the SimHypervisor class into a single implementation. Previously, it was implemented as an abstract class with multiple implementations for each multiplexer type. We now pass the multiplexer type as parameter to the SimHypervisor constructor.
2022-09-22	refactor(sim/compute): Make interference domain independent of profile	Fabian Mastenbroek
	This change updates the virtual machine performance interference model so that the interference domain can be constructed independently of the interference profile. As a consequence, the construction of the topology now does not depend anymore on the interference profile.
2022-09-22	refactor(compute): Simplify constructor of SimHost	Fabian Mastenbroek
	This change updates the constructor of SimHost to receive a `SimBareMetalMachine` and `SimHypervisor` directly instead of construction these objects itself. This ensures better testability and also simplifies the constructor of this class, especially when future changes to `SimBareMetalMachine` or `SimHypervisor` change their constructors.
2022-09-22	refactor(compute): Add separate error host state	Fabian Mastenbroek
	This change adds a new HostState to indicate that the host is in an error state as opposed to being purposefully unavailable.
2022-09-22	refactor(sim/compute): Extract Random dependency from interference model	Fabian Mastenbroek
	This change moves the Random dependency outside the interference model, to allow the interference model to be completely immutable and passable between different simulations.
2022-09-21	refactor(sim/compute): Move interference logic into VmInterferenceMember	Fabian Mastenbroek
	This change updates the design of the VM interference model, where we move more of the logic into the `VmInterferenceMember` interface. This removes the dependency on the `VmInterferenceModel` for the hypervisor interface.
2022-09-21	refactor(sim/compute): Pass interference key via parameter	Fabian Mastenbroek
	This change updates the signature of the `SimHypervisor` interface to accept a `VmInterferenceKey` when creating a new virtual machine, instead of providing a string identifier. This is in preparation for removing the dependency on the `VmInterferenceModel` in the `SimAbstractHypervisor` class.
2022-09-21	refactor(sim/compute): Remove timestamp parameter from SimTrace	Fabian Mastenbroek
	This change removes the timestamp parameter from `SimTrace`. Instead, it is now assumed that the trace is continuous and the end of a fragment starts a new fragment, in order to simplify replaying of the trace.
2022-09-21	feat(compute): Add support for affinity scheduling (#101)	Fabian Mastenbroek
	This change adds support for (anti-)affinity scheduling of servers onto hosts, which happens at the compute service level. In the future, we might add support for server groups, which also enables soft (anti-)affinity scheduling. Implements #26 ## Implementation Notes :hammer_and_pick: * Add `DifferentHostFilter` to schedule instances on different hosts from a set of instances. * Add `SameHostFilter` to schedule instances on the same hosts as a set of instances.
2022-06-23	bug(compute/workload): Fix conversion from UUID to Binary	Fabian Mastenbroek
	This change fixes an issue with the metric exporting code in OpenDC where a UUID is not converted correctly into a `Binary` object that is consumed by the Apache Parquet library.
2022-06-07	refactor(trace/api): Introduce type system for trace API	Fabian Mastenbroek
	This change updates the trace API by introducing a limited type system for the table columns. Previously, the table columns could have any possible type representable by the JVM. With this change, we limit the available types to a small type system.
2022-05-06	refactor(telemetry): Remove dependency on OpenTelemetry SDK	Fabian Mastenbroek
	This change removes the dependency on the OpenTelemetry SDK. Instead, we'll only expose metrics via the OpenTelemetry API in the future via adapter classes.
2022-05-06	refactor(compute/service): Remove OpenTelemetry from "compute" modules	Fabian Mastenbroek
	This change removes the OpenTelemetry integration from the OpenDC Compute modules. Previously, we chose to integrate OpenTelemetry to provide a unified way to report metrics to the users. Although this worked as expected, the overhead of the OpenTelemetry when collecting metrics during simulation was considerable and lacked more optimization opportunities (other than providing a separate API implementation). Furthermore, since we were tied to OpenTelemetry's SDK implementation, we experienced issues with throttling and registering multiple instruments. We will instead use another approach, where we expose the core metrics in OpenDC via specialized interfaces (see the commits before) such that access is fast and can be done without having to interface with OpenTelemetry. In addition, we will provide an adapter to that is able to forward these metrics to OpenTelemetry implementations, so we can still integrate with the wider ecosystem.
2022-05-06	refactor(telemetry/compute): Support direct metric access	Fabian Mastenbroek
	This change introduces a `ComputeMetricReader` class that can be used as a replacement for the `CoroutineMetricReader` class when reading metrics from the Compute service. This implementation operates directly on a `ComputeService` instance, providing better performance.
2022-05-04	refactor(compute): Directly expose scheduler stats to user	Fabian Mastenbroek
	This change updates the `ComputeService` interface to directly expose statistics about the scheduler to the user, such that they do not necessarily have to interact with OpenTelemetry to obtain these values.
2022-05-04	feat(compute): Add support for looking up hosts	Fabian Mastenbroek
	This change adds the ability for users to lookup the `Host` on which a `Server` is hosted (if any). This allows the user to potentially interact with the `Host` directly, e.g., in order to obtain advanced metrics.
2022-05-03	refactor(compute): Expose CPU and system stats via Host interface	Fabian Mastenbroek
	This change updates the `Host` interface to directly expose CPU and system stats to be used by components that interface with the `Host` interface. Previously, this would require the user to interact with the OpenTelemetry SDK. Although that is still possible for more advanced usage cases, users can use the following methods to easily access common host and guest statistics.
2022-05-02	refactor(compute): Do not use Avro when exporting experiment data	Fabian Mastenbroek
	This change updates the `ParquetDataWriter` class to not use the `parquet-avro` library for exporting experiment data, but instead to use the low-level APIs to directly write the data in Parquet format.
2022-05-01	refactor(trace/parquet): Support custom ReadSupport implementations	Fabian Mastenbroek
	This change updates the `LocalParquetReader` implementation to support custom `ReadSupport` implementations, so we do not have to rely on the Avro implementation necessarily.
2022-04-24	build: Move modules into subgroups	Fabian Mastenbroek
	This change updates the Gradle build configuration of the project to publish the different type of modules (e.g., opendc-compute, opendc-simulator) into their own groups.
2022-04-23	build: Enable testing for all library modules	Fabian Mastenbroek
	This change updates the Gradle build configuration to ensure that all library modules (that will be published) use testing and are included in coverage reports. This should ensure the public modules remain well tested.
2022-04-22	refactor(trace/api): Move conventions into separate package	Fabian Mastenbroek
	This change moves the trace conventions (such as table and column names) in a separate conv package, so that it is separated from the main API. This also allows for a potential move into a separate module in the future.
2022-04-22	refactor(compute): Load interference model via trace library	Fabian Mastenbroek
	This change updates the compute support library to load the VM interference model via the OpenDC trace library, which provides a generic interface for reading interference models associated with workload traces.
2022-02-18	fix(simulator): Flush results before accessing counters	Fabian Mastenbroek
	This change updates the simulator implementation to flush the active progress when accessing the hypervisor counters. Previously, if the counters were accessed, while the mux or consumer was in progress, its counter values were not accurate.
2022-02-18	fix(compute): Disallow duplicate UIDs for SimHost	Fabian Mastenbroek
	This change fixes an issue with the ComputeServiceHelper where it allowed users to register multiple SimHost objects with the same UID. See this issue for more information: https://github.com/atlarge-research/opendc/issues/51
2022-02-18	build: Remove opendc-platform module	Fabian Mastenbroek
	This change removes the opendc-platform module from the project. This module represented a Java platform which was previously used for sharing a set of dependency versions between subprojects. However, with the version catalogue that was added by Gradle, we currently do not use the platform anymore.