diff options
| author | mjkwiatkowski <mati.rewa@gmail.com> | 2026-06-02 19:09:38 +0200 |
|---|---|---|
| committer | mjkwiatkowski <mati.rewa@gmail.com> | 2026-06-02 19:09:38 +0200 |
| commit | ffb1045c8d27b48868ff582551a0b41dd37afe45 (patch) | |
| tree | bd10b5891e874fdd7238a4db62322a3a53c0e5a9 /content/background.tex | |
| parent | 9c07f07c50ef6a93ac72a593ac087b01de7ec125 (diff) | |
feat: added more text to the background, refactor is still needed
Diffstat (limited to 'content/background.tex')
| -rw-r--r-- | content/background.tex | 87 |
1 files changed, 85 insertions, 2 deletions
diff --git a/content/background.tex b/content/background.tex index 85f3c57..c88367f 100644 --- a/content/background.tex +++ b/content/background.tex @@ -2,10 +2,38 @@ \section{Datacenters}\label{ss:datacenters} Explain the high risk phenomena that occur in datacenters, which includes failures. -\subsection{Failures} +\subsection{Failures}\label{sss:failures} + \section{Digital Twinning}\label{ss:digital-twinning} + +\gls{ed} is an open-source framework for developing digital twins of supercomputers. +It consists of 3 modules: +\begin{enumerate*}[label=(\arabic*)] + \item resource allocator and power simulator + \item thermal cooling model + \item augmented reality 3D model +\end{enumerate*} +of the supercomputer. +\gls{ed} has been used at the Frontier supercomputer at the Oak Ridge National Laboratory in the USA, successfully predicting potential energy losses at the supercomputer. +Brewer \etal include alongside the framework architecture an open-source artifact and a set of extensive verification and validation experiments. +The authors differentiate between different digital twins within \gls{ed}, such as \begin{enumerate*}[label=(\arabic*)] + \item descriptive twin + \item informative twin + \item predictive twin + \item comprehensive twin + \item autonomous twin +\end{enumerate*} +that together form the \gls{ed}. +The \emph{predictive twin} leverages data driven operational analytics to create \gls{ml} models. Authors argue that alongside simulation, \gls{ml} models should also have a significant role for modeling system workloads in \eg application fingerprinting. +Within the \emph{autonomous twin} the authors use \gls{rl} to train agents that can be used to make control decisions in order to optimize different processes. +In order to model the cooling system the authors use the Modelica software, and to predict energy power draw they coded a Python script. +The authors provide a intuitive way to interact with the system using a visual dashboard, and an advanced augmented reality model. +The authors posit that the best way to address the 3V's of data (velocity, volume and variety) is to use augmented reality coupled with dashboards. + + + Predictive modelling uses statistics to predict outcomes. When deployed commercially, for example in datacenters, predictive modelling is often referred to as predictive analytics~\cite{Wikipedia:PredictiveModelling}. Almost any statistical model can be used for prediction purposes, but nowadays predictive analysis is synonymous with machine learning. @@ -23,6 +51,62 @@ The process of inference from data to provide the best explanation is called abd %Include something about data-preprocessing in the pipeline. %See the article by Fei Tao +\subsection{Datacenter simulation}\label{sss:simulation} + +\begin{table}[h] + \centering + \renewcommand{\arraystretch}{1.4} + \begin{tabular}{m{0.7\linewidth}cc} + \toprule + Feature & \gls{ed} & \\ + \midrule + Virtual Prototyping & & \\ + Scenario Exploration & & \\ + 3D Facility Modelling & & \\ + Predictive maintenance & & \\ + Predictive energy modelling & & \\ + Reliability and availability modeling & & \\ + Cooling modelling & & \\ + Network modelling & & \\ + Predictive modelling & & \\ + Power consumption modelling & & \\ + Visual analytics dashboard & & \\ + Forensic analysis and diagnostics & & \\ + Failure detection & & \\ + Operational optimization & & \\ + Resource allocation & & \\ + \midrule + \end{tabular} + \caption{Comparison of selected features of existing datacenter digital twins.} +\end{table} + + +\begin{table}[h] + \centering + \renewcommand{\arraystretch}{1.4} + \begin{tabular}{cccm{0.3\linewidth}c} + \toprule + Project & Environment & Stakeholders & Highlighted Features & GUI \\ + \midrule + + CloudSim & Cloud, Fog, Edge & Research & VC\textsuperscript{$\star$}, N, S, E, WF, FD, EXP, CM, PI & \ding{51}\textsuperscript{$\dagger$} \\ + \midrule + SimGrid & Grid, P3P, Cloud & Research, Edu. & VC\textsuperscript{$\star$}, N\textsuperscript{$\star$}, S, E\textsuperscript{$\star$}, WF\textsuperscript{$\star$} & \ding{51}\textsuperscript{$\dagger$} \\ + \midrule + DGSim & Grid & Research & WF, F, EXP & \ding{55} \\ + \midrule + GroudSim & Grid, Cloud & Research & WF, CM, F & \ding{55} \\ + \midrule + iCanCloud & Cloud & Research & VC, N\textsuperscript{$\star$}, S, CM & \ding{51}\textsuperscript{$\star$} \\ + \midrule + \textbf{OpenDC} & Cloud & Research, Edu. & VC\textsuperscript{$\star$}, N, S, E\textsuperscript{$\star$},, CM, FS\textsuperscript{$\star$}, ML, WF, F\textsuperscript{$\star$}, PI, EXP\textsuperscript{$\star$} & \ding{51}\textsuperscript{$\star$} \\ + \bottomrule + \end{tabular} + \caption{Comparison of selected datacenter simulators. \textbf{Models:} VC = VMs and containers; N = Network, S = Storage, E = Energy, CM = Cost Models, FS = FaaS, ML = Machine Learning, WF = Workflows, FD = Federation; \textbf{Phenomena:} F = Failures, PI = Performance interface; \textbf{Tools:} EXP = Experiment automation; \textbf{Support:} \ding{51} = Yes, \ding{55} = No; $\dagger$ = extension, not integrated; $\star$ = advanced, carefully calibrated feature. Adapted form Mastenbroek \etal} +\end{table} + + + One of the key arguments that speak for a datacenter digital twin is that datacenters already connect hundreds of monitoring sensors and data coming from them. Monitoring of server racks, VM's, CPU profiling and all that give us lots of data. @@ -32,7 +116,6 @@ ODA can predict failures, help maintain the equipment, save bills, cut costs. But currently one of the key challenges is to somehow connect the physical and virtual spaces. The answer to how to do this is a digital twin. - %[citation needed] As of 2026, there is a lack of consensus of what is a digital twin. |
