summaryrefslogtreecommitdiff
path: root/script
diff options
context:
space:
mode:
Diffstat (limited to 'script')
-rw-r--r--script/indent.log22
-rw-r--r--script/main.tex102
2 files changed, 124 insertions, 0 deletions
diff --git a/script/indent.log b/script/indent.log
new file mode 100644
index 0000000..61d5c27
--- /dev/null
+++ b/script/indent.log
@@ -0,0 +1,22 @@
+INFO: latexindent version 3.24.7, 2025-08-15, a script to indent .tex files
+ latexindent lives here: /usr/share/texmf-dist/scripts/latexindent/
+ Sat Jun 27 14:25:37 2026
+ Reading input from STDIN
+INFO: Processing switches:
+INFO: Directory for backup files and log file indent.log:
+ .
+INFO: YAML settings read: defaultSettings.yaml
+ Reading defaultSettings.yaml from /usr/share/texmf-dist/scripts/latexindent/defaultSettings.yaml
+INFO: YAML reading settings
+ Home directory is /home/matt
+ latexindent.pl didn't find indentconfig.yaml or .indentconfig.yaml
+ see all possible locations: https://latexindentpl.readthedocs.io/en/latest/sec-appendices.html#indentconfig-options)
+INFO: Phase 1: searching for objects
+INFO: Phase 2: finding surrounding indentation
+INFO: Phase 3: indenting objects
+INFO: Phase 4: final indentation check
+INFO: Output routine:
+ Not outputting to file; see -w and -o switches for more options.
+ --------------
+INFO: Please direct all communication/issues to:
+ https://github.com/cmhughes/latexindent.pl
diff --git a/script/main.tex b/script/main.tex
new file mode 100644
index 0000000..8acb153
--- /dev/null
+++ b/script/main.tex
@@ -0,0 +1,102 @@
+\documentclass[12pt, a4paper]{article}
+\usepackage{palatino, enumitem, parskip, xspace}
+\usepackage[top=1.5cm, bottom=1.5cm, left=2cm, right=2cm]{geometry}
+\usepackage[dvipsnames]{xcolor}
+\newcommand{\eg}{\emph{e.g.,}\xspace}
+\newcommand{\todo}[1]{\textcolor{Blue}{\textbf{TODO(#1)}}}
+\newcommand{\etal}{\emph{et~al.}\xspace}
+\begin{document}
+\begin{center}
+ \Large My BSc Defence Script
+\end{center}
+\begin{enumerate}[label=\textbf{Slide \arabic*.}]
+ \item \textbf{Introduction}\\
+ Good morning everyone, my name is Mateusz and today I will present to you my project \emph{Sunfish: Enabling Predictive Analytics For Datacenters Through Digital Twinning}.
+
+ At a top level, my project is about trying to ease datacenter management by trying to pave the way to predicting unexpected events.
+
+ \item \textbf{Societal Impact}\\
+ As you know and as you will likely see in the upcoming presentations today, datacenters are important.
+ But, I would like to shortly mention this myself.
+
+ A single GPU is already very complex.
+ Within a Google Datacenter, there are hundreds of server racks, with tens of such GPUs.
+ This begs the question: How are we going to manage this large of a datacenter, that has so many \emph{layers of complexity}?
+
+ We cannot let these systems go down or experience big failures, because \eg in Netherlands over 3 million professionals depend daily on the cloud.
+ \todo{Read the slide box.}
+ As such, we must do something to manage datacenters well.
+
+
+ \item \textbf{Problem Statement}\\
+ Digital Twinning pairs complex objects (like datacenters) via a two-way connection with their virtual representation.
+ \todo{Give example about the airplane from aviation.}
+ \emph{It a method to manage complex systems.}
+
+ However, in digital twinning, specifically datacenter digital twinning a lot of elements are still shifting about and there are a lot of ways to create the virtual models and there seems to not be a fully functioning DCDT out there (\emph{that meets the official NASM definition}).
+
+ DCDT's lack mandatory features one of which is predictive analytics.
+ Predictive Analytics is a type of ODA that draws insights into the future based on current data, \eg telling when a host failure might happen before it does (\emph{and yet it is NOT present in existing DCDTs}).
+
+ \item \textbf{Research Questions}\\
+ We wish to enable the development of predictive analysis components for DCDT's by designing a predictive DCDT.
+ We ask the following research questions. \todo{Read from slide boxes and explain for each (1) describe why it's important (2) say why it's challenging (3) say what makes it scientific.}
+
+ \item \textbf{Literature Survey}\\
+ This is the most exciting part of the thesis for me.
+ To answer \textbf{RQ1} we conduct a comprehensive literature survey.
+ We did not conduct the systematic literature survey by Kitchenham \etal, instead we relied heavily on snowballing and manual search for works in Google Scholar and DBLP.
+
+ Google Scholar referred us to ACM Digital Library, IEEExplore, Science Direct and others.
+ We used structured queries such a ``datacenters'' \texttt{AND} ``digital twinning'' or plainly ``datacenter digital twins''.
+
+ To filter out relevant work we read the abstract, introduction and conclusion and afterwards decided whether to include the article.
+ The results are in \textbf{Table 1.1}.
+ \todo{"Read the slide box."}
+ \item \textbf{System Model}\\
+ We also created a holistic system model of DCDTs.
+ We decided to make a system model instead of a taxonomy, because we discuss the design of a set of systems, and there are not that many to consider making a full \emph{Linnaeus} tree and a taxonomy.
+
+ The system model is in \textbf{Fig. 1.3}, and what I found to be the most interesting while reading the literature was the lack of the connection between the two twins.
+ As such, what makes this design special is the \emph{Digital Thread}. \todo{Read the slide box.}
+
+ \item \textbf{Reference Architecture and Prototype}\\
+ From the literature survey, we gathered the potential use-cases of our system, which we omit for brevity.
+ From the use-cases we developed as set of functional and non-functional requirements, based on which we created the reference architecture.
+
+ The most innovative part of the data pipeline is the use of both in-band and out-of-band data pipelines, by including both a short-term cache and a long-term database.
+
+ The most interesting thing that I devised myself, is the predictive analytics component.
+ \todo{Go through the elements in the plot.}
+
+ Given this reference architecture, we created a prototype, called \emph{Sunfish}.
+ We evaluate this prototype in the following slides.
+
+
+ \item \textbf{Novel Evaluation Method}\\
+ Now we go to the most difficult part.
+ In order to evaluate a prototype, we propose a novel approach.
+ Many researchers do not have a real facility to experiment with.
+ We propose to use a second simulator to act as the real datacenter.
+
+ \todo{Say in order to not cram content into the presentation, we omit the technical setup, and include it in extra slides.}
+
+
+ \item \textbf{Experiment 1: Red and Yellow Alarms}\\
+ For Experiment 1 we copy the idea of Milojicic \etal for different ways a DCDT can notify the datacenter.
+
+ Imagine a scenario: a datacenter will soon run a workload.
+ We want to detect and differentiate between failures that are big and unexpected and failures we anticipated would occur.
+
+ To achieve this: the DCDT runs the workload using the simulator.
+ We cannot know what kind of failures we can expect, so we use a statistical distribution to approximate what might occur in practice.
+ In result, we get a picture of what kind of problems we might expect.
+
+ We now use the real-time feedback loop to notify the DC operators that what is happening in reality is different from simulation.
+ If we get within 80\% of the predicted threshold for number of failures we send a yellow alarm.
+ If we get within 90\% we send a red alarm.
+ \item \textbf{Experiment 2: Conceptual Experiment}\\
+ \item \textbf{Key Takeaways}\\
+ \todo{Read from the slide.}
+\end{enumerate}
+\end{document}