2 files changed, 124 insertions, 0 deletions
diff --git a/script/indent.log b/script/indent.log
new file mode 100644
index 0000000..61d5c27
--- /dev/null
+++ b/script/indent.log
@@ -0,0 +1,22 @@
+INFO:  latexindent version 3.24.7, 2025-08-15, a script to indent .tex files
+       latexindent lives here: /usr/share/texmf-dist/scripts/latexindent/
+       Sat Jun 27 14:25:37 2026
+       Reading input from STDIN
+INFO:  Processing switches:
+INFO:  Directory for backup files and log file indent.log:
+       .
+INFO:  YAML settings read: defaultSettings.yaml
+       Reading defaultSettings.yaml from /usr/share/texmf-dist/scripts/latexindent/defaultSettings.yaml
+INFO:  YAML reading settings
+       Home directory is /home/matt
+       latexindent.pl didn't find indentconfig.yaml or .indentconfig.yaml
+       see all possible locations: https://latexindentpl.readthedocs.io/en/latest/sec-appendices.html#indentconfig-options)
+INFO:  Phase 1: searching for objects
+INFO:  Phase 2: finding surrounding indentation
+INFO:  Phase 3: indenting objects
+INFO:  Phase 4: final indentation check
+INFO:  Output routine:
+       Not outputting to file; see -w and -o switches for more options.
+       --------------
+INFO:  Please direct all communication/issues to:
+        https://github.com/cmhughes/latexindent.pl
diff --git a/script/main.tex b/script/main.tex
new file mode 100644
index 0000000..8acb153
--- /dev/null
+++ b/script/main.tex
@@ -0,0 +1,102 @@
+\documentclass[12pt, a4paper]{article}
+\usepackage{palatino, enumitem, parskip, xspace}
+\usepackage[top=1.5cm, bottom=1.5cm, left=2cm, right=2cm]{geometry}
+\usepackage[dvipsnames]{xcolor}
+\newcommand{\eg}{\emph{e.g.,}\xspace}
+\newcommand{\todo}[1]{\textcolor{Blue}{\textbf{TODO(#1)}}}
+\newcommand{\etal}{\emph{et~al.}\xspace}
+\begin{document}
+\begin{center}
+	\Large My BSc Defence Script
+\end{center}
+\begin{enumerate}[label=\textbf{Slide \arabic*.}]
+	\item \textbf{Introduction}\\
+	      Good morning everyone, my name is Mateusz and today I will present to you my project \emph{Sunfish: Enabling Predictive Analytics For Datacenters Through Digital Twinning}.
+
+	      At a top level, my project is about trying to ease datacenter management by trying to pave the way to predicting unexpected events.
+
+	\item \textbf{Societal Impact}\\
+	      As you know and as you will likely see in the upcoming presentations today, datacenters are important.
+	      But, I would like to shortly mention this myself.
+
+	      A single GPU is already very complex.
+	      Within a Google Datacenter, there are hundreds of server racks, with tens of such GPUs.
+	      This begs the question: How are we going to manage this large of a datacenter, that has so many \emph{layers of complexity}?
+
+	      We cannot let these systems go down or experience big failures, because \eg in Netherlands over 3 million professionals depend daily on the cloud.
+	      \todo{Read the slide box.}
+	      As such, we must do something to manage datacenters well.
+
+
+	\item \textbf{Problem Statement}\\
+	      Digital Twinning pairs complex objects (like datacenters) via a two-way connection with their virtual representation.
+	      \todo{Give example about the airplane from aviation.}
+	      \emph{It a method to manage complex systems.}
+
+	      However, in digital twinning, specifically datacenter digital twinning a lot of elements are still shifting about and there are a lot of ways to create the virtual models and there seems to not be a fully functioning DCDT out there (\emph{that meets the official NASM definition}).
+
+	      DCDT's lack mandatory features one of which is predictive analytics.
+	      Predictive Analytics is a type of ODA that draws insights into the future based on current data, \eg telling when a host failure might happen before it does (\emph{and yet it is NOT present in existing DCDTs}).
+
+	\item \textbf{Research Questions}\\
+	      We wish to enable the development of predictive analysis components for DCDT's by designing a predictive DCDT.
+	      We ask the following research questions. \todo{Read from slide boxes and explain for each (1) describe why it's important (2) say why it's challenging (3) say what makes it scientific.}
+
+	\item \textbf{Literature Survey}\\
+	      This is the most exciting part of the thesis for me.
+	      To answer \textbf{RQ1} we conduct a comprehensive literature survey.
+	      We did not conduct the systematic literature survey by Kitchenham \etal, instead we relied heavily on snowballing and manual search for works in Google Scholar and DBLP.
+
+	      Google Scholar referred us to ACM Digital Library, IEEExplore, Science Direct and others.
+	      We used structured queries such a ``datacenters'' \texttt{AND} ``digital twinning'' or plainly ``datacenter digital twins''.
+
+	      To filter out relevant work we read the abstract, introduction and conclusion and afterwards decided whether to include the article.
+	      The results are in \textbf{Table 1.1}.
+	      \todo{"Read the slide box."}
+	\item \textbf{System Model}\\
+	      We also created a holistic system model of DCDTs.
+	      We decided to make a system model instead of a taxonomy, because we discuss the design of a set of systems, and there are not that many to consider making a full \emph{Linnaeus} tree and a taxonomy.
+
+	      The system model is in \textbf{Fig. 1.3}, and what I found to be the most interesting while reading the literature was the lack of the connection between the two twins.
+	      As such, what makes this design special is the \emph{Digital Thread}.      \todo{Read the slide box.}
+
+	\item \textbf{Reference Architecture and Prototype}\\
+	      From the literature survey, we gathered the potential use-cases of our system, which we omit for brevity.
+	      From the use-cases we developed as set of functional and non-functional requirements, based on which we created the reference architecture.
+
+	      The most innovative part of the data pipeline is the use of both in-band and out-of-band data pipelines, by including both a short-term cache and a long-term database.
+
+	      The most interesting thing that I devised myself, is the predictive analytics component.
+	      \todo{Go through the elements in the plot.}
+
+	      Given this reference architecture, we created a prototype, called \emph{Sunfish}.
+	      We evaluate this prototype in the following slides.
+
+
+	\item \textbf{Novel Evaluation Method}\\
+	      Now we go to the most difficult part.
+	      In order to evaluate a prototype, we propose a novel approach.
+	      Many researchers do not have a real facility to experiment with.
+	      We propose to use a second simulator to act as the real datacenter.
+
+	      \todo{Say in order to not cram content into the presentation, we omit the technical setup, and include it in extra slides.}
+
+
+	\item \textbf{Experiment 1: Red and Yellow Alarms}\\
+	      For Experiment 1 we copy the idea of Milojicic \etal for different ways a DCDT can notify the datacenter.
+
+	      Imagine a scenario: a datacenter will soon run a workload.
+	      We want to detect and differentiate between failures that are big and unexpected and failures we anticipated would occur.
+
+	      To achieve this: the DCDT runs the workload using the simulator.
+	      We cannot know what kind of failures we can expect, so we use a statistical distribution to approximate what might occur in practice.
+	      In result, we get a picture of what kind of problems we might expect.
+
+	      We now use the real-time feedback loop to notify the DC operators that what is happening in reality is different from simulation.
+	      If we get within 80\% of the predicted threshold for number of failures we send a yellow alarm.
+	      If we get within 90\% we send a red alarm.
+	\item \textbf{Experiment 2: Conceptual Experiment}\\
+	\item \textbf{Key Takeaways}\\
+	      \todo{Read from the slide.}
+\end{enumerate}
+\end{document}