ass2/q4/fibonacci.md


1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236

---
title: Assignment 2, **Question 4** - FIT3155
author: Alexander Occhipinti - 29994705
date: Week 8 of Semester 2, 2021
geometry: margin=2cm
header-includes:
- \usepackage{cancel}
- \usepackage{qtree}
---

#  Part a
Firstly, the sum of the amortised cost for every state $i$ of $D$ can be expressed as:

$$
\sum_{i=1}^n ac_{i} = \sum_{i=1}^n tc_{i} + \sum_{i=1}^n \Phi(D_{i}) - \Phi(D_{i-1})  
$$ {#eq:1}

This is just taking equation 3 from the assignment paper, summing it then breaking each part into separate sums.

Since we know that $0 < i \leq n$, we can expand out $\sum_{i=1}^n \Phi(D_{i})$ into:

$$
\sum_{i=1}^n \Phi(D_{i}) = \Phi(D_{n}) + \Phi(D_{n-1}) + \Phi(D_{n-2}) + \ldots + \Phi(D_{i}) + \Phi(D_{i-1}) + \ldots + \Phi(D_{1}) 
$${#eq:2}

$D_{i-1}$ is defined as the stage of the data structure before $D_{i}$. This means we can expand the right-hand side of (@eq:1) as:
\begin{align*}
\sum_{i=1}^n \Phi(D_{i}) - \Phi(D_{i-1}) = \Phi(D_{n})\;+\;&\cancel{\Phi(D_{n-1})} + \cancel{\Phi(D_{n-2})} + \ldots + \cancel{\Phi(D_{i})} + \cancel{\Phi(D_{i-1})} + \ldots + \cancel{\Phi(D_{1})} \\
-\;&\cancel{\Phi(D_{n-1})} + \cancel{\Phi(D_{n-2})} + \ldots + \cancel{\Phi(D_{i})} + \cancel{\Phi(D_{i-1})} + \ldots + \cancel{\Phi(D_{1})} + \Phi(D_{0})
\end{align*}
These expanded terms of $\Phi(D_{i})$ and $\Phi(D_{i-1})$ then cancel out to become $\Phi(D_{n}) - \Phi(D_{0})$. Therefore, it can be concluded that:
$$\sum_{i=1}^n ac_{i} = \sum_{i=1}^n tc_{i} + \sum_{i=1}^n \Phi(D_{n}) - \Phi(D_{0}) = \sum_{i=1}^n tc_{i} + \Phi(D_{n}) - \Phi(D_{0})$$
This is the derived right-hand side.

# Part b

The true cost of inserting an element into a Fibonacci heap is $O(1)$. 

This is because adding a new element to the heap is as simple as adding it directly to the end of the root list, and checking if it is the minimum element. Both of these operations are $O(1)$ each, combining to make $O(1)$.

For a heap in state $H_{i-1}$, let $T(H)$ be the number of nodes in the root list of $H$.
When we insert a node into $H$, it is put directly into the root list. This means that the state after the insert has 1 more node than it did before the insert.
This means:
$$\Phi(D_{i}) - \Phi(D_{i-1})  = T(H) - T(H) + 1 = 1 $$
So the change in the potential function increases by only 1 unit per insert.
This then shows that the amortised cost is:
\begin{align*}
    ac_i &= O(1) + 1 \\
    &= O(1).
\end{align*}

# Part c

## True cost

When `extract_min` is performed, the following three operations are performed:

1. `find_min` (shown to be $O(1)$ in true and amortised cost in the question.)
2. Delete the min and promote its children
3. Consolidate.

Deleting the minimum node (which is in the root list) will take in the worst possible case $O(D_{max}(N))$ operations. 
This is because $O(1)$ work is being done for each of the min's children, which could be at worst $D_{max}(N)$ children. This is true because a root of degree $k$ has $k$ children.

Consolidating requires going over every root in the root list and checking whether it needs to be merged with another or not. This requires $O(1)$ work for $T(H)$ nodes, resulting in $O(T(H))$ complexity.

Adding the time taken by these three operations results in:

\begin{align*}
    tc_i &= O(1) + O(D_{max}(N)) + O(T(H)) \\
    &= O\big(D_{max}(N) + T(H)\big) \text{ time.}
\end{align*}

## Potential cost

The potential function $\Phi$ we are using is the number of nodes in the root list.
When we delete the minimum element from the heap, its children become root nodes themselves. This means that the potential function increases by 
$D_{max}(N)$ (minus 1, but this isn't important for the sake of the cost function) as it is the maximum number of children that a node could have.
Therefore, the change in potential is:
$$
\Phi(D_{i}) - \Phi(D_{i-1}) = D_{max}(N)
$$

### Scaling units of potential

Plugging this into the original amortised cost equation:
\begin{align*}
    ac_i &= tc_i + m[\Phi(D_{i}) - \Phi(D_{i-1})] \\
    &= O\big(D_{max}(N) + T(H)\big) + m[\Phi(D_{i}) - \Phi(D_{i-1})] \\
    &= O\big(D_{max}(N) + T(H)\big) + m[D_{max}(N)]
\end{align*}

# Part d

Consider a Fibonacci heap where $D_{max}(N) = 3$ named $H_3$.

In order to fill $H_3$ with the most amount of nodes possible without changing $D_{max}(N)$ to $4$, 
we need to add exactly one root node of each previous degree to the root list. This looks like so:

$$\Tree [.H_3 0 [.1 0 ] [.2 [.1 0 ] 0 ] [.3 [.2 [.1 0 ] 0 ] [.1 0 ] 0 ]     ]$$

If we were to consolidate a tree filled like this, it would remain the same. If we add just one more node and consolidate, the whole heap will collapse in on itself and create a heap with a single root node of degree 4.
For a node of degree $k$, if nothing has been deleted below it, it will have $2^k$ nodes.
In $H_3$'s case, it has 15 nodes:
$$N = 2^0 + 2^1 + 2^2 + 2^3 = 1 + 2 + 4 + 8 = 15 $$ 
Like mentioned earlier, adding one more node (16 nodes total) turns the heap into a single degree 4 root in the root list. This makes sense since $2^4 = 16$.
For $H_3$ the assertion that $D_{max}(N) \leq \lfloor\log_2(N)\rfloor$ holds true since: 
\begin{align*}
    3 &\leq \lfloor\log_2(15)\rfloor \\
    &\leq \lfloor 3.907\ldots \rfloor \\
    &\leq 3
\end{align*}

More generally, consider a Fibonacci heap where $D_{max}(N) = k$. 
This heap is maximally filled in the same way as the above example. It consists of exactly one root node of degree $k$ or smaller. $k$, $k-1$, $k-2$, \ldots $\;$ $0$.
$$\Tree [.H 0 [.1 0 ] {...} \qroof{\; ... \;}.k-1 \qroof{\; ... \;}.k ]$$
If we were to add one more node and consolidate, the heap will consist of a root list containing just a degree $k+1$ node. This would no longer be a $D_{max}(N) = k$ heap.
Given that for a node of degree $k$, if nothing has been deleted below it, it will have $2^k$ nodes, we can calculate the maximum number of nodes in a heap of max degree $k$.
\begin{align*}
    &N = \sum_{d=0}^{k} \; d^2 \\
    &\therefore \; k \leq \lfloor\log_2(N)\rfloor
\end{align*}
Substituting  $D_{max}(N)$ back in for $k$:
$$
D_{max}(N) \leq \lfloor\log_2(N)\rfloor.
$$

# Part e

Let $N_k$ denote the number of nodes in a *maximally decreased* tree of degree $k$.
By definition, a node of degree $k$ must have exactly $k$ children. By the rules of Fibonacci heaps and binomial trees, each of these children are subtrees with their own degrees.

By looking at the maximally decreased trees shown in Figure 1 on the assignment sheet, it can be seen that the number of nodes in a tree follows the Fibonacci sequence.
This is evident from the fact that $N_0$, $N_1$, $N_2$ and $N_3$ are 1, 2, 3 and 5 nodes respectively. This also holds true for $N_4$ which has 8 nodes.

Let $F_n$ represent the $n^{\text{th}}$ number in the Fibonacci sequence, defined in the same way as Figure 1 in the assignment sheet. (*i.e.* $F_0 = 0$, $F_1 = 1$, $F_2 = 1$ etc\ldots)

The number of nodes in a maximally decreased tree of degree $k$ is: $N_k = F_{k+2}$.

# Part f

Using the definitions of $N_k$ and $F_n$ defined in Part e, we want to show that $N_k = F_{k+2}$.

## Base cases
For $N_0$ this holds true since there is 1 node in a maximally decreased tree of degree 0: the root. If you were to delete this node, the tree would not exist. 1 is the $0+2 = 2^{\text{nd}}$ number in the Fibonacci sequence.
Therefore: $N_0 = F_{2} = 1$.

This also holds true for $N_1$. Since deleting a tree of degree 1's only child will decrement the degree, a node of degree 1 is already maximally decreased. The number of nodes in this tree is $2$ and $N_1 = F_{2+1} = F_{3} = 2$.

## Induction

To define a maximally decreased tree of degree $k$, we must delete as many nodes as we can from that tree while preserving the degree of the tree's root. 
The root must have $k$ direct children at all times in order to remain a tree of degree $k$.

Assume that this property holds for a maximally decreased tree $T$ of degree $k$. (*i.e.* $N_k = F_{k+2}$) *and* for a maximally decreased tree  of degree $k-1$. (*i.e.* $N_{k-1} = F_{k+1}$)

This means that $T$ must have $k$ children, and that each of those subtrees must also be maximally decreased. This extends down to every node in $T$.

To obtain a maximally decreased tree of degree $k$, we must ask each of the root's children to delete the node with the highest degree out of its children, then tell each of its children to do the same.
For nodes of rank $0$, they have no children so they remain unaffected.
The recursively applies the operation to all children in the tree. 
The reason this forms a maximally decreased tree is because: each node is deleting the most nodes it possibly can with a single delete action, marking that node.

We know that a tree of degree $k+1$ can be split into two trees of degree $k$, joined by the link between the root and the node of degree $k$. Let us name the side without the root as $\alpha$ and the one with the root as $\beta$.
When $\alpha$ is maximally decreased, we delete the node of degree $k-1$, marking the node of degree $k$ of $\alpha$. The same is done to all of its children. This results in a maximally decreased tree of degree $k-1$, since the “root” of $\alpha$ was allowed to be marked.
Due to our assumption of $N_{k-1} = F_{k+1}$, we assume $\alpha$ has $F_{k+1}$ nodes.

In $\beta$'s case: we do **not** want to mark the root node, since $\beta$'s root is the root of the whole tree of degree $k+1$. If you were to attempt to mark the root, it would delete one of the direct children of the $k+1$ degree tree, decrementing its degree (violating the rule of maximally decreasing). 
Instead, we can treat $\beta$ as a normal tree of degree $k$ and maximally decrease it as so. Due to our assumption of $N_k = F_{k}$, we assume $\beta$ has $F_{k+2}$ nodes.

## Conclusion

Given what's described above, the size of $T$ is the sum of both its halves: $\alpha$ and $\beta$.

Since the definition of the Fibonacci sequence is: $F_n = F{n-1} + F{n-2}$, it can be said that:
\begin{align*}
    N_{k+1} &= \alpha + \beta \\
    &= F_{k+1} + F_{k+2} \\
    &= F_{k+3}
\end{align*}
Therefore by the Principle of Mathematical Induction: $N_{k} = F_{k+2}$. $\blacksquare$

# Part g

Since we know that the number of nodes in a maximally decreased heap is $F_{k+2}$ nodes, we can define $g(k) = F_{k+2}$, substituting it into the assignment sheet's equation 7.

Since we also know that $\mathbf{size}(r) \geq F_{k+2}$ and that $F_{k+2} \geq \phi^k$ (from the sheet's equation 8), we know that $\mathbf{size}(r) \geq \phi^k$.
This shows that $\mathbf{size}(r)$ is an upper bound on $F_k+2$.
Taking the $\log$ base $\phi$ of both sides we get:
$$
    \log_\phi(\mathbf{size}(r)) \geq k
$$
This means that $\log_\phi(\mathbf{size}(r))$ is an upper bound of the degree of that tree $k$, meaning that $k$ is bounded by $O(\log({\mathbf{size}}(r)))$.

$D_{max}(N)$ represents the biggest degree $k$ in the heap and $N \geq \mathbf{size}(r)$ because the number of nodes in the whole heap will be at least the number of nodes in the biggest tree. Plugging in these values into the above equation yields:
\begin{align*}
    \log_\phi(\mathbf{size}(r)) &\geq D_{max}(N) \\
    \log_\phi(N) &\geq D_{max}(N) \\
\end{align*}

From this, it can be concluded that $D_{max}(N)$ for `extract_min` grows at $O(\log(N))$.

# Part h

We can calculate the amortised cost of `decrease_key` using the equation $ac_i = tc_i + \Phi(D_{i}) - \Phi(D_{i-1})$.

## True cost

The true cost of a single `decrease_key` operation is $O(1)$.  This includes: changing the key value, checking its parent and marking/unmarking nodes.
This is because these operations do not depend on the state of the data structure for complexity.

## Potential cost

The cost function for this question is defined as $\Phi = T(H)$, where $T(H)$ is the number of nodes in the root list.
As described before, a maximum of $D_{max}(N)$ cuts is performed on a heap for a single `decrease_key` operation.
For each cascading cut, we need to add the newly orphaned node to the root list. This increases the amount of roots in the root list by 1 for every cut done, which could be at maximum $D_{max}(N)$ new roots.

In other words:
\begin{align*}
\Phi(D_{i}) - \Phi(D_{i-1}) &= \Phi(D_{i}) - \Phi(D_{i-1}) + D_{max}(N) \\
&= D_{max}(N)
\end{align*}

## Amortised cost

Plugging the two findings into the amortised cost function yields:
\begin{align*}
ac_i &= O(1) + D_{max}(N) \\
&= O(D_{max}(N)) \\
&= O(\log(N)) \;\; \textit{from part g}
\end{align*}

This is not what I expected, since the amortised cost should take into account the fact that not every `decrease_key` operation takes $O(D_{max}(N))$ work. 
In a large amount of cases, a single operation does not need to go through every level of depth in the tree of the highest degree. It doesn't seem to take into account that a some operations consist of just decreasing the key and not deleting anything at all. I would expect something closer to constant time.

# Part i