Corrections

This commit is contained in:
Antti H S Laaksonen 2017-02-04 00:47:45 +02:00
parent d8a26a6274
commit 660e6f6e14
1 changed files with 121 additions and 112 deletions

View File

@ -2,40 +2,45 @@
\index{amortized analysis} \index{amortized analysis}
Often the time complexity of an algorithm The time complexity of an algorithm
is easy to analyze by looking at the structure is often easy to analyze
just by examining the structure
of the algorithm: of the algorithm:
what loops there are and how many times what loops does the algorithm contain,
they are performed. and how many times the loops are performed.
However, sometimes a straightforward analysis However, sometimes a straightforward analysis
doesn't give a true picture of the efficiency of the algorithm. does not give a true picture of the efficiency of the algorithm.
\key{Amortized analysis} can be used for analyzing \key{Amortized analysis} can be used for analyzing
an algorithm that contains an operation whose algorithms that contain operations whose
time complexity varies. time complexity varies.
The idea is to consider all such operations during the The idea is to estimate the total time used for
execution of the algorithm instead of a single operation, all such operations during the
and estimate the total time complexity of the operations. execution of the algorithm, instead of focusing
on individual operations.
\section{Two pointers method} \section{Two pointers method}
\index{two pointers method} \index{two pointers method}
In the \key{two pointers method}, In the \key{two pointers method},
two pointers iterate through the elements in an array. two pointers are used for
iterating through the elements in an array.
Both pointers can move during the algorithm, Both pointers can move during the algorithm,
but the restriction is that each pointer can move but each pointer can move to one direction only.
to only one direction. This restriction ensures that the algorithm works efficiently.
This ensures that the algorithm works efficiently.
We will next discuss two problems that can be solved We will next discuss two problems that can be solved
using the two pointers method. using the two pointers method.
\subsubsection{Subarray sum} \subsubsection{Subarray sum}
Given an array that contains $n$ positive integers, As the first example,
our task is to find out if there is a subarray we consider a problem where we are
where the sum of the elements is $x$. given an array of $n$ positive numbers
and a target sum $x$,
and we should find a subarray whose sum is $x$
or report that there is no such subarray.
For example, the array For example, the array
\begin{center} \begin{center}
\begin{tikzpicture}[scale=0.7] \begin{tikzpicture}[scale=0.7]
@ -61,7 +66,7 @@ For example, the array
\node at (7.5,1.4) {$8$}; \node at (7.5,1.4) {$8$};
\end{tikzpicture} \end{tikzpicture}
\end{center} \end{center}
contains a subarray with sum 8: contains a subarray whose sum is 8:
\begin{center} \begin{center}
\begin{tikzpicture}[scale=0.7] \begin{tikzpicture}[scale=0.7]
\fill[color=lightgray] (2,0) rectangle (5,1); \fill[color=lightgray] (2,0) rectangle (5,1);
@ -89,16 +94,17 @@ contains a subarray with sum 8:
\end{center} \end{center}
It turns out that the problem can be solved in It turns out that the problem can be solved in
$O(n)$ time using the two pointers method. $O(n)$ time by using the two pointers method.
The idea is to iterate through the array The idea is that
using two pointers that define a range in the array. the left and right pointer indicate the
first and last element of an subarray.
On each turn, the left pointer moves one step On each turn, the left pointer moves one step
forward, and the right pointer moves forward forward, and the right pointer moves forward
as long as the sum is at most $x$. as long as the subarray sum is at most $x$.
If the sum of the range becomes exactly $x$, If the sum becomes exactly $x$,
we have found a solution. we have found a solution.
As an example, we consider the following array As an example, consider the following array
with target sum $x=8$: with target sum $x=8$:
\begin{center} \begin{center}
\begin{tikzpicture}[scale=0.7] \begin{tikzpicture}[scale=0.7]
@ -125,10 +131,10 @@ with target sum $x=8$:
\end{tikzpicture} \end{tikzpicture}
\end{center} \end{center}
First, the pointers define a range with sum $1+3+2=6$. Initially, the subarray contains the elements
The range can't be larger 1, 3 and 2, and the sum of the subarray is 6.
because the next number 5 would make the sum The subarray cannot be larger because
larger than $x$. the next element 5 would make the sum larger than $x$.
\begin{center} \begin{center}
\begin{tikzpicture}[scale=0.7] \begin{tikzpicture}[scale=0.7]
@ -159,8 +165,8 @@ larger than $x$.
\end{tikzpicture} \end{tikzpicture}
\end{center} \end{center}
After this, the left pointer moves one step forward. Then, the left pointer moves one step forward.
The right pointer doesn't move because otherwise The right pointer does not move because otherwise
the sum would become too large. the sum would become too large.
\begin{center} \begin{center}
@ -232,37 +238,38 @@ the number of steps the right pointer moves.
There is no upper bound how many steps the There is no upper bound how many steps the
pointer can move on a single turn. pointer can move on a single turn.
However, the pointer moves \emph{a total of} However, the pointer moves \emph{a total of}
$O(n)$ steps during the algorithm $O(n)$ steps during the algorithm,
because it only moves forward. because it only moves forward.
Since both the left and the right pointer Since both the left and right pointer
move $O(n)$ steps during the algorithm, move $O(n)$ steps during the algorithm,
the time complexity is $O(n)$. the time complexity is $O(n)$.
\subsubsection{Sum of two numbers} \subsubsection{2SUM-problem}
\index{2SUM problem} \index{2SUM-problem}
Given an array of $n$ integers and an integer $x$, Another problem that can be solved using
our task is to find two numbers in array the two pointers method is the following problem,
whose sum is $x$ or report that there are no such numbers. also known as the \key{2SUM-problem}:
This problem is known as the \key{2SUM} problem, We are given an array of $n$ numbers and
and it can be solved efficiently using the a target sum $x$, and our task is to find two numbers
two pointers method. in the array such that their sum is $x$,
or report that no such numbers exist.
First, we sort the numbers in the array in To solve the problem, we first
increasing order. sort the numbers in the array in increasing order.
After this, we iterate through the array using After that, we iterate through the array using
two pointers that begin at both ends of the array. two pointers initially located at both ends of the array.
The left pointer begins from the first element The left pointer begins at the first element
and moves one step forward on each turn. and moves one step forward on each turn.
The right pointer begins from the last element The right pointer begins at the last element
and always moves backward until the sum of the range and always moves backward until the sum of the
defined by the pointers is at most $x$. subarray is at most $x$.
If the sum is exactly $x$, we have found a solution. If the sum is exactly $x$, we have found a solution.
For example, consider the following array when For example, consider the following array
our task is to find two elements whose sum is $x=12$: with target sum $x=12$:
\begin{center} \begin{center}
\begin{tikzpicture}[scale=0.7] \begin{tikzpicture}[scale=0.7]
\draw (0,0) grid (8,1); \draw (0,0) grid (8,1);
@ -358,7 +365,7 @@ and the sum becomes $4+7=11$.
\end{center} \end{center}
After this, the left pointer moves one step forward again. After this, the left pointer moves one step forward again.
The right pointer doesn't move, and the solution The right pointer does not move, and a solution
$5+7=12$ has been found. $5+7=12$ has been found.
\begin{center} \begin{center}
@ -391,16 +398,16 @@ $5+7=12$ has been found.
\end{tikzpicture} \end{tikzpicture}
\end{center} \end{center}
At the beginning of the algorithm, The algorithm consists of two phases:
the sorting takes $O(n \log n)$ time. First, sorting the array takes $O(n \log n)$ time.
After this, the left pointer moves $O(n)$ steps After this, the left pointer moves $O(n)$ steps
forward, and the right pointer moves $O(n)$ steps forward, and the right pointer moves $O(n)$ steps
backward. Thus, the total time complexity backward. Thus, the total time complexity
of the algorithm is $O(n \log n)$. of the algorithm is $O(n \log n)$.
Note that it is possible to solve Note that it is possible to solve the problem
in another way in $O(n \log n)$ time using binary search. in another way in $O(n \log n)$ time using binary search.
In this solution, we iterate through the array In such a solution, we iterate through the array
and for each number, we try to find another and for each number, we try to find another
number such that the sum is $x$. number such that the sum is $x$.
This can be done by performing $n$ binary searches, This can be done by performing $n$ binary searches,
@ -408,8 +415,9 @@ and each search takes $O(\log n)$ time.
\index{3SUM-problem} \index{3SUM-problem}
A somewhat more difficult problem is A somewhat more difficult problem is
the \key{3SUM} problem where our task is the \key{3SUM-problem} that asks to
to find \emph{three} numbers whose sum is $x$. find \emph{three} numbers in the array
such that their sum is $x$.
This problem can be solved in $O(n^2)$ time. This problem can be solved in $O(n^2)$ time.
Can you see how it is possible? Can you see how it is possible?
@ -421,34 +429,33 @@ Amortized analysis is often used for
estimating the number of operations estimating the number of operations
performed for a data structure. performed for a data structure.
The operations may be distributed unevenly so The operations may be distributed unevenly so
that the most operations appear during a that most operations occur during a
certain phase in the algorithm, but the total certain phase of the algorithm, but the total
number of the operations is limited. number of the operations is limited.
As an example, let us consider a problem As an example, consider the problem
where our task is to find for each element of finding for each element
in an array the in an array the
\key{nearest smaller element}, i.e., \key{nearest smaller element}, i.e.,
the nearest smaller element that precedes the first smaller element that precedes
the element in the array. the element in the array.
It is possible that no such element exists, It is possible that no such element exists,
and the algorithm should notice this. in which case the algorithm should report this.
It turns out that the problem can be efficiently solved It turns out that the problem can be solved
in $O(n)$ time using a suitable data structure. in $O(n)$ time using an appropriate data structure.
An efficient solution for the problem is to An efficient solution to the problem is to
iterate through the array from the left to the right, iterate through the array from left to right,
and maintain a chain of elements where the and maintain a chain of elements where the
first element is the active element in the array first element is the current element,
and each following element is the nearest smaller and each following element is the nearest smaller
element of the previous element. element of the previous element.
If the chain only contains one element, If the chain only contains one element,
the active element doesn't have a nearest smaller element. the current element does not have the nearest smaller element.
At each step, we remove elements from the chain At each step, elements are removed from the chain
until the first element is smaller until the first element is smaller
than the active element, or the chain is empty. than the current element, or the chain is empty.
After this, the active element becomes the first element After this, the current element is added to the chain.
in the chain.
As an example, consider the following array: As an example, consider the following array:
\begin{center} \begin{center}
@ -476,11 +483,10 @@ As an example, consider the following array:
\end{tikzpicture} \end{tikzpicture}
\end{center} \end{center}
First, numbers 1, 3 and 4 are added to the chain First, the numbers 1, 3 and 4 are added to the chain,
because each element is larger than the previous element. because each number is larger than the previous number.
This means that the nearest smaller element of Thus, the nearest smaller element of 4 is 3,
number 4 is number 3 whose nearest smaller element and the nearest smaller element of 3 is 1.
is number 1.
\begin{center} \begin{center}
\begin{tikzpicture}[scale=0.7] \begin{tikzpicture}[scale=0.7]
\fill[color=lightgray] (2,0) rectangle (3,1); \fill[color=lightgray] (2,0) rectangle (3,1);
@ -510,10 +516,11 @@ is number 1.
\end{tikzpicture} \end{tikzpicture}
\end{center} \end{center}
The next number 2 is smaller than two first numbers in the chain. The next number 2 is smaller than the two first numbers in the chain.
Thus, numbers 4 and 3 are removed, and then number 2 becomes Thus, the numbers 4 and 3 are removed from the chain,
the first element in the chain. and then the number 2
Its nearest smaller element is number 1: is added to the chain.
Its nearest smaller element is 1:
\begin{center} \begin{center}
\begin{tikzpicture}[scale=0.7] \begin{tikzpicture}[scale=0.7]
\fill[color=lightgray] (3,0) rectangle (4,1); \fill[color=lightgray] (3,0) rectangle (4,1);
@ -542,9 +549,9 @@ Its nearest smaller element is number 1:
\end{tikzpicture} \end{tikzpicture}
\end{center} \end{center}
After this, number 5 is larger than number 2, After this, the number 5 is larger than the number 2,
so it will be added to the chain and so it will be added to the chain, and
its nearest smaller element is number 2: its nearest smaller element is 2:
\begin{center} \begin{center}
\begin{tikzpicture}[scale=0.7] \begin{tikzpicture}[scale=0.7]
\fill[color=lightgray] (4,0) rectangle (5,1); \fill[color=lightgray] (4,0) rectangle (5,1);
@ -574,22 +581,22 @@ its nearest smaller element is number 2:
\end{tikzpicture} \end{tikzpicture}
\end{center} \end{center}
Algorithm continues in a similar way The algorithm continues in the same way,
and finds out the nearest smaller element and finds the nearest smaller element
for each number in the array. for each number in the array.
But how efficient is the algorithm? But how efficient is the algorithm?
The efficiency of the algorithm depends on The efficiency of the algorithm depends on
the total time used for manipulating the chain. the total time used for manipulating the chain.
If an element is larger than the first If an element is larger than the first
element in the chain, it will only be inserted element in the chain, it is directly added
to the beginning of the chain which is efficient. to the chain, which is efficient.
However, sometimes the chain can contain several However, sometimes the chain can contain several
larger elements and it takes time to remove them. larger elements, and it takes time to remove them.
Still, each element is added exactly once to the chain Still, each element is added exactly once to the chain
and removed at most once. and removed at most once from the chain.
Thus, each element causes $O(1)$ operations Thus, each element causes $O(1)$ chain operations,
to the chain, and the total time complexity and the total time complexity
of the algorithm is $O(n)$. of the algorithm is $O(n)$.
\section{Sliding window minimum} \section{Sliding window minimum}
@ -597,33 +604,33 @@ of the algorithm is $O(n)$.
\index{sliding window} \index{sliding window}
\index{sliding window minimum} \index{sliding window minimum}
A \key{sliding window} is an active subarray A \key{sliding window} is a constant-size subarray
that moves through the array whose size is constant. that moves through the array.
At each position of the window, At each position of the window,
we typically want to calculate some information we typically want to calculate some information
about the elements inside the window. about the elements inside the window.
An interesting problem is to maintain An interesting problem is to maintain
the \key{sliding window minimum}. the \key{sliding window minimum},
This means that at each position of the window, which means that at each position of the window,
we should report the smallest element inside the window. we should report the smallest element inside the window.
The sliding window minima can be calculated The sliding window minimum can be calculated
using the same idea that we used for calculating using the same idea that we used for calculating
the nearest smaller elements. the nearest smaller elements.
The idea is to maintain a chain whose We maintain a chain whose
first element is the last element in the window, first element is always the last element in the window,
and each element is smaller than the previous element. and each element in the chain is smaller than the previous element.
The last element in the chain is always the The last element in the chain is always the
smallest element inside the window. smallest element inside the window.
When the sliding window moves forward and When the sliding window moves forward and
a new element appears, we remove all elements a new element appears, we remove from the chain
from the chain that are larger than the new element. all elements that are larger than the new element.
After this, we add the new number to the chain. After this, we add the new element to the chain.
In addition, if the last element in the chain Finally, if the last element in the chain
doesn't belong to the window anymore, it is removed from the chain. does not belong to the window anymore,
it is removed from the chain.
As an example, consider the following array As an example, consider the following array:
when the window size is $k=4$:
\begin{center} \begin{center}
\begin{tikzpicture}[scale=0.7] \begin{tikzpicture}[scale=0.7]
@ -650,7 +657,7 @@ when the window size is $k=4$:
\end{tikzpicture} \end{tikzpicture}
\end{center} \end{center}
The sliding window begins from the left border of the array. Suppose that the size of the sliding window is 4.
At the first window position, the smallest element is 1: At the first window position, the smallest element is 1:
\begin{center} \begin{center}
\begin{tikzpicture}[scale=0.7] \begin{tikzpicture}[scale=0.7]
@ -685,8 +692,9 @@ At the first window position, the smallest element is 1:
Then the window moves one step forward. Then the window moves one step forward.
The new number 3 is smaller than the numbers The new number 3 is smaller than the numbers
5 and 4 in the chain, so the numbers 5 and 4 5 and 4 in the chain, so the numbers 5 and 4
are removed and the number 3 is added to the chain. are removed from the chain
The smallest element is 1 as before. and the number 3 is added to the chain.
The smallest element is still 1.
\begin{center} \begin{center}
\begin{tikzpicture}[scale=0.7] \begin{tikzpicture}[scale=0.7]
\fill[color=lightgray] (1,0) rectangle (5,1); \fill[color=lightgray] (1,0) rectangle (5,1);
@ -716,8 +724,9 @@ The smallest element is 1 as before.
\end{tikzpicture} \end{tikzpicture}
\end{center} \end{center}
After this, the window moves again and the smallest element 1 After this, the window moves again,
doesn't belong to the window anymore. and the smallest element 1
does not belong to the window anymore.
Thus, it is removed from the chain and the smallest Thus, it is removed from the chain and the smallest
element is now 3. In addition, the new number 4 element is now 3. In addition, the new number 4
is added to the chain. is added to the chain.