Revision continues

2017-05-25 12:36:59 +03:00 · 2017-05-25 12:36:59 +03:00 · 0b998fc638
parent ace73e0bf5
commit 0b998fc638
1 changed files with 138 additions and 103 deletions
--- a/chapter10.tex
+++ b/chapter10.tex
@ -492,20 +492,35 @@ contains two such subgrids:
 There is an $O(n^3)$ time algorithm for solving the problem:
 go through all $O(n^2)$ pairs of rows and for each pair
-calculate the number of columns that contain a black
+$(a,b)$ calculate the number of columns that contain a black
 square in both rows in $O(n)$ time.
-Then, if there are $c$ such columns for a fixed row pair,
+The following code assumes that $\texttt{color}[y][x]$
-they account for $c(c-1)/2$ subgrids with black corners.
+denotes the color in row $y$ in column $x$:
 \begin{lstlisting}
 int count = 0;
 for (int i = 0; i < n; i++) {
    if (color[a][i] == BLACK && color[b][i] == BLACK) count++;
 }
 \end{lstlisting}
 Then, those columns
 account for $\texttt{count}(\texttt{count}-1)/2$ subgrids with black corners,
 because we can choose any two of them to form a subgrid.
 To optimize this algorithm, we divide the grid into blocks
 of columns such that each block consists of $N$
 consecutive columns. Then, each row is stored as
 a list of $N$-bit numbers that describe the colors
-of the squares.
+of the squares. Now we can process $N$ columns at the same time
-Using this representation,
+using bit operations. In the following code,
-we can process $N$ columns at the same time
+$\texttt{color}[y][k]$ represents
-using bit operations, and the resulting algorithm
+a block of $N$ colors as bits (0 is white and 1 is black).
-works in $O(n^3/N)$ time.
+\begin{lstlisting}
 int count = 0;
 for (int i = 0; i <= n/N; i++) {
    count += __builtin_popcount(color[a][i]&color[b][i]);
 }
 \end{lstlisting}
 The resulting algorithm works in $O(n^3/N)$ time.
 We generated a random grid of size $2500 \times 2500$
 and compared the original and bit-optimized implementation.
@ -520,110 +535,109 @@ Bit operations provide an efficient and convenient
 way to implement dynamic programming algorithms
 whose states contain subsets of elements,
 because such states can be stored as integers.
-Next we discuss some examples of combining
+Next we discuss examples of combining
 bit operations and dynamic programming.
-\subsubsection{Paths in a grid}
+\subsubsection{Optimal selection}
-As a first example, consider
+As a first example, consider the following problem:
-an $n \times n$ grid where
+We are given the prices of $k$ products
-each square contains an integer.
+over $n$ days, and we want to buy each product
-Our task is to check if there is a path from the upper-left
+exactly once.
-corner to the lower-right corner
+However, we are allowed to buy at most one product
-such that we only move right and down
+in a day.
-and each integer between 0 and $k$ appears on the path.
+What is the minimum total price?
-
+For example, consider the following scenario ($k=3$ and $n=8$):
 For example, in the following grid,
 there is a path that contains all
 integers between 0 and $k$:
 \begin{center}
 \begin{tikzpicture}[scale=.65]
-  \begin{scope}
+    \draw (0, 0) grid (8,3);
-    \fill [color=lightgray] (0, 9) rectangle (1, 8);
+    \node at (-2.5,2.5) {product $A$};
-    \fill [color=lightgray] (0, 8) rectangle (1, 7);
+    \node at (-2.5,1.5) {product $B$};
-    \fill [color=lightgray] (1, 8) rectangle (2, 7);
+    \node at (-2.5,0.5) {product $C$};
-    \fill [color=lightgray] (1, 7) rectangle (2, 6);
+
-    \fill [color=lightgray] (2, 7) rectangle (3, 6);
+    \foreach \x/\v in {0/6,1/9,2/5,3/2,4/8,5/9,6/1,7/6}
-    \fill [color=lightgray] (3, 7) rectangle (4, 6);
+        {\node at (\x+0.5,2.5) {\v};}
-    \fill [color=lightgray] (4, 7) rectangle (5, 6);
+    \foreach \x/\v in {0/8,1/2,2/6,3/2,4/7,5/5,6/7,7/2}
-    \fill [color=lightgray] (4, 6) rectangle (5, 5);
+        {\node at (\x+0.5,1.5) {\v};}
-    \fill [color=lightgray] (4, 5) rectangle (5, 4);
+    \foreach \x/\v in {0/5,1/3,2/9,3/7,4/3,5/5,6/1,7/4}
-    \draw (0, 4) grid (5, 9);
+        {\node at (\x+0.5,0.5) {\v};}
-    \node at (0.5,8.5) {2};
+\end{tikzpicture}
-    \node at (1.5,8.5) {0};
+\end{center}
-    \node at (2.5,8.5) {3};
+In this scenario, the minimum total price is $5$:
-    \node at (3.5,8.5) {1};
+\begin{center}
-    \node at (4.5,8.5) {1};
+\begin{tikzpicture}[scale=.65]
-    \node at (0.5,7.5) {0};
+    \fill [color=lightgray] (1, 1) rectangle (2, 2);
-    \node at (1.5,7.5) {1};
+    \fill [color=lightgray] (3, 2) rectangle (4, 3);
-    \node at (2.5,7.5) {4};
+    \fill [color=lightgray] (6, 0) rectangle (7, 1);
-    \node at (3.5,7.5) {2};
+    \draw (0, 0) grid (8,3);
-    \node at (4.5,7.5) {0};
+    \node at (-2.5,2.5) {product $A$};
-    \node at (0.5,6.5) {3};
+    \node at (-2.5,1.5) {product $B$};
-    \node at (1.5,6.5) {0};
+    \node at (-2.5,0.5) {product $C$};
-    \node at (2.5,6.5) {2};
+
-    \node at (3.5,6.5) {4};
+    \foreach \x/\v in {0/6,1/9,2/5,3/2,4/8,5/9,6/1,7/6}
-    \node at (4.5,6.5) {1};
+        {\node at (\x+0.5,2.5) {\v};}
-    \node at (0.5,5.5) {2};
+    \foreach \x/\v in {0/8,1/2,2/6,3/2,4/7,5/5,6/7,7/2}
-    \node at (1.5,5.5) {1};
+        {\node at (\x+0.5,1.5) {\v};}
-    \node at (2.5,5.5) {0};
+    \foreach \x/\v in {0/5,1/3,2/9,3/7,4/3,5/5,6/1,7/4}
-    \node at (3.5,5.5) {1};
+        {\node at (\x+0.5,0.5) {\v};}
    \node at (4.5,5.5) {3};
    \node at (0.5,4.5) {0};
    \node at (1.5,4.5) {2};
    \node at (2.5,4.5) {3};
    \node at (3.5,4.5) {3};
    \node at (4.5,4.5) {1};
  \end{scope}
 \end{tikzpicture}
 \end{center}
-We assume that the rows and columns are numbered
+Let $\texttt{price}[x][d]$ denote the price of product $x$
-from 1 to $n$, and $\texttt{value}[x][y]$
+on day $d$ where $0 \le x < k$ and $0 \le d < n$.
-denotes the value at position $(x,y)$.
+For example, in the above scenario $\texttt{price}[2][3] = 7$.
-It turns out that the problem can be solved in $O(n^2 2^k)$ time
+Then, let $\texttt{total}(S,d)$ denote the minimum total
-using dynamic programming.
+price for buying a subset $S$ of products by day $d$.
 Using this function, the solution to the problem is
 $\texttt{total}(\{0 \ldots k-1\},n-1)$.
-Let $\texttt{possible}(x,y,S)$ be a function
+First, $\texttt{total}(\emptyset,d) = 0$,
-that indicates whether there is a path
+because it does not cost anything to buy an empty set,
-from the upper-left square to square $(x,y)$ such that
+and $\texttt{total}(\{x\},0) = \texttt{price}[x][0]$,
-all values in $S$ appear on the path.
+because there is one way to buy one product on the first day.
-Thus, $\texttt{possible}(n,n,\{0 \ldots k\})$
+Then, the following recurrence can be used:
 tells us whether a desired path exists.
 The values of the function can be calculated using
 the following recurrence:
 \begin{equation*}
 \begin{aligned}
-\texttt{possible}(x,y,\emptyset) & = & \textrm{true} \\
+\texttt{total}(S,d) & = \min( & \texttt{total}(S,d-1), \\
-\texttt{possible}(x,y,S) & = & \texttt{possible}(x-1,y,S \setminus \texttt{value}[x][y]) \lor \\
+& & \min_{x \in S} \texttt{total}(S \setminus x,d-1)+\texttt{price}[x][d]))
                    &  & \texttt{possible}(x,y-1,S \setminus \texttt{value}[x][y]) \\
 \end{aligned}
 \end{equation*}
-The base case states that there is always a path that
+This means that we either do not buy any product on day $d$
-does not contain any integers.
+or buy a product $x$ that belongs to $S$.
-Then, in the recursive case we consider both directions
+In the latter case, we remove $x$ from $S$ and add the
-and remove $\texttt{value}[x][y]$
+price of $x$ to the total price.
 from $S$, because we can collect it in square $(x,y)$.
-We can implement the dynamic programming using an array
+The next step is to calculate the values of the function
 using dynamic programming.
 To store the function values, we declare an array
 \begin{lstlisting}
-bool possible[N][N][1<<K];
+int total[1<<K][N];
 \end{lstlisting}
-where $N$ and $K$ are suitably large constants.
+where $K$ and $N$ are suitably large constants.
-Then, we can calculate the values
+The first dimension of the array corresponds to a bit
-of the function as follows:
+representation of a subset.
 First, the cases where $d=0$ can be processed as follows:
 \begin{lstlisting}
-for (int x = 0; x <= n; x++) {
+for (int x = 0; x < k; x++) {
-    for (int y = 0; y <= n; y++) {
+    total[1<<x][0] = price[x][0];
-        possible[x][y][0] = true;
+}
-        if (x == 0 || y == 0) continue;
+\end{lstlisting}
-        for (int b = 1; b < (1<<k); b++) {
+Then, the recurrence translates into the following code:
-            possible[x][y][b] = possible[x-1][y][b&~value[x][y]] ||
+\begin{lstlisting}
-                                  possible[x][y-1][b&~value[x][y]];
+for (int d = 1; d < n; d++) {
    for (int s = 0; s < (1<<k); s++) {
        total[s][d] = total[s][d-1];
        for (int x = 0; x < k; x++) {
            if (s&(1<<x)) {
                total[s][d] = min(total[s][d],
                                    total[s^(1<<x)][d-1]+price[x][d]);
            }
        }
    }
 }
 \end{lstlisting}
 The time complexity of the algorithm is $O(n 2^k k)$.
 \subsubsection{From permutations to subsets}
@ -631,14 +645,14 @@ Using dynamic programming, it is often possible
 to change an iteration over permutations into
 an iteration over subsets\footnote{This technique was introduced in 1962
 by M. Held and R. M. Karp \cite{hel62}.}.
-The benefit of this technique is that
+The benefit of this is that
 $n!$, the number of permutations of an $n$ element set,
 is much larger than $2^n$, the number of subsets
 of the same set.
 For example, if $n=20$, then
 $n! \approx 2.4 \cdot 10^{18}$ and $2^n \approx 10^6$.
 Thus, for certain values of $n$,
-we can efficiently go through subsets but not through permutations.
+we can efficiently go through the subsets but not through the permutations.
 As an example, consider the following problem:
 There is an elevator with maximum weight $x$,
@ -675,25 +689,25 @@ The idea is to calculate for each subset of people
 two values: the minimum number of rides needed and
 the minimum weight of people who ride in the last group.
-Let $\texttt{rides}(X)$ and $\texttt{weight}(X)$ denote
+Let $\texttt{rides}(S)$ and $\texttt{weight}(S)$ denote
 the minimum number of rides and the minimum
-weight of the last group, where $X$ is a subset
+weight of the last group, where $S$ is a subset
 of people. For example,
 \[ \texttt{rides}(\{B,D,E\})=2 \hspace{10px} \textrm{and}
 \hspace{10px} \texttt{weight}(\{B,D,E\})=5,\]
 because the optimal rides are $\{B,E\}$ and $\{D\}$,
 and the second ride has weight 5.
 Of course, our final goal is to calculate the value
-of $\texttt{rides}(\{A,B,C,D,E\})$ that is the solution
+of $\texttt{rides}(X)$ where $X$ is a set that
-to the problem.
+contains all the people.
 We can calculate the values
 of the functions recursively and then apply
 dynamic programming.
 The idea is to go through all people
-that belong to $X$ and optimally
+who belong to $S$ and optimally
 choose the last person who enters the elevator.
-For example, if $X=\{B,D,E\}$,
+For example, if $S=\{B,D,E\}$,
 one of $B$, $D$ and $E$ is the last person
 who enters the elevator.
 Each such choice yields a subproblem
@ -706,19 +720,40 @@ for (int b = 0; b < (1<<n); b++) {
 }
 \end{lstlisting}
 to go through the subsets,
-because if $X$ and $Y$ are two subsets
+because if $S_1$ and $S_2$ are two subsets
-and $X \subset Y$,
+and $S_1 \subset S_2$,
-then $X$ comes before $Y$ in the above order.
+then $S_1$ comes before $S_2$ in the above order.
 \subsubsection{Counting subsets}
 Our last problem in this chapter is as follows:
-Let $X=\{0,1,\ldots,n-1\}$, and each subset $S \subset X$,
+Let $X=\{0 \ldots n-1\}$, and each subset $S \subset X$,
 is assigned an integer $\texttt{value}(S)$.
 Our task is to calculate for each $S$
 \[\texttt{sum}(S) = \sum_{A \subset S} \texttt{value}(A),\]
 i.e., the sum of values of subsets of $S$.
 For example, suppose that $n=3$ and the values are as follows:
 \begin{multicols}{2}
 \begin{itemize}
 \item $\texttt{value}(\emptyset) = 3$
 \item $\texttt{value}(\{0\}) = 1$
 \item $\texttt{value}(\{1\}) = 4$
 \item $\texttt{value}(\{0,1\}) = 5$
 \item $\texttt{value}(\{2\}) = 5$
 \item $\texttt{value}(\{0,2\}) = 1$
 \item $\texttt{value}(\{1,2\}) = 3$
 \item $\texttt{value}(\{0,1,2\}) = 3$
 \end{itemize}
 \end{multicols}
 In this case, for example,
 \begin{equation*}
 \begin{split}
 \texttt{sum}(\{0,2\}) &= \texttt{value}(\emptyset)+\texttt{value}(\{0\})+\texttt{value}(\{2\})+\texttt{value}(\{0,2\}) \\ 
                      &= 3 + 1 + 5 + 1 = 10.
 \end{split}
 \end{equation*}
 Because there are a total of $2^n$ subsets,
 one possible solution is to go through all
 pairs of subsets in $O(2^{2n})$ time.