Data Structures and Algorithms: CHAPTER 1: Design and Analysis of Algorithms

Finally, a note on our typesetting conventions for programs. Pascal reserved words are in boldface, types are in roman, and procedure, function, and variable names are in italic. We distinguish between upper and lower case letters.

Exercises

1.1	There are six teams in the football league: the Vultures, the Lions, the Eagles, the Beavers, the Tigers, and the Skunks. The Vultures have already played the Lions and the Eagles; the Lions have also played the Beavers and Skunks. The Tigers have played the Eagles and Skunks. Each team plays one game per week. Find a schedule so that all teams will have played each other in the fewest number of weeks. Hint. Create a graph whose vertices are the pairs of teams that have not yet played each other. What should the edges be so that in a legal coloring of the graph, each color can represent the games played in one week?
*1.2	Consider a robot arm that is fixed at one end. The arm contains two elbows at each of which it is possible to rotate the arm 90 degrees up and down in a vertical plane. How would you mathematically model the possible movements of the end of the arm? Describe an algorithm to move the end of the robot arm from one permissible position to another.
*1.3	Suppose we wish to multiply four matrices of real numbers M₁ × M₂ × M₃ × M₄ where M₁ is 10 by 20, M₂ is 20 by 50, M₃ is 50 by 1, and M₄ is 1 by 100. Assume that the multiplication of a p × q matrix by a q × r matrix requires pqr scalar operations, as it does in the usual matrix multiplication algorithm. Find the optimal order in which to multiply the matrices so as to minimize the total number of scalar operations. How would you find this optimal ordering if there are an arbitrary number of matrices?
**1.4	Suppose we wish to partition the square roots of the integers from 1 to 100 into two piles of fifty numbers each, such that the sum of the numbers in the first pile is as close as possible to the sum of the numbers in the second pile. If we could use two minutes of computer time to help answer this question, what computations would you perform in those two minutes?
1.5	Describe a greedy algorithm for playing chess. Would you expect it to perform very well?
1.6	In Section 1.2 we considered an ADT SET, with operations MAKE-NULL, UNION, and SIZE. Suppose for convenience that we assume all sets are subsets of {0, 1, . . . , 31} and let the ADT SET be interpreted as the Pascal data type set of 0..31. Write Pascal procedures for these operations using this implementation of SET.
1.7	The greatest common divisor of two integers p and q is the largest integer d that divides both p and q evenly. We wish to develop a program for computing the greatest common divisor of two integers p and q using the following algorithm. Let r be the remainder of p divided by q. If r is O, then q is the greatest common divisor. Otherwise, set p equal to q, then q equal to r, and repeat the process. Show that this process does find the correct greatest common divisor. Refine this algorithm into a pseudo-language program. Convert your pseudo-language program into a Pascal program.
1.8	We want to develop a program for a text formatter that will place words on lines that are both left and right justified. The program will have a word buffer and a line buffer. Initially both are empty. A word is read into the word buffer. If there is sufficient room in the line buffer, the word is transferred to the line buffer. Otherwise, additional spaces are inserted between words in the line buffer to fill out the line, and then the line buffer is emptied by printing the line. Refine this algorithm into a pseudo-language program. Convert your pseudo-language program to a Pascal program.
1.9	Consider a set of n cities and a table of distances between pairs of cities. Write a pseudo-language program for finding a short path that goes through each city exactly once and returns to the city from which it started. There is no known method for obtaining the shortest such tour except by exhaustive searching. Thus try to find an efficient algorithm for this problem using some reasonable heuristic.
1.10	Consider the following functions of n: Indicate for each distinct pair i and j whether f_i(n) is O(f_j(n)) and whether f_i(n) is W(f_j(n)).
1.11	Consider the following functions of n: Indicate for each distinct pair i and j whether g_i(n) is O(gj(n)) and whether g_i(n) is W(g_j(n)).
1.12	Give, using "big oh" notation, the worst case running times of the following procedures as a function of n. procedure matmpy ( n: integer); var i, j, k: integer; begin for i := 1 to n do for j := 1 to n do begin C[i, j] := O; for k := 1 to n do C[i, j] := C[i, j,] + A[i, k] * B[k, j] end end procedure mystery ( n: integer); var i, j, k: integer; begin for i:= 1 to n-1 do for j:= i + 1 to n do for k := 1 to j do { some statement requiring O(1) time } end procedure veryodd ( n: integer ); var i, j, x, y: integer; begin for i := 1 to n do if odd(i) then begin for j := i to n do x := x + 1; for j := 1 to i do y := y + l end end function recursive (n: integer ) : integer; begin if n <= 1 then return (l) else return (recursive(n-1) + recursive(n-1)) end
1.13	Show that the following statements are true. 17 is O(1). n(n-1)/2 is O(n²). max(n³, 10n²) is O(n³). e) If p(x) is any k^th degree polynomial with a positive leading coefficient, then p(n) is O(n^k) and W(n^k).
*1.14	Suppose T₁(n) is W(f(n)) and T₂(n) is W(g(n)). Which of the following statements are true? T₁(n) + T₂(n) is W(max(f(n), g(n))). T₁(n)T₂(n) is W(f(n)g(n)).
*1.15	Some authors define big omega by saying f(n) is W(g(n)) if there is some n₀ and c > 0 such that for all n ?/FONT> n₀ we have f(n) ?/FONT> cg(n). Is it true for this definition that f(n) is W(g(n)) if and only if g(n) is O(f(n))? Is (a) true for the definition of big omega in Section 1.4? Does Exercise 1.14(a) or (b) hold for this definition of big omega?
1.16	Order the following functions by growth rate: (a) n, (b) ?/FONT>?I>n, (c) logn, (d) loglogn, (e) log²n, (f) n/logn, (g) ?/FONT>?I>nlog²n, (h) (1/3)ⁿ, (i) (3/2)ⁿ, (j) 17.
1.17	Assume the parameter n in the procedure below is a positive power of 2, i.e., n = 2, 4, 8, 16 , . . .. Give the formula that expresses the value of the variable count in terms of the value of n when the procedure terminates. procedure mystery ( n: integer ); var x, count: integer; begin count := 0; x := 2; while x < n do begin x := 2 * x; count := count + 1 end; writeln(count) end
1.18	Here is a function max(i, n) that returns the largest element in positions i through i+n-1 of an integer array A. You may assume for convenience that n is a power of 2. function max ( i, n: integer ): integer; var m1, m2: integer; begin if n = 1 then return (A[i]) else begin m1 := max(i, n div 2); m2 := max(i+n div 2, n div 2); if m1 < m2 then return (m2) else return (m1) end end Let T(n) be the worst-case time taken by max with second argument n. That is, n is the number of elements of which the largest is found. Write an equation expressing T(n) in terms of T(j) for one or more values of j less than n and a constant or constants that represent the times taken by individual statements of the max program. Give a tight big oh upper bound on T(n). Your answer should be equal to the big omega lower bound, and be as simple as possible.

Bibliographic Notes

The concept of an abstract data type can be traced to the class type in the language SIMULA 67 (Birtwistle et al. [1973]). Since that time, a variety of other languages that support abstract data types have been developed including Alphard (Shaw, Wulf, and London [1977]), C with classes (Stroustrup [1982]), CLU (Liskov, et al. [1977]), MESA (Geschke, Morris, and Satterthwaite [1977]), and Russell (Demers and Donahue [1979]). The ADT concept is further discussed in works such as Gotlieb and Gotlieb [1978] and Wulf et al. [1981].

Knuth [1968] was the first major work to advocate the systematic study of the running time of programs. Aho, Hopcroft, and Ullman [1974] relate the time and space complexity of algorithms to various models of computation, such as Turing machines and random-access machines. See also the bibliographic notes to Chapter 9 for more references to the subject of analysis of algorithms and programs.

For additional material on structured programming see Hoare, Dahl, and Dijkstra [1972], Wirth [1973], Kernighan and Plauger [1974], and Yourdon and Constantine [1975]. Organizational and psychological problems arising in the development of large software projects are discussed in Brooks [1974] and Weinberg [1971]. Kernighan and Plauger [1981] show how to build useful software tools for a programming environment.

‡ We distinguish the abstract data type SET from the built-in set type of Pascal.

† The record has no known name because it was created by a call new(header), which made header point to this newly-created record. Internal to the machine, however, there is a memory address that can be used to locate the cell.

† Note the asymmetry between big-oh and big-omega notation. The reason such asymmetry is often useful is that there are many times when an algorithm is fast on many but not all inputs. For example, there are algorithms to test whether their input is of prime length that run very fast whenever that length is even, so we could not get a good lower bound on running time that held for all n ?/FONT> n₀.

Design and Analysis of Algorithms

1.1 From Problems to Programs

Algorithms

Pseudo-Language and Stepwise Refinement

Summary

1.2 Abstract Data Types

Definition of Abstract Data Type

1.3 Data Types, Data Structures and Abstract Data Types

Pointers and Cursors

1.4 The Running Time of a Program

Measuring the Running Time of a Program

Big-Oh and Big-Omega Notation

The Tyranny of Growth Rate

A Few Grains of Salt

1.5 Calculating the Running Time of a Program

Procedure Calls

Programs with GOTO's

Analyzing a Pseudo-Program

1.6 Good Programming Practice

1.7 Super Pascal

Exercises

Bibliographic Notes