Class Notes: COP3530

Class Notes: Data Structures and Algorithms

Summer-C 1999 - M WRF 2nd Period CSE/E119

Instructor: M.S. Schmalz -- TAs: TA Mailing List

Project 5 Assignment

Project 5 consists of (a) writing a Java application that performs timing measurement on sorting and tree traversal algorithms, then (b) summarizing experimental results and analyses in a 4-5 page report.

Important Note: This should not entail writing of new sorting or tree traversal code, as you already did this for Projects 3 and 4. Hence, the effort for this project involves (i) inserting timing calls into your code, (ii), measuring runtime for different-sized inputs, (iii) benchmarking various operations in Java on your computer, (iv) constructing a complexity budget and prediction of runtime from your previous analysis of algorithms (homework and class notes), and (v) comparing the measured and predicted performance.

Answers to student questions are included below in red typeface, to distinguish them from the assignment text. Note the clarification of total computation time versus total algorithm runtime.

Goal. The purpose of this project is to provide basic hands-on experience with Java programming for timing measurement, and intermediate-level experience with algorithm performance analysis. In particular, we will be tying together the principles and techniques we have learned thus far, to produce a practical result, and to teach hands-on measurement and analysis skills that will be valuable in your industrial or academic careers.
The program you develop as Project 5 will (a) read a sequence of integers from stdin as in Project 1, (b) store the sequence in an array or linked list, then (c) according to two command-line switches, run a sorting or tree traversal algorithm using various data structures, as you did for Projects 3 and 4. While running the program, you will (d) measure timing data for enough iterations of the algorithm to ensure 99 percent accuracy (one percent error) given the one millisecond resolution of the UNIX timing function(s). Your program will (e) output the correct information for the algorithm or data structure selected, per Programming Assignments 3 and 4 specification, then will (f) output the measured or computed time intervals (specified below), obtained by dividing the measured times by the number of iterations. An example of the input file for Project #5 to be used for all algorithms in this project, is given below.
When the TAs test the program, the numbers will be piped in from a text file (denoted by inFile), and the command line will be as follows:
java Proj5 dswitch aswitch < inFile
where dswitch is one of the following switches: -avl for AVL search tree or -bst for BST, -arr for array or -dll for doubly-linked list; and aswitch is one of the following switches: -b for BFS or -d for DFS, -h for histogram-sort, -i for insertion-sort, or -m for merge-sort.
Clearly, BFS and DFS can only be used with AVL search tree or BST, and array and DLL can only be used with one of the sorting algorithms. If you want to have a better chance of getting points for the quality of your code, then construct an error detection and reporting method that will detect mismatches between the dswitch and aswitch values (i.e., conditions that do not meet the preceding structure-method associations).
Note that inFile, a text file that you enter using a word processor, will be stored on disk. Your program will read the input sequence by command-line piping of that file to stdin.
More information about the test procedure is posted below under Example Test Procedure.
Features. We will be making a modular program with object-oriented design and implementation, as well as resuable Java code, as follows.
Functionality. The program implemented as a Java Class called Proj5 (must be capitalized) that will perform the following steps:
Example Test Procedure. Type up a text file "inFile" in either Dos or Unix with one value per line, like this:
```
22          [Input number 1]
36          [Input number 2]
54          [Input number 3]
14          [Input number 4]
17          [Input number 5]
65          [Input number 6]
E           [Specifies end of input sequence]
14          [Number to be deleted from tree -- ignored by sorting methods]
```
When making a big file (e.g., 10,000 inputs) you should write a program to generate unique random numbers. You can do this with two nested for loops whose indices reference primes. Hint: Recall the Fundamental Theorem of Mathematics from Discrete Math class.
Now run your program this way:
java Proj5 dswitch aswitch < inFile

where dswitch and aswitch were described previously.
Required Programming Procedure. Define each class in a separate .java file, with your master file as Proj5.java, where Proj5, which has a main method, is the class name that the TAs will use to run your program. (It is o.k. to have a main method in each class - the TAs are primarily interested in the main method in the Proj3 class.)
Other Hints on Programming Procedure. Use the following steps to guide your implementation of the Java version of the preceding program.
Documentation. Illustrate your program with in-line comments, so the TAs know what you are doing. Consult your textbook and other Java programming references provided on the course Web page and elsewhere (e.g., Java books in library) for style guides.

Evaluation. Maximum score = 150 points, as follows:

  For each of the ten structure-method pairs (six for sorting, four for trees):
    Code present and clear                  1 points max.
    Code compiles correctly                 1 points max.
    Cmd-line switches work correctly        1 points max.
    Screen output displayed correctly       1 points max.
    Timing calls inserted correctly         3 points max.
    Correct computation of timing data      2 points max.
    Correct display of timing data          1 points max.
  TOTAL POINTS PER DataStructure/Method Pair:        10 points max.
  ----------------------------------------------------------------
  TOTAL PROGRAM POINTS                              100 points max.

  For the Report:  (For each of ten structure-method pairs)
    Tabular data for structure-method pair  1 points max.
    Graph of timing data for each s-m pair  1 points max.
    Work budget for structure-method pair   1 points max.
    Complexity prediction for each sm pair  1 points max.
    Comparison of prediction and data       1 points max.
  ---------------------------------------------------------------
  TOTAL POINTS PER DataStructure/Method Pair:        10 points max.
  TOTAL REPORT POINTS                                50 points max.

Note that we have "raised the bar" on this assignment. For example, if your code is present as specified, but doesn't compile and isn't documented, you would get 1 point per structure-method pair for which the code is present (10 points max.) If the code is present, well documented, compiles, reads in data, and outputs the result correctly (but doesn't do any timing measurement or data display) then you could get as many as 40 points total.

Submission Procedure. Please submit only your *.java files. You must be logged in to your CISE account to perform the submission procedure, which is described in detail at this link.
In summary, type in the following commands within your directory to submit all the java files for Project 5:
turnin -c cop3530 -p proj5 *.java {Return}
where {Return} denotes the carriage return key.
Important Note: In case you have developed your programs on any environment other than the CISE Unix system, please test your program on a CISE Unix machine before submission. We will only be testing programs for grading purposes on the CISE Unix machines. The excuse that you could not get into the CISE Lab will not be accepted, since you can remote log-in to all CISE and CIRCA labs 24 hours per day.
Special Note from Lloyd Noronha:
Before starting any programming project, please make sure that there are no unnecessary files in your "dev" directory. Make sure you only submit the files necessary for compiling and running your project. You will lose points for compilation errors in any file you submit, even if it is not relevant. We will compile and run your project as follows :
```
javac *.java
java Proj5 -avl -i < inFile
```
So please make sure that you test run your project as above in your directory just before your submission.
Due Date. Submit the completed project by MIDNIGHT on Thursday 5 August 1999. No late submissions will be permitted, except for documented excuses (medical, police, fire, judicial, death of immediate family member, or military service).
This means that the submission program will be turned off on midnite of the due date. Due to tight grade submission deadlines for the Summer-C Semester, we have to allow enough time to grade the assignments (and final exams).

This concludes the description of Project #5. Use the E-mail link at the top of this Web page to ask the TAs, if you have any questions about programming or grading.