CS140 -- Lab 5
- CS140 -- Data Structures
- James S. Plank
- Spring, 2014
- Directory for files, makefile, executables and gradescript:
/home/plank/cs140/Labs/Lab5
- This is a two week lab, but parts of it are due after week 1.
- For week 1, cases 1-28 and 101-120 must work (you might find that
a few other cases work, but that will be pure luck).
You are not allowed...
You are not allowed to modify bitmatrix.h or
bitmatrix_editor.cpp.
Bit-matrices
This lab is all about bit-matrices. These are matrices whose values can only be
zero or one, and arithmetic is modulo 2. You would be surprised how powerful
these are. They are used in fault-tolerant storage applications, among other
things, which is why I am fond of them.
Let's take some simple examples. Here are four bit-matrices:
When you add bit-matrices, you simply add elements in corresponding rows and columns, modulo
two. So, for example:
and
Bit-matrix multiplication is just like standard matrix multiplication,
except that addition and multiplication of the individual elements is
done modulo two. For example, multiplying
RV.txt and
CV.txt will yield a 1 X 1 matrix whose value is:
(1*1) + (0*0) + (1*1) + (0*1) + (0*1) + (0*0) + (0*0) + (1*1)
This is 1+0+1+0+0+0+0+1 which equals 3%2 which equals 1. Thus:
Similarly:
and
That last example is interesting -- when the product of two square matrices
is equal to the identity matrix, then these two matrices are inverses
of each other. Some matrices, like the one below, have no inverses:
Bitmatrix.h
You have a big job. Behold
bitmatrix.h:
#include <string>
#include <iostream>
#include <vector>
using namespace std;
class Bitmatrix {
public:
Bitmatrix(string fn); // Read from a file
Bitmatrix(int rows, int cols); // Create an empty bitmatrix
void Print(int w); // Print it on standard ouptput with spaces & blank lines
void Write(string fn); // Write it to a file either as hex or zeros & ones
void PGM(string fn, int pixels, int border); // Write it to a pgm file
int Rows();
int Cols();
void Set(int row, int col, char val); // Set the specified element to val
char Val(int row, int col); // Return the specified element
void Swap_Rows(int r1, int r2);
void R1_Plus_Equals_R2(int r1, int r2);
Bitmatrix *Copy();
protected:
vector <string> M; // The matrix. Elements are '0' or '1'.
};
Bitmatrix *Sum(Bitmatrix *a1, Bitmatrix *a2);
Bitmatrix *Product(Bitmatrix *a1, Bitmatrix *a2);
Bitmatrix *Sub_Matrix(Bitmatrix *a1, vector <int> &rows);
Bitmatrix *Inverse(Bitmatrix *m);
class HTE {
public:
string key;
Bitmatrix *bm;
};
typedef vector <HTE *> HTVec;
class BM_Hash {
public:
BM_Hash(int size);
void Store(string &key, Bitmatrix *bm);
Bitmatrix *Recall(string &key);
HTVec All();
protected:
vector <HTVec> table;
};
|
The Bitmatrix class implements methods that allow you to create, read, write
and manipulate bit-matrices. Here are the methods:
- Bitmatrix(int rows, int cols): Create a bitmatrix of the specified size
whose entries are all zeros. If rows or cols is less than or equal to
zero, print an error message, but still create a 1 X 1 bit-matrix whose entry is zero.
- Bitmatrix(string fn): Read a bit-matrix from a file. The file should contain
only zeros, ones, spaces and newlines, and each line should either be blank, or it
should contain the same number of non-spaces as all other lines. I've written this
one for you, so you don't need to sweat about it too much.
- void Write(string fn): Write the bit-matrix to a file. Here, just write
out the zeros and ones with no spaces. Each row of the bit-matrix gets one row of output.
- void Print(int w): Print the matrix on standard output. If w is less than
or equal to zero, print each row on its own line with no spaces. Otherwise, print a space
after every w columns and a blank line after every w rows. I've written
this one for you.
- void PGM(string fn, int pixels, int border): Create a PGM file from the bit-matrix.
This is how I made the pictures above. The zero entries are white (255) and the
one entries are gray (100). Each entry is a pixels by pixels square. If border
is greater than zero, then there should be a black border of that many pixels separating each
square and around the whole matrix. The above matrices were created with PGM(xxx, 10, 2).
- int Rows(): Return the number of rows in the bit-matrix.
- int Cols(): Return the number of columns in the bit-matrix.
- void Set(int row, int col, char val): Set the element in the specified row and
column (zero-indexed, of course) to the given value. Val can be either 0 or '0' to
specify zero, and either 1 or '1' to specify one. The character your program
actually stores should be a '0' or a '1'. Hence if val is either the
integer 0 or 1, as opposed to the integer 48 or 49, which represent '0' and
'1', then
you need to convert it to '0' or '1'. You can do so by adding '0' to val.
If row or col are bad,
simply do nothing -- your code should not be erroneous if given bad values.
- char Val(int row, int col): Return the specified value -- '0' should
be returned as the integer 0 and '1' should be returned as the integer 1. Even
though Val returns a char, you can have your return statement return the
integer 0 or the integer 1 and it will work properly.
This also means that when you use Val to retrieve a value from
a matrix, and you want to store its return value back into another matrix
(e.g., the Copy method will need to do this),
you will need to first convert the return value to the character
'0' or '1'. You can do this by
adding '0' to the return value.
Val() should return -1 if given bad input.
- void Swap_Rows(int r1, int r2): Swap the specified rows. This should do nothing
if given bad values.
- void R1_Plus_Equals_R2(int r1, int r2) Set row r1 to be the sum of
rows r1 and r2. Again, this should do nothing if given bad values. It's
ok for r2 to equal r1.
- Bitmatrix *Copy() Create a new bit-matrix that is a copy of the given bit-matrix
and return a pointer to it. To do this, you'll have to create a new empty bit-matrix of
the proper size, and then use Set() to set its elements properly.
The only piece of data in a bitmatrix is a vector of strings named M.
If M is storing a r by c matrix, then it will contain r
strings, each of which has c characters. The characters are either '0' for zero
or '1' for one.
So, you have to implement 10 methods (there are 12, but I've implemented two of them
for you), and only one of them (PGM()) is difficult. Were I you, I'd wait until the
end to do PGM().
You have to implement four procedures which operate on pointers. Since these are not
part of the data structure, you have to use Rows() Cols(), Set(),
and Val() to implement them. You should not modify the input matrices
in any way.
- Bitmatrix *Sum(Bitmatrix *a1, Bitmatrix *a2): This creates a new bit-matrix which is the sum of a1 and a2. If a1 and a2 are
not the same size, return NULL.
- Bitmatrix *Product(Bitmatrix *a1, Bitmatrix *a2): This creates a new bit-matrix which is the product of a1 and a2. This product will have a1->Rows()
rows and a2->Cols() columns. If a1->Cols()
and a2->Rows() do not match, then return NULL.
- Bitmatrix *Sub_Matrix(Bitmatrix *a1, vector <int> &rows): This
creates a new bit-matrix composed of the specified rows of the given bit-matrix.
It is ok to repeat entries in rows. However, if rows is empty or contains
bad indices, return NULL.
- Bitmatrix *Inverse(Bitmatrix *a1): Create and return the inverse of a1.
To do this, you should also use the Swap_Rows() and R1_Plus_Equals_R2() methods.
I'll go into more detail on how to invert a bit-matrix below. If a1 is not square or
not invertible, return NULL.
The first three of these are very easy. Inverse() will be tougher.
Finally, you are also going to implement a hash table to store bit-matrices with keys
that are strings. You should use the djb_hash() function from class as the
hash function and you should use separate chaining as the collision resolution
mechanism. Your hash table is a vector of vectors of pointers to HTE's (hash
table entries). Each hash table entry stores a key and a pointer to a bit-matrix.
The BM_Hash class has four methods:
- BM_Hash(int size): The constructor specifies the size of the table.
- void Store(string &key, Bitmatrix *bm): Store the given key and bit-matrix
in the hash table. If the key is already there, replace the bit-matrix with the
given one.
- Bitmatrix *Recall(string &key); return the bit-matrix corresponding to the
given key. If the key is not in the hash table, then return NULL.
- HTVec All(): Return a vector of all hash table entries in the table. The
vector should be ordered just like the hash table. In other words, suppose "A" hashes to 5,
"B" hashes to 1 and "C" hashes to 1. And suppose that "B" was added to the table before "C".
Then All() should return the HTE's in the order "B", "C", "A".
You should not call new or delete on bit-matrices when you implement any
of the hash table methods. You should call new when you create a new HTE.
That is the only time that you will call new in the hash table methods.
Starter Code
The file
bitmatrix_start.cpp
provides dummy implementations for everything in the lab, with the exception of
Bitmatrix(string fn) and Print(). The nice thing about it is that
it will compile with any program that uses bitmatrix.h. However, it won't
run correctly. It's a good starting place for you. Make sure that you
copy bitmatrix_start.cpp to bitmatrix.cpp before you submit your code!.
The testing program matrix_editor.cpp
I've written
bitmatrix_editor.cpp as a program that uses
all of the features of bitmatrix.h. You run it with an optional prompt
as a command line argument. If you don't specify a prompt, it will not print a
prompt. This is nice because you can treat the editor as an interactive editor,
or you can write scripts for it.
Copy bitmatrix_editor.cpp, bitmatrix.h and makefile to your
directory. Then
copy bitmatrix_start.cpp to bitmatrix.cpp and type make:
UNIX> make
g++ -c bitmatrix_editor.cpp
g++ -c bitmatrix.cpp
g++ -o bitmatrix_editor bitmatrix_editor.o bitmatrix.o
UNIX>
Now you have a version of bitmatrix_editor that only works for what
I have implemented. However, let's explore those. Go ahead and copy
the following matrix files too:
C003.txt,
C244.txt,
CM4_3.txt,
CV.txt,
RV.txt,
t3.txt and
t4.txt.
Bitmatrix_editor reads lines of text from standard input. Blank lines
and lines that begin with '#' are ignored. Otherwise, the first word on a line
is a command and the remaining words are arguments.
At all times, there is one "current matrix." You may also store and recall matrices
with single-word keys. Three simple commands are:
- READ filename: This reads a bit-matrix from a given file using the Read()
method. If there is an error, it will return a single-element matrix whose element is zero.
- PRINT [w]: This calls the Print() method on the current matrix.
If an argument is specified, it is the value of w that is passed to the Print()
method. Otherwise, 0 is passed.
- QUIT: Exits. You can also simply close standard input with CNTL-D.
Since I have implemented these for you in bitmatrix_start.cpp, they will work
without any extra effort from you:
UNIX> bitmatrix_editor "BM-Editor>"
BM-Editor> READ C003.txt
BM-Editor> PRINT
10000001
11000000
01100001
00110001
00011001
00001100
00000110
00000011
BM-Editor> PRINT 4
1000 0001
1100 0000
0110 0001
0011 0001
0001 1001
0000 1100
0000 0110
0000 0011
BM-Editor> READ RV.txt
BM-Editor> PRINT
10100001
BM-Editor> PRINT 4
1010 0001
BM-Editor> QUIT
UNIX>
bitmatrix_editor also implements the following commands, which of course will only
work when you implement the proper methods and procedures:
- EMPTY rows cols: Creates a bit-matrix of the given size whose
values are all zeros. It sets the current matrix to this matrix.
- SET row col value: Sets the given element of the current
matrix to the given value.
- VAL row col: Returns the value of the given element.
- SWAP r1 r2: Swaps the given rows of the current matrix using Swap_Rows().
- += r1 r2: Sets r1 of the current matrix
equal to r1+r2 using the
R1_Plus_Equals_R2() method.
- WRITE fn: Calls the Write() method to write the current matrix
to the specified file.
- PGM fn pixels border: Calls the PGM() method
to create a PGM picture of the current matrix.
- STORE key: Makes a copy of the current matrix and stores it into the
hash table with the given key. If the key is already in the hash table, it will delete
the bit-matrix before putting the copy into the hash table.
- RECALL key: If the key is in the hash table, this will set the
current matrix to a copy of the bit-matrix stored in the hash table.
- ALL: This calls All() and prints out each key, along with
the dimensions of the bit-matrix stored there.
- SUM key1 key2: This adds the two matrices stored with the
given keys and puts the sum into the current matrix.
- PRODUCT key1 key2: This multiplies the two matrices stored with the
given keys and puts the product into the current matrix.
- SUBMATRIX row1 row2 ...: This creates a submatrix of the
current matrix with the specified rows, and replaces the current matrix with the submatrix.
- INVERT: This inverts the current matrix and replaces it with the inverse.
With these last four commands, bitmatrix_editor checks to make sure that you haven't
modified the input matrices.
Some examples with the above matrices:
UNIX> bitmatrix_editor 'BM-Editor>'
BM-Editor> READ C003.txt
BM-Editor> STORE C003
BM-Editor> READ C244.txt
BM-Editor> STORE C244
BM-Editor> READ CV.txt
BM-Editor> STORE CV
BM-Editor> READ RV.txt
BM-Editor> STORE RV
BM-Editor> READ t3.txt
BM-Editor> STORE t3
BM-Editor> READ t4.txt
BM-Editor> STORE t4
BM-Editor> RECALL RV
BM-Editor> PRINT
10100001
BM-Editor> ALL
CV 8 X 1
RV 1 X 8
t3 16 X 8
t4 8 X 16
C003 8 X 8
C244 8 X 8
BM-Editor> SUM C003 C003
BM-Editor> PRINT
00000000
00000000
00000000
00000000
00000000
00000000
00000000
00000000
BM-Editor> SUM C003 C244
BM-Editor> PRINT
11111110
11111111
10000001
00111110
11100001
11110000
11111000
11111100
BM-Editor> PRODUCT RV CV
BM-Editor> PRINT
1
BM-Editor> PRODUCT C003 C244
BM-Editor> PRINT
10000000
01000000
00100000
00010000
00001000
00000100
00000010
00000001
BM-Editor> PRODUCT C003 CV
BM-Editor> PRINT
0
1
0
1
1
1
0
1
BM-Editor> RECALL t3
BM-Editor> PRINT 8
10000001
11000000
01100001
00110001
00011001
00001100
00000110
00000011
01111111
00111111
11100000
00001111
11111000
11111100
11111110
11111111
BM-Editor> RECALL t4
BM-Editor> PRINT 8
10000001 01111111
11000000 00111111
01100001 11100000
00110001 00001111
00011001 11111000
00001100 11111100
00000110 11111110
00000011 11111111
BM-Editor> PRODUCT t3 t4
BM-Editor> PRINT 8
10000010 10000000
01000001 01000000
10100010 00100000
01010011 00010000
00101011 00001000
00010101 00000100
00001010 00000010
00000101 00000001
10000000 11010101
01000000 11101010
00100000 10100000
00010000 00000101
00001000 01010111
00000100 10101011
00000010 01010101
00000001 10101010
BM-Editor> PRODUCT t4 t3
BM-Editor> PRINT 8
01010111
10101011
00000010
01010110
01111100
10111110
01011111
10101111
BM-Editor> RECALL C003
BM-Editor> INVERT
BM-Editor> PRINT
01111111
00111111
11100000
00001111
11111000
11111100
11111110
11111111
BM-Editor> RECALL C244
BM-Editor> INVERT
BM-Editor> PRINT
10000001
11000000
01100001
00110001
00011001
00001100
00000110
00000011
BM-Editor> PRODUCT CV RV
BM-Editor> PRINT
10100001
00000000
10100001
10100001
10100001
00000000
00000000
10100001
BM-Editor> INVERT
Matrix not invertible.
BM-Editor> QUIT
UNIX>
Inverting a bit-matrix
Inverting a bit-matrix is easier than doing a general matrix inversion.
However, the steps are the same. Suppose you want to invert the square
matrix M. What you do is make a copy of M, and create
an identity matrix of the same size as M. Call this matrix Inv.
Then you perform "SWAP"
and "+=" operations on the copy of M to turn it into an identity matrix.
You perform the exact same operations on Inv. When you're done, the
inverse of the original matrix is in Inv
Let's perform an example on the matrix in
Inv-Ex.txt:
10110
01011
10011
11001
01100
|
You first go through each row of M from the first to the last, doing the
following steps:
- Suppose you are at row i. If M[i][i] is not one, then find
a row j where j > i such that M[j][i] equals one, and
swap rows i and j. If you can't find such a row, the matrix is
not invertible.
- Now, look at every row j such that j > i. If M[j][i] equals
one, then set row j equal to the sum of rows i and j.
We'll do this for our example:
Action |
M |
Inv |
Start |
|
|
i=0
No swap necesary.
Set row 2 = row 2 + row 0. |
|
|
i=0 still.
Set row 3 = row 3 + row 0. |
|
|
i=1
No swap necesary.
Set row 3 = row 3 + row 1. |
|
|
i=1 still.
Set row 4 = row 4 + row 1. |
|
|
i=2
Swap row 2 and row 3 |
|
|
i=2 still.
Set row 4 = row 4 + row 2. |
|
|
i=3
Swap row 3 and row 4 |
|
|
When you're done with this pass, M will be an upper-triangular matrix.
Now, you start with the last row and go to the first row. Suppose you are
in row i:
- If there is any j > i where M[i][j] equals one, replace
row i with the sum of row i and row j.
When you are done with this step, M will be the identity matrix, and Inv
will be the inverse of the original M:
Action |
M |
Inv |
Start:
i=4
No action necessary. |
|
|
i=3
Set row 3 = row 3 + row 4. |
|
|
i=2
No action necessary. |
|
|
i=1
Set row 1 = row 1 + row 3. |
|
|
i=1 still.
Set row 1 = row 1 + row 4. |
|
|
i=0
Set row 0 = row 0 + row 2. |
|
|
i=0 still.
Set row 0 = row 0 + row 3. |
|
|
Finally, let's double-check ourselves:
BM-Editor> READ Inv-Ex.txt
BM-Editor> STORE Inv-Ex
BM-Editor> INVERT
BM-Editor> PRINT
01101
11011
11010
00111
10100
BM-Editor> STORE Inv
BM-Editor> PRODUCT Inv Inv-Ex
BM-Editor> PRINT
10000
01000
00100
00010
00001
BM-Editor>
Gradescript Rubric
First, this lab will count double.
The gradescript will have up to 200 test cases. 30 percent of them will test matrix inversion
and 20 percent of them will test the PGM file. Your PGM doesn't have to match mine in format;
however, its size and pixels must match mine exactly.