Assignment 1: Buffer Manager
Due: October 7, 1999
50 Points
Instructor: Rich Maclin
You will carry out this assignment, and subsequent ones, in teams of two. The teams will be selected by the instructor.
You should begin by reading the chapter on Disks and Files, to get an overview of buffer management. This material will also be covered in class. In addition, HTML documentation is available for Minibase, which you can read using Netscape. Follow this link to the Minibase home page. In particular, you should read the description of the DB class, which you will call extensively in this assignment. The header file for the DB class is in db.h
Each student doing the assignment should set their MINIBASE_ROOT envirornment variable to point at the Minibase installation. The homework assignment directories are located relative to it. Use this command to set the MINIBASE_ROOT environment variable
setenv MINIBASE_ROOT /usr/local/minibase/minibase-2.0The following command can be used to set the path
set path = (/usr/local/minibase/bin $path)To make a local copy of the files, the student should:
1) Copy the appropriate Makefile from the 'src' directory to the directory they will implement the assignment in.The 'src' file is located in /usr/local/minibase/mini_hwk/assign/BufMgr/src directory
2) Execute: make setup to copy the files over. This will give an error if they have not set their MINIBASE_ROOT variable.
3) Implement the assignment, typing 'make' to attempt to build the test program.Important Note: The Minibase buffer manager interface differs from what you have to implement (described below) in that it contains methods that are required to support concurrency control and recovery.
The methods that you have to implement are described below:
class BufMgr { public: // This is made public just because we need it in your // driver test.C . It could be private for real use. Page* bufPool; // The physical buffer pool of pages. public: BufMgr(int numbuf); // Allocate pages (frames) for the pool in main memory. ~BufMgr(); // Should flush all dirty pages in the pool to // disk before shutting down and deallocate the // buffer pool in main memory. Status pinPage(PageId PageId_in_a_DB, Page*& page,int emptyPage=0); // Check if this page is in buffer pool. If it is, increment the pin_count // and return a pointer to this page. If the pin_count was 0 before the // call, the page was a replacement candidate, but is no longer a candidate; // be sure to remove this from the LRU list of candidates. // If the page is not in the pool, choose a frame (from the set of replacement // candidates) to hold this page, read the page (using // the appropriate DB class method) and pin it. // Also, must write out the old page in chosen frame if it is dirty // before reading new page. (You can assume that emptyPage == 0 for // this assignment.) Status unpinPage(PageId globalPageId_in_a_DB, int dirty=FALSE); // Should be called with dirty==TRUE if the client has // modified the page. If so, this call should set the dirty bit // for this frame. Further, if pin_count>0 should decrement it, and if it // becomes zero, should update the LRU list by adding an entry for this frame. // If pin_count=0 before this call, return error. Status newPage(PageId& firstPageId, Page*& firstpage,int howmany=1); // Call DB object to allocate a run of new pages and // find a frame in the buffer pool for the first page // and pin it. (This call allows a client of the Buffer Manager // to allocate pages on disk.) If buffer is full, i.e., you // can't find a frame for the first page, ask DB to deallocate // all these pages, and return error. Status freePage(PageId globalPageId); // This method should be called to delete a page that is on disk. // This routine must call the DB class to deallocate the page. Status flushPage(int pageId); // Used to flush a particular page of the buffer pool to disk // Should call the write_page method of the DB class };
The buffer pool is a collection of frames (page-sized sequence of main memory bytes) that is managed by the Buffer Manager. It should be stored as an array bufPool[numbuf] of Page objects.
In addition, you should maintain an array bufDescr[numbuf] of descriptors, one per frame. Each descriptor is a record with the following fields:
The pin_count field is an
integer, page number is a PageId object, and dirtybit
is a boolean. This describes the page that is stored in the corresponding frame.
A page is identified by a page number that is
generated by the DB class when the page is allocated, and is unique over all
pages in the database.
The PageId type is defined as an integer type in minirel.h.
A simple hash table should be used to figure out what frame a given disk page occupies. The hash table should be implemented (entirely in main memory) by using an array of pointers to lists of <page number, frame number> pairs. The array is called the directory and each list of pairs is called a bucket. Given a page number, you should apply a hash function to find the directory entry pointing to the bucket that contains the frame number for this page, if the page is in the buffer pool. If you search the bucket and don't find a pair containing this page number, the page is not in the pool. If you find such a pair, it will tell you the frame in which the page resides. This is illustrated in Figure 1.
The hash function must distribute values in the domain of the search field uniformly over the collection of buckets. If we have HTSIZE buckets, numbered 0 through M-1, a hash function h of the form h(value) = (a*value+b) mod HTSIZE works well in practice. HTSIZE should be chosen to be a prime number.
When a page is requested the buffer manager should do the following:
To implement the LRU replacement policy, maintain a queue of frame numbers. When a frame is to be chosen for replacement, you should pick the first frame in this list. A frame number is added to the end of the queue when the pin_count for the frame is decremented to 0. A frame number is removed from the queue if the pin_count becomes non-zero: this could happen if there is a request for the page currently in the frame!
http://www.cs.wisc.edu/coral/minibase/system/error.html
You should turn in copies of your code together with copies of the output produced by running the tests provided by the TAs. The Solution will be made public after the due date. Solutions turned at a later date will not receive any credit. So be sure to turn in whatever you have, even if it is not fully working at that time.