CS 8761 Natural Language Processing - Fall 2002 - MRD + Web => Corpus
Final Project - Stage I - Due, Wed Nov 27, noon
This may be revised in response to your questions. Last update Wed
Nov 20 7:00 pm
Objectives
Revise you Beta submission in response to any comments I may have made to
your group individually, and also the following general requests that
apply to all teams.
Specification
Make sure that perldoc shows me the syntax of your functions. This can
be via the SYNOPSIS feature or you can provide it via some other
mechanism.
Provide at least 20 cases that will be executed via "make test".
Provide a simple menu driven interface that allows a user to use your
module and query the dictionary without having to write code. Provide
options that allow output to be written to a file. Make sure that all the
major functionality of your program is supported, including the features
below.
Add the following functionality to your modules if they don't already
have it.
- LDOCE and Big Mac: Find all the definitions that include a given set
of words. The set will consist of 1 or more words. "Label" each
definition with the headword, part of speech, and THES or subject code
that pertain to that definition.
- LDOCE: Given a word, find all the other words that share the same
subject code. If a word has multiple subject codes, keep a separate list
of words for each. "Label" each list with the given word and the subject
code. (The subject codes are designated as FIELD).
- Big Mac: Given a word, find all the other words that share the same
value. If a word has multiple values, keep a separate list
of words for each. "Label" each list with the given word and the
value.
To summarize then, by noon on Wed Nov 27, you should submit a revised
version of both of your interfaces that reflects the following:
1) syntax/examples of ALL supported functions included in perldoc
documentation.
2) "make test" should execute at least 20 cases for both LDOCE and Big Mac.
3) provide interface program to LDOCE and Big Mac modules.
4) add functionality described above.
Submission Guidelines
Submit your modules to the web drop in two distinct tar files named for
your team. Please submit your entire package. Once unpacked things should
be structured such that I can install your module using the standard "3
step CPAN" install. The three steps are as follows.
perl Makefile.PL
make
make test
I should not have to do anything else to get each of your modules installed. I should be able
to include it in a program via the use command. Please provide some example usages in your
documentation, and of course your test files will show how to use the code as well (see
QueryData again as an example).
Make sure you team name, individual names, date, etc. are included in your source code.
Your code may well end being distributed via CPAN so provide appropriate info about
copyrights, distribution, etc.
Submit your LDOCE and Big Mac interfaces separately. Only submit 2 per team. Coordinate with
your teammates so you don't have multiple submissions.
This is a team assignment. You are strongly advised to divide up the
work of the project into tasks that can be carried out in parallel by
various team members. All team members should be acknowledged in the
comments, etc. and all teammates will receive the same grade. Do not work
with other teams. Each team should operate independent of all the
other teams. Make your own decisions as a team and do not be
influenced by the decisions of other teams if you happen to hear of
them accidentally. You are free to work with your teammates as closely
as is necessary.
by:
Ted Pedersen
- tpederse@umn.edu