|
|
|
Task #19 |
|
| Organizing team: | |
| Collin Baker (International Computer Science Institute, Berkeley) | |
| Michael Ellsworth (International Computer Science Institute, Berkeley) | |
| Katrin Erk (U Texas, Austin) | |
|
Summary of Task | Datasets | Evaluation | Resources | Mailing list and forum | Systems and Results | Bibliography |
We have been asked if we could extend the deadline for submission of participants' results on Task 19, Frame Semantic Structure Extraction. We believe that we can do a fast turnaround, and get the evaluations out quickly, so we are extending the deadline for submission of results to
Friday, April 6th, at 06:59 GMT. Note that this is 7 hours later than the extension announced previously.
We should still be able to give back evaluations to participants by April 10th, leaving a week to write up the papers on each system.
We recognize that this is a difficult task, and we hope that this will make it possible for participants to produce more complete results. Please note that this does not override the 21/7 day limits for the training/testing data; it just means that participants will have until the 21/7 day limit or April 6th at 06:59, whichever is earlier.
If there are any questions about the format for the submission of results, the nature of the evaluation, etc., please feel free to contact us, either by email or by posting a question on the SemEval section of the FrameNet forum on the public website. (Note that in order to post to the forum, you will need to create a login for yourself on the public website--this is separate from anything else, but only takes a few minutes.)
This task consists of recognizing words and phrases that evoke
semantic frames of the sort defined in the FrameNet project (http://framenet.icsi.berkeley.edu),
and their semantic dependents, which are usually, but not always,
their syntactic dependents (including subjects). For example, in the
sentence
A number of automatic systems have been built to do this sort of annotation, which can be divided into three parts:
The evaluation will measure precision and recall for frames and frame elements, with partial credit for incorrect but closely related frames. This is a more advanced version of the Automatic Semantic Role Labeling task of Senseval-3 (Litkowski 2004).
Shalmaneser comes with pre-trained parameter sets based on the
FrameNet Release 1.3 data, and has recently been modified to read FN
full-text annotation as well. Downloads of Shalmaneser are available
at the SALSA website:
http://www.coli.uni-saarland.de/projects/salsa/shal/
Note that the output from Shalmaneser is in SALSA/TIGER
XML, rather than FrameNet full-text XML.
Task-specific Software
N.B. The versions of the software on this website are more recent
than those on the SemEval website. Please see the SemEval
forum for a discussion of the minimal file format required.
Here is the DTD that is needed for validation; you will need to
save it to a file and place this file in a directory called "docs",
one level up from the directory containing the full-text annotation
files:
corpusV1_2.dtd
fttosem.pl Reads
the FrameNet full-text annotation XML and converts it to the
dependency XML that is the basis of the semantic dependency evaluation.
SynSemTreeMaker.pm Required by fttosem.pl. Last revised 23 Mar, 2007
SynSemDocumentation.txt
(documentation for the above)
There is a separate script,
semDep2dot,
that reads the semantic dependency XML and converts it to the
"dot" format used by Graphviz (www.graphviz.org), making it
possible to produce graphics that are easier for humans to read than
the underlying XML.
We have written a script,
fssScore.pl,
for scoring your results against the training data as described in
the FS dependency evaluation specification. It calculates precision,
recall, and F1 score, per file (or optionally per sentence).
There is now also a script which calculates precision, recall, and
F1 score for the flat, labels-only evaluation:
srlScore.pl. This implementation
is quite simple, and requires exact matches for both boundaries and
labels of the target itself (i.e. the frame label) and the FEs (i.e. the
FE labels). We are in the process of implementing a new algorithm that
gives partial credit for cases in which the boundaries are correct, but
the frame in question is not the gold-standard frame, but is close to
it, as defined by the frame-to-frame relations defined in frRelation.xml.
The amount of partial credit will depend on the number of links
between the proposed frame and the gold-standard one.
Note that both fssScore and srlScore require a copy of
frames.xml, part of the regular FN data release. Both scoring scripts
include a flag (-n) causing them to ignore named entities in scoring
precision and recall. This has been implemented with minimal change
to the script, that is, by simply setting the value of NE matches to
0; if the "verbose" option is used, named entities will be listed as a
category, but their total will always be zero.
Mailing list and Forum
A Mailman mailing
list has been set up at ICSI for those interested in this task;
posting to the list is limited to those who have subscribed. Please
go to
http://mailman.icsi.berkeley.edu/mailman/listinfo/fn-semeval to
subscribe.
A forum
has been set up within the regular FrameNet website for SemEval
questions and answers and comments. Please ask questions there if
possible; this will help us to get the answers out both to registered
participants and to others who are interested, but not actually
participating.
Systems and Results
Summary information about systems and results will be filled in here
after the testing period.
Bibliography
Coming soon.
For more information, visit the SemEval-2007 home page.
Last modified: Tue Apr 3 15:28:55 PDT 2007