Selected Publications
These eight publications are my answer to the following question :
"What should I read to find out about your current research
interests, and what you are likely to be working on in the next
few years?" I made these selections in 2008, so more recent work isn't
included, and it's possible my perspective has changed a bit since then.
If you are interested in the more current answer please ask!
2008
-
Empiricism is Not a Matter of Faith (Pedersen),
Computational Linguistics,
Volume 34, Number 3, pp. 465-470, September 2008.
[Journal Citation Reports Index Factor 2007: 2.367]
[This explains why we release our
software,
and how we develop it so
that it's both possible and productive to do so. The goal is to
promote greater sharing of software so that experimental results
can be reproduced reliably and quickly.]
2007
2006
-
A Comparative Study of Supervised Learning as Applied to Acronym
Expansion in Clinical Reports (Joshi, Pakhomov, Pedersen, and
Chute)
- Appears in the Proceedings of the Annual Symposium of the American
Medical Informatics Association, pp. 399-403, Nov 11-16, 2006,
Washington, DC. [acceptance rate 41%]
[This shows that acronym expansion in medical text can be
effectively handled using
supervised
learning
techniques first developed
for word sense disambiguation of general English text.]
-
Unsupervised Corpus Based Methods for WSD (Pedersen), In Agirre, E.
and Edmonds, P. (Editors), Word Sense
Disambiguation : Algorithms and Applications, June 2006, pp.
133-166, Springer.
[This describes the foundations of
SenseClusters,
which clusters written contexts based on their lexical similarity. Various
related approaches are also discussed, including Latent Semantic Analysis.]
2005
-
Name
Discrimination by Clustering Similar Contexts (Pedersen, Purandare,
and Kulkarni) - Appears in the Proceedings of the Sixth International
Conference on Intelligent Text Processing and Computational
Linguistics, p. 220-231, February 13-19, 2005, Mexico City. [acceptance
rate 37%]
[We show how to use
SenseClusters to
discover the identities associated with a name mentioned multiple
times in a text.]
2004
2003
-
Extended Gloss Overlaps as a Measure of Semantic Relatedness
(Banerjee and Pedersen) - Appears in the Proceedings of the Eighteenth
International Joint Conference on Artificial Intelligence,
pp. 805-810, August 9-15, 2003, Acapulco, Mexico. [acceptance rate
21%]
[
We introduce the extended gloss overlap measure (aka
lesk in WordNet::Similarity). The goal is to measure how related two concepts are based on the similarity of their definitions as found
in WordNet.
]
-
Using Measures of Semantic Relatedness for Word Sense Disambiguation
(Patwardhan, Banerjee and Pedersen) - Appears in the Proceedings of the
Fourth International Conference on Intelligent Text Processing and
Computational Linguistics,
pp. 241-257, February 17-21, 2003, Mexico City. [acceptance rate 46%]
[
We introduce
WordNet::Similarity and
WordNet::SenseRelate,
both of which remain very active projects. The goal is to measure the
similarity of concepts based on WordNet,
and to use that information to assign each word in a running text the sense
that is most related to its neighbors.
]
By:
Ted Pedersen
- tpederse AT d umn edu