Text summarization

A topic for collecting resources about text summarization resources.



MEAD is the most elaborate publicly available platform for multi-lingual summarization and evaluation.The platform implements multiple summarization algorithms such as position-based, centroid-based, largest common subsequence, and keywords.

The methods for evaluating the quality of the summaries are both intrinsic and extrinsic. MEAD implements a battery of summarization algorithms, including baselines (lead-based and random) as well as centroid-based and query-based methods.




meaddoc.pdf (248.6 KB)

More details

MEAD is a publicly available toolkit for multi-lingual summarization and evaluation. The toolkit implements multiple summarization algorithms (at arbitrary compression rates) such as position-based, Centroid[RJB00], TF*IDF, and query-based methods. Methods for evaluating the quality of the summaries include co-selection (precision/recall, kappa, and relative utility) and content-based measures (cosine, word overlap, bigram overlap).

MEAD v1.0 and v2.0 were developed at the University of Michigan in 2000 and early 2001. MEAD v3.01 – v3.06 were written in the summer of 2001, an eight-week summer workshop on Text Summarization was held at Johns Hopkins University.

MEAD is written in Perl and requires several XML-related Perl modules and an external software package to run. Also, a number of other modules and packages can be used in various extensions of MEAD. A full list of these software packages is included in the Downloading and Installation Section.

MEAD is intended to be used with multiple languages. The MEAD system can summarize English documents on all POSIX-conforming operating systems (Unix, Linux, etc.). MEAD can also summarize Mandarin Chinese documents on some POSIX operating systems.