A Multi-level Annotated Corpus of Scientific Papers for Scientific Document Summarization


Related work sections or literature reviews are an essential part of every scientific article being crucial for paper reviewing and assessment. The automatic generation of related work sections can be considered an instance of the multi-document summarization problem. In order to allow the study of this specific problem, we have developed a manually annotated, machine readable data-set of related work sections, cited papers (e.g. references) and sentences, together with an additional layer of papers citing the references.

You can download the corpus from the resources page.

For more information, please contact us.