A Dataset of Reader-Aware Multi-Document Summarization (RA-MDS):
http://www1.se.cuhk.edu.hk/~textmine/dataset/ra-mds/


We are delighted to announce the release of a dataset for Reader-Aware
Multi-
Document Summarization. In the RA-MDS task, one should jointly consider
news
documents and news reader comments for generating the summaries.
This dataset contains 45 topics from those 6 predefined categories. Each
topic
contains 10 news documents and 4 model summaries. Definitely, this dataset
could also be used for conventional MDS. Moreover, the human-written
summaries
are abstractive, thus, it is also a good dataset for text generation.

This is a *HIGH-QUALITY* dataset. We put significant effort in preparing
the dataset
in the past three years, and spent tens of thousands US dollars.
During the preparation, we employed experts with particular background,
such as
Journalist and English Literature, to conduct the tasks of data collection,
aspect
annotation, and summary writing, as well as scrutinizing. Thus, we are
pretty
confident that this dataset will bring in interesting new things to the
researchers
in our community. Please enjoy the dataset, and forward us questions if
any.

Best,

-- 
Name: 李丕绩, Piji Li
Email: *p...@se.cuhk.edu.hk <p...@se.cuhk.edu.hk>*
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Dept. of Systems Engineering & Engineering Management,
Rm 711, William M. W. Mong Engineering Building,
The Chinese University of Hong Kong
Shatin, NT, Hong Kong
_______________________________________________
uai mailing list
uai@ENGR.ORST.EDU
https://secure.engr.oregonstate.edu/mailman/listinfo/uai

Reply via email to