Assessing the Effect of Inconsistent Assessors on Summarization Evaluation

Karolina K. Owczarzak; Peter Rankel; Hoa T. Dang; John M. Conroy

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

PUBLICATIONS

Assessing the Effect of Inconsistent Assessors on Summarization Evaluation

Published

June 8, 2012

Author(s)

Karolina K. Owczarzak, Peter Rankel, Hoa T. Dang, John M. Conroy

Abstract

We investigate the consistency of human assessors involved in summarization evaluation to understand its effect on system ranking and automatic evaluation techniques. Using Text Analysis Conference data, we measure annotator consistency based on human scoring of summaries for Responsiveness, Readability, and Pyramid scoring. We identify inconsistencies in the data and measure to what extent these inconsistencies affect the ranking of automatic summarization systems. Finally, we examine the stability of automatic scoring metrics (ROUGE and CLASSY) with respect to the inconsistent assessments.

Proceedings Title

Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics

Conference Dates

July 8-14, 2012

Conference Location

Jeju, KR

Pub Type

Conferences

Download Paper

Local Download

Keywords

Evaluation, Summarization

Metrology and Data and informatics

Citation

Owczarzak, K. , Rankel, P. , Dang, H. and Conroy, J. (2012), Assessing the Effect of Inconsistent Assessors on Summarization Evaluation, Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, KR, [online], https://tsapps.nist.gov/publication/get_pdf.cfm?pub_id=911315 (Accessed December 3, 2024)

Issues

If you have any questions about this publication or are having problems accessing it, please contact reflib@nist.gov.

Created June 7, 2012, Updated October 12, 2021

Assessing the Effect of Inconsistent Assessors on Summarization Evaluation

Author(s)

Abstract

Download Paper

Keywords

Citation

Additional citation formats

Issues