Braggaar, A., Tomas, F., Blomsma, P., Hommes, S., Braun, N., van Miltenburg, E., van der Lee, C., Goudbeek, M., & Krahmer, E. (2022). A reproduction study of methods for evaluating dialogue system output: Replicating Santhanam and Shaikh (2019). In Proceedings of the 15th International Conference on Natural Language Generation: Generation Challenges (pp. 86-93). Association for Computational Linguistics. https://aclanthology.org/2022.inlg-genchal.13