Best practices for the human evaluation of automatically generated text
Currently, there is little agreement as to how Natural Language Generation (NLG) systems
should be evaluated. While there is some agreement regarding automatic metrics, there is a …
should be evaluated. While there is some agreement regarding automatic metrics, there is a …
[PDF][PDF] Best practices for the human evaluation of automatically generated text
C van der Lee, A Gatt, E van Miltenburg, S Wubben… - researchgate.net
Currently, there is little agreement as to how Natural Language Generation (NLG) systems
should be evaluated, with a particularly high degree of variation in the way that human …
should be evaluated, with a particularly high degree of variation in the way that human …
Best practices for the human evaluation of automatically generated text
C van der Lee, A Gatt… - 12th International …, 2019 - research.tilburguniversity.edu
Currently, there is little agreement as to how Natural Language Generation (NLG) systems
should be evaluated. While there is some agreement regarding automatic metrics, there is a …
should be evaluated. While there is some agreement regarding automatic metrics, there is a …
[PDF][PDF] Best practices for the human evaluation of automatically generated text
C van der Lee, A Gatt, E van Miltenburg, S Wubben… - inlg2019.com
Currently, there is little agreement as to how Natural Language Generation (NLG) systems
should be evaluated, with a particularly high degree of variation in the way that human …
should be evaluated, with a particularly high degree of variation in the way that human …