Publication | Open Access
Revisiting Challenges in Data-to-Text Generation with Fact Grounding
34
Citations
32
References
2019
Year
Unknown Venue
Data-to-text generation models face challenges in ensuring data fidelity by referring to the correct input source. To inspire studies in this area, However, limited attempts have been made in this direction and the challenges remain. We observe a prominent bottleneck in the corpus where only about 60% of the summary contents can be grounded to the boxscore records. Such information deficiency tends to misguide a conditioned language model to produce unconditioned random facts and thus leads to factual hallucinations. In this work, we restore the information balance and revamp this task to focus on fact-grounded data-to-text generation. We introduce a purified and larger-scale dataset, RotoWire-FG (Fact-Grounding), with 50% more data from the year 2017-19 and enriched input tables, hoping to attract more research focuses in this direction. Moreover, we achieve improved data fidelity over the stateof-the-art models by integrating a new form of table reconstruction as an auxiliary task to boost the generation quality.
| Year | Citations | |
|---|---|---|
Page 1
Page 1