I summarise the kinds of evaluations that are needed for a structured data generation task.