scholarly journals Deep Graph Convolutional Encoders for Structured Data to Text Generation

Author(s):  
Diego Marcheggiani ◽  
Laura Perez-Beltrachini
2018 ◽  
Author(s):  
Longxu Dou ◽  
Guanghui Qin ◽  
Jinpeng Wang ◽  
Jin-Ge Yao ◽  
Chin-Yew Lin

2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Yang Yang ◽  
Juan Cao ◽  
Yujun Wen ◽  
Pengzhou Zhang

AbstractGenerating fluent, coherent, and informative text from structured data is called table-to-text generation. Copying words from the table is a common method to solve the “out-of-vocabulary” problem, but it’s difficult to achieve accurate copying. In order to overcome this problem, we invent an auto-regressive framework based on the transformer that combines a copying mechanism and language modeling to generate target texts. Firstly, to make the model better learn the semantic relevance between table and text, we apply a word transformation method, which incorporates the field and position information into the target text to acquire the position of where to copy. Then we propose two auxiliary learning objectives, namely table-text constraint loss and copy loss. Table-text constraint loss is used to effectively model table inputs, whereas copy loss is exploited to precisely copy word fragments from a table. Furthermore, we improve the text search strategy to reduce the probability of generating incoherent and repetitive sentences. The model is verified by experiments on two datasets and better results are obtained than the baseline model. On WIKIBIO, the result is improved from 45.47 to 46.87 on BLEU and from 41.54 to 42.28 on ROUGE. On ROTOWIRE, the result is increased by 4.29% on CO metric, and 1.93 points higher on BLEU.


2019 ◽  
Vol 1 (1) ◽  
pp. 47-66 ◽  
Author(s):  
Stefanie Sirén-Heikel ◽  
Leo Leppänen ◽  
Carl-Gustav Lindén ◽  
Asta Bäck

AbstractNews automation is an emerging field within journalism, with the potential to transform newswork. Increasing access to data, combined with developing technology, will allow further inquiries into automated journalism. Producing news text using NLG (natural language generation) is currently largely undertaken in specific, predictable news domains, such as sports or finance. This interdisciplinary study investigates how elite media representatives from Finland, Europe and the US imagine the affordances of this emerging technology for their organization. Our analysis shows how the affordances of news automation are imagined as providing efficiency, increasing output and aiding in reallocating resources to pursue quality journalism. The affordances are, however, constrained by such factors as access to structured data, the quality of automation and a lack of relevant skills. In its current form, automated text generation is seen as providing only limited benefits to news organizations that are already imagining further possibilities of automation.


2021 ◽  
Author(s):  
Linyong Nan ◽  
Dragomir Radev ◽  
Rui Zhang ◽  
Amrit Rau ◽  
Abhinand Sivaprasad ◽  
...  

1994 ◽  
Vol 33 (05) ◽  
pp. 454-463 ◽  
Author(s):  
A. M. van Ginneken ◽  
J. van der Lei ◽  
J. H. van Bemmel ◽  
P. W. Moorman

Abstract:Clinical narratives in patient records are usually recorded in free text, limiting the use of this information for research, quality assessment, and decision support. This study focuses on the capture of clinical narratives in a structured format by supporting physicians with structured data entry (SDE). We analyzed and made explicit which requirements SDE should meet to be acceptable for the physician on the one hand, and generate unambiguous patient data on the other. Starting from these requirements, we found that in order to support SDE, the knowledge on which it is based needs to be made explicit: we refer to this knowledge as descriptional knowledge. We articulate the nature of this knowledge, and propose a model in which it can be formally represented. The model allows the construction of specific knowledge bases, each representing the knowledge needed to support SDE within a circumscribed domain. Data entry is made possible through a general entry program, of which the behavior is determined by a combination of user input and the content of the applicable domain knowledge base. We clarify how descriptional knowledge is represented, modeled, and used for data entry to achieve SDE, which meets the proposed requirements.


1992 ◽  
Vol 31 (04) ◽  
pp. 268-274 ◽  
Author(s):  
W. Gaus ◽  
J. G. Wechsler ◽  
P. Janowitz ◽  
J. Tudyka ◽  
W. Kratzer ◽  
...  

Abstract:A system using structured reporting of findings was developed for the preparation of medical reports and for clinical documentation purposes in upper abdominal sonography, and evaluated in the course of routine use. The evaluation focussed on the following parameters: completeness and correctness of the entered data, the proportion of free text, the validity and objectivity of the documentation, user acceptance, and time required. The completeness in the case of two clinically relevant parameters could be compared with an already existing database containing freely dictated reports. The results confirmed the hypothesis that, for the description of results of a technical examination, structured data reporting is a viable alternative to free-text dictation. For the application evaluated, there is even evidence of the superiority of a structured approach. The system can be put to use in related areas of application.


1996 ◽  
Vol 35 (03) ◽  
pp. 261-264 ◽  
Author(s):  
T. Schromm ◽  
T. Frankewitsch ◽  
M. Giehl ◽  
F. Keller ◽  
D. Zellner

Abstract:A pharmacokinetic database was constructed that is as free of errors as possible. Pharmacokinetic parameters were derived from the literature using a text-processing system and a database system. A random data sample from each system was compared with the original literature. The estimated error frequencies using statistical methods differed significantly between the two systems. The estimated error frequency in the text-processing system was 7.2%, that in the database system 2.7%. Compared with the original values in the literature, the estimated probability of error for identical pharmacokinetic parameters recorded in both systems is 2.4% and is not significantly different from the error frequency in the database. Parallel data entry with a text-processing system and a database system is, therefore, not significantly better than structured data entry for reducing the error frequency.


Sign in / Sign up

Export Citation Format

Share Document