Toward Better Storylines with Sentence-Level Language Models

We propose a new generative language model for sentences that first samples a prototype sentence from the training corpus and then edits it into a new sentence. Compared to traditional language models that generate from scratch either left-to-right or by first sampling a latent sentence vector, our prototype-then-edit model improves perplexity on language modeling and generates higher quality outputs according to human evaluation. Furthermore, the model gives rise to a latent edit vector that captures interpretable semantics such as sentence similarity and sentence-level analogies.

Download Full-text

Using Sentence-Level LSTM Language Models for Script Inference

10.18653/v1/p16-1027 ◽

2016 ◽

Cited By ~ 5

Author(s):

Karl Pichotta ◽

Raymond J. Mooney

Keyword(s):

Language Models ◽

Sentence Level

Download Full-text

Qualitative: Open Source Python Tool for Quality Estimation over Multiple Machine Translation Outputs

Prague Bulletin of Mathematical Linguistics ◽

10.2478/pralin-2014-0009 ◽

2014 ◽

Vol 102 (1) ◽

pp. 5-16

Author(s):

Avramidis Eleftherios ◽

Poustka Lukas ◽

Schmeier Sven

Keyword(s):

Object Oriented ◽

Object Oriented Programming ◽

Language Models ◽

Hooking Up ◽

Quality Estimation ◽

Web Based ◽

Sentence Level ◽

On Line ◽

The Given ◽

Selection Of

Abstract “Qualitative” is a python toolkit for ranking and selection of sentence-level output by different MT systems using Quality Estimation. The toolkit implements a basic pipeline for annotating the given sentences with black-box features. Consequently, it applies a machine learning mechanism in order to rank data based on models pre-trained on human preferences. The preprocessing pipeline includes support for language models, PCFG parsing, language checking tools and various other pre-processors and feature generators. The code follows the principles of object-oriented programming to allow modularity and extensibility. The tool can operate by processing both batch-files and single sentences. An XML-RPC interface is provided for hooking up with web-services and a graphical animated web-based interface demonstrates its potential on-line use.

Download Full-text

On the Hierarchical Information in a Single Contextualised Word Representation (Student Abstract)

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i10.7231 ◽

2020 ◽

Vol 34 (10) ◽

pp. 13917-13918

Author(s):

Dean L. Slack ◽

Mariann Hardey ◽

Noura Al Moubayed

Keyword(s):

Language Processing ◽

Fine Tuning ◽

Language Models ◽

Linguistic Features ◽

Widespread Application ◽

Linear Classifiers ◽

Sentence Level ◽

Level Information ◽

Word Representation ◽

And Performance

Contextual word embeddings produced by neural language models, such as BERT or ELMo, have seen widespread application and performance gains across many Natural Language Processing tasks, suggesting rich linguistic features encoded in their representations. This work aims to investigate to what extent any linguistic hierarchical information is encoded into a single contextual embedding. Using labelled constituency trees, we train simple linear classifiers on top of single contextualised word representations for ancestor sentiment analysis tasks at multiple constituency levels of a sentence. To assess the presence of hierarchical information throughout the networks, the linear classifiers are trained using representations produced by each intermediate layer of BERT and ELMo variants. We show that with no fine-tuning, a single contextualised representation encodes enough syntactic and semantic sentence-level information to significantly outperform a non-contextual baseline for classifying 5-class sentiment of its ancestor constituents at multiple levels of the constituency tree. Additionally, we show that both LSTM and transformer architectures trained on similarly sized datasets achieve similar levels of performance on these tasks. Future work looks to expand the analysis to a wider range of NLP tasks and contextualisers.

Download Full-text

CERT: Contrastive Self-supervised Learning for Language Understanding

10.36227/techrxiv.12308378.v1 ◽

2020 ◽

Author(s):

Hongchao Fang ◽

Pengtao Xie

Keyword(s):

Supervised Learning ◽

Language Models ◽

Language Understanding ◽

Great Effectiveness ◽

Language Representation ◽

Sentence Level ◽

Back Translation

Pretrained language models such as BERT, GPT have shown great effectiveness in language understanding. The auxiliary predictive tasks in existing pretraining approaches are mostly defined on tokens, thus may not be able to capture sentence-level semantics very well. To address this issue, we propose CERT: Contrastive self-supervised Encoder Representations from Transformers, which pretrains language representation models using contrastive self-supervised learning at the sentence level. CERT creates augmentations of original sentences using back-translation. Then it finetunes a pretrained language encoder (e.g., BERT) by predicting whether two augmented sentences originate from the same sentence. CERT is simple to use and can be flexibly plugged into any pretraining-finetuning NLP pipeline. We evaluate CERT on three language understanding tasks: CoLA, RTE, and QNLI. CERT outperforms BERT significantly.<br>

Download Full-text

Deep semi-supervised learning ensemble framework for classifying co-mentions of human proteins and phenotypes

BMC Bioinformatics ◽

10.1186/s12859-021-04421-z ◽

2021 ◽

Vol 22 (1) ◽

Author(s):

Morteza Pourreza Shahri ◽

Indika Kahanda

Keyword(s):

Ensemble Learning ◽

Language Processing ◽

Human Protein ◽

Language Models ◽

Convolutional Networks ◽

Relationship Extraction ◽

Novel Approach ◽

Sentence Level ◽

Human Proteins ◽

Automated Tools

Abstract Background Identifying human protein-phenotype relationships has attracted researchers in bioinformatics and biomedical natural language processing due to its importance in uncovering rare and complex diseases. Since experimental validation of protein-phenotype associations is prohibitive, automated tools capable of accurately extracting these associations from the biomedical text are in high demand. However, while the manual annotation of protein-phenotype co-mentions required for training such models is highly resource-consuming, extracting millions of unlabeled co-mentions is straightforward. Results In this study, we propose a novel deep semi-supervised ensemble framework that combines deep neural networks, semi-supervised, and ensemble learning for classifying human protein-phenotype co-mentions with the help of unlabeled data. This framework allows the ability to incorporate an extensive collection of unlabeled sentence-level co-mentions of human proteins and phenotypes with a small labeled dataset to enhance overall performance. We develop PPPredSS, a prototype of our proposed semi-supervised framework that combines sophisticated language models, convolutional networks, and recurrent networks. Our experimental results demonstrate that the proposed approach provides a new state-of-the-art performance in classifying human protein-phenotype co-mentions by outperforming other supervised and semi-supervised counterparts. Furthermore, we highlight the utility of PPPredSS in powering a curation assistant system through case studies involving a group of biologists. Conclusions This article presents a novel approach for human protein-phenotype co-mention classification based on deep, semi-supervised, and ensemble learning. The insights and findings from this work have implications for biomedical researchers, biocurators, and the text mining community working on biomedical relationship extraction.

Download Full-text

Comparing the Quality and Speed of Sentence Classification with Modern Language Models

Applied Sciences ◽

10.3390/app10103386 ◽

2020 ◽

Vol 10 (10) ◽

pp. 3386 ◽

Cited By ~ 2

Author(s):

Krzysztof Fiok ◽

Waldemar Karwowski ◽

Edgar Gutierrez ◽

Mohammad Reza-Davahli

Keyword(s):

Language Models ◽

Data Sets ◽

Word Embeddings ◽

Data Set ◽

Sentence Level ◽

Sentence Classification ◽

Development Of Language ◽

Gated Recurrent Units ◽

Vector Representations ◽

Fully Connected

After the advent of Glove and Word2vec, the dynamic development of language models (LMs) used to generate word embeddings has enabled the creation of better text classifier frameworks. With the vector representations of words generated by newer LMs, embeddings are no longer static but are context-aware. However, the quality of results provided by state-of-the-art LMs comes at the price of speed. Our goal was to present a benchmark to provide insight into the speed–quality trade-off of a sentence classifier framework based on word embeddings provided by selected LMs. We used a recurrent neural network with gated recurrent units to create sentence-level vector representations from word embeddings provided by an LM and a single fully connected layer for classification. Benchmarking was performed on two sentence classification data sets: The Sixth Text REtrieval Conference (TREC6)set and a 1000-sentence data set of our design. Our Monte Carlo cross-validated results based on these two data sources demonstrated that the newest deep learning LMs provided improvements over Glove and FastText in terms of weighted Matthews correlation coefficient (MCC) scores. We postulate that progress in LMs is more apparent when more difficult classification tasks are addressed.

Download Full-text

CERT: Contrastive Self-supervised Learning for Language Understanding

10.36227/techrxiv.12308378 ◽

2020 ◽

Author(s):

Hongchao Fang ◽

Pengtao Xie

Keyword(s):

Supervised Learning ◽

Language Models ◽

Language Understanding ◽

Great Effectiveness ◽

Language Representation ◽

Sentence Level ◽

Back Translation

Pretrained language models such as BERT, GPT have shown great effectiveness in language understanding. The auxiliary predictive tasks in existing pretraining approaches are mostly defined on tokens, thus may not be able to capture sentence-level semantics very well. To address this issue, we propose CERT: Contrastive self-supervised Encoder Representations from Transformers, which pretrains language representation models using contrastive self-supervised learning at the sentence level. CERT creates augmentations of original sentences using back-translation. Then it finetunes a pretrained language encoder (e.g., BERT) by predicting whether two augmented sentences originate from the same sentence. CERT is simple to use and can be flexibly plugged into any pretraining-finetuning NLP pipeline. We evaluate CERT on three language understanding tasks: CoLA, RTE, and QNLI. CERT outperforms BERT significantly.<br>

Download Full-text

Exploring the Potential Impact of Sentence-Level Comprehension and Sentence-Level Fluency on Deaf Students' Passage Comprehension

Journal of Speech Language and Hearing Research ◽

10.1044/2020_jslhr-19-00205 ◽

2020 ◽

Vol 63 (7) ◽

pp. 2281-2292

Author(s):

Ying Zhao ◽

Xinchun Wu ◽

Hongjun Chen ◽

Peng Sun ◽

Ruibo Xie ◽

...

Keyword(s):

Independent Predictor ◽

Vocabulary Knowledge ◽

Group Differences ◽

Regression Analyses ◽

Deaf Students ◽

Mediating Role ◽

Sentence Level ◽

Mediating Mechanisms ◽

Potential Impact ◽

The Relationship

Purpose This exploratory study aimed to investigate the potential impact of sentence-level comprehension and sentence-level fluency on passage comprehension of deaf students in elementary school. Method A total of 159 deaf students, 65 students ( M age = 13.46 years) in Grades 3 and 4 and 94 students ( M age = 14.95 years) in Grades 5 and 6, were assessed for nonverbal intelligence, vocabulary knowledge, sentence-level comprehension, sentence-level fluency, and passage comprehension. Group differences were examined using t tests, whereas the predictive and mediating mechanisms were examined using regression modeling. Results The regression analyses showed that the effect of sentence-level comprehension on passage comprehension was not significant, whereas sentence-level fluency was an independent predictor in Grades 3–4. Sentence-level comprehension and fluency contributed significant variance to passage comprehension in Grades 5–6. Sentence-level fluency fully mediated the influence of sentence-level comprehension on passage comprehension in Grades 3–4, playing a partial mediating role in Grades 5–6. Conclusions The relative contributions of sentence-level comprehension and fluency to deaf students' passage comprehension varied, and sentence-level fluency mediated the relationship between sentence-level comprehension and passage comprehension.

Download Full-text

Mapping Treatment: An Approach To Treating Sentence Level Impairments In Agrammatism

Perspectives on Neurophysiology and Neurogenic Speech and Language Disorders ◽

10.1044/nnsld11.3.14 ◽

2001 ◽

Vol 11 (3) ◽

pp. 14-23 ◽

Cited By ~ 4

Author(s):

Ruth B. Fink

Keyword(s):

Sentence Level

Download Full-text