language modelling Latest Research Papers

Protein sequence alignment is a key component of most bioinformatics pipelines to study the structures and functions of proteins. Aligning highly divergent sequences remains, however, a difficult task that current algorithms often fail to perform accurately, leaving many proteins or open reading frames poorly annotated. Here, we leverage recent advances in deep learning for language modelling and differentiable programming to propose DEDAL, a flexible model to align protein sequences and detect homologs. DEDAL is a machine learning-based model that learns to align sequences by observing large datasets of raw protein sequences and of correct alignments. Once trained, we show that DEDAL improves by up to two- or three-fold the alignment correctness over existing methods on remote homologs, and better discriminates remote homologs from evolutionarily unrelated sequences, paving the way to improvements on many downstream tasks relying on sequence alignment in structural and functional genomics.

Download Full-text

MultiModal Language Modelling on Knowledge Graphs for Deep Video Understanding

10.1145/3474085.3479220 ◽

2021 ◽

Author(s):

Vishal Anand ◽

Raksha Ramesh ◽

Boshen Jin ◽

Ziyin Wang ◽

Xiaoxiao Lei ◽

...

Keyword(s):

Video Understanding ◽

Multimodal Language ◽

Language Modelling ◽

Knowledge Graphs

Download Full-text

The Zero Resource Speech Challenge 2021: Spoken Language Modelling

10.21437/interspeech.2021-1755 ◽

2021 ◽

Author(s):

Ewan Dunbar ◽

Mathieu Bernard ◽

Nicolas Hamilakis ◽

Tu Anh Nguyen ◽

Maureen de Seyssel ◽

...

Keyword(s):

Spoken Language ◽

Language Modelling

Download Full-text

PhonemeBERT: Joint Language Modelling of Phoneme Sequence and ASR Transcript

10.21437/interspeech.2021-1582 ◽

2021 ◽

Author(s):

Mukuntha Narayanan Sundararaman ◽

Ayush Kumar ◽

Jithendra Vepa

Keyword(s):

Language Modelling

Download Full-text

Aided language modelling, responsive communication and eye-gaze technology as communication intervention for adults with Rett syndrome: three experimental single case studies

Disability and Rehabilitation Assistive Technology ◽

10.1080/17483107.2021.1967469 ◽

2021 ◽

pp. 1-15

Author(s):

H. Wandin ◽

P. Lindberg ◽

K. Sonnander

Keyword(s):

Case Studies ◽

Rett Syndrome ◽

Single Case ◽

Eye Gaze ◽

Communication Intervention ◽

Language Modelling ◽

Responsive Communication

Download Full-text

Set-to-Sequence Methods in Machine Learning: A Review

Journal of Artificial Intelligence Research ◽

10.1613/jair.1.12839 ◽

2021 ◽

Vol 71 ◽

pp. 885-924

Author(s):

Mateusz Jurewicz ◽

Leon Derczynski

Keyword(s):

Machine Learning ◽

Representation Learning ◽

Qualitative Comparison ◽

Machine Learning Methods ◽

Strategy Games ◽

Meta Learning ◽

Grid Optimization ◽

Multi Agent ◽

Language Modelling ◽

Complex Target

Machine learning on sets towards sequential output is an important and ubiquitous task, with applications ranging from language modelling and meta-learning to multi-agent strategy games and power grid optimization. Combining elements of representation learning and structured prediction, its two primary challenges include obtaining a meaningful, permutation invariant set representation and subsequently utilizing this representation to output a complex target permutation. This paper provides a comprehensive introduction to the _eld as well as an overview of important machine learning methods tackling both of these key challenges, with a detailed qualitative comparison of selected model architectures.

Download Full-text