An open-source end-to-end ASR system for Brazilian Portuguese using DNNs built from newly assembled corpora

With the rapid development of speech assistants, adapting server-intended automatic speech recognition (ASR) solutions to a direct device has become crucial. For on-device speech recognition tasks, researchers and industry prefer end-to-end ASR systems as they can be made resource-efficient while maintaining a higher quality compared to hybrid systems. However, building end-to-end models requires a significant amount of speech data. Personalization, which is mainly handling out-of-vocabulary (OOV) words, is another challenging task associated with speech assistants. In this work, we consider building an effective end-to-end ASR system in low-resource setups with a high OOV rate, embodied in Babel Turkish and Babel Georgian tasks. We propose a method of dynamic acoustic unit augmentation based on the Byte Pair Encoding with dropout (BPE-dropout) technique. The method non-deterministically tokenizes utterances to extend the token’s contexts and to regularize their distribution for the model’s recognition of unseen words. It also reduces the need for optimal subword vocabulary size search. The technique provides a steady improvement in regular and personalized (OOV-oriented) speech recognition tasks (at least 6% relative word error rate (WER) and 25% relative F-score) at no additional computational cost. Owing to the BPE-dropout use, our monolingual Turkish Conformer has achieved a competitive result with 22.2% character error rate (CER) and 38.9% WER, which is close to the best published multilingual system.

Download Full-text

End-To-End Computer Vision Framework: An Open-Source Platform for Research and Education

Sensors ◽

10.3390/s21113691 ◽

2021 ◽

Vol 21 (11) ◽

pp. 3691

Author(s):

Ciprian Orhei ◽

Silviu Vert ◽

Muguras Mocofan ◽

Radu Vasiu

Keyword(s):

Machine Learning ◽

Image Processing ◽

Computer Vision ◽

Open Source ◽

Visual Processing ◽

Research Field ◽

Learning Models ◽

Research Activity ◽

End To End ◽

Machine Learning Models

Computer Vision is a cross-research field with the main purpose of understanding the surrounding environment as closely as possible to human perception. The image processing systems is continuously growing and expanding into more complex systems, usually tailored to the certain needs or applications it may serve. To better serve this purpose, research on the architecture and design of such systems is also important. We present the End-to-End Computer Vision Framework, an open-source solution that aims to support researchers and teachers within the image processing vast field. The framework has incorporated Computer Vision features and Machine Learning models that researchers can use. In the continuous need to add new Computer Vision algorithms for a day-to-day research activity, our proposed framework has an advantage given by the configurable and scalar architecture. Even if the main focus of the framework is on the Computer Vision processing pipeline, the framework offers solutions to incorporate even more complex activities, such as training Machine Learning models. EECVF aims to become a useful tool for learning activities in the Computer Vision field, as it allows the learner and the teacher to handle only the topics at hand, and not the interconnection necessary for visual processing flow.

Download Full-text

Open-source RTP Library for End-to-End Encrypted Real-Time Video Streaming Applications

10.1109/ism52913.2021.00023 ◽

2021 ◽

Author(s):

Joni Rasanen ◽

Aaro Altonen ◽

Alexandre Mercat ◽

Jarno Vanne

Keyword(s):

Open Source ◽

Real Time ◽

Video Streaming ◽

Streaming Applications ◽

End To End

Download Full-text

Demo Abstract: 5G End-to-End Open Source Network with Traffic Prioritization Mechanism

IEEE INFOCOM 2019 - IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS) ◽

10.1109/infcomw.2019.8845291 ◽

2019 ◽

Author(s):

Mohamad Yassin ◽

William Diego ◽

Sofiane Imadali

Keyword(s):

Open Source ◽

End To End ◽

Traffic Prioritization

Download Full-text

End-to-end framework for fault management for open source clusters

Proceedings of the 2010 TeraGrid Conference on - TG '10 ◽

10.1145/1838574.1838583 ◽

2010 ◽

Cited By ~ 14

Author(s):

John L. Hammond ◽

Tommy Minyard ◽

Jim Browne

Keyword(s):

Open Source ◽

Fault Management ◽

End To End

Download Full-text

OpenCyto: An Open Source Infrastructure for Scalable, Robust, Reproducible, and Automated, End-to-End Flow Cytometry Data Analysis

PLoS Computational Biology ◽

10.1371/journal.pcbi.1003806 ◽

2014 ◽

Vol 10 (8) ◽

pp. e1003806 ◽

Cited By ~ 100

Author(s):

Greg Finak ◽

Jacob Frelinger ◽

Wenxin Jiang ◽

Evan W. Newell ◽

John Ramey ◽

...

Keyword(s):

Flow Cytometry ◽

Data Analysis ◽

Open Source ◽

Flow Cytometry Data ◽

End To End

Download Full-text

TiEMPO: Open-source time-dependent end-to-end model for simulating ground-based submillimeter astronomical observations

Millimeter, Submillimeter, and Far-Infrared Detectors and Instrumentation for Astronomy X ◽

10.1117/12.2561014 ◽

2020 ◽

Author(s):

Esmee Huijten ◽

Yannick Roelvink ◽

Stefanie A. Brackenhoff ◽

Akio Taniguchi ◽

Tom Bakx ◽

...

Keyword(s):

Open Source ◽

Time Dependent ◽

Astronomical Observations ◽

End To End

Download Full-text

Espnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp40776.2020.9053512 ◽

2020 ◽

Cited By ~ 5

Author(s):

Tomoki Hayashi ◽

Ryuichi Yamamoto ◽

Katsuki Inoue ◽

Takenori Yoshimura ◽

Shinji Watanabe ◽

...

Keyword(s):

Open Source ◽

Text To Speech ◽

End To End

Download Full-text

Streaming On-Device End-to-End ASR System for Privacy-Sensitive Voice-Typing

10.21437/interspeech.2020-3172 ◽

2020 ◽

Author(s):

Abhinav Garg ◽

Gowtham P. Vadisetti ◽

Dhananjaya Gowda ◽

Sichen Jin ◽

Aditya Jayasimha ◽

...

Keyword(s):

End To End ◽

Asr System

Download Full-text

MobiGlam

Virtual Learning Environments ◽

10.4018/978-1-4666-0011-9.ch209 ◽

2012 ◽

pp. 333-352

Author(s):

Fatma Meawad ◽

Geneen Stubbs

Keyword(s):

Open Source ◽

Learning Environments ◽

Mobile Devices ◽

Mobile Learning ◽

Real World ◽

Virtual Learning Environments ◽

Learning Activities ◽

Wide Acceptance ◽

End To End ◽

Institutional Challenges

This chapter discusses the principles underpinning the design and the development of a framework, MobiGlam, which supports ubiquitous and scalable access to learning activities. The framework allows full end to end interconnectivity among open source virtual learning environments (VLEs) and Java-enabled mobile devices. Through this framework, interoperability and adaptivity techniques are combined to address the technical, pedagogical, and institutional challenges of mobile learning. The discussed framework achieved a level of flexibility and simplicity that resulted in a wide acceptance of the framework institutionally, allowing its use in various real world settings.

Download Full-text