Microcontroller Based Speech to Text Translation System

For the purpose of an effective communication, speech becomes a convenient conduit to convey messages as an important activity in human life. The need for better reception of voice messages between humans and the environment they interact with becomes a trend and an area of interest that needs to be studied for improvements. A speech to text translation system is an embedded based design that convert analogue signals particularly voice from an input into digital signals that a computer or any electronic device can understand and perform a required task or display the equivalent digital signal in text on a screen. Speech translation systems mitigates the bottlenecks to an efficient communication caused by other varieties of communication methods. Even though speech translation and recognition designs haven’t been well explored for electronic integration due to complexity and variation of sound signals from sources, this low cost, simple and portable project was incorporated to serve as a substratum and alternative in speech to text translation designs using microcontroller, in other to bridge the gaps in the world of human communications. This research paper addresses design methodology, limitations, recommendations and applications of the implemented speech to text translation system for improved communication reception.

Download Full-text

Bootstrapping a Statistical Speech Translator From a Rule-Based One

Linguistic Issues in Language Technology ◽

10.33011/lilt.v5i.1231 ◽

2012 ◽

Vol 5 ◽

Author(s):

Manny Rayner ◽

Pierrette Bouillon ◽

Paula Estrella ◽

Yukie Nakao ◽

Gwen Christian

Keyword(s):

Translation System ◽

Hybrid Architecture ◽

Considerable Effort ◽

Speech Translation ◽

Rule Based ◽

Source Language ◽

French And English ◽

Series Of Experiments ◽

Statistical Systems ◽

Translation Systems

We describe a series of experiments in which we start with English to French and English to Japanese versions of a rule-based speech translation system for a medical domain, and bootstrap corresponding statistical systems. Comparative evaluation reveals that the statistical systems are still slightly inferior to the rule-based ones, despite the fact that considerable effort has been invested in tuning both the recognition and translation components; however, a hybrid system is able to deliver a small but significant improvement in performance. In conclusion, we suggest that the hybrid architecture we describe potentially allows construction of limited-domain speech translation systems which combine substantial source-language coverage with high-precision translation.

Download Full-text

Signal processing of C-RTD Sensor output as the input to the instrument of low temperature monitoring using Arduino Uno Rev.3

Indonesian Review of Physics ◽

10.12928/irip.v1i2.809 ◽

2019 ◽

Vol 1 (2) ◽

pp. 48

Author(s):

Rismawan Rismawan ◽

Moh. Toifur

Keyword(s):

Output Signal ◽

Low Cost ◽

Digital Signal ◽

Analog Signal ◽

Digital Signals ◽

Temperature Detector ◽

Detector Output ◽

Coil Resistance ◽

Arduino Uno ◽

Value Changes

The C-RTD (Coil-Resistance Temperature Detector) output signal is an analog signal in the form of a direct voltage. This value changes with changes in RTD temperature. This analog signal can be read by users using a multimeter or similar device but does not directly indicate the RTD temperature. In order to obtain RTD temperature values, an additional device is required. In order to have a useful value and practicality, a device that can convert analog signals into values can be read directly by the user. The microcontroller was chosen as a used device. The selected microcontroller system is Arduino Uno because has been coupled with input and output ports so users only need to enter programs related to the system being created. In the other hand Arduino Uno by considering the low cost and practical. For the measurement system, the RTD output signal must be conditioned into a digital signal using the ADC so that it can be processed by the microcontroller. From testing instrument obtained that the system has been able to convert analog RTD signals into digital signals. The range of measurement is -176°C to 0°C with an accuracy of ± 0.20 / mV.

Download Full-text

Open Source Toolkit for Speech to Text Translation

Prague Bulletin of Mathematical Linguistics ◽

10.2478/pralin-2018-0011 ◽

2018 ◽

Vol 111 (1) ◽

pp. 125-135

Author(s):

Thomas Zenkel ◽

Matthias Sperber ◽

Jan Niehues ◽

Markus Müller ◽

Ngoc-Quan Pham ◽

...

Keyword(s):

Open Source ◽

Recognition System ◽

Translation System ◽

Speech Recognition System ◽

Speech Translation ◽

System A ◽

Sentence Segmentation ◽

Overall Performance ◽

Translation Systems ◽

Ted Talks

Abstract In this paper we introduce an open source toolkit for speech translation. While there already exists a wide variety of open source tools for the essential tasks of a speech translation system, our goal is to provide an easy to use recipe for the complete pipeline of translating speech. We provide a Docker container with a ready to use pipeline of the following components: a neural speech recognition system, a sentence segmentation system and an attention-based translation system. We provide recipes for training and evaluating models for the task of translating English lectures and TED talks to German. Additionally, we provide pre-trained models for this task. With this toolkit we hope to facilitate the development of speech translation systems and to encourage researchers to improve the overall performance of speech translation systems.

Download Full-text

An Investigation into Methodology and Metrics Employed to Evaluate the (Speech-to-Speech) Way in Translation Systems

Modern Applied Science ◽

10.5539/mas.v11n4p55 ◽

2017 ◽

Vol 11 (4) ◽

pp. 55

Author(s):

Parnyan Bahrami Dashtaki

Keyword(s):

Speech Recognition ◽

Machine Translation ◽

Automatic Speech Recognition ◽

Speech Synthesis ◽

Translation System ◽

Speech Translation ◽

Pattern Recognition Problem ◽

Finite State ◽

Training Examples ◽

Translation Systems

Speech-to-speech translation is a challenging problem, due to poor sentence planning typically associated with spontaneous speech, as well as errors caused by automatic speech recognition. Based upon a statistically trained speech translation system, in this study, we try to investigate methodologies and metrics employed to assess the (speech-to-speech) way in translation systems. The speech translation is performed incrementally based on generation of partial hypotheses from speech recognition. Speech-input translation can be properly approached as a pattern recognition problem by means of statistical alignment models and stochastic finite-state transducers. Under this general framework, some specific models are presented. One of the features of such models is their capability of automatically learning from training examples. The speech translation system consists of three modules: automatic speech recognition, machine translation and text to speech synthesis. Many procedures for incorporation of speech recognition and machine translation have been projected. In this research, we want explore methodologies and metrics employed to assess the (speech-to-speech) way in translation systems.

Download Full-text

Data Augmentation for Pipeline-Based Speech Translation

Frontiers in Artificial Intelligence and Applications - Human Language Technologies – The Baltic Perspective ◽

10.3233/faia200605 ◽

2020 ◽

Author(s):

Diego Alves ◽

Askars Salimbajevs ◽

Mārcis Pinnis

Keyword(s):

Speech Processing ◽

Data Augmentation ◽

System Development ◽

Recognition System ◽

Translation System ◽

Speech Translation ◽

Baseline System ◽

System Output ◽

Translation Methods ◽

Translation Systems

Pipeline-based speech translation methods may suffer from errors found in speech recognition system output. Therefore, it is crucial that machine translation systems are trained to be robust against such noise. In this paper, we propose two methods for parallel data augmentation for pipeline-based speech translation system development. The first method utilises a speech processing workflow to introduce errors and the second method generates commonly found suffix errors using a rule-based method. We show that the methods in combination allow significantly improving speech translation quality by 1.87 BLEU points over a baseline system.

Download Full-text

An automatic speech translation system for travel conversation

10.3115/1289189.1289241 ◽

2002 ◽

Cited By ~ 1

Author(s):

kitoshi Okumura ◽

Ken-ichi Iso ◽

Shin-ichi Doi ◽

Kiyoshi Yamabana ◽

Ken Hanazawa ◽

...

Keyword(s):

Translation System ◽

Speech Translation

Download Full-text

A 40 Mb/s VLC System Reusing an Existing Large LED Panel in an Indoor Office Environment

Sensors ◽

10.3390/s21051697 ◽

2021 ◽

Vol 21 (5) ◽

pp. 1697

Author(s):

Xicong Li ◽

Zabih Ghassemlooy ◽

Stanislav Zvánovec ◽

Paul Anthony Haigh

Keyword(s):

Modulation Depth ◽

Light Emitting Diode ◽

Low Cost ◽

Data Communication ◽

Experimental System ◽

Digital Signal ◽

Visible Light Communication ◽

Cost Driver ◽

Limiting Factors ◽

Light Sources

With advances in solid-state lighting, visible light communication (VLC) has emerged as a promising technology to enhance existing light-emitting diode (LED)-based lighting infrastructure by adding data communication capabilities to the illumination functionality. The last decade has witnessed the evolution of the VLC concept through global standardisation and product launches. Deploying VLC systems typically requires replacing existing light sources with new luminaires that are equipped with data communication functionality. To save the investment, it is clearly desirable to make the most of the existing illumination systems. This paper investigates the feasibility of adding data communication functionality to the existing lighting infrastructure. We do this by designing an experimental system in an indoor environment based on an off-the-shelf LED panel typically used in office environments, with the dimensions of 60 × 60 cm2. With minor modifications, the VLC function is implemented, and all of the modules of the LED panel are fully reused. A data rate of 40 Mb/s is supported at a distance of up to 2 m while using the multi-band carrierless amplitude and phase (CAP) modulation. Two main limiting factors for achieving higher data rates are observed. The first factor is the limited bandwidth of the LED string inside the panel. The second is the flicker due to the residual ripple of the bias current that is generated by the panel’s driver. Flicker is introduced by the low-cost driver, which provides bias currents that fluctuate in the low frequency range (less than several kilohertz). This significantly reduces the transmitter’s modulation depth. Concurrently, the driver can also introduce an effect that is similar to baseline wander at the receiver if the flicker is not completely filtered out. We also proposed a solution based on digital signal processing (DSP) to mitigate the flicker issue at the receiver side and its effectiveness has been confirmed.

Download Full-text

Evaluation of 2-way Iraqi Arabic–English speech translation systems using automated metrics

Machine Translation ◽

10.1007/s10590-011-9105-x ◽

2011 ◽

Vol 26 (1-2) ◽

pp. 159-176 ◽

Cited By ~ 3

Author(s):

Sherri Condon ◽

Mark Arehart ◽

Dan Parvaz ◽

Gregory Sanders ◽

Christy Doran ◽

...

Keyword(s):

Speech Translation ◽

Translation Systems

Download Full-text

Methods of Respiratory Virus Detection: Advances towards Point-of-Care for Early Intervention

Micromachines ◽

10.3390/mi12060697 ◽

2021 ◽

Vol 12 (6) ◽

pp. 697

Author(s):

Siming Lu ◽

Sha Lin ◽

Hongrui Zhang ◽

Liguo Liang ◽

Shien Shen

Keyword(s):

Viral Load ◽

Respiratory Virus ◽

Viral Infections ◽

Point Of Care ◽

Human Life ◽

Low Cost ◽

Virus Detection ◽

Frequent Monitoring ◽

User Friendly ◽

New Strategies

Respiratory viral infections threaten human life and inflict an enormous healthcare burden worldwide. Frequent monitoring of viral antibodies and viral load can effectively help to control the spread of the virus and make timely interventions. However, current methods for detecting viral load require dedicated personnel and are time-consuming. Additionally, COVID-19 detection is generally relied on an automated PCR analyzer, which is highly instrument-dependent and expensive. As such, emerging technologies in the development of respiratory viral load assays for point-of-care (POC) testing are urgently needed for viral screening. Recent advances in loop-mediated isothermal amplification (LAMP), biosensors, nanotechnology-based paper strips and microfluidics offer new strategies to develop a rapid, low-cost, and user-friendly respiratory viral monitoring platform. In this review, we summarized the traditional methods in respiratory virus detection and present the state-of-art technologies in the monitoring of respiratory virus at POC.

Download Full-text

DESIGNING OF ALL-OPTICAL TRI-STATE LOGIC SYSTEM WITH THE HELP OF OPTICAL NONLINEAR MATERIAL

Journal of Nonlinear Optical Physics & Materials ◽

10.1142/s0218863508004159 ◽

2008 ◽

Vol 17 (03) ◽

pp. 315-328 ◽

Cited By ~ 23

Author(s):

TANAY CHATTOPADHYAY ◽

GOUTAM KUMAR MAITY ◽

JITENDRA NATH ROY

Keyword(s):

High Speed ◽

Optical Switch ◽

Digital Signal ◽

Optical Signal ◽

Nonlinear Material ◽

Digital Signals ◽

Photonic Networks ◽

Optical Nonlinear Material ◽

All Optical ◽

Logic Operations

Nonlinear optics has been of increased interest for all-optical signal, data and image processing in high speed photonic networks. The application of multi-valued (nonbinary) digital signals can provide considerable relief in transmission, storage and processing of a large amount of information in digital signal processing. Here, we propose the design of an all-optical system for some basic tri-state logic operations (trinary OR, trinary AND, trinary XOR, Inverter, Truth detector, False detector) which exploits the polarization properties of light. Nonlinear material based optical switch can play an important role. Tri-state logic can play a significant role towards carry and borrow free arithmetic operations. The principles and possibilities of the design of nonlinear material based tri-state logic circuits are proposed and described.

Download Full-text