scholarly journals Microcontroller Based Speech to Text Translation System

2019 ◽  
Vol 4 (12) ◽  
pp. 149-154
Author(s):  
Victor Sorochi Uko ◽  
Gachada Benard Dubukumah ◽  
Ibrahim Kafayat Ayosubomi

For the purpose of an effective communication, speech becomes a convenient conduit to convey messages as an important activity in human life. The need for better reception of voice messages between humans and the environment they interact with becomes a trend and an area of interest that needs to be studied for improvements. A speech to text translation system is an embedded based design that convert analogue signals particularly voice from an input into digital signals that a computer or any electronic device can understand and perform a required task or display the equivalent digital signal in text on a screen. Speech translation systems mitigates the bottlenecks to an efficient communication caused by other varieties of communication methods. Even though speech translation and recognition designs haven’t been well explored for electronic integration due to complexity and variation of sound signals from sources, this low cost, simple and portable project was incorporated to serve as a substratum and alternative in speech to text translation designs using microcontroller, in other to bridge the gaps in the world of human communications. This research paper addresses design methodology, limitations, recommendations and applications of the implemented speech to text translation system for improved communication reception.

2012 ◽  
Vol 5 ◽  
Author(s):  
Manny Rayner ◽  
Pierrette Bouillon ◽  
Paula Estrella ◽  
Yukie Nakao ◽  
Gwen Christian

We describe a series of experiments in which we start with English to French and English to Japanese versions of a rule-based speech translation system for a medical domain, and bootstrap corresponding statistical systems. Comparative evaluation reveals that the statistical systems are still slightly inferior to the rule-based ones, despite the fact that considerable effort has been invested in tuning both the recognition and translation components; however, a hybrid system is able to deliver a small but significant improvement in performance. In conclusion, we suggest that the hybrid architecture we describe potentially allows construction of limited-domain speech translation systems which combine substantial source-language coverage with high-precision translation.


2019 ◽  
Vol 1 (2) ◽  
pp. 48
Author(s):  
Rismawan Rismawan ◽  
Moh. Toifur

The C-RTD (Coil-Resistance Temperature Detector) output signal is an analog signal in the form of a direct voltage. This value changes with changes in RTD temperature. This analog signal can be read by users using a multimeter or similar device but does not directly indicate the RTD temperature. In order to obtain RTD temperature values, an additional device is required. In order to have a useful value and practicality, a device that can convert analog signals into values can be read directly by the user. The microcontroller was chosen as a used device. The selected microcontroller system is Arduino Uno because has been coupled with input and output ports so users only need to enter programs related to the system being created. In the other hand Arduino Uno by considering the low cost and practical. For the measurement system, the RTD output signal must be conditioned into a digital signal using the ADC so that it can be processed by the microcontroller. From testing instrument obtained that the system has been able to convert analog RTD signals into digital signals. The range of measurement is -176°C to  0°C with an accuracy of ± 0.20 / mV. 


2018 ◽  
Vol 111 (1) ◽  
pp. 125-135
Author(s):  
Thomas Zenkel ◽  
Matthias Sperber ◽  
Jan Niehues ◽  
Markus Müller ◽  
Ngoc-Quan Pham ◽  
...  

Abstract In this paper we introduce an open source toolkit for speech translation. While there already exists a wide variety of open source tools for the essential tasks of a speech translation system, our goal is to provide an easy to use recipe for the complete pipeline of translating speech. We provide a Docker container with a ready to use pipeline of the following components: a neural speech recognition system, a sentence segmentation system and an attention-based translation system. We provide recipes for training and evaluating models for the task of translating English lectures and TED talks to German. Additionally, we provide pre-trained models for this task. With this toolkit we hope to facilitate the development of speech translation systems and to encourage researchers to improve the overall performance of speech translation systems.


2017 ◽  
Vol 11 (4) ◽  
pp. 55
Author(s):  
Parnyan Bahrami Dashtaki

Speech-to-speech translation is a challenging problem, due to poor sentence planning typically associated with spontaneous speech, as well as errors caused by automatic speech recognition. Based upon a statistically trained speech translation system, in this study, we try to investigate methodologies and metrics employed to assess the (speech-to-speech) way in translation systems. The speech translation is performed incrementally based on generation of partial hypotheses from speech recognition. Speech-input translation can be properly approached as a pattern recognition problem by means of statistical alignment models and stochastic finite-state transducers. Under this general framework, some specific models are presented. One of the features of such models is their capability of automatically learning from training examples. The speech translation system consists of three modules: automatic speech recognition, machine translation and text to speech synthesis. Many procedures for incorporation of speech recognition and machine translation have been projected. In this research, we want explore methodologies and metrics employed to assess the (speech-to-speech) way in translation systems.


Author(s):  
Diego Alves ◽  
Askars Salimbajevs ◽  
Mārcis Pinnis

Pipeline-based speech translation methods may suffer from errors found in speech recognition system output. Therefore, it is crucial that machine translation systems are trained to be robust against such noise. In this paper, we propose two methods for parallel data augmentation for pipeline-based speech translation system development. The first method utilises a speech processing workflow to introduce errors and the second method generates commonly found suffix errors using a rule-based method. We show that the methods in combination allow significantly improving speech translation quality by 1.87 BLEU points over a baseline system.


Author(s):  
kitoshi Okumura ◽  
Ken-ichi Iso ◽  
Shin-ichi Doi ◽  
Kiyoshi Yamabana ◽  
Ken Hanazawa ◽  
...  

Sensors ◽  
2021 ◽  
Vol 21 (5) ◽  
pp. 1697
Author(s):  
Xicong Li ◽  
Zabih Ghassemlooy ◽  
Stanislav Zvánovec ◽  
Paul Anthony Haigh

With advances in solid-state lighting, visible light communication (VLC) has emerged as a promising technology to enhance existing light-emitting diode (LED)-based lighting infrastructure by adding data communication capabilities to the illumination functionality. The last decade has witnessed the evolution of the VLC concept through global standardisation and product launches. Deploying VLC systems typically requires replacing existing light sources with new luminaires that are equipped with data communication functionality. To save the investment, it is clearly desirable to make the most of the existing illumination systems. This paper investigates the feasibility of adding data communication functionality to the existing lighting infrastructure. We do this by designing an experimental system in an indoor environment based on an off-the-shelf LED panel typically used in office environments, with the dimensions of 60 × 60 cm2. With minor modifications, the VLC function is implemented, and all of the modules of the LED panel are fully reused. A data rate of 40 Mb/s is supported at a distance of up to 2 m while using the multi-band carrierless amplitude and phase (CAP) modulation. Two main limiting factors for achieving higher data rates are observed. The first factor is the limited bandwidth of the LED string inside the panel. The second is the flicker due to the residual ripple of the bias current that is generated by the panel’s driver. Flicker is introduced by the low-cost driver, which provides bias currents that fluctuate in the low frequency range (less than several kilohertz). This significantly reduces the transmitter’s modulation depth. Concurrently, the driver can also introduce an effect that is similar to baseline wander at the receiver if the flicker is not completely filtered out. We also proposed a solution based on digital signal processing (DSP) to mitigate the flicker issue at the receiver side and its effectiveness has been confirmed.


2011 ◽  
Vol 26 (1-2) ◽  
pp. 159-176 ◽  
Author(s):  
Sherri Condon ◽  
Mark Arehart ◽  
Dan Parvaz ◽  
Gregory Sanders ◽  
Christy Doran ◽  
...  

Micromachines ◽  
2021 ◽  
Vol 12 (6) ◽  
pp. 697
Author(s):  
Siming Lu ◽  
Sha Lin ◽  
Hongrui Zhang ◽  
Liguo Liang ◽  
Shien Shen

Respiratory viral infections threaten human life and inflict an enormous healthcare burden worldwide. Frequent monitoring of viral antibodies and viral load can effectively help to control the spread of the virus and make timely interventions. However, current methods for detecting viral load require dedicated personnel and are time-consuming. Additionally, COVID-19 detection is generally relied on an automated PCR analyzer, which is highly instrument-dependent and expensive. As such, emerging technologies in the development of respiratory viral load assays for point-of-care (POC) testing are urgently needed for viral screening. Recent advances in loop-mediated isothermal amplification (LAMP), biosensors, nanotechnology-based paper strips and microfluidics offer new strategies to develop a rapid, low-cost, and user-friendly respiratory viral monitoring platform. In this review, we summarized the traditional methods in respiratory virus detection and present the state-of-art technologies in the monitoring of respiratory virus at POC.


2008 ◽  
Vol 17 (03) ◽  
pp. 315-328 ◽  
Author(s):  
TANAY CHATTOPADHYAY ◽  
GOUTAM KUMAR MAITY ◽  
JITENDRA NATH ROY

Nonlinear optics has been of increased interest for all-optical signal, data and image processing in high speed photonic networks. The application of multi-valued (nonbinary) digital signals can provide considerable relief in transmission, storage and processing of a large amount of information in digital signal processing. Here, we propose the design of an all-optical system for some basic tri-state logic operations (trinary OR, trinary AND, trinary XOR, Inverter, Truth detector, False detector) which exploits the polarization properties of light. Nonlinear material based optical switch can play an important role. Tri-state logic can play a significant role towards carry and borrow free arithmetic operations. The principles and possibilities of the design of nonlinear material based tri-state logic circuits are proposed and described.


Sign in / Sign up

Export Citation Format

Share Document