Maximum-a-Posteriori-Based Decoding for End-to-End Acoustic Models

2017 ◽  
Vol 25 (5) ◽  
pp. 1023-1034 ◽  
Author(s):  
Naoyuki Kanda ◽  
Xugang Lu ◽  
Hisashi Kawai
1994 ◽  
Vol 02 (03) ◽  
pp. 251-266 ◽  
Author(s):  
PETER GERSTOFT

The data set from the Workshop on Acoustic Models in Signal Processing (May 1993) is inverted in order to find both the environmental parameters and the source position, Genetic algorithms are used for the optimization. When using genetic algorithms the responses from many environmental parameter sets are computed in order to estimate the solution. All these samples of the parameter space are used to estimate the a posteriori probabilities of the model parameters. Thus the uniqueness and uncertainty of the model parameters are assessed.


Author(s):  
Danny Henry Galatang ◽  
◽  
Suyanto Suyanto ◽  

The syllable-based automatic speech recognition (ASR) systems commonly perform better than the phoneme-based ones. This paper focuses on developing an Indonesian monosyllable-based ASR (MSASR) system using an ASR engine called SPRAAK and comparing it to a phoneme-based one. The Mozilla DeepSpeech-based end-to-end ASR (MDSE2EASR), one of the state-of-the-art models based on character (similar to the phoneme-based model), is also investigated to confirm the result. Besides, a novel Kaituoxu SpeechTransformer (KST) E2EASR is also examined. Testing on the Indonesian speech corpus of 5,439 words shows that the proposed MSASR produces much higher word accuracy (76.57%) than the monophone-based one (63.36%). Its performance is comparable to the character-based MDS-E2EASR, which produces 76.90%, and the character-based KST-E2EASR (78.00%). In the future, this monosyllable-based ASR is possible to be improved to the bisyllable-based one to give higher word accuracy. Nevertheless, extensive bisyllable acoustic models must be handled using an advanced method.


Author(s):  
Arno J. Bleeker ◽  
Mark H.F. Overwijk ◽  
Max T. Otten

With the improvement of the optical properties of the modern TEM objective lenses the point resolution is pushed beyond 0.2 nm. The objective lens of the CM300 UltraTwin combines a Cs of 0. 65 mm with a Cc of 1.4 mm. At 300 kV this results in a point resolution of 0.17 nm. Together with a high-brightness field-emission gun with an energy spread of 0.8 eV the information limit is pushed down to 0.1 nm. The rotationally symmetric part of the phase contrast transfer function (pctf), whose first zero at Scherzer focus determines the point resolution, is mainly determined by the Cs and defocus. Apart from the rotationally symmetric part there is also the non-rotationally symmetric part of the pctf. Here the main contributors are not only two-fold astigmatism and beam tilt but also three-fold astigmatism. The two-fold astigmatism together with the beam tilt can be corrected in a straight-forward way using the coma-free alignment and the objective stigmator. However, this only works well when the coefficient of three-fold astigmatism is negligible compared to the other aberration coefficients. Unfortunately this is not generally the case with the modern high-resolution objective lenses. Measurements done at a CM300 SuperTwin FEG showed a three fold-astigmatism of 1100 nm which is consistent with measurements done by others. A three-fold astigmatism of 1000 nm already sinificantly influences the image at a spatial frequency corresponding to 0.2 nm which is even above the point resolution of the objective lens. In principle it is possible to correct for the three-fold astigmatism a posteriori when through-focus series are taken or when off-axis holography is employed. This is, however not possible for single images. The only possibility is then to correct for the three-fold astigmatism in the microscope by the addition of a hexapole corrector near the objective lens.


VASA ◽  
2016 ◽  
Vol 45 (3) ◽  
pp. 223-228 ◽  
Author(s):  
Jan Paweł Skóra ◽  
Jacek Kurcz ◽  
Krzysztof Korta ◽  
Przemysław Szyber ◽  
Tadeusz Andrzej Dorobisz ◽  
...  

Abstract. Background: We present the methods and results of the surgical management of extracranial carotid artery aneurysms (ECCA). Postoperative complications including early and late neurological events were analysed. Correlation between reconstruction techniques and morphology of ECCA was assessed in this retrospective study. Patients and methods: In total, 32 reconstructions of ECCA were performed in 31 symptomatic patients with a mean age of 59.2 (range 33 - 84) years. The causes of ECCA were divided among atherosclerosis (n = 25; 78.1 %), previous carotid endarterectomy with Dacron patch (n = 4; 12.5 %), iatrogenic injury (n = 2; 6.3 %) and infection (n = 1; 3.1 %). In 23 cases, intervention consisted of carotid bypass. Aneurysmectomy with end-to-end suture was performed in 4 cases. Aneurysmal resection with patching was done in 2 cases and aneurysmorrhaphy without patching in another 2 cases. In 1 case, ligature of the internal carotid artery (ICA) was required. Results: Technical success defined as the preservation of ICA patency was achieved in 31 cases (96.9 %). There was one perioperative death due to major stroke (3.1 %). Two cases of minor stroke occurred in the 30-day observation period (6.3 %). Three patients had a transient hypoglossal nerve palsy that subsided spontaneously (9.4 %). At a mean long-term follow-up of 68 months, there were no major or minor ipsilateral strokes or surgery-related deaths reported. In all 30 surviving patients (96.9 %), long-term clinical outcomes were free from ipsilateral neurological symptoms. Conclusions: Open surgery is a relatively safe method in the therapy of ECCA. Surgical repair of ECCAs can be associated with an acceptable major stroke rate and moderate minor stroke rate. Complication-free long-term outcomes can be achieved in as many as 96.9 % of patients. Aneurysmectomy with end-to-end anastomosis or bypass surgery can be implemented during open repair of ECCA.


Sign in / Sign up

Export Citation Format

Share Document