scholarly journals Speaker verification with a priori threshold determination using kernel-based probabilistic neural networks

Author(s):  
Kwok-kwong Yiu ◽  
Man-Wai Mak ◽  
Sun-Yuan Kung
2021 ◽  
Vol 9 (2) ◽  
pp. T585-T598
Author(s):  
Abidin B. Caf ◽  
John D. Pigott

Extensive dolomitization is prevalent in the platform and periplatform carbonates in the Lower-Middle Permian strata in the Midland and greater Permian Basin. Early workers have found that the platform and shelf-top carbonates were dolomitized, whereas slope and basinal carbonates remained calcitic, proposing a reflux dolomitization model as the possible diagenetic mechanism. More importantly, they underline that this dolomitization pattern controls the porosity and forms an updip seal. These studies are predominately conducted using well logs, cores, and outcrop analogs, and although exhibiting high resolution vertically, such determinations are laterally sparse. We have used supervised Bayesian classification and probabilistic neural networks (PNN) on a 3D seismic volume to create an estimation of the most probable distribution of dolomite and limestone within a subsurface 3D volume petrophysically constrained. Combining this lithologic information with porosity, we then illuminate the diagenetic effects on a seismic scale. We started our workflow by deriving lithology classifications from well-log crossplots of neutron porosity and acoustic impedance to determine the a priori proportions of the lithology and the probability density functions calculation for each lithology type. Then, we applied these probability distributions and a priori proportions to 3D seismic volumes of the acoustic impedance and predicted neutron porosity volume to create a lithology volume and probability volumes for each lithology type. The acoustic impedance volume was obtained by model-based poststack inversion, and the neutron porosity volume was obtained by the PNN. Our results best supported a regional reflux dolomitization model, in which the porosity is increasing from shelf to slope while the dolomitization is decreasing, but with sea-level forcing. With this study, we determined that diagenesis and the corresponding reservoir quality in these platforms and periplatform strata can be directly imaged and mapped on a seismic scale by quantitative seismic interpretation and supervised classification methods.


2002 ◽  
Vol 12 (05) ◽  
pp. 381-397 ◽  
Author(s):  
K. K. YIU ◽  
M. W. MAK ◽  
S. Y. KUNG

This paper compares kernel-based probabilistic neural networks for speaker verification based on 138 speakers of the YOHO corpus. Experimental evaluations using probabilistic decision-based neural networks (PDBNNs), Gaussian mixture models (GMMs) and elliptical basis function networks (EBFNs) as speaker models were conducted. The original training algorithm of PDBNNs was also modified to make PDBNNs appropriate for speaker verification. Results show that the equal error rate obtained by PDBNNs and GMMs is less than that of EBFNs (0.33% vs. 0.48%), suggesting that GMM- and PDBNN-based speaker models outperform the EBFN ones. This work also finds that the globally supervised learning of PDBNNs is able to find decision thresholds that not only maintain the false acceptance rates to a low level but also reduce their variation, whereas the ad-hoc threshold-determination approach used by the EBFNs and GMMs causes a large variation in the error rates. This property makes the performance of PDBNN-based systems more predictable.


2007 ◽  
Vol 70 (7-9) ◽  
pp. 1424-1438 ◽  
Author(s):  
Todor D. Ganchev ◽  
Dimitris K. Tasoulis ◽  
Michael N. Vrahatis ◽  
Nikos D. Fakotakis

1995 ◽  
Vol 31 (22) ◽  
pp. 1930-1931 ◽  
Author(s):  
D. Anguita ◽  
S. Rovetta ◽  
S. Ridella ◽  
R. Zunino

2004 ◽  
Vol 34 (1) ◽  
pp. 37-52
Author(s):  
Wiktor Jassem ◽  
Waldemar Grygiel

The mid-frequencies and bandwidths of formants 1–5 were measured at targets, at plus 0.01 s and at minus 0.01 s off the targets of vowels in a 100-word list read by five male and five female speakers, for a total of 3390 10-variable spectrum specifications. Each of the six Polish vowel phonemes was represented approximately the same number of times. The 3390* 10 original-data matrix was processed by probabilistic neural networks to produce a classification of the spectra with respect to (a) vowel phoneme, (b) identity of the speaker, and (c) speaker gender. For (a) and (b), networks with added input information from another independent variable were also used, as well as matrices of the numerical data appropriately normalized. Mean scores for classification with respect to phonemes in a multi-speaker design in the testing sets were around 95%, and mean speaker-dependent scores for the phonemes varied between 86% and 100%, with two speakers scoring 100% correct. The individual voices were identified between 95% and 96% of the time, and classifications of the spectra for speaker gender were practically 100% correct.


Sign in / Sign up

Export Citation Format

Share Document