scholarly journals Modelling the Microphone-Related Timbral Brightness of Recorded Signals

2021 ◽  
Vol 11 (14) ◽  
pp. 6461
Author(s):  
Andy Pearce ◽  
Tim Brookes ◽  
Russell Mason

Brightness is one of the most common timbral descriptors used for searching audio databases, and is also the timbral attribute of recorded sound that is most affected by microphone choice, making a brightness prediction model desirable for automatic metadata generation. A model, sensitive to microphone-related as well as source-related brightness, was developed based on a novel combination of the spectral centroid and the ratio of the total magnitude of the signal above 500 Hz to that of the full signal. This model performed well on training data (r = 0.922). Validating it on new data showed a slight gradient error but good linear correlation across source types and overall (r = 0.955). On both training and validation data, the new model out-performed metrics previously used for brightness prediction.

2020 ◽  
Vol 41 (Supplement_2) ◽  
Author(s):  
S Gao ◽  
D Stojanovski ◽  
A Parker ◽  
P Marques ◽  
S Heitner ◽  
...  

Abstract Background Correctly identifying views acquired in a 2D echocardiographic examination is paramount to post-processing and quantification steps often performed as part of most clinical workflows. In many exams, particularly in stress echocardiography, microbubble contrast is used which greatly affects the appearance of the cardiac views. Here we present a bespoke, fully automated convolutional neural network (CNN) which identifies apical 2, 3, and 4 chamber, and short axis (SAX) views acquired with and without contrast. The CNN was tested in a completely independent, external dataset with the data acquired in a different country than that used to train the neural network. Methods Training data comprised of 2D echocardiograms was taken from 1014 subjects from a prospective multisite, multi-vendor, UK trial with the number of frames in each view greater than 17,500. Prior to view classification model training, images were processed using standard techniques to ensure homogenous and normalised image inputs to the training pipeline. A bespoke CNN was built using the minimum number of convolutional layers required with batch normalisation, and including dropout for reducing overfitting. Before processing, the data was split into 90% for model training (211,958 frames), and 10% used as a validation dataset (23,946 frames). Image frames from different subjects were separated out entirely amongst the training and validation datasets. Further, a separate trial dataset of 240 studies acquired in the USA was used as an independent test dataset (39,401 frames). Results Figure 1 shows the confusion matrices for both validation data (left) and independent test data (right), with an overall accuracy of 96% and 95% for the validation and test datasets respectively. The accuracy for the non-contrast cardiac views of >99% exceeds that seen in other works. The combined datasets included images acquired across ultrasound manufacturers and models from 12 clinical sites. Conclusion We have developed a CNN capable of automatically accurately identifying all relevant cardiac views used in “real world” echo exams, including views acquired with contrast. Use of the CNN in a routine clinical workflow could improve efficiency of quantification steps performed after image acquisition. This was tested on an independent dataset acquired in a different country to that used to train the model and was found to perform similarly thus indicating the generalisability of the model. Figure 1. Confusion matrices Funding Acknowledgement Type of funding source: Private company. Main funding source(s): Ultromics Ltd.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Zhixiang Yu ◽  
Haiyan He ◽  
Yanan Chen ◽  
Qiuhe Ji ◽  
Min Sun

AbstractOvarian cancer (OV) is a common type of carcinoma in females. Many studies have reported that ferroptosis is associated with the prognosis of OV patients. However, the mechanism by which this occurs is not well understood. We utilized Genotype-Tissue Expression (GTEx) and The Cancer Genome Atlas (TCGA) to identify ferroptosis-related genes in OV. In the present study, we applied Cox regression analysis to select hub genes and used the least absolute shrinkage and selection operator to construct a prognosis prediction model with mRNA expression profiles and clinical data from TCGA. A series of analyses for this signature was performed in TCGA. We then verified the identified signature using International Cancer Genome Consortium (ICGC) data. After a series of analyses, we identified six hub genes (DNAJB6, RB1, VIMP/ SELENOS, STEAP3, BACH1, and ALOX12) that were then used to construct a model using a training data set. The model was then tested using a validation data set and was found to have high sensitivity and specificity. The identified ferroptosis-related hub genes might play a critical role in the mechanism of OV development. The gene signature we identified may be useful for future clinical applications.


2003 ◽  
Vol 69 (7) ◽  
pp. 3996-3998 ◽  
Author(s):  
M. Reeslev ◽  
M. Miller ◽  
K. F. Nielsen

ABSTRACT Two mold species, Stachybotrys chartarum and Aspergillus versicolor, were inoculated onto agar overlaid with cellophane, allowing determination of a direct measurement of biomass density by weighing. Biomass density, ergosterol content, and beta-N-acetylhexosaminidase (3.2.1.52) activity were monitored from inoculation to stationary phase. Regression analysis showed a good linear correlation to biomass density for both ergosterol content and beta-N-acetylhexosaminidase activity. The same two mold species were inoculated onto wallpapered gypsum board, from which a direct biomass measurement was not possible. Growth was measured as an increase in ergosterol content and beta-N-acetylhexosaminidase activity. A good linear correlation was seen between ergosterol content and beta-N-acetylhexosaminidase activity. From the experiments performed on agar medium, conversion factors (CFs) for estimating biomass density from ergosterol content and beta-N-acetylhexosaminidase activity were determined. The CFs were used to estimate the biomass density of the molds grown on gypsum board. The biomass densities estimated from ergosterol content and beta-N-acetylhexosaminidase activity data gave similar results, showing significantly slower growth and lower stationary-phase biomass density on gypsum board than on agar.


2021 ◽  
Vol 8 (5) ◽  
pp. 929
Author(s):  
Hurriyatul Fitriyah ◽  
Rizal Maulana

<p class="Abstrak">Gulma merupakan tanaman pengganggu dalam lahan pertanian. Herbisida merupakan obat yang efektif membunuh gulma tersebut. Penyemprotan herbisida harus tepat sasaran kepada gulma saja dan tidak mengenai tanaman. Penelitian ini membuat sistem yang dapat mendeteksi gulma secara otomatis di antara tanaman pada lahan pertanian riil. Sistem ini menggunakan gambar lahan pertanian riil dimana tanaman tampak utuh (daun dapat lebih dari satu) yang diambil menggunakan kamera dengan posisi vertikal menghadap ke bawah. Algoritma yang dibuat menggunakan segmentasi berdasarkan warna hijau dalam ruang warna HSV untuk mendeteksi daun, baik gulma maupun tanaman pada beragam pencahayaan. Sebanyak tiga fitur bentuk domain spasial digunakan untuk membedakan gulma dengan tanaman yang memiliki karakteristik bentuk daun yang berbeda. Fitur bentuk yang digunakan adalah <em>Rectangularity, Edge-to-Center distances function</em>, dan <em>Distance Transform function</em>. Klasifikasi gulma dan tanaman menggunakan metode Jaringan syaraf tiruan (JST) yang dapat dilatih secara <em>offline. </em>Dari 149 tanaman yang terdeteksi dimana 70% sebagai data training, 15% data validasi dan 15% data uji, didapati akurasi pengujian sebesar 95.46%.</p><p class="Abstrak"><em><strong><br /></strong></em></p><p class="Abstrak"><em><strong>Abstract</strong></em></p><p class="Abstract"><em>Weed is a major challenge in a crop plantation. A herbicide is the most effective substance to kill this unwanted vegetation. Spraying the herbicide must be done carefully to target the weeds only. Here in this research, we develop an algorithm that detects weeds among the plants based on the shape of their leaves. The detection is based on images that were acquired using a camera. The leaves of weeds and plants were detected based on their green color using segmentation in HSV color-space as it is more effective to detect objects in various illumination. Three shape features were extracted, which are Rectangularity that is based on Rectangularity, Edge-to-Center distance function, and Distance Transform function. Those features were fed into a learning algorithm, Artificial Neural Network (ANN), to classify whether it is the plant or the weed. The testing on the weed classification in a real outdoor environment showed 95.46% accuracy using a total of 149 detected plants (70% as training data, 15%  as validation data, and 15% as testing data).<strong></strong></em></p><p class="Abstrak"><em><strong><br /></strong></em></p>


2021 ◽  
Author(s):  
Yash Chauhan ◽  
Prateek Singh

Coins recognition systems have humungous applications from vending and slot machines to banking and management firms which directly translate to a high volume of research regarding the development of methods for such classification. In recent years, academic research has shifted towards a computer vision approach for sorting coins due to the advancement in the field of deep learning. However, most of the documented work utilizes what is known as ‘Transfer Learning’ in which we reuse a pre-trained model of a fixed architecture as a starting point for our training. While such an approach saves us a lot of time and effort, the generic nature of the pre-trained model can often become a bottleneck for performance on a specialized problem such as coin classification. This study develops a convolutional neural network (CNN) model from scratch and tests it against a widely-used general-purpose architecture known as Googlenet. We have shown in this study by comparing the performance of our model with that of Googlenet (documented in various previous studies) that a more straightforward and specialized architecture is more optimal than a more complex general architecture for the coin classification problem. The model developed in this study is trained and tested on 720 and 180 images of Indian coins of different denominations, respectively. The final accuracy gained by the model is 91.62% on the training data, while the accuracy is 90.55% on the validation data.


2021 ◽  
Vol 14 (2) ◽  
pp. 127-135
Author(s):  
Fadhil Yusuf Rahadika ◽  
Novanto Yudistira ◽  
Yuita Arum Sari

During the COVID-19 pandemic, many offline activities are turned into online activities via video meetings to prevent the spread of the COVID 19 virus. In the online video meeting, some micro-interactions are missing when compared to direct social interactions. The use of machines to assist facial expression recognition in online video meetings is expected to increase understanding of the interactions among users. Many studies have shown that CNN-based neural networks are quite effective and accurate in image classification. In this study, some open facial expression datasets were used to train CNN-based neural networks with a total number of training data of 342,497 images. This study gets the best results using ResNet-50 architecture with Mish activation function and Accuracy Booster Plus block. This architecture is trained using the Ranger and Gradient Centralization optimization method for 60000 steps with a batch size of 256. The best results from the training result in accuracy of AffectNet validation data of 0.5972, FERPlus validation data of 0.8636, FERPlus test data of 0.8488, and RAF-DB test data of 0.8879. From this study, the proposed method outperformed plain ResNet in all test scenarios without transfer learning, and there is a potential for better performance with the pre-training model. The code is available at https://github.com/yusufrahadika-facial-expressions-essay.


2019 ◽  
Vol 19 (1) ◽  
pp. 89-113 ◽  
Author(s):  
Yujiao Zhu ◽  
Kai Li ◽  
Yanjie Shen ◽  
Yang Gao ◽  
Xiaohuan Liu ◽  
...  

Abstract. We measured the particle number concentration, size distribution, and new particle formation (NPF) events in the marine atmosphere during six cruise campaigns over the marginal seas of China in 2011–2016 and one campaign from the marginal seas to the Northwest Pacific Ocean (NWPO) in 2014. We observed relatively frequent NPF events in the atmosphere over the marginal seas of China, i.e., on 23 out of 126 observational days, with the highest frequency of occurrence in fall, followed by spring and summer. In total, 22 out of 23 NPF events were found to be associated with the long-range transport of continental pollutants based on 24 h air mass back trajectories and pre-existing particle number concentrations, which largely exceeded the clean marine background, leaving one much weaker NPF event that was likely induced by oceanic precursors alone, as supported by multiple independent pieces of evidence. Although the long-range transport signal of continental pollutants can be clearly observed in the remote marine atmosphere over the NWPO, NPF events were observed on only 2 out of 36 days. The nucleation-mode particles (<30 nm), however, accounted for as high as 35 %±13 % of the total particle number concentration during the NWPO cruise campaign, implying the existence of many undetected NPF events in the near-sea-level atmosphere or above. To better characterize NPF events, we introduce a term called the net maximum increase in the nucleation-mode particle number concentration (NMINP) and correlate it with the formation rate of new particles (FR). We find a moderately good linear correlation between NMINP and FR at FR≤8 cm−3 s−1, but no correlation exists at FR>8 cm−3 s−1. The possible mechanisms are argued in terms of the roles of different vapor precursors. We also find that a ceiling exists for the growth of new particles from 10 nm to larger sizes in most NPF events. We thereby introduce a term called the maximum geometric median diameter of new particles (Dpgmax) and correlate it with the growth rate of new particles (GR). A moderately good linear correlation is also obtained between the Dpgmax and GR, and only GR values larger than 7.9 nm h−1 can lead to new particles growing with a Dpgmax beyond 50 nm based on the equation. By combining simultaneous measurements of the particle number size distributions and cloud condensation nuclei (CCN) at different super saturations (SS), we observed a clear increase in CCN when the Dpg of new particles exceeded 50 nm at SS=0.4 %. However, this case did not occur for SS=0.2 %. Consistent with the results of previous studies in the continental atmosphere, our results imply that particles smaller than 50 nm are unlikely activated as CCN at SS=0.4 % in the marine atmosphere. Moreover, κ decrease from 0.4 to 0.1 during the growth period of new particles, implying that organics likely overwhelm the growth of new particles to CCN size. The chemical analysis of nano-Micro-Orifice Uniform Deposit Impactor (nano-MOUDI) samples reveals that trimethylamine (TMA) and oxalic acid might appreciably contribute to the growth of new particles in some cases.


2018 ◽  
Vol 10 (11) ◽  
pp. 1831 ◽  
Author(s):  
Jianbin Tao ◽  
Deepak Mishra ◽  
David Cotten ◽  
Jessica O’Connell ◽  
Monique Leclerc ◽  
...  

Despite the importance of tidal ecosystems in the global carbon budget, the relationships between environmental drivers and carbon dynamics in these wetlands remain poorly understood. This limited understanding results from the challenges associated with in situ flux studies and their correlation with satellite imagery which can be affected by periodic tidal flooding. Carbon dioxide eddy covariance (EC) towers are installed in only a few wetlands worldwide, and the longest eddy-covariance record from Georgia (GA) wetlands contains only two continuous years of observations. The goals of the present study were to evaluate the performance of existing MODIS Gross Primary Production (GPP) products (MOD17A2) against EC derived GPP and develop a tide-robust Normalized Difference Moisture Index (NDMI) based model to predict GPP within a Spartina alterniflora salt marsh on Sapelo Island, GA. These EC tower-based observations represent a basis to associate CO2 fluxes with canopy reflectance and thus provide the means to use satellite-based reflectance data for broader scale investigations. We demonstrate that Light Use Efficiency (LUE)-based MOD17A2 does not accurately reflect tidal wetland GPP compared to a simple empirical vegetation index-based model where tidal influence was accounted for. The NDMI-based GPP model was capable of predicting changes in wetland CO2 fluxes and explained 46% of the variation in flux-estimated GPP within the training data, and a root mean square error of 6.96 g C m−2 in the validation data. Our investigation is the first to create a MODIS-based wetland GPP estimation procedure that demonstrates the importance of filtering tidal observations from satellite surface reflectance data.


Heart ◽  
2018 ◽  
Vol 104 (23) ◽  
pp. 1921-1928 ◽  
Author(s):  
Ming-Zher Poh ◽  
Yukkee Cheung Poh ◽  
Pak-Hei Chan ◽  
Chun-Ka Wong ◽  
Louise Pun ◽  
...  

ObjectiveTo evaluate the diagnostic performance of a deep learning system for automated detection of atrial fibrillation (AF) in photoplethysmographic (PPG) pulse waveforms.MethodsWe trained a deep convolutional neural network (DCNN) to detect AF in 17 s PPG waveforms using a training data set of 149 048 PPG waveforms constructed from several publicly available PPG databases. The DCNN was validated using an independent test data set of 3039 smartphone-acquired PPG waveforms from adults at high risk of AF at a general outpatient clinic against ECG tracings reviewed by two cardiologists. Six established AF detectors based on handcrafted features were evaluated on the same test data set for performance comparison.ResultsIn the validation data set (3039 PPG waveforms) consisting of three sequential PPG waveforms from 1013 participants (mean (SD) age, 68.4 (12.2) years; 46.8% men), the prevalence of AF was 2.8%. The area under the receiver operating characteristic curve (AUC) of the DCNN for AF detection was 0.997 (95% CI 0.996 to 0.999) and was significantly higher than all the other AF detectors (AUC range: 0.924–0.985). The sensitivity of the DCNN was 95.2% (95% CI 88.3% to 98.7%), specificity was 99.0% (95% CI 98.6% to 99.3%), positive predictive value (PPV) was 72.7% (95% CI 65.1% to 79.3%) and negative predictive value (NPV) was 99.9% (95% CI 99.7% to 100%) using a single 17 s PPG waveform. Using the three sequential PPG waveforms in combination (<1 min in total), the sensitivity was 100.0% (95% CI 87.7% to 100%), specificity was 99.6% (95% CI 99.0% to 99.9%), PPV was 87.5% (95% CI 72.5% to 94.9%) and NPV was 100% (95% CI 99.4% to 100%).ConclusionsIn this evaluation of PPG waveforms from adults screened for AF in a real-world primary care setting, the DCNN had high sensitivity, specificity, PPV and NPV for detecting AF, outperforming other state-of-the-art methods based on handcrafted features.


2018 ◽  
Author(s):  
Carla Márquez-Luna ◽  
Steven Gazal ◽  
Po-Ru Loh ◽  
Samuel S. Kim ◽  
Nicholas Furlotte ◽  
...  

AbstractGenetic variants in functional regions of the genome are enriched for complex trait heritability. Here, we introduce a new method for polygenic prediction, LDpred-funct, that leverages trait-specific functional priors to increase prediction accuracy. We fit priors using the recently developed baseline-LD model, which includes coding, conserved, regulatory and LD-related annotations. We analytically estimate posterior mean causal effect sizes and then use cross-validation to regularize these estimates, improving prediction accuracy for sparse architectures. LDpred-funct attained higher prediction accuracy than other polygenic prediction methods in simulations using real genotypes. We applied LDpred-funct to predict 21 highly heritable traits in the UK Biobank. We used association statistics from British-ancestry samples as training data (avg N=373K) and samples of other European ancestries as validation data (avg N=22K), to minimize confounding. LDpred-funct attained a +4.6% relative improvement in average prediction accuracy (avg prediction R2=0.144; highest R2=0.413 for height) compared to SBayesR (the best method that does not incorporate functional information). For height, meta-analyzing training data from UK Biobank and 23andMe cohorts (total N=1107K; higher heritability in UK Biobank cohort) increased prediction R2 to 0.431. Our results show that incorporating functional priors improves polygenic prediction accuracy, consistent with the functional architecture of complex traits.


Sign in / Sign up

Export Citation Format

Share Document