Localizing category-related information in speech with multi-scale analyses

Measurements of the physical outputs of speech—vocal tract geometry and acoustic energy—are high-dimensional, but linguistic theories posit a low-dimensional set of categories such as phonemes and phrase types. How can it be determined when and where in high-dimensional articulatory and acoustic signals there is information related to theoretical categories? For a variety of reasons, it is problematic to directly quantify mutual information between hypothesized categories and signals. To address this issue, a multi-scale analysis method is proposed for localizing category-related information in an ensemble of speech signals using machine learning algorithms. By analyzing how classification accuracy on unseen data varies as the temporal extent of training input is systematically restricted, inferences can be drawn regarding the temporal distribution of category-related information. The method can also be used to investigate redundancy between subsets of signal dimensions. Two types of theoretical categories are examined in this paper: phonemic/gestural categories and syntactic relative clause categories. Moreover, two different machine learning algorithms were examined: linear discriminant analysis and neural networks with long short-term memory units. Both algorithms detected category-related information earlier and later in signals than would be expected given standard theoretical assumptions about when linguistic categories should influence speech. The neural network algorithm was able to identify category-related information to a greater extent than the discriminant analyses.

Download Full-text

PSIX-15 Assessment of machine learning algorithms for prediction of Aleutian disease in American mink

Journal of Animal Science ◽

10.1093/jas/skab235.484 ◽

2021 ◽

Vol 99 (Supplement_3) ◽

pp. 264-265

Author(s):

Duy Ngoc Do ◽

Guoyu Hu ◽

Younes Miar

Keyword(s):

Machine Learning ◽

Random Forest ◽

Linear Models ◽

American Mink ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Training Data ◽

Enzyme Linked Immunosorbent Assay ◽

Linear Discriminant ◽

Machine Learning Classification

Abstract American mink (Neovison vison) is the major source of fur for the fur industries worldwide and Aleutian disease (AD) is causing severe financial losses to the mink industry. Different methods have been used to diagnose the AD in mink, but the combination of several methods can be the most appropriate approach for the selection of AD resilient mink. Iodine agglutination test (IAT) and counterimmunoelectrophoresis (CIEP) methods are commonly employed in test-and-remove strategy; meanwhile, enzyme-linked immunosorbent assay (ELISA) and packed-cell volume (PCV) methods are complementary. However, using multiple methods are expensive; and therefore, hindering the corrected use of AD tests in selection. This research presented the assessments of the AD classification based on machine learning algorithms. The Aleutian disease was tested on 1,830 individuals using these tests in an AD positive mink farm (Canadian Centre for Fur Animal Research, NS, Canada). The accuracy of classification for CIEP was evaluated based on the sex information, and IAT, ELISA and PCV test results implemented in seven machine learning classification algorithms (Random Forest, Artificial Neural Networks, C50Tree, Naive Bayes, Generalized Linear Models, Boost, and Linear Discriminant Analysis) using the Caret package in R. The accuracy of prediction varied among the methods. Overall, the Random Forest was the best-performing algorithm for the current dataset with an accuracy of 0.89 in the training data and 0.94 in the testing data. Our work demonstrated the utility and relative ease of using machine learning algorithms to assess the CIEP information, and consequently reducing the cost of AD tests. However, further works require the inclusion of production and reproduction information in the models and extension of phenotypic collection to increase the accuracy of current methods.

Download Full-text

Introduction and Implementation of Machine Learning Algorithms in R

Advances in Business Information Systems and Analytics - Sentiment Analysis and Knowledge Discovery in Contemporary Business ◽

10.4018/978-1-5225-4999-4.ch008 ◽

2019 ◽

pp. 126-147

Author(s):

S. R. Mani Sekhar ◽

G. M. Siddesh

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Support Vector Machine ◽

Discriminant Analysis ◽

Computer Science ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Support Vector ◽

Linear Discriminant ◽

The Given

Machine learning is one of the important areas in the field of computer science. It helps to provide an optimized solution for the real-world problems by using past knowledge or previous experience data. There are different types of machine learning algorithms present in computer science. This chapter provides the overview of some selected machine learning algorithms such as linear regression, linear discriminant analysis, support vector machine, naive Bayes classifier, neural networks, and decision trees. Each of these methods is illustrated in detail with an example and R code, which in turn assists the reader to generate their own solutions for the given problems.

Download Full-text

Multi-scale X-ray tomography and machine learning algorithms to study MoNi4 electrocatalysts anchored on MoO2 cuboids aligned on Ni foam

BMC Materials ◽

10.1186/s42833-020-00011-0 ◽

2020 ◽

Vol 2 (1) ◽

Cited By ~ 1

Author(s):

Emre Topal ◽

Zhongquan Liao ◽

Markus Löffler ◽

Jürgen Gluch ◽

Jian Zhang ◽

...

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Ni Foam ◽

X Ray ◽

Multi Scale

Download Full-text

A comparison of machine learning algorithms for chemical toxicity classification using a simulated multi-scale data model

BMC Bioinformatics ◽

10.1186/1471-2105-9-241 ◽

2008 ◽

Vol 9 (1) ◽

Cited By ~ 43

Author(s):

Richard Judson ◽

Fathi Elloumi ◽

R Woodrow Setzer ◽

Zhen Li ◽

Imran Shah

Keyword(s):

Machine Learning ◽

Data Model ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Chemical Toxicity ◽

Multi Scale ◽

Scale Data

Download Full-text

Deep learning for predicting disease status using genomic data

10.7287/peerj.preprints.27123 ◽

2018 ◽

Cited By ~ 1

Author(s):

Qianfan Wu ◽

Adel Boueiz ◽

Alican Bozkurt ◽

Arya Masoomi ◽

Allan Wang ◽

...

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Rapid Development ◽

Learning Algorithms ◽

Genomic Data ◽

Disease Status ◽

Machine Learning Algorithms ◽

High Dimensional ◽

Learning Approach ◽

Low Dimensional

Predicting disease status for a complex human disease using genomic data is an important, yet challenging, step in personalized medicine. Among many challenges, the so-called curse of dimensionality problem results in unsatisfied performances of many state-of-art machine learning algorithms. A major recent advance in machine learning is the rapid development of deep learning algorithms that can efficiently extract meaningful features from high-dimensional and complex datasets through a stacked and hierarchical learning process. Deep learning has shown breakthrough performance in several areas including image recognition, natural language processing, and speech recognition. However, the performance of deep learning in predicting disease status using genomic datasets is still not well studied. In this article, we performed a review on the four relevant articles that we found through our thorough literature review. All four articles used auto-encoders to project high-dimensional genomic data to a low dimensional space and then applied the state-of-the-art machine learning algorithms to predict disease status based on the low-dimensional representations. This deep learning approach outperformed existing prediction approaches, such as prediction based on probe-wise screening and prediction based on principal component analysis. The limitations of the current deep learning approach and possible improvements were also discussed.

Download Full-text

Analysis of Machine Learning-Based Assessment for Elbow Spasticity Using Inertial Sensors

Sensors ◽

10.3390/s20061622 ◽

2020 ◽

Vol 20 (6) ◽

pp. 1622 ◽

Cited By ~ 6

Author(s):

Jung-Yeon Kim ◽

Geunsu Park ◽

Seong-A Lee ◽

Yunyoung Nam

Keyword(s):

Machine Learning ◽

Inertial Sensors ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Lower Limbs ◽

Support Vector ◽

Multilayer Perceptrons ◽

Test Machine ◽

Linear Discriminant ◽

Passive Stretch

Spasticity is a frequently observed symptom in patients with neurological impairments. Spastic movements of their upper and lower limbs are periodically measured to evaluate functional outcomes of physical rehabilitation, and they are quantified by clinical outcome measures such as the modified Ashworth scale (MAS). This study proposes a method to determine the severity of elbow spasticity, by analyzing the acceleration and rotation attributes collected from the elbow of the affected side of patients and machine-learning algorithms to classify the degree of spastic movement; this approach is comparable to assigning an MAS score. We collected inertial data from participants using a wearable device incorporating inertial measurement units during a passive stretch test. Machine-learning algorithms—including decision tree, random forests (RFs), support vector machine, linear discriminant analysis, and multilayer perceptrons—were evaluated in combinations of two segmentation techniques and feature sets. A RF performed well, achieving up to 95.4% accuracy. This work not only successfully demonstrates how wearable technology and machine learning can be used to generate a clinically meaningful index but also offers rehabilitation patients an opportunity to monitor the degree of spasticity, even in nonhealthcare institutions where the help of clinical professionals is unavailable.

Download Full-text

Analysis of dual-stage filtration and validation of high-dimensional real process data for creation of machine learning algorithms

10.1109/iceccme52200.2021.9591094 ◽

2021 ◽

Author(s):

Dusan Strusnik ◽

Jurij Avsec

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

High Dimensional ◽

Process Data ◽

Real Process ◽

Dual Stage

Download Full-text

Teaching a Machine to Feel Postoperative Pain: Combining High-Dimensional Clinical Data with Machine Learning Algorithms to Forecast Acute Postoperative Pain

Pain Medicine ◽

10.1111/pme.12713 ◽

2015 ◽

Vol 16 (7) ◽

pp. 1386-1401 ◽

Cited By ~ 22

Author(s):

Patrick J. Tighe ◽

Christopher A. Harle ◽

Robert W. Hurley ◽

Haldun Aytug ◽

Andre P. Boezaart ◽

...

Keyword(s):

Machine Learning ◽

Postoperative Pain ◽

Clinical Data ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

High Dimensional ◽

Acute Postoperative Pain

Download Full-text

A multipurpose machine learning approach to predict COVID-19 negative prognosis in São Paulo, Brazil

Scientific Reports ◽

10.1038/s41598-021-82885-y ◽

2021 ◽

Vol 11 (1) ◽

Cited By ~ 2

Author(s):

Fernando Timoteo Fernandes ◽

Tiago Almeida de Oliveira ◽

Cristiane Esteves Teixeira ◽

Andre Filipe de Moraes Batista ◽

Gabriel Dalla Costa ◽

...

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Sao Paulo ◽

Machine Learning Algorithms ◽

Gradient Boosting ◽

São Paulo ◽

C Reactive Protein ◽

Reactive Protein ◽

Unseen Data ◽

Extreme Gradient Boosting

AbstractThe new coronavirus disease (COVID-19) is a challenge for clinical decision-making and the effective allocation of healthcare resources. An accurate prognostic assessment is necessary to improve survival of patients, especially in developing countries. This study proposes to predict the risk of developing critical conditions in COVID-19 patients by training multipurpose algorithms. We followed a total of 1040 patients with a positive RT-PCR diagnosis for COVID-19 from a large hospital from São Paulo, Brazil, from March to June 2020, of which 288 (28%) presented a severe prognosis, i.e. Intensive Care Unit (ICU) admission, use of mechanical ventilation or death. We used routinely-collected laboratory, clinical and demographic data to train five machine learning algorithms (artificial neural networks, extra trees, random forests, catboost, and extreme gradient boosting). We used a random sample of 70% of patients to train the algorithms and 30% were left for performance assessment, simulating new unseen data. In order to assess if the algorithms could capture general severe prognostic patterns, each model was trained by combining two out of three outcomes to predict the other. All algorithms presented very high predictive performance (average AUROC of 0.92, sensitivity of 0.92, and specificity of 0.82). The three most important variables for the multipurpose algorithms were ratio of lymphocyte per C-reactive protein, C-reactive protein and Braden Scale. The results highlight the possibility that machine learning algorithms are able to predict unspecific negative COVID-19 outcomes from routinely-collected data.

Download Full-text

Survey on Clustering High-Dimensional data using Hubness

International Journal of Scientific Research in Computer Science Engineering and Information Technology ◽

10.32628/cseit195671 ◽

2020 ◽

pp. 01-07

Author(s):

Miss. Archana Chaudahri ◽

Mr. Nilesh Vani

Keyword(s):

Machine Learning ◽

Nearest Neighbor ◽

Learning Algorithms ◽

High Dimensional Data ◽

Algorithm Design ◽

Machine Learning Algorithms ◽

High Dimensional ◽

K Nearest Neighbor ◽

Weighted Voting ◽

Conventional Machine

Most data of interest today in data-mining applications is complex and is usually represented by many different features. Such high-dimensional data is by its very nature often quite difficult to handle by conventional machine-learning algorithms. This is considered to be an aspect of the well known curse of dimensionality. Consequently, high-dimensional data needs to be processed with care, which is why the design of machine-learning algorithms needs to take these factors into account. Furthermore, it was observed that some of the arising high-dimensional properties could in fact be exploited in improving overall algorithm design. One such phenomenon, related to nearest-neighbor learning methods, is known as hubness and refers to the emergence of very influential nodes (hubs) in k-nearest neighbor graphs. A crisp weighted voting scheme for the k-nearest neighbor classifier has recently been proposed which exploits this notion.

Download Full-text