Automatic Metadata Generation for Fish Specimen Image Collections

Metadata are key descriptors of research data, particularly for researchers seeking to apply machine learning (ML) to the vast collections of digitized specimens. Unfortunately, the available metadata is often sparse and, at times, erroneous. Additionally, it is prohibitively expensive to address these limitations through traditional, manual means. This paper reports on research that applies machine-driven approaches to analyzing digitized fish images and extracting various important features from them. The digitized fish specimens are being analyzed as part of the Biology Guided Neural Networks (BGNN) initiative, which is developing a novel class of artificial neural networks using phylogenies and anatomy ontologies. Automatically generated metadata is crucial for identifying the high-quality images needed for the neural network's predictive analytics. Methods that combine ML and image informatics techniques allow us to rapidly enrich the existing metadata associated with the 7,244 images from the Illinois Natural History Survey (INHS) used in our study. Results show we can accurately generate many key metadata properties relevant to the BGNN project, as well as general image quality metrics (e.g. brightness and contrast). Results also show that we can accurately generate bounding boxes and segmentation masks for fish, which are needed for subsequent machine learning analyses. The automatic process outperforms humans in terms of time and accuracy, and provides a novel solution for leveraging digitized specimens in ML. This research demonstrates the ability of computational methods to enhance the digital library services associated with the tens of thousands of digitized specimens stored in open-access repositories worldwide.

Download Full-text

Predictive Analytics for Maintaining Power System Stability in Smart Energy Communities

10.31265/usps.83 ◽

2021 ◽

Author(s):

Aida Mehdipour Pirbazari

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Energy Supply ◽

Predictive Analytics ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

System Stability ◽

Energy Forecasting ◽

The Stability ◽

Energy Prosumers

Digitalization and decentralization of energy supply have introduced several challenges to emerging power grids known as smart grids. One of the significant challenges, on the demand side, is preserving the stability of the power systems due to locally distributed energy sources such as micro-power generation and storage units among energy prosumers at the household and community levels. In this context, energy prosumers are defined as energy consumers who also generate, store and trade energy. Accurate predictions of energy supply and electric demand of prosuemrs can address the stability issues at local levels. This study aims to develop appropriate forecasting frameworks for such environments to preserve power stability. Building on existing work on energy forecasting at low-aggregated levels, it asks: What factors influence most on consumption and generation patterns of residential customers as energy prosumers. It also investigates how the accuracy of forecasting models at the household and community levels can be improved. Based on a review of the literature on energy forecasting and per- forming empirical study on real datasets, the forecasting frameworks were developed focusing on short-term prediction horizons. These frameworks are built upon predictive analytics including data col- lection, data analysis, data preprocessing, and predictive machine learning algorithms based on statistical learning, artificial neural networks and deep learning. Analysis of experimental results demonstrated that load observa- tions from previous hours (lagged loads) along with air temperature and time variables highly affects the households’ consumption and generation behaviour. The results also indicate that the prediction accuracy of adopted machine learning techniques can be improved by feeding them with highly influential variables and appliance-level data as well as by combining multiple learning algorithms ranging from conventional to deep neural networks. Further research is needed to investigate online approaches that could strengthen the effectiveness of forecasting in time-sensitive energy environments.

Download Full-text

Feature extraction and prediction of Dengue Outbreaks

International Journal of Scientific Research in Computer Science Engineering and Information Technology ◽

10.32628/cseit206544 ◽

2020 ◽

pp. 216-222

Author(s):

Kunal Parikh ◽

Tanvi Makadia ◽

Harshil Patel

Keyword(s):

Public Health ◽

Machine Learning ◽

Developing Countries ◽

Feature Extraction ◽

Predictive Analytics ◽

Learning Algorithm ◽

Machine Learning Algorithm ◽

Health Concerns ◽

The World ◽

Dengue Outbreaks

Dengue is unquestionably one of the biggest health concerns in India and for many other developing countries. Unfortunately, many people have lost their lives because of it. Every year, approximately 390 million dengue infections occur around the world among which 500,000 people are seriously infected and 25,000 people have died annually. Many factors could cause dengue such as temperature, humidity, precipitation, inadequate public health, and many others. In this paper, we are proposing a method to perform predictive analytics on dengue’s dataset using KNN: a machine-learning algorithm. This analysis would help in the prediction of future cases and we could save the lives of many.

Download Full-text

Creating adaptive predictions for packaging-critical quality parameters using advanced analytics and machine learning

TAPPI Journal ◽

10.32964/tj18.11.679 ◽

2019 ◽

Vol 18 (11) ◽

pp. 679-689

Author(s):

CYDNEY RECHTIN ◽

CHITTA RANJAN ◽

ANTHONY LEWIS ◽

BETH ANN ZARKO

Keyword(s):

Machine Learning ◽

Real Time ◽

Predictive Analytics ◽

Paper Industry ◽

Quality Parameters ◽

Fast Change ◽

The Face ◽

Model Tuning ◽

Reducing Costs ◽

Time Critical

Packaging manufacturers are challenged to achieve consistent strength targets and maximize production while reducing costs through smarter fiber utilization, chemical optimization, energy reduction, and more. With innovative instrumentation readily accessible, mills are collecting vast amounts of data that provide them with ever increasing visibility into their processes. Turning this visibility into actionable insight is key to successfully exceeding customer expectations and reducing costs. Predictive analytics supported by machine learning can provide real-time quality measures that remain robust and accurate in the face of changing machine conditions. These adaptive quality “soft sensors” allow for more informed, on-the-fly process changes; fast change detection; and process control optimization without requiring periodic model tuning. The use of predictive modeling in the paper industry has increased in recent years; however, little attention has been given to packaging finished quality. The use of machine learning to maintain prediction relevancy under everchanging machine conditions is novel. In this paper, we demonstrate the process of establishing real-time, adaptive quality predictions in an industry focused on reel-to-reel quality control, and we discuss the value created through the availability and use of real-time critical quality.

Download Full-text

MACHINE LEARNING FOR SOFTWARE

International Conference on Technics Technologies and Education ◽

10.15547/ictte.2019.04.061 ◽

2019 ◽

pp. 251-255

Author(s):

Shafagat Mahmudova

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Soft Computing ◽

Computing Technology

The study machine learning for software based on Soft Computing technology. It analyzes Soft Computing components. Their use in software, their advantages and challenges are studied. Machine learning and its features are highlighted. The functions and features of neural networks are clarified, and recommendations were given.

Download Full-text

Nanosecond Photodynamics Simulations of a Cis-Trans Isomerization Are Enabled by Machine Learning

10.26434/chemrxiv.13047863 ◽

2020 ◽

Author(s):

Jingbai Li ◽

Patrick Reiser ◽

André Eberhard ◽

Pascal Friederich ◽

Steven Lopez

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Excited State ◽

Adaptive Sampling ◽

Computational Cost ◽

Ground Truth ◽

Absolute Error ◽

Photochemical Reactions ◽

Computational Techniques ◽

Full Potential

Photochemical reactions are being increasingly used to construct complex molecular architectures with mild and straightforward reaction conditions. Computational techniques are increasingly important to understand the reactivities and chemoselectivities of photochemical isomerization reactions because they offer molecular bonding information along the excited-state(s) of photodynamics. These photodynamics simulations are resource-intensive and are typically limited to 1–10 picoseconds and 1,000 trajectories due to high computational cost. Most organic photochemical reactions have excited-state lifetimes exceeding 1 picosecond, which places them outside possible computational studies. Westermeyr et al. demonstrated that a machine learning approach could significantly lengthen photodynamics simulation times for a model system, methylenimmonium cation (CH2NH2+).We have developed a Python-based code, Python Rapid Artificial Intelligence Ab Initio Molecular Dynamics (PyRAI2MD), to accomplish the unprecedented 10 ns cis-trans photodynamics of trans-hexafluoro-2-butene (CF3–CH=CH–CF3) in 3.5 days. The same simulation would take approximately 58 years with ground-truth multiconfigurational dynamics. We proposed an innovative scheme combining Wigner sampling, geometrical interpolations, and short-time quantum chemical trajectories to effectively sample the initial data, facilitating the adaptive sampling to generate an informative and data-efficient training set with 6,232 data points. Our neural networks achieved chemical accuracy (mean absolute error of 0.032 eV). Our 4,814 trajectories reproduced the S1 half-life (60.5 fs), the photochemical product ratio (trans: cis = 2.3: 1), and autonomously discovered a pathway towards a carbene. The neural networks have also shown the capability of generalizing the full potential energy surface with chemically incomplete data (trans → cis but not cis → trans pathways) that may offer future automated photochemical reaction discoveries.

Download Full-text

Predictive Analytics of Performance of India in the Olympics using Machine Learning Algorithms

International Journal of Emerging Trends in Engineering Research ◽

10.30534/ijeter/2020/57852020 ◽

2020 ◽

Vol 8 (5) ◽

pp. 1829-1833

Author(s):

Varagiri Shailaja

Keyword(s):

Machine Learning ◽

Predictive Analytics ◽

Learning Algorithms ◽

Machine Learning Algorithms

Download Full-text

Big data Predictive Analytics for Apache Spark using Machine Learning

2020 Global Conference on Wireless and Optical Technologies (GCWOT) ◽

10.1109/gcwot49901.2020.9391620 ◽

2020 ◽

Author(s):

Muhammad Junaid ◽

Shiraz Ali Wagan ◽

Nawab Muhammad Faseeh Qureshi ◽

Choon Sung Nam ◽

Dong Ryeol Shin

Keyword(s):

Machine Learning ◽

Big Data ◽

Predictive Analytics ◽

Apache Spark

Download Full-text

Discretization and machine learning approximation of BSDEs with a constraint on the Gains-process

Monte Carlo Methods and Applications ◽

10.1515/mcma-2020-2080 ◽

2021 ◽

Vol 0 (0) ◽

Author(s):

Idris Kharroubi ◽

Thomas Lim ◽

Xavier Warin

Keyword(s):

Neural Network ◽

Machine Learning ◽

Neural Networks ◽

Differential Equations ◽

Numerical Experiments ◽

Optimization Problem ◽

Learning Approach ◽

The Neural Network ◽

Machine Learning Approach ◽

Mesh Grid

AbstractWe study the approximation of backward stochastic differential equations (BSDEs for short) with a constraint on the gains process. We first discretize the constraint by applying a so-called facelift operator at times of a grid. We show that this discretely constrained BSDE converges to the continuously constrained one as the mesh grid converges to zero. We then focus on the approximation of the discretely constrained BSDE. For that we adopt a machine learning approach. We show that the facelift can be approximated by an optimization problem over a class of neural networks under constraints on the neural network and its derivative. We then derive an algorithm converging to the discretely constrained BSDE as the number of neurons goes to infinity. We end by numerical experiments.

Download Full-text

Feasibility of Using Floor Vibration to Detect Human Falls

International Journal of Environmental Research and Public Health ◽

10.3390/ijerph18010200 ◽

2020 ◽

Vol 18 (1) ◽

pp. 200

Author(s):

Yu Shao ◽

Xinyue Wang ◽

Wenjie Song ◽

Sobia Ilyas ◽

Haibo Guo ◽

...

Keyword(s):

Machine Learning ◽

Human Body ◽

Nearest Neighbor ◽

Body Movement ◽

Modern Society ◽

Recognition System ◽

Machine Learning Algorithms ◽

K Nearest Neighbor ◽

Study Results ◽

Floor Vibration

With the increasing aging population in modern society, falls as well as fall-induced injuries in elderly people become one of the major public health problems. This study proposes a classification framework that uses floor vibrations to detect fall events as well as distinguish different fall postures. A scaled 3D-printed model with twelve fully adjustable joints that can simulate human body movement was built to generate human fall data. The mass proportion of a human body takes was carefully studied and was reflected in the model. Object drops, human falling tests were carried out and the vibration signature generated in the floor was recorded for analyses. Machine learning algorithms including K-means algorithm and K nearest neighbor algorithm were introduced in the classification process. Three classifiers (human walking versus human fall, human fall versus object drop, human falls from different postures) were developed in this study. Results showed that the three proposed classifiers can achieve the accuracy of 100, 85, and 91%. This paper developed a framework of using floor vibration to build the pattern recognition system in detecting human falls based on a machine learning approach.

Download Full-text

Machine learning predictive analytics for player movement prediction in NBA

Proceedings of the 2021 ACM Southeast Conference ◽

10.1145/3409334.3452064 ◽

2021 ◽

Author(s):

Dembe Koi Stephanos ◽

Ghaith Husari ◽

Brian T. Bennett ◽

Emma Stephanos

Keyword(s):

Machine Learning ◽

Predictive Analytics ◽

Movement Prediction

Download Full-text