Optical Data Analysis: Decoding Optical Data with Machine Learning (Laser Photonics Rev. 15(2)/2021)

Background: With the advent of data analysis and machine learning, there is a growing impetus of analyzing and generating models on historic data. The data comes in numerous forms and shapes with an abundance of challenges. The most sorted form of data for analysis is the numerical data. With the plethora of algorithms and tools it is quite manageable to deal with such data. Another form of data is of categorical nature, which is subdivided into, ordinal (order wise) and nominal (number wise). This data can be broadly classified as Sequential and Non-Sequential. Sequential data analysis is easier to preprocess using algorithms. Objective: The challenge of applying machine learning algorithms on categorical data of nonsequential nature is dealt in this paper. Methods: Upon implementing several data analysis algorithms on such data, we end up getting a biased result, which makes it impossible to generate a reliable predictive model. In this paper, we will address this problem by walking through a handful of techniques which during our research helped us in dealing with a large categorical data of non-sequential nature. In subsequent sections, we will discuss the possible implementable solutions and shortfalls of these techniques. Results: The methods are applied to sample datasets available in public domain and the results with respect to accuracy of classification are satisfactory. Conclusion: The best pre-processing technique we observed in our research is one hot encoding, which facilitates breaking down the categorical features into binary and feeding it into an Algorithm to predict the outcome. The example that we took is not abstract but it is a real – time production services dataset, which had many complex variations of categorical features. Our Future work includes creating a robust model on such data and deploying it into industry standard applications.

Download Full-text

Measuring Engagement Level in Child-Robot Interaction Using Machine Learning Based Data Analysis

2020 International Conference on Data Analytics for Business and Industry: Way Towards a Sustainable Economy (ICDABI) ◽

10.1109/icdabi51230.2020.9325676 ◽

2020 ◽

Author(s):

George K. Sidiropoulos ◽

George A. Papakostas ◽

Chris Lytridis ◽

Christos Bazinas ◽

Vassilis G. Kaburlasos ◽

...

Keyword(s):

Machine Learning ◽

Data Analysis ◽

Robot Interaction

Download Full-text

Machine learning-assisted production data analysis in liquid-rich Duvernay Formation

Journal of Petroleum Science and Engineering ◽

10.1016/j.petrol.2021.108377 ◽

2021 ◽

Vol 200 ◽

pp. 108377

Author(s):

Bing Kong ◽

Zhuoheng Chen ◽

Shengnan Chen ◽

Tianjie Qin

Keyword(s):

Machine Learning ◽

Data Analysis ◽

Production Data

Download Full-text

Field-Scale Soil Moisture Retrieval Using PALSAR-2 Polarimetric Decomposition and Machine Learning

Agronomy ◽

10.3390/agronomy11010035 ◽

2020 ◽

Vol 11 (1) ◽

pp. 35

Author(s):

Xiaodong Huang ◽

Beth Ziniti ◽

Michael H. Cosh ◽

Michele Reba ◽

Jinfei Wang ◽

...

Keyword(s):

Machine Learning ◽

Soil Moisture ◽

Canopy Cover ◽

The United States ◽

Optical Data ◽

Study Region ◽

Model Based ◽

Key Indicator ◽

Ground Measurement ◽

L Band

Soil moisture is a key indicator to assess cropland drought and irrigation status as well as forecast production. Compared with the optical data which are obscured by the crop canopy cover, the Synthetic Aperture Radar (SAR) is an efficient tool to detect the surface soil moisture under the vegetation cover due to its strong penetration capability. This paper studies the soil moisture retrieval using the L-band polarimetric Phased Array-type L-band SAR 2 (PALSAR-2) data acquired over the study region in Arkansas in the United States. Both two-component model-based decomposition (SAR data alone) and machine learning (SAR + optical indices) methods are tested and compared in this paper. Validation using independent ground measurement shows that the both methods achieved a Root Mean Square Error (RMSE) of less than 10 (vol.%), while the machine learning methods outperform the model-based decomposition, achieving an RMSE of 7.70 (vol.%) and R2 of 0.60.

Download Full-text

Comparison of Capability of SAR and Optical Data in Mapping Forest above Ground Biomass Based on Machine Learning

Environmental Sciences Proceedings ◽

10.3390/iecg2020-07916 ◽

2020 ◽

Vol 5 (1) ◽

pp. 13

Author(s):

Negar Tavasoli ◽

Hossein Arefi

Keyword(s):

Machine Learning ◽

Carbon Stock ◽

Vegetation Indices ◽

Forest Biomass ◽

Principal Component ◽

Optical Data ◽

Above Ground Biomass ◽

Ground Biomass ◽

Texture Characteristics ◽

Sentinel 2

Assessment of forest above ground biomass (AGB) is critical for managing forest and understanding the role of forest as source of carbon fluxes. Recently, satellite remote sensing products offer the chance to map forest biomass and carbon stock. The present study focuses on comparing the potential use of combination of ALOSPALSAR and Sentinel-1 SAR data, with Sentinel-2 optical data to estimate above ground biomass and carbon stock using Genetic-Random forest machine learning (GA-RF) algorithm. Polarimetric decompositions, texture characteristics and backscatter coefficients of ALOSPALSAR and Sentinel-1, and vegetation indices, tasseled cap, texture parameters and principal component analysis (PCA) of Sentinel-2 based on measured AGB samples were used to estimate biomass. The overall coefficient (R2) of AGB modelling using combination of ALOSPALSAR and Sentinel-1 data, and Sentinel-2 data were respectively 0.70 and 0.62. The result showed that Combining ALOSPALSAR and Sentinel-1 data to predict AGB by using GA-RF model performed better than Sentinel-2 data.

Download Full-text

Machine learning based real-time vehicle data analysis for safe driving modeling

Proceedings of the 34th ACM/SIGAPP Symposium on Applied Computing - SAC '19 ◽

10.1145/3297280.3297584 ◽

2019 ◽

Cited By ~ 2

Author(s):

Pamul Yadav ◽

Sangsu Jung ◽

Dhananjay Singh

Keyword(s):

Machine Learning ◽

Data Analysis ◽

Real Time ◽

Safe Driving ◽

Vehicle Data

Download Full-text

Topologic Data Analysis and Machine Learning

JACC Cardiovascular Imaging ◽

10.1016/j.jcmg.2021.04.005 ◽

2021 ◽

Author(s):

Rebecca T. Hahn

Keyword(s):

Machine Learning ◽

Data Analysis

Download Full-text

Passenger data analysis of Titanic using machine learning approach in the context of chances of surviving the disaster

IOP Conference Series Materials Science and Engineering ◽

10.1088/1757-899x/1065/1/012042 ◽

2021 ◽

Vol 1065 (1) ◽

pp. 012042

Author(s):

Md Arfinul Haque ◽

G Shivaprasad ◽

G Guruprasad

Keyword(s):

Machine Learning ◽

Data Analysis ◽

Learning Approach ◽

Machine Learning Approach

Download Full-text

Classification of apatite structures via topological data analysis: a framework for a ‘Materials Barcode’ representation of structure maps

Scientific Reports ◽

10.1038/s41598-021-90070-4 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Scott Broderick ◽

Ruhil Dongol ◽

Tianmu Zhang ◽

Krishna Rajan

Keyword(s):

Machine Learning ◽

Data Analysis ◽

Crystal Chemistry ◽

Persistent Homology ◽

Hierarchical Classification ◽

Topological Data Analysis ◽

Learning Tool ◽

Coordination Polyhedra ◽

Machine Learning Tool ◽

Topological Data

AbstractThis paper introduces the use of topological data analysis (TDA) as an unsupervised machine learning tool to uncover classification criteria in complex inorganic crystal chemistries. Using the apatite chemistry as a template, we track through the use of persistent homology the topological connectivity of input crystal chemistry descriptors on defining similarity between different stoichiometries of apatites. It is shown that TDA automatically identifies a hierarchical classification scheme within apatites based on the commonality of the number of discrete coordination polyhedra that constitute the structural building units common among the compounds. This information is presented in the form of a visualization scheme of a barcode of homology classifications, where the persistence of similarity between compounds is tracked. Unlike traditional perspectives of structure maps, this new “Materials Barcode” schema serves as an automated exploratory machine learning tool that can uncover structural associations from crystal chemistry databases, as well as to achieve a more nuanced insight into what defines similarity among homologous compounds.

Download Full-text