Road Accidents Analysis Using Comparative Study & Application of Machine Learning Algorithms

Model predictive contrTraffic Collisions are one of the major sources of deaths, injuries & property damage every year. Road accidents are one of the most difficult real world problems to tackle with, due to its high order of unpredictability. The persistence as well as existence of this problem may be prevalent to a different degree for each & every place. The consequences of this may result in loss of human life & capital. To avoid this, every place needs to tackle the problem with a customized approach depending on the causes that are responsible for the accidents. Even in today's world, where the mass operation of autonomous vehicles is still grim or out of sight, the possibility of predicting a road accident before it takes place, is practically impossible. The only idea or approach that can help to decrease the number of road accidents, is to analyze the reasons that lead to these accidents. The concepts of Data Analysis, Data Visualization & Machine Learning help to tackle real world problems, by exploring & deriving valuable insights, which in turn help in taking measures to solve the targeted problem & drive business growth. In this research study, the dataset pertaining to road mishaps that occurred in UK over time period 2005 - 2015 will be analyzed using these concepts. The defined approach can help the concerned authorities & respective government, to take every possible step & amendment, & hence mitigate the identified causes & scenarios that lead to road accidents.

Download Full-text

Empirical Comparison of Various Discretization Procedures

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001498000567 ◽

1998 ◽

Vol 12 (07) ◽

pp. 1017-1032 ◽

Cited By ~ 10

Author(s):

Petr Berka ◽

Ivan Bruha

Keyword(s):

Machine Learning ◽

Real World ◽

Learning Algorithm ◽

Machine Learning Algorithms ◽

The Other ◽

Machine Learning Algorithm ◽

Empirical Comparison ◽

Numerical Attributes ◽

Real World Problems ◽

Discretization Procedure

The genuine symbolic machine learning (ML) algorithms are capable of processing symbolic, categorial data only. However, real-world problems, e.g. in medicine or finance, involve both symbolic and numerical attributes. Therefore, there is an important issue of ML to discretize (categorize) numerical attributes. There exist quite a few discretization procedures in the ML field. This paper describes two newer algorithms for categorization (discretization) of numerical attributes. The first one is implemented in the KEX (Knowledge EXplorer) as its preprocessing procedure. Its idea is to discretize the numerical attributes in such a way that the resulting categorization corresponds to KEX knowledge acquisition algorithm. Since the categorization for KEX is done "off-line" before using the KEX machine learning algorithm, it can be used as a preprocessing step for other machine learning algorithms, too. The other discretization procedure is implemented in CN4, a large extension of the well-known CN2 machine learning algorithm. The range of numerical attributes is divided into intervals that may form a complex generated by the algorithm as a part of the class description. Experimental results show a comparison of performance of KEX and CN4 on some well-known ML databases. To make the comparison more exhibitory, we also used the discretization procedure of the MLC++ library. Other ML algorithms such as ID3 and C4.5 were run under our experiments, too. Then, the results are compared and discussed.

Download Full-text

Predicting Future Occurrence of Acute Hypotensive Episodes Using Noninvasive and Invasive Features

Military Medicine ◽

10.1093/milmed/usaa418 ◽

2021 ◽

Vol 186 (Supplement_1) ◽

pp. 445-451

Author(s):

Yifei Sun ◽

Navid Rashedi ◽

Vikrant Vaze ◽

Parikshit Shah ◽

Ryan Halter ◽

...

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Real World ◽

Short Term Memory ◽

Model Performance ◽

Learning Technologies ◽

Machine Learning Algorithms ◽

Support Vector ◽

K Nearest Neighbor ◽

Continuous Map

ABSTRACT Introduction Early prediction of the acute hypotensive episode (AHE) in critically ill patients has the potential to improve outcomes. In this study, we apply different machine learning algorithms to the MIMIC III Physionet dataset, containing more than 60,000 real-world intensive care unit records, to test commonly used machine learning technologies and compare their performances. Materials and Methods Five classification methods including K-nearest neighbor, logistic regression, support vector machine, random forest, and a deep learning method called long short-term memory are applied to predict an AHE 30 minutes in advance. An analysis comparing model performance when including versus excluding invasive features was conducted. To further study the pattern of the underlying mean arterial pressure (MAP), we apply a regression method to predict the continuous MAP values using linear regression over the next 60 minutes. Results Support vector machine yields the best performance in terms of recall (84%). Including the invasive features in the classification improves the performance significantly with both recall and precision increasing by more than 20 percentage points. We were able to predict the MAP with a root mean square error (a frequently used measure of the differences between the predicted values and the observed values) of 10 mmHg 60 minutes in the future. After converting continuous MAP predictions into AHE binary predictions, we achieve a 91% recall and 68% precision. In addition to predicting AHE, the MAP predictions provide clinically useful information regarding the timing and severity of the AHE occurrence. Conclusion We were able to predict AHE with precision and recall above 80% 30 minutes in advance with the large real-world dataset. The prediction of regression model can provide a more fine-grained, interpretable signal to practitioners. Model performance is improved by the inclusion of invasive features in predicting AHE, when compared to predicting the AHE based on only the available, restricted set of noninvasive technologies. This demonstrates the importance of exploring more noninvasive technologies for AHE prediction.

Download Full-text

Systematic literature review of machine learning methods used in the analysis of real-world data for patient-provider decision making

BMC Medical Informatics and Decision Making ◽

10.1186/s12911-021-01403-2 ◽

2021 ◽

Vol 21 (1) ◽

Author(s):

Alan Brnabic ◽

Lisa M. Hess

Keyword(s):

Machine Learning ◽

Decision Making ◽

Literature Review ◽

Systematic Literature Review ◽

Real World ◽

Learning Algorithms ◽

External Validation ◽

Machine Learning Algorithms ◽

Learning Methods ◽

Machine Learning Methods

Abstract Background Machine learning is a broad term encompassing a number of methods that allow the investigator to learn from the data. These methods may permit large real-world databases to be more rapidly translated to applications to inform patient-provider decision making. Methods This systematic literature review was conducted to identify published observational research of employed machine learning to inform decision making at the patient-provider level. The search strategy was implemented and studies meeting eligibility criteria were evaluated by two independent reviewers. Relevant data related to study design, statistical methods and strengths and limitations were identified; study quality was assessed using a modified version of the Luo checklist. Results A total of 34 publications from January 2014 to September 2020 were identified and evaluated for this review. There were diverse methods, statistical packages and approaches used across identified studies. The most common methods included decision tree and random forest approaches. Most studies applied internal validation but only two conducted external validation. Most studies utilized one algorithm, and only eight studies applied multiple machine learning algorithms to the data. Seven items on the Luo checklist failed to be met by more than 50% of published studies. Conclusions A wide variety of approaches, algorithms, statistical software, and validation strategies were employed in the application of machine learning methods to inform patient-provider decision making. There is a need to ensure that multiple machine learning approaches are used, the model selection strategy is clearly defined, and both internal and external validation are necessary to be sure that decisions for patient care are being made with the highest quality evidence. Future work should routinely employ ensemble methods incorporating multiple machine learning algorithms.

Download Full-text

Machine Learning Approaches to Traffic Accident Analysis and Hotspot Prediction

Computers ◽

10.3390/computers10120157 ◽

2021 ◽

Vol 10 (12) ◽

pp. 157

Author(s):

Daniel Santos ◽

José Saias ◽

Paulo Quaresma ◽

Vítor Beires Nogueira

Keyword(s):

Machine Learning ◽

Predictive Model ◽

Road Accident ◽

Influential Factors ◽

Supervised Machine Learning ◽

Machine Learning Techniques ◽

Economic Losses ◽

Learning Approaches ◽

Road Accidents ◽

Accident Data

Traffic accidents are one of the most important concerns of the world, since they result in numerous casualties, injuries, and fatalities each year, as well as significant economic losses. There are many factors that are responsible for causing road accidents. If these factors can be better understood and predicted, it might be possible to take measures to mitigate the damages and its severity. The purpose of this work is to identify these factors using accident data from 2016 to 2019 from the district of Setúbal, Portugal. This work aims at developing models that can select a set of influential factors that may be used to classify the severity of an accident, supporting an analysis on the accident data. In addition, this study also proposes a predictive model for future road accidents based on past data. Various machine learning approaches are used to create these models. Supervised machine learning methods such as decision trees (DT), random forests (RF), logistic regression (LR), and naive Bayes (NB) are used, as well as unsupervised machine learning techniques including DBSCAN and hierarchical clustering. Results show that a rule-based model using the C5.0 algorithm is capable of accurately detecting the most relevant factors describing a road accident severity. Further, the results of the predictive model suggests the RF model could be a useful tool for forecasting accident hotspots.

Download Full-text

Internet of Things-Based Intelligent Smart Home Control System

Security and Communication Networks ◽

10.1155/2021/9928254 ◽

2021 ◽

Vol 2021 ◽

pp. 1-17

Author(s):

Olutosin Taiwo ◽

Absalom E. Ezugwu

Keyword(s):

Machine Learning ◽

Mobile Application ◽

Smart Home ◽

Human Life ◽

Machine Learning Algorithms ◽

Support Vector ◽

Automation System ◽

Home Automation ◽

Area Of Interest ◽

Home Automation System

The smart home is now an established area of interest and research that contributes to comfort in modern homes. With the Internet being an essential part of broad communication in modern life, IoT has allowed homes to go beyond building to interactive abodes. In many spheres of human life, the IoT has grown exponentially, including monitoring ecological factors, controlling the home and its appliances, and storing data generated by devices in the house in the cloud. Smart home includes multiple components, technologies, and devices that generate valuable data for predicting home and environment activities. This work presents the design and development of a ubiquitous, cloud-based intelligent home automation system. The system controls, monitors, and oversees the security of a home and its environment via an Android mobile application. One module controls and monitors electrical appliances and environmental factors, while another module oversees the home’s security by detecting motion and capturing images. Our work uses a camera to capture images of objects triggered by their motion being detected. To avoid false alarms, we used the concept of machine learning to differentiate between images of regular home occupants and those of an intruder. The support vector machine algorithm is proposed in this study to classify the features of the image captured and determine if it is that of a regular home occupant or an intruder before sending an alarm to the user. The design of the mobile application allows a graphical display of the activities in the house. Our work proves that machine learning algorithms can improve home automation system functionality and enhance home security. The work’s prototype was implemented using an ESP8266 board, an ESP32-CAM board, a 5 V four-channel relay module, and sensors.

Download Full-text

Machine Learning for Landslides Prevention: A Survey

10.36227/techrxiv.12546098 ◽

2020 ◽

Author(s):

Zhengjing Ma ◽

Gang Mei

Keyword(s):

Machine Learning ◽

State Of The Art ◽

Human Life ◽

Machine Learning Algorithms ◽

Warning Systems ◽

Negative Effects ◽

Relevant Research ◽

Related Information ◽

Urgent Task ◽

Comprehensive Survey

Landslides are one of the most critical categories of natural disasters worldwide and induce severely destructive outcomes to human life and the overall economic system. To reduce its negative effects, landslides prevention has become an urgent task, which includes investigating landslide-related information and predicting potential landslides. Machine learning is a state-of-the-art analytics tool that has been widely used in landslides prevention. This paper presents a comprehensive survey of relevant research on machine learning applied in landslides prevention, mainly focusing on (1) landslides detection based on images, (2) landslides susceptibility assessment, and (3) the development of landslide warning systems. Moreover, this paper discusses the current challenges and potential opportunities in the application of machine learning algorithms for landslides prevention.

Download Full-text

Efficient inference for agent-based models of real-world phenomena

10.1101/2021.10.04.462980 ◽

2021 ◽

Author(s):

Andreas Christ Sølvsten Jørgensen ◽

Atiyo Ghosh ◽

Marc Sturrock ◽

Vahid Shahrezaei

Keyword(s):

Machine Learning ◽

Case Studies ◽

Parameter Space ◽

Real World ◽

Autonomous Agents ◽

Stochastic Simulations ◽

Model Parameters ◽

Learning Approaches ◽

Real World Applications ◽

Real World Problems

AbstractThe modelling of many real-world problems relies on computationally heavy simulations. Since statistical inference rests on repeated simulations to sample the parameter space, the high computational expense of these simulations can become a stumbling block. In this paper, we compare two ways to mitigate this issue based on machine learning methods. One approach is to construct lightweight surrogate models to substitute the simulations used in inference. Alternatively, one might altogether circumnavigate the need for Bayesian sampling schemes and directly estimate the posterior distribution. We focus on stochastic simulations that track autonomous agents and present two case studies of real-world applications: tumour growths and the spread of infectious diseases. We demonstrate that good accuracy in inference can be achieved with a relatively small number of simulations, making our machine learning approaches orders of magnitude faster than classical simulation-based methods that rely on sampling the parameter space. However, we find that while some methods generally produce more robust results than others, no algorithm offers a one-size-fits-all solution when attempting to infer model parameters from observations. Instead, one must choose the inference technique with the specific real-world application in mind. The stochastic nature of the considered real-world phenomena poses an additional challenge that can become insurmountable for some approaches. Overall, we find machine learning approaches that create direct inference machines to be promising for real-world applications. We present our findings as general guidelines for modelling practitioners.Author summaryComputer simulations play a vital role in modern science as they are commonly used to compare theory with observations. One can thus infer the properties of a observed system by comparing the data to the predicted behaviour in different scenarios. Each of these scenarios corresponds to a simulation with slightly different settings. However, since real-world problems are highly complex, the simulations often require extensive computational resources, making direct comparisons with data challenging, if not insurmountable. It is, therefore, necessary to resort to inference methods that mitigate this issue, but it is not clear-cut what path to choose for any specific research problem. In this paper, we provide general guidelines for how to make this choice. We do so by studying examples from oncology and epidemiology and by taking advantage of developments in machine learning. More specifically, we focus on simulations that track the behaviour of autonomous agents, such as single cells or individuals. We show that the best way forward is problem-dependent and highlight the methods that yield the most robust results across the different case studies. We demonstrate that these methods are highly promising and produce reliable results in a small fraction of the time required by classic approaches that rely on comparisons between data and individual simulations. Rather than relying on a single inference technique, we recommend employing several methods and selecting the most reliable based on predetermined criteria.

Download Full-text

Detection and defense of cyberattacks on the machine learning control of robotic systems

The Journal of Defense Modeling and Simulation Applications Methodology Technology ◽

10.1177/15485129211043874 ◽

2021 ◽

pp. 154851292110438

Author(s):

George W Clark ◽

Todd R Andel ◽

J Todd McDonald ◽

Tom Johnsten ◽

Tom Thomas

Keyword(s):

Machine Learning ◽

Autonomous Vehicles ◽

Defense Mechanisms ◽

Autonomous Vehicle ◽

Learning Algorithms ◽

Learning Model ◽

Machine Learning Algorithms ◽

Robotic Systems ◽

Machine Learning Model ◽

Attack Surface

Robotic systems are no longer simply built and designed to perform sequential repetitive tasks primarily in a static manufacturing environment. Systems such as autonomous vehicles make use of intricate machine learning algorithms to adapt their behavior to dynamic conditions in their operating environment. These machine learning algorithms provide an additional attack surface for an adversary to exploit in order to perform a cyberattack. Since an attack on robotic systems such as autonomous vehicles have the potential to cause great damage and harm to humans, it is essential that detection and defenses of these attacks be explored. This paper discusses the plausibility of direct and indirect cyberattacks on a machine learning model through the use of a virtual autonomous vehicle operating in a simulation environment using a machine learning model for control. Using this vehicle, this paper proposes various methods of detection of cyberattacks on its machine learning model and discusses possible defense mechanisms to prevent such attacks.

Download Full-text

Machine learning in road accident research: decision trees describing road accidents during cross-flow turns

Ergonomics ◽

10.1080/001401398186603 ◽

1998 ◽

Vol 41 (7) ◽

pp. 1060-1079 ◽

Cited By ~ 12

Author(s):

DAVID D. CLARKE ◽

RICHARD FORSYTH ◽

RICHARD WRIGHT

Keyword(s):

Machine Learning ◽

Decision Trees ◽

Cross Flow ◽

Road Accident ◽

Road Accidents ◽

Accident Research ◽

Research Decision

Download Full-text

The Potential of Machine Learning Algorithms in Discriminating Chronic Obstructive Pulmonary Disease and Healthy Saliva Samples

Disease and Diagnosis ◽

10.34172/ddj.2021.28 ◽

2021 ◽

Vol 10 (4) ◽

pp. 155-163

Author(s):

Atefeh Goshvarpour ◽

Ateke Goshvarpour

Keyword(s):

Machine Learning ◽

Chronic Obstructive Pulmonary Disease ◽

Pulmonary Disease ◽

Intelligent System ◽

Smoking Status ◽

Human Life ◽

Machine Learning Algorithms ◽

Chronic Obstructive ◽

Obstructive Pulmonary Disease ◽

Age Range

Background: Today, with the spread of tobacco use and increased environmental pollutions, respiratory diseases are considered important factors threatening human life. Chronic obstructive pulmonary disease (COPD) is a kind of inflammatory lung disease. Clinically, COPD is currently diagnosed and monitored by spirometry as the gold-standard technique although spirometry systems encounter some limitations. Thanks to the economical handling and sampling, practicality, and non-invasiveness of saliva biomarkers, it is promising for the testing environment. Accordingly, the current analytic observational study aimed to propose an intelligent system for COPD detection. Materials and Methods: To this end, 40 COPD (8 females and 32 males in the age range of 71.67±8.27 years) and 40 controls (17 females and 23 males within the age range of 38.23±14.05 years) were considered in this study. The samples were characterized by absolute minimum value and the average value of the real and imaginary parts of saliva permittivity. Additionally, the age, gender, and smoking status of the participants were determined, and then the performance of various classifiers was evaluated by adjusting k in k-fold cross-validation (CV) and classifier parameterization. Results: The results showed that the k-nearest neighbor outperformed other classifiers. Using both 8- and 10-fold CV, the maximum classification rates of 100% were achieved for all k values. On the other hand, increasing the k in k-fold CV improved classification performances. The positive role of parameterization was revealed as well. Conclusions: Overall, these findings authenticated the potential of machine learning (ML) algorithms in the diagnosis of COPD using subjects’ saliva features and demographic information.

Download Full-text