Estimation of Traffic Incident Duration: A Comparative Study of Decision Tree Models

2020 ◽  
Vol 45 (10) ◽  
pp. 8099-8110
Author(s):  
Abdulsamet Saracoglu ◽  
Halit Ozen
2021 ◽  
Vol 54 (1) ◽  
pp. 1-38
Author(s):  
Víctor Adrián Sosa Hernández ◽  
Raúl Monroy ◽  
Miguel Angel Medina-Pérez ◽  
Octavio Loyola-González ◽  
Francisco Herrera

Experts from different domains have resorted to machine learning techniques to produce explainable models that support decision-making. Among existing techniques, decision trees have been useful in many application domains for classification. Decision trees can make decisions in a language that is closer to that of the experts. Many researchers have attempted to create better decision tree models by improving the components of the induction algorithm. One of the main components that have been studied and improved is the evaluation measure for candidate splits. In this article, we introduce a tutorial that explains decision tree induction. Then, we present an experimental framework to assess the performance of 21 evaluation measures that produce different C4.5 variants considering 110 databases, two performance measures, and 10× 10-fold cross-validation. Furthermore, we compare and rank the evaluation measures by using a Bayesian statistical analysis. From our experimental results, we present the first two performance rankings in the literature of C4.5 variants. Moreover, we organize the evaluation measures into two groups according to their performance. Finally, we introduce meta-models that automatically determine the group of evaluation measures to produce a C4.5 variant for a new database and some further opportunities for decision tree models.


Sensors ◽  
2021 ◽  
Vol 21 (8) ◽  
pp. 2849
Author(s):  
Sungbum Jun

Due to the recent advance in the industrial Internet of Things (IoT) in manufacturing, the vast amount of data from sensors has triggered the need for leveraging such big data for fault detection. In particular, interpretable machine learning techniques, such as tree-based algorithms, have drawn attention to the need to implement reliable manufacturing systems, and identify the root causes of faults. However, despite the high interpretability of decision trees, tree-based models make a trade-off between accuracy and interpretability. In order to improve the tree’s performance while maintaining its interpretability, an evolutionary algorithm for discretization of multiple attributes, called Decision tree Improved by Multiple sPLits with Evolutionary algorithm for Discretization (DIMPLED), is proposed. The experimental results with two real-world datasets from sensors showed that the decision tree improved by DIMPLED outperformed the performances of single-decision-tree models (C4.5 and CART) that are widely used in practice, and it proved competitive compared to the ensemble methods, which have multiple decision trees. Even though the ensemble methods could produce slightly better performances, the proposed DIMPLED has a more interpretable structure, while maintaining an appropriate performance level.


2014 ◽  
Vol 2014 ◽  
pp. 1-16 ◽  
Author(s):  
Qingchao Liu ◽  
Jian Lu ◽  
Shuyan Chen ◽  
Kangjia Zhao

This study presents the applicability of the Naïve Bayes classifier ensemble for traffic incident detection. The standard Naive Bayes (NB) has been applied to traffic incident detection and has achieved good results. However, the detection result of the practically implemented NB depends on the choice of the optimal threshold, which is determined mathematically by using Bayesian concepts in the incident-detection process. To avoid the burden of choosing the optimal threshold and tuning the parameters and, furthermore, to improve the limited classification performance of the NB and to enhance the detection performance, we propose an NB classifier ensemble for incident detection. In addition, we also propose to combine the Naïve Bayes and decision tree (NBTree) to detect incidents. In this paper, we discuss extensive experiments that were performed to evaluate the performances of three algorithms: standard NB, NB ensemble, and NBTree. The experimental results indicate that the performances of five rules of the NB classifier ensemble are significantly better than those of standard NB and slightly better than those of NBTree in terms of some indicators. More importantly, the performances of the NB classifier ensemble are very stable.


Author(s):  
Prashansa Agrawal ◽  
Antony Franklin ◽  
Digvijay Pawar ◽  
Srijith PK

Author(s):  
Haozhe Cong ◽  
Cong Chen ◽  
Pei-Sung Lin ◽  
Guohui Zhang ◽  
John Milton ◽  
...  

Highway traffic incidents induce a significant loss of life, economy, and productivity through injuries and fatalities, extended travel time and delay, and excessive energy consumption and air pollution. Traffic emergency management during incident conditions is the core element of active traffic management, and it is of practical significance to accurately understand the duration time distribution for typical traffic incident types and the factors that influence incident duration. This study proposes a dual-learning Bayesian network (BN) model to estimate traffic incident duration and to examine the influence of heterogeneous factors on the length of duration based on expert knowledge of traffic incident management and highway incident data collected in Zhejiang Province, China. Fifteen variables related to three aspects of traffic incidents, including incident information, incident consequences, and rescue resources, were included in the analysis. The trained BN model achieves favorable performance in several areas, including classification accuracy, the receiver operating characteristic (ROC) curve, and the area under curve (AUC) value. A classification matrix, and significant variables and their heterogeneous influences are identified accordingly. The research findings from this study provide beneficial reference to the understanding of decision-making in traffic incident response and process, active traffic incident management, and intelligent transportation systems.


Sign in / Sign up

Export Citation Format

Share Document