IDENTIFICATION OF NAMES AND ACTIONS OF PRINCIPAL OBJECTS IN TV PROGRAM SEGMENTS USING CLOSED CAPTIONS

This paper proposes a method for automatically extracting principal video objects that appear in TV program segments and their actions using linguistic analysis of closed captions. We focus on features based on the text style of the closed captions by using Quinlan's C4.5 decision-tree learning algorithm. We extract a noun describing a video object and a verb describing an action for each video shot. To show the effectiveness of the method, we conducted experiments on the extraction of video segments in which animals appear and perform actions in twenty episodes of a Nature program. We obtained F-values of 0.609 on the extraction of video segments in which animals appear and 0.699 on extracting the action of "eating." We applied our method to a further 20 episodes, and generated a multimedia encyclopedia of animals. This provided a total of 387 video clips of 105 kinds of animals and 261 video clips of 56 kinds of actions.

Download Full-text

Evaluating Seismic Soil Liquefaction Potential Using Bayesian Belief Network and C4.5 Decision Tree Approaches

Applied Sciences ◽

10.3390/app9204226 ◽

2019 ◽

Vol 9 (20) ◽

pp. 4226 ◽

Cited By ~ 2

Author(s):

Mahmood Ahmad ◽

Xiao-Wei Tang ◽

Jiang-Nan Qiu ◽

Feezan Ahmad

Keyword(s):

Decision Tree ◽

Domain Knowledge ◽

Probabilistic Reasoning ◽

Learning Algorithm ◽

Bayesian Belief Network ◽

Soil Liquefaction ◽

Point Of View ◽

Liquefaction Potential ◽

Belief Network ◽

C4.5 Decision Tree

Liquefaction is considered a damaging phenomenon of earthquakes and a major cause of concern in civil engineering. Therefore, its predictory assessment is an essential task for geotechnical experts. This paper investigates the performance of Bayesian belief network (BBN) and C4.5 decision tree (DT) models to evaluate seismic soil liquefaction potential based on the updated and relatively large cone penetration test (CPT) dataset (which includes 251 case histories), comparing them to a simplified procedure and an evolutionary-based approach. The BBN model was developed using the K2 machine learning algorithm and domain knowledge (DK) with data fusion methodology, while the DT model was created using a C4.5 algorithm. This study shows that the BBN model is preferred over the others for evaluation of seismic soil liquefaction potential. Owing to its overall performance, simplicity in practice, data-driven characteristics, and ability to map interactions between variables, the use of a BBN model in assessing seismic soil liquefaction is quite promising. The results of a sensitivity analysis show that ‘equivalent clean sand penetration resistance’ is the most significant factor affecting liquefaction potential. This study also interprets the probabilistic reasoning of the robust BBN model and most probable explanation (MPE) of seismic soil liquefied sites, based on an engineering point of view.

Download Full-text

Automatic Generation of a Multimedia Encyclopedia from TV Programs by Using Closed Captions and Detecting Principal Video Objects

Eighth IEEE International Symposium on Multimedia (ISM'06) ◽

10.1109/ism.2006.36 ◽

2006 ◽

Cited By ~ 6

Author(s):

Kikuka Miura ◽

Ichiro Yamada ◽

Hideki Sumiyoshi ◽

Nobuyuki Yagi

Keyword(s):

Automatic Generation ◽

Video Objects ◽

Closed Captions ◽

Multimedia Encyclopedia

Download Full-text

Utilizing the Genetic Algorithm to Pruning the C4.5 Decision Tree Algorithm

Asian Journal of Applied Sciences ◽

10.24203/ajas.v9i1.6503 ◽

2021 ◽

Vol 9 (1) ◽

Author(s):

Maad M. Mijwil ◽

Rana A. Abttan

Keyword(s):

Machine Learning ◽

Genetic Algorithm ◽

Decision Tree ◽

Learning Algorithm ◽

Machine Learning Algorithms ◽

Continuous Data ◽

Decision Tree Algorithm ◽

Data Set ◽

Confidence Factor ◽

C4.5 Decision Tree

A decision tree (DTs) is one of the most popular machine learning algorithms that divide data repeatedly to form groups or classes. It is a supervised learning algorithm that can be used on discrete or continuous data for classification or regression. The most traditional classifier in this algorithm is the C4.5 decision tree, which is the point of this research. This classifier has the advantage of building a vast data set and does not stop until it reaches the desired goal. The problem with this classifier is that there are unnecessary nodes and branches leading to overfitting. This overfitting can negatively affect the classification process. In this context, the authors suggest utilizing a genetic algorithm to prune the effect of overfitting. This dataset study consists of four datasets: IRIS, Car Evaluation, GLASS, and WINE collected from UC Irvine (UCI) machine learning repository. The experimental results have confirmed the effectiveness of the genetic algorithm in pruning the effect of overfitting on the four datasets and optimizing confidence factor (CF) of the C4.5 decision tree. The proposed method has reached about 92% accuracy in this work.

Download Full-text

Improvisasi Teknik Oversampling MWMOTE Untuk Penanganan Data Tidak Seimbang

JURNAL MEDIA INFORMATIKA BUDIDARMA ◽

10.30865/mib.v5i2.2811 ◽

2021 ◽

Vol 5 (2) ◽

pp. 398

Author(s):

Pramana Yoga Saputra ◽

Moch Zawaruddin Abdullah ◽

Annisa Puspa Kirana

Keyword(s):

Machine Learning ◽

Decision Tree ◽

Learning Algorithm ◽

Synthetic Data ◽

Classification Performance ◽

Machine Learning Algorithm ◽

Minority Class ◽

Imbalance Data ◽

Decision Tree Classification ◽

C4.5 Decision Tree

Imbalance data is a condition which there is a distinction in the quantity of data that results withinside the majority class (classes with very many members) and minority class (classes with very few members). It can complicate the classification process since the machine learning algorithm method is designed to classify already balanced data. The oversampling process technique is used to resolve data imbalance by applying synthetic data to the minority class in such a manner that it has the same volume of data as the majority class. MWMOTE is an oversampling technique that generates synthetic data based on members of the minority class clusters that are close to the majority class. This approach is capable of generating synthetic data well. The resulting synthesis data remains in the nearby majority region and too dense on the border of the cluster. It is hence permitting the resulting synthetic data to go into the majority class classification. This study is objectives to improve the process of generating synthetic data on MWMOTE so that the resulting data is extensively dispensed withinside the minority class. The outcomes of the test show that the proposed method is capable of enhancing the classification performance for KNN and C4.5 Decision Tree classification sequentially by 0.46% and 0.96% compared to MWMOTE

Download Full-text

C4.5 Decision Tree Machine Learning Algorithm Based GIS Route Identification

2018 Tenth International Conference on Ubiquitous and Future Networks (ICUFN) ◽

10.1109/icufn.2018.8436994 ◽

2018 ◽

Author(s):

Pankaj Kumar Dalela ◽

Prashant Bansal ◽

Arun Yadav ◽

Sabyasachi Majumdar ◽

Anurag Yadav ◽

...

Keyword(s):

Machine Learning ◽

Decision Tree ◽

Learning Algorithm ◽

Machine Learning Algorithm ◽

C4.5 Decision Tree

Download Full-text

DIFFERENTIAL EVOLUTION IN THE DECISION TREE LEARNING ALGORITHM

Siberian Journal of Science and Technology ◽

10.31772/2587-6066-2019-20-3-312-319 ◽

2019 ◽

Vol 20 (3) ◽

pp. 312-319

Author(s):

S. A. Mitrofanov ◽

◽

E. S. Semenkin ◽

Keyword(s):

Decision Tree ◽

Differential Evolution ◽

Learning Algorithm ◽

Decision Tree Learning

Download Full-text

Internet Traffic Classification Using C4.5 Decision Tree

Journal of Software ◽

10.3724/sp.j.1001.2009.03444 ◽

2009 ◽

Vol 20 (10) ◽

pp. 2692-2704 ◽

Cited By ~ 26

Author(s):

Peng XU ◽

Sen LIN

Keyword(s):

Decision Tree ◽

Internet Traffic ◽

Traffic Classification ◽

C4.5 Decision Tree ◽

Internet Traffic Classification

Download Full-text

174 A comparison of machine learning algorithms in the classification of beef steers finished in feedlot

Journal of Animal Science ◽

10.1093/jas/skaa278.231 ◽

2020 ◽

Vol 98 (Supplement_4) ◽

pp. 126-127

Author(s):

Lucas S Lopes ◽

Christine F Baes ◽

Dan Tulpan ◽

Luis Artur Loyola Chardulo ◽

Otavio Machado Neto ◽

...

Keyword(s):

Machine Learning ◽

Decision Tree ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Final Decision ◽

Relevant Parameter ◽

Good Prediction ◽

Quality Traits ◽

C4.5 Decision Tree

Abstract The aim of this project is to compare some of the state-of-the-art machine learning algorithms on the classification of steers finished in feedlots based on performance, carcass and meat quality traits. The precise classification of animals allows for fast, real-time decision making in animal food industry, such as culling or retention of herd animals. Beef production presents high variability in its numerous carcass and beef quality traits. Machine learning algorithms and software provide an opportunity to evaluate the interactions between traits to better classify animals. Four different treatment levels of wet distiller’s grain were applied to 97 Angus-Nellore animals and used as features for the classification problem. The C4.5 decision tree, Naïve Bayes (NB), Random Forest (RF) and Multilayer Perceptron (MLP) Artificial Neural Network algorithms were used to predict and classify the animals based on recorded traits measurements, which include initial and final weights, sheer force and meat color. The top performing classifier was the C4.5 decision tree algorithm with a classification accuracy of 96.90%, while the RF, the MLP and NB classifiers had accuracies of 55.67%, 39.17% and 29.89% respectively. We observed that the final decision tree model constructed with C4.5 selected only the dry matter intake (DMI) feature as a differentiator. When DMI was removed, no other feature or combination of features was sufficiently strong to provide good prediction accuracies for any of the classifiers. We plan to investigate in a follow-up study on a significantly larger sample size, the reasons behind DMI being a more relevant parameter than the other measurements.

Download Full-text

A DECISION TREE-BASED CLASSIFICATION APPROACH TO RULE EXTRACTION FOR SECURITY ANALYSIS

International Journal of Information Technology & Decision Making ◽

10.1142/s0219622006001824 ◽

2006 ◽

Vol 05 (01) ◽

pp. 227-240 ◽

Cited By ~ 15

Author(s):

N. REN ◽

M. ZARGHAM ◽

S. RAHIMI

Keyword(s):

Decision Tree ◽

High Performance ◽

Security Analysis ◽

Predictive Performance ◽

Classification Model ◽

Stock Selection ◽

Selection Rules ◽

Stock Portfolios ◽

Decision Tree Classification ◽

C4.5 Decision Tree

Stock selection rules are extensively utilized as the guideline to construct high performance stock portfolios. However, the predictive performance of the rules developed by some economic experts in the past has decreased dramatically for the current stock market. In this paper, C4.5 decision tree classification method was adopted to construct a model for stock prediction based on the fundamental stock data, from which a set of stock selection rules was derived. The experimental results showed that the generated rules have exceptional predictive performance. Moreover, it also demonstrated that the C4.5 decision tree classification model can work efficiently on the high noise stock data domain.

Download Full-text