Speaker recognition using adaptively boosted decision tree classifier

People communicate their views, arguments and emotions about their everyday life on social media (SM) platforms (e.g. Twitter and Facebook). Twitter stands as an international micro-blogging service that features a brief message called tweets. Freestyle writing, incorrect grammar, typographical errors and abbreviations are some noises that occur in the text. Sentiment analysis (SA) centered on a tweet posted by the user, and also opinion mining (OM) of the customers review is another famous research topic. The texts are gathered from users’ tweets by means of OM and automatic-SA centered on ternary classifications, namely positive, neutral and negative. It is very challenging for the researchers to ascertain sentiments as a result of its limited size, misspells, unstructured nature, abbreviations and slangs for Twitter data. This paper, with the aid of the Gradient Boosted Decision Tree classifier (GBDT), proposes an efficient SA and Sentiment Classification (SC) of Twitter data. Initially, the twitter data undergoes pre-processing. Next, the pre-processed data is processed using HDFS MapReduce. Now, the features are extracted from the processed data, and then efficient features are selected using the Improved Elephant Herd Optimization (I-EHO) technique. Now, score values are calculated for each of those chosen features and given to the classifier. At last, the GBDT classifier classifies the data as negative, positive, or neutral. Experiential results are analyzed and contrasted with the other conventional techniques to show the highest performance of the proposed method.

Download Full-text

An Efficient Detection of HCC-recurrence in Clinical Data Processing using Boosted Decision Tree Classifier

Procedia Computer Science ◽

10.1016/j.procs.2020.03.196 ◽

2020 ◽

Vol 167 ◽

pp. 193-204

Author(s):

P. Radha ◽

R. Divya

Keyword(s):

Decision Tree ◽

Data Processing ◽

Clinical Data ◽

Decision Tree Classifier ◽

Efficient Detection ◽

Tree Classifier ◽

Boosted Decision Tree ◽

Hcc Recurrence

Download Full-text

A Boosted Decision Tree Model for Predicting Loan Default in P2P Lending Communities

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.a9626.109119 ◽

2019 ◽

Vol 9 (1) ◽

pp. 1257-1261

Keyword(s):

Small Business ◽

Decision Tree ◽

Decision Tree Classifier ◽

Tree Model ◽

Loan Default ◽

Accuracy Profile ◽

Default Prediction ◽

Tree Classifier ◽

Social Lending ◽

Boosted Decision Tree

Loan Default Prediction For Social Lending Is An Emerging Area Of Research In Predictive Analytics. The Need For Large Amount Of Data And Few Available Studies In The Current Loan Default Prediction Models For Social Lending Suggest That Other Viable And Easily Implementable Models Should Be Investigated And Developed. In View Of This, This Study Developed A Data Mining Model For Predicting Loan Default Among Social Lending Patrons, Specifically The Small Business Owners, Using Boosted Decision Tree Model. The United States Small Business Administration (Usba) PubliclyAvailable Loan Administration Dataset Of 27 Features And 899164 Data Instances Was Used In 80:20 Ratios For The Training And Testing Of The Model. 16 Data Features Were Finally Used As Predictors After Data Cleaning And Feature Engineering. The Gradient Boosting Decision Tree Classifier Recorded 99% Accuracy Compared To The Basic Decision Tree Classifier Of 98%. The Model Is Further Evaluated With (A) Receiver Operating Characteristics (Roc) And Area Under Curve (Auc), (B) Cumulative Accuracy Profile (Cap), And (C) Cumulative Accuracy Profile (Cap) Under Auc. Each Of These Model Performance Evaluation Metrics, Especially Roc-Auc, Showed The Relationship Between The True Positives And False Positives That Implies The Model Is A Good Fit.

Download Full-text