scholarly journals ONLINE PURCHASING PLATFORM USING CROWD SOURCING WITH IMPROVISATION OF CLASSIFICATION ACCURACY

2021 ◽  
Vol 9 (1) ◽  
pp. 1123-1134
Author(s):  
P. Gokulakrishnan, D. Suresh, S. Satheesbabu

Crowd-sourcing is a prototype where persons cum organisations acquire facts such as ideas, micro-tasks, financial, vote casting associated to items and offerings from individuals of large, open and rapidly-evolving nature. It entails utilization of web acquired and distribute work between members to get a collective result. The software of classification tasks in crowd-sourcing is a counter step due to the inclined reputation of crowd-sourcing market. Dynamic Label Acquisition and Answer Aggregation (DLTA) crowd-sourcing framework accomplishes the classification assignment in a promising manner. But most of the current works are now not in a position to supply an budget allocation for labels due to the fact they do  not make the most the Label inference and acquisition phase. In addition, label mismatch and multi-label tasks are the different issues encountered in the current works. To overcome, it is proposed to undertake Random Forest Algorithm (RFA) for classification in crowd-sourcing. The goal of this work is to enhance the crowd-sourcing classification task efficiency with Dynamic Resource Algorithm. RFA is activated by means of developing a multitude of  decision tree at training time and consequences with the training and it applies a bagging approach to produce the last end result with more accuracy.

2021 ◽  
pp. 1-13
Author(s):  
Tiebo Sun ◽  
Jinhao Liu ◽  
Jiangming Kan ◽  
Tingting Sui

Aiming at the problem of automatic classification of point cloud in the investigation of vegetation resources in the straw checkerboard barriers region, an improved random forest point cloud classification algorithm was proposed. According to the problems of decision tree redundancy and absolute majority voting in the existing random forest algorithm, first the similarity of the decision tree was calculated based on the tree edit distance, further clustered reduction based on the maximum and minimum distance algorithm, and then introduced classification accuracy of decision tree to construct weight matrix to implement weighted voting at the voting stage. Before random forest classification, based on the characteristics of point cloud data, a total of 20 point cloud single-point features and multi-point statistical features were selected to participate in point cloud classification, based on the point cloud data spatial distribution characteristics, three different scales for selecting point cloud neighborhoods were set based on the point cloud density, point cloud classification feature sets at different scales were constructed, optimizing important features of point cloud to participate in point cloud classification calculation after variable importance scored. The experimental results showed that the point cloud classification based on the optimized random forest algorithm in this paper achieved a total classification accuracy of 94.15% in dataset 1 acquired by lidar, the overall accuracy of classification on dataset 2 obtained by dense matching reaches 92.03%, both were higher than the unoptimized random forest algorithm and MRF, SVM point cloud classification method, and dimensionality reduction through feature optimization can greatly improve the efficiency of the algorithm.


2016 ◽  
Vol 26 (03) ◽  
pp. 1750007 ◽  
Author(s):  
S. Dinakaran ◽  
P. Ranjit Jeba Thangaiah

This article introduces a novel ensemble method named eAdaBoost (Effective Adaptive Boosting) is a meta classifier which is developed by enhancing the existing AdaBoost algorithm and to handle the time complexity and also to produce the best classification accuracy. The eAdaBoost reduces the error rate when compared with the existing methods and generates the best accuracy by reweighing each feature for further process. The comparison results of an extensive experimental evaluation of the proposed method are explained using the UCI machine learning repository datasets. The accuracy of the classifiers and statistical test comparisons are made with various boosting algorithms. The proposed eAdaBoost has been also implemented with different decision tree classifiers like C4.5, Decision Stump, NB Tree and Random Forest. The algorithm has been computed with various dataset, with different weight thresholds and the performance is analyzed. The proposed method produces better results using random forest and NB tree as base classifier than the decision stump and C4.5 classifiers for few datasets. The eAdaBoost gives better classification accuracy, and prediction accuracy, and execution time is also less when compared with other classifiers.


Nowadays, heart disease is the main cause of several deaths among all other diseases. Due to the lack of resources in the medical field, the prediction of heart diseases becomes a major problem. For early diagnosis and treatment, some classification algorithms such as Decision Tree and Random Forest Algorithm are used. The data mining techniques compare the accuracy of the algorithm and predict heart diseases. The main aim of this paper is to predict heart disease based on the dataset values. In this paper we are comparing the accuracy of above two algorithms. To implement these methods the following steps are used. In first phase, a dataset of 13 attributes is collected and it was applied on classification techniques using the Decision tree and Random Forest Algorithms. Finally, the accuracy is collected for both the algorithms. In this paper we observed that random forest is generating better results than decision tree in prediction of heart diseases.


2017 ◽  
Vol 2017 ◽  
pp. 1-13 ◽  
Author(s):  
Jae-Hee Hur ◽  
Sun-Young Ihm ◽  
Young-Ho Park

Recently, the importance of mobile cloud computing has increased. Mobile devices can collect personal data from various sensors within a shorter period of time and sensor-based data consists of valuable information from users. Advanced computation power and data analysis technology based on cloud computing provide an opportunity to classify massive sensor data into given labels. Random forest algorithm is known as black box model which is hardly able to interpret the hidden process inside. In this paper, we propose a method that analyzes the variable impact in random forest algorithm to clarify which variable affects classification accuracy the most. We apply Shapley Value with random forest to analyze the variable impact. Under the assumption that every variable cooperates as players in the cooperative game situation, Shapley Value fairly distributes the payoff of variables. Our proposed method calculates the relative contributions of the variables within its classification process. In this paper, we analyze the influence of variables and list the priority of variables that affect classification accuracy result. Our proposed method proves its suitability for data interpretation in black box model like a random forest so that the algorithm is applicable in mobile cloud computing environment.


Author(s):  
H. Sahu ◽  
D. Haldar ◽  
A. Danodia ◽  
S. Kumar

<p><strong>Abstract.</strong> A study was conducted in Saharanpur District of Uttar Pradesh to asses the potential of Sentinel-1A SAR Data in orchard crop classification. The objective of the study was to evaluate three different classifiers that are maximum likelihood classifier, decision tree algorithm and random forest algorithm in Sentinel-1A SAR Data. An attempt is made to study Sentinel-1A SAR Data to classify orchard crop using this approach. Here the rule-based classifiers such as decision tree algorithm and random forest algorithm are compared with conventional maximum likelihood classifier. Statistical analysis of the classification show that the distribution of the crop, forest orchard, settlement and waterbody was 17.47<span class="thinspace"></span>%, 0.47<span class="thinspace"></span>%, 28.3<span class="thinspace"></span>%, 28.3<span class="thinspace"></span>% and 25.5<span class="thinspace"></span>% respectively in all the classification algorithm but root mean square error for maximum likelihood classifier (1.278) is more than decision tree algorithm (1.196) and random forest algorithm (1.193). Out of three, a percentage correct prediction is highest in case of decision tree algorithm (73.4) than random forest algorithm (72.5) and least for maximum likelihood classifier (66.8) in December 2017. The accuracy for orchard class is 0.81 for maximum likelihood classifier, 0.80 for decision tree algorithm and 0.78 for random forest algorithm. Thus Sentinel-1A SAR Data was effectively utilized for the classification of orchard crops.</p>


Sign in / Sign up

Export Citation Format

Share Document