Assessment of machine learning algorithms in cloud computing frameworks

Author(s):  
Kevin Li ◽  
C. Gibson ◽  
D. Ho ◽  
Qi Zhou ◽  
J. Kim ◽  
...  
2020 ◽  
Vol 17 (8) ◽  
pp. 3765-3769
Author(s):  
N. P. Ponnuviji ◽  
M. Vigilson Prem

Cloud Computing has revolutionized the Information Technology by allowing the users to use variety number of resources in different applications in a less expensive manner. The resources are allocated to access by providing scalability flexible on-demand access in a virtual manner, reduced maintenance with less infrastructure cost. The majority of resources are handled and managed by the organizations over the internet by using different standards and formats of the networking protocols. Various research and statistics have proved that the available and existing technologies are prone to threats and vulnerabilities in the protocols legacy in the form of bugs that pave way for intrusion in different ways by the attackers. The most common among attacks is the Distributed Denial of Service (DDoS) attack. This attack targets the cloud’s performance and cause serious damage to the entire cloud computing environment. In the DDoS attack scenario, the compromised computers are targeted. The attacks are done by transmitting a large number of packets injected with known and unknown bugs to a server. A huge portion of the network bandwidth of the users’ cloud infrastructure is affected by consuming enormous time of their servers. In this paper, we have proposed a DDoS Attack detection scheme based on Random Forest algorithm to mitigate the DDoS threat. This algorithm is used along with the signature detection techniques and generates a decision tree. This helps in the detection of signature attacks for the DDoS flooding attacks. We have also used other machine learning algorithms and analyzed based on the yielded results.


Author(s):  
B. Praveen ◽  
S. Mustak ◽  
Pritee Sharma

<p><strong>Abstract.</strong> Mapping of agricultural land use/cover was initiated since the past several decades for land use planning, change detection analysis, crop yield monitoring etc. using earth observation datasets and traditional parametric classifiers. Recently, machine learning, cloud computing, Google Earth Engine (GEE) and open source earth observation datasets widely used for fast, cost-efficient and precise agricultural land use/cover mapping and change detection analysis. Main objective of this study was to assess the transferability of the machine learning algorithms for land use/cover mapping using cloud computing and open source earth observation datasets. In this study, the Landsat TM (L5, L8) of 2018, 2009 and 1998 were selected and median reflectance of spectral bands in Kharif and Rabi season were used for the classification. In addition, three important machine learning algorithms such as Support Vector Machine with Radial Basis Function (SVM-RBF), Random forest (RF) and Classification and Regression Tree (CART) were selected to evaluate the performance in transferability for agricultural land use classification using GEE. Seven land use/cover classes such as built-up, cropland, fallow land, vegetation etc. were selected based on literature review and local land use classification scheme. In this classification, several strategies were employed such as feature extraction, feature selection, parameter tuning, sensitivity analysis on size of training samples, transferability analysis to assess the performance of the selected machine learning algorithms for land use/cover classification. The result shows that SVM-RBF outperforms the RF and CART for both spatial and temporal transferability analysis. This result is very helpful for agriculture and remote sensing scientist to suggest promising guideline to land use planner and policy-makers for efficient land use mapping, change detection analysis, land use planning and natural resource management.</p>


2019 ◽  
Vol 11 (24) ◽  
pp. 2928 ◽  
Author(s):  
Pinki Mondal ◽  
Xue Liu ◽  
Temilola E. Fatoyinbo ◽  
David Lagomasino

Creating a national baseline for natural resources, such as mangrove forests, and monitoring them regularly often requires a consistent and robust methodology. With freely available satellite data archives and cloud computing resources, it is now more accessible to conduct such large-scale monitoring and assessment. Yet, few studies examine the reproducibility of such mangrove monitoring frameworks, especially in terms of generating consistent spatial extent. Our objective was to evaluate a combination of image processing approaches to classify mangrove forests along the coast of Senegal and The Gambia. We used freely available global satellite data (Sentinel-2), and cloud computing platform (Google Earth Engine) to run two machine learning algorithms, random forest (RF), and classification and regression trees (CART). We calibrated and validated the algorithms using 800 reference points collected using high-resolution images. We further re-ran 10 iterations for each algorithm, utilizing unique subsets of the initial training data. While all iterations resulted in thematic mangrove maps with over 90% accuracy, the mangrove extent ranges between 827–2807 km2 for Senegal and 245–1271 km2 for The Gambia with one outlier for each country. We further report “Places of Agreement” (PoA) to identify areas where all iterations for both methods agree (506.6 km2 and 129.6 km2 for Senegal and The Gambia, respectively), thus have a high confidence in predicting mangrove extent. While we acknowledge the time- and cost-effectiveness of such methods for the landscape managers, we recommend utilizing them with utmost caution, as well as post-classification on-the-ground checks, especially for decision making.


2020 ◽  
Vol 39 (5) ◽  
pp. 6579-6590
Author(s):  
Sandy Çağlıyor ◽  
Başar Öztayşi ◽  
Selime Sezgin

The motion picture industry is one of the largest industries worldwide and has significant importance in the global economy. Considering the high stakes and high risks in the industry, forecast models and decision support systems are gaining importance. Several attempts have been made to estimate the theatrical performance of a movie before or at the early stages of its release. Nevertheless, these models are mostly used for predicting domestic performances and the industry still struggles to predict box office performances in overseas markets. In this study, the aim is to design a forecast model using different machine learning algorithms to estimate the theatrical success of US movies in Turkey. From various sources, a dataset of 1559 movies is constructed. Firstly, independent variables are grouped as pre-release, distributor type, and international distribution based on their characteristic. The number of attendances is discretized into three classes. Four popular machine learning algorithms, artificial neural networks, decision tree regression and gradient boosting tree and random forest are employed, and the impact of each group is observed by compared by the performance models. Then the number of target classes is increased into five and eight and results are compared with the previously developed models in the literature.


2020 ◽  
pp. 1-11
Author(s):  
Jie Liu ◽  
Lin Lin ◽  
Xiufang Liang

The online English teaching system has certain requirements for the intelligent scoring system, and the most difficult stage of intelligent scoring in the English test is to score the English composition through the intelligent model. In order to improve the intelligence of English composition scoring, based on machine learning algorithms, this study combines intelligent image recognition technology to improve machine learning algorithms, and proposes an improved MSER-based character candidate region extraction algorithm and a convolutional neural network-based pseudo-character region filtering algorithm. In addition, in order to verify whether the algorithm model proposed in this paper meets the requirements of the group text, that is, to verify the feasibility of the algorithm, the performance of the model proposed in this study is analyzed through design experiments. Moreover, the basic conditions for composition scoring are input into the model as a constraint model. The research results show that the algorithm proposed in this paper has a certain practical effect, and it can be applied to the English assessment system and the online assessment system of the homework evaluation system algorithm system.


2019 ◽  
Vol 1 (2) ◽  
pp. 78-80
Author(s):  
Eric Holloway

Detecting some patterns is a simple task for humans, but nearly impossible for current machine learning algorithms.  Here, the "checkerboard" pattern is examined, where human prediction nears 100% and machine prediction drops significantly below 50%.


Sign in / Sign up

Export Citation Format

Share Document