Using Machine Learning Image Recognition for Code Reviews

Mapping Intimacies ◽

10.5121/csit.2020.101514 ◽

2020 ◽

Author(s):

Michael Dorin ◽

Trang Le ◽

Rajkumar Kolakaluri ◽

Sergio Montenegro

Keyword(s):

Machine Learning ◽

Image Recognition ◽

Source Code ◽

Cost Effective ◽

Software Developers ◽

Development Cycle ◽

Software Release ◽

Software Engineers

It is commonly understood that code reviews are a cost-effective way of finding faults early in the development cycle. However, many modern software developers are too busy to do them. Skipping code reviews means a loss of opportunity to detect expensive faults prior to software release. Software engineers can be pushed in many directions and reviewing code is very often considered an undesirable task, especially when time is wasted reviewing programs that are not ready. In this study, we wish to ascertain the potential for using machine learning and image recognition to detect immature software source code prior to a review. We show that it is possible to use machine learning to detect software problems visually and allow code reviews to focus on application details. The results are promising and are an indication that further research could be valuable.

Download Full-text

A development cycle for automated self-exploration of robot behaviors

AI Perspectives ◽

10.1186/s42467-021-00008-9 ◽

2021 ◽

Vol 3 (1) ◽

Author(s):

Thomas M. Roehr ◽

Daniel Harnack ◽

Hendrik Wöhrle ◽

Felix Wiebe ◽

Moritz Schilling ◽

...

Keyword(s):

Machine Learning ◽

Semantic Annotation ◽

Robotic Systems ◽

Integrative Approach ◽

Software Components ◽

Proof Of Concept ◽

Software Developers ◽

Development Cycle ◽

Development Processes ◽

Application Requirements

AbstractIn this paper we introduce Q-Rock, a development cycle for the automated self-exploration and qualification of robot behaviors. With Q-Rock, we suggest a novel, integrative approach to automate robot development processes. Q-Rock combines several machine learning and reasoning techniques to deal with the increasing complexity in the design of robotic systems. The Q-Rock development cycle consists of three complementary processes: (1) automated exploration of capabilities that a given robotic hardware provides, (2) classification and semantic annotation of these capabilities to generate more complex behaviors, and (3) mapping between application requirements and available behaviors. These processes are based on a graph-based representation of a robot’s structure, including hardware and software components. A central, scalable knowledge base enables collaboration of robot designers including mechanical, electrical and systems engineers, software developers and machine learning experts. In this paper we formalize Q-Rock’s integrative development cycle and highlight its benefits with a proof-of-concept implementation and a use case demonstration.

Download Full-text

Software Source Code Plagiarism and Direction Detection Based on PDG

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.373-375.1172 ◽

2013 ◽

Vol 373-375 ◽

pp. 1172-1177

Author(s):

Bo Shu ◽

Xiao Jun Du

Keyword(s):

Software Development ◽

Open Source ◽

Open Source Software ◽

Process Design ◽

Source Code ◽

Detection Algorithm ◽

Plagiarism Detection ◽

Software Developers ◽

Development Cycle ◽

Direction Detection

Because of the complexity of the software development, some software developers may plagiarize source code that comes from other projects or open source software in order to shorten development cycle. Usually the copyist would modify and disguise the source code copied to escape plagiarism detection. So far, most algorithms cant completely detect the source disguised by the copyist, especially cant exactly distinguish between the source code and the plagiaristic code. In this paper, we summarize and analyze the effect of disguised source to the detection process, design the strategy to remove the effect of disguised source, and propose a PDG-based software source code plagiarism detection algorithm. The algorithm can detect the existence of disguised source, so as to find out source code plagiarism. And we propose a heuristic rule to make the detection algorithm have the ability to give the plagiarism direction. Any existing algorithm does not have this function. We prove the availability of the algorithm by experiment.

Download Full-text

Recognizing lines of code violating company-specific coding guidelines using machine learning

Empirical Software Engineering ◽

10.1007/s10664-019-09769-8 ◽

2019 ◽

Vol 25 (1) ◽

pp. 220-265 ◽

Cited By ~ 1

Author(s):

Miroslaw Ochodek ◽

Regina Hebig ◽

Wilhelm Meding ◽

Gert Frost ◽

Miroslaw Staron

Keyword(s):

Machine Learning ◽

Action Research ◽

Software Development ◽

Source Code ◽

Design Guidelines ◽

Code Design ◽

Action Research Project ◽

Software Developers ◽

Code Analysis

AbstractSoftware developers in big and medium-size companies are working with millions of lines of code in their codebases. Assuring the quality of this code has shifted from simple defect management to proactive assurance of internal code quality. Although static code analysis and code reviews have been at the forefront of research and practice in this area, code reviews are still an effort-intensive and interpretation-prone activity. The aim of this research is to support code reviews by automatically recognizing company-specific code guidelines violations in large-scale, industrial source code. In our action research project, we constructed a machine-learning-based tool for code analysis where software developers and architects in big and medium-sized companies can use a few examples of source code lines violating code/design guidelines (up to 700 lines of code) to train decision-tree classifiers to find similar violations in their codebases (up to 3 million lines of code). Our action research project consisted of (i) understanding the challenges of two large software development companies, (ii) applying the machine-learning-based tool to detect violations of Sun’s and Google’s coding conventions in the code of three large open source projects implemented in Java, (iii) evaluating the tool on evolving industrial codebase, and (iv) finding the best learning strategies to reduce the cost of training the classifiers. We were able to achieve the average accuracy of over 99% and the average F-score of 0.80 for open source projects when using ca. 40K lines for training the tool. We obtained a similar average F-score of 0.78 for the industrial code but this time using only up to 700 lines of code as a training dataset. Finally, we observed the tool performed visibly better for the rules requiring to understand a single line of code or the context of a few lines (often allowing to reach the F-score of 0.90 or higher). Based on these results, we could observe that this approach can provide modern software development companies with the ability to use examples to teach an algorithm to recognize violations of code/design guidelines and thus increase the number of reviews conducted before the product release. This, in turn, leads to the increased quality of the final software.

Download Full-text

Application of Machine Learning Approaches for the Design and Study of Anticancer Drugs

Current Drug Targets ◽

10.2174/1389450119666180809122244 ◽

2019 ◽

Vol 20 (5) ◽

pp. 488-500 ◽

Cited By ~ 6

Author(s):

Yan Hu ◽

Yi Lu ◽

Shuo Wang ◽

Mengying Zhang ◽

Xiaosheng Qu ◽

...

Keyword(s):

Machine Learning ◽

Drug Design ◽

Anticancer Drugs ◽

Nearest Neighbor ◽

Cost Effective ◽

Support Vector ◽

Learning Approaches ◽

K Nearest Neighbor ◽

Activity Prediction ◽

Linear Discriminant

Background: Globally the number of cancer patients and deaths are continuing to increase yearly, and cancer has, therefore, become one of the world's highest causes of morbidity and mortality. In recent years, the study of anticancer drugs has become one of the most popular medical topics. Objective: In this review, in order to study the application of machine learning in predicting anticancer drugs activity, some machine learning approaches such as Linear Discriminant Analysis (LDA), Principal components analysis (PCA), Support Vector Machine (SVM), Random forest (RF), k-Nearest Neighbor (kNN), and Naïve Bayes (NB) were selected, and the examples of their applications in anticancer drugs design are listed. Results: Machine learning contributes a lot to anticancer drugs design and helps researchers by saving time and is cost effective. However, it can only be an assisting tool for drug design. Conclusion: This paper introduces the application of machine learning approaches in anticancer drug design. Many examples of success in identification and prediction in the area of anticancer drugs activity prediction are discussed, and the anticancer drugs research is still in active progress. Moreover, the merits of some web servers related to anticancer drugs are mentioned.

Download Full-text

Knee Muscle Force Estimating Model Using Machine Learning Approach

The Computer Journal ◽

10.1093/comjnl/bxaa160 ◽

2020 ◽

Author(s):

Anurag Sohane ◽

Ravinder Agarwal

Keyword(s):

Machine Learning ◽

Random Forest ◽

Muscle Force ◽

Vastus Lateralis ◽

Input Parameter ◽

Research Work ◽

Cost Effective ◽

Coefficient Of Determination ◽

Muscle Forces ◽

Knee Muscle

Abstract Various simulation type tools and conventional algorithms are being used to determine knee muscle forces of human during dynamic movement. These all may be good for clinical uses, but have some drawbacks, such as higher computational times, muscle redundancy and less cost-effective solution. Recently, there has been an interest to develop supervised learning-based prediction model for the computationally demanding process. The present research work is used to develop a cost-effective and efficient machine learning (ML) based models to predict knee muscle force for clinical interventions for the given input parameter like height, mass and angle. A dataset of 500 human musculoskeletal, have been trained and tested using four different ML models to predict knee muscle force. This dataset has obtained from anybody modeling software using AnyPyTools, where human musculoskeletal has been utilized to perform squatting movement during inverse dynamic analysis. The result based on the datasets predicts that the random forest ML model outperforms than the other selected models: neural network, generalized linear model, decision tree in terms of mean square error (MSE), coefficient of determination (R2), and Correlation (r). The MSE of predicted vs actual muscle forces obtained from the random forest model for Biceps Femoris, Rectus Femoris, Vastus Medialis, Vastus Lateralis are 19.92, 9.06, 5.97, 5.46, Correlation are 0.94, 0.92, 0.92, 0.94 and R2 are 0.88, 0.84, 0.84 and 0.89 for the test dataset, respectively.

Download Full-text

A cost-effective trilateration-based radio localization algorithm using machine learning and sequential least-square programming optimization

Computer Communications ◽

10.1016/j.comcom.2021.06.005 ◽

2021 ◽

Author(s):

João Paulo P.G. Marques ◽

Daniel C. Cunha ◽

Lucas M.F. Harada ◽

Lizandro N. Silva ◽

Igor D. Silva

Keyword(s):

Machine Learning ◽

Cost Effective ◽

Least Square ◽

Localization Algorithm ◽

Radio Localization

Download Full-text

A hybrid machine learning–based multi-objective supervisory control strategy of a full-scale wastewater treatment for cost-effective and sustainable operation under varying influent conditions

Journal of Cleaner Production ◽

10.1016/j.jclepro.2021.125853 ◽

2021 ◽

Vol 291 ◽

pp. 125853 ◽

Cited By ~ 1

Author(s):

SungKu Heo ◽

KiJeon Nam ◽

Shahzeb Tariq ◽

Juin Yau Lim ◽

Junkyu Park ◽

...

Keyword(s):

Machine Learning ◽

Wastewater Treatment ◽

Control Strategy ◽

Supervisory Control ◽

Cost Effective ◽

Full Scale ◽

Multi Objective ◽

Sustainable Operation ◽

Hybrid Machine

Download Full-text

Detection of Strawberry Diseases Using a Convolutional Neural Network

Plants ◽

10.3390/plants10010031 ◽

2020 ◽

Vol 10 (1) ◽

pp. 31

Author(s):

Jia-Rong Xiao ◽

Pei-Che Chung ◽

Hung-Yi Wu ◽

Quoc-Hung Phan ◽

Jer-Liang Andrew Yeh ◽

...

Keyword(s):

Neural Network ◽

Powdery Mildew ◽

Convolutional Neural Network ◽

Image Recognition ◽

Cost Effective ◽

Gray Mold ◽

Leaf Blight ◽

Crown Rot ◽

Accuracy Rate ◽

Fruit Disease

The strawberry (Fragaria × ananassa Duch.) is a high-value crop with an annual cultivated area of ~500 ha in Taiwan. Over 90% of strawberry cultivation is in Miaoli County. Unfortunately, various diseases significantly decrease strawberry production. The leaf and fruit disease became an epidemic in 1986. From 2010 to 2016, anthracnose crown rot caused the loss of 30–40% of seedlings and ~20% of plants after transplanting. The automation of agriculture and image recognition techniques are indispensable for detecting strawberry diseases. We developed an image recognition technique for the detection of strawberry diseases using a convolutional neural network (CNN) model. CNN is a powerful deep learning approach that has been used to enhance image recognition. In the proposed technique, two different datasets containing the original and feature images are used for detecting the following strawberry diseases—leaf blight, gray mold, and powdery mildew. Specifically, leaf blight may affect the crown, leaf, and fruit and show different symptoms. By using the ResNet50 model with a training period of 20 epochs for 1306 feature images, the proposed CNN model achieves a classification accuracy rate of 100% for leaf blight cases affecting the crown, leaf, and fruit; 98% for gray mold cases, and 98% for powdery mildew cases. In 20 epochs, the accuracy rate of 99.60% obtained from the feature image dataset was higher than that of 1.53% obtained from the original one. This proposed model provides a simple, reliable, and cost-effective technique for detecting strawberry diseases.

Download Full-text

Accelerated discovery of high-strength aluminum alloys by machine learning

Communications Materials ◽

10.1038/s43246-020-00074-2 ◽

2020 ◽

Vol 1 (1) ◽

Author(s):

Jiaheng Li ◽

Yingbo Zhang ◽

Xinyu Cao ◽

Qi Zeng ◽

Ye Zhuang ◽

...

Keyword(s):

Machine Learning ◽

Aluminum Alloys ◽

Mechanical Performance ◽

High Strength ◽

Cost Effective ◽

Alloy System ◽

Processing Route ◽

Specific Strength ◽

Cu Alloy ◽

High Strength Aluminum Alloys

Abstract Aluminum alloys are attractive for a number of applications due to their high specific strength, and developing new compositions is a major goal in the structural materials community. Here, we investigate the Al-Zn-Mg-Cu alloy system (7xxx series) by machine learning-based composition and process optimization. The discovered optimized alloy is compositionally lean with a high ultimate tensile strength of 952 MPa and 6.3% elongation following a cost-effective processing route. We find that the Al8Cu4Y phase in wrought 7xxx-T6 alloys exists in the form of a nanoscale network structure along sub-grain boundaries besides the common irregular-shaped particles. Our study demonstrates the feasibility of using machine learning to search for 7xxx alloys with good mechanical performance.

Download Full-text

Machine Learning Models for Finger Bend Evaluation using Implemented Low cost Flex Sensor

International Journal for Research in Applied Science and Engineering Technology ◽

10.22214/ijraset.2021.35742 ◽

2021 ◽

Vol 9 (VI) ◽

pp. 3605-3611

Author(s):

Pratyush Kaware

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Low Cost ◽

Learning Algorithms ◽

Cost Effective ◽

Machine Learning Algorithms ◽

Support Vector ◽

Learning Models ◽

Machine Learning Models

In this paper a cost-effective sensor has been implemented to read finger bend signals, by attaching the sensor to a finger, so as to classify them based on the degree of bent as well as the joint about which the finger was being bent. This was done by testing with various machine learning algorithms to get the most accurate and consistent classifier. Finally, we found that Support Vector Machine was the best algorithm suited to classify our data, using we were able predict live state of a finger, i.e., the degree of bent and the joints involved. The live voltage values from the sensor were transmitted using a NodeMCU micro-controller which were converted to digital and uploaded on a database for analysis.

Download Full-text