Evaluating Impact of Race in Facial Recognition across Machine Learning and Deep Learning Algorithms

James Coe; Mustafa Atay

doi:10.3390/computers10090113

Evaluating Impact of Race in Facial Recognition across Machine Learning and Deep Learning Algorithms

Computers ◽

10.3390/computers10090113 ◽

2021 ◽

Vol 10 (9) ◽

pp. 113

Author(s):

James Coe ◽

Mustafa Atay

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Racial Bias ◽

Learning Algorithm ◽

Facial Recognition ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Design Development ◽

Deep Learning Algorithm ◽

The Impact

The research aims to evaluate the impact of race in facial recognition across two types of algorithms. We give a general insight into facial recognition and discuss four problems related to facial recognition. We review our system design, development, and architectures and give an in-depth evaluation plan for each type of algorithm, dataset, and a look into the software and its architecture. We thoroughly explain the results and findings of our experimentation and provide analysis for the machine learning algorithms and deep learning algorithms. Concluding the investigation, we compare the results of two kinds of algorithms and compare their accuracy, metrics, miss rates, and performances to observe which algorithms mitigate racial bias the most. We evaluate racial bias across five machine learning algorithms and three deep learning algorithms using racially imbalanced and balanced datasets. We evaluate and compare the accuracy and miss rates between all tested algorithms and report that SVC is the superior machine learning algorithm and VGG16 is the best deep learning algorithm based on our experimental study. Our findings conclude the algorithm that mitigates the bias the most is VGG16, and all our deep learning algorithms outperformed their machine learning counterparts.

Download Full-text

Stock price prediction using DEEP learning algorithm and its comparison with machine learning algorithms

Intelligent Systems in Accounting Finance & Management ◽

10.1002/isaf.1459 ◽

2019 ◽

Vol 26 (4) ◽

pp. 164-174 ◽

Cited By ~ 3

Author(s):

Mahla Nikou ◽

Gholamreza Mansourfar ◽

Jamshid Bagherzadeh

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Stock Price ◽

Learning Algorithm ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Stock Price Prediction ◽

Price Prediction ◽

Deep Learning Algorithm

Download Full-text

A Deep Learning Algorithm to Predict Hazardous Drinkers and the Severity of Alcohol-Related Problems Using K-NHANES

Frontiers in Psychiatry ◽

10.3389/fpsyt.2021.684406 ◽

2021 ◽

Vol 12 ◽

Author(s):

Suk-Young Kim ◽

Taesung Park ◽

Kwonyoung Kim ◽

Jihoon Oh ◽

Yoonjae Park ◽

...

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Large Scale ◽

Learning Algorithm ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Deep Learning Algorithm ◽

Conventional Machine ◽

Large Scale Survey ◽

Alcohol Related Problems

Purpose: The number of patients with alcohol-related problems is steadily increasing. A large-scale survey of alcohol-related problems has been conducted. However, studies that predict hazardous drinkers and identify which factors contribute to the prediction are limited. Thus, the purpose of this study was to predict hazardous drinkers and the severity of alcohol-related problems of patients using a deep learning algorithm based on a large-scale survey data.Materials and Methods: Datasets of National Health and Nutrition Examination Survey of South Korea (K-NHANES), a nationally representative survey for the entire South Korean population, were used to train deep learning and conventional machine learning algorithms. Datasets from 69,187 and 45,672 participants were used to predict hazardous drinkers and the severity of alcohol-related problems, respectively. Based on the degree of contribution of each variable to deep learning, it was possible to determine which variable contributed significantly to the prediction of hazardous drinkers.Results: Deep learning showed the higher performance than conventional machine learning algorithms. It predicted hazardous drinkers with an AUC (Area under the receiver operating characteristic curve) of 0.870 (Logistic regression: 0.858, Linear SVM: 0.849, Random forest classifier: 0.810, K-nearest neighbors: 0.740). Among 325 variables for predicting hazardous drinkers, energy intake was a factor showing the greatest contribution to the prediction, followed by carbohydrate intake. Participants were classified into Zone I, Zone II, Zone III, and Zone IV based on the degree of alcohol-related problems, showing AUCs of 0.881, 0.774, 0.853, and 0.879, respectively.Conclusion: Hazardous drinking groups could be effectively predicted and individuals could be classified according to the degree of alcohol-related problems using a deep learning algorithm. This algorithm could be used to screen people who need treatment for alcohol-related problems among the general population or hospital visitors.

Download Full-text

Identification of Network Traffic over IOT Platforms

Revue d intelligence artificielle ◽

10.18280/ria.350410 ◽

2021 ◽

Vol 35 (4) ◽

pp. 349-357

Author(s):

Shilpa P. Khedkar ◽

Aroul Canessane Ramalingam

Keyword(s):

Machine Learning ◽

Deep Neural Networks ◽

Learning Algorithm ◽

Heavy Traffic ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Deep Learning Algorithm ◽

Iot Platforms ◽

The Internet Of Things

The Internet of Things (IoT) is a rising infrastructure of 21st century. The classification of traffic over IoT networks is attained significance importance due to rapid growth of users and devices. It is need of the hour to isolate the normal traffic from the malicious traffic and to assign the normal traffic to the proper destination to suffice the QoS requirements of the IoT users. Detection of malicious traffic can be done by continuously monitoring traffic for suspicious links, files, connection created and received, unrecognised protocol/port numbers, and suspicious Destination/Source IP combinations. A proficient classification mechanism in IoT environment should be capable enough to classify the heavy traffic in a fast manner, to deflect the malevolent traffic on time and to transmit the benign traffic to the designated nodes for serving the needs of the users. In this work, adaboost and Xgboost machine learning algorithms and Deep Neural Networks approach are proposed to separate the IoT traffic which eventually enhances the throughput of IoT networks and reduces the congestion over IoT channels. The result of experiment indicates a deep learning algorithm achieves higher accuracy compared to machine learning algorithms.

Download Full-text

Wheat Lodging Detection from UAS Imagery Using Machine Learning Algorithms

Remote Sensing ◽

10.3390/rs12111838 ◽

2020 ◽

Vol 12 (11) ◽

pp. 1838 ◽

Cited By ~ 8

Author(s):

Zhao Zhang ◽

Paulo Flores ◽

C. Igathinathane ◽

Dayakar L. Naik ◽

Ravi Kiran ◽

...

Keyword(s):

Neural Network ◽

Machine Learning ◽

Deep Learning ◽

Standard Deviation ◽

Learning Algorithm ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Superior Performance ◽

Support Vector ◽

Unmanned Aerial Systems

The current mainstream approach of using manual measurements and visual inspections for crop lodging detection is inefficient, time-consuming, and subjective. An innovative method for wheat lodging detection that can overcome or alleviate these shortcomings would be welcomed. This study proposed a systematic approach for wheat lodging detection in research plots (372 experimental plots), which consisted of using unmanned aerial systems (UAS) for aerial imagery acquisition, manual field evaluation, and machine learning algorithms to detect the occurrence or not of lodging. UAS imagery was collected on three different dates (23 and 30 July 2019, and 8 August 2019) after lodging occurred. Traditional machine learning and deep learning were evaluated and compared in this study in terms of classification accuracy and standard deviation. For traditional machine learning, five types of features (i.e. gray level co-occurrence matrix, local binary pattern, Gabor, intensity, and Hu-moment) were extracted and fed into three traditional machine learning algorithms (i.e., random forest (RF), neural network, and support vector machine) for detecting lodged plots. For the datasets on each imagery collection date, the accuracies of the three algorithms were not significantly different from each other. For any of the three algorithms, accuracies on the first and last date datasets had the lowest and highest values, respectively. Incorporating standard deviation as a measurement of performance robustness, RF was determined as the most satisfactory. Regarding deep learning, three different convolutional neural networks (simple convolutional neural network, VGG-16, and GoogLeNet) were tested. For any of the single date datasets, GoogLeNet consistently had superior performance over the other two methods. Further comparisons between RF and GoogLeNet demonstrated that the detection accuracies of the two methods were not significantly different from each other (p > 0.05); hence, the choice of any of the two would not affect the final detection accuracies. However, considering the fact that the average accuracy of GoogLeNet (93%) was larger than RF (91%), it was recommended to use GoogLeNet for wheat lodging detection. This research demonstrated that UAS RGB imagery, coupled with the GoogLeNet machine learning algorithm, can be a novel, reliable, objective, simple, low-cost, and effective (accuracy > 90%) tool for wheat lodging detection.

Download Full-text

Research on Classification Method of Maize Seed Defect Based on Machine Vision

Journal of Sensors ◽

10.1155/2019/2716975 ◽

2019 ◽

Vol 2019 ◽

pp. 1-9

Author(s):

Sheng Huang ◽

Xiaofei Fan ◽

Lei Sun ◽

Yanlu Shen ◽

Xuesong Suo

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Learning Algorithm ◽

Machine Learning Algorithms ◽

Heat Map ◽

Deep Learning Algorithm ◽

Quality Classification ◽

Visualization Technology ◽

Better Than

Traditionally, the classification of seed defects mainly relies on the characteristics of color, shape, and texture. This method requires repeated extraction of a large amount of feature information, which is not efficiently used in detection. In recent years, deep learning has performed well in the field of image recognition. We introduced convolutional neural networks (CNNs) and transfer learning into the quality classification of seeds and compared them with traditional machine learning algorithms. Experiments showed that deep learning algorithm was significantly better than the machine learning algorithm with an accuracy of 95% (GoogLeNet) vs. 79.2% (SURF+SVM). We used three classifiers in GoogLeNet to demonstrate that network accuracy increases as the depth of the network increases. We used the visualization technology to obtain the feature map of each layer of the network in CNNs and used the heat map to represent the probability distribution of the inference results. As an end-to-end network, CNNs can be easily applied for automated seed manufacturing.

Download Full-text

Construction of Innovation and Entrepreneurship Platform Based on Deep Learning Algorithm

Scientific Programming ◽

10.1155/2021/1833979 ◽

2021 ◽

Vol 2021 ◽

pp. 1-7

Author(s):

Jian Li ◽

Yongyan Zhao

Keyword(s):

Machine Learning ◽

College Students ◽

Neural Networks ◽

Deep Learning ◽

National Economy ◽

Learning Algorithm ◽

Learning Algorithms ◽

Computing Power ◽

Deep Learning Algorithm ◽

Innovation And Entrepreneurship

As the national economy has entered a stage of rapid development, the national economy and social development have also ushered in the “14th Five-Year Plan,” and the country has also issued support policies to encourage and guide college students to start their own businesses. Therefore, the establishment of an innovation and entrepreneurship platform has a significant impact on China’s economy. This gives college students great support and help in starting a business. The theory of deep learning algorithms originated from the development of artificial neural networks and is another important field of machine learning. As the computing power of computers has been greatly improved, especially the computing power of GPU can quickly train deep neural networks, deep learning algorithms have become an important research direction. The deep learning algorithm is a nonlinear network structure and a standard modeling method in the field of machine learning. After modeling various templates, they can be identified and implemented. This article uses a combination of theoretical research and empirical research, based on the views and research content of some scholars in recent years, and introduces the basic framework and research content of this article. Then, deep learning algorithms are used to analyze the experimental data. Data analysis is performed, and relevant concepts of deep learning algorithms are combined. This article focuses on exploring the construction of an IAE (innovation and entrepreneurship) education platform and making full use of the role of deep learning algorithms to realize the construction of innovation and entrepreneurship platforms. Traditional methods need to extract features through manual design, then perform feature classification, and finally realize the function of recognition. The deep learning algorithm has strong data image processing capabilities and can quickly process large-scale data. Research data show that 49.5% of college students and 35.2% of undergraduates expressed their interest in entrepreneurship. Entrepreneurship is a good choice to relieve employment pressure.

Download Full-text

Comparative study on machine learning algorithms for early fire forest detection system using geodata.

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v10i5.pp5507-5513 ◽

2020 ◽

Vol 10 (5) ◽

pp. 5507

Author(s):

Zouiten Mohammed ◽

Chaaouan Hanae ◽

Setti Larbi

Keyword(s):

Machine Learning ◽

Forest Fire ◽

Forest Fires ◽

Learning Algorithm ◽

Detection System ◽

Learning Algorithms ◽

Spatial Prediction ◽

Machine Learning Algorithms ◽

Geographical Information ◽

Deep Learning Algorithm

Forest fires have caused considerable losses to ecologies, societies and economies worldwide. To minimize these losses and reduce forest fires, modeling and predicting the occurrence of forest fires are meaningful because they can support forest fire prevention and management. In recent years, the convolutional neural network (CNN) has become an important state-of-the-art deep learning algorithm, and its implementation has enriched many fields. Therefore, a competitive spatial prediction model for automatic early detection of wild forest fire using machine learning algorithms can be proposed. This model can help researchers to predict forest fires and identify risk zonas. System using machine learning algorithm on geodata will be able to notify in real time the interested parts and authorities by providing alerts and presenting on maps based on geographical treatments for more efficacity and analyzing of the situation. This research extends the application of machine learning algorithms for early fire forest prediction to detection and representation in geographical information system (GIS) maps.

Download Full-text

DeepKhib: a deep-learning framework for lysine 2-hydroxyisobutyrylation sites prediction

10.1101/2020.08.14.250712 ◽

2020 ◽

Author(s):

Luna Zhang ◽

Yang Zou ◽

Ningning He ◽

Yu Chen ◽

Zhen Chen ◽

...

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Learning Algorithm ◽

Machine Learning Algorithms ◽

Prediction Algorithm ◽

Learning Models ◽

Post Translational Modification ◽

Deep Learning Algorithm ◽

Multiple Species ◽

Species Specific

AbstractAs a novel type of post-translational modification, lysine 2-Hydroxyisobutyrylation (Khib) plays an important role in gene transcription and signal transduction. In order to understand its regulatory mechanism, the essential step is the recognition of Khib sites. Thousands of Khib sites have been experimentally verified across five different species. However, there are only a couple traditional machine-learning algorithms developed to predict Khib sites for limited species, lacking a general prediction algorithm. We constructed a deep-learning algorithm based on convolutional neural network with the one-hot encoding approach, dubbed CNNOH. It performs favorably to the traditional machine-learning models and other deep-learning models across different species, in terms of cross-validation and independent test. The area under the ROC curve (AUC) values for CNNOH ranged from 0.82 to 0.87 for different organisms, which is superior to the currently-available Khib predictors. Moreover, we developed the general model based on the integrated data from multiple species and it showed great universality and effectiveness with the AUC values in the range of 0.79 to 0.87. Accordingly, we constructed the on-line prediction tool dubbed DeepKhib for easily identifying Khib sites, which includes both species-specific and general models. DeepKhib is available at http://www.bioinfogo.org/DeepKhib.

Download Full-text

AERIAL POINT CLOUD CLASSIFICATION WITH DEEP LEARNING AND MACHINE LEARNING ALGORITHMS

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xlii-4-w18-843-2019 ◽

2019 ◽

Vol XLII-4/W18 ◽

pp. 843-849

Author(s):

E. Özdemir ◽

F. Remondino ◽

A. Golkar

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Learning Algorithm ◽

Learning Algorithms ◽

Point Clouds ◽

Machine Learning Algorithms ◽

Geometric Features ◽

Semantic Classes ◽

3D Point Clouds ◽

City Models

Abstract. With recent advances in technology, 3D point clouds are getting more and more frequently requested and used, not only for visualization needs but also e.g. by public administrations for urban planning and management. 3D point clouds are also a very frequent source for generating 3D city models which became recently more available for many applications, such as urban development plans, energy evaluation, navigation, visibility analysis and numerous other GIS studies. While the main data sources remained the same (namely aerial photogrammetry and LiDAR), the way these city models are generated have been evolving towards automation with different approaches. As most of these approaches are based on point clouds with proper semantic classes, our aim is to classify aerial point clouds into meaningful semantic classes, e.g. ground level objects (GLO, including roads and pavements), vegetation, buildings’ facades and buildings’ roofs. In this study we tested and evaluated various machine learning algorithms for classification, including three deep learning algorithms and one machine learning algorithm. In the experiments, several hand-crafted geometric features depending on the dataset are used and, unconventionally, these geometric features are used also for deep learning.

Download Full-text

Waste Management Using Machine Learning and Deep Learning Algorithms

International Journal on Perceptive and Cognitive Computing ◽

10.31436/ijpcc.v6i2.165 ◽

2020 ◽

Vol 6 (2) ◽

pp. 97-106

Author(s):

Khan Nasik Sami ◽

Zian Md Afique Amin ◽

Raini Hassan

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Random Forest ◽

Decision Tree ◽

Waste Management ◽

Learning Algorithm ◽

Learning Algorithms ◽

Classification Problem ◽

Deep Learning Algorithm ◽

Unskilled Workers

Waste Management is one of the essential issues that the world is currently facing does not matter if the country is developed or under developing. The key issue in this waste segregation is that the trash bin at open spots gets flooded well ahead of time before the beginning of the following cleaning process. The isolation of waste is done by unskilled workers which is less effective, time-consuming, and not plausible because of a lot of waste. So, we are proposing an automated waste classification problem utilizing Machine Learning and Deep Learning algorithms. The goal of this task is to gather a dataset and arrange it into six classes consisting of glass, paper, and metal, plastic, cardboard, and waste. The model that we have used are classification models. For our research we did comparisons between four algorithms, those are CNN, SVM, Random Forest, and Decision Tree. As our concern is a classification problem, we have used several machine learning and deep learning algorithm that best fits for classification solutions. For our model, CNN accomplished high characterization on accuracy around 90%, while SVM additionally indicated an excellent transformation to various kinds of waste which were 85%, and Random Forest and Decision Tree have accomplished 55% and 65% respectively

Download Full-text