This Person Does Exist

Temes de Disseny ◽

10.46467/tdd37.2021.214-225 ◽

2021 ◽

pp. 214-225

Author(s):

Matthias Schäfer

Keyword(s):

Machine Learning ◽

Public Discourse ◽

Ethical Issues ◽

Machine Learning Techniques ◽

Artistic Practice ◽

Found Objects ◽

Creative Commons ◽

Learning Techniques ◽

Machine Learning Model ◽

Popular Image

This Person Does Exist is an artistic approach to exploring a large dataset of photographic portraits in a randomised manner. The dataset was originally created by Nvidia Research Lab, which has scraped and analysed creative commons images from the popular image hosting platform Flickr. These pictures were then used to train a machine learning model which can create new stochastic images of faces. In contrast to a popular website that showcases the computer generated images, I am displaying random faces from the dataset with their corresponding metadata. This essay looks into extractivist mechanisms in current machine learning techniques, using the internet to populate and refine databases, while focusing on artistic approaches that expose them. I make the case for Dataset Art as an emerging field which reframes scientific corpora by placing them into galleries and exhibiting them as found objects online. Finally, I argue that this artistic practice is a legitimate way of opening up a larger public discourse, although artists working with human data must be aware of ethical issues and responsibilities regarding privacy and consent.

Download Full-text

Leveraging Machine Learning in Financial Fraud Forensics in the Age of Cybersecurity

10.4018/978-1-7998-8386-9.ch010 ◽

2022 ◽

pp. 220-249

Author(s):

Md Ariful Haque ◽

Sachin Shetty

Keyword(s):

Machine Learning ◽

Financial Institutions ◽

Machine Learning Techniques ◽

Cyber Attack ◽

Financial Industry ◽

Security Breaches ◽

Financial Gain ◽

Learning Techniques ◽

Machine Learning Model ◽

Machine Learning Models

Financial sectors are lucrative cyber-attack targets because of their immediate financial gain. As a result, financial institutions face challenges in developing systems that can automatically identify security breaches and separate fraudulent transactions from legitimate transactions. Today, organizations widely use machine learning techniques to identify any fraudulent behavior in customers' transactions. However, machine learning techniques are often challenging because of financial institutions' confidentiality policy, leading to not sharing the customer transaction data. This chapter discusses some crucial challenges of handling cybersecurity and fraud in the financial industry and building machine learning-based models to address those challenges. The authors utilize an open-source e-commerce transaction dataset to illustrate the forensic processes by creating a machine learning model to classify fraudulent transactions. Overall, the chapter focuses on how the machine learning models can help detect and prevent fraudulent activities in the financial sector in the age of cybersecurity.

Download Full-text

Machine Learning Generalisation across Different 3D Architectural Heritage

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi9060379 ◽

2020 ◽

Vol 9 (6) ◽

pp. 379 ◽

Cited By ~ 4

Author(s):

Eleonora Grilli ◽

Fabio Remondino

Keyword(s):

Machine Learning ◽

Point Cloud ◽

Machine Learning Techniques ◽

Training Dataset ◽

High Complexity ◽

Architectural Heritage ◽

Learning Techniques ◽

Machine Learning Model ◽

Point Cloud Classification

The use of machine learning techniques for point cloud classification has been investigated extensively in the last decade in the geospatial community, while in the cultural heritage field it has only recently started to be explored. The high complexity and heterogeneity of 3D heritage data, the diversity of the possible scenarios, and the different classification purposes that each case study might present, makes it difficult to realise a large training dataset for learning purposes. An important practical issue that has not been explored yet, is the application of a single machine learning model across large and different architectural datasets. This paper tackles this issue presenting a methodology able to successfully generalise to unseen scenarios a random forest model trained on a specific dataset. This is achieved looking for the best features suitable to identify the classes of interest (e.g., wall, windows, roof and columns).

Download Full-text

Classification Framework for Healthy Hairs and Alopecia Areata: A Machine Learning (ML) Approach

Computational and Mathematical Methods in Medicine ◽

10.1155/2021/1102083 ◽

2021 ◽

Vol 2021 ◽

pp. 1-10

Author(s):

Choudhary Sobhan Shakeel ◽

Saad Jawaid Khan ◽

Beenish Chaudhry ◽

Syeda Fatima Aijaz ◽

Umer Hassan

Keyword(s):

Machine Learning ◽

Alopecia Areata ◽

Nearest Neighbor ◽

Autoimmune Disorder ◽

Machine Learning Techniques ◽

Support Vector ◽

Classification Framework ◽

Learning Techniques ◽

Machine Learning Model ◽

Image Set

Alopecia areata is defined as an autoimmune disorder that results in hair loss. The latest worldwide statistics have exhibited that alopecia areata has a prevalence of 1 in 1000 and has an incidence of 2%. Machine learning techniques have demonstrated potential in different areas of dermatology and may play a significant role in classifying alopecia areata for better prediction and diagnosis. We propose a framework pertaining to the classification of healthy hairs and alopecia areata. We used 200 images of healthy hairs from the Figaro1k dataset and 68 hair images of alopecia areata from the Dermnet dataset to undergo image preprocessing including enhancement and segmentation. This was followed by feature extraction including texture, shape, and color. Two classification techniques, i.e., support vector machine (SVM) and k -nearest neighbor (KNN), are then applied to train a machine learning model with 70% of the images. The remaining image set was used for the testing phase. With a 10-fold cross-validation, the reported accuracies of SVM and KNN are 91.4% and 88.9%, respectively. Paired sample T -test showed significant differences between the two accuracies with a p < 0.001 . SVM generated higher accuracy (91.4%) as compared to KNN (88.9%). The findings of our study demonstrate potential for better prediction in the field of dermatology.

Download Full-text

Machine learning techniques to predict daily rainfall amount

Journal Of Big Data ◽

10.1186/s40537-021-00545-4 ◽

2021 ◽

Vol 8 (1) ◽

Author(s):

Chalachew Muluken Liyew ◽

Haileyesus Amsaya Melese

Keyword(s):

Machine Learning ◽

Pearson Correlation ◽

Daily Rainfall ◽

Learning Model ◽

Machine Learning Techniques ◽

Gradient Boosting ◽

Correlation Technique ◽

Learning Techniques ◽

Machine Learning Model ◽

Extreme Gradient Boosting

AbstractPredicting the amount of daily rainfall improves agricultural productivity and secures food and water supply to keep citizens healthy. To predict rainfall, several types of research have been conducted using data mining and machine learning techniques of different countries’ environmental datasets. An erratic rainfall distribution in the country affects the agriculture on which the economy of the country depends on. Wise use of rainfall water should be planned and practiced in the country to minimize the problem of the drought and flood occurred in the country. The main objective of this study is to identify the relevant atmospheric features that cause rainfall and predict the intensity of daily rainfall using machine learning techniques. The Pearson correlation technique was used to select relevant environmental variables which were used as an input for the machine learning model. The dataset was collected from the local meteorological office at Bahir Dar City, Ethiopia to measure the performance of three machine learning techniques (Multivariate Linear Regression, Random Forest, and Extreme Gradient Boost). Root mean squared error and Mean absolute Error methods were used to measure the performance of the machine learning model. The result of the study revealed that the Extreme Gradient Boosting machine learning algorithm performed better than others.

Download Full-text

Identification and Implementation Perspective on the Stratification Algorithms in the Prognostication of Heart Disease using Machine Learning Techniques

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.c8663.019320 ◽

2020 ◽

Vol 9 (3) ◽

pp. 1223-1225

Keyword(s):

Machine Learning ◽

Heart Disease ◽

Heart Diseases ◽

Classification Algorithm ◽

Machine Learning Techniques ◽

Data Set ◽

Learning Techniques ◽

Machine Learning Model ◽

Combination Of Classifiers ◽

Great Idea

Analysis of patient’s data is always a great idea to get accurate results on using classifiers. A combination of classifiers would give an accurate result than using a single classifier because one single classifier does not give accurate results but always appropriate ones. The aim is to predict the outcome feature of the data set. The “outcome” can contain only two values that is 0 and 1. 0 means patient doesn’t have heart disease and 1 means patient have heart diseases. So, there is a need to build a classification algorithm that can predict the Outcome feature of the test dataset with good accuracy. For this understanding the data is important, and then various classification algorithm can be tested. Then the best model can be selected which gives highest accuracy among all. The built model can then be given to the software developer for building the end user application using the selected machine learning model that will be able to predict the heart disease in a patient.

Download Full-text

Public Opinions and Concerns Regarding the Canadian Prime Minister’s Daily COVID-19 Briefing: Longitudinal Study of YouTube Comments Using Machine Learning Techniques

Journal of Medical Internet Research ◽

10.2196/23957 ◽

2021 ◽

Vol 23 (2) ◽

pp. e23957

Author(s):

Chengda Zheng ◽

Jia Xue ◽

Yumin Sun ◽

Tingshao Zhu

Keyword(s):

Machine Learning ◽

Language Processing ◽

Public Discourse ◽

Latent Dirichlet Allocation ◽

Prime Minister ◽

Machine Learning Techniques ◽

Future Health ◽

The Public ◽

Learning Techniques ◽

The Government

Background During the COVID-19 pandemic in Canada, Prime Minister Justin Trudeau provided updates on the novel coronavirus and the government’s responses to the pandemic in his daily briefings from March 13 to May 22, 2020, delivered on the official Canadian Broadcasting Corporation (CBC) YouTube channel. Objective The aim of this study was to examine comments on Canadian Prime Minister Trudeau’s COVID-19 daily briefings by YouTube users and track these comments to extract the changing dynamics of the opinions and concerns of the public over time. Methods We used machine learning techniques to longitudinally analyze a total of 46,732 English YouTube comments that were retrieved from 57 videos of Prime Minister Trudeau’s COVID-19 daily briefings from March 13 to May 22, 2020. A natural language processing model, latent Dirichlet allocation, was used to choose salient topics among the sampled comments for each of the 57 videos. Thematic analysis was used to classify and summarize these salient topics into different prominent themes. Results We found 11 prominent themes, including strict border measures, public responses to Prime Minister Trudeau’s policies, essential work and frontline workers, individuals’ financial challenges, rental and mortgage subsidies, quarantine, government financial aid for enterprises and individuals, personal protective equipment, Canada and China’s relationship, vaccines, and reopening. Conclusions This study is the first to longitudinally investigate public discourse and concerns related to Prime Minister Trudeau’s daily COVID-19 briefings in Canada. This study contributes to establishing a real-time feedback loop between the public and public health officials on social media. Hearing and reacting to real concerns from the public can enhance trust between the government and the public to prepare for future health emergencies.

Download Full-text

Big Five Personality Prediction from Social Media Data using Machine Learning Techniques

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.d7946.049420 ◽

2020 ◽

Vol 9 (4) ◽

pp. 2412-2417

Keyword(s):

Machine Learning ◽

Big Five ◽

Machine Learning Techniques ◽

Big Five Personality ◽

Learning Techniques ◽

Machine Learning Model ◽

Personality Prediction ◽

Twitter Users ◽

Bayes Algorithm ◽

Media Data

Personality has been important for a number of types of cooperation; it has useful in predicting job achievement, expert and emotional relationship achievement, and even tendency towards a variety of interfaces. To accurately examine the characters of users, a personality test must be carried out. In numerous areas of online life it is usually impractical to use character research. . We used SVM classification, Random Forest algorithm, Naïve Bayes Algorithm and Logistic regression to comparatively predict the user’s personality accurately. The main goal of the paper is to evaluate the machine learning models using the four parameters- accuracy, precision, recall, f1 score and basing upon these parameters the best machine learning model will be used to classify the big five personality traits of the twitter users.

Download Full-text

Clinical Data Analysis for Prediction of Cardiovascular Disease Using Machine Learning Techniques

Computational Intelligence and Neuroscience ◽

10.1155/2022/2973324 ◽

2022 ◽

Vol 2022 ◽

pp. 1-13

Author(s):

Rajkumar Gangappa Nadakinamani ◽

A. Reyana ◽

Sandeep Kautish ◽

A. S. Vibith ◽

Yogita Gupta ◽

...

Keyword(s):

Machine Learning ◽

Cardiovascular Disease ◽

Cardiac Risk ◽

Machine Learning Algorithms ◽

Random Tree ◽

Machine Learning Techniques ◽

Learning Technology ◽

Tree Model ◽

Learning Techniques ◽

Machine Learning Model

Cardiovascular disease is difficult to detect due to several risk factors, including high blood pressure, cholesterol, and an abnormal pulse rate. Accurate decision-making and optimal treatment are required to address cardiac risk. As machine learning technology advances, the healthcare industry’s clinical practice is likely to change. As a result, researchers and clinicians must recognize the importance of machine learning techniques. The main objective of this research is to recommend a machine learning-based cardiovascular disease prediction system that is highly accurate. In contrast, modern machine learning algorithms such as REP Tree, M5P Tree, Random Tree, Linear Regression, Naive Bayes, J48, and JRIP are used to classify popular cardiovascular datasets. The proposed CDPS’s performance was evaluated using a variety of metrics to identify the best suitable machine learning model. When it came to predicting cardiovascular disease patients, the Random Tree model performed admirably, with the highest accuracy of 100%, the lowest MAE of 0.0011, the lowest RMSE of 0.0231, and the fastest prediction time of 0.01 seconds.

Download Full-text

Machine learning distributions of quantum ansatz with hierarchical structure

International Journal of Modern Physics B ◽

10.1142/s0217979220501969 ◽

2020 ◽

Vol 34 (20) ◽

pp. 2050196

Author(s):

Haozhen Situ ◽

Zhimin He

Keyword(s):

Machine Learning ◽

Quantum Systems ◽

Machine Learning Techniques ◽

Learning Techniques ◽

Machine Learning Model ◽

Variational Autoencoder ◽

Systems Learning ◽

Near Term ◽

Learning Measurement

Machine learning techniques can help to represent and solve quantum systems. Learning measurement outcome distribution of quantum ansatz is useful for characterization of near-term quantum computing devices. In this work, we use the popular unsupervised machine learning model, variational autoencoder (VAE), to reconstruct the measurement outcome distribution of quantum ansatz. The number of parameters in the VAE are compared with the number of measurement outcomes. The numerical results show that VAE can efficiently learn the measurement outcome distribution with few parameters. The influence of entanglement on the task is also revealed.

Download Full-text

Development of accurate classification of heavenly bodies using novel machine learning techniques

Soft Computing ◽

10.1007/s00500-021-05687-4 ◽

2021 ◽

Vol 25 (10) ◽

pp. 7213-7228

Author(s):

Michał Wierzbiński ◽

Paweł Pławiak ◽

Mohamed Hammad ◽

U. Rajendra Acharya

Keyword(s):

Machine Learning ◽

Outer Space ◽

Principal Component ◽

Sloan Digital Sky Survey ◽

Machine Learning Techniques ◽

Evolutionary System ◽

Stellar Spectra ◽

Learning Techniques ◽

Machine Learning Model

AbstractThe heavenly bodies are objects that swim in the outer space. The classification of these objects is a challenging task for astronomers. This article presents a novel methodology that enables an efficient and accurate classification of cosmic objects (3 classes) based on evolutionary optimization of classifiers. This research collected the data from Sloan Digital Sky Survey database. In this work, we are proposing to develop a novel machine learning model to classify stellar spectra of stars, quasars and galaxies. First, the input data are normalized and then subjected to principal component analysis to reduce the dimensionality. Then, the genetic algorithm is implemented on the data which helps to find the optimal parameters for the classifiers. We have used 21 classifiers to develop an accurate and robust classification with fivefold cross-validation strategy. Our developed model has achieved an improvement in the accuracy using nineteen out of twenty-one models. We have obtained the highest classification accuracy of 99.16%, precision of 98.78%, recall of 98.08% and F1-score of 98.32% using evolutionary system based on voting classifier. The developed machine learning prototype can help the astronomers to make accurate classification of heavenly bodies in the sky. Proposed evolutionary system can be used in other areas where accurate classification of many classes is required.

Download Full-text