A Framework for Privacy Quantification: Measuring the Impact of Privacy Techniques Through Mutual Information, Distance Mapping, and Machine Learning

Yoan Miche; Wei Ren; Ian Oliver; Silke Holtmanns; Amaury Lendasse

doi:10.1007/s12559-018-9604-7

Forecasting US movies box office performances in Turkey using machine learning algorithms

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-189120 ◽

2020 ◽

Vol 39 (5) ◽

pp. 6579-6590

Author(s):

Sandy Çağlıyor ◽

Başar Öztayşi ◽

Selime Sezgin

Keyword(s):

Machine Learning ◽

Global Economy ◽

Learning Algorithms ◽

Forecast Model ◽

Machine Learning Algorithms ◽

Gradient Boosting ◽

High Stakes ◽

Box Office ◽

Industry Forecast ◽

The Impact

The motion picture industry is one of the largest industries worldwide and has significant importance in the global economy. Considering the high stakes and high risks in the industry, forecast models and decision support systems are gaining importance. Several attempts have been made to estimate the theatrical performance of a movie before or at the early stages of its release. Nevertheless, these models are mostly used for predicting domestic performances and the industry still struggles to predict box office performances in overseas markets. In this study, the aim is to design a forecast model using different machine learning algorithms to estimate the theatrical success of US movies in Turkey. From various sources, a dataset of 1559 movies is constructed. Firstly, independent variables are grouped as pre-release, distributor type, and international distribution based on their characteristic. The number of attendances is discretized into three classes. Four popular machine learning algorithms, artificial neural networks, decision tree regression and gradient boosting tree and random forest are employed, and the impact of each group is observed by compared by the performance models. Then the number of target classes is increased into five and eight and results are compared with the previously developed models in the literature.

Download Full-text

The impact of economic plans on the Chinese education system: a machine learning approach

CADMO ◽

10.3280/cad2018-001005 ◽

2018 ◽

pp. 37-49

Author(s):

Wenjun Lin ◽

Xuefu Xu ◽

Francesco Dell’Anna

Keyword(s):

Machine Learning ◽

Education System ◽

Learning Approach ◽

Chinese Education ◽

System A ◽

Machine Learning Approach ◽

The Impact

Download Full-text

Image scrambling effect evaluation method based on mutual information distance of difference image

Journal of Computer Applications ◽

10.3724/sp.j.1087.2009.01293 ◽

2009 ◽

Vol 29 (5) ◽

pp. 1293-1296 ◽

Cited By ~ 3

Author(s):

Cheng-mao WU ◽

Xiao-ping TIAN ◽

Tie-niu TAN

Keyword(s):

Mutual Information ◽

Evaluation Method ◽

Effect Evaluation ◽

Image Scrambling ◽

Difference Image ◽

Information Distance

Download Full-text

PODC 2020 Review

ACM SIGACT News ◽

10.1145/3444815.3444827 ◽

2021 ◽

Vol 51 (4) ◽

pp. 75-81

Author(s):

Ahad Mirza Baig ◽

Alkida Balliu ◽

Peter Davies ◽

Michal Dory

Keyword(s):

Machine Learning ◽

Distributed Computing ◽

Keynote Speaker ◽

Lively Discussion ◽

Theoretical Understanding ◽

New Directions ◽

New Ideas ◽

New Challenges ◽

The Impact ◽

Distributed Machine Learning

Rachid Guerraoui was the rst keynote speaker, and he got things o to a great start by discussing the broad relevance of the research done in our community relative to both industry and academia. He rst argued that, in some sense, the fact that distributed computing is so pervasive nowadays could end up sti ing progress in our community by inducing people to work on marginal problems, and becoming isolated. His rst suggestion was to try to understand and incorporate new ideas coming from applied elds into our research, and argued that this has been historically very successful. He illustrated this point via the distributed payment problem, which appears in the context of blockchains, in particular Bitcoin, but then turned out to be very theoretically interesting; furthermore, the theoretical understanding of the problem inspired new practical protocols. He then went further to discuss new directions in distributed computing, such as the COVID tracing problem, and new challenges in Byzantine-resilient distributed machine learning. Another source of innovation Rachid suggested was hardware innovations, which he illustrated with work studying the impact of RDMA-based primitives on fundamental problems in distributed computing. The talk concluded with a very lively discussion.

Download Full-text

Machine Learning Based Device Simulation Using Multi-variable Non-linear Regression to Assess the Impact of Device Parameter Variability on Threshold Voltage of Double Gate-All-Around (DGAA) MOSFET

2020 IEEE 2nd International Conference on Circuits and Systems (ICCS) ◽

10.1109/iccs51219.2020.9336608 ◽

2020 ◽

Author(s):

Sandeep Moparthi ◽

Chandan Yadav ◽

Gopi Krishna Saramekala ◽

Pramod Kumar Tiwari

Keyword(s):

Machine Learning ◽

Linear Regression ◽

Threshold Voltage ◽

Device Simulation ◽

Double Gate ◽

Device Parameter ◽

Non Linear ◽

The Impact

Download Full-text

A Geometric Perspective on Information Plane Analysis

Entropy ◽

10.3390/e23060711 ◽

2021 ◽

Vol 23 (6) ◽

pp. 711

Author(s):

Mina Basirat ◽

Bernhard C. Geiger ◽

Peter M. Roth

Keyword(s):

Neural Network ◽

Mutual Information ◽

Geometric Interpretation ◽

Neural Network Training ◽

Neural Network Learning ◽

Network Learning ◽

Plane Analysis ◽

Network Training ◽

Hidden Layer ◽

The Impact

Information plane analysis, describing the mutual information between the input and a hidden layer and between a hidden layer and the target over time, has recently been proposed to analyze the training of neural networks. Since the activations of a hidden layer are typically continuous-valued, this mutual information cannot be computed analytically and must thus be estimated, resulting in apparently inconsistent or even contradicting results in the literature. The goal of this paper is to demonstrate how information plane analysis can still be a valuable tool for analyzing neural network training. To this end, we complement the prevailing binning estimator for mutual information with a geometric interpretation. With this geometric interpretation in mind, we evaluate the impact of regularization and interpret phenomena such as underfitting and overfitting. In addition, we investigate neural network learning in the presence of noisy data and noisy labels.

Download Full-text

Individualized embryo selection strategy developed by stacking machine learning model for better in vitro fertilization outcomes: an application study

Reproductive Biology and Endocrinology ◽

10.1186/s12958-021-00734-z ◽

2021 ◽

Vol 19 (1) ◽

Author(s):

Qingsong Xi ◽

Qiyu Yang ◽

Meng Wang ◽

Bo Huang ◽

Bo Zhang ◽

...

Keyword(s):

Machine Learning ◽

In Vitro Fertilization ◽

Endometrial Thickness ◽

Learning System ◽

Embryo Selection ◽

Selection Strategy ◽

Application Study ◽

Vitro Fertilization ◽

The Impact

Abstract Background To minimize the rate of in vitro fertilization (IVF)- associated multiple-embryo gestation, significant efforts have been made. Previous studies related to machine learning in IVF mainly focused on selecting the top-quality embryos to improve outcomes, however, in patients with sub-optimal prognosis or with medium- or inferior-quality embryos, the selection between SET and DET could be perplexing. Methods This was an application study including 9211 patients with 10,076 embryos treated during 2016 to 2018, in Tongji Hospital, Wuhan, China. A hierarchical model was established using the machine learning system XGBoost, to learn embryo implantation potential and the impact of double embryos transfer (DET) simultaneously. The performance of the model was evaluated with the AUC of the ROC curve. Multiple regression analyses were also conducted on the 19 selected features to demonstrate the differences between feature importance for prediction and statistical relationship with outcomes. Results For a single embryo transfer (SET) pregnancy, the following variables remained significant: age, attempts at IVF, estradiol level on hCG day, and endometrial thickness. For DET pregnancy, age, attempts at IVF, endometrial thickness, and the newly added P1 + P2 remained significant. For DET twin risk, age, attempts at IVF, 2PN/ MII, and P1 × P2 remained significant. The algorithm was repeated 30 times, and averaged AUC of 0.7945, 0.8385, and 0.7229 were achieved for SET pregnancy, DET pregnancy, and DET twin risk, respectively. The trend of predictive and observed rates both in pregnancy and twin risk was basically identical. XGBoost outperformed the other two algorithms: logistic regression and classification and regression tree. Conclusion Artificial intelligence based on determinant-weighting analysis could offer an individualized embryo selection strategy for any given patient, and predict clinical pregnancy rate and twin risk, therefore optimizing clinical outcomes.

Download Full-text

The Impact of COVID-19 Epidemic on Indian Economy Unleashed By Machine Learning

IOP Conference Series Materials Science and Engineering ◽

10.1088/1757-899x/1022/1/012085 ◽

2021 ◽

Vol 1022 ◽

pp. 012085

Author(s):

Kamal Deep Garg ◽

Manik Gupta ◽

Munish Kumar

Keyword(s):

Machine Learning ◽

Indian Economy ◽

The Impact

Download Full-text

Evaluation of Urban-Scale Building Energy-Use Models and Tools—Application for the City of Fribourg, Switzerland

Sustainability ◽

10.3390/su13041595 ◽

2021 ◽

Vol 13 (4) ◽

pp. 1595

Author(s):

Valeria Todeschi ◽

Roberto Boghetti ◽

Jérôme H. Kämpf ◽

Guglielmina Mutani

Keyword(s):

Machine Learning ◽

Energy Use ◽

Residential Buildings ◽

Building Energy ◽

Energy Performance ◽

Space Heating ◽

Engineering Model ◽

Building Energy Use ◽

The Impact ◽

The City

Building energy-use models and tools can simulate and represent the distribution of energy consumption of buildings located in an urban area. The aim of these models is to simulate the energy performance of buildings at multiple temporal and spatial scales, taking into account both the building shape and the surrounding urban context. This paper investigates existing models by simulating the hourly space heating consumption of residential buildings in an urban environment. Existing bottom-up urban-energy models were applied to the city of Fribourg in order to evaluate the accuracy and flexibility of energy simulations. Two common energy-use models—a machine learning model and a GIS-based engineering model—were compared and evaluated against anonymized monitoring data. The study shows that the simulations were quite precise with an annual mean absolute percentage error of 12.8 and 19.3% for the machine learning and the GIS-based engineering model, respectively, on residential buildings built in different periods of construction. Moreover, a sensitivity analysis using the Morris method was carried out on the GIS-based engineering model in order to assess the impact of input variables on space heating consumption and to identify possible optimization opportunities of the existing model.

Download Full-text