On the Impact of Multi-language Development in Machine Learning Frameworks

The motion picture industry is one of the largest industries worldwide and has significant importance in the global economy. Considering the high stakes and high risks in the industry, forecast models and decision support systems are gaining importance. Several attempts have been made to estimate the theatrical performance of a movie before or at the early stages of its release. Nevertheless, these models are mostly used for predicting domestic performances and the industry still struggles to predict box office performances in overseas markets. In this study, the aim is to design a forecast model using different machine learning algorithms to estimate the theatrical success of US movies in Turkey. From various sources, a dataset of 1559 movies is constructed. Firstly, independent variables are grouped as pre-release, distributor type, and international distribution based on their characteristic. The number of attendances is discretized into three classes. Four popular machine learning algorithms, artificial neural networks, decision tree regression and gradient boosting tree and random forest are employed, and the impact of each group is observed by compared by the performance models. Then the number of target classes is increased into five and eight and results are compared with the previously developed models in the literature.

Download Full-text

The impact of economic plans on the Chinese education system: a machine learning approach

CADMO ◽

10.3280/cad2018-001005 ◽

2018 ◽

pp. 37-49

Author(s):

Wenjun Lin ◽

Xuefu Xu ◽

Francesco Dell’Anna

Keyword(s):

Machine Learning ◽

Education System ◽

Learning Approach ◽

Chinese Education ◽

System A ◽

Machine Learning Approach ◽

The Impact

Download Full-text

PODC 2020 Review

ACM SIGACT News ◽

10.1145/3444815.3444827 ◽

2021 ◽

Vol 51 (4) ◽

pp. 75-81

Author(s):

Ahad Mirza Baig ◽

Alkida Balliu ◽

Peter Davies ◽

Michal Dory

Keyword(s):

Machine Learning ◽

Distributed Computing ◽

Keynote Speaker ◽

Lively Discussion ◽

Theoretical Understanding ◽

New Directions ◽

New Ideas ◽

New Challenges ◽

The Impact ◽

Distributed Machine Learning

Rachid Guerraoui was the rst keynote speaker, and he got things o to a great start by discussing the broad relevance of the research done in our community relative to both industry and academia. He rst argued that, in some sense, the fact that distributed computing is so pervasive nowadays could end up sti ing progress in our community by inducing people to work on marginal problems, and becoming isolated. His rst suggestion was to try to understand and incorporate new ideas coming from applied elds into our research, and argued that this has been historically very successful. He illustrated this point via the distributed payment problem, which appears in the context of blockchains, in particular Bitcoin, but then turned out to be very theoretically interesting; furthermore, the theoretical understanding of the problem inspired new practical protocols. He then went further to discuss new directions in distributed computing, such as the COVID tracing problem, and new challenges in Byzantine-resilient distributed machine learning. Another source of innovation Rachid suggested was hardware innovations, which he illustrated with work studying the impact of RDMA-based primitives on fundamental problems in distributed computing. The talk concluded with a very lively discussion.

Download Full-text

Machine Learning Based Device Simulation Using Multi-variable Non-linear Regression to Assess the Impact of Device Parameter Variability on Threshold Voltage of Double Gate-All-Around (DGAA) MOSFET

2020 IEEE 2nd International Conference on Circuits and Systems (ICCS) ◽

10.1109/iccs51219.2020.9336608 ◽

2020 ◽

Author(s):

Sandeep Moparthi ◽

Chandan Yadav ◽

Gopi Krishna Saramekala ◽

Pramod Kumar Tiwari

Keyword(s):

Machine Learning ◽

Linear Regression ◽

Threshold Voltage ◽

Device Simulation ◽

Double Gate ◽

Device Parameter ◽

Non Linear ◽

The Impact

Download Full-text

Review on Machine Learning Frameworks in Drivers’ Physiological Signal Analysis to Detect Stress

2021 7th International Conference on Control, Instrumentation and Automation (ICCIA) ◽

10.1109/iccia52082.2021.9403605 ◽

2021 ◽

Author(s):

Maryam Memar ◽

Amin Mokaribolhassan ◽

Amir Aminzadeh Ghavifekr

Keyword(s):

Machine Learning ◽

Signal Analysis ◽

Physiological Signal ◽

Learning Frameworks

Download Full-text

Individualized embryo selection strategy developed by stacking machine learning model for better in vitro fertilization outcomes: an application study

Reproductive Biology and Endocrinology ◽

10.1186/s12958-021-00734-z ◽

2021 ◽

Vol 19 (1) ◽

Author(s):

Qingsong Xi ◽

Qiyu Yang ◽

Meng Wang ◽

Bo Huang ◽

Bo Zhang ◽

...

Keyword(s):

Machine Learning ◽

In Vitro Fertilization ◽

Endometrial Thickness ◽

Learning System ◽

Embryo Selection ◽

Selection Strategy ◽

Application Study ◽

Vitro Fertilization ◽

The Impact

Abstract Background To minimize the rate of in vitro fertilization (IVF)- associated multiple-embryo gestation, significant efforts have been made. Previous studies related to machine learning in IVF mainly focused on selecting the top-quality embryos to improve outcomes, however, in patients with sub-optimal prognosis or with medium- or inferior-quality embryos, the selection between SET and DET could be perplexing. Methods This was an application study including 9211 patients with 10,076 embryos treated during 2016 to 2018, in Tongji Hospital, Wuhan, China. A hierarchical model was established using the machine learning system XGBoost, to learn embryo implantation potential and the impact of double embryos transfer (DET) simultaneously. The performance of the model was evaluated with the AUC of the ROC curve. Multiple regression analyses were also conducted on the 19 selected features to demonstrate the differences between feature importance for prediction and statistical relationship with outcomes. Results For a single embryo transfer (SET) pregnancy, the following variables remained significant: age, attempts at IVF, estradiol level on hCG day, and endometrial thickness. For DET pregnancy, age, attempts at IVF, endometrial thickness, and the newly added P1 + P2 remained significant. For DET twin risk, age, attempts at IVF, 2PN/ MII, and P1 × P2 remained significant. The algorithm was repeated 30 times, and averaged AUC of 0.7945, 0.8385, and 0.7229 were achieved for SET pregnancy, DET pregnancy, and DET twin risk, respectively. The trend of predictive and observed rates both in pregnancy and twin risk was basically identical. XGBoost outperformed the other two algorithms: logistic regression and classification and regression tree. Conclusion Artificial intelligence based on determinant-weighting analysis could offer an individualized embryo selection strategy for any given patient, and predict clinical pregnancy rate and twin risk, therefore optimizing clinical outcomes.

Download Full-text

The Impact of COVID-19 Epidemic on Indian Economy Unleashed By Machine Learning

IOP Conference Series Materials Science and Engineering ◽

10.1088/1757-899x/1022/1/012085 ◽

2021 ◽

Vol 1022 ◽

pp. 012085

Author(s):

Kamal Deep Garg ◽

Manik Gupta ◽

Munish Kumar

Keyword(s):

Machine Learning ◽

Indian Economy ◽

The Impact

Download Full-text

Evaluation of Urban-Scale Building Energy-Use Models and Tools—Application for the City of Fribourg, Switzerland

Sustainability ◽

10.3390/su13041595 ◽

2021 ◽

Vol 13 (4) ◽

pp. 1595

Author(s):

Valeria Todeschi ◽

Roberto Boghetti ◽

Jérôme H. Kämpf ◽

Guglielmina Mutani

Keyword(s):

Machine Learning ◽

Energy Use ◽

Residential Buildings ◽

Building Energy ◽

Energy Performance ◽

Space Heating ◽

Engineering Model ◽

Building Energy Use ◽

The Impact ◽

The City

Building energy-use models and tools can simulate and represent the distribution of energy consumption of buildings located in an urban area. The aim of these models is to simulate the energy performance of buildings at multiple temporal and spatial scales, taking into account both the building shape and the surrounding urban context. This paper investigates existing models by simulating the hourly space heating consumption of residential buildings in an urban environment. Existing bottom-up urban-energy models were applied to the city of Fribourg in order to evaluate the accuracy and flexibility of energy simulations. Two common energy-use models—a machine learning model and a GIS-based engineering model—were compared and evaluated against anonymized monitoring data. The study shows that the simulations were quite precise with an annual mean absolute percentage error of 12.8 and 19.3% for the machine learning and the GIS-based engineering model, respectively, on residential buildings built in different periods of construction. Moreover, a sensitivity analysis using the Morris method was carried out on the GIS-based engineering model in order to assess the impact of input variables on space heating consumption and to identify possible optimization opportunities of the existing model.

Download Full-text

Machine Learning-Based on Assessment of the Impact of the Manufacturing Process on Battery Electrode Heterogeneity

Energy and AI ◽

10.1016/j.egyai.2021.100090 ◽

2021 ◽

pp. 100090

Author(s):

Marc Duquesnoy ◽

Iker Boyano ◽

Larraitz Ganborena ◽

Pablo Cereijo ◽

Elixabete Ayerbe ◽

...

Keyword(s):

Machine Learning ◽

Manufacturing Process ◽

Battery Electrode ◽

The Impact

Download Full-text

Classification of multiwavelength transients with Machine Learning

Monthly Notices of the Royal Astronomical Society ◽

10.1093/mnras/staa3873 ◽

2020 ◽

Author(s):

K Sooknunan ◽

M Lochner ◽

Bruce A Bassett ◽

H V Peiris ◽

R Fender ◽

...

Keyword(s):

Machine Learning ◽

Small Sample ◽

Light Curves ◽

Machine Learning Techniques ◽

Optical Data ◽

Test Time ◽

Test Accuracy ◽

Training Set ◽

The Impact

Abstract With the advent of powerful telescopes such as the Square Kilometer Array and the Vera C. Rubin Observatory, we are entering an era of multiwavelength transient astronomy that will lead to a dramatic increase in data volume. Machine learning techniques are well suited to address this data challenge and rapidly classify newly detected transients. We present a multiwavelength classification algorithm consisting of three steps: (1) interpolation and augmentation of the data using Gaussian processes; (2) feature extraction using wavelets; (3) classification with random forests. Augmentation provides improved performance at test time by balancing the classes and adding diversity into the training set. In the first application of machine learning to the classification of real radio transient data, we apply our technique to the Green Bank Interferometer and other radio light curves. We find we are able to accurately classify most of the eleven classes of radio variables and transients after just eight hours of observations, achieving an overall test accuracy of 78%. We fully investigate the impact of the small sample size of 82 publicly available light curves and use data augmentation techniques to mitigate the effect. We also show that on a significantly larger simulated representative training set that the algorithm achieves an overall accuracy of 97%, illustrating that the method is likely to provide excellent performance on future surveys. Finally, we demonstrate the effectiveness of simultaneous multiwavelength observations by showing how incorporating just one optical data point into the analysis improves the accuracy of the worst performing class by 19%.

Download Full-text