scholarly journals Automated Assessment of Quality of Jupyter Notebooks Using Artificial Intelligence and Big Code

Author(s):  
Priti Oli ◽  
Rabin Banjade ◽  
Lasang Jimba Tamang ◽  
Vasile Rus

We present in this paper an automated method to assess the quality of Jupyter notebooks. The quality of notebooks is assessed in terms of reproducibility and executability. Specifically, we automatically extract a number of expert-defined features for each notebook, perform a feature selection step, and then trained supervised binary classifiers to predict whether a notebook is reproducible and executable, respectively. We also experimented with semantic code embeddings to capture the notebooks' semantics. We have evaluated these methods on a dataset of 306,539 notebooks and achieved an F1 score of 0.87 for reproducibility and 0.96 for executability (using expert-defined features) and an F1 score of 0.81 for reproducibility and 0.78 for executability (using code embeddings). Our results suggest that semantic code embeddings can be used to determine with good performance the reproducibility and executability of Jupyter notebooks, and since they can be automatically derived, they have the advantage of no need for expert involvement to define features.

Author(s):  
Kenong Su ◽  
Tianwei Yu ◽  
Hao Wu

Abstract Cell clustering is one of the most important and commonly performed tasks in single-cell RNA sequencing (scRNA-seq) data analysis. An important step in cell clustering is to select a subset of genes (referred to as ‘features’), whose expression patterns will then be used for downstream clustering. A good set of features should include the ones that distinguish different cell types, and the quality of such set could have a significant impact on the clustering accuracy. All existing scRNA-seq clustering tools include a feature selection step relying on some simple unsupervised feature selection methods, mostly based on the statistical moments of gene-wise expression distributions. In this work, we carefully evaluate the impact of feature selection on cell clustering accuracy. In addition, we develop a feature selection algorithm named FEAture SelecTion (FEAST), which provides more representative features. We apply the method on 12 public scRNA-seq datasets and demonstrate that using features selected by FEAST with existing clustering tools significantly improve the clustering accuracy.


2020 ◽  
Vol 17 (6) ◽  
pp. 76-91
Author(s):  
E. D. Solozhentsev

The scientific problem of economics “Managing the quality of human life” is formulated on the basis of artificial intelligence, algebra of logic and logical-probabilistic calculus. Managing the quality of human life is represented by managing the processes of his treatment, training and decision making. Events in these processes and the corresponding logical variables relate to the behavior of a person, other persons and infrastructure. The processes of the quality of human life are modeled, analyzed and managed with the participation of the person himself. Scenarios and structural, logical and probabilistic models of managing the quality of human life are given. Special software for quality management is described. The relationship of human quality of life and the digital economy is examined. We consider the role of public opinion in the management of the “bottom” based on the synthesis of many studies on the management of the economics and the state. The bottom management is also feedback from the top management.


AI Magazine ◽  
2012 ◽  
Vol 34 (1) ◽  
pp. 10 ◽  
Author(s):  
Steve Kelling ◽  
Jeff Gerbracht ◽  
Daniel Fink ◽  
Carl Lagoze ◽  
Weng-Keen Wong ◽  
...  

In this paper we describe eBird, a citizen-science project that takes advantage of the human observational capacity to identify birds to species, which is then used to accurately represent patterns of bird occurrences across broad spatial and temporal extents. eBird employs artificial intelligence techniques such as machine learning to improve data quality by taking advantage of the synergies between human computation and mechanical computation. We call this a Human-Computer Learning Network, whose core is an active learning feedback loop between humans and machines that dramatically improves the quality of both, and thereby continually improves the effectiveness of the network as a whole. In this paper we explore how Human-Computer Learning Networks can leverage the contributions of a broad recruitment of human observers and processes their contributed data with Artificial Intelligence algorithms leading to a computational power that far exceeds the sum of the individual parts.


2021 ◽  
Vol 11 (4) ◽  
pp. 1880
Author(s):  
Roberta Fusco ◽  
Adele Piccirillo ◽  
Mario Sansone ◽  
Vincenza Granata ◽  
Paolo Vallone ◽  
...  

Purpose: The aim of the study was to estimate the diagnostic accuracy of textural, morphological and dynamic features, extracted by dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) images, by carrying out univariate and multivariate statistical analyses including artificial intelligence approaches. Methods: In total, 85 patients with known breast lesion were enrolled in this retrospective study according to regulations issued by the local Institutional Review Board. All patients underwent DCE-MRI examination. The reference standard was pathology from a surgical specimen for malignant lesions and pathology from a surgical specimen or fine needle aspiration cytology, core or Tru-Cut needle biopsy for benign lesions. In total, 91 samples of 85 patients were analyzed. Furthermore, 48 textural metrics, 15 morphological and 81 dynamic parameters were extracted by manually segmenting regions of interest. Statistical analyses including univariate and multivariate approaches were performed: non-parametric Wilcoxon–Mann–Whitney test; receiver operating characteristic (ROC), linear classifier (LDA), decision tree (DT), k-nearest neighbors (KNN), and support vector machine (SVM) were utilized. A balancing approach and feature selection methods were used. Results: The univariate analysis showed low accuracy and area under the curve (AUC) for all considered features. Instead, in the multivariate textural analysis, the best performance (accuracy (ACC) = 0.78; AUC = 0.78) was reached with all 48 metrics and an LDA trained with balanced data. The best performance (ACC = 0.75; AUC = 0.80) using morphological features was reached with an SVM trained with 10-fold cross-variation (CV) and balanced data (with adaptive synthetic (ADASYN) function) and a subset of five robust morphological features (circularity, rectangularity, sphericity, gleaning and surface). The best performance (ACC = 0.82; AUC = 0.83) using dynamic features was reached with a trained SVM and balanced data (with ADASYN function). Conclusion: Multivariate analyses using pattern recognition approaches, including all morphological, textural and dynamic features, optimized by adaptive synthetic sampling and feature selection operations obtained the best results and showed the best performance in the discrimination of benign and malignant lesions.


2020 ◽  
pp. 1-12
Author(s):  
Yingli Duan

Curriculum is the basis of vocational training, its development level and teaching efficiency determine the realization of vocational training objectives, as well as the quality and level of major vocational academic training. Therefore, the development of curriculum is an important issue. And affect the school’s teaching capacity building. The analysis of the latest developments in the main courses shows that there are some deviations or irrationalities in the curriculum in some colleges and universities, and the general problems of understanding the latest courses, such as lack of solid foundation in curriculum setting, unclear direction of objectives, unclear reform ideas, inadequate and systematic construction measures, lack of attention to the quality of education. This paper explains the rules for the establishment of first-level courses, clarifies the ideas and priorities of architecture, and explores strategies for building university-level courses using knowledge of artificial intelligence and neural network algorithms in order to gain experience from them.


2014 ◽  
Vol 571-572 ◽  
pp. 105-108
Author(s):  
Lin Xu

This paper proposes a new framework of combining reinforcement learning with cloud computing digital library. Unified self-learning algorithms, which includes reinforcement learning, artificial intelligence and etc, have led to many essential advances. Given the current status of highly-available models, analysts urgently desire the deployment of write-ahead logging. In this paper we examine how DNS can be applied to the investigation of superblocks, and introduce the reinforcement learning to improve the quality of current cloud computing digital library. The experimental results show that the method works more efficiency.


2021 ◽  
Author(s):  
Tiancheng Yang ◽  
Shah Nazir

Abstract With the development and advancement of information technology, artificial intelligence (AI) and machine learning (ML) are applied in every sector of life. Among these applications, music is one of them which has gained attention in the last couple of years. The music industry is revolutionized with AIbased innovative and intelligent techniques. It is very convenient for composers to compose music of high quality using these technologies. Artificial intelligence and Music (AIM) is one of the emerging fields used to generate and manage sounds for different media like the Internet, games, etc. Sounds in the games are very effective and can be made more attractive by implementing AI approaches. The quality of sounds in the game directly impacts the productivity and experience of the player. With computer-assisted technologies, the game designers can create sounds for different scenarios or situations like horror and suspense and provide gamer information. The practical and productive audio of a game can guide visually impaired people during other events in the game. For the better creation and composition of music, good quality of knowledge about musicology is essential. Due to AIM, there are a lot of intelligent and interactive tools available for the efficiency and effective learning of music. The learners can be provided with a very reliable and interactive environment based on artificial intelligence. The current study has considered presenting a detailed overview of the literature available in the area of research. The study has demonstrated literature analysis from various perspectives, which will become evidence for researchers to devise novel solutions in the field.


Sign in / Sign up

Export Citation Format

Share Document