ModelSet: a dataset for machine learning in model-driven engineering

AbstractThe application of machine learning (ML) algorithms to address problems related to model-driven engineering (MDE) is currently hindered by the lack of curated datasets of software models. There are several reasons for this, including the lack of large collections of good quality models, the difficulty to label models due to the required domain expertise, and the relative immaturity of the application of ML to MDE. In this work, we present ModelSet, a labelled dataset of software models intended to enable the application of ML to address software modelling problems. To create it we have devised a method designed to facilitate the exploration and labelling of model datasets by interactively grouping similar models using off-the-shelf technologies like a search engine. We have built an Eclipse plug-in to support the labelling process, which we have used to label 5,466 Ecore meta-models and 5,120 UML models with its category as the main label plus additional secondary labels of interest. We have evaluated the ability of our labelling method to create meaningful groups of models in order to speed up the process, improving the effectiveness of classical clustering methods. We showcase the usefulness of the dataset by applying it in a real scenario: enhancing the MAR search engine. We use ModelSet to train models able to infer useful metadata to navigate search results. The dataset and the tooling are available at https://figshare.com/s/5a6c02fa8ed20782935c and a live version at http://modelset.github.io.

Download Full-text

A Model-Driven Engineering Approach for Monitoring Machine Learning Models

10.1109/models-c53483.2021.00028 ◽

2021 ◽

Author(s):

Panagiotis Kourouklidis ◽

Dimitris Kolovos ◽

Joost Noppen ◽

Nicholas Matragkas

Keyword(s):

Machine Learning ◽

Model Driven Engineering ◽

Learning Models ◽

Model Driven ◽

Engineering Approach ◽

Machine Learning Models

Download Full-text

Maturity of software modelling and model driven engineering: a survey in the Italian industry

16th International Conference on Evaluation & Assessment in Software Engineering (EASE 2012) ◽

10.1049/ic.2012.0012 ◽

2012 ◽

Cited By ~ 15

Author(s):

F. Tomassetti ◽

M. Torchiano ◽

A. Tiso ◽

F. Ricca ◽

G. Reggio

Keyword(s):

Model Driven Engineering ◽

Model Driven ◽

Software Modelling

Download Full-text

Data Analytics and Machine Learning Methods, Techniques and Tool for Model-Driven Engineering of Smart IoT Services

2021 IEEE/ACM 43rd International Conference on Software Engineering: Companion Proceedings (ICSE-Companion) ◽

10.1109/icse-companion52605.2021.00130 ◽

2021 ◽

Author(s):

Armin Moin

Keyword(s):

Machine Learning ◽

Data Analytics ◽

Model Driven Engineering ◽

Learning Methods ◽

Model Driven ◽

Machine Learning Methods

Download Full-text

Model-Driven Data Warehouse Automation

Advances in Systems Analysis, Software Engineering, and High Performance Computing - Advances and Applications in Model-Driven Engineering ◽

10.4018/978-1-4666-4494-6.ch011 ◽

2014 ◽

pp. 240-267

Author(s):

Moez Essaidi ◽

Aomar Osmani ◽

Céline Rouveirol

Keyword(s):

Machine Learning ◽

Data Warehouse ◽

Inductive Logic Programming ◽

Inductive Logic ◽

Machine Learning Techniques ◽

Learning Approach ◽

Model Driven Engineering ◽

Model Driven ◽

Transformation Rules ◽

Learning Techniques

Transformation design is a key step in model-driven engineering, and it is a very challenging task, particularly in context of the model-driven data warehouse. Currently, this process is ensured by human experts. The authors propose a new methodology using machine learning techniques to automatically derive these transformation rules. The main goal is to automatically derive the transformation rules to be applied in the model-driven data warehouse process. The proposed solution allows for a simple design of the decision support systems and the reduction of time and costs of development. The authors use the inductive logic programming framework to learn these transformation rules from examples of previous projects. Then, they find that in model-driven data warehouse application, dependencies exist between transformations. Therefore, the authors investigate a new machine learning methodology, learning dependent-concepts, that is suitable to solve this kind of problem. The experimental evaluation shows that the dependent-concept learning approach gives significantly better results.

Download Full-text

Towards Model-Driven Engineering for Big Data Analytics -- An Exploratory Analysis of Domain-Specific Languages for Machine Learning

2014 47th Hawaii International Conference on System Sciences ◽

10.1109/hicss.2014.101 ◽

2014 ◽

Cited By ~ 7

Author(s):

Dominic Breuker

Keyword(s):

Machine Learning ◽

Big Data ◽

Data Analytics ◽

Big Data Analytics ◽

Exploratory Analysis ◽

Model Driven Engineering ◽

Domain Specific Languages ◽

Model Driven ◽

Domain Specific

Download Full-text

Generating Graphical User Interfaces Based on Model Driven Engineering

International Review on Computers and Software (IRECOS) ◽

10.15866/irecos.v10i5.6303 ◽

2015 ◽

Vol 10 (5) ◽

pp. 520 ◽

Cited By ~ 1

Author(s):

Sarra Roubi ◽

Mohammed Erramdani ◽

Samir Mbarki

Keyword(s):

User Interfaces ◽

Graphical User Interfaces ◽

Model Driven Engineering ◽

Model Driven

Download Full-text

Proceedings of the 23rd ACM/IEEE International Conference on Model Driven Engineering Languages and Systems

10.1145/3365438 ◽

2020 ◽

Keyword(s):

Model Driven Engineering ◽

International Conference ◽

Model Driven ◽

Ieee International Conference

Download Full-text

Foreword: Quality in model driven engineering

2012 Eighth International Conference on the Quality of Information and Communications Technology ◽

10.1109/quatic.2012.77 ◽

2012 ◽

Author(s):

Vasco Amaral

Keyword(s):

Model Driven Engineering ◽

Model Driven

Download Full-text

Model-driven Engineering for High-Performance Computing Applications

Modeling Simulation and Optimization - Focus on Applications ◽

10.5772/8969 ◽

2010 ◽

Cited By ~ 5

Author(s):

David Lugato ◽

Jean-Michel Bruel ◽

Ileana Ober

Keyword(s):

High Performance Computing ◽

High Performance ◽

Model Driven Engineering ◽

Model Driven ◽

Performance Computing

Download Full-text

A Review on Human–AI Interaction in Machine Learning and Insights for Medical Applications

International Journal of Environmental Research and Public Health ◽

10.3390/ijerph18042121 ◽

2021 ◽

Vol 18 (4) ◽

pp. 2121

Author(s):

Mansoureh Maadi ◽

Hadi Akbarzadeh Khorshidi ◽

Uwe Aickelin

Keyword(s):

Machine Learning ◽

Future Research ◽

Computational Power ◽

Medical Field ◽

Interactive Machine Learning ◽

Human In The Loop ◽

Human Interactions ◽

Scoping Literature Review ◽

Domain Expertise ◽

Expertise Level

Objective: To provide a human–Artificial Intelligence (AI) interaction review for Machine Learning (ML) applications to inform how to best combine both human domain expertise and computational power of ML methods. The review focuses on the medical field, as the medical ML application literature highlights a special necessity of medical experts collaborating with ML approaches. Methods: A scoping literature review is performed on Scopus and Google Scholar using the terms “human in the loop”, “human in the loop machine learning”, and “interactive machine learning”. Peer-reviewed papers published from 2015 to 2020 are included in our review. Results: We design four questions to investigate and describe human–AI interaction in ML applications. These questions are “Why should humans be in the loop?”, “Where does human–AI interaction occur in the ML processes?”, “Who are the humans in the loop?”, and “How do humans interact with ML in Human-In-the-Loop ML (HILML)?”. To answer the first question, we describe three main reasons regarding the importance of human involvement in ML applications. To address the second question, human–AI interaction is investigated in three main algorithmic stages: 1. data producing and pre-processing; 2. ML modelling; and 3. ML evaluation and refinement. The importance of the expertise level of the humans in human–AI interaction is described to answer the third question. The number of human interactions in HILML is grouped into three categories to address the fourth question. We conclude the paper by offering a discussion on open opportunities for future research in HILML.

Download Full-text