Voting with random classifiers (VORACE): theoretical and experimental analysis

Cristina Cornelio; Michele Donini; Andrea Loreggia; Maria Silvia Pini; Francesca Rossi

doi:10.1007/s10458-021-09504-y

Voting with random classifiers (VORACE): theoretical and experimental analysis

Autonomous Agents and Multi-Agent Systems ◽

10.1007/s10458-021-09504-y ◽

2021 ◽

Vol 35 (2) ◽

Author(s):

Cristina Cornelio ◽

Michele Donini ◽

Andrea Loreggia ◽

Maria Silvia Pini ◽

Francesca Rossi

Keyword(s):

State Of The Art ◽

Empirical Evaluation ◽

Parameter Tuning ◽

Voting Rules ◽

Specific Domain ◽

Ensemble Technique ◽

Voting Rule ◽

Input Sample ◽

Learning Scenarios ◽

A New Technique

AbstractIn many machine learning scenarios, looking for the best classifier that fits a particular dataset can be very costly in terms of time and resources. Moreover, it can require deep knowledge of the specific domain. We propose a new technique which does not require profound expertise in the domain and avoids the commonly used strategy of hyper-parameter tuning and model selection. Our method is an innovative ensemble technique that uses voting rules over a set of randomly-generated classifiers. Given a new input sample, we interpret the output of each classifier as a ranking over the set of possible classes. We then aggregate these output rankings using a voting rule, which treats them as preferences over the classes. We show that our approach obtains good results compared to the state-of-the-art, both providing a theoretical analysis and an empirical evaluation of the approach on several datasets.

Download Full-text

Approximate Mining of Frequent -Subgraph Patterns in Evolving Graphs

ACM Transactions on Knowledge Discovery from Data ◽

10.1145/3442590 ◽

2021 ◽

Vol 15 (3) ◽

pp. 1-35

Author(s):

Muhammad Anis Uddin Nasir ◽

Cigdem Aslay ◽

Gianmarco De Francisci Morales ◽

Matteo Riondato

Keyword(s):

Sample Size ◽

State Of The Art ◽

Empirical Evaluation ◽

Connected Subgraph ◽

Dynamic Graphs ◽

High Quality ◽

Complexity Bounds ◽

Approximation Quality ◽

Uniform Sample ◽

Waiting For Godot

“Perhaps he could dance first and think afterwards, if it isn’t too much to ask him.” S. Beckett, Waiting for Godot Given a labeled graph, the collection of -vertex induced connected subgraph patterns that appear in the graph more frequently than a user-specified minimum threshold provides a compact summary of the characteristics of the graph, and finds applications ranging from biology to network science. However, finding these patterns is challenging, even more so for dynamic graphs that evolve over time, due to the streaming nature of the input and the exponential time complexity of the problem. We study this task in both incremental and fully-dynamic streaming settings, where arbitrary edges can be added or removed from the graph. We present TipTap , a suite of algorithms to compute high-quality approximations of the frequent -vertex subgraphs w.r.t. a given threshold, at any time (i.e., point of the stream), with high probability. In contrast to existing state-of-the-art solutions that require iterating over the entire set of subgraphs in the vicinity of the updated edge, TipTap operates by efficiently maintaining a uniform sample of connected -vertex subgraphs, thanks to an optimized neighborhood-exploration procedure. We provide a theoretical analysis of the proposed algorithms in terms of their unbiasedness and of the sample size needed to obtain a desired approximation quality. Our analysis relies on sample-complexity bounds that use Vapnik–Chervonenkis dimension, a key concept from statistical learning theory, which allows us to derive a sufficient sample size that is independent from the size of the graph. The results of our empirical evaluation demonstrates that TipTap returns high-quality results more efficiently and accurately than existing baselines.

Download Full-text

InstantDL: an easy-to-use deep learning pipeline for image segmentation and classification

BMC Bioinformatics ◽

10.1186/s12859-021-04037-3 ◽

2021 ◽

Vol 22 (1) ◽

Author(s):

Dominik Jens Elias Waibel ◽

Sayedali Shetab Boushehri ◽

Carsten Marr

Keyword(s):

Image Processing ◽

Deep Learning ◽

Specific Problem ◽

State Of The Art ◽

Image Data ◽

Semantic Segmentation ◽

Parameter Tuning ◽

Cellular Processes ◽

Minimal Effort ◽

Instance Segmentation

Abstract Background Deep learning contributes to uncovering molecular and cellular processes with highly performant algorithms. Convolutional neural networks have become the state-of-the-art tool to provide accurate and fast image data processing. However, published algorithms mostly solve only one specific problem and they typically require a considerable coding effort and machine learning background for their application. Results We have thus developed InstantDL, a deep learning pipeline for four common image processing tasks: semantic segmentation, instance segmentation, pixel-wise regression and classification. InstantDL enables researchers with a basic computational background to apply debugged and benchmarked state-of-the-art deep learning algorithms to their own data with minimal effort. To make the pipeline robust, we have automated and standardized workflows and extensively tested it in different scenarios. Moreover, it allows assessing the uncertainty of predictions. We have benchmarked InstantDL on seven publicly available datasets achieving competitive performance without any parameter tuning. For customization of the pipeline to specific tasks, all code is easily accessible and well documented. Conclusions With InstantDL, we hope to empower biomedical researchers to conduct reproducible image processing with a convenient and easy-to-use pipeline.

Download Full-text

Decision Making in Committees: Transparency, Reputation, and Voting Rules

The American Economic Review ◽

10.1257/aer.97.1.150 ◽

2007 ◽

Vol 97 (1) ◽

pp. 150-168 ◽

Cited By ~ 96

Author(s):

Gilat Levy

Keyword(s):

Decision Making ◽

Career Concerns ◽

Decision Making Process ◽

Voting Rules ◽

The Public ◽

Voting Rule ◽

The Right

In this paper I analyze the effect of transparency on decision making in committees. I focus on committees whose members are motivated by career concerns. The main result is that when the decision-making process is secretive (when individual votes are not revealed to the public), committee members comply with preexisting biases. For example, if the voting rule demands a supermajority to accept a reform, individuals vote more often against reforms. Transparent committees are therefore more likely to accept reforms. I also find that coupled with the right voting rule, a secretive procedure may induce better decisions than a transparent one. (JEL D71, D72)

Download Full-text

JKRW Link Prediction – A New Ensemble Technique Based on Merging Other Known Techniques in The Social Network Analysis

International Journal of Interactive Mobile Technologies (iJIM) ◽

10.3991/ijim.v15i12.22831 ◽

2021 ◽

Vol 15 (12) ◽

pp. 125

Author(s):

Aya Taleb ◽

Rizik M. H. Al-Sayyed ◽

Hamed S. Al-Bdour

Keyword(s):

Link Prediction ◽

Preferential Attachment ◽

Large Data ◽

The Other ◽

Jaccard Coefficient ◽

Ensemble Technique ◽

Signed Rank ◽

Proposed Model ◽

Different Populations ◽

A New Technique

In this research, a new technique to improve the accuracy of the link prediction for most of the networks is proposed; it is based on the prediction ensemble approach using the voting merging technique. The new proposed ensemble called Jaccard, Katz, and Random models Wrapper (JKRW), it scales up the prediction accuracy and provides better predictions for different sizes of populations including small, medium, and large data. The proposed model has been tested and evaluated based on the area under curve (AUC) and accuracy (ACC) measures. These measures applied to the other models used in this study that has been built based on the Jaccard Coefficient, Katz, Adamic/Adar, and Preferential attachment. Results from applying the evaluation matrices verify the improvement of JKRW effectiveness and stability in comparison to the other tested models. The results from applying the Wilcoxon signed-rank method (one of the non-parametric paired tests) indicate that JKRW has significant differences compared to the other models in the different populations at <strong>0.95</strong> confident interval.

Download Full-text

HyP-ABC: A Novel Automated Hyper-Parameter Tuning Algorithm Using Evolutionary Optimization

10.36227/techrxiv.14714508.v2 ◽

2021 ◽

Author(s):

Leila Zahedi ◽

Farid Ghareh Mohammadi ◽

M. Hadi Amini

Keyword(s):

Parameter Optimization ◽

Real World ◽

Optimization Problems ◽

State Of The Art ◽

Parameter Tuning ◽

Gradient Boosting ◽

Support Vector ◽

Wide Range ◽

Extreme Gradient Boosting ◽

Art Techniques

Machine learning techniques lend themselves as promising decision-making and analytic tools in a wide range of applications. Different ML algorithms have various hyper-parameters. In order to tailor an ML model towards a specific application, a large number of hyper-parameters should be tuned. Tuning the hyper-parameters directly affects the performance (accuracy and run-time). However, for large-scale search spaces, efficiently exploring the ample number of combinations of hyper-parameters is computationally challenging. Existing automated hyper-parameter tuning techniques suffer from high time complexity. In this paper, we propose HyP-ABC, an automatic innovative hybrid hyper-parameter optimization algorithm using the modified artificial bee colony approach, to measure the classification accuracy of three ML algorithms, namely random forest, extreme gradient boosting, and support vector machine. Compared to the state-of-the-art techniques, HyP-ABC is more efficient and has a limited number of parameters to be tuned, making it worthwhile for real-world hyper-parameter optimization problems. We further compare our proposed HyP-ABC algorithm with state-of-the-art techniques. In order to ensure the robustness of the proposed method, the algorithm takes a wide range of feasible hyper-parameter values, and is tested using a real-world educational dataset.

Download Full-text

Scaling Up AND/OR Abstraction Sampling

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2020/589 ◽

2020 ◽

Author(s):

Kalev Kask ◽

Bobak Pezeshki ◽

Filjor Broka ◽

Alexander Ihler ◽

Rina Dechter

Keyword(s):

Importance Sampling ◽

State Of The Art ◽

Empirical Evaluation ◽

Search Space ◽

Scaling Up ◽

Sampling Scheme ◽

Search Spaces

Abstraction Sampling (AS) is a recently introduced enhancement of Importance Sampling that exploits stratification by using a notion of abstractions: groupings of similar nodes into abstract states. It was previously shown that AS performs particularly well when sampling over an AND/OR search space; however, existing schemes were limited to ``proper'' abstractions in order to ensure unbiasedness, severely hindering scalability. In this paper, we introduce AOAS, a new Abstraction Sampling scheme on AND/OR search spaces that allow more flexible use of abstractions by circumventing the properness requirement. We analyze the properties of this new algorithm and, in an extensive empirical evaluation on five benchmarks, over 480 problems, and comparing against other state of the art algorithms, illustrate AOAS's properties and show that it provides a far more powerful and competitive Abstraction Sampling framework.

Download Full-text

Textual Deblurring using Convolutional Neural Network

10.36227/techrxiv.16760632 ◽

2021 ◽

Author(s):

Muhammad Shahroz Nadeem ◽

Sibt Hussain ◽

Fatih Kurugollu

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Loss Function ◽

State Of The Art ◽

Empirical Evaluation ◽

The State

This paper solves the textual deblurring problem, In this paper we propose a new loss function, we provide empirical evaluation of the design choices based on which a memory friendly CNN model is proposed, that performs better then the state of the art CNN method.

Download Full-text

Learning Non-Taxonomic Relations of Ontologies

International Journal on Semantic Web and Information Systems ◽

10.4018/ijswis.2021010105 ◽

2021 ◽

Vol 17 (1) ◽

pp. 97-122

Author(s):

Mohamed Hassan Mohamed Ali ◽

Said Fathalla ◽

Mohamed Kholief ◽

Yasser Fouad Hassan

Keyword(s):

Systematic Review ◽

State Of The Art ◽

Research Work ◽

Semantic Knowledge ◽

Future Research ◽

Automatic Extraction ◽

Ontology Learning ◽

Comprehensive Understanding ◽

Specific Domain ◽

Taxonomic Relations

Ontologies, as semantic knowledge representation, have a crucial role in various information systems. The main pitfall of manually building ontologies is effort and time-consuming. Ontology learning is a key solution. Learning Non-Taxonomic Relationships of Ontologies (LNTRO) is the process of automatic/semi-automatic extraction of all possible relationships between concepts in a specific domain, except the hierarchal relations. Most of the research works focused on the extraction of concepts and taxonomic relations in the ontology learning process. This article presents the results of a systematic review of the state-of-the-art approaches for LNTRO. Sixteen approaches have been described and qualitatively analyzed. The solutions they provide are discussed along with their respective positive and negative aspects. The goal is to provide researchers in this area a comprehensive understanding of the drawbacks of the existing work, thereby encouraging further improvement of the research work in this area. Furthermore, this article proposes a set of recommendations for future research.

Download Full-text

A Novel Neural Model with Lateral Interaction for Learning Tasks

Neural Computation ◽

10.1162/neco_a_01345 ◽

2020 ◽

pp. 1-24

Author(s):

Dequan Jin ◽

Ziyan Qin ◽

Murong Yang ◽

Penghe Chen

Keyword(s):

Numerical Experiments ◽

State Of The Art ◽

Neural Model ◽

Parameter Tuning ◽

Small Sample ◽

Lateral Interaction ◽

Learning Tasks ◽

Proposed Model ◽

High Level ◽

Elementary Field

We propose a novel neural model with lateral interaction for learning tasks. The model consists of two functional fields: an elementary field to extract features and a high-level field to store and recognize patterns. Each field is composed of some neurons with lateral interaction, and the neurons in different fields are connected by the rules of synaptic plasticity. The model is established on the current research of cognition and neuroscience, making it more transparent and biologically explainable. Our proposed model is applied to data classification and clustering. The corresponding algorithms share similar processes without requiring any parameter tuning and optimization processes. Numerical experiments validate that the proposed model is feasible in different learning tasks and superior to some state-of-the-art methods, especially in small sample learning, one-shot learning, and clustering.

Download Full-text

Least Squares Optimization: From Theory to Practice

Robotics ◽

10.3390/robotics9030051 ◽

2020 ◽

Vol 9 (3) ◽

pp. 51 ◽

Cited By ~ 2

Author(s):

Giorgio Grisetti ◽

Tiziano Guadagnino ◽

Irvin Aloise ◽

Mirco Colosi ◽

Bartolomeo Della Corte ◽

...

Keyword(s):

Computer Vision ◽

Least Squares ◽

Open Source ◽

State Of The Art ◽

Nonlinear Least Squares ◽

Research Community ◽

Specific Domain ◽

Theory To Practice ◽

Least Squares Optimization ◽

Computer Vision Systems

Nowadays, Nonlinear Least-Squares embodies the foundation of many Robotics and Computer Vision systems. The research community deeply investigated this topic in the last few years, and this resulted in the development of several open-source solvers to approach constantly increasing classes of problems. In this work, we propose a unified methodology to design and develop efficient Least-Squares Optimization algorithms, focusing on the structures and patterns of each specific domain. Furthermore, we present a novel open-source optimization system that addresses problems transparently with a different structure and designed to be easy to extend. The system is written in modern C++ and runs efficiently on embedded systemsWe validated our approach by conducting comparative experiments on several problems using standard datasets. The results show that our system achieves state-of-the-art performances in all tested scenarios.

Download Full-text