AVATAR Modulo Theories

This paper introduces a new technique for reasoning with quantifiers and theories. Traditionally, first-order theorem provers (ATPs) are well suited to reasoning with first-order problems containing many quantifiers and satisfiability modulo theories (SMT) solvers are well suited to reasoning with first-order problems in ground theories such as arithmetic. A recent development in first-order theorem proving has been the AVATAR architecture which uses a SAT solver to guide proof search based on a propositional abstraction of the first-order clause space. The approach turns a single proof search into a sequence of proof searches on (much) smaller sub-problems. This work extends the AVATAR approach to use a SMT solver in place of the SAT solver, with the effect that the first-order solver only needs to consider ground-theory-consistent sub-problems. The new architecture has been implemented using the Vampire theorem prover and Z3 SMT solver. Our experimental results, and the results of recent competitions, show that such a combination can be highly effective.

Download Full-text

νZ - Maximal Satisfaction with Z3

10.29007/jmxj ◽

2018 ◽

Author(s):

Nikolaj Bjorner ◽

Anh-Dung Phan

Keyword(s):

Theorem Proving ◽

Optimization Problems ◽

Linear Optimization ◽

Satisfiability Modulo Theories ◽

Objective Functions ◽

Smt Solver ◽

Smt Solvers ◽

Free Variables ◽

Linear Optimization Problems ◽

Pareto Fronts

Satisfiability Modulo Theories, SMT, solvers are used inmany applications. These applications benefit from thepower of tuned and scalable theorem proving technologiesfor supported logics and specialized theory solvers.SMT solvers are primarily used to determine whether formulas are satisfiable.Furthermore, when formulas are satisfiable, many applications need modelsthat assign values to free variables.Yet, in many cases arbitrary assignments are insufficient,and what is really needed is an <i>optimal</i> assignmentwith respect to objective functions. So far, users of Z3,an SMT solver from Microsoft Research, build custom loopsto achieve objective values. This is no longer necessarywith νZ (new-Z, or max-Z), an extension within Z3 that letsusers formulate objective functions directly with Z3. Under the hood there is aportfolio of approaches for solving linear optimization problems overSMT formulas, MaxSMT, and their combinations. Objective functions are combinedas either Pareto fronts, lexicographically, or each objective is optimized independently.

Download Full-text

Global Subsumption Revisited (Briefly)

10.29007/qcd7 ◽

2018 ◽

Author(s):

Giles Reger ◽

Martin Suda

Keyword(s):

Initial Problem ◽

Theorem Prover ◽

General Idea ◽

Practical Implementation ◽

Global Approach ◽

Smt Solver ◽

First Order ◽

Theorem Provers ◽

Sat Solver

Global subsumption is an existing simplification technique for saturation-based first-order theorem provers. The general idea is that we can replace a clause C by its subclause D if D follows from the initial problem as D will subsume C. The effectiveness of the technique comes from a cheap, global approach for (incompletely) checking whether D is a consequence of the initial problem. The idea is to produce and maintain a set S of ground clauses that follow from the input (e.g. grounded versions of all derived clauses) and to check whether a grounding of D follows from this set. As this is now a propositional problem this check can be performed by a SAT solver, making it efficient. In this paper we review the global subsumption technique and pose a number of questions related to the practical implementation of global subsumption and possible variations of the approach. We consider, for example, which groundings to place in S, how to select the subclause(s) D to check, how to integrate this technique with the AVATAR approach and whether it makes sense to replace the SAT solver with an SMT solver. This discussion takes place within the context of the Vampire theorem prover.

Download Full-text

CSE_E 1.0: An Integrated Automated Theorem Prover for First-Order Logic

Symmetry ◽

10.3390/sym11091142 ◽

2019 ◽

Vol 11 (9) ◽

pp. 1142

Author(s):

Feng Cao ◽

Yang Xu ◽

Jun Liu ◽

Shuwei Chen ◽

Xinran Ning

Keyword(s):

Order Logic ◽

Theorem Prover ◽

First Order Logic ◽

Inference Mechanism ◽

First Order ◽

Proof Search ◽

Interdisciplinary Field ◽

Heuristic Strategies ◽

Automated Theorem Prover ◽

Hard Problems

First-order logic is an important part of mathematical logic, and automated theorem proving is an interdisciplinary field of mathematics and computer science. The paper presents an automated theorem prover for first-order logic, called C S E _ E 1.0, which is a combination of two provers contradiction separation extension (CSE) and E, where CSE is based on the recently-introduced multi-clause standard contradiction separation (S-CS) calculus for first-order logic and E is the well-known equational theorem prover for first-order logic based on superposition and rewriting. The motivation of the combined prover C S E _ E 1.0 is to (1) evaluate the capability, applicability and generality of C S E _ E , and (2) take advantage of novel multi-clause S-CS dynamic deduction of CSE and mature equality handling of E to solve more and harder problems. In contrast to other improvements of E, C S E _ E 1.0 optimizes E mainly from the inference mechanism aspect. The focus of the present work is given to the description of C S E _ E including its S-CS rule, heuristic strategies, and the S-CS dynamic deduction algorithm for implementation. In terms of combination, in order not to lose the capability of E and use C S E _ E to solve some hard problems which are unsolved by E, C S E _ E 1.0 schedules the running of the two provers in time. It runs plain E first, and if E does not find a proof, it runs plain CSE, then if it does not find a proof, some clauses inferred in the CSE run as lemmas are added to the original clause set and the combined clause set handed back to E for further proof search. C S E _ E 1.0 is evaluated through benchmarks, e.g., CASC-26 (2017) and CASC-J9 (2018) competition problems (FOFdivision). Experimental results show that C S E _ E 1.0 indeed enhances the performance of E to a certain extent.

Download Full-text

Deep Network Guided Proof Search

10.29007/8mwc ◽

2018 ◽

Cited By ~ 1

Author(s):

Sarah Loos ◽

Geoffrey Irving ◽

Christian Szegedy ◽

Cezary Kaliszyk

Keyword(s):

Deep Learning ◽

Program Analysis ◽

Network Models ◽

Theorem Prover ◽

Neural Network Models ◽

Two Phase ◽

First Order ◽

Proof Search ◽

System Verification ◽

Theory Reasoning

Deep learning techniques lie at the heart of several significant AI advances in recent years including object recognition and detection, image captioning, machine translation, speech recognition and synthesis, and playing the game of Go.Automated first-order theorem provers can aid in the formalization and verification of mathematical theorems and play a crucial role in program analysis, theory reasoning, security, interpolation, and system verification.Here we suggest deep learning based guidance in the proof search of the theorem prover E. We train and compare several deep neural network models on the traces of existing ATP proofs of Mizar statements and use them to select processed clauses during proof search. We give experimental evidence that with a hybrid, two-phase approach, deep learning based guidance can significantly reduce the average number of proof search steps while increasing the number of theorems proved.Using a few proof guidance strategies that leverage deep neural networks, we have found first-order proofs of 7.36% of the first-order logic translations of the Mizar Mathematical Library theorems that did not previously have ATP generated proofs. This increases the ratio of statements in the corpus with ATP generated proofs from 56% to 59%.

Download Full-text

Understanding LEO-II’s proofs

10.29007/x9c9 ◽

2018 ◽

Author(s):

Nik Sultana ◽

Christoph Benzmüller

Keyword(s):

Theorem Proving ◽

Automated Theorem Proving ◽

Higher Order ◽

Target Audience ◽

Theorem Prover ◽

First Order

The LEO and LEO-II provers have pioneered the integration of higher-order and first-order automated theorem proving. To date, the LEO-II system is, to our knowledge, the only automated higher-order theorem prover which is capable of generating joint higher-order–first-order proof objects in TPTP format. This paper discusses LEO-II’s proof objects. The target audience are practitioners with an interest in using LEO-II proofs within other systems.

Download Full-text

SAT solving experiments in Vampire

10.29007/5l47 ◽

2018 ◽

Author(s):

Armin Biere ◽

Ioan Dragan ◽

Laura Kovács ◽

Andrei Voronkov

Keyword(s):

State Of The Art ◽

Theorem Prover ◽

Sat Solving ◽

First Order ◽

Automated Theorem Prover ◽

Sat Solver ◽

Overall Performance

In order to better understand how well a state of the art SAT solver would behave in the framework of a first-order automated theorem prover we have decided to integrate Lingeling, best performing SAT solver, inside Vampire’s AVATAR framework. In this paper we propose two ways of integrating a SAT solver inside of Vampire and evaluate overall performance of this combination. Our experiments show that by using a state of the art SAT solver in Vampire we manage to solve more problems. Surprisingly though, there are cases where combination of the two solvers does not always prove to generate best results.

Download Full-text

On Recurrent Neural Network Based Theorem Prover For First Order Minimal Logic

JUCS - Journal of Universal Computer Science ◽

10.3897/jucs.76563 ◽

2021 ◽

Vol 27 (11) ◽

pp. 1193-1202

Author(s):

Ashot Baghdasaryan ◽

Hovhannes Bolibekyan

Keyword(s):

Neural Network ◽

Neural Networks ◽

Recurrent Neural Network ◽

Theorem Proving ◽

Recurrent Neural Networks ◽

Selection Problem ◽

Theorem Prover ◽

Minimal Logic ◽

Free System ◽

First Order

There are three main problems for theorem proving with a standard cut-free system for the first order minimal logic. The first problem is the possibility of looping. Secondly, it might generate proofs which are permutations of each other. Finally, during the proof some choice should be made to decide which rules to apply and where to use them. New systems with history mechanisms were introduced for solving the looping problems of automated theorem provers in the first order minimal logic. In order to solve the rule selection problem, recurrent neural networks are deployed and they are used to determine which formula from the context should be used on further steps. As a result, it yields to the reduction of time during theorem proving.

Download Full-text

Integer Induction in Saturation

Automated Deduction – CADE 28 - Lecture Notes in Computer Science ◽

10.1007/978-3-030-79876-5_21 ◽

2021 ◽

pp. 361-377

Author(s):

Petra Hozzová ◽

Laura Kovács ◽

Andrei Voronkov

Keyword(s):

Theorem Proving ◽

Program Analysis ◽

State Of The Art ◽

Inductive Reasoning ◽

Theorem Prover ◽

Inference Rules ◽

First Order ◽

Theorem Provers ◽

Mathematical Properties

AbstractIntegers are ubiquitous in programming and therefore also in applications of program analysis and verification. Such applications often require some sort of inductive reasoning. In this paper we analyze the challenge of automating inductive reasoning with integers. We introduce inference rules for integer induction within the saturation framework of first-order theorem proving. We implemented these rules in the theorem prover Vampire and evaluated our work against other state-of-the-art theorem provers. Our results demonstrate the strength of our approach by solving new problems coming from program analysis and mathematical properties of integers.

Download Full-text

nanoCoP: Natural Non-clausal Theorem Proving

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2017/695 ◽

2017 ◽

Cited By ~ 1

Author(s):

Jens Otten

Keyword(s):

Conjunctive Normal Form ◽

Order Logic ◽

Theorem Prover ◽

Original Structure ◽

First Order ◽

Proof Search ◽

Automated Theorem Prover ◽

Speed Up ◽

Input Formula ◽

Original Formula

Most efficient fully automated theorem provers implement proof search calculi that require the input formula to be in a clausal form, i.e. disjunctive or conjunctive normal form. The translation into clausal form introduces a significant overhead to the proof search and modifies the structure of the original formula. Translating a proof in clausal form back into a more readable non-clausal proof of the original formula is not straightforward. This paper presents a non-clausal automated theorem prover for classical first-order logic. It is based on a non-clausal connection calculus and implemented with a few lines of Prolog code. Working entirely on the original structure of the input formula yields not only a speed up of the proof search, but the resulting non-clausal proofs are also shorter.

Download Full-text

Concurrency Debugging with MaxSMT

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33011608 ◽

2019 ◽

Vol 33 ◽

pp. 1608-1616 ◽

Cited By ~ 2

Author(s):

Miguel Terra-Neves ◽

Nuno Machado ◽

Ines Lynce ◽

Vasco Manquinho

Keyword(s):

Satisfiability Modulo Theories ◽

Huge Number ◽

Sequential Programs ◽

Concurrency Bugs ◽

Maximum Satisfiability ◽

Smt Solver ◽

Root Cause ◽

Recent Approach ◽

Sat Solver ◽

Current Maximum

Current Maximum Satisfiability (MaxSAT) algorithms based on successive calls to a powerful Satisfiability (SAT) solver are now able to solve real-world instances in many application domains. Moreover, replacing the SAT solver with a Satisfiability Modulo Theories (SMT) solver enables effective MaxSMT algorithms. However, MaxSMT has seldom been used in debugging multi-threaded software.Multi-threaded programs are usually non-deterministic due to the huge number of possible thread operation schedules, which makes them much harder to debug than sequential programs. A recent approach to isolate the root cause of concurrency bugs in multi-threaded software is to produce a report that shows the differences between a failing and a non-failing execution. However, since they rely solely on heuristics, these reports can be unnecessarily large. Hence, reports may contain operations that are not relevant to the bug’s occurrence.This paper proposes the use of MaxSMT for the generation of minimal reports for multi-threaded software with concurrency bugs. The proposed techniques report situations that the existing techniques are not able to identify. Experimental results show that using MaxSMT can significantly improve the accuracy of the generated reports and, consequently, their usefulness in debugging the root cause of concurrency bugs.

Download Full-text