A framework to exploit the structure of and solve set packing problems with a semi-block-angular structure

2019 ◽  
Vol 137 ◽  
pp. 106036
Author(s):  
Maryam Radman ◽  
Kourosh Eshghi
2013 ◽  
Vol 60 (4) ◽  
pp. 1-39 ◽  
Author(s):  
Georg Gottlob ◽  
Gianluigi Greco

2020 ◽  
Vol 34 (02) ◽  
pp. 1593-1602
Author(s):  
Vishnu Suresh Lokhande ◽  
Shaofei Wang ◽  
Maneesh Singh ◽  
Julian Yarkony

In this paper, we introduce a new optimization approach to Entity Resolution. Traditional approaches tackle entity resolution with hierarchical clustering, which does not benefit from a formal optimization formulation. In contrast, we model entity resolution as correlation-clustering, which we treat as a weighted set-packing problem and write as an integer linear program (ILP). In this case, sources in the input data correspond to elements and entities in output data correspond to sets/clusters. We tackle optimization of weighted set packing by relaxing integrality in our ILP formulation. The set of potential sets/clusters can not be explicitly enumerated, thus motivating optimization via column generation. In addition to the novel formulation, we also introduce new dual optimal inequalities (DOI), that we call flexible dual optimal inequalities, which tightly lower-bound dual variables during optimization and accelerate column generation. We apply our formulation to entity resolution (also called de-duplication of records), and achieve state-of-the-art accuracy on two popular benchmark datasets. Our F-DOI can be extended to other weighted set-packing problems.


2004 ◽  
Vol 153 (3) ◽  
pp. 564-580 ◽  
Author(s):  
Xavier Delorme ◽  
Xavier Gandibleux ◽  
Joaquin Rodriguez
Keyword(s):  

2002 ◽  
Vol 16 (1) ◽  
pp. 127-155 ◽  
Author(s):  
Lázaro Cánovas ◽  
Mercedes Landete ◽  
Alfredo Marín
Keyword(s):  

Author(s):  
Tien-Nam Le ◽  
Daniel Lokshtanov ◽  
Saket Saurabh ◽  
Stéphan Thomassé ◽  
Meirav Zehavi

2008 ◽  
Vol 186 (2) ◽  
pp. 504-512 ◽  
Author(s):  
Bahram Alidaee ◽  
Gary Kochenberger ◽  
Karen Lewis ◽  
Mark Lewis ◽  
Haibo Wang

Author(s):  
Klaus Jansen ◽  
Kim-Manuel Klein ◽  
Marten Maack ◽  
Malin Rau

AbstractInteger linear programs of configurations, or configuration IPs, are a classical tool in the design of algorithms for scheduling and packing problems where a set of items has to be placed in multiple target locations. Herein, a configuration describes a possible placement on one of the target locations, and the IP is used to choose suitable configurations covering the items. We give an augmented IP formulation, which we call the module configuration IP. It can be described within the framework of n-fold integer programming and, therefore, be solved efficiently. As an application, we consider scheduling problems with setup times in which a set of jobs has to be scheduled on a set of identical machines with the objective of minimizing the makespan. For instance, we investigate the case that jobs can be split and scheduled on multiple machines. However, before a part of a job can be processed, an uninterrupted setup depending on the job has to be paid. For both of the variants that jobs can be executed in parallel or not, we obtain an efficient polynomial time approximation scheme (EPTAS) of running time $$f(1/\varepsilon )\cdot \mathrm {poly}(|I|)$$ f ( 1 / ε ) · poly ( | I | ) . Previously, only constant factor approximations of 5/3 and $$4/3 + \varepsilon $$ 4 / 3 + ε , respectively, were known. Furthermore, we present an EPTAS for a problem where classes of (non-splittable) jobs are given, and a setup has to be paid for each class of jobs being executed on one machine.


Sign in / Sign up

Export Citation Format

Share Document