scholarly journals Provable randomized rounding for minimum-similarity diversification

Author(s):  
Bruno Ordozgoiti ◽  
Ananth Mahadevan ◽  
Antonis Matakos ◽  
Aristides Gionis

AbstractWhen searching for information in a data collection, we are often interested not only in finding relevant items, but also in assembling a diverse set, so as to explore different concepts that are present in the data. This problem has been researched extensively. However, finding a set of items with minimal pairwise similarities can be computationally challenging, and most existing works striving for quality guarantees assume that item relatedness is measured by a distance function. Given the widespread use of similarity functions in many domains, we believe this to be an important gap in the literature. In this paper we study the problem of finding a diverse set of items, when item relatedness is measured by a similarity function. We formulate the diversification task using a flexible, broadly applicable minimization objective, consisting of the sum of pairwise similarities of the selected items and a relevance penalty term. To find good solutions we adopt a randomized rounding strategy, which is challenging to analyze because of the cardinality constraint present in our formulation. Even though this obstacle can be overcome using dependent rounding, we show that it is possible to obtain provably good solutions using an independent approach, which is faster, simpler to implement and completely parallelizable. Our analysis relies on a novel bound for the ratio of Poisson-Binomial densities, which is of independent interest and has potential implications for other combinatorial-optimization problems. We leverage this result to design an efficient randomized algorithm that provides a lower-order additive approximation guarantee. We validate our method using several benchmark datasets, and show that it consistently outperforms the greedy approaches that are commonly used in the literature.

Author(s):  
ENDER SEVİNÇ ◽  
TANSEL DÖKEROĞLU

Data classification is the process of organizing data by relevant categories. In this way, the data can be understood and used more efficiently by scientists. Numerous studies have been proposed in the literature for the problem of data classification. However, with recently introduced metaheuristics, it has continued to be riveting to revisit this classical problem and investigate the efficiency of new techniques. Teaching-learning-based optimization (TLBO) is a recent metaheuristic that has been reported to be very effective for combinatorial optimization problems. In this study, we propose a novel hybrid TLBO algorithm with extreme learning machines (ELM) for the solution of data classification problems. The proposed algorithm (TLBO-ELM) is tested on a set of UCI benchmark datasets. The performance of TLBO-ELM is observed to be competitive for both binary and multiclass data classification problems compared with state-of-the-art algorithms.


Author(s):  
Jun Zhu ◽  
Yushen Wang

Swarm intelligent algorithms can effectively tackle optimization problems that are difficult to solve by using traditional optimization algorithms. However, with the huge increase in the time and space cost for solving optimization problems, the use of swarm intelligent algorithms suffer from the limitation of overly long computation time. Based on Spark, which is the most popular open-source distributed computing framework, this paper studies specifically using swarm intelligent algorithms to solve combinatorial optimization problems. Based on the characteristics of typical swarm intelligent algorithms, we develop Spark-based parallel implementation of these algorithms to accelerate the population updating and parameter tuning procedures involved in swarm intelligence. Specifically, we first initialize the swarm and generate the initial solution, then perform the distributed iterative evolution procedure, and finally obtain the optimal solution. In addition, in order to improve solution quality, we rely on the Spark platform to perform distributed parameter tuning. The tuning strategy first generates different parameter combinations according to a given parameter list, then execute swarm intelligent algorithms with different parameter combinations in a distributed and parallel manner, and finally determine the optimal parameter combination by comparing the solutions of all algorithms. Experimental results on benchmark datasets show that the distributed algorithms can significantly enhance the computational efficiency without affecting the solution quality.


Mathematics ◽  
2021 ◽  
Vol 9 (3) ◽  
pp. 225
Author(s):  
José García ◽  
Gino Astorga ◽  
Víctor Yepes

The optimization methods and, in particular, metaheuristics must be constantly improved to reduce execution times, improve the results, and thus be able to address broader instances. In particular, addressing combinatorial optimization problems is critical in the areas of operational research and engineering. In this work, a perturbation operator is proposed which uses the k-nearest neighbors technique, and this is studied with the aim of improving the diversification and intensification properties of metaheuristic algorithms in their binary version. Random operators are designed to study the contribution of the perturbation operator. To verify the proposal, large instances of the well-known set covering problem are studied. Box plots, convergence charts, and the Wilcoxon statistical test are used to determine the operator contribution. Furthermore, a comparison is made using metaheuristic techniques that use general binarization mechanisms such as transfer functions or db-scan as binarization methods. The results obtained indicate that the KNN perturbation operator improves significantly the results.


Mathematics ◽  
2021 ◽  
Vol 9 (13) ◽  
pp. 1456
Author(s):  
Stefka Fidanova ◽  
Krassimir Todorov Atanassov

Some of industrial and real life problems are difficult to be solved by traditional methods, because they need exponential number of calculations. As an example, we can mention decision-making problems. They can be defined as optimization problems. Ant Colony Optimization (ACO) is between the best methods, that solves combinatorial optimization problems. The method mimics behavior of the ants in the nature, when they look for a food. One of the algorithm parameters is called pheromone, and it is updated every iteration according quality of the achieved solutions. The intuitionistic fuzzy (propositional) logic was introduced as an extension of Zadeh’s fuzzy logic. In it, each proposition is estimated by two values: degree of validity and degree of non-validity. In this paper, we propose two variants of intuitionistic fuzzy pheromone updating. We apply our ideas on Multiple-Constraint Knapsack Problem (MKP) and compare achieved results with traditional ACO.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Yoshihiko Imanaka ◽  
Toshihisa Anazawa ◽  
Fumiaki Kumasaka ◽  
Hideyuki Jippo

AbstractTailored material is necessary in many industrial applications since material properties directly determine the characteristics of components. However, the conventional trial and error approach is costly and time-consuming. Therefore, materials informatics is expected to overcome these drawbacks. Here, we show a new materials informatics approach applying the Ising model for solving discrete combinatorial optimization problems. In this study, the composition of the composite, aimed at developing a heat sink with three necessary properties: high thermal dissipation, attachability to Si, and a low weight, is optimized. We formulate an energy function equation concerning three objective terms with regard to the thermal conductivity, thermal expansion and specific gravity, with the composition variable and two constrained terms with a quadratic unconstrained binary optimization style equivalent to the Ising model and calculated by a simulated annealing algorithm. The composite properties of the composition selected from ten constituents are verified by the empirical mixture rule of the composite. As a result, an optimized composition with high thermal conductivity, thermal expansion close to that of Si, and a low specific gravity is acquired.


2021 ◽  
Vol 11 (9) ◽  
pp. 4169
Author(s):  
Hirotaka Takano ◽  
Junichi Murata ◽  
Kazuki Morishita ◽  
Hiroshi Asano

The recent growth in the penetration of photovoltaic generation systems (PVs) has brought new difficulties in the operating and planning of electric power distribution networks. This is because operators of the distribution networks normally cannot monitor or control the output of the PVs, which introduces additional uncertainty into the available information that operations must rely on. This paper focuses on the service restoration of the distribution networks, and the authors propose a problem framework and its solution method that finds the optimal restoration configuration under extensive PV installation. The service restoration problems have been formulated as combinatorial optimization problems. They do, however, require accurate information on load sections, which is impractical in distribution networks with extensively installed PVs. A combined framework of robust optimization and two-stage stochastic programming adopted in the proposed problem formulation enables us to deal with the PV-originated uncertainty using readily available information only. In addition, this problem framework can be treated by a traditional solution method with slight extensions. The validity of the authors’ proposal is verified through numerical simulations on a real-scale distribution network model and includes a discussion of their results.


Sign in / Sign up

Export Citation Format

Share Document