Fast paraphrase extraction in Ancient Greek literature

2020 ◽  
Vol 62 (2) ◽  
pp. 75-89
Author(s):  
Marcus Pöckelmann ◽  
Janis Dähne ◽  
Jörg Ritter ◽  
Paul Molitor

AbstractIn this paper,A shorter version of the paper appeared in German in the final report of the Digital Plato project which was funded by the Volkswagen Foundation from 2016 to 2019. [35], [28]. we present a method for paraphrase extraction in Ancient Greek that can be applied to huge text corpora in interactive humanities applications. Since lexical databases and POS tagging are either unavailable or do not achieve sufficient accuracy for ancient languages, our approach is based on pure word embeddings and the word mover’s distance (WMD) [20]. We show how to adapt the WMD approach to paraphrase searching such that the expensive WMD computation has to be computed for a small fraction of the text segments contained in the corpus, only. Formally, the time complexity will be reduced from \mathcal{O}(N\cdot {K^{3}}\cdot \log K) to \mathcal{O}(N+{K^{3}}\cdot \log K), compared to the brute-force approach which computes the WMD between each text segment of the corpus and the search query. N is the length of the corpus and K the size of its vocabulary. The method, which searches not only for paraphrases of the same length as the search query but also for paraphrases of varying lengths, was evaluated on the Thesaurus Linguae Graecae® (TLG®) [25]. The TLG consists of about 75\cdot {10^{6}} Greek words. We searched the whole TLG for paraphrases for given passages of Plato. The experimental results show that our method and the brute-force approach, with only very few exceptions, propose the same text passages in the TLG as possible paraphrases. The computation times of our method are in a range that allows its application in interactive systems and let the humanities scholars work productively and smoothly.

Nature ◽  
2018 ◽  
Vol 560 (7718) ◽  
pp. 293-294
Author(s):  
Davide Castelvecchi

Author(s):  
Rajesh Prasad

Word matching problem is to find all the exact occurrences of a pattern P[0...m-1] in the text T[0...n-1], where P neither contains any white space nor preceded and followed by space. In the parameterized word matching problem, a given word P[0...m-1] is said to match with a sub-word t of the text T[0...n-1], if there exists a one-to-one correspondence between the symbols of P and the symbols of t. Exact Word Matching (EWM) problem has been previously solved by partitioning the text into number of tables in the pre-processing phase and then applying either brute force approach or fast hashing during the searching process. This paper presents an extension of EWM problem for parameterized word matching. It first split the text into number of tables in the pre-processing phase and then applying prev-encoding and bit-parallelism technique, Parameterized Shift-Or (PSO) during the searching phase. Experimental results show that this technique performs better than PSO.


2020 ◽  
Vol 70 (6) ◽  
pp. 612-618
Author(s):  
Maiya Din ◽  
Saibal K. Pal ◽  
S. K. Muttoo ◽  
Sushila Madan

The Playfair cipher is a symmetric key cryptosystem-based on encryption of digrams of letters. The cipher shows higher cryptanalytic complexity compared to mono-alphabetic cipher due to the use of 625 different letter-digrams in encryption instead of 26 letters from Roman alphabets. Population-based techniques like Genetic algorithm (GA) and Swarm intelligence (SI) are more suitable compared to the Brute force approach for cryptanalysis of cipher because of specific and unique structure of its Key Table. This work is an attempt to automate the process of cryptanalysis using hybrid computational intelligence. Multiple particle swarm optimization (MPSO) and GA-based hybrid technique (MPSO-GA) have been proposed and applied in solving Playfair ciphers. The authors have attempted to find the solution key applied in generating Playfair crypts by using the proposed hybrid technique to reduce the exhaustive search space. As per the computed results of the MPSO-GA technique, correct solution was obtained for the Playfair ciphers of 100 to 200 letters length. The proposed technique provided better results compared to either GA or PSO-based technique. Furthermore, the technique was also able to recover partial English text message for short Playfair ciphers of 80 to 120 characters length.


2021 ◽  
Author(s):  
Charles R. Krouse ◽  
Grant O. Musgrove ◽  
Taewoan Kim ◽  
Seungmin Lee ◽  
Muhyoung Lee ◽  
...  

Abstract The Chaboche model is a well-validated non-linear kinematic hardening material model. This material model, like many models, depends on a set of material constants that must be calibrated for it to match the experimental data. Due to the challenge of calibrating these constants, the Chaboche model is often disregarded. The challenge with calibrating the Chaboche constants is that the most reliable method for doing the calibration is a brute force approach, which tests thousands of combinations of constants. Different sampling techniques and optimization schemes can be used to select different combinations of these constants, but ultimately, they all rely on iteratively selecting values and running simulations for each selected set. In the experience of the authors, such brute force methods require roughly 2,500 combinations to be evaluated in order to have confidence that a reasonable solution is found. This process is not efficient. It is time-intensive and labor-intensive. It requires long simulation times, and it requires significant effort to develop the accompanying scripts and algorithms that are used to iterate through combinations of constants and to calculate agreement. A better, more automated method exists for calibrating the Chaboche material constants. In this paper, the authors describe a more efficient, automated method for calibrating Chaboche constants. The method is validated by using it to calibrate Chaboche constants for an IN792 single-crystal material and a CM247 directionally-solidified material. The calibration results using the automated approach were compared to calibration results obtained using a brute force approach. It was determined that the automated method achieves agreeable results that are equivalent to, or supersede, results obtained using the conventional brute force method. After validating the method for cases that only consider a single material orientation, the automated method was extended to multiple off-axis calibrations. The Chaboche model that is available in commercial software, such as ANSYS, will only accept a single set of Chaboche constants for a given temperature. There is no published method for calibrating Chaboche constants that considers multiple material orientations. Therefore, the approach outlined in this paper was extended to include multiple material orientations in a single calibration scheme. The authors concluded that the automated approach can be used to successfully, accurately, and efficiently calibrate multiple material directions. The approach is especially well-suited when off-axis calibration must be considered concomitantly with longitudinal calibration. Overall, the automated Chaboche calibration method yielded results that agreed well with experimental data. Thus, the method can be used with confidence to efficiently and accurately calibrate the Chaboche non-linear kinematic hardening material model.


Author(s):  
Eliot Rudnick-Cohen

Abstract Multi-objective decision making problems can sometimes involve an infinite number of objectives. In this paper, an approach is presented for solving multi-objective optimization problems containing an infinite number of parameterized objectives, termed “infinite objective optimization”. A formulation is given for infinite objective optimization problems and an approach for checking whether a Pareto frontier is a solution to this formulation is detailed. Using this approach, a new sampling based approach is developed for solving infinite objective optimization problems. The new approach is tested on several different example problems and is shown to be faster and perform better than a brute force approach.


Author(s):  
Rafael Labedan ◽  
Christian Talbot ◽  
Jonathan Gagnon ◽  
Francois Gagnon

Author(s):  
Lia Humphrey ◽  
Edward W. Thommes ◽  
Roie Fields ◽  
Naseem Hakim ◽  
Ayman Chit ◽  
...  

In this work we present an analysis of the two major strategies currently implemented around the world in the fight against COVID-19: Social distancing & shelter-in-place measures to protect the susceptible, and testing & contact-tracing to identify, isolate and treat the infected. The majority of countries have principally relied on the former; we consider the examples of Italy, Canada and the United States. By fitting a disease transmission model to daily case report data, we infer that in each of the three countries, the current level of national shutdown is equivalent to about half the population being under quarantine. We demonstrate that in the absence of other measures, scaling back social distancing in such a way as to prevent a second wave will take prohibitively long. In contrast, South Korea, a country that has managed to control and suppress its outbreak principally through mass testing and contact tracing, and has only instated a partial shutdown. For all four countries, we estimate the level of testing which would be required to allow a complete exit from shutdown and a full lifting of social distancing measures, without a resurgence of COVID-19. We find that a “brute-force” approach of untargeted universal testing requires an average testing rate of once every 36 to 48 hours for every individual, depending on the country. If testing is combined with contact tracing, and/or if tests are able to identify latent infection, then an average rate of once every 4 to 5 days is sufficient.Significance StatementWe demonstrate how current quarantine measures can be lifted after the current pandemic wave by large-scale, frequent-testing and contact tracing on the remaining susceptible populations. We present an analysis of the two major strategies currently implemented around the world against COVID-19: Social distancing & shelter-in-place measures to protect the susceptible, and testing & contact-tracing to identify, isolate and treat the infected. We find that a “brute-force” approach of untargeted universal testing requires an average testing rate of once every 36 to 48 hours for every individual, depending on the country. If testing is combined with contact tracing, and/or if tests are able to identify latent infection, then an average rate of once every 4 to 5 days is sufficient.


Sign in / Sign up

Export Citation Format

Share Document