metric matching
Recently Published Documents


TOTAL DOCUMENTS

17
(FIVE YEARS 6)

H-INDEX

4
(FIVE YEARS 0)

2021 ◽  
Vol 2021 ◽  
pp. 1-19
Author(s):  
Yanli Shao ◽  
Jingru Zhao ◽  
Xingqi Wang ◽  
Weiwei Wu ◽  
Jinglong Fang

As the scale and complexity of software increase, software security issues have become the focus of society. Software defect prediction (SDP) is an important means to assist developers in discovering and repairing potential defects that may endanger software security in advance and improving software security and reliability. Currently, cross-project defect prediction (CPDP) and cross-company defect prediction (CCDP) are widely studied to improve the defect prediction performance, but there are still problems such as inconsistent metrics and large differences in data distribution between source and target projects. Therefore, a new CCDP method based on metric matching and sample weight setting is proposed in this study. First, a clustering-based metric matching method is proposed. The multigranularity metric feature vector is extracted to unify the metric dimension while maximally retaining the information contained in the metrics. Then use metric clustering to eliminate metric redundancy and extract representative metrics through principal component analysis (PCA) to support one-to-one metric matching. This strategy not only solves the metric inconsistent and redundancy problem but also transforms the cross-company heterogeneous defect prediction problem into a homogeneous problem. Second, a sample weight setting method is proposed to transform the source data distribution. Wherein the statistical source sample frequency information is set as an impact factor to increase the weight of source samples that are more similar to the target samples, which improves the data distribution similarity between the source and target projects, thereby building a more accurate prediction model. Finally, after the above two-step processing, some classical machine learning methods are applied to build the prediction model, and 12 project datasets in NASA and PROMISE are used for performance comparison. Experimental results prove that the proposed method has superior prediction performance over other mainstream CCDP methods.


Author(s):  
Toshiya Itoh ◽  
Shuichi Miyazaki ◽  
Makoto Satake

In the online metric matching problem, there are servers on a given metric space and requests are given one-by-one. The task of an online algorithm is to match each request immediately and irrevocably with one of the unused servers. In this paper, we pursue competitive analysis for two variants of the online metric matching problem. The first variant is a restriction where each server is placed at one of two positions, which is denoted by OMM([Formula: see text]). We show that a simple greedy algorithm achieves the competitive ratio of 3 for OMM([Formula: see text]). We also show that this greedy algorithm is optimal by showing that the competitive ratio of any deterministic online algorithm for OMM([Formula: see text]) is at least 3. The second variant is the online facility assignment problem on a line. In this problem, the metric space is a line, the servers have capacities, and the distances between any two consecutive servers are the same. We denote this problem by OFAL([Formula: see text]), where [Formula: see text] is the number of servers. We first observe that the upper and lower bounds for OMM([Formula: see text]) also hold for OFAL([Formula: see text]), so the competitive ratio for OFAL([Formula: see text]) is exactly 3. We then show lower bounds on the competitive ratio [Formula: see text] [Formula: see text], [Formula: see text] [Formula: see text] and [Formula: see text] [Formula: see text] for OFAL([Formula: see text]), OFAL([Formula: see text]) and OFAL([Formula: see text]), respectively.


2021 ◽  
Vol 17 (2) ◽  
pp. 55-71
Author(s):  
Rohit Vashisht ◽  
Syed Afzal Murtaza Rizvi

Cross-project defect prediction (CPDP) forecasts flaws in a target project through defect prediction models (DPM) trained by defect data of another project. However, CPDP has a prevalent problem (i.e., distinct projects must have identical features to describe themselves). This article emphasizes on heterogeneous CPDP (HCPDP) modeling that does not require same metric set between two applications and builds DPM based on metrics showing comparable distribution in their values for a given pair of datasets. This paper evaluates empirically and theoretically HCPDP modeling, which comprises of three main phases: feature ranking and feature selection, metric matching, and finally, predicting defects in the target application. The research work has been experimented on 13 benchmarked datasets of three open source projects. Results show that performance of HCPDP is very much comparable to baseline within project defect prediction (WPDP) and XG boosting classification model gives best results when used in conjunction with Kendall's method of correlation as compared to other set of classifiers.


2020 ◽  
pp. 001789692094989
Author(s):  
Qiyang Zhang

Objective: The aim of this study was to investigate the impact of illiteracy on physical health and mental health. Design: Matching methods (nearest neighbour matching, Mahalanobis metric matching, and propensity score matching). Setting: Elderly people at least 65 years old in 22 provinces of China. Methods: The analysis used data from Chinese Longitudinal Healthy Longevity Survey (CLHLS). The independent variable was a dummy variable, which was coded as 1 for illiterate or semiliterate, and 0 for literate. Dependent variables were indicators of physical and mental health derived from the survey results. Matching methodologies controlled for confounding variables including age, sex, living sites, access to tap water and financial support. Results: Illiteracy was found to have a significant impact on physical health, exercise habits, anxiety, loneliness and happiness. On average, illiteracy decreased physical health by 19.9%, decreased exercise habits by 7%, increased anxiety by 11.56%, increased loneliness by 17.6% and decreased happiness by 11.3%. Conclusion: Findings confirm the past literature in which illiteracy has been found to be adversely associated with physical and mental health. The analysis uniquely found that illiteracy had a higher cost on mental health as compared to physical health for elderly people in China.


2018 ◽  
Vol 10 (11) ◽  
pp. 4008 ◽  
Author(s):  
Eunah Jung ◽  
Heeyeun Yoon

In this study, we investigate how far away and for how long past flooding affected single-family housing values in Gyeonggi, South Korea. In order to empirically explore the geographic and temporal extent of the effects, we adopt two analytical methods: random-intercept multilevel modeling and Mahalanobis-metric matching modeling. The analytical results suggest that the geographic extent of the discount effect of a flooding disaster is within 300 m from an inundated area. Market values of housing located 0–100, 100–200, and 200–300 m from inundated areas were lower by 11.0%, 7.4%, and 6.3%, respectively, than counterparts in the control group. The effect lasted only for 12 months after the disaster and then disappeared. During the first month, 1–3 months, and 3–6 months after a flood, housing units in the disaster-influenced area (within 300 m of the inundated area) were worth, on average, 57.6%, 49.2%, and 45.9% less than control units, respectively. Also, within the following 6 months, the discount effects were reduced to 33.2%. On the other hand, the results showed no statistically significant effects on market values more than 12 months after the disaster. By providing insights into how people perceive and respond to natural hazards, this research provides practical lessons for establishing sustainable disaster management and urban resilience strategies.


2012 ◽  
Vol 12 ◽  
pp. 419-430 ◽  
Author(s):  
SHUANG-NAN ZHANG ◽  
SHUXU YI

In Newtonian gravity (NG) it is known that the gravitational field anywhere inside a spherically symmetric distribution of mass is determined only by the enclosed mass. This is also widely believed to be true in general relativity (GR), and the Birkhoff theorem is often invoked to support this analogy between NG and GR. Here we show that such an understanding of the Birkhoff theorem is incorrect and leads to erroneous calculations of light deflection and delay time through matter. The correct metric, matching continuously to the location of an external observer, is determined both by the enclosed mass and mass distribution outside. The effect of the outside mass is to make the interior clock run slower, i.e., a slower speed of light for external observer. We also discuss the relations and differences between NG and GR, in light of the results we obtained in this Lettework. Finally we discuss the Generalized Shapiro delay, caused by the outside mass, and its possible laboratory test.


Sign in / Sign up

Export Citation Format

Share Document