nearest neighborhood
Recently Published Documents


TOTAL DOCUMENTS

149
(FIVE YEARS 53)

H-INDEX

12
(FIVE YEARS 3)

2021 ◽  
Vol 10 (1) ◽  
pp. 94
Author(s):  
Ali Abdolahi ◽  
Vali Nowzari ◽  
Ali Pirzad ◽  
Seyed Ehsan Amirhosseini

Introduction: Health companies need investment for development. Due to the high risk of their activities, it is very difficult to attract investment for this field, but this lack of financial resources leads to the failure of these companies, so providing a model for predicting profits and losses in companies is very important and functional.Materials and Method: In this study, a combination of two logistic regression algorithms and differential analysis were used to design a profit and loss forecasting model. Also, the information of 20 companies in the field of health was used to evaluate the proposed model. 10 profitable companies and 10 loss-making companies were selected and for each company, nine variables independent of the financial information of these companies were collected.Results: The designed prediction model was implemented on the data in this study. To do this, the data were divided into two sets: training and testing. The prediction model was implemented on training data and evaluated by test data and reached 99.65% sensitivity, 94.75% specificity and 96.28% accuracy. The proposed model was then compared with the methods of decision tree C4.5, Bayesian, support vector machine, nearest neighborhood and multilayer neural network and it was found to have a better output.Conclusion: In this study, it was found that the risk in the field of health investment can be reduced, so the profit and loss situation of health companies can be predicted with appropriate accuracy. It was also found that the combination of logistic regression and differential analysis algorithms can increase the accuracy of the prediction model.


2021 ◽  
pp. 1-12
Author(s):  
Chunyan She ◽  
Shaohua Zeng

Outlier detection is a hot issue in data mining, which has plenty of real-world applications. LOF (Local Outlier Factor) can capture the abnormal degree of objects in the dataset with different density levels, and many extended algorithms have been proposed in recent years. However, the LOF needs to search the nearest neighborhood of each object on the whole dataset, which greatly increases the time cost. Most of these extended algorithms only consider the distance between an object and its neighborhood, but ignore the local distribution of an object within its neighborhood, resulting in a high false-positive rate. To improve the running speed, a rough clustering based on triple fusion is proposed, which divides a dataset into several subsets and outlier detection is performed only on each subset. Then, considering the local distribution of an object within its neighborhood, a new local outlier factor is constructed to estimate the abnormal degree of each object. Finally, the experimental results indicate that the proposed algorithm has better performance and lower running time than the others.


2021 ◽  
Vol 1 (3) ◽  
Author(s):  
Tarkono ◽  
As'ad Humam ◽  
As'ad Humam ◽  
Ranti Vidia Mahyunis ◽  
Shofiyyah Fauziah Sayuti ◽  
...  

Abstract   Keteguhan Village is an area that has the highly potential to flood disaster, such as the flash flood incident that occurred on March 30, 2020. Floodprone mapping is needed to map flood prone potentials in SKeteguhan Village with the aim of increasing the alertness and readiness of the Keteguhan Village’s community in dealing that disaster. The used method includes processing the parameters of rainfall, land cover, slope, soil type, land height and land cover, then carried out by a weighted overlay process to form new data in the form of a flood prone potential map. The obtained results are that there are 3 potential areas, namely the lowlands along the river area of Umbul Kunci Street, the river area in the nearest neighborhood of Keteguhan Village and Mushollah Nurul Jannah on Laksamana R.E. Martadinata Street. Based on the area of vulnerability level in Keteguhan Village, the safe category has an area of up to 137,451 Ha with a percentage of 44.6%, the non-prone category has an area of up to 95,5116 Ha with a percentage of 30.01%, the vulnerable category has an area of up to 62.4922 Ha with a percentage of 20.27% and the very vulnerable category has area up to 15.7767 Ha with a percentage of 5.12%.


2021 ◽  
Vol 13 (17) ◽  
pp. 3475
Author(s):  
Yihuan Peng ◽  
Xuetong Xie ◽  
Mingsen Lin ◽  
Lishan Ran ◽  
Feng Yuan ◽  
...  

Rain affects the wind measurement accuracy of the Ku-band spaceborne scatterometer. In order to improve the quality of the retrieved wind field, it is necessary to identify and flag rain-contaminated data. In this study, an HY-2A scatterometer is used to study rain identification. In addition to the conventional parameters, such as the retrieved wind speed, the wind direction relative to the along-track direction, and the normalized beam difference, the experiment expands the mean deviation of the backscattering coefficient, the beam difference between fore and aft, and the node number of the wind vector cell (WVC) as the sensitive parameters according to the microwave scattering characteristics of rain and the actual measurement situation of the HY-2A. Furthermore, a rain identification model for HY2 (HY2RRM) with the K-Nearest Neighborhood (KNN) algorithm was built. After several tests, the accuracy of the selected HY2RRM approach is found to about 88%, and about 70% of rain-contaminated data can be accurately identified. The research results are helpful for better understanding the characteristics of microwave backscattering and provide a possible way to further improve the wind field retrieval accuracy of the HY-2A scatterometer and other Ku-band scatterometers.


Symmetry ◽  
2021 ◽  
Vol 13 (9) ◽  
pp. 1570 ◽  
Author(s):  
Jingcheng Zhu ◽  
Lunwen Wang

Identifying influential nodes in complex networks is of great significance for clearly understanding network structure and maintaining network stability. Researchers have proposed many classical methods to evaluate the propagation impact of nodes, but there is still some room for improvement in the identification accuracy. Degree centrality is widely used because of its simplicity and convenience, but it has certain limitations. We divide the nodes into neighbor layers according to the distance between the surrounding nodes and the measured node. Considering that the node’s neighbor layer information directly affects the identification result, we propose a new node influence identification method by combining degree centrality information about itself and neighbor layer nodes. This method first superimposes the degree centrality of the node itself with neighbor layer nodes to quantify the effect of neighbor nodes, and then takes the nearest neighborhood several times to characterize node influence. In order to evaluate the efficiency of the proposed method, the susceptible–infected–recovered (SIR) model was used to simulate the propagation process of nodes on multiple real networks. These networks are unweighted and undirected networks, and the adjacency matrix of these networks is symmetric. Comparing the calculation results of each method with the results obtained by SIR model, the experimental results show that the proposed method is more effective in determining the node influence than seven other identification methods.


2021 ◽  
Vol 12 (1) ◽  
Author(s):  
Na Zhao ◽  
Reid T. Powell ◽  
Xueying Yuan ◽  
Goeun Bae ◽  
Kevin P. Roarty ◽  
...  

AbstractThe epithelial-mesenchymal transition (EMT) has been implicated in conferring stem cell properties and therapeutic resistance to cancer cells. Therefore, identification of drugs that can reprogram EMT may provide new therapeutic strategies. Here, we report that cells derived from claudin-low mammary tumors, a mesenchymal subtype of triple-negative breast cancer, exhibit a distinctive organoid structure with extended “spikes” in 3D matrices. Upon a miR-200 induced mesenchymal-epithelial transition (MET), the organoids switch to a smoother round morphology. Based on these observations, we developed a morphological screening method with accompanying analytical pipelines that leverage deep neural networks and nearest neighborhood classification to screen for EMT-reversing drugs. Through screening of a targeted epigenetic drug library, we identified multiple class I HDAC inhibitors and Bromodomain inhibitors that reverse EMT. These data support the use of morphological screening of mesenchymal mammary tumor organoids as a platform to identify drugs that reverse EMT.


2021 ◽  
Vol 10 (7) ◽  
pp. 473
Author(s):  
Qingying Yu ◽  
Chuanming Chen ◽  
Liping Sun ◽  
Xiaoyao Zheng

Urban hotspot area detection is an important issue that needs to be explored for urban planning and traffic management. It is of great significance to mine hotspots from taxi trajectory data, which reflect residents’ travel characteristics and the operational status of urban traffic. The existing clustering methods mainly concentrate on the number of objects contained in an area within a specified size, neglecting the impact of the local density and the tightness between objects. Hence, a novel algorithm is proposed for detecting urban hotspots from taxi trajectory data based on nearest neighborhood-related quality clustering techniques. The proposed spatial clustering algorithm not only considers the maximum clustering in a limited range but also considers the relationship between each cluster center and its nearest neighborhood, effectively addressing the clustering issue of unevenly distributed datasets. As a result, the proposed algorithm obtains high-quality clustering results. The visual representation and simulated experimental results on a real-life cab trajectory dataset show that the proposed algorithm is suitable for inferring urban hotspot areas, and that it obtains better accuracy than traditional density-based methods.


Sign in / Sign up

Export Citation Format

Share Document