nearest neighbor
Recently Published Documents





Parita Shah ◽  
Priya Swaminarayan ◽  
Maitri Patel

<span>Opinion analysis is by a long shot most basic zone of characteristic language handling. It manages the portrayal of information to choose the motivation behind the wellspring of the content. The reason might be of a type of gratefulness (positive) or study (negative). This paper offers a correlation between the outcomes accomplished by applying the calculation arrangement using various classifiers for instance K-nearest neighbor and multinomial naive Bayes. These techniques are utilized to assess a significant assessment with either a positive remark or negative remark. The gathered information considered on the grounds of the extremity film datasets and an association with the results accessible proof has been created for a careful assessment. This paper investigates the word level count vectorizer and term frequency inverse document frequency (TF-IDF) influence on film sentiment analysis. We concluded that multinomial Naive Bayes (MNB) classier generate more accurate result using TF-IDF vectorizer compared to CountVectorizer, K-nearest-neighbors (KNN) classifier has the same accuracy result in case of TF-IDF and CountVectorizer.</span>

2022 ◽  
Vol 9 (2) ◽  
pp. 119-127
Alrige et al. ◽  

This study aims to utilize the machine learning technique to build a model to recommend the suitable wind turbine type based on some variables, such as air speed and air density, as well as visualize the location of the recommended wind turbine selection on a 3D map. Particularly, we applied the K-nearest neighbor model (KNN) to determine the amount of energy produced by a single wind turbine. We applied it on 10 separate wind farms in Saudi Arabia. The results indicate that the model performs very well in predicting the best wind turbine type with the mean accuracy of 88%, where ten wind stations resulted from the optimized model with the suggested turbine type in each station. Adding more wind attributes and other factors may assist in increasing the model mean accuracy. The project’s findings will assist decision-makers in Saudi Arabia to make informed decisions as to what kind of wind turbine is suitable for a specific location. In the long run, this will help to make wind energy-a sustainable source of energy-one of the main goals of the 2030 vision, specifically under National Industrial Development and Logistics Program.

2022 ◽  
Vol 18 (1) ◽  
pp. 1-63
Siu-Wing Cheng ◽  
Man-Kit Lau

We propose a dynamic data structure for the distribution-sensitive point location problem in the plane. Suppose that there is a fixed query distribution within a convex subdivision S , and we are given an oracle that can return in O (1) time the probability of a query point falling into a polygonal region of constant complexity. We can maintain S such that each query is answered in O opt (S) ) expected time, where opt ( S ) is the expected time of the best linear decision tree for answering point location queries in S . The space and construction time are O(n log 2 n ), where n is the number of vertices of S . An update of S as a mixed sequence of k edge insertions and deletions takes O(k log 4 n) amortized time. As a corollary, the randomized incremental construction of the Voronoi diagram of n sites can be performed in O(n log 4 n ) expected time so that, during the incremental construction, a nearest neighbor query at any time can be answered optimally with respect to the intermediate Voronoi diagram at that time.

Minerals ◽  
2022 ◽  
Vol 12 (1) ◽  
pp. 103
Timofey Timkin ◽  
Mahnaz Abedini ◽  
Mansour Ziaii ◽  
Mohammad Reza Ghasemi

In this study, the zonality method has been used to separate geochemical anomalies and to calculate erosional levels in the regional scale for porphyry-Cu deposit, Abrisham-Rud (Semnan province, East of Iran). In geochemical maps of multiplicative haloes, the co-existence of both the supra-ore elements and sub-ore elements local maxima implied blind mineralization in the northwest of the study area. Moreover, considering the calculated zonality indices and two previously presented geochemical models, E and NW of the study have been introduced as ZDM and BM, respectively. For comparison, the geological layer has been created by combining rock units, faults, and alterations utilizing the K-nearest neighbor (KNN) algorithm. The rock units and faults have been identified from the geological map; moreover, alterations have been detected by using remote sensing and ASTER images. In the geological layer map related to E of the study area, many parts have been detected as high potential areas; in addition, both geochemical and geological layer maps only confirmed each other at the south of this area and suggested this part as high potential mineralization. Therefore, high potential areas in the geological layer map could be related to the mineralization or not. Due to the incapability of the geological layer in identifying erosional levels, mineralogy investigation could be used to recognize this level; however, because of the high cost, mineralogy is not recommended for application on a regional scale. The findings demonstrated that the zonality method has successfully distinguished geochemical anomalies including BM and ZDM without dependent on alteration and was able to predict erosional levels. Therefore, this method is more powerful than the geological layer.

2022 ◽  
Vol 12 (2) ◽  
pp. 856
Branislav Dimitrijevic ◽  
Sina Darban Khales ◽  
Roksana Asadi ◽  
Joyoung Lee

Highway crashes, along with the property damage, personal injuries, and fatalities that they cause, continue to present one of the most significant and critical transportation problems. At the same time, provision of safe travel is one of the main goals of any transportation system. For this reason, both in transportation research and practice much attention has been given to the analysis and modeling of traffic crashes, including the development of models that can be applied to predict crash occurrence and crash severity. In general, such models assess short-term crash risks at a given highway facility, thus providing intelligence that can be used to identify and implement traffic operations strategies for crash mitigation and prevention. This paper presents several crash risk and injury severity assessment models applied at a highway segment level, considering the input data that is typically collected or readily available to most transportation agencies in real-time and at a regional network scale, which would render them readily applicable in practice. The input data included roadway geometry characteristics, traffic flow characteristics, and weather condition data. The paper develops, tests, and compares the performance of models that employ Random effects Bayesian Logistics Regression, Gaussian Naïve Bayes, K-Nearest Neighbor, Random Forest, and Gradient Boosting Machine methods. The paper applies random oversampling examples (ROSE) method to deal with the problem of data imbalance associated with the injury severity analysis. The models were trained and tested using a dataset of 10,155 crashes that occurred on two interstate highways in New Jersey over a two-year period. The paper also analyzes the potential improvement in the prediction abilities of the tested models by adding reactive data to the analysis. To that end, traffic crashes were classified in multiple classes based on the driver age and the vehicle age to assess the impact of these attributes on driver injury severity outcomes. The results of this analysis are promising, showing that the simultaneous use of reactive and proactive data can improve the prediction performance of the presented models.

Xiongwen Chen ◽  
Qian Wang ◽  
Ping Wu ◽  
Guanghui Zhou

Abstract We propose an AA-stacked multilayer graphene nanoribbon with two symmetrical armchair edges as a multiple flat-band (FB) material. Using the tight-binding Hamiltonian and Green’s function method, we find that the FBs are complete and merged into many dispersive bands. The FBs cause multiple strongly localized states (SLSs) at the sites of the odd lines in every sublayer and a giant optical absorption (GOA) at energy point 2t, where t is the electronic intralayer hopping energy between two nearest-neighbor sites. By driving an electric field perpendicular to the ribbon plane, the bandgaps of the FBs are tunable. Accordingly, the positions of the SLSs in the energy regime can be shifted. However, the position of the GOA is robust against such field, but its strength exhibits a collapse behavior with a fixed quantization step. On the contrary, by driving an electric field parallel to the ribbon plane, the completeness of FBs is destroyed. Resultantly, the SLSs and GOA are suppressed and even quenched. Therefore, such ribbons may be excellent candidates for the design of the controllable information-transmission and optical-electric nanodevices.

2022 ◽  
Vol 10 (4) ◽  
pp. 476-487
Erysta Risky Rismia ◽  
Tatik Widiharih ◽  
Rukun Santoso

The characteristics of society in choosing contraceptive methods are also the crucial factors for the government to prepare the family planning services needed at Bulakamba District, Brebes Regency, Central Java. In this case, a classification process needs to be done to assist the process of classifying the characteristics of society in the selection of contraceptive methods. Multinomial Logistic Regression classification is good in exploring data information  meanwhile Fuzzy K Nearest Neighbor (FK-NN) classification is good for handling big data and noise. These two methods used in this study because they are relevant to the data applied and will be compared their classification accuracy through APER and Press's Q calculations.The classification accuracy results obtained based on the APER calculation for Multinomial Logistic Regression is 88,25% and Fuzzy K Nearest Neighbor (FK-NN) is 88,92%.  Meanwhile, the Press's Q value of both methods are 9600,945 and 9518,014 greater than χ 2𝛼,1 which is 3,841, so that it is statistically accurate. Based on the results obtained, it can be concluded that Multinomial Logistic Regression classification method has a better classification accuracy than Fuzzy K Nearest Neighbor (FK-NN) method. 

2022 ◽  
Vol 2022 ◽  
pp. 1-8
Xiushan Zhang

Based on the understanding and comparison of various main recommendation algorithms, this paper focuses on the collaborative filtering algorithm and proposes a collaborative filtering recommendation algorithm with improved user model. Firstly, the algorithm considers the score difference caused by different user scoring habits when expressing preferences and adopts the decoupling normalization method to normalize the user scoring data; secondly, considering the forgetting shift of user interest with time, the forgetting function is used to simulate the forgetting law of score, and the weight of time forgetting is introduced into user score to improve the accuracy of recommendation; finally, the similarity calculation is improved when calculating the nearest neighbor set. Based on the Pearson similarity calculation, the effective weight factor is introduced to obtain a more accurate and reliable nearest neighbor set. The algorithm establishes an offline user model, which makes the algorithm have better recommendation efficiency. Two groups of experiments were designed based on the mean absolute error (MAE). One group of experiments tested the parameters in the algorithm, and the other group of experiments compared the proposed algorithm with other algorithms. The experimental results show that the proposed method has better performance in recommendation accuracy and recommendation efficiency.

Sign in / Sign up

Export Citation Format

Share Document