scholarly journals Learning Partial Lexicographic Preference Trees and Forests over Multi-Valued Attributes

10.29007/xtl4 ◽  
2018 ◽  
Author(s):  
Xudong Liu ◽  
Mirek Truszczynski

\tit{Partial lexicographic preference trees}, or \tit{PLP-trees}, form an intuitive formalism for compact representation of qualitative preferences over combinatorial domains. We show that PLP-trees can be used to accurately model preferences arising in practical situations, and that high-accuracy PLP-trees can be effectively learned. We also propose and study learning methods for a variant of our model based on the concept of a PLP-forest, a collection of PLP-trees, where the preference order specified by a PLP-forest is obtained by aggregating the orders of its constituent PLP-trees. Our results demonstrate the potential of both approaches, with learning PLP-forests showing particularly promising behavior.

Author(s):  
Anna Evgenievna Kharitonova ◽  
Alina Alekseevna Sundupey ◽  
Svetlana Skachkova

The article provides a comparative analysis of the results of the Russian Agricultural Census of 2006 and 2016. As a result, there is a decrease in the number of agricultural producers, a decrease in the size of agricultural land and equipment in organizations. Against this background, one can see an increase in the concentration of production in both crop and livestock production. Machine learning models have been built to classify subsidy organizations using Python libraries. The accuracy of the constructed models was up to 86 %, which proves the possibility of their use. In the future, the use of machine learning methods will reduce the number of Russian agricultural census indicators and classify organizations with high accuracy according to qualitative characteristics.


2019 ◽  
Author(s):  
sugianto sugianto

The purpose of this article is to describe profession challenges of the Buddhist religion teacher in Tangerang. The method used in this study is descriptive qualitative. The study was conducted from March to June 2018 to Buddhist religion teacher in Tangerang. Data is collected through interviews. Validity of data by credibility, dependability, confirmability and transferbility tests. Data analysis using the Miles & Huberman model. Based on the results of research profession challenges of the Buddhist religion teacher in Tangerang is related to evocative, quality students, character building, reduced number of students, and educational facilities. To uncover these challenges, teachers will conduct quality learning through cooperative learning methods, the use of contemporary learning media, patience and caution in dealing with students, and provide additional assignments to attend a Buddhist Sunday school or devotional service.


2019 ◽  
Author(s):  
Zhenzhen Du ◽  
Yujie Yang ◽  
Jing Zheng ◽  
Qi Li ◽  
Denan Lin ◽  
...  

BACKGROUND Predictions of cardiovascular disease risks based on health records have long attracted broad research interests. Despite extensive efforts, the prediction accuracy has remained unsatisfactory. This raises the question as to whether the data insufficiency, statistical and machine-learning methods, or intrinsic noise have hindered the performance of previous approaches, and how these issues can be alleviated. OBJECTIVE Based on a large population of patients with hypertension in Shenzhen, China, we aimed to establish a high-precision coronary heart disease (CHD) prediction model through big data and machine-learning METHODS Data from a large cohort of 42,676 patients with hypertension, including 20,156 patients with CHD onset, were investigated from electronic health records (EHRs) 1-3 years prior to CHD onset (for CHD-positive cases) or during a disease-free follow-up period of more than 3 years (for CHD-negative cases). The population was divided evenly into independent training and test datasets. Various machine-learning methods were adopted on the training set to achieve high-accuracy prediction models and the results were compared with traditional statistical methods and well-known risk scales. Comparison analyses were performed to investigate the effects of training sample size, factor sets, and modeling approaches on the prediction performance. RESULTS An ensemble method, XGBoost, achieved high accuracy in predicting 3-year CHD onset for the independent test dataset with an area under the receiver operating characteristic curve (AUC) value of 0.943. Comparison analysis showed that nonlinear models (K-nearest neighbor AUC 0.908, random forest AUC 0.938) outperform linear models (logistic regression AUC 0.865) on the same datasets, and machine-learning methods significantly surpassed traditional risk scales or fixed models (eg, Framingham cardiovascular disease risk models). Further analyses revealed that using time-dependent features obtained from multiple records, including both statistical variables and changing-trend variables, helped to improve the performance compared to using only static features. Subpopulation analysis showed that the impact of feature design had a more significant effect on model accuracy than the population size. Marginal effect analysis showed that both traditional and EHR factors exhibited highly nonlinear characteristics with respect to the risk scores. CONCLUSIONS We demonstrated that accurate risk prediction of CHD from EHRs is possible given a sufficiently large population of training data. Sophisticated machine-learning methods played an important role in tackling the heterogeneity and nonlinear nature of disease prediction. Moreover, accumulated EHR data over multiple time points provided additional features that were valuable for risk prediction. Our study highlights the importance of accumulating big data from EHRs for accurate disease predictions.


Kybernetes ◽  
2021 ◽  
Vol ahead-of-print (ahead-of-print) ◽  
Author(s):  
Yuyan Luo ◽  
Zheng Yang ◽  
Yuan Liang ◽  
Xiaoxu Zhang ◽  
Hong Xiao

PurposeBased on climate issues and carbon emissions, this study aims to promote low-carbon consumption and compel consumers to actively shift to energy-saving appliances. In this big data era, online reviews in social and electronic commerce (e-commerce) websites contain valuable product information, which can facilitate firm business strategies and consumer comparison shopping. This study is designed to advance existing research on energy-saving refrigerators by incorporating machine learning models in the analysis of online reviews to provide valuable suggestions to e-commerce platform managers and manufacturers to effectively understand the psychological cognition of consumers.Design/methodology/approachThis study proposes an online e-commerce review mining and management strategy model based on “data acquisition and cleaning, data mining and analysis and strategy formation” through multiple machine learning methods, namely, Bayes networks, support vector machine (SVM), latent Dirichlet allocation (LDA) and importance–performance analysis (IPA), to help managers.FindingsBased on a case study of one of the largest e-commerce platforms in China, this study linguistically analyzes 29,216 online reviews of energy-saving refrigerators. Results indicate that the energy-saving refrigerator features that consumers are generally satisfied with are, in sequential order, logistics, function, price, outlook, after-sales service, brand, quality and space. This study also identifies ten topics with 100 keywords by analyzing 18 different refrigerator models. Finally, based on the IPA, this study allocates different priorities to the features and provides suggestions from the perspective of consumers, the government and manufacturers.Research limitations/implicationsIn terms of limitations, future research may focus on the following points. First, the topics identified in this study derive from specific points in time and reviews; thus, the topics may change with the text data. A machine learning-based online review analysis platform could be developed in the future to dynamically improve consumer satisfaction. Moreover, given that consumers' needs may change over time, e-commerce platform types and consumer characteristics, such as user profiles, can be incorporated into the model to effectively analyze trends in consumers' perceived dimensions.Originality/valueThis study fills the gap in previous research in this field, which uses small-sample data for qualitative analysis, while integrating management ideas and proposes an online e-commerce review mining and management strategy model based on machine learning methods. Moreover, this study considers how consumers' emotional and thematic preferences for products affect their purchase decision-making from the perspective of their psychological perception and linguistically analyzes online reviews of energy-saving refrigerators using the proposed mining model. Through the improved IPA model, this study provides optimizing strategies to help e-commerce platform managers and manufacturers.


2015 ◽  
Vol 32 (2) ◽  
Author(s):  
Jiaqin Huang ◽  
Xianghua Huang ◽  
Tianhong Zhang

AbstractIn the simulation of engine-propeller integrated control system for a turboprop aircraft, a real-time propeller model with high-accuracy is required. A study is conducted to compare the real-time and precision performance of propeller models based on strip theory and lifting surface theory. The emphasis in modeling by strip theory is focused on three points as follows: First, FLUENT is adopted to calculate the lift and drag coefficients of the propeller. Next, a method to calculate the induced velocity which occurs in the ground rig test is presented. Finally, an approximate method is proposed to obtain the downwash angle of the propeller when the conventional algorithm has no solution. An advanced approximation of the velocities induced by helical horseshoe vortices is applied in the model based on lifting surface theory. This approximate method will reduce computing time and remain good accuracy. Comparison between the two modeling techniques shows that the model based on strip theory which owns more advantage on both real-time and high-accuracy can meet the requirement.


2011 ◽  
Vol 121-126 ◽  
pp. 4870-4874
Author(s):  
Miao Li ◽  
Hui Bin Gao

To meet the requirement of high tracking accuracy as well as develop more reasonable evaluation method, in this paper, the General Regression Neural Network (GRNN) has been applied to build the tracking error model of the theodolite. First, we analyze the nonlinear factors in the theodolite. Second, we discuss the principle of GRNN, including its structure, the function as well as its priors. Third, we build the tracking error model based on GRNN and verify the model through the different parameters. The result indicated that the network model based on GRNN has high accuracy and good generalization ability. It could instead the real system to a certain extent. The research in this paper has important value to the engineering practice.


Sign in / Sign up

Export Citation Format

Share Document