scholarly journals Data-driven integration evaluation from the perspective of Adaboost and its application in WeChat public number ranking

2020 ◽  
Vol 309 ◽  
pp. 02017
Author(s):  
Yicheng Gong ◽  
Juan Zhao ◽  
Dongyang Zhang

The traditional comprehensive evaluation is difficult to model when dealing with large data with large parameters and complex structure, and it cannot adapt to the update of data. In order to improve this situation, this paper draws on the Adaptive Learning Adaboost perspective in statistical learning to develop a data-driven integrated evaluation model that updates the weight of sample weights and weak evaluation models with data. Three specific weak evaluation models were selected: data-driven Topsis method, principal component analysis method and factor analysis method. Taking the ranking of WeChat public account as an example, the results show that the accuracy of the integrated evaluation model is 88.57%, which is 17.14%, 31.43% and 28.57% higher than the data-driven Topsis method, principal component method and factor analysis method.

2022 ◽  
pp. 146808742110707
Author(s):  
Aran Mohammad ◽  
Reza Rezaei ◽  
Christopher Hayduk ◽  
Thaddaeus Delebinski ◽  
Saeid Shahpouri ◽  
...  

The development of internal combustion engines is affected by the exhaust gas emissions legislation and the striving to increase performance. This demands for engine-out emission models that can be used for engine optimization for real driving emission controls. The prediction capability of physically and data-driven engine-out emission models is influenced by the system inputs, which are specified by the user and can lead to an improved accuracy with increasing number of inputs. Thereby the occurrence of irrelevant inputs becomes more probable, which have a low functional relation to the emissions and can lead to overfitting. Alternatively, data-driven methods can be used to detect irrelevant and redundant inputs. In this work, thermodynamic states are modeled based on 772 stationary measured test bench data from a commercial vehicle diesel engine. Afterward, 37 measured and modeled variables are led into a data-driven dimensionality reduction. For this purpose, approaches of supervised learning, such as lasso regression and linear support vector machine, and unsupervised learning methods like principal component analysis and factor analysis are applied to select and extract the relevant features. The selected and extracted features are used for regression by the support vector machine and the feedforward neural network to model the NOx, CO, HC, and soot emissions. This enables an evaluation of the modeling accuracy as a result of the dimensionality reduction. Using the methods in this work, the 37 variables are reduced to 25, 22, 11, and 16 inputs for NOx, CO, HC, and soot emission modeling while maintaining the accuracy. The features selected using the lasso algorithm provide more accurate learning of the regression models than the extracted features through principal component analysis and factor analysis. This results in test errors RMSETe for modeling NOx, CO, HC, and soot emissions 19.22 ppm, 6.46 ppm, 1.29 ppm, and 0.06 FSN, respectively.


2015 ◽  
Vol 14 (4) ◽  
pp. 101-108
Author(s):  
Pinchao Meng ◽  
Weishi Yin ◽  
Yanzhong Li

Abstract In this paper 12 economic indices of the software industry in 30 cities/provinces in China are used to set up an evaluation system for the competitiveness of the regional software industry. By using the statistical analysis method of factor analysis, an evaluation model of the comprehensive competitiveness of the software industry for each city/province is built. Taking Beijing and Shanghai as examples, the comprehensive competitiveness and problems of the software industry in Jilin province are compared and analyzed.


2015 ◽  
Vol 12 (2) ◽  
pp. 75 ◽  
Author(s):  
Osman Erol ◽  
Yusuf Levent Şahin ◽  
Eray Yılmaz ◽  
Halil İbrahim Haseski

<p>The aim of this study is to develop a scale to determine internet users behavior related to cyber security. In this context created an item pool in accordance with expert opinion. This item pool was administered to 810 people for exploratory factor analysis. In exploratory factor analysis; principal component analysis method which is commonly used and Varimax vertical rotation method to determine the factor structure was used. Scale was administered to 292 people and structural equation modeling approach was applied to confirmation study.As a result of factor analysis,“Personal Cyber Security Provision Scale" which consists of 5 factors and 25 items and has a good compatibility was occurred.</p><p> </p><p><strong>Özet</strong></p><p>Bu araştırmanın amacı internet kullanıcılarının siber güvenlik ile ilgili davranışlarını belirlemeye yönelik bir ölçek geliştirmektir. Bu bağlamda öncelikle uzman görüşü doğrultusunda 26 maddelik bir madde havuzu oluşturulmuştur. Bu madde havuzu yapı geçerliliğinin test edilmesi için Facebook sosyal paylaşımda bir uygulamayı kullanan 810 kişiye uygulanarak açımlayıcı faktör analizi yapılmıştır. Açımlayıcı faktör analizinde en sık kullanılan yöntem olan temel bileşenler analizi yöntemi kullanılmış, ölçekteki faktör yapısını belirlemek için ise Varimax - dikey döndürme yöntemi kullanılmıştır. Ölçeğin doğrulama çalışması için ise aynı sosyal ağ uygulamasını kullanan ve daha önce ölçeğin uygulandığı kişilerin elendiği292 kişinin verisi kullanılarak yapısal eşitlik modeli yaklaşımı uygulanmıştır. Açımlayıcı faktör analizi sonucunda 5 faktörlü ve 25 maddeden oluşan; doğrulayıcı faktör analizi sonucunda ise elde edilen uyum indekslerine göre iyi bir uyuma sahip "Kişisel Siber Güvenliği Sağlama Ölçeği" ortaya çıkmıştır.</p>


Author(s):  
Ulil Hamida

Policies related to the automotive industry have become significant for the Ministry of Industry. The problem in determining these policies is the determination of important factors for the automotive industry so that the policies formulated are right on target. The search for these important factors can be done by using the factor analysis method. So far, no studies have been conducted to examine the factors that influence the growth of the automotive industry. In this study, factor analysis is performed on factors in the automotive industry using the principal component analysis algorithm. The algorithm seeks to describe independently the aspects that become the main factors in determining the automotive industry. Based on an analysis of factors in the automotive industry production, the most influential factors are foreign investment, vehicle ownership ratios, and at last the change in GDP.


2018 ◽  
Vol 2 (1) ◽  
pp. 19
Author(s):  
Trisha Gilang Saraswati

The tendency of consumers in shopping for furniture is very different from shopping for other goods or services, because furniture is expected to be stored and used in a long time. Considering that, consumers tend to want to shop directly to offline stores in order to see, feel and check quality directly because of many aspects that are assessed such as the quality of materials, models, colors and more. Behind the trend to shop furniture offline, IKEA continues to innovate to improve the performance of its website so that consumers can shop online. Therefore, this study intends to analyze what factors are driving consumers in shopping for furniture online especially on the website of IKEA Indonesia. From two grounded theory employed on this research, there are 14 factors that can influence consumers to buy online. This data analysis uses Principal Component Analysis (PCA), a factor analysis method that extracts factors by using total variance in the analysis. From data calculation, it is known that there are 8 driving factors of consumer to purchase furniture online on IKEA Indonesia’s website: Enjoyment, Perceived Risk, Efficiency, Service & Merchandise Quality, Ease of Navigation, Price Attractiveness, Flexibility and Reliability. By knowing what factors can affect consumers in doing online shopping for furniture, companies in this case IKEA Indonesia can optimize the use of its website in accordance with influential factors.


2012 ◽  
Vol 109 (9) ◽  
pp. 1662-1669 ◽  
Author(s):  
Hui Zuo ◽  
Zumin Shi ◽  
Baojun Yuan ◽  
Yue Dai ◽  
Xiaoqun Pan ◽  
...  

The aim of the present study was to examine the association between dietary patterns and insulin resistance in Chinese adults without known diabetes. Study subjects were 1070 Chinese adults aged 18 years and above in Jiangsu Province who participated in the 2006 wave of the China Health and Nutrition Survey. Usual dietary intake was assessed by using a validated FFQ. Dietary patterns were identified by factor analysis using a principal component analysis method. Insulin resistance was defined as the highest quartile of the homeostasis model assessment of insulin resistance (HOMA-IR) scores. We derived four dietary patterns in our population by factor analysis: the Western, High-wheat, Traditional and Hedonic pattern. After adjusted for potential confounders, the Western pattern was significantly associated with greater odds for insulin resistance (P for trend = 0·009), while a significant negative association was found between the Hedonic pattern and insulin resistance (P for trend = 0·035). Compared with the lowest quartile of the Western pattern, the highest quartile had higher odds of insulin resistance (adjusted OR 1·89, 95 % CI 1·12, 3·19). There was a 42 % decrease in the odds after adjustment for all covariates in the highest quartile of the Hedonic pattern, compared with the lowest quartile (adjusted OR 0·58, 95 % CI 0·34, 0·99). HOMA-IR levels as a continuous variable also increased across the quartiles of the Western pattern and decreased across the quartiles of the Hedonic pattern. In conclusion, dietary patterns were significantly associated with insulin resistance in Chinese adults without known diabetes.


2013 ◽  
Vol 2 (4) ◽  
pp. 23
Author(s):  
PUTU OKA SURYA ARSANA ◽  
MADE SUSILAWATI ◽  
KETUT JAYANEGARA

Bali instead of famous for tourism also popular at agriculture. One of them is subak. It is a culture heritage in the world. To cope with this problem the development in agriculture should be increased. The goal for this research are to know the identifiier factors of agriculture devolopment in Bali, the most dominat factors, and the variable which represent the development of agriculture in Bali. The method of analysis used for this research is factors analysis. Factor analysis is used to reduce the data or summary, for variable which is being changed to a new variable called factor and still load many information contained in a real variable. The method used in the factor analysis is principal component analysis method. Many  factors are determined by eigen values. The  factor rotation which used is varimax rotation. Based on the research results, got seven factors with the diversities which can be explained are 76.417%. Factors dryland farming as the most dominant factor identifier with the total value of the largest eigenvalues ??is 4.564 or 25.356% with variables representing these factors are widely planted potatoes and pulses.


2014 ◽  
Vol 955-959 ◽  
pp. 4110-4118
Author(s):  
Xiao Jun Wang ◽  
Chao Lian Tu ◽  
Zu Shan Wang

Structuring and measurement of eco-civilization construction is an important part of researching on eco-civilization construction theory, which possesses the practical significance of measuring eco-civilization construction level and guiding the eco-civilization construction practice. The evaluation system of eco-civilization construction level could be structured from five aspects of the quality of ecological economy and society, the development of ecological base, ecological saving, ecological protection and ecological harmony. Analyzing the main parts of five aspects by use of factor analysis method, the ecological economy and society quality played the key role in the evaluation model of eco-civilization construction level. Based on the cluster analysis on the data of inland provinces and municipalities in the year of 2012, the eco-civilization construction level of inland of China could be clustered as different categories with obvious imbalance among different areas.


Sign in / Sign up

Export Citation Format

Share Document