scholarly journals Three methods of coding nominal variables in regression analysis

2021 ◽  
Vol 4 ◽  
pp. 35-45
Author(s):  
S.M. Lapach ◽  

The paper compares three methods of coding nominal variables in regression analysis: coding of each level as a separate variable, coding with binary code, numbering of factor levels. Although these methods have existed for a long time and even have a theoretical justification (except for encoding with binary code), there were no recommendations and comparisons for their practical application. The features of the application of each method and the existing limitations are analyzed. In the article, there are considered two examples that provide a detailed comparison of these three methods. Comparative analysis has been carried out in the following areas: the presence of restrictions in use; statistical properties of plans; labour intensity and difficulty of obtaining mathematical models and the final result of their building; convenience of semantic analysis and use. Additionally, there have been made comparisons with models based on Chebyshev orthogonal polynomials. It has been established that different methods of coding nominal variables, when used correctly, lead to regression models that are approximately identical in their properties. Moreover, the method of encoding each level as a separate variable is possible only if there are experiments in which there is no nominal variable as an influence effect. The binary coding method is inconvenient to use with a large number of levels of variation of the nominal variable and inconvenient to analyze. When coding by level numbering, it is necessary that the average response values, according to the dispersion diagram of this factor, are sorted by value in accordance with the assigned numbers. With this encoding method, a natural number of factors is preserved. Sharply distinguishable best results are achieved with this coding method using Chebyshev orthogonal polynomials. The highest accuracy and uniformity of approximation are ensured.

Author(s):  
Germa´n L. Di´az-Cuevas ◽  
Roger F. Ngwompo

A binary encoding method for bond graphs that can be used for genetic algorithms (GAs) applications is presented. The originality of the proposed coding is that it encompasses causal information. This ensures that causal analysis is taken into account in assessing the fitness of topologies generated in GA operations and the suitability of design candidates to meet performance specifications can be tested directly from the binary code as the model equations can be derived from it. The code is suitable for GAs applications on bond graphs (BG) for topology and parameter optimisation in automated synthesis of dynamic systems. The coding method and its possible applications are illustrated through worked examples.


2019 ◽  
Vol 12 (8) ◽  
pp. 4241-4259 ◽  
Author(s):  
Sylke Boyd ◽  
Stephen Sorenson ◽  
Shelby Richard ◽  
Michelle King ◽  
Morton Greenslit

Abstract. Halo displays, in particular the 22∘ halo, have been captured in long time series of images obtained from total sky imagers (TSIs) at various Atmospheric Radiation Measurement (ARM) sites. Halo displays form if smooth-faced hexagonal ice crystals are present in the optical path. We describe an image analysis algorithm for long time series of TSI images which scores images with respect to the presence of 22∘ halos. Each image is assigned an ice halo score (IHS) for 22∘ halos, as well as a photographic sky type (PST), which differentiates cirrostratus (PST-CS), partially cloudy (PST-PCL), cloudy (PST-CLD), or clear (PST-CLR) within a near-solar image analysis area. The color-resolved radial brightness behavior of the near-solar region is used to define the discriminant properties used to classify photographic sky type and assign an ice halo score. The scoring is based on the tools of multivariate Gaussian analysis applied to a standardized sun-centered image produced from the raw TSI image, following a series of calibrations, rotation, and coordinate transformation. The algorithm is trained based on a training set for each class of images. We present test results on halo observations and photographic sky type for the first 4 months of the year 2018, for TSI images obtained at the Southern Great Plains (SGP) ARM site. A detailed comparison of visual and algorithm scores for the month of March 2018 shows that the algorithm is about 90 % reliable in discriminating the four photographic sky types and identifies 86 % of all visual halos correctly. Numerous instances of halo appearances were identified for the period January through April 2018, with persistence times between 5 and 220 min. Varying by month, we found that between 9 % and 22 % of cirrostratus skies exhibited a full or partial 22∘ halo.


2020 ◽  
Vol 8 (4) ◽  
pp. 76
Author(s):  
Muhannad Atmeh ◽  
Mohammad Shaban ◽  
Malek Alsharairi

The relationship between companies and society has been questioned for a long time. However, the effect of the motives behind CSR regarding the companies’ actual engagement with CSR has received little attention, especially in emerging markets. This paper tackles this issue for the first time using a sample of Jordanian companies. We explore the effect of two types of motives on the level of engagement in CSR: extrinsic motive (financial) and intrinsic motives (ethical and altruistic). The relationship between the company’s actual financial performance and CSR is also investigated. Primary data were collected using a questionnaire, distributed to Jordanian company’s managers in five sectors: pharmaceutical, technology and telecommunication, construction, farming, and financial services. Multiple regression analysis was conducted to depict the relationships. Results show that the intrinsic motives have a significant effect on CSR, while the extrinsic motive has none. When intrinsic motives were tested separately, results showed that the ethical motive had a significant effect, while the altruistic had no effect. In both cases, CSR was shown to be more significantly driven by the company’s financial performance. Different stakeholders such as policymakers, entrepreneurs, researchers, and investors may use the results of this study to increase companies’ involvement in CSR.


2019 ◽  
Vol 7 (3) ◽  
pp. 29-38
Author(s):  
M Sinurat

 The people in Samosir believe that pigs are livestock that bring many benefits. That is why many people raise pigs. At this time there is a downward trend in pig population.This research was conducted at the people's farm in Simanindo District, Samosir Regency.   This study was conducted from May until July 2019 and aimed to investigate the factors that affect the rate of decline in the pig population. The result of research will show dominant factors that causes the rate of decline in the pig population and improvement could be conducted to prevent pig population decline trend. The method used is multiple linear regression analysis with the variable of animal feed limitations, farmer experience, long time raising, local environmental customs, religious holiday production. This research was conducted by asking the farmer directly and giving a randomly selected questionnaire. In total there were 84 pig farmers in 4 villages in each district. The results of this study indicated that the limitations of feed, long time raising, environmental customs around and religious day have a very significant influence on the decline in pig population in Simanindo District Samosir Regency.


2019 ◽  
Vol 32 (3) ◽  
pp. 49-62
Author(s):  
S. D. Lysenko

The article republishes the ceramics and bronze decorations of Komarovo and Sosnitsa cultures of the Trzciniec cultural circle (TCC) from the exposition of the Archaeological Museum of the Institute of Archeology of the National Academy of Sciences of Ukraine. Detailed descriptions of the exhibits and their author’s graphic reconstructions are given. These things, which became a textbook long time ago, previously were published only in the form of schematic and inaccurate drawings or in the form of not very high-quality photographs. The materials originate from the sites of the northern part of the forest-steppe eastern Volhynia (Wojciechowka), Kiev (Ukrainka, Zavalovka, Zdvizhevka, Plitovische, Gostomel) and Chernigov (Rudnya) Polesie. The finds refer to different periods of the formation and development of the TСC and date back to 2nd thousand BC. Special attention is paid to vessels discovered in 1956 by N. T. Evstropov at the site Gostomel, Stekol’nyy zavod 1. S. S. Berezanskaya came to the conclusion that this point is not a household site, but «a small soil burial ground with burning», referring to a series of observations of N. T. Evstropova. Detailed comparison of the primary publication of N. T. Evstropov with subsequent re-publication of the site of S. S. Berezanskaya, allow us to call into question conclusions of the latter. One of the reasons for skepticism is the miraculous transformation of «destroyed teeth of a ruminant animal» (in N. T. Evstropov publication) into «worn-out» and «calcified bones» (in S. S. Berezanskaya publications). Fragments of the teeth of a ruminant animal (bull?) are still folded into one of the vessels exhibited at the Archaeological Museum of Institute of Archeology of the National Academy of Sciences of Ukraine. Could the vessels from Gostomel be connected with ritual-funeral practice, such as the vessels found in the ritual-funeral complexes of the Malopolovetskoe, Wojciechowka, Bukovna cemeteries? It is possible, but there is no evidence for this. Similarly, they can be associated with any other rituals, as well as with the remains of an ordinary household complex. N. T. Evstropov did it at the first publication of the site. Analysis of the ceramic complex allows us to attribute the Gostomel, Stekol’nyy zavod 1 point to the turn of the middle and late stages of the TCC Sosnitsa culture and date it to the end of the 13th — the beginning of the 12th centuries BC.


2017 ◽  
Vol II (I) ◽  
pp. 73-84
Author(s):  
Niaz Ali ◽  
Muhammad Tariq ◽  
Asia Baig

This study investigates the business cycle characteristics for Pakistan using three sets of variables namely expenditure components of GDP, nominal variables and real variables. The findings reveal that the volatility of expenditure components are greater than GDP during the full sample of 1973 to 2015. Whereas, in the Pre-SAP and Post-SAP periods i.e. 1973-1988 and 1989-2015, real variables and nominal variables show more volatility than GDP. And, in terms of co-movement, expenditure components of GDP showed strong pro-cyclicality and relationship with GDP against other sets of variables. Moreover, the nominal variables show positive persistence and the business cycles caused by it, lasting for a long time against real variables and expenditure components of GDP. Furthermore, the results show that the correlation between CPI and GDP across all periods is counter cyclical. The stability test results show that business cycles features remained stable during two time periods.


2020 ◽  
Vol 2020 ◽  
pp. 1-8
Author(s):  
Yumi Suzuki ◽  
Sachiko Tsubakino ◽  
Hiromi Fujii

Patients with cerebrovascular disorders are often forced to rest, with early prognosis made by bedside examination. However, overloading, for example, talking for a long time, may worsen the condition. We hypothesized that activities of daily living (ADL) from the Functional Independence Measure (FIM) that were actually performed regularly are useful to predict prognosis. The present study was aimed at determining the predictive items related to predicting prognosis from the status of early motor paralysis and ADL in patients with acute middle cerebral artery (MCA) infarction. We examined 367 patients with MCA infarction for Brunnstrom recovery stage (BRS) and FIM within 4 days of admission and modified the Rankin Scale before onset and just before discharge. Logistic regression analysis was used to compare two groups of patients based on their postdischarge destination (Home/another hospital or facility). The logistic regression analysis showed the following: BRS Hand: odds ratio (OR) 1.641 (95% CI 1.642 (1.336–2.017), p<0.001); FIM Grooming: OR 1.279 (95% CI 1.220–1.807, p<0.001); and FIM Eating: OR 1.280 (95% CI 1.102–1.488, p<0.001). On the other hand, the ROC analysis showed the ROC area for Eating to be 0.830 (95% CI 0.787–0.874), for Grooming to be 0.81 (95% CI 0.765–0.865), and for BRS Hand to be 0.805 (95% CI 0.760–0.851). The BRS Hand and FIM Eating and Grooming domains were identified as predictive factors using the following cutoff points: BRS Hand stage V and FIM scores of 5 for Eating and 4 for Grooming. The cutoff points for the BRS Hand and FIM Eating revealed that, at a minimum, such patients can use the nonaffected hand. The presence of cognitive dysfunction or dysphagia affects these domains. Therefore, these results suggested that Eating and Grooming are appropriate as evaluation items.


2012 ◽  
Vol 4 (2) ◽  
pp. 202 ◽  
Author(s):  
Hussain Alkharusi

The use of categorical variables in regression involves the application of coding methods. The purpose of this paper is to describe how categorical independent variables can be incorporated into regression by virtue of two coding methods: dummy and effect coding. The paper discusses the uses, interpretations, and underlying assumptions of each method. In general, overall results of the regression are unaffected by the methods used for coding the categorical independent variables. In any of the methods, the analysis tests whether group membership is related to the dependent variables. Both methods yield identical R2 and F. However, the interpretations of the intercept and regression coefficients depend on what coding method has been applied and whether the groups have equal sample sizes.


2021 ◽  
Vol 7 (3) ◽  
pp. 9-16
Author(s):  
Millenia Rusbandi ◽  
Imam Fahrur Rozi ◽  
Kadek Suarjuna Batubulan

At present, the number of crimes in Indonesia is quite large. The large number of crimes in Indonesia will have an impact on the number of legal documents that will be handled by law enforcement officials. In understanding legal documents, law enforcement officials such as lawyers, judges, and prosecutors must read the entire document which will take a long time. Therefore a summary is needed so that law enforcement officials can understand it more easily. So that one solution needed is to make a summary of the legal documents where the documents are in PDF form. In terms of summarizing the text, the method that can be used is the Latent Semantic Analysis algorithm. The algorithm is used to describe or analyze the hidden meaning of a language, code or other type of representation in order to obtain important information.From testing the 10 documents summarized by experts, the results of precision, recall, f-measure and accuracy are obtained sequentially on automatic text summarization using the Latent Semantic Analysis method for a compression rate of 75%, namely 53%, 27%, 35% and 71%. for a compression rate of 50%, namely 54%, 56%, 55% and 75%, and for a compression rate of 25%, namely 51%, 79%, 61% and 75%. Based on the results of the research and testing that has been done, it can be concluded that the Latent Semantic Analysis Method can be used to summarize legal documents.


2021 ◽  
pp. 30-43
Author(s):  
Bing Xia ◽  
Jianmin Pang ◽  
Jun Wang ◽  
Fudong Liu ◽  
Feng Yue

Sign in / Sign up

Export Citation Format

Share Document