Factors That Can Trigger Depression

According to data from the last National Health Survey (PNS), conducted in 2013 by the Brazilian Institute of Geography and Statistics (IBGE) in partnership with the Ministry of Health, 7.6% of people aged 18 and over received diagnosis of depression. Therefore, based on this research, the purpose of this study was to identify factors that may be relevant to a possible diagnosis of depression, using machine learning techniques. The binary logistic regression model was chosen as the machine learning technique, with progressive and regressive methods for selecting variables and a model built by the researcher, generating seven different models. The model’s performance evaluation was made by comparing some metrics such as Cox-Snell R2 and Nagelkerke R2, which presented remarkably close results. Based on these models, 37 explanatory variables were selected which were applied to a new logistic regression model. The results showed that some variables significantly increased the chance of a positive diagnosis of depression as well as some variables were indicative of a reduction in the chances of this diagnosis.

Download Full-text

Diagnosing Multicollinearity of Logistic Regression Model

Asian Journal of Probability and Statistics ◽

10.9734/ajpas/2019/v5i230132 ◽

2019 ◽

pp. 1-9 ◽

Cited By ~ 6

Author(s):

N. A. M. R. Senaviratna ◽

T. M. J. A. Cooray

Keyword(s):

Logistic Regression ◽

Regression Model ◽

Sample Size ◽

Logistic Regression Model ◽

Secondary Data ◽

Binary Logistic Regression ◽

Condition Index ◽

Binary Logistic Regression Model ◽

Correlated Variables ◽

Explanatory Variables

One of the key problems arises in binary logistic regression model is that explanatory variables being considered for the logistic regression model are highly correlated among themselves. Multicollinearity will cause unstable estimates and inaccurate variances that affects confidence intervals and hypothesis tests. Aim of this was to discuss some diagnostic measurements to detect multicollinearity namely tolerance, Variance Inflation Factor (VIF), condition index and variance proportions. The adapted diagnostics are illustrated with data based on a study of road accidents. Secondary data used from 2014 to 2016 in this study were acquired from the Traffic Police headquarters, Colombo in Sri Lanka. The response variable is accident severity that consists of two levels particularly grievous and non-grievous. Multicolinearity is identified by correlation matrix, tolerance and VIF values and confirmed by condition index and variance proportions. The range of solutions available for logistic regression such as increasing sample size, dropping one of the correlated variables and combining variables into an index. It is safely concluded that without increasing sample size, to omit one of the correlated variables can reduce multicollinearity considerably.

Download Full-text

Prediction and Classification of Low Birth Weight Data Using Machine Learning Techniques

Indonesian Journal of Science and Technology ◽

10.17509/ijost.v3i1.10799 ◽

2018 ◽

Vol 3 (1) ◽

pp. 18 ◽

Cited By ~ 4

Author(s):

Alfensi Faruk ◽

Endro Setyo Cahyono

Keyword(s):

Machine Learning ◽

Logistic Regression ◽

Birth Weight ◽

Low Birth Weight ◽

Binary Logistic Regression ◽

Machine Learning Techniques ◽

Binary Logistic Regression Model ◽

Data Set ◽

Learning Techniques

Machine learning (ML) is a subject that focuses on the data analysis using various statistical tools and learning processes in order to gain more knowledge from the data. The objective of this research was to apply one of the ML techniques on the low birth weight (LBW) data in Indonesia. This research conducts two ML tasks; including prediction and classification. The binary logistic regression model was firstly employed on the train and the test data. Then; the random approach was also applied to the data set. The results showed that the binary logistic regression had a good performance for prediction; but it was a poor approach for classification. On the other hand; random forest approach has a very good performance for both prediction and classification of the LBW data set

Download Full-text

Application of Machine Learning Techniques to Predict the Occurrence of Distraction-affected Crashes with Phone-Use Data

Transportation Research Record Journal of the Transportation Research Board ◽

10.1177/03611981211045371 ◽

2021 ◽

pp. 036119812110453

Author(s):

Chaolun Ma ◽

Yongxin Peng ◽

Lingtao Wu ◽

Xiaoyu Guo ◽

Xiubin Wang ◽

...

Keyword(s):

Machine Learning ◽

Logistic Regression ◽

Regression Model ◽

Logistic Regression Model ◽

Machine Learning Techniques ◽

Essential Information ◽

Driving Behaviors ◽

Learning Techniques ◽

Learning Technique ◽

Unbalanced Dataset

Distraction occurs when a driver’s attention is diverted from driving to a secondary task. The number of distraction-affected crashes has been increasing in recent years. Accurately predicting distraction-affected crashes is critical for roadway agencies to reduce distracted driving behaviors and distraction-affected crashes. Recently, more and more emerging phone-use data and machine learning techniques are available to safety researchers, and can potentially improve the prediction of distraction-affected crashes. Therefore, this study first examines if phone-use events provide essential information for distraction-affected crashes. The authors apply the machine learning technique (i.e., XGBoost) under two scenarios, with and without phone-use events, and compare their performances with two conventional statistical models: logistic regression model and mixed-effects logistic regression model. The comparison demonstrates the superiority of XGBoost over logistic regression with a high-dimensional unbalanced dataset. Further, this study implements SHAP (SHapley Additive exPlanation) to interpret the results and analyze the importance of individual features related to distraction-affected crashes and tests its ability to improve prediction accuracy. The trained XGBoost model achieves a sensitivity of 91.59%, a specificity of 85.92%, and 88.72% accuracy. The XGBoost and SHAP results suggest that: (1) phone-use information is an important factor associated with the occurrences of distraction-affected crashes; (2) distraction-affected crashes are more likely to occur on roadway segments with higher exposure (i.e., length and traffic volume), unevenness of traffic flow condition, or with medium truck volume.

Download Full-text

Analisis Pengobatan Pasien Menggunakan Model Regresi Logistik Biner

KALBISCIENTIA Jurnal Sains dan Teknologi ◽

10.53008/kalbiscientia.v6i2.42 ◽

2020 ◽

Vol 6 (2) ◽

pp. 92

Author(s):

Krishna Krishna Prafidya Romantica

Keyword(s):

Logistic Regression ◽

Regression Model ◽

Logistic Regression Model ◽

Explanatory Variable ◽

Binary Logistic Regression ◽

Residential Areas ◽

Binary Logistic Regression Model ◽

Dummy Variables ◽

Explanatory Variables ◽

Patient Class

Researcher used patient data spread across two residential areas, namely sector 1 and sector2. The research data consisted of four explanatory variables, namely: the age of the patient, the class ofpatients found in the hospital, the patient’s area of residence, and the findings of the disease suffered by the patient. Class, sector, and disease variables are variables categorized into categories 0 and 1. The researcher considers the dummy variables discussed in the explanatory variable variables. Category 0 indicates that the sample does not meet the criteria in the category. Choosing, category 1 shows that the sample meets the criteria in the category. Next, the researcher will estimate the explanatory parameter variables and dummy variables, then do the partial test to get the parameter significance and model it using the Binary Logistic Regression Model. With the Logistic Regression Model, researcher will calculate the consideration of the patient’s recovery. This probability is used as

Download Full-text

Survey on turnover intention of scientific and technological workers based on the binary logistic regression model—a case study of XPCC

Information Management and Management Engineering ◽

10.2495/imme140591 ◽

2014 ◽

Author(s):

Zhui Liu ◽

Honglu Gou ◽

Lingying Kong

Keyword(s):

Logistic Regression ◽

Regression Model ◽

Turnover Intention ◽

Logistic Regression Model ◽

Binary Logistic Regression ◽

Binary Logistic Regression Model

Download Full-text

The analytic construction ofD-optimal designs for the two-variable binary logistic regression model without interaction

Statistics ◽

10.1080/02331888.2014.937342 ◽

2014 ◽

Vol 49 (5) ◽

pp. 1169-1186 ◽

Cited By ~ 3

Author(s):

Gaëtan M. Kabera ◽

Linda M. Haines ◽

Principal Ndlovu

Keyword(s):

Logistic Regression ◽

Regression Model ◽

Logistic Regression Model ◽

Binary Logistic Regression ◽

Optimal Designs ◽

Binary Logistic Regression Model ◽

Analytic Construction

Download Full-text

Analysis and Prediction of P2P Online Lending Platform—Based on Binary Logistic Regression Model

Proceedings of 2017 the 7th International Workshop on Computer Science and Engineering ◽

10.18178/wcse.2017.06.224 ◽

2017 ◽

Keyword(s):

Logistic Regression ◽

Regression Model ◽

Logistic Regression Model ◽

Binary Logistic Regression ◽

Binary Logistic Regression Model ◽

Online Lending

Download Full-text

FAKTOR-FAKTOR YANG MEMPENGARUHI PEROLEHAN KREDIT OLEH PENGUSAHA MIKRO

Jurnal Ilmiah Ekonomi Bisnis ◽

10.35972/jieb.v4i3.237 ◽

2018 ◽

Vol 4 (3) ◽

Author(s):

Abdul Azis Safii ◽

Tri Suwarno

Keyword(s):

Logistic Regression ◽

Regression Model ◽

Large Scale ◽

Logistic Regression Model ◽

Banking Sector ◽

Binary Logistic Regression ◽

Binary Logistic Regression Model ◽

Access To Credit ◽

The Poor ◽

Micro Enterprises

Abstract: The number of micro-entrepreneurs and the dominant number of micro enterprises compared to medium and large-scale enterprises in Indonesia are not balanced by the provision of access to credit and venture capital for micro businesses. This resulted in a micro-sector sector identical to the poor being vulnerable to exploitation by moneylenders who exploit the difficulties of micro entrepreneurs accessing credit from the banking sector. This study examines the factors that determine the accessibility of credit by micro entrepreneur in Bojonegoro regency. A total sum of 270 micro entrepreneurs who have applied for banking loan were sampled from the study area. With an binary logistic regression model the research resulting that education, skill on entrepreneur, and monthly net profits generated by the microenterprise are significant in determining the accessibility of microcredit. Keywords: micro entrepreneur, microcredit, credit accessibility Abstrak: Perkembangan jumlah pengusaha mikro serta dominannya jumlah usaha mikro dibandingkan dengan usaha menengah dan usaha besar di Indonesia, tidak diimbingi dengan penyediaan akses kredit dan modal usaha bagi para pelaku usaha mikro. Hal tersebut mengakibatkan sektor usaha mikro yang identik dengan masyarakat miskin rentan dieksploitasi oleh rentenir yang memanfaatkan sulitnya para pengusaha mikro mengakses kredit dari sektor perbankan. Penelitian ini menggunakan data primer yang di ambil langsung dari pengusaha mikro dengan teknik kuesioner. Analisis data dengan metode binary logistic regression mendapatkan hasil variabel yang berpengaruh signifikan terhadap akses kredit para pengusaha mikro adalah variabel usia pengusaha, laba bersih usaha tiap bulan, dan jumlah karyawan yang di pekerjakan. Kata kunci : usaha mikro, microcredit, akses kredit

Download Full-text

ESTIMATION OF THE BINARY LOGISTIC REGRESSION MODEL PARAMETER USING BOOTSTRAP RE-SAMPLING

Latin American Applied Research - An international journal ◽

10.52292/j.laar.2018.228 ◽

2018 ◽

Vol 48 (3) ◽

pp. 199-204 ◽

Cited By ~ 1

Author(s):

R. LI ◽

J. ZHOU ◽

L. WANG

Keyword(s):

Logistic Regression ◽

Parameter Estimation ◽

Maximum Likelihood ◽

Regression Model ◽

Logistic Regression Model ◽

Binary Logistic Regression ◽

Parametric Bootstrap ◽

Binary Logistic Regression Model ◽

Bayesian Bootstrap ◽

Non Parametric

In this paper, the non-parametric bootstrap and non-parametric Bayesian bootstrap methods are applied for parameter estimation in the binary logistic regression model. A real data study and a simulation study are conducted to compare the Nonparametric bootstrap, Non-parametric Bayesian bootstrap and the maximum likelihood methods. Study results shows that three methods are all effective ways for parameter estimation in the binary logistic regression model. In small sample case, the non-parametric Bayesian bootstrap method performs relatively better than the non-parametric bootstrap and the maximum likelihood method for parameter estimation in the binary logistic regression model.

Download Full-text

Die Flexion der Indefinita jemand und niemand

Zeitschrift für Germanistische Linguistik ◽

10.1515/zgl-2021-2028 ◽

2021 ◽

Vol 49 (2) ◽

pp. 209-243

Author(s):

Linnéa Weitkamp

Keyword(s):

Logistic Regression ◽

Regression Model ◽

Influencing Factors ◽

Logistic Regression Model ◽

Binary Logistic Regression ◽

Binary Logistic Regression Model ◽

Corpus Study ◽

Frequent Use ◽

Current Variation ◽

Reference Corpus

Abstract This article investigates the inflection of the German indefinite pronouns jemand and niemand in the accusative and dative. The pronouns are used both with inflectional suffix (jemanden/jemandem, niemanden/niemandem) and without (jemand, niemand) and are thus an example of current variation in contemporary German. The grammars take an unusually liberal stance and describe both forms as correct, partially even with preference to the uninflected form. A corpus study which examines conceptually written data of the DeReKo (German reference corpus) and conceptually oral data of the DECOW16B (German web corpus), shows that over 90 % of occurrences are inflected. But almost 10 % of uninflected forms show that these formations are no arbitrary errors either. To find out what influences the presence or absence of the inflectional ending, a binary logistic regression model was calculated. The following factors proved to be significant influencing factors for inflection: the degree of formality (DeReKo vs. DECOW16B), the lexeme (jemand vs. niemand), the case (acc vs. dat), government by preposition vs. government by verb and the following nominalized adjective (jemand anderen). With regard to the different inflectional suffixes, the frequent use of -en in the dative stood out in particular. Although this form is classified as erroneous in all grammars, almost 30 % of the dative occurrences in informal DECOW16B data are formed in this way.

Download Full-text