scholarly journals Combining Data Mining Techniques to Analyse Factors Associated with Allocation of Socioeconomic Resources at IFMG

2021 ◽  
Author(s):  
Eduardo Melo ◽  
Elisa Tuler ◽  
Leonardo Rocha

The granting of socioeconomic assistance to students from Federal Education Institutions is one of the ways found to provide finantial support during their studies, focusing primarily on those who are more socially vulnerable. Institutions carry out selection processes to identify students with a profile of demand and appropriately distribute the grants according to the budget available for this purpose. This article applied Data Mining techniques to a set of information from students who applied to receive scholarships at IFMG - Campus Bambuí, seeking to identify the attributes associated with the distribution of benefits and analyzing the adequacy of the current indicator used by the institution to classify the level of social vulnerability of students. The proposed methodology involved combining different machine learning algorithms, such as data classification and feature selection techniques. In addition to identifying the degree of importance of each attribute in the constructed model, the differential of this article is to present well-founded suggestions for new attributes that could be able to improve the index used by the institution and, consequently, optimize the workload of those involved with the analysis of selective processes. The composition of the institution's index with five new attributes resulted in a gain of around 10% in rating performance.

2019 ◽  
Vol 15 (2) ◽  
pp. 275-280
Author(s):  
Agus Setiyono ◽  
Hilman F Pardede

It is now common for a cellphone to receive spam messages. Great number of received messages making it difficult for human to classify those messages to Spam or no Spam.  One way to overcome this problem is to use Data Mining for automatic classifications. In this paper, we investigate various data mining techniques, named Support Vector Machine, Multinomial Naïve Bayes and Decision Tree for automatic spam detection. Our experimental results show that Support Vector Machine algorithm is the best algorithm over three evaluated algorithms. Support Vector Machine achieves 98.33%, while Multinomial Naïve Bayes achieves 98.13% and Decision Tree is at 97.10 % accuracy.


2022 ◽  
pp. 154-178
Author(s):  
Siddhartha Kumar Arjaria ◽  
Vikas Raj ◽  
Sunil Kumar ◽  
Priyanshu Shrivastava ◽  
Monu Kumar ◽  
...  

Skin disease rates have been increasing over the past few decades. It has led to both fatal and non-fatal disabilities all around the world, especially in those areas where medical resources are not good enough. Early diagnosis of skin diseases increases the chances of cure significantly. Therefore, this work is comparing six machine learning algorithms, namely KNN, random forest, neural network, naïve bayes, logistic regression, and SVM, for the prediction of the skin diseases. The information gain, gain ratio, gini decrease, chi-square, and relieff are used to rank the features. This work comprises the introduction, literature review, and proposed methodology parts. In this research paper, a new method of analyzing skin disease has been proposed in which six different data mining techniques are used to develop an ensemble method that integrates all the six data mining techniques as a single one. The ensemble method used on the dermatology dataset gives improved result with 94% accuracy in comparison to other classifier algorithms and hence is more effective in this area.


Author(s):  
Rafael Jose de Alencar Almeida ◽  
Vinicius Humberto Serapilha Durelli ◽  
Igor Campos Moraes ◽  
Matheus Carvalho Viana ◽  
Elverton Carvalho Fazzion ◽  
...  

2015 ◽  
Vol 9 (1) ◽  
pp. 30 ◽  
Author(s):  
María Messina ◽  
Esther Hochsztain

<p>El Centro de Emprendedurismo CCEEmprende de- sarrolla, desde 2007, un programa de apoyo a emprende- dores. Para mejorar su gestión, resulta de gran importancia analizar, en forma preliminar, los emprendimientos en una de dos categorías: éxito o fracaso. En este artículo se identifican los principales factores asociados al éxito de un emprendimiento y cómo se vincu- lan para anticipar el futuro del emprendimiento. Se presenta un caso de estudio con base en los datos de una encuesta realizada a emprendedores participantes del programa, aplicando técnicas de clasificación. Las dos técnicas utilizadas de data mining son árbol de decisión y regresión logística, en ambas se obtuvieron resultados coincidentes. Los hallazgos muestran que los dos elementos más relevantes para anticipar el éxito de un emprendimiento son contar con financiamiento y que, anteriormente, la situa- ción laboral del emprendedor sea trabajador independiente. Estos primeros resultados obtenidos en el estudio de caso revelan información útil acerca de las mejores formas de apoyo al emprendedor, cómo generar incentivos al em- prendedor y la definición de herramientas o actividades que incidan favorablemente en el éxito de los emprendimientos. Si bien desde la teoría o para otras realidades existe infor- mación sobre los factores que colaboran en la determina- ción del éxito, para la realidad del Uruguay no se identifican estudios similares.</p><p> </p><p><strong>Abstract</strong> </p><p>Since 2007, the CCEE Entrepreneurship Centre has developed a supporting program for entrepreneurs. A preliminary analysis to determine if the venture was successful or a failure is made to improve the program’s management . In this article, the authors identify the main factors associated with entrepreneurship’s success, and how they can anticipate entrepreneurship’s performance. The case study is based on a survey data applied to the Entrepreneurship Program participants. The two data mining techniques are decision trees and logistic regression. The results were consistent across both tech- niques. The findings show that the two most important elements to predict entrepreneurship’s success are fun- ding and previous experience as self-employed. The results provided very useful insight about the best ways to support entrepreneurship, how to encoura- ge entrepreneurs, and define tools or activities to impact positively ventures success in Uruguay, since similar stu- dies have not been developed.</p>


Author(s):  
Minal Shahakar

It might have happened so many times that you or someone yours need doctors help immediately, but they are not available due to some reason. The Heart Disease Prediction application is an end user support to the online. Here, we propose a web application that allows users to get instant guidance on their heart disease through an intelligent system online. The application is fed with various details and the heart disease associated with those details. The applications allows user to share their heart related issues. It then processes user specific details to check for various illnesses that could be associated with it. Here we use some intelligent data mining techniques to the most accurate that could be associated with patient‟s details. Based on result, system automatically shows the result specific doctors for further treatment and the system allows user to view doctor‟s details.


2021 ◽  
Vol 2021 ◽  
pp. 1-14
Author(s):  
Ishleen Kaur ◽  
M. N. Doja ◽  
Tanvir Ahmad ◽  
Musheer Ahmad ◽  
Amir Hussain ◽  
...  

Ovarian cancer is the third most common gynecologic cancers worldwide. Advanced ovarian cancer patients bear a significant mortality rate. Survival estimation is essential for clinicians and patients to understand better and tolerate future outcomes. The present study intends to investigate different survival predictors available for cancer prognosis using data mining techniques. Dataset of 140 advanced ovarian cancer patients containing data from different data profiles (clinical, treatment, and overall life quality) has been collected and used to foresee cancer patients’ survival. Attributes from each data profile have been processed accordingly. Clinical data has been prepared corresponding to missing values and outliers. Treatment data including varying time periods were created using sequence mining techniques to identify the treatments given to the patients. And lastly, different comorbidities were combined into a single factor by computing Charlson Comorbidity Index for each patient. After appropriate preprocessing, the integrated dataset is classified using appropriate machine learning algorithms. The proposed integrated model approach gave the highest accuracy of 76.4% using ensemble technique with sequential pattern mining including time intervals of 2 months between treatments. Thus, the treatment sequences and, most importantly, life quality attributes significantly contribute to the survival prediction of cancer patients.


2013 ◽  
Vol 22 (05) ◽  
pp. 1360011 ◽  
Author(s):  
RANDALL WALD ◽  
TAGHI M. KHOSHGOFTAAR ◽  
JOHN C. SLOAN

One of the most important types of signal found in the area of machine condition monitoring/prognostic health monitoring (MCM/PHM) is the vibration signal, a type of waveform. Many time-frequency domain techniques have been proposed to interpret such signals, including wavelet packet decomposition (WPD). Previous work has shown how to extend the WPD algorithm to operate on streaming signals, but the number of output variables becomes exponential in the number of levels of decomposition, hindering data mining in limited-memory environments. Feature selection techniques, well understood in other areas of data mining, can be used to greatly reduce the number of output variables and speed up the machine learning algorithms. This paper presents a case study comparing two versions of WPD both with and without feature selection, demonstrating that removing most of the features produced by the WPD does not impair its performance within the context of MCM/PHM.


Author(s):  
Christian Gomes ◽  
Alan Cardoso ◽  
Thiago Silveira ◽  
Diego Dias ◽  
Elisa Tuler ◽  
...  

Vaccines ◽  
2021 ◽  
Vol 9 (6) ◽  
pp. 653
Author(s):  
Dongwon Yoon ◽  
Juhwan Kim ◽  
Juyoung Shin

In the context of recent measles outbreaks, substantial factors associated with measles-mumps-rubella (MMR) unvaccination need to be clarified. This study aimed to identify differential demographic and clinical characteristics between MMR vaccinated and unvaccinated groups. We used a large-linked database to identify children born between 2008 and 2016 by combining data from the Korea Immunization Registry Information System and National Health Information database. The MMR vaccination status was ascertained up to the age of 2 to define MMR vaccinated and unvaccinated groups. We conducted a multivariate logistic regression to estimate odds ratios (ORs) with 95% confidence intervals (CIs) to identify factors associated with MMR unvaccination. Of 3,973,253 children, 75,674 (1.9%) did not receive the MMR vaccine. Compared with the MMR vaccinated group, the underutilization of healthcare resources was more notable in the MMR unvaccinated group (number of outpatient visits (5.73 ± 12.1 vs. 25.8 ± 17.06); days hospitalized (1.69 ± 14.5 vs. 2.32 ± 6.90)). Children were less likely to receive the MMR vaccine if they were born with congenital anomaly (OR 2.12; 95% CI 1.90–2.36), were never admitted to an intensive care unit (1.88; 1.78–1.98), or never visited an emergency room (3.57; 3.53–3.72). There were substantial factors associated with MMR unvaccination, underscoring a need to optimize targeted interventions tailored to the subset of children in South Korea.


Sign in / Sign up

Export Citation Format

Share Document