scholarly journals INTERNATIONAL SYSTEM OF KNOWLEDGE EXCHANGE FOR YOUNG SCIENTISTS

2021 ◽  
Vol 5 (1) ◽  
pp. 69-74
Author(s):  
Olesia Barkovska ◽  
Vladyslav Kholiev ◽  
Daria Pyvovarova ◽  
Georgiy Ivaschenko ◽  
Dmytro Rosinskiy

The paper proposes a system which is electronic data storage (of qualification works of students from different countries) and provides the capability to identify and connect young scientists conducting research on a related problem area. The purpose of developing this system is to provide opportunities for knowledge exchange, research in a team on a common problem, as well as to identify scientific trends in different countries. In this paper, the preprocessing methods influence on the work of classifiers such as Logistic Regression, LSTM, BERT, LightGBM was researched. A study was conducted on the speed of classification and F1 assessment. Conclusions. Lemmatization showed to require a shorter operating time compared to steaming by almost twice and a better score by an average of 5 percent, so it was decided to use the Logistic Regression classifier with lemmatization at the stage of text preparation in the subsequent operation of the proposed ISKE.

2018 ◽  
Vol 1 (2) ◽  
pp. 14-24
Author(s):  
Dame Christine Sagala ◽  
Ali Sadikin ◽  
Beni Irawan

The data processing systems is a very necessary way to manipulate a data into useful information. The system makes data storage, adding, changing, scheduling to reporting well integrated, so that it can help parts to exchange information and make decisions quickly. The problems faced by GKPI Pal Merah Jambi are currently still using Microsoft Office Word and in disseminating information such as worship schedules, church activities and other worship routines through paper and wall-based worship services. To print worship and report reports requires substantial operational funds, in addition to data collection and storage there are still deficiencies including recording data on the book, difficulty in processing large amounts of data and stored in only one special place that is passive. Based on the above problems, the author is interested in conducting research with the title Designing Data Processing Systems for Web-Based Churches in the GKPI Pal Merah Church in Jambi. The purpose of this study is to design and produce a data processing system for the church. Using this system can facilitate data processing in the GKPI Pal Merah Jambi Church. This study uses a waterfall development method, a method that provides a systematic and sequential approach to system needs analysis, design, implementation and unit testing, system testing and care. Applications built using the web with MySQL DBMS database, PHP programming language and Laravel.


In today’s modern world, the world population is affected with some kind of heart diseases. With the vast knowledge and advancement in applications, the analysis and the identification of the heart disease still remain as a challenging issue. Due to the lack of awareness in the availability of patient symptoms, the prediction of heart disease is a questionable task. The World Health Organization has released that 33% of population were died due to the attack of heart diseases. With this background, we have used Heart Disease Prediction dataset extracted from UCI Machine Learning Repository for analyzing and the prediction of heart disease by integrating the ensembling methods. The prediction of heart disease classes are achieved in four ways. Firstly, The important features are extracted for the various ensembling methods like Extra Trees Regressor, Ada boost regressor, Gradient booster regress, Random forest regressor and Ada boost classifier. Secondly, the highly importance features of each of the ensembling methods is filtered from the dataset and it is fitted to logistic regression classifier to analyze the performance. Thirdly, the same extracted important features of each of the ensembling methods are subjected to feature scaling and then fitted with logistic regression to analyze the performance. Fourth, the Performance analysis is done with the performance metric such as Mean Squared error (MSE), Mean Absolute error (MAE), R2 Score, Explained Variance Score (EVS) and Mean Squared Log Error (MSLE). The implementation is done using python language under Spyder platform with Anaconda Navigator. Experimental results shows that before applying feature scaling, the feature importance extracted from the Ada boost classifier is found to be effective with the MSE of 0.04, MAE of 0.07, R2 Score of 92%, EVS of 0.86 and MSLE of 0.16 as compared to other ensembling methods. Experimental results shows that after applying feature scaling, the feature importance extracted from the Ada boost classifier is found to be effective with the MSE of 0.09, MAE of 0.13, R2 Score of 91%, EVS of 0.93 and MSLE of 0.18 as compared to other ensembling methods.


2019 ◽  
Vol 11 (2) ◽  
pp. 136 ◽  
Author(s):  
Yuliang Wang ◽  
Huiyi Su ◽  
Mingshi Li

Hyperspectral images (HSIs) provide unique capabilities for urban impervious surfaces (UIS) extraction. This paper proposes a multi-feature extraction model (MFEM) for UIS detection from HSIs. The model is based on a nonlinear dimensionality reduction technique, t-distributed stochastic neighbor embedding (t-SNE), and the deep learning method convolutional deep belief networks (CDBNs). We improved the two methods to create a novel MFEM consisting of improved t-SNE, deep compression CDBNs (d-CDBNs), and a logistic regression classifier. The improved t-SNE method provides dimensionality reduction and spectral feature extraction from the original HSIs and the d-CDBNs algorithm extracts spatial feature and edges using the reduced dimensional datasets. Finally, the extracted features are combined into multi-feature for the impervious surface detection using the logistic regression classifier. After comparing with the commonly used methods, the current experimental results demonstrate that the proposed MFEM model provides better performance for UIS extraction and detection from HSIs.


Author(s):  
Wan-Jui Lee

Air leakage in braking pipes is a commonly encountered mechanical defect on trains. A severe air leakage will lead to braking issues and therefore decrease the reliability and cause train delays or stranding. However, air leakage is difficult to be detected via visual inspection and therefore most air leakage defects are run to fail. In this research we present a framework that not only can detect air leakages but also predicts the severity of air leakages so that action plans can be determined based on the severity level. The proposed contextual anomaly detection method detects air leakages based on the on/off logs of a compressor. Air leakage causes failure in the context when the compressor idle time is short than the compressor run time, that is, the speed of air consumption is faster than air generation. In our method the logistic regression classifier is adopted to model two different classes of compressor behavior for each train separately. The logistic regression classifier defines the boundary separating the two classes under normal situations and models the distribution of the compressor idle time and run time separately using logistic functions. The air leakage anomaly is further detected in the context that when a compressor idle time is erroneously classified as a compressor run time. To distinguish anomalies from outliers and detect anomalies based on the severity degree, a density-based clustering method with a dynamic density threshold is developed for anomaly detection. The results have demonstrated that most air leakages can be detected one to four weeks before the braking failure and therefore can be prevented in time. Most importantly, the contextual anomaly detection method can pre-filter anomaly candidates and therefore avoid generating false alarms. To facilitate the decisionmaking process, the logistic function built on the compressor run time is further used together with the duration of an air leakage to model the severity of the air leakage. By building the prediction model on the severity, the remaining useful life of the air braking pipe until it reaches a certain level of severity can be estimated.


2021 ◽  
Vol 6 (2) ◽  
pp. 120-129
Author(s):  
Nadhif Ikbar Wibowo ◽  
Tri Andika Maulana ◽  
Hamzah Muhammad ◽  
Nur Aini Rakhmawati

Public responses, posted on Twitter reacting to the Tokopedia data leak incident, were used as a data set to compare the performance of three different classifiers, trained using supervised learning modeling, to classify sentiment on the text. All tweets were classified into either positive, negative, or neutral classes. This study compares the performance of Random Forest, Support-Vector Machine, and Logistic Regression classifier. Data was scraped automatically and used to evaluate several models; the SVM-based model has the highest f1-score 0.503583. SVM is the best performing classifier.


2018 ◽  
Vol 09 (01) ◽  
pp. 129-140 ◽  
Author(s):  
Jena Daniels ◽  
Nick Haber ◽  
Catalin Voss ◽  
Jessey Schwartz ◽  
Serena Tamura ◽  
...  

Background Recent advances in computer vision and wearable technology have created an opportunity to introduce mobile therapy systems for autism spectrum disorders (ASD) that can respond to the increasing demand for therapeutic interventions; however, feasibility questions must be answered first. Objective We studied the feasibility of a prototype therapeutic tool for children with ASD using Google Glass, examining whether children with ASD would wear such a device, if providing the emotion classification will improve emotion recognition, and how emotion recognition differs between ASD participants and neurotypical controls (NC). Methods We ran a controlled laboratory experiment with 43 children: 23 with ASD and 20 NC. Children identified static facial images on a computer screen with one of 7 emotions in 3 successive batches: the first with no information about emotion provided to the child, the second with the correct classification from the Glass labeling the emotion, and the third again without emotion information. We then trained a logistic regression classifier on the emotion confusion matrices generated by the two information-free batches to predict ASD versus NC. Results All 43 children were comfortable wearing the Glass. ASD and NC participants who completed the computer task with Glass providing audible emotion labeling (n = 33) showed increased accuracies in emotion labeling, and the logistic regression classifier achieved an accuracy of 72.7%. Further analysis suggests that the ability to recognize surprise, fear, and neutrality may distinguish ASD cases from NC. Conclusion This feasibility study supports the utility of a wearable device for social affective learning in ASD children and demonstrates subtle differences in how ASD and NC children perform on an emotion recognition task.


2019 ◽  
Vol 16 (2) ◽  
pp. 576-579
Author(s):  
T. Narmadha ◽  
J. Gowrishankar ◽  
M. Ramkumar ◽  
K. Vengatesan

Cloud Storage Providers (CSPs) over topographically scattered data stores furnishing a few storage classes with various costs. An imperative issue looked by application providers is the means by which to misuse value contrasts crosswise over data stores to limit the financial cost of utilizations that incorporate problem area objects that are acquired much of the time and spot objects that are regularly becoming to far less. This financial cost comprises of reproduction, creation, storage, Put, Get, and potential migration costs. To advance such costs, we initially propose the ideal arrangement that uses dynamic and direct programming methods with the supposition that the workload on objects is known ahead of time. We likewise star represents a lightweight heuristic arrangement, motivated from a rough algorithm for the Set Covering Problem, which does not make any presumption on the object workload. This arrangement together decides object imitations area, object reproductions migration times, and redirection of Get (read) solicitations to object copies with the goal that the fiscal cost of data storage administration is improved while the client saw idleness is fulfilled.


2021 ◽  
Author(s):  
Tingyan Deng

Autistic Spectrum Disorder (ASD) is a developmental disability, which can affect communication and behavior, causing significant social, communication, and behavior challenge. From a rare childhood disorder, ASD has evolved into a disorder that is found, according to the National Institute of Health, in 1% to 2% of the population in high income countries. A potential early and accurate diagnosis can not only help doctors to find the disease early, leading to a more on time treatment to the patient, but also can save significant healthcare costs for the patients. With the rapid growth of ASD cases, many open-source ASD related datasets were created for scientists and doctors to investigate this disease. Autistic Spectrum Disorder Screening Data for Adult is a well-known dataset, which contains 20 features to be utilized for further analysis on the potential cause and prediction of ASD. In this paper, we developed an Autism classification algorithm based on logistic regression model. Our model starts with featuring engineering to extract deep information from the dataset and then applied a modified logistic regression classifier to the data. The model can predict the ASD in an average F1 score of 0.97, which displays the superiority and feasibility of the proposed model. Besides, the data visualization technique was used to displays several feature distributions images for people to better understand the data and related feature engineering.


Sign in / Sign up

Export Citation Format

Share Document