scholarly journals Statistical analysis of the impact of distortion (correction) on an automated classification of celiac disease

Author(s):  
M. Liedlgruber ◽  
A. Uhl ◽  
A. Vecsei
2021 ◽  
Vol 21 (S2) ◽  
Author(s):  
Kun Zeng ◽  
Yibin Xu ◽  
Ge Lin ◽  
Likeng Liang ◽  
Tianyong Hao

Abstract Background Eligibility criteria are the primary strategy for screening the target participants of a clinical trial. Automated classification of clinical trial eligibility criteria text by using machine learning methods improves recruitment efficiency to reduce the cost of clinical research. However, existing methods suffer from poor classification performance due to the complexity and imbalance of eligibility criteria text data. Methods An ensemble learning-based model with metric learning is proposed for eligibility criteria classification. The model integrates a set of pre-trained models including Bidirectional Encoder Representations from Transformers (BERT), A Robustly Optimized BERT Pretraining Approach (RoBERTa), XLNet, Pre-training Text Encoders as Discriminators Rather Than Generators (ELECTRA), and Enhanced Representation through Knowledge Integration (ERNIE). Focal Loss is used as a loss function to address the data imbalance problem. Metric learning is employed to train the embedding of each base model for feature distinguish. Soft Voting is applied to achieve final classification of the ensemble model. The dataset is from the standard evaluation task 3 of 5th China Health Information Processing Conference containing 38,341 eligibility criteria text in 44 categories. Results Our ensemble method had an accuracy of 0.8497, a precision of 0.8229, and a recall of 0.8216 on the dataset. The macro F1-score was 0.8169, outperforming state-of-the-art baseline methods by 0.84% improvement on average. In addition, the performance improvement had a p-value of 2.152e-07 with a standard t-test, indicating that our model achieved a significant improvement. Conclusions A model for classifying eligibility criteria text of clinical trials based on multi-model ensemble learning and metric learning was proposed. The experiments demonstrated that the classification performance was improved by our ensemble model significantly. In addition, metric learning was able to improve word embedding representation and the focal loss reduced the impact of data imbalance to model performance.


2018 ◽  
Vol 102 ◽  
pp. 221-226 ◽  
Author(s):  
M. Gadermayr ◽  
G. Wimmer ◽  
H. Kogler ◽  
A. Vécsei ◽  
D. Merhof ◽  
...  

2009 ◽  
Vol 95 (2) ◽  
pp. S68-S78 ◽  
Author(s):  
Andreas Vécsei ◽  
Thomas Fuhrmann ◽  
Michael Liedlgruber ◽  
Leonhard Brunauer ◽  
Hannes Payer ◽  
...  

Author(s):  
Igor Ponomarenko ◽  
Kateryna Volovnenko

The subject of the research is a set of approaches to the statistical analysis ofthe activities of small business entities in Ukraine, including micro-enterprises. The purpose of writing this article is to study of the features of functioningof small business entities in Ukraine. Methodology. The research methodology isto use a system-structural and comparative analysis (to study the change in thenumber of small enterprises by major components); monographic (when studyingmethods of statistical analysis of small businesses); economic analysis (when assessing the impact of small business entities on socio-economic phenomena andprocesses in Ukraine). The scientific novelty consists to determine the features ofthe functioning of small businesses in Ukraine in modern conditions. The influenceof the activities of the main socio-economic and political indicators on the activities of small enterprises in recent periods of time has been identified. It has beenestablished that there is flexibility in the development of strategies by small businesses in conditions of significant competition, which makes it possible to quicklyrespond to changing situations in specific markets. Conclusions. The use of acomprehensive statistical analysis of small businesses functioning in Ukraine willallow government agencies to develop a set of measures to optimize the activitiesof these enterprises, which ultimately will positively affect the strengthening oftheir competitiveness and will contribute to the growth of the national economicsystem.


Sign in / Sign up

Export Citation Format

Share Document