The feasibility of creating a checklist for the assessment of the methodological quality both of randomised and non-randomised studies of health care interventions (Preprint)
BACKGROUND According to the World Health Organization (WHO) close to 800,000 people worldwide death by suicidal each year. Many more attempt to do it. In consequence, the WHO recognizes suicide as a global public health priority, which affects not only rich countries, but poor and middle income countries as well. OBJECTIVE The aim of this study is to evaluate several supervised classifiers for detecting messages with suicidal ideation in order to know if these systems can be used in automatic suicide prevention systems. METHODS We used machine learning techniques to make a systematic analysis of 28 supervised classifier algorithms with parameters by defect. The Life Corpus, used in this research, is a bilingual corpus (English and Spanish) oriented to suicide. The corpus was constructed by two annotation experts, retrieving texts from several social networks. The corpus quality was measured using mutual annotation agreement. RESULTS The different experiments determined that the classifier with the best performance was KStar, with the corpus version POS-SYNSETS-NUM; and the cycle with 2 classes Urgent and No Risk was the one that achieved the best results with the PRC-Area metrics of 0,81036 and F-measure of 0,7148. CONCLUSIONS The present research fulfilled the objective of discovering which characteristics are the most suitable for the automatic classification of messages with suicidal ideation, using the Life Corpus. The results of this evaluation demonstrate that the Life Corpus and machine learning techniques could be suitable for detecting suicide ideation messages.