Opening the Knowledge Tombs - Web Based Text Mining as Approach for Re-evaluation of Machine Learning Rules

The automatic recognition of gene names and their associated database identifiers from biomedical text has been widely studied in recent years, as these tasks play an important role in many downstream text-mining applications. Despite significant previous research, only a small number of tools are publicly available and these tools are typically restricted to detecting only mention level gene names or only document level gene identifiers. In this work, we report GNormPlus: an end-to-end and open source system that handles both gene mention and identifier detection. We created a new corpus of 694 PubMed articles to support our development of GNormPlus, containing manual annotations for not only gene names and their identifiers, but also closely related concepts useful for gene name disambiguation, such as gene families and protein domains. GNormPlus integrates several advanced text-mining techniques, including SimConcept for resolving composite gene names. As a result, GNormPlus compares favorably to other state-of-the-art methods when evaluated on two widely used public benchmarking datasets, achieving 86.7% F1-score on the BioCreative II Gene Normalization task dataset and 50.1% F1-score on the BioCreative III Gene Normalization task dataset. The GNormPlus source code and its annotated corpus are freely available, and the results of applying GNormPlus to the entire PubMed are freely accessible through our web-based tool PubTator.

Download Full-text

Development of a Web Based Corruption Case Mapping Using Machine Learning with Artificial Neural Network

2018 International Conference on Information Management and Technology (ICIMTech) ◽

10.1109/icimtech.2018.8528150 ◽

2018 ◽

Author(s):

Noerlina ◽

Retno Dewanti ◽

Tirta Nugraha Mursitama ◽

Sheila Putri Fajrianti ◽

Desi Maya Kristin ◽

...

Keyword(s):

Neural Network ◽

Machine Learning ◽

Artificial Neural Network ◽

Web Based ◽

Corruption Case ◽

Artificial Neural

Download Full-text

The Voice of Chinese Health Consumers: A Text Mining Approach to Web-Based Physician Reviews

Journal of Medical Internet Research ◽

10.2196/jmir.4430 ◽

2016 ◽

Vol 18 (5) ◽

pp. e108 ◽

Cited By ~ 41

Author(s):

Haijing Hao ◽

Kunpeng Zhang

Keyword(s):

Text Mining ◽

Web Based ◽

Health Consumers ◽

Physician Reviews ◽

The Voice

Download Full-text

Feature Selection Optimization for Highlighting Opinions Using Supervised and Unsupervised Learning on Arabic Language

International Journal of Advanced Trends in Computer Science and Engineering ◽

10.30534/ijatcse/2021/251022021 ◽

2021 ◽

Vol 10 (2) ◽

pp. 636-642

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Text Mining ◽

Unsupervised Learning ◽

Arabic Language ◽

Machine Learning Techniques ◽

Business Decision ◽

Supervised And Unsupervised Learning ◽

Proposed Model ◽

The Impact

Text mining utilizes machine learning (ML) and natural language processing (NLP) for text implicit knowledge recognition, such knowledge serves many domains as translation, media searching, and business decision making. Opinion mining (OM) is one of the promised text mining fields, which are used for polarity discovering via text and has terminus benefits for business. ML techniques are divided into two approaches: supervised and unsupervised learning, since we herein testified an OM feature selection(FS)using four ML techniques. In this paper, we had implemented number of experiments via four machine learning techniques on the same three Arabic language corpora. This paper aims at increasing the accuracy of opinion highlighting on Arabic language, by using enhanced feature selection approaches. FS proposed model is adopted for enhancing opinion highlighting purpose. The experimental results show the outperformance of the proposed approaches in variant levels of supervisory,i.e. different techniques via distinct data domains. Multiple levels of comparison are carried out and discussed for further understanding of the impact of proposed model on several ML techniques.

Download Full-text

Machine learning can differentiate venom toxins from other proteins having non-toxic physiological functions

PeerJ Computer Science ◽

10.7717/peerj-cs.90 ◽

2016 ◽

Vol 2 ◽

pp. e90 ◽

Cited By ~ 24

Author(s):

Ranko Gacesa ◽

David J. Barlow ◽

Paul F. Long

Keyword(s):

Machine Learning ◽

Sequence Data ◽

Biological Data ◽

Biological Databases ◽

Web Based ◽

Physiological Functions ◽

Link Type ◽

Venom Toxins ◽

Venomous Animals ◽

Toxin Protein

Ascribing function to sequence in the absence of biological data is an ongoing challenge in bioinformatics. Differentiating the toxins of venomous animals from homologues having other physiological functions is particularly problematic as there are no universally accepted methods by which to attribute toxin function using sequence data alone. Bioinformatics tools that do exist are difficult to implement for researchers with little bioinformatics training. Here we announce a machine learning tool called ‘ToxClassifier’ that enables simple and consistent discrimination of toxins from non-toxin sequences with >99% accuracy and compare it to commonly used toxin annotation methods. ‘ToxClassifer’ also reports the best-hit annotation allowing placement of a toxin into the most appropriate toxin protein family, or relates it to a non-toxic protein having the closest homology, giving enhanced curation of existing biological databases and new venomics projects. ‘ToxClassifier’ is available for free, either to download (https://github.com/rgacesa/ToxClassifier) or to use on a web-based server (http://bioserv7.bioinfo.pbf.hr/ToxClassifier/).

Download Full-text

Modern Clinical Text Mining: A Guide and Review

Annual Review of Biomedical Data Science ◽

10.1146/annurev-biodatasci-030421-030931 ◽

2021 ◽

Vol 4 (1) ◽

Author(s):

Bethany Percha

Keyword(s):

Machine Learning ◽

Text Mining ◽

Data Science ◽

Annual Review ◽

Publication Date ◽

Biomedical Data ◽

Clinical Text ◽

Quality Improvement Research ◽

Comprehensive Survey ◽

Technical Advances

Electronic health records (EHRs) are becoming a vital source of data for healthcare quality improvement, research, and operations. However, much of the most valuable information contained in EHRs remains buried in unstructured text. The field of clinical text mining has advanced rapidly in recent years, transitioning from rule-based approaches to machine learning and, more recently, deep learning. With new methods come new challenges, however, especially for those new to the field. This review provides an overview of clinical text mining for those who are encountering it for the first time (e.g., physician researchers, operational analytics teams, machine learning scientists from other domains). While not a comprehensive survey, this review describes the state of the art, with a particular focus on new tasks and methods developed over the past few years. It also identifies key barriers between these remarkable technical advances and the practical realities of implementation in health systems and in industry. Expected final online publication date for the Annual Review of Biomedical Data Science, Volume 4 is July 2021. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.

Download Full-text

Opening the Knowledge Tombs - Web Based Text Mining as Approach for Re-evaluation of Machine Learning Rules

Text Mining Based Approach to Customer Sentiment Analysis Using Machine Learning

An Advanced Web-Based Hindi Language Interface to Database Using Machine Learning Approach

Applied Difference Techniques of Machine Learning Algorithm and Web-Based Management System for Sickle Cell Disease

Text Mining Approach for Product Quality Enhancement: (Improving Product Quality through Machine Learning)

GNormPlus: An Integrative Approach for Tagging Genes, Gene Families, and Protein Domains

Development of a Web Based Corruption Case Mapping Using Machine Learning with Artificial Neural Network

The Voice of Chinese Health Consumers: A Text Mining Approach to Web-Based Physician Reviews

Feature Selection Optimization for Highlighting Opinions Using Supervised and Unsupervised Learning on Arabic Language

Machine learning can differentiate venom toxins from other proteins having non-toxic physiological functions

Modern Clinical Text Mining: A Guide and Review

Export Citation Format