A Knowledge Discovery from Full-Text Document Collections Using Clustering and Interpretable Genetic-Fuzzy Systems

AbstractPubTator Central (https://www.ncbi.nlm.nih.gov/research/pubtator/) is a web service for viewing and retrieving bioconcept annotations in full text biomedical articles. PubTator Central (PTC) provides automated annotations from state-of-the-art text mining systems for genes/proteins, genetic variants, diseases, chemicals, species and cell lines, all available for immediate download. PTC annotates PubMed (29 million abstracts) and the PMC Text Mining subset (3 million full text articles). The new PTC web interface allows users to build full text document collections and visualize concept annotations in each document. Annotations are downloadable in multiple formats (XML, JSON and tab delimited) via the online interface, a RESTful web service and bulk FTP. Improved concept identification systems and a new disambiguation module based on deep learning increase annotation accuracy, and the new server-side architecture is significantly faster. PTC is synchronized with PubMed and PubMed Central, with new articles added daily. The original PubTator service has served annotated abstracts for ∼300 million requests, enabling third-party research in use cases such as biocuration support, gene prioritization, genetic disease analysis, and literature-based knowledge discovery. We demonstrate the full text results in PTC significantly increase biomedical concept coverage and anticipate this expansion will both enhance existing downstream applications and enable new use cases.

Download Full-text

A knowledge discovery method based on genetic-fuzzy systems for obtaining consumer behaviour patterns. An empirical application to a Web-based trust model

International Journal of Management and Decision Making ◽

10.1504/ijmdm.2009.026685 ◽

2009 ◽

Vol 10 (5/6) ◽

pp. 402 ◽

Cited By ~ 5

Author(s):

Jorge Casillas ◽

Francisco J. Martinez Lopez

Keyword(s):

Knowledge Discovery ◽

Fuzzy Systems ◽

Consumer Behaviour ◽

Trust Model ◽

Web Based ◽

Genetic Fuzzy Systems ◽

Empirical Application ◽

Discovery Method

Download Full-text

Definition and selection of fuzzy sets in genetic‐fuzzy systems using the concept of fuzzimetric arcs

Kybernetes ◽

10.1108/03684920810851069 ◽

2008 ◽

Vol 37 (1) ◽

pp. 166-181 ◽

Cited By ~ 11

Author(s):

Issam Kouatli

Keyword(s):

Fuzzy Sets ◽

Fuzzy Systems ◽

Genetic Fuzzy Systems ◽

Selection Of

Download Full-text

An Algorithm Based on Genetic Fuzzy Systems for the Selection of Routes in Multi-Sink Wireless Sensor Networks

Lecture Notes in Computer Science - Hybrid Artificial Intelligent Systems ◽

10.1007/978-3-642-21219-2_44 ◽

2011 ◽

pp. 347-355

Author(s):

Lliam B. Leal ◽

Marcus Vincius de S. Lemos ◽

Raimir Holanda Filho ◽

Ricardo A. L. Rabelo ◽

Fabio A. S. Borges

Keyword(s):

Wireless Sensor Networks ◽

Sensor Networks ◽

Fuzzy Systems ◽

Wireless Sensor ◽

Genetic Fuzzy Systems ◽

Selection Of

Download Full-text

Fuzzy Systems and Knowledge Discovery

10.1007/11539506 ◽

2005 ◽

Cited By ~ 2

Keyword(s):

Knowledge Discovery ◽

Fuzzy Systems

Download Full-text

Hierarchical Genetic Fuzzy Systems: Accuracy, Interpretability and Design Autonomy

Interpretability Issues in Fuzzy Modeling - Studies in Fuzziness and Soft Computing ◽

10.1007/978-3-540-37057-4_16 ◽

2003 ◽

pp. 379-405 ◽

Cited By ~ 6

Author(s):

Myriam Regattieri Delgado ◽

Fernando Von Zuben ◽

Fernando Gomide

Keyword(s):

Fuzzy Systems ◽

Genetic Fuzzy Systems

Download Full-text

Experimental Evaluation of Resampling Combined with Clustering and Random Oracle Using Genetic Fuzzy Systems

Advances in Intelligent Systems and Computing - Multimedia and Internet Systems: Theory and Practice ◽

10.1007/978-3-642-32335-5_13 ◽

2013 ◽

pp. 131-142

Author(s):

Tadeusz Lasota ◽

Zbigniew Telec ◽

Bogdan Trawiński ◽

Grzegorz Trawiński

Keyword(s):

Experimental Evaluation ◽

Fuzzy Systems ◽

Random Oracle ◽

Genetic Fuzzy Systems

Download Full-text

InfoGuide: A Full-Text Document Retrieval System

Database and Expert Systems Applications ◽

10.1007/978-3-7091-7553-8_3 ◽

1990 ◽

pp. 12-21 ◽

Cited By ~ 5

Author(s):

IJsbrand Jan Aalbersberg ◽

Frans Sijstermans

Keyword(s):

Full Text ◽

Retrieval System ◽

Document Retrieval ◽

Text Document

Download Full-text

Conceptual Clustering of Textual Documents and Some Insights for Knowledge Discovery

Emerging Technologies of Text Mining ◽

10.4018/978-1-59904-373-9.ch011 ◽

2008 ◽

pp. 223-243 ◽

Cited By ~ 2

Author(s):

Leandro Krug Wives ◽

José Palazzo Moreira de Oliveira ◽

Stanley Loh

Keyword(s):

Knowledge Discovery ◽

Real World ◽

Document Clustering ◽

Relevant Information ◽

Conceptual Clustering ◽

Document Collections ◽

Clustering Techniques ◽

Related Information

This chapter introduces a technique to cluster textual documents using concepts. Document clustering is a technique capable of organizing large amounts of documents in clusters of related information, which helps the localization of relevant information. Traditional document clustering techniques use words to represent the contents of the documents and the use of words may cause semantic mistakes. Concepts, instead, represent real world events and objects, and people employ them to express ideas, thoughts, opinions and intentions. Thus, concepts are more appropriate to represent the contents of a document and its use helps the comprehension of large document collections, since it is possible to summarize each cluster and rapidly identify its contents (i.e. concepts). To perform this task, the chapter presents a methodology to cluster documents using concepts and presents some practical experiments in a case study to demonstrate that the proposed approach achieves better results than the use of words.

Download Full-text

Advances in Natural Computation, Fuzzy Systems and Knowledge Discovery

10.1007/978-3-030-32591-6 ◽

2020 ◽

Cited By ~ 1

Keyword(s):

Knowledge Discovery ◽

Fuzzy Systems ◽

Natural Computation

Download Full-text