scholarly journals The articles.ELM resource: simplifying access to protein linear motif literature by annotation, text-mining and classification

Database ◽  
2020 ◽  
Vol 2020 ◽  
Author(s):  
N Palopoli ◽  
J A Iserte ◽  
L B Chemes ◽  
C Marino-Buslje ◽  
G Parisi ◽  
...  

Abstract Modern biology produces data at a staggering rate. Yet, much of these biological data is still isolated in the text, figures, tables and supplementary materials of articles. As a result, biological information created at great expense is significantly underutilised. The protein motif biology field does not have sufficient resources to curate the corpus of motif-related literature and, to date, only a fraction of the available articles have been curated. In this study, we develop a set of tools and a web resource, ‘articles.ELM’, to rapidly identify the motif literature articles pertinent to a researcher’s interest. At the core of the resource is a manually curated set of about 8000 motif-related articles. These articles are automatically annotated with a range of relevant biological data allowing in-depth search functionality. Machine-learning article classification is used to group articles based on their similarity to manually curated motif classes in the Eukaryotic Linear Motif resource. Articles can also be manually classified within the resource. The ‘articles.ELM’ resource permits the rapid and accurate discovery of relevant motif articles thereby improving the visibility of motif literature and simplifying the recovery of valuable biological insights sequestered within scientific articles. Consequently, this web resource removes a critical bottleneck in scientific productivity for the motif biology field. Database URL: http://slim.icr.ac.uk/articles/

Author(s):  
Manjeet Kumar ◽  
Marc Gouw ◽  
Sushama Michael ◽  
Hugo Sámano-Sánchez ◽  
Rita Pancsa ◽  
...  

Abstract The eukaryotic linear motif (ELM) resource is a repository of manually curated experimentally validated short linear motifs (SLiMs). Since the initial release almost 20 years ago, ELM has become an indispensable resource for the molecular biology community for investigating functional regions in many proteins. In this update, we have added 21 novel motif classes, made major revisions to 12 motif classes and added >400 new instances mostly focused on DNA damage, the cytoskeleton, SH2-binding phosphotyrosine motifs and motif mimicry by pathogenic bacterial effector proteins. The current release of the ELM database contains 289 motif classes and 3523 individual protein motif instances manually curated from 3467 scientific publications. ELM is available at: http://elm.eu.org.


Zootaxa ◽  
2021 ◽  
Vol 4963 (1) ◽  
pp. 58-90
Author(s):  
EDUARD JENDEK ◽  
OTO NAKLÁDAL

Two hundred and eighteen taxa of the genus Agrilus (Coleoptera, Buprestidae) mostly from the Palaearctic and Oriental regions are studied and their taxonomic, nomenclatural, distributional or biological data are updated. For some species, the biological information is supplemented by images from the field, or distributional data by maps showing the whole range of the species. The synonymy of following Agrilus is updated: A. blairi Bourgoin, 1925 (A. telawensis Fisher, 1935 syn. nov.); A. lancifer Deyrolle, 1864 (A. perakianus Kerremans, 1900 syn. nov.); A. lestagei Théry, 1930 (A. collartianus Descarpentries & Villiers, 1963 syn. nov.); A. lugubris Kerremans, 1914 (A. achilleus Obenberger, 1935 syn. nov.); A. ocularis Deyrolle, 1864 (A. capitatus Deyrolle, 1864 syn. nov.), A. bidentellus Obenberger, 1924 syn. nov.) and Agrilus velatus Kerremans, 1912 (A. kuchingensis Tôyama, 1987 syn. nov.). The invalid synonyms A. myrmido Kerremans, 1912; A. myrmidonius Obenberger, 1936 and A. miwai Théry, 1936 are moved from the synonymy of A. achilleus Obenberger, 1935 to synonymy of A. lugubris Kerremans, 1914. A. massanensis Schaefer, 1955 is downgraded to subspecies A. pratensis massanensis Schaefer, 1955 status nov. The specific name samyi Baudon, 1968 is resurrected from the synonymy of A. suturaalba Deyrolle, 1864 and revalidated as Agrilus samyi Baudon, 1968 (nomen revalidatum).


2021 ◽  
Author(s):  
Zhihong Zhang ◽  
Sai Hu ◽  
Wei Yan ◽  
Bihai Zhao ◽  
Lei Wang

Abstract BackgroundIdentification of essential proteins is very important for understanding the basic requirements to sustain a living organism. In recent years, various different computational methods have been proposed to identify essential proteins based on protein-protein interaction (PPI) networks. However, there has been reliable evidence that a huge amount of false negatives and false positives exist in PPI data. Therefore, it is necessary to reduce the influence of false data on accuracy of essential proteins prediction by integrating multi-source biological information with PPI networks.ResultsIn this paper, we proposed a non-negative matrix factorization and multiple biological information based model (NDM) for identifying essential proteins. The first stage in this progress was to construct a weighted PPI network by combing the information of protein domain, protein complex and the topology characteristic of the original PPI network. Then, the non-negative matrix factorization technique was used to reconstruct an optimized PPI network with whole enough weight of edges. In the final stage, the ranking score of each protein was computed by the PageRank algorithm in which the initial scores were calculated with homologous and subcellular localization information. In order to verify the effectiveness of the NDM method, we compared the NDM with other state-of-the-art essential proteins prediction methods. The comparison of the results obtained from different methods indicated that our NDM model has better performance in predicting essential proteins.ConclusionEmploying the non-negative matrix factorization and integrating multi-source biological data can effectively improve quality of the PPI network, which resulted in the led to optimization of the performance essential proteins identification. This will also provide a new perspective for other prediction based on protein-protein interaction networks.


Author(s):  
Elena S. Boltanova ◽  
◽  
Maria P. Imekova ◽  

In the world, it is customary to create biological databases of different species. And initially, the databases for the investigation of crimes were widespread. However, later, when their potential and benefits, including for medicine, were assessed, the databases for other areas appeared. Russia was no exception in this regard. Although, in our country, unlike foreign states, the activities of biological databases based on purposes other than the disclosure of crimes are practically not regulated in any way. This article deals with the analysis of legal regulation of biobanks in the Russian Federation and abroad. Special attention is paid to the classification of biobanks. The purpose of the study is to determine the feasibility in the legislative regulation of their activities, as well as the patterns in such a regulation. To achieve this goal, the authors studied extensive regulatory material, which included EU directives and national regulations of the EU member states. The methodological basis of the study was the general scientific and private scientific meth-ods of research. Of course, such private scientific research methods as the comparative-legal method and the formal legal method have been widely used. Due to the comparative legal analysis, it is established that the EU countries have a high level of legislative activity in terms of determining the legal regime of biological databases. All countries recognize the specifics of such a legal regime, which can largely be explained by a special legal nature of biological samples and biological data. In this regard, the following issues related to the activities of biological databases are reflected everywhere in the EU countries at the level of law: the procedure for their creation; the procedure for receiving, processing, storing and transmitting biological samples and the data obtained on their basis; the rights and obligations of database creators and persons who have provided their biological samples and biological data about themselves; a set of measures aimed at protecting the rights and interests of donors and third parties, etc. As it seems, a similar approach to the regulation of the activities of biological bases estab-lished not for the investigation of crimes should be implemented by Russia. At the same time, special attention should be paid to the research of biological databases. In the Russian Federa-tion, they are created, as a rule, at the local level. Their main drawback is that they are sepa-rate sources of limited biological information, functioning independently of each other while comprehensive (concentrated in one place) information can bring invaluable benefits and advantages for Russian science and medicine as a whole. However, this requires the estab-lishment of an appropriate legal framework.


Author(s):  
N. Srinivasan ◽  
G. Agarwal ◽  
R. M. Bhaskara ◽  
R. Gadkari ◽  
O. Krishnadev ◽  
...  

In the post-genomic era, biological databases are growing at a tremendous rate. Despite rapid accumulation of biological information, functions and other biological properties of many putative gene products of various organisms remain either unknown or obscure. This paper examines how strategic integration of large biological databases and combinations of various biological information helps address some of the fundamental questions on protein structure, function and interactions. New developments in function recognition by remote homology detection and strategic use of sequence databases aid recognition of functions of newly discovered proteins. Knowledge of 3-D structures and combined use of sequences and 3-D structures of homologous protein domains expands the ability of remote homology detection enormously. The authors also demonstrate how combined consideration of functions of individual domains of multi-domain proteins helps in recognizing gross biological attributes. This paper also discusses a few cases of combining disparate biological datasets or combination of disparate biological information in obtaining new insights about protein-protein interactions across a host and a pathogen. Finally, the authors discuss how combinations of low resolution structural data, obtained using cryoEM studies, of gigantic multi-component assemblies, and atomic level 3-D structures of the components is effective in inferring finer features in the assembly.


Author(s):  
Marc Gouw ◽  
Jesús Alvarado-Valverde ◽  
Jelena Čalyševa ◽  
Francesca Diella ◽  
Manjeet Kumar ◽  
...  

2009 ◽  
Author(s):  
Francesca Diella ◽  
Allegra Via ◽  
Claudia Chica ◽  
Katja Luck ◽  
Catherin Gould ◽  
...  

Author(s):  
Chigateri M. Vinay ◽  
Sanjay Kannath Udayamanoharan ◽  
Navya Prabhu Basrur ◽  
Bobby Paul ◽  
Padmalatha S. Rai

AbstractPlant metabolome as the downstream product in the biological information of flow starting from genomics is highly complex, and dynamically produces a wide range of primary and secondary metabolites, including ionic inorganic compounds, hydrophilic carbohydrates, amino acids, organic compounds, and compounds associated with hydrophobic lipids. The complex metabolites present in biological samples bring challenges to analytical tools for separating and characterization of the metabolites. Analytical tools such as nuclear magnetic resonance (NMR) and mass spectrometry have recently facilitated the separation, characterization, and quantification of diverse chemical structures. The massive amount of data generated from these analytical tools need to be handled using fast and accurate bioinformatics tools and databases. In this review, we focused on plant metabolomics data acquisition using various analytical tools and freely available workflows from raw data to meaningful biological data to help biologists and chemists to move at the same pace as computational biologists.


Author(s):  
KMS Rana ◽  
K Ahammad ◽  
MA Salam

Bioinformatics is one of the ongoing trends of biological research integrating gene based information and computational technology to produce new knowledge. It works to synthesize complex biological information from multiomics data (results of high throughput technologies) by employing a number of bioinformatics tools (software). User convenience and availability are the determining factors of these tools being widely used in bioinformatics research. BLAST, FASTA (FAST-All), EMBOSS, ClustalW, RasMol and Protein Explorer, Cn3D, Swiss PDB viewer, Hex, Vega, Bioeditor etc. are commonly operated bioinformatics software tools in fisheries and aquaculture research. By default, these software tools mine and analyze a vast biological data set using the available databases. However, aquaculture scientists can use bioinformatics for genomic data manipulation, genome annotation and expression profiling, molecular folding, modeling, and design as well as generating biological network and system biology. Therefore, they can contribute in specified fields of aquaculture such as disease diagnosis and aquatic health management, fish nutritional aspects and culture-able strain development. Although having huge prospects, Bangladesh is still in infancy of applying bioinformatics in aquaculture research with limited resources. Research council at national level should be formed to bring all the enthusiastic scientists and skilled manpower under a single umbrella and facilitate to contribute in a collaborative platform. Besides, fully-fledged bioinformatics degree should be launched at University levels to produce knowledgeable and trained work force for future research. This review was attempted to shed light on bioinformatics, as young integrated field of bio-computational research, and its significance in aquaculture research of Bangladesh. Int. J. Agril. Res. Innov. Tech. 10(2): 137-145, December 2020


2016 ◽  
Author(s):  
Thor-Seng Liew ◽  
Liz Price ◽  
Gopalasamy Reuben Clements

AbstractBiodiversity conservation is now about prioritisation, especially in a world with limited resources and so many habitats and species in need of protection. However, we cannot prioritise effectively if historical and current information on a particular habitat or species remains scattered. Several good platforms have been created to help users to find, use and create biodiversity information. However, good platforms for sharing habitat information for threatened ecosystems are still lacking. Limestone hills are an example of threatened ecosystems that harbour unique biodiversity, but are facing intensifying anthropogenic disturbances. As limestone is a vital resource for the construction industry, it is not possible to completely halt forest degradation and quarrying in developing countries such as Malaysia, where 445 limestone hills have been recorded in the peninsula to date. As such, there is an urgent need to identify which hills must be prioritised for conservation. To make decisions based on sound science, collating spatial and biological information on limestone hills into a publicly accessible database is critical. Here, we compile Malaysia’s first limestone hill GIS map for 445 limestone hills in the peninsula based on information from geological reports and scientific literature. To assist in conservation prioritisation efforts, we quantify characteristics of limestone hills in terms of size, degree of isolation, and spatial distribution patterns and also assessed the degree of habitat disturbance of each limestone hill in terms of buffer area forest degradation and quarrying activity. All this information is stored in a KMZ file and can be accessed through the Google Earth interface. This product should not be viewed as a final output containing basic limestone hill information. Rather, this database is a foundational platform for users to collect, store, update and manipulate spatial and biological data from limestone hills to better inform decisions regarding their management.


Sign in / Sign up

Export Citation Format

Share Document