
显示样式: 排序: IF: - GO 导出
-
HGFDB: a collective database of helmeted guinea fowl genomics Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2021-01-08 Li X, Li Z, Shen Q, et al.
AbstractAs a vigorous and hardy and an almost disease-free game bird, the domestic helmeted guinea fowl (Numida meleagris, hereafter HGF) has attracted considerable attention in a large number of genetic study projects. However, none of the current/recent avian databases are related to this agriculturally and commercially important poultry species. To address this data gap, we developed Helmeted Guinea
-
Creating a Metabolic Syndrome Research Resource using the National Health and Nutrition Examination Survey Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-12-31 Willysha S Jenkins; Christian Richardson; Ariel Williams; Clarlynda R Williams-DeVane
Metabolic syndrome (MetS) is multifaceted. Risk factors include visceral adiposity, dyslipidemia, hyperglycemia, hypertension and environmental stimuli. MetS leads to an increased risk of cardiovascular disease, type 2 diabetes and stroke. Comparative studies, however, have identified heterogeneity in the pathology of MetS across groups though the etiology of these differences has yet to be elucidated
-
CorkOakDB—The Cork Oak Genome Database Portal Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-12-31 Cirenia Arias-Baldrich; Marta Contreiras Silva; Filippo Bergeretti; Inês Chaves; Célia Miguel; Nelson J M Saibo; Daniel Sobral; Daniel Faria; Pedro M Barros
Quercus suber (cork oak) is an evergreen tree native to the Mediterranean basin, which plays a key role in the ecology and economy of this area. Over the last decades, this species has gone through an observable decline, mostly due to environmental factors. Deciphering the mechanisms of cork oak’s response to the environment and getting a deep insight into its biology are crucial to counteract biotic
-
HeartBioPortal2.0: new developments and updates for genetic ancestry and cardiometabolic quantitative traits in diverse human populations Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-12-31 Bohdan B Khomtchouk; Christopher S Nelson; Kasra A Vand; Salvator Palmisano; Robert L Grossman
Cardiovascular disease (CVD) is the leading cause of death worldwide for all genders and across most racial and ethnic groups. However, different races and ethnicities exhibit different rates of CVD and its related cardiorenal and metabolic comorbidities, suggesting differences in genetic predisposition and risk of onset, as well as socioeconomic and lifestyle factors (diet, exercise, etc.) that act
-
Novel methods included in SpolLineages tool for fast and precise prediction of Mycobacterium tuberculosis complex spoligotype families Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-12-15 David Couvin; Wilfried Segretier; Erick Stattner; Nalin Rastogi
Bioinformatic tools are currently being developed to better understand the Mycobacterium tuberculosis complex (MTBC). Several approaches already exist for the identification of MTBC lineages using classical genotyping methods such as mycobacterial interspersed repetitive units—variable number of tandem DNA repeats and spoligotyping-based families. In the recently released SITVIT2 proprietary database
-
NPBS database: a chemical data resource with relational data between natural products and biological sources Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-12-11 Tingjun Xu; Weiming Chen; Junhong Zhou; Jingfang Dai; Yingyong Li; Yingli Zhao
NPBS (Natural Products & Biological Sources) database is a chemical data resource with relational data between natural products and biological sources, manually curated from literatures of natural product researches. The relational data link a specific species and all the natural products derived from it and contrarily link a specific natural product and all the biological sources. The biological sources
-
CamRegBase: a gene regulation database for the biofuel crop, Camelina sativa Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-12-11 Fabio Gomez-Cano; Lisa Carey; Kevin Lucas; Tatiana García Navarrete; Eric Mukundi; Steve Lundback; Danny Schnell; Erich Grotewold
Camelina is an annual oilseed plant from the Brassicaceae family that is gaining momentum as a biofuel winter cover crop. However, a significant limitation in further enhancing its utility as a producer of oils that can be used as biofuels, jet fuels or bio-based products is the absence of a repository for all the gene expression and regulatory information that is being rapidly generated by the community
-
CEG 2.0: an updated database of clusters of essential genes including eukaryotic organisms Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-12-11 Shuo Liu; Shu-Xuan Wang; Wei Liu; Chen Wang; Fa-Zhan Zhang; Yuan-Nong Ye; Candy-S Wu; Wen-Xin Zheng; Nini Rao; Feng-Biao Guo
Essential genes are key elements for organisms to maintain their living. Building databases that store essential genes in the form of homologous clusters, rather than storing them as a singleton, can provide more enlightening information such as the general essentiality of homologous genes in multiple organisms. In 2013, the first database to store prokaryotic essential genes in clusters, CEG (Clusters
-
Applying graph database technology for analyzing perturbed co-expression networks in cancer Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-12-11 Claire M Simpson; Florian Gnad
Graph representations provide an elegant solution to capture and analyze complex molecular mechanisms in the cell. Co-expression networks are undirected graph representations of transcriptional co-behavior indicating (co-)regulations, functional modules or even physical interactions between the corresponding gene products. The growing avalanche of available RNA sequencing (RNAseq) data fuels the construction
-
Knowledge extraction for assisted curation of summaries of bacterial transcription factor properties Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-12-11 Carlos-Francisco Méndez-Cruz; Antonio Blanchet; Alan Godínez; Ignacio Arroyo-Fernández; Socorro Gama-Castro; Sara Berenice Martínez-Luna; Cristian González-Colín; Julio Collado-Vides
Transcription factors (TFs) play a main role in transcriptional regulation of bacteria, as they regulate transcription of the genetic information encoded in DNA. Thus, the curation of the properties of these regulatory proteins is essential for a better understanding of transcriptional regulation. However, traditional manual curation of article collections to compile descriptions of TF properties takes
-
HAHmiR.DB: a server platform for high-altitude human miRNA–gene coregulatory networks and associated regulatory circuits Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-12-01 Pankaj Khurana; Apoorv Gupta; Ragumani Sugadev; Yogendra Kumar Sharma; Bhuvnesh Kumar
Around 140 million people live in high-altitude (HA) conditions! and even a larger number visit such places for tourism, adventure-seeking or sports training. Rapid ascent to HA can cause severe damage to the body organs and may lead to many fatal disorders. During induction to HA, human body undergoes various physiological, biochemical, hematological and molecular changes to adapt to the extreme environmental
-
ncVarDB: a manually curated database for pathogenic non-coding variants and benign controls Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-12-01 Harry Biggs; Padmini Parthasarathy; Alexandra Gavryushkina; Paul P Gardner
Variants within the non-coding genome are frequently associated with phenotypes in genome-wide association studies. These non-coding regions may be involved in the regulation of gene expression, encode functional non-coding RNAs, or influence splicing and other cellular functions. We have curated a list of characterized non-coding human genome variants based on the published evidence that indicates
-
A hybrid approach toward biomedical relation extraction training corpora: combining distant supervision with crowdsourcing Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-12-01 Diana Sousa; Andre Lamurias; Francisco M Couto
Biomedical relation extraction (RE) datasets are vital in the construction of knowledge bases and to potentiate the discovery of new interactions. There are several ways to create biomedical RE datasets, some more reliable than others, such as resorting to domain expert annotations. However, the emerging use of crowdsourcing platforms, such as Amazon Mechanical Turk (MTurk), can potentially reduce
-
RegulomePA: a database of transcriptional regulatory interactions in Pseudomonas aeruginosa PAO1 Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-12-01 Edgardo Galán-Vásquez; Beatriz Carely Luna-Olivera; Marcelino Ramírez-Ibáñez; Agustino Martínez-Antonio
We present RegulomePA, a database that contains biological information on regulatory interactions between transcription factors (TFs), sigma factor (SFs) and target genes in Pseudomonas aeruginosa PAO1. RegulomePA consists of 4827 regulatory interactions between 2831 nodes, which represent the interactions of TFs and SFs with their target genes, from the total of predicted RegulomePA including 27.27%
-
ThRSDB: a database of Thai rice starch composition, molecular structure and functionality Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-12-01 Kwanjeera Wanichthanarak; Maysaya Thitisaksakul
As starch properties can affect end product quality in many ways, rice starch from Thai domesticated cultivars and landraces has been the focus of increasing research interest. Increasing knowledge in this area creates a high demand from the research community for better organized information. The Thai Rice Starch Database (ThRSDB) is an online database containing data extensively curated from original
-
VarStack: a web tool for data retrieval to interpret somatic variants in cancer Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-11-28 Morgan Howard; Bruce Kane; Mary Lepry; Paul Stey; Ashok Ragavendran; Ece D Gamsiz Uzun
Advances in tumor genome sequencing created an urgent need for bioinformatics tools to support the interpretation of the clinical significance of the variants detected. VarStack is a web tool which is a base to retrieve somatic variant data relating to cancer from existing databases. VarStack incorporates data from several publicly available databases and presents them with an easy-to-navigate user
-
An informatics research platform to make public gene expression time-course datasets reusable for more scientific discoveries Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-11-28 Braja Gopal Patra; Babak Soltanalizadeh; Nan Deng; Leqing Wu; Vahed Maroufy; Canglin Wu; W Jim Zheng; Kirk Roberts; Hulin Wu; Ashraf Yaseen
The exponential growth of genomic/genetic data in the era of Big Data demands new solutions for making these data findable, accessible, interoperable and reusable. In this article, we present a web-based platform named Gene Expression Time-Course Research (GETc) Platform that enables the discovery and visualization of time-course gene expression data and analytical results from the NIH/NCBI-sponsored
-
OGDA: a comprehensive organelle genome database for algae Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-11-28 Tao Liu; Yutong Cui; Xuli Jia; Jing Zhang; Ruoran Li; Yahui Yu; Shangang Jia; Jiangyong Qu; Xumin Wang
Algae are the oldest taxa on Earth, with an evolutionary relationship that spans prokaryotes (Cyanobacteria) and eukaryotes. A long evolutionary history has led to high algal diversity. Their organelle DNAs are characterized by uniparental inheritance and a compact genome structure compared with nuclear genomes; thus, they are efficient molecular tools for the analysis of gene structure, genome structure
-
A curated database reveals trends in single-cell transcriptomics Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-11-28 Valentine Svensson; Eduardo da Veiga Beltrame; Lior Pachter
The more than 1000 single-cell transcriptomics studies that have been published to date constitute a valuable and vast resource for biological discovery. While various ‘atlas’ projects have collated some of the associated datasets, most questions related to specific tissue types, species or other attributes of studies require identifying papers through manual and challenging literature search. To facilitate
-
BarleyVarDB: a database of barley genomic variation Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-11-28 Cong Tan; Brett Chapman; Penghao Wang; Qisen Zhang; Gaofeng Zhou; Xiao-qi Zhang; Roberto A Barrero; Matthew I Bellgard; Chengdao Li
Barley (Hordeum vulgare L.) is one of the first domesticated grain crops and represents the fourth most important cereal source for human and animal consumption. BarleyVarDB is a database of barley genomic variation. It can be publicly accessible through the website at http://146.118.64.11/BarleyVar. This database mainly provides three sets of information. First, there are 57 754 224 single nuclear
-
KiMoSys 2.0: an upgraded database for submitting, storing and accessing experimental data for kinetic modeling Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-11-28 Hugo Mochão; Pedro Barahona; Rafael S Costa
The KiMoSys (https://kimosys.org), launched in 2014, is a public repository of published experimental data, which contains concentration data of metabolites, protein abundances and flux data. It offers a web-based interface and upload facility to share data, making it accessible in structured formats, while also integrating associated kinetic models related to the data. In addition, it also supplies
-
WCSdb: a database of wild Coffea species Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-11-20 Romain Guyot; Perla Hamon; Emmanuel Couturon; Nathalie Raharimalala; Jean-Jacques Rakotomalala; Sreenath Lakkanna; Sylvie Sabatier; Antoine Affouard; Pierre Bonnet
Coffee is a beverage enjoyed by millions of people worldwide and an important commodity for millions of people. Beside the two cultivated species (Coffea arabica and Coffea canephora), the 139 wild coffee species/taxa belonging to the Coffea genus are largely unknown to coffee scientists and breeders although these species may be crucial for future coffee crop development to face climate changes. Here
-
EukRef-excavates: seven curated SSU ribosomal RNA gene databases Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-11-20 Martin Kolisko; Olga Flegontova; Anna Karnkowska; Gordon Lax; Julia M Maritz; Tomáš Pánek; Petr Táborský; Jane M Carlton; Ivan Čepička; Aleš Horák; Julius Lukeš; Alastair G B Simpson; Vera Tai
The small subunit ribosomal RNA (SSU rRNA) gene is a widely used molecular marker to study the diversity of life. Sequencing of SSU rRNA gene amplicons has become a standard approach for the investigation of the ecology and diversity of microbes. However, a well-curated database is necessary for correct classification of these data. While available for many groups of Bacteria and Archaea, such reference
-
Predicted rat interactome database and gene set linkage analysis Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-11-20 Yu-Tian Tao; Xiao-Bao Ding; Jie Jin; Hai-Bo Zhang; Wen-Ping Guo; Li Ruan; Qiao-Lei Yang; Peng-Cheng Chen; Heng Yao; Xin Chen
Rattus norvegicus, or the rat, has been widely used as animal models for a diversity of human diseases in the last 150 years. The rat, as a disease model, has the advantage of relatively large body size and highly similar physiology to humans. In drug discovery, rat models are routinely used in drug efficacy and toxicity assessments. To facilitate molecular pharmacology studies in rats, we present
-
Measurement Recorder: developing a useful tool for making species descriptions that produces computable phenotypes Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-11-20 Hong Cui; Limin Zhang; Bruce Ford; Hsin-liang Cheng; James A Macklin; Anton Reznicek; Julian Starr
To use published phenotype information in computational analyses, there have been efforts to convert descriptions of phenotype characters from human languages to ontologized statements. This postpublication curation process is not only slow and costly, it is also burdened with significant intercurator variation (including curator–author variation), due to different interpretations of a character by
-
GPCR-PEnDB: a database of protein sequences and derived features to facilitate prediction and classification of G protein-coupled receptors Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-11-20 Khodeza Begum; Jonathon E Mohl; Fredrick Ayivor; Eder E Perez; Ming-Ying Leung
G protein-coupled receptors (GPCRs) constitute the largest group of membrane receptor proteins in eukaryotes. Due to their significant roles in various physiological processes such as vision, smell and inflammation, GPCRs are the targets of many prescription drugs. However, the functional and sequence diversity of GPCRs has kept their prediction and classification based on amino acid sequence data
-
STOREFISH 2.0: a database on the reproductive strategies of teleost fishes Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-11-20 Stéphane Teletchea; Fabrice Teletchea
Teleost fishes show the most outstanding reproductive diversity of all vertebrates. Yet to date, no one has been able to decisively explain this striking variability nor to perform large-scale phylogenetic analyses of reproductive modes. Here, we describe STrategies Of REproduction in FISH (STOREFISH) 2.0, an online database easing the sharing of an original data set on reproduction published in 2007
-
DPL: a comprehensive database on sequences, structures, sources and functions of peptide ligands Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-11-20 Fangyu Wang; Ning Li; Chunfeng Wang; Guangxu Xing; Shuai Cao; Qian Xu; Yunshang Zhang; Man Hu; Gaiping Zhang
DPL (http://www.peptide-ligand.cn/) is a comprehensive database of peptide ligand (DPL). DPL1.0 holds 1044 peptide ligand entries and provides references for the study of the polypeptide platform. The data were collected from PubMed-NCBI, PDB, APD3, CAMPR3, etc. The lengths of the base sequences are varied from 3 to78. DPL database has 923 linear peptides and 88 cyclic peptides. The functions of peptides
-
TopoDB: a novel multifunctional management system for laboratory animal colonies Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-11-18 Adam Renschen; Atsuko Matsunaga; Jorge R Oksenberg; Adam Santaniello; Alessandro Didonna
Animal models are widely employed in basic research to test mechanistic hypotheses in a complex biological environment as well as to evaluate the therapeutic potential of candidate compounds in preclinical settings. Rodents, and in particular mice, represent the most common in vivo models for their small size, short lifespan and possibility to manipulate their genome. Over time, a typical laboratory
-
OCCAM: prediction of small ORFs in bacterial genomes by means of a target-decoy database approach and machine learning techniques Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-11-18 Fabio R. Cerqueira; Ana Tereza Ribeiro Vasconcelos
Small open reading frames (ORFs) have been systematically disregarded by automatic genome annotation. The difficulty in finding patterns in tiny sequences is the main reason that makes small ORFs to be overlooked by computational procedures. However, advances in experimental methods show that small proteins can play vital roles in cellular activities. Hence, it is urgent to make progress in the development
-
The IMEx coronavirus interactome: an evolving map of Coronaviridae–host molecular interactions Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-11-18 L Perfetto; C Pastrello; N del-Toro; M Duesbury; M Iannuccelli; M Kotlyar; L Licata; B Meldal; K Panneerselvam; S Panni; N Rahimzadeh; S Ricard-Blum; L Salwinski; A Shrivastava; G Cesareni; M Pellegrini; S Orchard; I Jurisica; H Hermjakob; P Porras
The current coronavirus disease of 2019 (COVID-19) pandemic, caused by the severe acute respiratory syndrome coronavirus (SARS-CoV)-2, has spurred a wave of research of nearly unprecedented scale. Among the different strategies that are being used to understand the disease and develop effective treatments, the study of physical molecular interactions can provide fine-grained resolution of the mechanisms
-
CRISPR sequences are sometimes erroneously translated and can contaminate public databases with spurious proteins containing spaced repeats Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-11-18 Alejandro Rubio; Pablo Mier; Miguel A Andrade-Navarro; Andrés Garzón; Juan Jiménez; Antonio J Pérez-Pulido
The genomics era is resulting in the generation of a plethora of biological sequences that are usually stored in public databases. There are many computational tools that facilitate the annotation of these sequences, but sometimes they produce mistakes that enter the databases and can be propagated when erroneous data are used for secondary analyses, such as gene prediction or homology searching. While
-
YQFC: a web tool to compare quantitative biological features between two yeast gene lists Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-11-11 Wei-Sheng Wu; Lai-Ji Wang; Han-Chen Yen; Yan-Yuan Tseng
Nowadays high-throughput omics technologies are routinely used in biological research. From the omics data, researchers can easily get two gene lists (e.g. stress-induced genes vs. stress-repressed genes) related to their biological question. The next step would be to apply enrichment analysis tools to identify distinct functional/regulatory features between these two gene lists for further investigation
-
WGVD: an integrated web-database for wheat genome variation and selective signatures Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-11-11 Jierong Wang; Weiwei Fu; Rui Wang; Dexiang Hu; Hong Cheng; Jing Zhao; Yu Jiang; Zhensheng Kang
Bread wheat is one of the most important crops worldwide. With the release of the complete wheat reference genome and the development of next-generation sequencing technology, a mass of genomic data from bread wheat and its progenitors has been yield and has provided genomic resources for wheat genetics research. To conveniently and effectively access and use these data, we established Wheat Genome
-
CitrusKB: a comprehensive knowledge base for transcriptome and interactome of Citrus spp. infected by Xanthomonas citri subsp. citri at different infection stages Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-11-11 Adriano Ferrasa; Mayara M Murata; Teresa D C G Cofre; Juliana S Cavallini; Gustavo Peron; Maria H M Julião; José Belasque; Henrique Ferreira; Maria Inês T Ferro; Rui P Leite; Helen A Penha; Flávia M S Carvalho; Alessandro M Varani; Roberto H Herai; Jesus A Ferro
Citrus canker type A is a serious disease caused by Xanthomonas citri subsp. citri (X. citri), which is responsible for severe losses to growers and to the citrus industry worldwide. To date, no canker-resistant citrus genotypes are available, and there is limited information regarding the molecular and genetic mechanisms involved in the early stages of the citrus canker development. Here, we present
-
CircR2Cancer: a manually curated database of associations between circRNAs and cancers Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-11-11 Wei Lan; Mingrui Zhu; Qingfeng Chen; Baoshan Chen; Jin Liu; Min Li; Yi-Ping Phoebe Chen
Accumulating evidences have shown that the deregulation of circRNA has close association with many human cancers. However, these experimental verified circRNA–cancer associations are not collected in any database. Here, we develop a manually curated database (circR2Cancer) that provides experimentally supported associations between circRNAs and cancers. The current version of the circR2Cancer contains
-
A checklist recipe: making species data open and FAIR Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-11-11 Lien Reyserhove; Peter Desmet; Damiano Oldoni; Tim Adriaens; Diederik Strubbe; Amy J S Davis; Sonia Vanderhoeven; Filip Verloove; Quentin Groom
Species checklists are a crucial source of information for research and policy. Unfortunately, many traditional species checklists vary wildly in their content, format, availability and maintenance. The fact that these are not open, findable, accessible, interoperable and reusable (FAIR) severely hampers fast and efficient information flow to policy and decision-making that are required to tackle the
-
A Collection of Benchmark Data Sets for Knowledge Graph-based Similarity in the Biomedical Domain Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-11-11 Carlota Cardoso; Rita T Sousa; Sebastian Köhler; Catia Pesquita
The ability to compare entities within a knowledge graph is a cornerstone technique for several applications, ranging from the integration of heterogeneous data to machine learning. It is of particular importance in the biomedical domain, where semantic similarity can be applied to the prediction of protein–protein interactions, associations between diseases and genes, cellular localization of proteins
-
Color Data v2: a user-friendly, open-access database with hereditary cancer and hereditary cardiovascular conditions datasets Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-11-11 Mark J Berger; Hannah E Williams; Ryan Barrett; Anjali D Zimmer; Wendy McKennon; Huy Hong; Jeremy Ginsberg; Alicia Y Zhou; Cynthia L Neben
Publicly available genetic databases promote data sharing and fuel scientific discoveries for the prevention, treatment and management of disease. In 2018, we built Color Data, a user-friendly, open access database containing genotypic and self-reported phenotypic information from 50 000 individuals who were sequenced for 30 genes associated with hereditary cancer. In a continued effort to promote
-
Exploring functionally annotated transcriptional consensus regulatory elements with CONREL Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-11-09 Davide Dalfovo; Samuel Valentini; Alessandro Romanel
Understanding the interaction between human genome regulatory elements and transcription factors is fundamental to elucidate the structure of gene regulatory networks. Here we present CONREL, a web application that allows for the exploration of functionally annotated transcriptional ‘consensus’ regulatory elements at different levels of abstraction. CONREL provides an extensive collection of consensus
-
A content-based dataset recommendation system for researchers—a case study on Gene Expression Omnibus (GEO) repository Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-11-12 Braja Gopal Patra; Kirk Roberts; Hulin Wu
It is a growing trend among researchers to make their data publicly available for experimental reproducibility and data reusability. Sharing data with fellow researchers helps in increasing the visibility of the work. On the other hand, there are researchers who are inhibited by the lack of data resources. To overcome this challenge, many repositories and knowledge bases have been established to date
-
LAMP2: a major update of the database linking antimicrobial peptides. Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-08-25 Guizi Ye,Hongyu Wu,Jinjiang Huang,Wei Wang,Kuikui Ge,Guodong Li,Jiang Zhong,Qingshan Huang
Antimicrobial peptides (AMPs) have been regarded as a potential weapon to fight against drug-resistant bacteria, which is threating the globe. Thus, more and more AMPs had been designed or identified. There is a need to integrate them into a platform for researchers to facilitate investigation and analyze existing AMPs. The AMP database has become an important tool for the discovery and transformation
-
FAIR digital objects in environmental and life sciences should comprise workflow operation design data and method information for repeatability of study setups and reproducibility of results. Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-08-20 Janno Harjes,Anton Link,Tanja Weibulat,Dagmar Triebel,Gerhard Rambold
Repeatability of study setups and reproducibility of research results by underlying data are major requirements in science. Until now, abstract models for describing the structural logic of studies in environmental sciences are lacking and tools for data management are insufficient. Mandatory for repeatability and reproducibility is the use of sophisticated data management solutions going beyond data
-
NCBI Taxonomy: a comprehensive update on curation, resources and tools Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-08-06 Conrad L Schoch; Stacy Ciufo; Mikhail Domrachev; Carol L Hotton; Sivakumar Kannan; Rogneda Khovanskaya; Detlef Leipe; Richard Mcveigh; Kathleen O’Neill; Barbara Robbertse; Shobha Sharma; Vladimir Soussov; John P Sullivan; Lu Sun; Seán Turner; Ilene Karsch-Mizrachi
The National Center for Biotechnology Information (NCBI) Taxonomy includes organism names and classifications for every sequence in the nucleotide and protein sequence databases of the International Nucleotide Sequence Database Collaboration. Since the last review of this resource in 2012, it has undergone several improvements. Most notable is the shift from a single SQL database to a series of linked
-
SPDB: a specialized database and web-based analysis platform for swine pathogens Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-08-06 Xiaoru Wang; Zongbao Liu; Xiaoying Li; Danwei Li; Jiayu Cai; He Yan
The rapid and accurate diagnosis of swine diseases is indispensable for reducing their negative impacts on the pork industry. Next-generation sequencing (NGS) is a promising diagnostic tool for swine diseases. To support the application of NGS in the diagnosis of swine disease, we established the Swine Pathogen Database (SPDB). The SPDB represents the first comprehensive and highly specialized database
-
FLUTE: Fast and reliable knowledge retrieval from biomedical literature. Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-08-06 Emilee Holtzapple,Cheryl A Telmer,Natasa Miskov-Zivanov
State-of-the-art machine reading methods extract, in hours, hundreds of thousands of events from the biomedical literature. However, many of the extracted biomolecular interactions are incorrect or not relevant for computational modeling of a system of interest. Therefore, rapid, automated methods are required to filter and select accurate and useful information. The FiLter for Understanding True Events
-
ConoMode, a database for conopeptide binding modes. Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-08-04 Xiao Li,Hao Liu,Chunxiao Gao,Yangyang Li,Dongning Jia,Yanbo Yang,Jinbo Yang,Zhiqiang Wei,Tao Jiang,Rilei Yu
ConoMode is a database for complex three-dimensional (3D) structures of conopeptides binding with their target proteins. Conopeptides, a large family of peptides from the venom of marine snails of the Conus genus, have exceptionally diverse sequences, and their high specificity to block ion channels makes them crucial as drug leads and tools for physiological studies. ConoMode is a specialized archive
-
CerealsDB-new tools for the analysis of the wheat genome: update 2020. Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-08-04 Paul A Wilkinson,Alexandra M Allen,Simon Tyrrell,Luzie U Wingen,Xingdong Bian,Mark O Winfield,Amanda Burridge,Daniel S Shaw,Jan Zaucha,Simon Griffiths,Robert P Davey,Keith J Edwards,Gary L A Barker
CerealsDB (www.cerealsdb.uk.net) is an online repository of mainly hexaploid wheat (Triticum aestivum) single nucleotide polymorphisms (SNPs) and genotyping data. The CerealsDB website has been designed to enable wheat breeders and scientists to select the appropriate markers for research breeding tasks, such as marker-assisted selection. We report a large update of genotyping information for over
-
Gliome database: a comprehensive web-based tool to access and analyze glia secretome data. Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-07-31 Jong-Heon Kim,Su-Hyeong Park,Jin Han,Pan-Woo Ko,Dongseop Kwon,Kyoungho Suk
Glial cells are phenotypically heterogeneous non-neuronal components of the central and peripheral nervous systems. These cells are endowed with diverse functions and molecular machineries to detect and regulate neuronal or their own activities by various secreted mediators, such as proteinaceous factors. In particular, glia-secreted proteins form a basis of a complex network of glia–neuron or glia–glia
-
OPTIK: a database for understanding catchment areas to guide mobilization of cancer center assets. Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-07-28 Dinesh Pal Mudaranthakam,Lisa M Harlan-Williams,Roy A Jensen,Hanluen Kuo,Vandita Garimella,Ronald C Chen,Matthew S Mayo,Hope Krebill
An increasingly diversified demographic landscape in rural and urban America warrants the attention of The University of Kansas Cancer Center (KU Cancer Center) researchers, clinicians, outreach staff and administrators as the institution assesses ways to reach its expansive, bi-state catchment area. Within the counties of the KU Cancer Center catchment area, patient level and public health data are
-
CNSA: a data repository for archiving omics data. Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-07-23 Xueqin Guo,Fengzhen Chen,Fei Gao,Ling Li,Ke Liu,Lijin You,Cong Hua,Fan Yang,Wanliang Liu,Chunhua Peng,Lina Wang,Xiaoxia Yang,Feiyu Zhou,Jiawei Tong,Jia Cai,Zhiyong Li,Bo Wan,Lei Zhang,Tao Yang,Minwen Zhang,Linlin Yang,Yawen Yang,Wenjun Zeng,Bo Wang,Xiaofeng Wei,Xun Xu
With the application and development of high-throughput sequencing technology in life and health sciences, massive multi-omics data brings the problem of efficient management and utilization. Database development and biocuration are the prerequisites for the reuse of these big data. Here, relying on China National GeneBank (CNGB), we present CNGB Sequence Archive (CNSA) for archiving omics data, including
-
WellExplorer: an integrative resource linking hydraulic fracturing chemicals with hormonal pathways and geographic location. Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-07-23 Owen Wetherbee,Jessica R Meeker,Caroline DeVoto,Trevor M Penning,Jason H Moore,Mary Regina Boland
Exposure to hydraulic fracturing fluid in drinking water increases the risk of many adverse health outcomes. Unfortunately, most individuals and researchers are unaware of the health risks posed by a particular well due to the diversity of chemical ingredients used across sites. We constructed WellExplorer (http://WellExplorer.org), an interactive tool for researchers and community members to use for
-
Erratum to: Large expert-curated database for benchmarking document similarity detection in biomedical literature search. Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-07-18 Peter Brown,,Yaoqi Zhou
This manuscript has been updated in order to amended incorrect affiliation indices for several authors. The publisher apologies for any confusion caused.
-
Autophagy and Tumor Database: ATdb, a novel database connecting autophagy and tumor. Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-07-07 Kelie Chen,Dexin Yang,Fan Zhao,Shengchao Wang,Yao Ye,Wenjie Sun,Haohua Lu,Zhi Ruan,Jinming Xu,Tianru Wang,Guang Lu,Liming Wang,Yu Shi,Honghe Zhang,Han Wu,Weiguo Lu,Han-Ming Shen,Dajing Xia,Yihua Wu
Autophagy is an essential cellular process that is closely implicated in diverse pathophysiological processes and a variety of human diseases, especially tumors. Autophagy is regarded as not only an anti-cancer process in tumorigenesis but also a pro-tumor process in progression and metastasis according to current research. It means the role of autophagy in tumor is considered to be complex, controversial
-
Tripal and Galaxy: supporting reproducible scientific workflows for community biological databases. Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-07-04 Shawna Spoor,Connor Wytko,Brian Soto,Ming Chen,Abdullah Almsaeed,Bradford Condon,Nic Herndon,Heidi Hough,Sook Jung,Meg Staton,Jill Wegrzyn,Dorrie Main,F Alex Feltus,Stephen P Ficklin
Online biological databases housing genomics, genetic and breeding data can be constructed using the Tripal toolkit. Tripal is an open-source, internationally developed framework that implements FAIR data principles and is meant to ease the burden of constructing such websites for research communities. Use of a common, open framework improves the sustainability and manageability of such as site. Site
-
SAGER: a database of Symbiodiniaceae and Algal Genomic Resource. Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-07-04 Liying Yu,Tangcheng Li,Ling Li,Xin Lin,Hongfei Li,Chichi Liu,Chentao Guo,Senjie Lin
Symbiodiniaceae dinoflagellates are essential endosymbionts of reef building corals and some other invertebrates. Information of their genome structure and function is critical for understanding coral symbiosis and bleaching. With the rapid development of sequencing technology, genome draft assemblies of several Symbiodiniaceae species and diverse marine algal genomes have become publicly available
-
RNAWRE: a resource of writers, readers and erasers of RNA modifications. Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-07-01 Fulei Nie,Pengmian Feng,Xiaoming Song,Meng Wu,Qiang Tang,Wei Chen
RNA modifications are involved in various kinds of cellular biological processes. Accumulated evidences have demonstrated that the functions of RNA modifications are determined by the effectors that can catalyze, recognize and remove RNA modifications. They are called ‘writers’, ‘readers’ and ‘erasers’. The identification of RNA modification effectors will be helpful for understanding the regulatory
-
CHDGKB: a knowledgebase for systematic understanding of genetic variations associated with non-syndromic congenital heart disease. Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-07-01 Lan Yang,Yang Yang,Xingyun Liu,Yongquan Chen,Yalan Chen,Yuxin Lin,Yan Sun,Bairong Shen
Congenital heart disease (CHD) is one of the most common birth defects, with complex genetic and environmental etiologies. The reports of genetic variation associated with CHD have increased dramatically in recent years due to the revolutionary development of molecular technology. However, CHD is a heterogeneous disease, and its genetic origins remain inconclusive in most patients. Here we present
-
mAML: an automated machine learning pipeline with a microbiome repository for human disease classification. Database J. Biol. Databases Curation (IF 2.593) Pub Date : 2020-06-25 Fenglong Yang,Quan Zou
Due to the concerted efforts to utilize the microbial features to improve disease prediction capabilities, automated machine learning (AutoML) systems aiming to get rid of the tediousness in manually performing ML tasks are in great demand. Here we developed mAML, an ML model-building pipeline, which can automatically and rapidly generate optimized and interpretable models for personalized microbiome-based