数据库名称:Intensification
数据库简述:Intensification
所属国家/地区:United States
数据库主要信息:Intensification is a database that contains the results for 12 repeat protein domains, from the amplification of population-genetic signal by constructing a motif-based multiple sequence alignment (motif-MSA). Because protein-coding regions are typically under high selective constraints, these variants occur at low frequencies, such that there is often insufficient statistics for downstream calculations. We make use of the modular structure of repeat motifs to amplify signals of selection from population genetics and traditional inter-species conservation. For each repeat protein repeat domain, we construct a motif-MSA and then accumulate single nucleotide variants (SNVs) across the human population based on the genomic coordinate system of the motif-MSA. This allows us to integrate all the corresponding SNV population-genetic profiles, including enrichment of rare variants, non-synonymous-to-synonymous ratio and delta DAFs, with the amino acid variation across the motif-MSA
建立年份:2017
联系信息:Contact information
University/Institution:
Yale University
Address:
Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
City:
Province/State:
Country/Region:
United States
Contact name (PI/Team):
Mark Gerstein
Contact email (PI/Helpdesk):
mark@gersteinlab.org