UNC Systems Genetics数据库介绍

数据库名称：UNC Systems Genetics

数据库简述：

所属国家/地区：United States

数据库主要信息：Here we provide genomic sequences for the Collaborative Cross (CC) mouse strains and the eight CC founder strains in the form of FASTA files for the 19 autosomes, sex chromosomes (X and Y), and mitochondria (M). These sequences can be used as reference sequences for high-throughput short-read alignments, or for any other comparative genomic analyses.

Each genome comes with a companion MOD file, which can be used to remap coordinates from the FASTA sequences back to reference coordinates. This is necessary since, in general, all gene and genomic annotations are specified relative to the reference. MOD files are genome and version specific, and therefore should always be downloaded together as a set with their associated FASTA sequence.

We supply two types of genomes, sequenced and imputed. Sequenced genomes result from direct DNA sequencing at a minimum of 30x coverage, and an iterative alignment process. Imputed genomes are derived from genotype data, where we first construct a haplotype mosaic using MegaMUGA genotypes and then assemble an imputed genome using segments of DNA sequence from the inferred founders

建立年份：2014

联系信息：Contact information

University/Institution:
University of California Los Angeles

Address:
Department of Computer Science, University of California, Los Angeles, CA 90095, USA

City:

Province/State:

Country/Region:
United States

Contact name (PI/Team):
Wang W

Contact email (PI/Helpdesk):
weiwang@cs.ucla.edu.