Pelagophyceae sp. CCMP2097 v1.0


(May 2016) The Pelagophyceae sp. CCMP2097 genome was sequenced with Illumina technology, assembled with AllPathsLG, and annotated with the JGI AnnotationPipeline. The transcriptome was sequenced with Illumina technology and assemblied with Rnnotator. The mitochondrial and plastid genomes were sequenced with Illumina technology and assembled with ARACHNE.

Summary statistics for the Pelagophyceae sp. CCMP2097 v1.0 release are below.
Genome Assembly
Genome Assembly size (Mbp) 85.82
Sequencing read coverage depth 145.8x
# of contigs 8717
# of scaffolds 1716
# of scaffolds >= 2Kbp 1472
Scaffold N50 99
Scaffold L50 (Mbp) 0.19
# of gaps 7001
% of scaffold length in gaps 16.2%
Three largest Scaffolds (Mbp) 1.44, 1.36, 1.32

ESTs Data set # sequences total # mapped to genome % mapped to genome
Ests est.fasta 242967535 223020814 91.8%
Other JGI_RNA_contigs 50498 43799 86.7%

Gene Models FilteredModels1
length (bp) of: average median
gene 1758 1410
transcript 1422 1142
exon 370 194
intron 120 79
protein length (aa) 420 320
exons per gene 3.85 3
# of gene models 19402


The work conducted by the U.S. Department of Energy Joint Genome Institute, a DOE Office of Science User Facility, is supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC02-05CH11231.