Statistics
Pipeline
Annotation Statistics
General
Nucleotides
(without 'N')
2244433056
GC %
37.62
Total number of genes
97436 (187846726 bp)
Protein coding genes
Number
90935
Mean gene length
2048.13 bp
Coding nucleotides
68991443 bp
% genes with introns
77
% genes with 5' UTR
41
% genes with 3' UTR
41
Comparison with At (TAIR10)
Full Length Best Hits (spanning 80% of the length of the At prot)
13568 Ha predicted peptides => 13906 TAIR10
Full Length Best Hits (spanning 60% of the length of the At prot)
15852 Ha predicted peptides => 16127 TAIR10 peptides
mRNAs
No EST support
41655 (45.85%)
EST support >= 80
of the mRNA length
39050 (42.98%)
mRNA with a short CDS (<100aa)
Number
23267
No EST support
11927 (51.26%)
EST support >= 80
of the mRNA length
8904 (38.27%)
Exons
Mean number
per gene
3.75
Mean length (bp)
244.61
GC %
42.16
Introns
Mean number
per gene
2.75
Mean length (bp)
411.47
GC %
33.20
CDS
Mean length (bp)
758.69
Min length (bp)
123.00
Max length (bp)
14343.00
GC %
43.20
UTRs
5' UTR
Mean length (bp)
164.29
GC %
40.41
3' UTR
Mean length (bp)
223.45
GC %
34.83
Annotation Pipeline
Annotation pipeline -
J. Gouzy