Basic Statistics
Measure | Value |
---|---|
Filename | ERR840909.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 12401764 |
Sequences flagged as poor quality | 0 |
Sequence length | 24-50 |
%GC | 46 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
TACCTGGTTGATCCTGCCAGTAGCATATGCTTGTCTCAAAGATTAAGCCA | 70606 | 0.5693222351271964 | No Hit |
ATGGCACATGCAGCGCAAGTAGGTCTACAAGACGCTACTTCCCCTATCAT | 28420 | 0.22916094839411555 | No Hit |
CTAAACCTAGCCCCAAACCCACTCCACCTTACTACCAGACAACCTTAGC | 27734 | 0.2236294772259817 | No Hit |
AGCCATTGTGGCTCCGGCCGGTTGCGCGGGCCCTCGGACCCTCAGAGA | 17714 | 0.14283451934740898 | No Hit |
CCGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGA | 16117 | 0.1299573189749458 | No Hit |
AGCCATTGTGGCTCCGGCCGGTTGCGCGGGCCCTCGG | 13071 | 0.10539629684938369 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGAAACG | 75 | 0.0 | 55.63319 | 44 |
TAAGCCA | 10855 | 0.0 | 50.674633 | 44 |
CGGTTAT | 30 | 3.8342732E-6 | 41.418167 | 18 |
TTAAGCC | 10610 | 0.0 | 40.99257 | 43 |
CGCTTAT | 200 | 0.0 | 39.602722 | 19 |
TACGGTA | 90 | 0.0 | 39.346096 | 37 |
GACGCTA | 3720 | 0.0 | 39.333527 | 31 |
CGCTACT | 4095 | 0.0 | 38.366093 | 33 |
CGACTAG | 180 | 0.0 | 37.966656 | 12 |
TTGCGCG | 10545 | 0.0 | 37.858727 | 22 |
GTTGCGC | 11775 | 0.0 | 37.771008 | 21 |
GCGCAAC | 110 | 0.0 | 37.65288 | 12 |
TCGACGA | 110 | 0.0 | 37.65288 | 3 |
TAGCGCA | 40 | 5.016336E-7 | 37.555298 | 24 |
GCCCTCG | 9690 | 0.0 | 37.33081 | 30 |
TCGGACC | 6680 | 0.0 | 37.21476 | 34 |
GCCGGTT | 12270 | 0.0 | 37.029934 | 17 |
ATTAAGC | 10945 | 0.0 | 36.873173 | 42 |
TCGTAGG | 45 | 4.0106897E-8 | 36.81615 | 18 |
TAATCCG | 145 | 0.0 | 36.57337 | 22 |