Basic Statistics
Measure | Value |
---|---|
Filename | ERR840937.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 9679102 |
Sequences flagged as poor quality | 0 |
Sequence length | 24-50 |
%GC | 53 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CCGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCC | 28934 | 0.29893269024337177 | No Hit |
CCGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGC | 22774 | 0.2352904226032539 | No Hit |
CCGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGG | 20512 | 0.21192048601202879 | No Hit |
CCGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAG | 11377 | 0.11754189593208131 | No Hit |
AGCCATTGTGGCTCCGGCCGGTTGCGCGGGCCCTCGGACCCTCA | 10734 | 0.11089871767029627 | No Hit |
TTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA | 10009 | 0.10340835337823695 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CTATCAT | 3295 | 0.0 | 129.96632 | 44 |
TAAGCCA | 4090 | 0.0 | 91.21746 | 44 |
ACGGTAT | 40 | 2.1590658E-6 | 73.315475 | 43 |
CACGATT | 295 | 0.0 | 71.393005 | 44 |
TATGCGC | 85 | 0.0 | 71.36372 | 42 |
TTAAGCC | 3490 | 0.0 | 63.86219 | 43 |
CCTATCA | 4275 | 0.0 | 58.858177 | 43 |
AATCGCA | 445 | 0.0 | 56.342793 | 44 |
TTATACG | 90 | 9.813393E-7 | 55.71676 | 44 |
ATTAAGC | 3865 | 0.0 | 55.126858 | 42 |
ATAGTCG | 110 | 7.2759576E-12 | 54.703728 | 44 |
GTGTTAG | 2295 | 0.0 | 54.435543 | 43 |
AGGCCGA | 11685 | 0.0 | 52.982677 | 42 |
CCCTATC | 4955 | 0.0 | 51.722492 | 42 |
CGAGATT | 905 | 0.0 | 50.97622 | 44 |
TACGCTA | 1545 | 0.0 | 50.794685 | 42 |
TGTGTTA | 2850 | 0.0 | 50.416275 | 42 |
CAACTAG | 300 | 0.0 | 50.14509 | 44 |
ACGACAC | 550 | 0.0 | 49.233356 | 44 |
TCGCAAA | 1095 | 0.0 | 47.6718 | 43 |