Basic Statistics
Measure | Value |
---|---|
Filename | ERR1042225.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 7357939 |
Sequences flagged as poor quality | 0 |
Sequence length | 43 |
%GC | 47 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 26291 | 0.3573147317475722 | No Hit |
TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 24221 | 0.32918185377726017 | No Hit |
GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 14618 | 0.1986697633671603 | No Hit |
ACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 9491 | 0.12898992503199605 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GGTATCA | 7465 | 0.0 | 24.782316 | 1 |
GTATCAA | 10790 | 0.0 | 17.025486 | 2 |
TCTATAC | 1245 | 0.0 | 15.008032 | 3 |
GTATTAG | 1640 | 0.0 | 14.100611 | 1 |
TATACTG | 1775 | 0.0 | 13.861971 | 5 |
GACGGAC | 790 | 0.0 | 13.816456 | 7 |
CGAACGA | 390 | 0.0 | 13.756411 | 16 |
CGCGTAA | 175 | 3.57486E-8 | 13.742857 | 10 |
CTAATAC | 1695 | 0.0 | 13.533924 | 3 |
CGCGCTA | 375 | 0.0 | 13.32 | 24 |
CGACGGT | 530 | 0.0 | 13.264152 | 7 |
CTAGTAC | 555 | 0.0 | 13.0 | 3 |
TAGACTA | 840 | 0.0 | 12.77381 | 5 |
ATCAACG | 14435 | 0.0 | 12.739175 | 4 |
TATACCG | 320 | 0.0 | 12.718751 | 5 |
AATCGTC | 525 | 0.0 | 12.685714 | 28 |
ACGGACC | 890 | 0.0 | 12.679774 | 8 |
TCAACGC | 14550 | 0.0 | 12.651202 | 5 |
CAACGCA | 14675 | 0.0 | 12.556047 | 6 |
ATTAGAC | 855 | 0.0 | 12.549707 | 3 |