Basic Statistics
Measure | Value |
---|---|
Filename | ERR1042257.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 3340671 |
Sequences flagged as poor quality | 0 |
Sequence length | 43 |
%GC | 50 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GTACGGGAAGCAGTGGTATCAACGCAGAGTACGGGAAGCAGTG | 4709 | 0.14095970540050187 | No Hit |
GTATTAGAGGCACCGCCTGCCCAGTGACACATGTTTAACGGCC | 4270 | 0.12781863284352157 | No Hit |
GTACGGAAGCAGTGGTATCAACGCAGAGTACGGAAGCAGTGGT | 3848 | 0.11518644008943113 | No Hit |
GGATTACTCCGGTCTGAACTCAGATCACGTAGGACTTTAATCG | 3848 | 0.11518644008943113 | No Hit |
CTCTAATACTGGTGATGCTAGAGGTGATGTTTTTGGTAAACAG | 3341 | 0.10000984832089121 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
TATTAGA | 1395 | 0.0 | 16.57706 | 2 |
TATTCCG | 90 | 4.450042E-5 | 16.444445 | 5 |
GTATTAG | 1600 | 0.0 | 15.840624 | 1 |
ATTAGAG | 1310 | 0.0 | 15.393129 | 3 |
TTAACGG | 1210 | 0.0 | 14.983472 | 35 |
TTACGCA | 75 | 0.0041065854 | 14.799999 | 27 |
GAACCGA | 115 | 2.2122289E-5 | 14.47826 | 6 |
TAACGGC | 1255 | 0.0 | 14.298804 | 36 |
TATACAC | 425 | 0.0 | 13.929412 | 3 |
ATCTCGT | 415 | 0.0 | 13.819277 | 37 |
TTAGACT | 255 | 1.8189894E-12 | 13.784313 | 4 |
GTGTAAG | 540 | 0.0 | 13.703704 | 1 |
CACATGT | 1435 | 0.0 | 13.407665 | 28 |
TTTAACG | 1380 | 0.0 | 13.405797 | 34 |
TAATACT | 1180 | 0.0 | 13.326271 | 4 |
GACACAT | 1450 | 0.0 | 13.013793 | 26 |
GTAAGAT | 615 | 0.0 | 12.934959 | 3 |
TATACCG | 130 | 7.005598E-5 | 12.807693 | 5 |
TAGAACA | 405 | 0.0 | 12.790123 | 4 |
ACCGCCT | 1515 | 0.0 | 12.699671 | 12 |