Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1041598.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 4034894 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 48 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 13533 | 0.33539914555376177 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 10888 | 0.2698459984326726 | No Hit |
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 9499 | 0.2354213022696507 | No Hit |
| CCCACGCCCTCTGCATCTTTTTTCTTTTCCCCCACGGATCTTT | 5221 | 0.12939621214336733 | No Hit |
| GTGCAGAGGGCGTGGGGGAAAAGAAAAAAGATCCGTGGGGGAA | 4527 | 0.11219625596112315 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GGTATCA | 7095 | 0.0 | 24.458069 | 1 |
| TATACAC | 1510 | 0.0 | 18.132452 | 37 |
| TATAGTC | 715 | 0.0 | 16.3007 | 5 |
| CGCGAAT | 80 | 3.3851998E-4 | 16.1875 | 35 |
| TACCGAC | 190 | 2.7284841E-11 | 15.578948 | 7 |
| GTATCAA | 11290 | 0.0 | 15.337467 | 2 |
| TCTTATA | 6725 | 0.0 | 15.240149 | 37 |
| TAGGTCG | 225 | 1.8189894E-12 | 14.8 | 21 |
| TCTATAC | 415 | 0.0 | 14.710843 | 3 |
| TATTAGA | 1265 | 0.0 | 14.478261 | 2 |
| TATACCG | 270 | 0.0 | 14.388889 | 5 |
| GTACCGT | 90 | 8.280365E-4 | 14.388888 | 6 |
| TTATAGT | 825 | 0.0 | 14.351516 | 4 |
| CTTATAC | 4265 | 0.0 | 14.314185 | 37 |
| ATTAGAG | 1145 | 0.0 | 14.21834 | 3 |
| GTATTAG | 1620 | 0.0 | 14.160494 | 1 |
| TGCGACG | 185 | 4.9094524E-9 | 14.0 | 22 |
| GCGCGAA | 80 | 0.0063022375 | 13.875 | 11 |
| TTAGAGG | 1580 | 0.0 | 13.816456 | 4 |
| TAACCGG | 295 | 0.0 | 13.79661 | 22 |