Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1042516.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 3837874 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 46 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 17346 | 0.4519689807429843 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 14023 | 0.36538458531989326 | No Hit |
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 11284 | 0.29401694792481464 | No Hit |
| ACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 6762 | 0.17619129757777352 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GGTATCA | 4705 | 0.0 | 25.439959 | 1 |
| GTATCAA | 7175 | 0.0 | 16.656446 | 2 |
| TATACTG | 895 | 0.0 | 16.329609 | 5 |
| TCTAGCG | 265 | 0.0 | 14.6603775 | 28 |
| CCGGTTA | 90 | 8.280233E-4 | 14.388888 | 25 |
| GTCGTAC | 105 | 1.6572243E-4 | 14.095238 | 1 |
| CGAACGA | 225 | 2.7284841E-11 | 13.9777775 | 16 |
| TAATACT | 920 | 0.0 | 13.875 | 4 |
| CGTAGAC | 175 | 3.573041E-8 | 13.742858 | 3 |
| TCTATAC | 555 | 0.0 | 13.666668 | 3 |
| ACGCGCG | 355 | 0.0 | 13.549295 | 21 |
| CTAGCGG | 290 | 0.0 | 13.396551 | 29 |
| TAGACAG | 740 | 0.0 | 13.25 | 5 |
| CGCGCTA | 395 | 0.0 | 13.113924 | 24 |
| GTACTAG | 285 | 0.0 | 12.982456 | 1 |
| CCGTATA | 100 | 0.0018338382 | 12.950001 | 2 |
| ACGGACC | 415 | 0.0 | 12.927711 | 8 |
| GACGGAC | 405 | 0.0 | 12.790123 | 7 |
| TAGTACC | 400 | 0.0 | 12.4875 | 4 |
| TATACAG | 905 | 0.0 | 12.469613 | 5 |