Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1042020.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 3716786 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 48 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 21892 | 0.5890035100218307 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 17303 | 0.4655366222322189 | No Hit |
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 15689 | 0.4221120075247808 | No Hit |
| ACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 6315 | 0.16990485866014346 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| CGTATCG | 80 | 1.618255E-5 | 18.5 | 10 |
| GTACTAG | 250 | 0.0 | 18.5 | 1 |
| GGTATCA | 7950 | 0.0 | 17.871698 | 1 |
| GCGTTAT | 530 | 0.0 | 15.009434 | 1 |
| TAAACGC | 605 | 0.0 | 14.677686 | 28 |
| GTAAACG | 610 | 0.0 | 14.557377 | 27 |
| GTATATG | 535 | 0.0 | 14.177571 | 1 |
| GTATTAG | 1005 | 0.0 | 13.99005 | 1 |
| ACGGACC | 655 | 0.0 | 13.839695 | 8 |
| ACGCTTC | 630 | 0.0 | 13.801588 | 31 |
| GTGTAGA | 660 | 0.0 | 13.734848 | 1 |
| TTAGCGA | 135 | 6.575954E-6 | 13.703703 | 27 |
| CTAGCGG | 530 | 0.0 | 13.613208 | 29 |
| CGTTATT | 450 | 0.0 | 13.566667 | 2 |
| GACGGAC | 675 | 0.0 | 13.42963 | 7 |
| TCTATAC | 485 | 0.0 | 13.350515 | 3 |
| CGCAATA | 570 | 0.0 | 13.307018 | 36 |
| CGAACGA | 280 | 0.0 | 13.214285 | 16 |
| TATACTG | 660 | 0.0 | 13.174242 | 5 |
| CGCAAGA | 785 | 0.0 | 12.961783 | 2 |