Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1041853.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 1008827 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 55 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 1960 | 0.1942850458998421 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 1897 | 0.18804016942449003 | No Hit |
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 1309 | 0.1297546556545374 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GGTATCA | 1010 | 0.0 | 32.78713 | 1 |
| TAACCGG | 25 | 0.005496029 | 29.6 | 22 |
| TAGTATC | 25 | 0.005496029 | 29.6 | 4 |
| TTATACG | 45 | 4.0076193E-6 | 28.777777 | 35 |
| CGTATGG | 35 | 8.8686397E-4 | 26.428572 | 2 |
| TATATAC | 50 | 2.7017848E-4 | 22.2 | 3 |
| TCTAAGC | 75 | 3.739824E-7 | 22.2 | 3 |
| GTATCAA | 1520 | 0.0 | 21.786184 | 2 |
| CTATACA | 70 | 5.100901E-6 | 21.142859 | 4 |
| GTTTAGG | 105 | 9.822543E-10 | 21.142857 | 1 |
| TTTAGCG | 45 | 0.003825489 | 20.555555 | 26 |
| GCTATAC | 50 | 0.0070343 | 18.5 | 3 |
| CGCACTA | 50 | 0.0070343 | 18.5 | 12 |
| TATACTT | 50 | 0.0070343 | 18.5 | 5 |
| TATACGA | 70 | 1.2190096E-4 | 18.5 | 36 |
| ATACGAA | 85 | 2.7225808E-5 | 17.411764 | 37 |
| TCTATAC | 75 | 2.0669508E-4 | 17.266666 | 3 |
| TTAATAT | 65 | 0.0015797535 | 17.076921 | 3 |
| CTATACT | 110 | 7.8052653E-7 | 16.818182 | 4 |
| CTACTAG | 100 | 5.876356E-6 | 16.650002 | 1 |