Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1042192.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 6471193 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 48 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 12802 | 0.1978306009417429 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 11700 | 0.18080128347276925 | No Hit |
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 7194 | 0.1111696096840258 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GGTATCA | 5630 | 0.0 | 17.185614 | 1 |
| ACGGACC | 690 | 0.0 | 16.086956 | 8 |
| GACGGAC | 705 | 0.0 | 15.482269 | 7 |
| GTATTAG | 1765 | 0.0 | 15.303116 | 1 |
| TTAACGG | 395 | 0.0 | 14.987342 | 35 |
| TATACTG | 1105 | 0.0 | 14.733032 | 5 |
| GCGTTAG | 115 | 2.212862E-5 | 14.478261 | 1 |
| TATATCG | 500 | 0.0 | 14.059999 | 5 |
| TAACGGC | 410 | 0.0 | 13.987804 | 36 |
| ATAACGC | 855 | 0.0 | 13.847952 | 3 |
| ATACCGT | 605 | 0.0 | 13.76033 | 6 |
| CTAGCGG | 405 | 0.0 | 13.703704 | 29 |
| TCTATAC | 1085 | 0.0 | 13.470046 | 3 |
| TAGACCG | 180 | 5.171205E-8 | 13.361111 | 5 |
| TACACTG | 2180 | 0.0 | 13.323395 | 5 |
| TAACGCC | 570 | 0.0 | 13.307017 | 4 |
| CTACACT | 1905 | 0.0 | 13.3044615 | 4 |
| TACGACG | 445 | 0.0 | 13.303371 | 5 |
| CGGACCA | 870 | 0.0 | 13.183908 | 9 |
| CGTATTA | 535 | 0.0 | 13.140187 | 15 |