Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1633292.fastq |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 537385 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 45 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 2402 | 0.4469793537221918 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 1859 | 0.34593447900481034 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 1162 | 0.21623231016868727 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| TGCGAAT | 40 | 1.5979676E-6 | 32.375 | 13 |
| AACTGCG | 60 | 4.3137334E-8 | 27.750002 | 10 |
| GGTATCA | 690 | 0.0 | 27.07971 | 1 |
| ACTGCGA | 60 | 1.3351928E-6 | 24.666668 | 11 |
| GCGAATG | 55 | 1.900215E-5 | 23.545454 | 14 |
| CTGCGAA | 70 | 5.0935832E-6 | 21.142859 | 12 |
| CGCTACA | 70 | 5.0935832E-6 | 21.142859 | 2 |
| GTATCAA | 1600 | 0.0 | 20.696877 | 1 |
| CCGGCAG | 145 | 0.0 | 20.413794 | 16 |
| GCCGCTC | 145 | 0.0 | 20.413794 | 27 |
| TTCGCCG | 145 | 0.0 | 20.413794 | 24 |
| CCGCTCT | 140 | 3.6379788E-12 | 19.82143 | 28 |
| CGGCAGC | 160 | 0.0 | 19.65625 | 17 |
| CGAACTA | 95 | 1.6719605E-7 | 19.473684 | 29 |
| AGCTTCG | 160 | 1.8189894E-12 | 18.5 | 21 |
| CGTTAGA | 50 | 0.0070301257 | 18.5 | 25 |
| GCTTCGC | 170 | 0.0 | 18.5 | 22 |
| CTTTCCG | 50 | 0.0070301257 | 18.5 | 21 |
| GCCGGCA | 165 | 3.6379788E-12 | 17.939394 | 15 |
| CGCTCTC | 155 | 2.0008883E-11 | 17.903225 | 29 |