Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR936102_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 2589463 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 125 |
| %GC | 44 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 10861 | 0.419430592366062 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 7697 | 0.29724309634854795 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 5564 | 0.21487080525962335 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| CGTATCA | 1290 | 0.0 | 49.142544 | 100-101 |
| CGGTATC | 1005 | 0.0 | 40.867752 | 100-101 |
| TCGAGTA | 535 | 0.0 | 40.61036 | 100-101 |
| GGTATCA | 9145 | 0.0 | 40.023754 | 1 |
| CGTACAT | 2010 | 0.0 | 39.090893 | 100-101 |
| TACGCAG | 1190 | 0.0 | 36.7653 | 100-101 |
| CGAGTAC | 1290 | 0.0 | 36.22244 | 100-101 |
| ACGTATC | 530 | 0.0 | 33.693268 | 100-101 |
| GCGTATC | 365 | 0.0 | 32.61631 | 100-101 |
| CCGTACA | 875 | 0.0 | 31.973307 | 100-101 |
| GCGTACA | 590 | 0.0 | 31.780174 | 100-101 |
| TCGTATC | 470 | 0.0 | 31.662113 | 100-101 |
| GACGCAG | 1055 | 0.0 | 30.749765 | 100-101 |
| CCGTATC | 655 | 0.0 | 30.443968 | 100-101 |
| GTATCAA | 18130 | 0.0 | 30.266315 | 1 |
| TCGTACA | 1110 | 0.0 | 30.030514 | 100-101 |
| CCGGTAT | 505 | 0.0 | 29.46771 | 100-101 |
| ACCGTAC | 335 | 0.0 | 29.314491 | 98-99 |
| AGTATCA | 3610 | 0.0 | 29.10283 | 100-101 |
| AGGTATC | 2395 | 0.0 | 28.457565 | 100-101 |