Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR936033_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 2632963 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 125 |
| %GC | 46 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 6320 | 0.2400337566460296 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 4587 | 0.17421437369230028 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 3350 | 0.12723308303230998 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| CGTATCA | 1040 | 0.0 | 44.355927 | 100-101 |
| GGTATCA | 6725 | 0.0 | 37.339752 | 1 |
| GCGTATC | 270 | 0.0 | 30.86367 | 100-101 |
| ACGTATC | 350 | 0.0 | 30.611721 | 100-101 |
| GTATCAA | 12780 | 0.0 | 30.543884 | 1 |
| CGTACAT | 1805 | 0.0 | 30.50337 | 100-101 |
| CGGTATC | 905 | 0.0 | 28.939257 | 100-101 |
| CCGTACA | 680 | 0.0 | 28.886059 | 100-101 |
| TACGCAG | 800 | 0.0 | 28.645344 | 100-101 |
| TCGTATC | 495 | 0.0 | 27.657053 | 100-101 |
| GCGTACA | 615 | 0.0 | 27.583733 | 100-101 |
| CGAGTAC | 1110 | 0.0 | 26.543947 | 100-101 |
| TCGTACA | 890 | 0.0 | 26.417418 | 100-101 |
| AGTATCA | 2855 | 0.0 | 25.643795 | 100-101 |
| ACCGTAC | 275 | 0.0 | 24.889929 | 98-99 |
| CGCATTA | 180 | 0.0 | 24.801163 | 100-101 |
| AGGTATC | 2035 | 0.0 | 24.569605 | 100-101 |
| GGGTATC | 1310 | 0.0 | 23.627367 | 100-101 |
| TCGGTAT | 330 | 0.0 | 23.448372 | 100-101 |
| TATCAAC | 16625 | 0.0 | 23.333479 | 2 |