Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR936167_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 2227252 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 125 |
| %GC | 46 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 11238 | 0.5045679608773502 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 8325 | 0.3737789886371187 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 5940 | 0.26669635945999826 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| CGTATCA | 1300 | 0.0 | 51.73353 | 100-101 |
| CGTACAT | 1455 | 0.0 | 41.9274 | 100-101 |
| GGTATCA | 8620 | 0.0 | 41.000526 | 1 |
| CGGTATC | 1100 | 0.0 | 38.415165 | 100-101 |
| ACGTATC | 380 | 0.0 | 38.372448 | 100-101 |
| AGTATCA | 2770 | 0.0 | 37.063496 | 100-101 |
| TCGTACA | 805 | 0.0 | 35.85774 | 100-101 |
| CCGGTAT | 395 | 0.0 | 35.40852 | 100-101 |
| GCGTATC | 390 | 0.0 | 33.573383 | 100-101 |
| TCGTATC | 600 | 0.0 | 32.73405 | 100-101 |
| AGGTATC | 2205 | 0.0 | 32.389908 | 100-101 |
| AACGTAC | 175 | 0.0 | 32.30893 | 98-99 |
| GTATCAA | 17170 | 0.0 | 32.19257 | 1 |
| CCGTACA | 630 | 0.0 | 31.647638 | 100-101 |
| TACGCAG | 990 | 0.0 | 30.65999 | 100-101 |
| TCGAGTA | 495 | 0.0 | 30.65999 | 100-101 |
| TATACGT | 295 | 0.0 | 30.260561 | 96-97 |
| ATACACG | 640 | 0.0 | 30.221159 | 96-97 |
| CCTGTCG | 860 | 0.0 | 30.102222 | 96-97 |
| CCGTATC | 645 | 0.0 | 29.98891 | 100-101 |