Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR1547166_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Illumina 1.5 |
| Total Sequences | 3458779 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 50 |
| %GC | 48 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 14602 | 0.42217210177348713 | No Hit |
| GACCAGAAAAATGGTCCTGCCAAGCGGCTACAGTGTTCTTCTTTCAGATA | 5720 | 0.16537627873882663 | No Hit |
| GATACTGAAGCTACGAATATACTGACTATGAAGACCTATGCTTTGATTCA | 4275 | 0.12359853000148317 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| TCGTTAG | 165 | 0.0 | 44.0 | 1 |
| CGTTTTT | 11560 | 0.0 | 40.99308 | 1 |
| CGTTAGG | 465 | 0.0 | 37.84946 | 2 |
| CGTAAGG | 585 | 0.0 | 37.60684 | 2 |
| TACGAAT | 540 | 0.0 | 37.48148 | 12 |
| GGGCGAT | 7380 | 0.0 | 37.44174 | 7 |
| TAGGGAC | 3070 | 0.0 | 37.335503 | 5 |
| GGCGATA | 1645 | 0.0 | 36.911854 | 8 |
| GACCGAT | 2610 | 0.0 | 36.750957 | 9 |
| CGAATAT | 545 | 0.0 | 36.733944 | 14 |
| CGCATGG | 350 | 0.0 | 36.45714 | 2 |
| ATATGCG | 145 | 0.0 | 36.41379 | 1 |
| CATATGC | 2885 | 0.0 | 36.298096 | 33 |
| TATAGCG | 285 | 0.0 | 36.280704 | 1 |
| GGTACCT | 2925 | 0.0 | 36.027348 | 8 |
| TCACGAC | 410 | 0.0 | 35.951218 | 25 |
| AGGGCGA | 4025 | 0.0 | 35.910557 | 6 |
| CGGTCTA | 415 | 0.0 | 35.518074 | 31 |
| TATACTA | 2275 | 0.0 | 35.49011 | 44 |
| GGACCGA | 2755 | 0.0 | 35.455536 | 8 |