Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR1546765_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Illumina 1.5 |
| Total Sequences | 3734833 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 50 |
| %GC | 41 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 22572 | 0.6043643718474159 | No Hit |
| CGTTTTTATTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 13653 | 0.36555851359351277 | No Hit |
| GACCAGAAAAATGGTCCTGCCAAGCGGCTACAGTGTTCTTCTTTCAGATA | 3949 | 0.10573431261852939 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| CGTTTTT | 34360 | 0.0 | 43.436558 | 1 |
| GAGCGAT | 2495 | 0.0 | 42.236473 | 7 |
| CGACGGT | 565 | 0.0 | 40.884956 | 28 |
| AGCGATA | 550 | 0.0 | 40.4 | 8 |
| GTCGCGA | 55 | 7.8216544E-11 | 40.0 | 2 |
| TTTCGCG | 105 | 0.0 | 39.80952 | 1 |
| GTTTTTA | 15510 | 0.0 | 39.64539 | 2 |
| CGCGCGA | 50 | 1.3496901E-9 | 39.6 | 2 |
| TCCGCGA | 45 | 2.3534085E-8 | 39.111107 | 2 |
| TATGCGA | 175 | 0.0 | 38.97143 | 2 |
| ACGGTCT | 595 | 0.0 | 38.82353 | 30 |
| CGCGAGT | 80 | 0.0 | 38.500004 | 4 |
| ACGTCGC | 355 | 0.0 | 38.422535 | 20 |
| TGCGCGA | 115 | 0.0 | 38.26087 | 2 |
| GAGACCG | 790 | 0.0 | 38.1519 | 7 |
| CGGTCTA | 610 | 0.0 | 37.868855 | 31 |
| GCGTTAG | 195 | 0.0 | 37.230766 | 1 |
| CGAGGTA | 195 | 0.0 | 37.230766 | 6 |
| GCGATAC | 225 | 0.0 | 37.155556 | 9 |
| AGCGATC | 400 | 0.0 | 36.85 | 8 |