Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR1545801_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Illumina 1.5 |
| Total Sequences | 4268405 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 50 |
| %GC | 45 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 29618 | 0.6938891693735716 | No Hit |
| GACCAGAAAAATGGTCCTGCCAAGCGGCTACAGTGTTCTTCTTTCAGATA | 9137 | 0.21406122427464125 | No Hit |
| AAGGAAGGAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAA | 4912 | 0.11507811465875427 | No Hit |
| GATACTGAAGCTACGAATATACTGACTATGAAGACCTATGCTTTGATTCA | 4910 | 0.11503125874887692 | No Hit |
| AAGGAAGGAAGGAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAA | 4723 | 0.11065023117534536 | No Hit |
| AAGGAAGGAAGGAAGGAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAA | 4623 | 0.10830743568147821 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| TCTATCG | 20 | 7.8588846E-4 | 44.000004 | 1 |
| TACCGAT | 20 | 7.8588846E-4 | 44.000004 | 10 |
| GTCGAAC | 35 | 1.4473153E-7 | 44.0 | 9 |
| ATTTACG | 160 | 0.0 | 42.625004 | 1 |
| CGACGGT | 490 | 0.0 | 41.7551 | 28 |
| CGTTTTT | 19015 | 0.0 | 41.246384 | 1 |
| TTACGCT | 65 | 0.0 | 40.615383 | 35 |
| TTACGCG | 120 | 0.0 | 40.333332 | 1 |
| TCACGAC | 515 | 0.0 | 40.15534 | 25 |
| CACGACG | 525 | 0.0 | 39.39048 | 26 |
| TACGAAC | 40 | 4.128051E-7 | 38.500004 | 41 |
| ACGATTG | 145 | 0.0 | 37.931038 | 1 |
| TAACGCG | 140 | 0.0 | 37.714287 | 1 |
| TTAGCGG | 935 | 0.0 | 37.64706 | 2 |
| TACGGGA | 1275 | 0.0 | 37.44314 | 4 |
| TAGGGTC | 1865 | 0.0 | 37.158176 | 5 |
| AGGGCGC | 925 | 0.0 | 37.102703 | 6 |
| TTTAGCG | 255 | 0.0 | 37.09804 | 1 |
| CGACAAT | 305 | 0.0 | 36.786884 | 20 |
| TACGCTA | 90 | 0.0 | 36.666668 | 37 |