Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR1547233_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Illumina 1.5 |
| Total Sequences | 1774701 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 50 |
| %GC | 46 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 31020 | 1.747900068800322 | No Hit |
| CGGTCGGCGTCCCCCAACTTCTTAGAGGGACAAGTGGCGTTCAGCCACCC | 2695 | 0.15185656626102087 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| CGTTTTT | 17995 | 0.0 | 42.92415 | 1 |
| ACGGGTA | 135 | 0.0 | 40.74074 | 5 |
| TAATGCG | 50 | 1.3496901E-9 | 39.6 | 1 |
| GTTTTTT | 20590 | 0.0 | 38.134045 | 2 |
| AGGGCGA | 2015 | 0.0 | 37.33995 | 6 |
| CGGTCTA | 155 | 0.0 | 36.903225 | 31 |
| TACGGGA | 550 | 0.0 | 36.4 | 4 |
| CGCCGTT | 170 | 0.0 | 36.23529 | 26 |
| TATACTA | 700 | 0.0 | 36.14286 | 44 |
| GCGATAT | 220 | 0.0 | 36.0 | 9 |
| TGGGCGA | 1290 | 0.0 | 35.813953 | 6 |
| ATAGGGC | 1105 | 0.0 | 35.43891 | 4 |
| GTAGGGA | 1845 | 0.0 | 35.41463 | 4 |
| GGCGATA | 870 | 0.0 | 35.402298 | 8 |
| GGGCGAT | 4215 | 0.0 | 35.33571 | 7 |
| GTACGGG | 430 | 0.0 | 35.302326 | 3 |
| TAGGGAC | 1250 | 0.0 | 35.2 | 5 |
| AGTACGG | 150 | 0.0 | 35.2 | 2 |
| ACGGGAC | 345 | 0.0 | 35.072464 | 5 |
| GTTGATC | 1170 | 0.0 | 34.97436 | 16 |