Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR1512007_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 1650461 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 25 |
| %GC | 46 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GTATCAACGCAGAGTACTTTTTTTT | 3244 | 0.19655114540725288 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTT | 1982 | 0.12008766035671245 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTT | 1956 | 0.1185123429151007 | No Hit |
| GTCCTACAGTGGACATTTCTAAATT | 1947 | 0.11796704072377354 | No Hit |
| GTCCTAAAGTGTGTATTTCTCATTT | 1853 | 0.1122716622810233 | No Hit |
| CTGTAGGACGTGGAATATGGCAAGA | 1794 | 0.10869690347121198 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| AGCGTAC | 35 | 0.0021027469 | 16.362953 | 1 |
| TAGGACC | 415 | 0.0 | 13.51124 | 4 |
| GGTATCA | 815 | 0.0 | 11.828843 | 1 |
| GGCGAGG | 450 | 0.0 | 11.396455 | 19 |
| TGGCGAG | 950 | 0.0 | 10.8946295 | 18 |
| GGACCGT | 70 | 0.0014919633 | 10.856727 | 6 |
| GTATCAA | 1655 | 0.0 | 10.727374 | 1 |
| CCTAGAC | 160 | 5.4023985E-10 | 10.6916275 | 3 |
| ACCGTGC | 80 | 3.7881514E-4 | 10.682235 | 8 |
| GTCCTAC | 1135 | 0.0 | 10.512176 | 1 |
| AGGACCT | 830 | 0.0 | 10.190681 | 5 |
| GGACCTG | 730 | 0.0 | 10.150296 | 6 |
| TGTAGGA | 1210 | 0.0 | 10.13201 | 2 |
| GGCGAGA | 555 | 0.0 | 9.924841 | 19 |
| GTATTAG | 145 | 1.5087426E-7 | 9.874196 | 1 |
| GTATAAC | 155 | 3.7747668E-8 | 9.852961 | 1 |
| CCAGGAC | 455 | 0.0 | 9.816977 | 3 |
| CCTACAG | 1200 | 0.0 | 9.741261 | 3 |
| TATGGCG | 1075 | 0.0 | 9.716141 | 16 |
| TAAGACA | 225 | 1.8189894E-12 | 9.714862 | 4 |