Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR1512020_2.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 2001292 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 25 |
| %GC | 45 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GTATCAACGCAGAGTACTTTTTTTT | 3269 | 0.1633444794662648 | No Hit |
| GTCCTACAGTGGACATTTCTAAATT | 3024 | 0.1511023878574441 | No Hit |
| CTGTAGGACGTGGAATATGGCAAGA | 3001 | 0.14995313027784052 | No Hit |
| GTCCTAAAGTGTGTATTTCTCATTT | 2672 | 0.13351375011742414 | No Hit |
| CTTTAGGACGTGAAATATGGCGAGG | 2334 | 0.1166246604693368 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTT | 2104 | 0.10513208467330104 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| TTCGCAG | 35 | 0.0022248263 | 16.225536 | 10 |
| ACGGTCC | 50 | 8.995485E-5 | 15.143836 | 8 |
| CGGTAGG | 65 | 2.421817E-6 | 15.08917 | 1 |
| CGTTAGG | 60 | 3.126301E-4 | 13.077282 | 1 |
| CGCTAGA | 45 | 0.009320455 | 13.077281 | 1 |
| AGGCGAG | 60 | 4.2209658E-4 | 12.620494 | 18 |
| CGAGCAA | 60 | 4.2227452E-4 | 12.6198635 | 10 |
| CGGTCCA | 60 | 4.2227452E-4 | 12.6198635 | 9 |
| GGTATCA | 1040 | 0.0 | 12.542873 | 1 |
| TAGTACC | 120 | 1.058288E-8 | 11.831122 | 4 |
| TAGGACC | 670 | 0.0 | 11.725171 | 4 |
| CGTAGGA | 110 | 3.2888238E-7 | 11.591228 | 2 |
| CCTATAC | 175 | 2.1827873E-11 | 10.817025 | 3 |
| CGTGCGC | 70 | 0.0015385037 | 10.817024 | 10 |
| GTAGGAC | 1990 | 0.0 | 10.416143 | 3 |
| CGAAATC | 110 | 6.3198877E-6 | 10.325343 | 13 |
| CCGTAGG | 105 | 2.8902554E-5 | 10.275006 | 1 |
| TAGAAAT | 650 | 0.0 | 10.192966 | 4 |
| TGTAGGA | 1855 | 0.0 | 10.151636 | 2 |
| GACGTGG | 1035 | 0.0 | 10.15076 | 7 |