Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR1512021_2.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 2213535 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 25 |
| %GC | 45 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GTCCTACAGTGGACATTTCTAAATT | 3804 | 0.17185181169486816 | No Hit |
| GTATCAACGCAGAGTACTTTTTTTT | 3596 | 0.16245507751176286 | No Hit |
| CTGTAGGACGTGGAATATGGCAAGA | 3361 | 0.15183857494911984 | No Hit |
| GTCCTAAAGTGTGTATTTCTCATTT | 3000 | 0.1355298199486342 | No Hit |
| CTTTAGGACGTGAAATATGGCGAGG | 2847 | 0.12861779913125385 | No Hit |
| GTCCTACAGTGTGCATTTCTCATTT | 2475 | 0.1118121014576232 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTT | 2250 | 0.10164736496147564 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GCGGTAT | 25 | 0.005024778 | 19.610708 | 1 |
| GACCGCC | 40 | 0.005409545 | 14.198039 | 7 |
| TACACGA | 40 | 0.005409545 | 14.198039 | 5 |
| TCGAACT | 110 | 1.9826984E-10 | 13.7677965 | 19 |
| TAGGACC | 960 | 0.0 | 12.916272 | 4 |
| GGTATCA | 1135 | 0.0 | 12.181101 | 1 |
| CAGTCGA | 55 | 0.0031542643 | 12.046822 | 9 |
| CGGTAGG | 125 | 1.1481461E-8 | 11.766424 | 1 |
| GGCGAGG | 1020 | 0.0 | 11.506908 | 19 |
| TGTAGGA | 2255 | 0.0 | 11.17506 | 2 |
| AGGACCT | 1580 | 0.0 | 11.142765 | 5 |
| GTAGGAC | 2395 | 0.0 | 10.986683 | 3 |
| GTATAGG | 225 | 0.0 | 10.894837 | 1 |
| GTGTAGG | 325 | 0.0 | 10.861315 | 1 |
| GGACCTG | 1450 | 0.0 | 10.836205 | 6 |
| CCTACCG | 70 | 0.0015382773 | 10.817309 | 3 |
| CTGTAGG | 2185 | 0.0 | 10.7253065 | 1 |
| GACGTGG | 900 | 0.0 | 10.622237 | 7 |
| TATGTCG | 90 | 9.877374E-5 | 10.517067 | 16 |
| CGTCTAT | 75 | 0.002036295 | 10.459044 | 1 |