Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR522081_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 32005877 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 50 |
| %GC | 44 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| TCTCTGAGCGGGCTGGCAAGGCAGACCGATATCGTATGCCGTCTTCTGCT | 63910 | 0.19968207713852051 | Illumina PCR Primer Index 1 (95% over 24bp) |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 47325 | 0.1478634689497807 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 45116 | 0.14096161151903444 | No Hit |
| TCTCTGAGCGGGCTGGCAAGGCAGACCGATCTCGTATGCCGTCTTCTGCT | 36202 | 0.11311047655404038 | Illumina Paired End PCR Primer 2 (96% over 29bp) |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| CTTCTGC | 31520 | 0.0 | 43.766163 | 43 |
| TTCTGCT | 31485 | 0.0 | 43.713993 | 44 |
| TCTTCTG | 36445 | 0.0 | 37.76574 | 42 |
| TATGCCG | 40890 | 0.0 | 35.624416 | 35 |
| CCGTCTT | 41270 | 0.0 | 35.2432 | 39 |
| CGTCTTC | 41695 | 0.0 | 35.203007 | 40 |
| GCCGTCT | 41495 | 0.0 | 35.076496 | 38 |
| CGTATGC | 42890 | 0.0 | 33.97034 | 33 |
| TCGTATG | 43470 | 0.0 | 33.92927 | 32 |
| ATGCCGT | 43020 | 0.0 | 33.892418 | 36 |
| TGCCGTC | 43115 | 0.0 | 33.793037 | 37 |
| GACCGAT | 43750 | 0.0 | 33.09156 | 24 |
| AGACCGA | 45500 | 0.0 | 32.61835 | 23 |
| GCGGGCT | 46670 | 0.0 | 32.3102 | 8 |
| ATATCGT | 26470 | 0.0 | 32.272503 | 29 |
| GGTATCA | 54605 | 0.0 | 32.24235 | 1 |
| TGAGCGG | 46975 | 0.0 | 32.03785 | 5 |
| ATCGTAT | 27220 | 0.0 | 31.918694 | 31 |
| TATCGTA | 26835 | 0.0 | 31.835087 | 30 |
| ACCGATA | 26860 | 0.0 | 31.834269 | 25 |