Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR522104_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 37110023 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 50 |
| %GC | 47 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| CGGTTCAGCAGGAATGCCGAGATCGGAAGAGCGGTTCAGCAGGAATGCCG | 76478 | 0.20608448558493211 | Illumina Paired End PCR Primer 2 (100% over 31bp) |
| AAGCAGTGGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 65195 | 0.1756803007101343 | No Hit |
| GATCGGAAGAGCGGTTCAGCAGGAATGCCGAGATCGGAAGAGCGGTTCAG | 38371 | 0.10339794184444455 | Illumina Paired End PCR Primer 2 (97% over 36bp) |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| TCAACGC | 29725 | 0.0 | 26.863487 | 12 |
| ATCAACG | 30295 | 0.0 | 26.452093 | 11 |
| CAACGCA | 31100 | 0.0 | 25.654596 | 13 |
| AACGCAG | 33165 | 0.0 | 24.12345 | 14 |
| ACGCAGA | 33335 | 0.0 | 23.993835 | 15 |
| GTATCAA | 33430 | 0.0 | 23.977749 | 9 |
| AAGCAGT | 34720 | 0.0 | 23.807404 | 1 |
| GGTATCA | 33780 | 0.0 | 23.742321 | 8 |
| CGCAGAG | 33890 | 0.0 | 23.60767 | 16 |
| GTGGTAT | 34405 | 0.0 | 23.400402 | 6 |
| TGGTATC | 34950 | 0.0 | 22.985239 | 7 |
| AGAGTAC | 35560 | 0.0 | 22.157423 | 19 |
| AGTGGTA | 36700 | 0.0 | 22.05685 | 5 |
| TATCAAC | 36955 | 0.0 | 21.768162 | 10 |
| CAGAGTA | 37420 | 0.0 | 21.189667 | 18 |
| GCAGAGT | 38665 | 0.0 | 20.65217 | 17 |
| AGCAGTG | 40900 | 0.0 | 20.108131 | 2 |
| AGATCGG | 19125 | 0.0 | 19.600014 | 20 |
| ATGCCGA | 20145 | 0.0 | 18.859339 | 14 |
| CGAGATC | 20740 | 0.0 | 18.236164 | 18 |