Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR523056_2.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 4941950 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 100 |
| %GC | 45 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GAGTAAGCAGTGGTATCAACGCAGAGTAAGCAGTGGTATCAACGCAGAGT | 22586 | 0.4570260727040945 | No Hit |
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 20499 | 0.41479577899412173 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 15654 | 0.3167575552160584 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 9625 | 0.19476117726808245 | No Hit |
| GTATCAACGCAGAGTAAGCAGTGGTATCAACGCAGAGTAAGCAGTGGTAT | 6349 | 0.1284715547506551 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| TATAACG | 410 | 0.0 | 38.982754 | 2 |
| TACCTGG | 4205 | 0.0 | 31.860771 | 2 |
| GTACATG | 19585 | 0.0 | 30.127243 | 1 |
| TACATGG | 20200 | 0.0 | 28.973131 | 2 |
| ATAACGC | 620 | 0.0 | 28.813488 | 3 |
| ACATGGG | 20680 | 0.0 | 27.756779 | 3 |
| ATGGGGG | 10840 | 0.0 | 27.23488 | 5 |
| CATGGGG | 15200 | 0.0 | 27.184862 | 4 |
| GTACCTG | 5390 | 0.0 | 25.993618 | 1 |
| TAACGCA | 670 | 0.0 | 25.960243 | 4 |
| TGGGGGG | 11255 | 0.0 | 23.332512 | 6 |
| ACCTGGG | 5705 | 0.0 | 23.32033 | 3 |
| GGGTTAG | 2020 | 0.0 | 22.576654 | 1 |
| GAGTACT | 16300 | 0.0 | 21.011824 | 12-13 |
| GTATAAC | 1265 | 0.0 | 20.069773 | 1 |
| TATCACG | 560 | 0.0 | 19.307108 | 2 |
| AGAGTAC | 25115 | 0.0 | 19.27481 | 10-11 |
| AGTACTT | 16880 | 0.0 | 18.64774 | 12-13 |
| GTACTTT | 18410 | 0.0 | 18.45126 | 14-15 |
| GAGTACA | 14635 | 0.0 | 18.375643 | 1 |