Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR522881_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 19852851 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 100 |
| %GC | 47 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 50802 | 0.2558927178771452 | No Hit |
| GAGTAAGCAGTGGTATCAACGCAGAGTAAGCAGTGGTATCAACGCAGAGT | 42291 | 0.21302230092796245 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 41505 | 0.20906317183360715 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 27081 | 0.13640861959826325 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| TACCTGG | 32940 | 0.0 | 50.30662 | 2 |
| GTACCTG | 38155 | 0.0 | 43.89108 | 1 |
| ACCTGGG | 39040 | 0.0 | 41.756466 | 3 |
| CCTGGGG | 36065 | 0.0 | 32.653095 | 4 |
| CTGGGGG | 22615 | 0.0 | 24.1872 | 5 |
| TGGGGGG | 29990 | 0.0 | 23.880186 | 6 |
| GTACATG | 88155 | 0.0 | 22.959372 | 1 |
| CATGGGG | 54420 | 0.0 | 22.658676 | 4 |
| TACATGG | 90515 | 0.0 | 22.11849 | 2 |
| ATGGGGG | 32425 | 0.0 | 22.101364 | 5 |
| ACATGGG | 92610 | 0.0 | 21.002295 | 3 |
| GTACCCG | 4155 | 0.0 | 19.914776 | 1 |
| TATAACG | 1725 | 0.0 | 19.888414 | 2 |
| GAGTACT | 50805 | 0.0 | 18.356377 | 12-13 |
| GTACACG | 4240 | 0.0 | 17.852287 | 1 |
| GTATCAA | 118455 | 0.0 | 17.793 | 1 |
| AGTACTT | 54040 | 0.0 | 16.931318 | 12-13 |
| TAACGCA | 2335 | 0.0 | 16.703999 | 4 |
| TATCACG | 2160 | 0.0 | 16.53584 | 2 |
| TACCCGG | 4960 | 0.0 | 16.391958 | 2 |