Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR522972_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 5294603 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 100 |
| %GC | 46 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 17114 | 0.32323481099527196 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 14535 | 0.27452483217344154 | No Hit |
| GAGTAAGCAGTGGTATCAACGCAGAGTAAGCAGTGGTATCAACGCAGAGT | 12939 | 0.24438092903282835 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 7607 | 0.14367460600917575 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| TACCTGG | 7360 | 0.0 | 43.813393 | 2 |
| TATAACG | 520 | 0.0 | 38.871002 | 2 |
| GTACCTG | 8540 | 0.0 | 38.83215 | 1 |
| ACCTGGG | 8125 | 0.0 | 38.355007 | 3 |
| ATAACGC | 700 | 0.0 | 28.87369 | 3 |
| TAACGCA | 635 | 0.0 | 28.128187 | 4 |
| CCTGGGG | 7270 | 0.0 | 27.348772 | 4 |
| GTACATG | 19980 | 0.0 | 24.414253 | 1 |
| TACATGG | 20395 | 0.0 | 23.071241 | 2 |
| ACATGGG | 20335 | 0.0 | 22.282543 | 3 |
| TATCACG | 540 | 0.0 | 21.762403 | 2 |
| CATGGGG | 12725 | 0.0 | 21.276323 | 4 |
| ATGGGGG | 7880 | 0.0 | 21.235369 | 5 |
| GAGTACT | 17245 | 0.0 | 20.576065 | 12-13 |
| CTGGGGG | 4705 | 0.0 | 19.980501 | 5 |
| GTATAAC | 1665 | 0.0 | 19.493734 | 1 |
| GTATCAA | 35055 | 0.0 | 19.376593 | 1 |
| TGGGGGG | 8065 | 0.0 | 18.880747 | 6 |
| AGTACTT | 17975 | 0.0 | 18.433117 | 12-13 |
| GGTATCA | 27810 | 0.0 | 18.318405 | 1 |