Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR522969_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 2989664 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 100 |
| %GC | 46 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 8386 | 0.2804997484667173 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 6745 | 0.22561063718197094 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 3991 | 0.13349326211908763 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| TACCTGG | 4165 | 0.0 | 45.479866 | 2 |
| GTACCTG | 4880 | 0.0 | 38.945213 | 1 |
| ACCTGGG | 4750 | 0.0 | 38.590317 | 3 |
| TATAACG | 320 | 0.0 | 36.721397 | 2 |
| TAACGCA | 390 | 0.0 | 30.128864 | 4 |
| ATAACGC | 400 | 0.0 | 29.375643 | 3 |
| CCTGGGG | 4270 | 0.0 | 28.728966 | 4 |
| TACCGTA | 220 | 3.6379788E-12 | 27.772408 | 7 |
| GTATCAA | 12055 | 0.0 | 25.794481 | 1 |
| GGTATCA | 8655 | 0.0 | 24.893785 | 1 |
| GTACATG | 10455 | 0.0 | 24.74749 | 1 |
| TACATGG | 10665 | 0.0 | 23.711039 | 2 |
| ACATGGG | 10690 | 0.0 | 22.643152 | 3 |
| TCAACGC | 13505 | 0.0 | 22.586945 | 4 |
| CAACGCA | 13670 | 0.0 | 22.314314 | 5 |
| CATGGGG | 6570 | 0.0 | 22.248585 | 4 |
| AACGCAG | 14165 | 0.0 | 21.83207 | 6 |
| ATCAACG | 13985 | 0.0 | 21.778095 | 3 |
| TATCAAC | 14280 | 0.0 | 21.52676 | 2 |
| ATGGGGG | 3575 | 0.0 | 21.035425 | 5 |