Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR522972_2.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 5294603 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 100 |
| %GC | 46 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 18241 | 0.3445206373357927 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 14749 | 0.2785666838476841 | No Hit |
| GAGTAAGCAGTGGTATCAACGCAGAGTAAGCAGTGGTATCAACGCAGAGT | 13129 | 0.2479694889305204 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 7825 | 0.14779200631284348 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| TACCTGG | 6830 | 0.0 | 41.107403 | 2 |
| GTACCTG | 7875 | 0.0 | 36.129932 | 1 |
| ACCTGGG | 7520 | 0.0 | 36.02227 | 3 |
| TATAACG | 580 | 0.0 | 35.67723 | 2 |
| TAACGCA | 745 | 0.0 | 27.7753 | 4 |
| ATAACGC | 750 | 0.0 | 26.963339 | 3 |
| GTACATG | 20370 | 0.0 | 25.0727 | 1 |
| CCTGGGG | 7010 | 0.0 | 24.420002 | 4 |
| TACATGG | 20785 | 0.0 | 24.2103 | 2 |
| ACATGGG | 20930 | 0.0 | 23.323545 | 3 |
| CATGGGG | 12955 | 0.0 | 22.833664 | 4 |
| TATCACG | 510 | 0.0 | 22.131329 | 2 |
| ATGGGGG | 8165 | 0.0 | 21.77157 | 5 |
| GAGTACT | 17130 | 0.0 | 21.195456 | 12-13 |
| GTATCAA | 35120 | 0.0 | 21.157528 | 1 |
| TCAACGC | 38100 | 0.0 | 19.243473 | 4 |
| CAACGCA | 38445 | 0.0 | 19.119356 | 5 |
| ATCAACG | 38665 | 0.0 | 18.98678 | 3 |
| AGTACTT | 17985 | 0.0 | 18.894243 | 12-13 |
| AACGCAG | 39175 | 0.0 | 18.86327 | 6 |