Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR522942_2.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 6647057 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 100 |
| %GC | 46 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 22831 | 0.3434753154666795 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 18329 | 0.2757460933462734 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 10910 | 0.16413278839041098 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| TACCTGG | 9820 | 0.0 | 49.42616 | 2 |
| ACCTGGG | 10990 | 0.0 | 42.795116 | 3 |
| TATAACG | 665 | 0.0 | 38.190975 | 2 |
| GTACCTG | 14325 | 0.0 | 34.801655 | 1 |
| ATAACGC | 790 | 0.0 | 33.33902 | 3 |
| GGTATCA | 17985 | 0.0 | 33.315598 | 1 |
| GTATCAA | 26710 | 0.0 | 30.9552 | 1 |
| TAACGCA | 880 | 0.0 | 29.928896 | 4 |
| CCTGGGG | 10520 | 0.0 | 29.729752 | 4 |
| TCAACGC | 31155 | 0.0 | 26.221495 | 4 |
| CAACGCA | 31535 | 0.0 | 25.874134 | 5 |
| ATCAACG | 31545 | 0.0 | 25.778423 | 3 |
| TATCAAC | 32040 | 0.0 | 25.556118 | 2 |
| AACGCAG | 32490 | 0.0 | 25.320038 | 6 |
| GTACATG | 26005 | 0.0 | 24.43358 | 1 |
| TACATGG | 26235 | 0.0 | 23.64571 | 2 |
| CATGGGG | 15570 | 0.0 | 23.047363 | 4 |
| ATGGGGG | 9675 | 0.0 | 22.894394 | 5 |
| ACGCAGA | 36075 | 0.0 | 22.77605 | 7 |
| ACATGGG | 26215 | 0.0 | 22.515715 | 3 |