Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1042727.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 5206357 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 49 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 24192 | 0.4646627190567224 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 24081 | 0.4625307100531139 | No Hit |
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 16872 | 0.32406536854848794 | No Hit |
| ACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 11406 | 0.2190783305870112 | No Hit |
| GTACGGGAAGCAGTGGTATCAACGCAGAGTACGGGAAGCAGTG | 8514 | 0.16353085276326612 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| AATCTCG | 645 | 0.0 | 18.64341 | 36 |
| CGAGACA | 780 | 0.0 | 17.076923 | 23 |
| TTAACGG | 765 | 0.0 | 16.928104 | 35 |
| TAACGGC | 775 | 0.0 | 16.709677 | 36 |
| ATCTCGT | 740 | 0.0 | 16.0 | 37 |
| ACGAGAC | 915 | 0.0 | 15.366121 | 22 |
| CACGAGA | 915 | 0.0 | 14.355192 | 21 |
| TTAGGAC | 845 | 0.0 | 14.23077 | 3 |
| GGACCGT | 390 | 0.0 | 14.23077 | 6 |
| TATACAC | 1075 | 0.0 | 14.111629 | 3 |
| TACACAT | 1220 | 0.0 | 14.102458 | 5 |
| CTAATAC | 1440 | 0.0 | 14.003473 | 3 |
| CCCACGA | 900 | 0.0 | 13.9777775 | 19 |
| GTATTAG | 1530 | 0.0 | 13.784313 | 1 |
| TTTAACG | 980 | 0.0 | 13.780612 | 34 |
| GGGGTTA | 770 | 0.0 | 13.694805 | 6 |
| TAATACT | 1405 | 0.0 | 13.562278 | 4 |
| TCTAATA | 1305 | 0.0 | 13.467433 | 2 |
| GTACTAG | 210 | 2.046363E-9 | 13.214287 | 1 |
| GTTTAGG | 925 | 0.0 | 13.2 | 1 |