Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1041990.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 2228693 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 46 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 8755 | 0.3928311346605387 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 7703 | 0.34562858141520614 | No Hit |
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 5650 | 0.2535118116313014 | No Hit |
| ACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 2349 | 0.10539809655255344 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GGTATCA | 2695 | 0.0 | 25.742115 | 1 |
| GTATCAA | 4185 | 0.0 | 16.532856 | 2 |
| TGCGACG | 80 | 3.3843477E-4 | 16.1875 | 22 |
| GAACCGT | 80 | 3.3843477E-4 | 16.1875 | 6 |
| ACGAACG | 150 | 8.1114194E-8 | 14.8 | 29 |
| TCTTATA | 2565 | 0.0 | 14.497076 | 37 |
| CGAACTA | 485 | 0.0 | 14.494845 | 24 |
| GTACGTA | 230 | 1.8189894E-12 | 14.478261 | 13 |
| TACCGAC | 155 | 1.2116288E-7 | 14.32258 | 7 |
| ATACCGT | 105 | 1.6567322E-4 | 14.095238 | 6 |
| GCGAACT | 505 | 0.0 | 13.920792 | 23 |
| GATCGGA | 80 | 0.0063010673 | 13.875001 | 1 |
| GGTCTAA | 80 | 0.0063010673 | 13.875001 | 1 |
| TTTAGCG | 135 | 6.5731256E-6 | 13.703703 | 26 |
| ATACCGA | 135 | 6.5731256E-6 | 13.703703 | 6 |
| TACCCGC | 205 | 1.4260877E-9 | 13.536586 | 11 |
| CTCTAAT | 555 | 0.0 | 13.333333 | 1 |
| GGCACCG | 640 | 0.0 | 13.296876 | 9 |
| TACCCCG | 450 | 0.0 | 13.155556 | 5 |
| ATAGACC | 85 | 0.009408354 | 13.058824 | 4 |