Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1042232.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 759588 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 45 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 4844 | 0.6377141292384819 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 4667 | 0.614412023360032 | No Hit |
| ACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 2095 | 0.2758074113861725 | No Hit |
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 1445 | 0.19023470618282542 | No Hit |
| GCGCAAGACGGACCAGAGCGAAAGCATTTGCCAAGAATGTTTT | 949 | 0.12493614959688673 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| TAGGTCG | 35 | 8.8666135E-4 | 26.428572 | 5 |
| TATACCG | 35 | 8.8666135E-4 | 26.428572 | 5 |
| GGTATCA | 1320 | 0.0 | 25.787878 | 1 |
| ACACCGT | 45 | 1.3226346E-4 | 24.666668 | 6 |
| TAAACGG | 40 | 0.0019305871 | 23.125002 | 30 |
| GCTAGAC | 50 | 2.7009335E-4 | 22.199999 | 3 |
| GACGGAC | 185 | 0.0 | 21.000002 | 7 |
| GAGTACG | 45 | 0.0038246305 | 20.555557 | 1 |
| CGTAGAA | 45 | 0.0038246305 | 20.555557 | 2 |
| CGTACAC | 55 | 5.1408855E-4 | 20.181818 | 3 |
| GTATCAA | 1720 | 0.0 | 19.898256 | 2 |
| ACGGACC | 200 | 0.0 | 19.425 | 8 |
| AGACGGA | 205 | 0.0 | 18.951218 | 6 |
| CGGACCA | 205 | 0.0 | 18.951218 | 9 |
| AAGACGG | 215 | 0.0 | 18.930233 | 5 |
| TACACCG | 90 | 2.1502601E-6 | 18.5 | 5 |
| GGCGTTA | 50 | 0.00703274 | 18.499998 | 31 |
| CGCAAGA | 255 | 0.0 | 18.137257 | 2 |
| GCGCAAG | 235 | 0.0 | 18.106384 | 1 |
| GCAAGAC | 365 | 0.0 | 17.232878 | 3 |