Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1041874.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 2337170 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 55 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 3428 | 0.14667311320956541 | No Hit |
| CCCTCAGAGAGGCGAGGGTTCGAGGGCACGAGTTCGAGGCCAA | 3397 | 0.14534672274588498 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 2930 | 0.1253652922123765 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GGTATCA | 1715 | 0.0 | 30.959183 | 1 |
| TATCGCG | 50 | 2.7032604E-4 | 22.2 | 7 |
| GTATACG | 45 | 0.0038269733 | 20.555557 | 1 |
| GTATCAA | 2590 | 0.0 | 20.285715 | 2 |
| TCGCTAA | 80 | 1.6178352E-5 | 18.5 | 14 |
| GTCTTAC | 140 | 9.458745E-11 | 18.5 | 1 |
| TATACGG | 60 | 9.240711E-4 | 18.5 | 2 |
| CGCTATA | 245 | 0.0 | 17.367346 | 2 |
| TGTATAC | 145 | 2.9849616E-9 | 16.586206 | 3 |
| GTCGCTA | 125 | 1.6597551E-7 | 16.279999 | 37 |
| GTATATA | 125 | 1.6597551E-7 | 16.279999 | 1 |
| GTGTTAG | 455 | 0.0 | 16.263735 | 1 |
| CGCTAAG | 70 | 0.0025935695 | 15.857142 | 1 |
| GTGTAAG | 260 | 0.0 | 15.653846 | 1 |
| TTAGACT | 95 | 7.064667E-5 | 15.578948 | 4 |
| TAGACTG | 180 | 2.0190782E-10 | 15.416667 | 5 |
| TCTGTCG | 670 | 0.0 | 15.186566 | 8 |
| TAGACAC | 195 | 4.1836756E-11 | 15.179486 | 5 |
| TCGCTAT | 305 | 0.0 | 15.163935 | 1 |
| TACACAG | 435 | 0.0 | 14.885058 | 5 |