Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1041967.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 3307970 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 48 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 7639 | 0.23092712449024627 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 3417 | 0.10329597910501001 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| CTTATAC | 3870 | 0.0 | 17.639536 | 37 |
| TTTAGCG | 155 | 4.0199666E-10 | 16.709677 | 26 |
| TATACCG | 145 | 2.9849616E-9 | 16.586206 | 5 |
| CTAACGC | 80 | 3.3849684E-4 | 16.1875 | 3 |
| CGCGAAA | 70 | 0.0025938733 | 15.857143 | 15 |
| TCTATAC | 315 | 0.0 | 15.857142 | 3 |
| TATACTG | 415 | 0.0 | 15.156627 | 5 |
| CTCTAAT | 655 | 0.0 | 14.122138 | 1 |
| CTAATAC | 1025 | 0.0 | 14.078049 | 3 |
| TACCCCG | 435 | 0.0 | 14.034483 | 5 |
| TTATGCG | 345 | 0.0 | 13.942029 | 4 |
| TCTTATA | 5855 | 0.0 | 13.934244 | 37 |
| TTGCGAT | 80 | 0.0063019195 | 13.875 | 16 |
| TACGGTC | 120 | 3.3040436E-5 | 13.874999 | 10 |
| ATTAGAC | 255 | 1.8189894E-12 | 13.784313 | 3 |
| CGAGTTC | 445 | 0.0 | 13.719102 | 14 |
| CCGAGTT | 460 | 0.0 | 13.673912 | 13 |
| ATACCTT | 585 | 0.0 | 13.598291 | 6 |
| TCTAATA | 740 | 0.0 | 13.5 | 2 |
| CGAACTA | 460 | 0.0 | 13.271738 | 24 |