Basic Statistics
Measure | Value |
---|---|
Filename | ERR1041967.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 3307970 |
Sequences flagged as poor quality | 0 |
Sequence length | 43 |
%GC | 48 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 7639 | 0.23092712449024627 | No Hit |
GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 3417 | 0.10329597910501001 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CTTATAC | 3870 | 0.0 | 17.639536 | 37 |
TTTAGCG | 155 | 4.0199666E-10 | 16.709677 | 26 |
TATACCG | 145 | 2.9849616E-9 | 16.586206 | 5 |
CTAACGC | 80 | 3.3849684E-4 | 16.1875 | 3 |
CGCGAAA | 70 | 0.0025938733 | 15.857143 | 15 |
TCTATAC | 315 | 0.0 | 15.857142 | 3 |
TATACTG | 415 | 0.0 | 15.156627 | 5 |
CTCTAAT | 655 | 0.0 | 14.122138 | 1 |
CTAATAC | 1025 | 0.0 | 14.078049 | 3 |
TACCCCG | 435 | 0.0 | 14.034483 | 5 |
TTATGCG | 345 | 0.0 | 13.942029 | 4 |
TCTTATA | 5855 | 0.0 | 13.934244 | 37 |
TTGCGAT | 80 | 0.0063019195 | 13.875 | 16 |
TACGGTC | 120 | 3.3040436E-5 | 13.874999 | 10 |
ATTAGAC | 255 | 1.8189894E-12 | 13.784313 | 3 |
CGAGTTC | 445 | 0.0 | 13.719102 | 14 |
CCGAGTT | 460 | 0.0 | 13.673912 | 13 |
ATACCTT | 585 | 0.0 | 13.598291 | 6 |
TCTAATA | 740 | 0.0 | 13.5 | 2 |
CGAACTA | 460 | 0.0 | 13.271738 | 24 |