Basic Statistics
Measure | Value |
---|---|
Filename | ERR1042638.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 13354595 |
Sequences flagged as poor quality | 0 |
Sequence length | 43 |
%GC | 45 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 167783 | 1.256369062483737 | No Hit |
GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 159406 | 1.1936415892806933 | No Hit |
GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 90262 | 0.6758872133524079 | No Hit |
ACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 65273 | 0.48876809817145334 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GGTATCA | 43655 | 0.0 | 24.574848 | 1 |
GTATCAA | 63110 | 0.0 | 17.142767 | 2 |
CGAACGA | 1270 | 0.0 | 15.586615 | 16 |
ACGGACC | 1715 | 0.0 | 15.533529 | 8 |
TATACTG | 2550 | 0.0 | 14.872549 | 5 |
CGCGCTA | 2605 | 0.0 | 14.629559 | 24 |
ACACGCT | 2855 | 0.0 | 14.450088 | 9 |
ACGCGCG | 2660 | 0.0 | 14.396616 | 21 |
CGGACCA | 2005 | 0.0 | 13.840399 | 9 |
GCGCGCT | 2770 | 0.0 | 13.758122 | 23 |
CTAACGC | 350 | 0.0 | 13.742857 | 3 |
AAGACGG | 2350 | 0.0 | 13.619149 | 5 |
GACGGAC | 2065 | 0.0 | 13.527844 | 7 |
TCCGATA | 1520 | 0.0 | 13.266448 | 8 |
CGCGCGC | 2945 | 0.0 | 13.191851 | 22 |
ACGCTGA | 3265 | 0.0 | 13.08882 | 11 |
TAACGCC | 1570 | 0.0 | 12.961783 | 4 |
CGATAAC | 1570 | 0.0 | 12.961783 | 10 |
CTAGCGG | 785 | 0.0 | 12.961783 | 29 |
TCTATAC | 1670 | 0.0 | 12.961078 | 3 |