Basic Statistics
Measure | Value |
---|---|
Filename | ERR1042069.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 2863262 |
Sequences flagged as poor quality | 0 |
Sequence length | 43 |
%GC | 48 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 13722 | 0.47924360397337024 | No Hit |
TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 12540 | 0.4379620167487292 | No Hit |
GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 9307 | 0.3250488428931757 | No Hit |
ACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 4705 | 0.1643230692825176 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GGTATCA | 3820 | 0.0 | 25.861258 | 1 |
TTAACGG | 70 | 1.2200034E-4 | 18.5 | 35 |
CGCAACG | 65 | 0.0015807123 | 17.076923 | 12 |
CTAGCGG | 185 | 1.8189894E-12 | 17.000002 | 29 |
ACGGACC | 490 | 0.0 | 16.989796 | 8 |
GTATCAA | 5855 | 0.0 | 16.935951 | 2 |
GACCGTA | 90 | 4.4496825E-5 | 16.444445 | 35 |
GACGGAC | 535 | 0.0 | 15.906543 | 7 |
TAACGGC | 105 | 9.352752E-6 | 15.857144 | 36 |
CGCGTAC | 70 | 0.00259376 | 15.857142 | 10 |
CGGGTAA | 265 | 0.0 | 15.358491 | 24 |
TATACTG | 625 | 0.0 | 15.096 | 5 |
TATACCG | 135 | 3.977857E-7 | 15.074075 | 5 |
TAACCCG | 285 | 0.0 | 14.929824 | 28 |
CGTTATT | 290 | 0.0 | 14.672414 | 2 |
TCTAGCG | 190 | 4.5656634E-10 | 14.605264 | 28 |
ACACGCT | 705 | 0.0 | 14.432625 | 9 |
GCGGGTA | 295 | 0.0 | 14.423729 | 23 |
CGAACGA | 245 | 0.0 | 14.346938 | 16 |
CGGACCA | 610 | 0.0 | 14.254099 | 9 |