Basic Statistics
Measure | Value |
---|---|
Filename | ERR1042073.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 3995698 |
Sequences flagged as poor quality | 0 |
Sequence length | 43 |
%GC | 46 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 36704 | 0.9185879413309015 | No Hit |
GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 29852 | 0.7471035098248165 | No Hit |
GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 18333 | 0.4588184592529266 | No Hit |
ACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 8792 | 0.22003664941644738 | No Hit |
GTGTAGCGCGCGTGCAGCCCCGGACATCTAAGGGCATCACAGA | 4780 | 0.11962866062450166 | No Hit |
GGGTAGGCACACGCTGAGCCAGTCAGTGTAGCGCGCGTGCAGC | 4257 | 0.10653958332186267 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
TGCGACG | 180 | 0.0 | 21.583332 | 22 |
CAATGCG | 200 | 0.0 | 20.35 | 19 |
CGTTCGG | 145 | 7.2759576E-12 | 19.13793 | 24 |
TGCGTAC | 60 | 9.2422764E-4 | 18.5 | 16 |
GTACTAG | 315 | 0.0 | 18.20635 | 1 |
TAGGTCG | 225 | 0.0 | 18.088888 | 21 |
CGACGAG | 205 | 0.0 | 18.048782 | 24 |
ACGGACC | 290 | 0.0 | 17.862068 | 8 |
TATACTG | 530 | 0.0 | 17.801888 | 5 |
TAACCGG | 220 | 0.0 | 17.65909 | 22 |
CGCGATA | 140 | 1.873559E-9 | 17.178572 | 14 |
TCTATAC | 620 | 0.0 | 17.008064 | 3 |
TTGCGAT | 120 | 1.0430813E-7 | 16.958332 | 16 |
CTAGTAC | 350 | 0.0 | 16.914286 | 3 |
CGGTAAG | 165 | 5.4569682E-11 | 16.818182 | 28 |
ATGTACG | 310 | 0.0 | 16.709679 | 11 |
GTCGTCA | 255 | 0.0 | 16.686274 | 24 |
CGACTAT | 180 | 1.0913936E-11 | 16.444445 | 36 |
GCCGAGT | 305 | 0.0 | 16.37705 | 12 |
CGCTACG | 80 | 3.3851888E-4 | 16.1875 | 16 |