Basic Statistics
Measure | Value |
---|---|
Filename | ERR1042029.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 6235009 |
Sequences flagged as poor quality | 0 |
Sequence length | 43 |
%GC | 47 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 30671 | 0.4919158897765825 | No Hit |
TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 23915 | 0.3835599916535806 | No Hit |
GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 22354 | 0.35852394118436715 | No Hit |
ACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 9897 | 0.1587327299768132 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GGTATCA | 8675 | 0.0 | 25.14294 | 1 |
CTAGCGG | 695 | 0.0 | 16.769785 | 29 |
GTATCAA | 13360 | 0.0 | 16.298279 | 2 |
TCTAGCG | 725 | 0.0 | 16.075863 | 28 |
CGCAAGA | 1175 | 0.0 | 15.114894 | 2 |
TATACTG | 1320 | 0.0 | 14.856062 | 5 |
GACGGAC | 1110 | 0.0 | 14.833333 | 7 |
ACGGACC | 1100 | 0.0 | 14.631818 | 8 |
ATACGAC | 155 | 1.2128294E-7 | 14.32258 | 3 |
AAGACGG | 1280 | 0.0 | 14.308594 | 5 |
CGCAATA | 815 | 0.0 | 14.300613 | 36 |
CGGACCA | 1165 | 0.0 | 14.133047 | 9 |
TTAACGG | 440 | 0.0 | 13.454545 | 35 |
GTATAGG | 1320 | 0.0 | 13.174243 | 1 |
GTGTAAG | 1000 | 0.0 | 13.134999 | 1 |
CGAGCCG | 1015 | 0.0 | 12.9408865 | 15 |
CGACGGT | 660 | 0.0 | 12.89394 | 7 |
GCGCAAG | 1335 | 0.0 | 12.7490635 | 1 |
GTACTAG | 450 | 0.0 | 12.744445 | 1 |
TGCGACG | 160 | 2.7002407E-6 | 12.71875 | 22 |