Basic Statistics
Measure | Value |
---|---|
Filename | ERR1042011.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 10616279 |
Sequences flagged as poor quality | 0 |
Sequence length | 43 |
%GC | 47 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 59917 | 0.5643879555162407 | No Hit |
TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 38621 | 0.36379036383651936 | No Hit |
GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 37092 | 0.34938795410331625 | No Hit |
ACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 17069 | 0.1607813811223311 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GGTATCA | 19055 | 0.0 | 21.019417 | 1 |
ACGGACC | 2035 | 0.0 | 15.909091 | 8 |
TCTAGCG | 1240 | 0.0 | 15.814515 | 28 |
GACGGAC | 2015 | 0.0 | 15.60794 | 7 |
CTAGCGG | 1345 | 0.0 | 14.855019 | 29 |
GTATCAA | 27435 | 0.0 | 14.599051 | 2 |
AAGACGG | 2300 | 0.0 | 14.317391 | 5 |
CGCAATA | 1415 | 0.0 | 14.250884 | 36 |
CGGACCA | 2365 | 0.0 | 13.845666 | 9 |
TACGACG | 1440 | 0.0 | 13.618056 | 5 |
GCGCAAG | 2590 | 0.0 | 13.571427 | 1 |
CGCAAGA | 2435 | 0.0 | 13.295689 | 2 |
CAAGACG | 2815 | 0.0 | 12.880995 | 4 |
TAGTACG | 380 | 0.0 | 12.657895 | 4 |
CTAGACA | 1290 | 0.0 | 12.620155 | 4 |
TAGACAG | 1675 | 0.0 | 12.480597 | 5 |
CGACGGT | 1605 | 0.0 | 12.448599 | 7 |
AGACGGA | 2560 | 0.0 | 12.429688 | 6 |
GCGTTAT | 1525 | 0.0 | 12.373771 | 1 |
GCGAAAG | 2530 | 0.0 | 12.357708 | 18 |