Basic Statistics
Measure | Value |
---|---|
Filename | ERR1042004.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 13157677 |
Sequences flagged as poor quality | 0 |
Sequence length | 43 |
%GC | 48 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 59645 | 0.45330950136562864 | No Hit |
TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 57803 | 0.43931006970303343 | No Hit |
GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 44313 | 0.3367843731078062 | No Hit |
ACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 16462 | 0.12511327037439815 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GGTATCA | 28595 | 0.0 | 18.108585 | 1 |
TTAACGG | 2335 | 0.0 | 15.687366 | 35 |
TAACGGC | 2545 | 0.0 | 14.320236 | 36 |
TATTAGA | 3475 | 0.0 | 14.267627 | 2 |
TAATACT | 4235 | 0.0 | 13.804013 | 4 |
ATTAGAG | 3435 | 0.0 | 13.733624 | 3 |
GTATTAG | 5890 | 0.0 | 13.254669 | 1 |
TCTAATA | 3185 | 0.0 | 12.89482 | 2 |
AATACTG | 3855 | 0.0 | 12.813229 | 5 |
GTATCAA | 40640 | 0.0 | 12.732407 | 2 |
GGGGTTA | 2355 | 0.0 | 12.333333 | 6 |
GTACTAT | 2030 | 0.0 | 12.2118225 | 1 |
CGAACTA | 2840 | 0.0 | 12.181338 | 24 |
TTAGGAC | 2240 | 0.0 | 11.5625 | 3 |
TTTAACG | 3400 | 0.0 | 11.535295 | 34 |
GGGTTAG | 2600 | 0.0 | 11.384615 | 7 |
GCGAACT | 3080 | 0.0 | 11.292208 | 23 |
GTGACAC | 3580 | 0.0 | 11.265364 | 24 |
ACCGCCT | 3555 | 0.0 | 11.240506 | 12 |
GGCACCG | 3550 | 0.0 | 11.2042265 | 9 |