Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1041555.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 8172623 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 48 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 34122 | 0.41751589422392293 | No Hit |
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 24053 | 0.29431187514706114 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 23923 | 0.29272119856746115 | No Hit |
| ACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 11617 | 0.14214530634779066 | No Hit |
| GCGCAAGACGGACCAGAGCGAAAGCATTTGCCAAGAATGTTTT | 8591 | 0.10511924996417919 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| AAGACGG | 1925 | 0.0 | 19.701298 | 5 |
| GACGGAC | 1995 | 0.0 | 18.26817 | 7 |
| TCTAGCG | 1355 | 0.0 | 18.022139 | 28 |
| CTAGCGG | 1400 | 0.0 | 17.574999 | 29 |
| ACGGACC | 2075 | 0.0 | 17.563854 | 8 |
| CGCAAGA | 2100 | 0.0 | 17.442858 | 2 |
| CAAGACG | 2350 | 0.0 | 17.08298 | 4 |
| GGTATCA | 15305 | 0.0 | 16.064358 | 1 |
| GCGCAAG | 2340 | 0.0 | 15.811966 | 1 |
| TTAACGG | 340 | 0.0 | 15.779412 | 35 |
| AGACGGA | 2415 | 0.0 | 15.62733 | 6 |
| CGGACCA | 2310 | 0.0 | 15.616883 | 9 |
| CGCAATA | 1655 | 0.0 | 15.537765 | 36 |
| TCGTTTA | 1475 | 0.0 | 14.925423 | 30 |
| TAGCGGC | 1685 | 0.0 | 14.8219595 | 30 |
| ACGAACG | 725 | 0.0 | 14.8 | 15 |
| CGCGAAA | 190 | 4.5656634E-10 | 14.605264 | 15 |
| CGACGGT | 1395 | 0.0 | 14.587814 | 7 |
| CGTTCGG | 205 | 9.276846E-11 | 14.439024 | 24 |
| TATACTG | 1745 | 0.0 | 14.418339 | 5 |