Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1042173.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 9258736 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 50 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GCGCAAGACGGACCAGAGCGAAAGCATTTGCCAAGAATGTTTT | 13476 | 0.14554902526651586 | No Hit |
| GGGTAGGCACACGCTGAGCCAGTCAGTGTAGCGCGCGTGCAGC | 12504 | 0.13505083199261755 | No Hit |
| GTGTAGCGCGCGTGCAGCCCCGGACATCTAAGGGCATCACAGA | 11038 | 0.11921713719885738 | No Hit |
| GAGTATGGTTGCAAAGCTGAAACTTAAAGGAATTGACGGAAGG | 10380 | 0.11211033557928425 | No Hit |
| CCTTAGATGTCCGGGGCTGCACGCGCGCTACACTGACTGGCTC | 10091 | 0.10898895918406142 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GGTATCA | 5950 | 0.0 | 20.987394 | 1 |
| AAGACGG | 3040 | 0.0 | 18.621712 | 5 |
| GACGGAC | 3025 | 0.0 | 18.041323 | 7 |
| ACGGACC | 3105 | 0.0 | 17.636072 | 8 |
| TCGTTTA | 2250 | 0.0 | 17.102222 | 30 |
| AGACGGA | 3385 | 0.0 | 16.72378 | 6 |
| CAAGACG | 3485 | 0.0 | 16.456242 | 4 |
| CGAACGA | 1895 | 0.0 | 16.10818 | 16 |
| GCGCAAG | 3510 | 0.0 | 16.075499 | 1 |
| CGCAAGA | 3510 | 0.0 | 16.022793 | 2 |
| ATGGTCG | 2370 | 0.0 | 16.002111 | 36 |
| ACGAACG | 1920 | 0.0 | 15.994791 | 15 |
| CGTTTAT | 2425 | 0.0 | 15.944329 | 31 |
| TAACGCC | 2550 | 0.0 | 15.743136 | 4 |
| TGGTCGG | 2465 | 0.0 | 15.685598 | 37 |
| GCGGGTA | 1345 | 0.0 | 15.54275 | 23 |
| CGGACCA | 3545 | 0.0 | 15.394922 | 9 |
| TATCTAG | 2555 | 0.0 | 15.350294 | 1 |
| TAACGAA | 2045 | 0.0 | 15.288509 | 13 |
| ATAACGC | 2830 | 0.0 | 15.231449 | 3 |