Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1041720.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 5946415 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 45 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 35905 | 0.6038091858708146 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 33455 | 0.5626078906366272 | No Hit |
| CTTATACACATCTCCGAGCCCACGAGACAGGCAGAAATCTCGT | 22449 | 0.37752158233153926 | No Hit |
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 21552 | 0.36243686321926744 | No Hit |
| ACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 13772 | 0.2316017297817256 | No Hit |
| TCTCCGAGCCCACGAGACAGGCAGAAATCTCGTATGCCGTCTT | 7047 | 0.11850837857768085 | RNA PCR Primer, Index 17 (95% over 21bp) |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GGTATCA | 10620 | 0.0 | 22.663372 | 1 |
| TACACAT | 5035 | 0.0 | 21.310825 | 5 |
| CCGTCTT | 1385 | 0.0 | 20.703972 | 37 |
| ACACATC | 5305 | 0.0 | 19.319511 | 6 |
| GCCGTCT | 1475 | 0.0 | 19.315256 | 36 |
| ATCTCCG | 5200 | 0.0 | 19.17596 | 10 |
| ACATCTC | 5705 | 0.0 | 18.548641 | 8 |
| ATACACA | 6390 | 0.0 | 17.197184 | 4 |
| CACATCT | 6110 | 0.0 | 16.562193 | 7 |
| ATCTCGT | 5700 | 0.0 | 16.552631 | 37 |
| TATACAC | 6305 | 0.0 | 16.460745 | 3 |
| AATCTCG | 5800 | 0.0 | 16.36293 | 36 |
| CCCACGA | 6025 | 0.0 | 16.273859 | 19 |
| CGAGACA | 6040 | 0.0 | 16.233442 | 23 |
| ACGAGAC | 6040 | 0.0 | 16.233442 | 22 |
| TCTCCGA | 6140 | 0.0 | 16.14984 | 11 |
| TATGCCG | 1910 | 0.0 | 16.078533 | 33 |
| GCCCACG | 6275 | 0.0 | 15.713944 | 18 |
| CGAGCCC | 6305 | 0.0 | 15.639175 | 15 |
| GTATCAA | 15520 | 0.0 | 15.543815 | 2 |