Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1041851.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 2228225 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 53 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 3654 | 0.16398703003511764 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 3438 | 0.15429321545176095 | No Hit |
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 2345 | 0.10524071850912722 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GGTATCA | 1960 | 0.0 | 30.959183 | 1 |
| GTATCAA | 2925 | 0.0 | 20.871796 | 2 |
| TAGACTA | 115 | 3.0540832E-9 | 19.304348 | 5 |
| GTTCTAA | 215 | 0.0 | 18.930233 | 1 |
| TTTAGCG | 90 | 2.1540618E-6 | 18.5 | 26 |
| TTAGACT | 115 | 6.412847E-8 | 17.695652 | 4 |
| GTATTAG | 320 | 0.0 | 16.765625 | 1 |
| CTAGACT | 170 | 8.54925E-11 | 16.32353 | 4 |
| ATAACCG | 80 | 3.384349E-4 | 16.1875 | 5 |
| CTACACT | 255 | 0.0 | 15.960784 | 4 |
| GTCGCTA | 95 | 7.0644295E-5 | 15.578948 | 37 |
| CGGATAA | 190 | 2.7284841E-11 | 15.578948 | 25 |
| TAAGATA | 155 | 7.219569E-9 | 15.516129 | 4 |
| TACACTG | 280 | 0.0 | 15.196429 | 5 |
| ATTACTC | 220 | 1.8189894E-12 | 15.136364 | 3 |
| CCAATAC | 380 | 0.0 | 15.092106 | 3 |
| CCTAGTA | 210 | 9.094947E-12 | 14.97619 | 2 |
| GTCTTAC | 210 | 9.094947E-12 | 14.97619 | 1 |
| ATCAACG | 4130 | 0.0 | 14.826877 | 4 |
| CAACGCA | 4085 | 0.0 | 14.809058 | 6 |