Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1041828.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 2039656 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 51 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| CTCTAATACTGGTGATGCTAGAGGTGATGTTTTTGGTAAACAG | 2558 | 0.12541330498868436 | No Hit |
| GTATTAGAGGCACCGCCTGCCCAGTGACACATGTTTAACGGCC | 2486 | 0.12188329796789263 | No Hit |
| CCATAGGGTCTTCTCGTCTTGCTGTGTCATGCCCGCCTCTTCA | 2328 | 0.1141368936722663 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| TTAACGC | 115 | 0.0 | 33.782608 | 35 |
| GGTATCA | 2065 | 0.0 | 29.474577 | 1 |
| TAACGCT | 140 | 0.0 | 26.428572 | 36 |
| AACGCTG | 210 | 0.0 | 22.02381 | 37 |
| GTATCAA | 3085 | 0.0 | 19.729334 | 2 |
| GAGCGTA | 60 | 9.240162E-4 | 18.5 | 6 |
| TCTAACG | 60 | 9.240162E-4 | 18.5 | 2 |
| AACCTCG | 420 | 0.0 | 18.5 | 19 |
| GCGGAAT | 230 | 0.0 | 18.5 | 19 |
| TATACTG | 145 | 1.546141E-10 | 17.862068 | 5 |
| GCCTTAC | 440 | 0.0 | 17.65909 | 25 |
| ATCGTAC | 105 | 4.801768E-7 | 17.619047 | 25 |
| ACCTCGC | 455 | 0.0 | 17.483517 | 20 |
| TCGCTAA | 460 | 0.0 | 17.293478 | 14 |
| CGTATCT | 65 | 0.0015805012 | 17.076923 | 9 |
| GTACGGA | 260 | 0.0 | 17.076923 | 6 |
| GTATACG | 185 | 1.8189894E-12 | 17.0 | 1 |
| CTCGCTA | 505 | 0.0 | 16.851486 | 13 |
| GTATAAA | 200 | 0.0 | 16.650002 | 1 |
| GTATTAG | 1200 | 0.0 | 16.65 | 1 |