Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1042538.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 5863569 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 52 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 6725 | 0.11469124009626219 | No Hit |
| GGGTAGGCACACGCTGAGCCAGTCAGTGTAGCGCGCGTGCAGC | 6049 | 0.10316242547840744 | No Hit |
| GCGCAAGACGGACCAGAGCGAAAGCATTTGCCAAGAATGTTTT | 5893 | 0.10050192979736404 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GGTATCA | 3990 | 0.0 | 25.454887 | 1 |
| AAGACGG | 1685 | 0.0 | 17.23739 | 5 |
| TCGTTTA | 1080 | 0.0 | 16.787037 | 30 |
| GTATCAA | 6085 | 0.0 | 16.447823 | 2 |
| ACGGACC | 1590 | 0.0 | 16.40566 | 8 |
| GACGGAC | 1590 | 0.0 | 16.172955 | 7 |
| CGTTTAT | 1180 | 0.0 | 15.834745 | 31 |
| AGACGGA | 1820 | 0.0 | 15.247252 | 6 |
| CAAGACG | 1885 | 0.0 | 15.1140585 | 4 |
| GTACCGT | 210 | 9.094947E-12 | 14.97619 | 6 |
| CGGACCA | 1760 | 0.0 | 14.926136 | 9 |
| TATACTG | 865 | 0.0 | 14.543352 | 5 |
| TAACGCC | 1385 | 0.0 | 14.425994 | 4 |
| ATAACGC | 1395 | 0.0 | 14.189964 | 3 |
| CGCAAGA | 1890 | 0.0 | 14.095239 | 2 |
| TACGACG | 1000 | 0.0 | 14.059999 | 5 |
| GCGCAAG | 1905 | 0.0 | 13.984252 | 1 |
| TGGTCGG | 1330 | 0.0 | 13.909774 | 37 |
| GCGAAAG | 1880 | 0.0 | 13.776595 | 18 |
| ATGGTCG | 1270 | 0.0 | 13.692913 | 36 |