Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1633420.fastq |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 1019248 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 45 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 5923 | 0.5811147041740578 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 3977 | 0.3901896300017268 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 2334 | 0.22899235514810917 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GGTATCA | 1120 | 0.0 | 24.776785 | 1 |
| GTATCAA | 3130 | 0.0 | 23.9377 | 1 |
| TAGACCG | 50 | 2.7018116E-4 | 22.199999 | 5 |
| CGAATTA | 60 | 3.725752E-5 | 21.583334 | 15 |
| CGTATAG | 45 | 0.0038255157 | 20.555555 | 1 |
| CGGTTAT | 45 | 0.0038255157 | 20.555555 | 37 |
| GCGGGTA | 65 | 6.9011476E-5 | 19.923079 | 22 |
| TTCGCCG | 125 | 4.129106E-10 | 19.240002 | 24 |
| ATTTTCG | 80 | 1.6163782E-5 | 18.5 | 15 |
| GCCGGCA | 150 | 1.2732926E-11 | 18.499998 | 15 |
| TATTCCG | 135 | 1.1514203E-9 | 17.814816 | 5 |
| CTAGACA | 115 | 6.402843E-8 | 17.695652 | 4 |
| CGAACTA | 105 | 4.7959475E-7 | 17.619047 | 24 |
| TTAACGG | 85 | 2.7226262E-5 | 17.411764 | 35 |
| CGCTCTC | 150 | 2.5102054E-10 | 17.266666 | 29 |
| TAAGACT | 230 | 0.0 | 16.891304 | 4 |
| ATAATAC | 245 | 0.0 | 16.612244 | 3 |
| TATCAAC | 4545 | 0.0 | 16.363035 | 2 |
| ATCAACG | 4545 | 0.0 | 16.281628 | 3 |
| CGAGTCG | 80 | 3.3820962E-4 | 16.1875 | 21 |