Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1042014.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 3080876 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 46 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 24310 | 0.7890612929569382 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 17608 | 0.5715257608550296 | No Hit |
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 15530 | 0.5040774117491259 | No Hit |
| ACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 6469 | 0.20997274801063076 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GGTATCA | 8155 | 0.0 | 18.670141 | 1 |
| TCTATCG | 60 | 9.2416216E-4 | 18.5 | 31 |
| GTACTAG | 185 | 1.8189894E-11 | 16.0 | 1 |
| TTAACGG | 210 | 9.094947E-12 | 14.9761915 | 35 |
| TTAGCGA | 125 | 2.9624698E-6 | 14.799999 | 27 |
| ATCCGTA | 125 | 2.9624698E-6 | 14.799999 | 12 |
| ACGGACC | 290 | 0.0 | 14.672414 | 8 |
| GCGTTAG | 90 | 8.27958E-4 | 14.388888 | 1 |
| CGAACGA | 245 | 0.0 | 14.346938 | 16 |
| TAACGAA | 265 | 0.0 | 13.962264 | 13 |
| TACTCCG | 400 | 0.0 | 13.875001 | 5 |
| GTACCGA | 80 | 0.006301791 | 13.875 | 6 |
| TTAGACT | 375 | 0.0 | 13.813333 | 4 |
| GACGGAC | 325 | 0.0 | 13.661538 | 7 |
| TATACCG | 190 | 7.1395334E-9 | 13.631579 | 5 |
| TAACGCC | 300 | 0.0 | 13.566667 | 4 |
| GTATCAA | 11245 | 0.0 | 13.506892 | 2 |
| TAACGGC | 235 | 5.638867E-11 | 13.3829775 | 36 |
| TAATACT | 625 | 0.0 | 13.32 | 4 |
| ATAACGA | 265 | 1.8189894E-12 | 13.264152 | 12 |