Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1042240.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 6681585 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 46 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 66235 | 0.9913067034244121 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 57620 | 0.862370231015545 | No Hit |
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 32488 | 0.48623193448859814 | No Hit |
| ACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 22419 | 0.3355341584369577 | No Hit |
| GAACAGTGGTATCAACGCAAAAAAAAAAAAAAAAAAAAAAAAA | 7724 | 0.11560131316147292 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| ACGGACC | 1045 | 0.0 | 17.70335 | 8 |
| AAGACGG | 1270 | 0.0 | 17.625982 | 5 |
| GACGGAC | 1115 | 0.0 | 17.089685 | 7 |
| CGGACCA | 1140 | 0.0 | 16.552631 | 9 |
| GGTATCA | 24575 | 0.0 | 16.057173 | 1 |
| AGACGGA | 1340 | 0.0 | 15.324627 | 6 |
| CTAACGC | 210 | 9.094947E-12 | 14.97619 | 3 |
| CGCAAGA | 1340 | 0.0 | 14.910448 | 2 |
| CGAACGA | 525 | 0.0 | 14.095238 | 16 |
| TCTAACG | 215 | 1.9826984E-10 | 13.767443 | 2 |
| GTATTAG | 1635 | 0.0 | 13.691133 | 1 |
| TATACTG | 1215 | 0.0 | 13.551441 | 5 |
| GCGCAAG | 1530 | 0.0 | 13.542483 | 1 |
| TACGACG | 855 | 0.0 | 13.415205 | 5 |
| TAACGCC | 925 | 0.0 | 13.4 | 4 |
| GTACTAG | 525 | 0.0 | 13.390475 | 1 |
| TCTATAC | 975 | 0.0 | 13.282051 | 3 |
| GCGAAAG | 1340 | 0.0 | 13.115672 | 18 |
| CGAGCCG | 985 | 0.0 | 12.959391 | 15 |
| CAAGACG | 1785 | 0.0 | 12.955181 | 4 |