Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1042013.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 2357191 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 47 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 13591 | 0.5765761026577821 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 10195 | 0.4325063179012647 | No Hit |
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 8975 | 0.3807497992313733 | No Hit |
| ACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 2998 | 0.1271852811248643 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GGTATCA | 3570 | 0.0 | 24.303923 | 1 |
| TAGACCG | 70 | 5.1056777E-6 | 21.142859 | 5 |
| ACGTTCG | 60 | 9.240743E-4 | 18.5 | 7 |
| GTATTAG | 710 | 0.0 | 18.239437 | 1 |
| CTAGTAC | 190 | 0.0 | 17.526316 | 3 |
| TAACGCC | 180 | 0.0 | 17.472223 | 4 |
| CGTTCGA | 65 | 0.0015805999 | 17.076923 | 8 |
| ATAACGC | 210 | 0.0 | 16.738096 | 3 |
| TTAGACT | 245 | 0.0 | 16.612244 | 4 |
| TTAGGAC | 440 | 0.0 | 16.397728 | 3 |
| TATACTG | 350 | 0.0 | 16.385714 | 5 |
| GACGGAC | 185 | 1.8189894E-11 | 16.0 | 7 |
| GTATCAA | 5430 | 0.0 | 15.978822 | 2 |
| TAATCGA | 430 | 0.0 | 15.918604 | 21 |
| TCTATAC | 295 | 0.0 | 15.677966 | 3 |
| ATCGATA | 450 | 0.0 | 15.622222 | 23 |
| TAGTACT | 285 | 0.0 | 15.578948 | 4 |
| GTAATCG | 455 | 0.0 | 15.450549 | 20 |
| AATCGAT | 455 | 0.0 | 15.450549 | 22 |
| ACGACCA | 350 | 0.0 | 15.328571 | 11 |