Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1041664.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 4786029 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 50 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 8426 | 0.17605409411434825 | No Hit |
| CCCATGTACTCTGCGTTGATACCACTGCTTCCCATGTACTCTG | 7111 | 0.14857828901579995 | No Hit |
| GTACATGGGAAGCAGTGGTATCAACGCAGAGTACATGGGAAGC | 6649 | 0.13892519247167118 | No Hit |
| GCGCAAGACGGACCAGAGCGAAAGCATTTGCCAAGAATGTTTT | 5915 | 0.12358888757255755 | No Hit |
| GAGTACATGGGAAGCAGTGGTATCAACGCAGAGTACATGGGAA | 5010 | 0.10467968330321442 | No Hit |
| CATGTACTCTGCGTTGATACCACTGCTTCCCATGTACTCTGCG | 4905 | 0.10248579772500334 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| AAGACGG | 1495 | 0.0 | 17.076923 | 5 |
| ACGGACC | 1520 | 0.0 | 16.309212 | 8 |
| TAGAGTG | 750 | 0.0 | 15.786668 | 5 |
| GACGGAC | 1540 | 0.0 | 15.616882 | 7 |
| CGCAAGA | 1650 | 0.0 | 15.472728 | 2 |
| TCGTTTA | 990 | 0.0 | 15.323233 | 30 |
| CGGACCA | 1685 | 0.0 | 15.151337 | 9 |
| GCGCAAG | 1735 | 0.0 | 15.14121 | 1 |
| CAAGACG | 1750 | 0.0 | 14.905715 | 4 |
| TTAGAGT | 740 | 0.0 | 14.5 | 4 |
| TACCGTC | 1200 | 0.0 | 14.491667 | 7 |
| ACCGTCG | 1220 | 0.0 | 14.254098 | 8 |
| CGCATCG | 1250 | 0.0 | 14.06 | 13 |
| AGACGGA | 1755 | 0.0 | 14.019943 | 6 |
| AATAACG | 1255 | 0.0 | 13.856573 | 2 |
| TAACGCC | 1275 | 0.0 | 13.639215 | 4 |
| ATAACGC | 1370 | 0.0 | 13.638687 | 3 |
| TAGACCG | 95 | 0.0012460814 | 13.631579 | 5 |
| ATTAGAG | 910 | 0.0 | 13.620878 | 3 |
| CCGTCGT | 1270 | 0.0 | 13.547243 | 9 |