Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1041651.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 2150921 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 48 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 5291 | 0.24598764901174894 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 3814 | 0.17731939015891332 | No Hit |
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 3048 | 0.14170673864823485 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GGTATCA | 2045 | 0.0 | 22.254278 | 1 |
| CGCGATA | 65 | 6.9057045E-5 | 19.923077 | 14 |
| TATATCG | 90 | 4.448856E-5 | 16.444445 | 5 |
| CGGATCG | 140 | 3.477362E-8 | 15.857143 | 37 |
| TACCGTC | 205 | 5.456968E-12 | 15.341464 | 7 |
| CGTCGTA | 175 | 2.240995E-9 | 14.8 | 10 |
| GTATCAA | 3075 | 0.0 | 14.679674 | 2 |
| TATACCG | 140 | 6.0013554E-7 | 14.535715 | 5 |
| TAACACT | 425 | 0.0 | 14.364706 | 4 |
| TATACTG | 310 | 0.0 | 14.322581 | 5 |
| TACTCCG | 220 | 1.8189894E-11 | 14.295454 | 5 |
| CAGTACT | 505 | 0.0 | 13.920792 | 4 |
| CTACGTA | 80 | 0.0063009746 | 13.875 | 9 |
| TAGGACC | 350 | 0.0 | 13.742858 | 4 |
| TCTAGAC | 230 | 4.0017767E-11 | 13.673912 | 3 |
| GTACTAG | 180 | 5.1650204E-8 | 13.361111 | 1 |
| ATAACGC | 195 | 1.0271833E-8 | 13.282052 | 3 |
| CGCTATA | 155 | 1.8894953E-6 | 13.129032 | 2 |
| ATACCGA | 155 | 1.8894953E-6 | 13.129032 | 6 |
| CGTACGG | 170 | 3.735313E-7 | 13.058824 | 22 |