Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1041951.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 970641 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 48 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 1860 | 0.19162594615310913 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 1381 | 0.14227711378357188 | No Hit |
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 1306 | 0.1345502611161078 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GGTATCA | 590 | 0.0 | 25.711866 | 1 |
| TGCGTAC | 40 | 0.0019309719 | 23.125 | 12 |
| ATTAGAC | 45 | 0.0038253856 | 20.555555 | 3 |
| GTTCGTA | 55 | 5.1423034E-4 | 20.181818 | 25 |
| CGCTATT | 50 | 0.0070341127 | 18.5 | 26 |
| GTATCAA | 895 | 0.0 | 17.156425 | 2 |
| TGTATAC | 110 | 7.8045196E-7 | 16.818182 | 3 |
| TAATACG | 70 | 0.002592104 | 15.857143 | 4 |
| CGTGCGT | 70 | 0.002592104 | 15.857143 | 10 |
| TAGGCTG | 245 | 0.0 | 15.102041 | 5 |
| TATACTG | 160 | 1.0953954E-8 | 15.03125 | 5 |
| CCTATAC | 160 | 1.0953954E-8 | 15.03125 | 3 |
| GTCTAAG | 100 | 1.0930853E-4 | 14.8 | 1 |
| CCGCCGT | 150 | 8.0952304E-8 | 14.799999 | 24 |
| CCTGTAA | 855 | 0.0 | 14.497076 | 1 |
| TACACTC | 115 | 2.2090337E-5 | 14.478261 | 5 |
| TCTGTCG | 130 | 4.4419594E-6 | 14.230768 | 8 |
| TGTAATC | 730 | 0.0 | 14.191781 | 3 |
| GTGTAGA | 300 | 0.0 | 14.183332 | 1 |
| TAGATCA | 105 | 1.6552124E-4 | 14.095238 | 4 |