Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR4062106_1.fastq |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 1132481 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 50 |
| %GC | 50 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GCGCAAGACGGACCAGAGCGAAAGCATTTGCCAAGAATGTTTTCATTAAT | 1293 | 0.11417410093414371 | No Hit |
| GAATAGGACCGCGGTTCTATTTTGTTGGTTTTCGGAACTGAGGCCATGAT | 1232 | 0.108787697100437 | No Hit |
| GATTAAGAGGGACGGCCGGGGGCATTCGTATTGCGCCGCTAGAGGTGAAA | 1231 | 0.10869939539824508 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GTATCAA | 1395 | 0.0 | 26.980303 | 1 |
| CGAATGC | 315 | 0.0 | 20.95196 | 43 |
| CTAGCGG | 340 | 0.0 | 20.70638 | 29 |
| AATACGA | 330 | 0.0 | 20.66625 | 39 |
| TAGGACG | 140 | 1.8553692E-10 | 20.428162 | 4 |
| GGTATCA | 655 | 0.0 | 20.162058 | 1 |
| TAGCGGC | 350 | 0.0 | 20.114769 | 30 |
| CGCAATA | 350 | 0.0 | 20.113882 | 36 |
| CAATACG | 345 | 0.0 | 19.76772 | 38 |
| GCGCAAT | 360 | 0.0 | 19.555164 | 35 |
| TCTAGCG | 350 | 0.0 | 19.486183 | 28 |
| CGGATCG | 80 | 8.986273E-5 | 19.250463 | 26 |
| TCTAGAT | 290 | 0.0 | 18.966812 | 2 |
| AACGCAG | 2020 | 0.0 | 18.732298 | 6 |
| TACGAAT | 365 | 0.0 | 18.684555 | 41 |
| ATACGAA | 365 | 0.0 | 18.684555 | 40 |
| TTAGGAC | 225 | 0.0 | 18.577404 | 3 |
| TCAACGC | 2035 | 0.0 | 18.486115 | 4 |
| CAACGCA | 2090 | 0.0 | 18.210161 | 5 |
| TATACTG | 135 | 5.525908E-8 | 17.925568 | 5 |