Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR840941.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 4266828 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 24-50 |
| %GC | 52 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| ATGGCACATGCAGCGCAAGTAGGTCTACAAGACGCTACTTCCCCTATC | 8656 | 0.2028673290791192 | No Hit |
| TTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA | 7475 | 0.17518868817772829 | No Hit |
| CATCCTGCTTCGTCAGGTTTATACCACTTTATTTGGTGTGCTGTGTTA | 6070 | 0.14226024578445629 | No Hit |
| ATGGCACATGCAGCGCAAGTAGGTCTACAAGACGCTACTTCCCCTA | 4933 | 0.11561281589039915 | No Hit |
| CATCCTGCTTCGTCAGGTTTATACCACTTTATTTGGTGTGCTGTGTT | 4314 | 0.10110555194631703 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| CTATCAT | 1995 | 0.0 | 212.04659 | 44 |
| CCGTCCA | 65 | 1.2732926E-11 | 144.16898 | 44 |
| ATACGCT | 140 | 0.0 | 133.87119 | 44 |
| TAAGCCA | 1060 | 0.0 | 128.81946 | 44 |
| ACCGTCA | 90 | 0.0 | 115.725655 | 43 |
| ATTAGTC | 155 | 0.0 | 112.27906 | 44 |
| CAGGTTA | 210 | 0.0 | 108.37192 | 44 |
| TACAATA | 135 | 0.0 | 99.16384 | 44 |
| TATACGC | 130 | 0.0 | 98.968994 | 43 |
| TTACTGT | 275 | 0.0 | 97.36086 | 44 |
| CCCATAC | 525 | 0.0 | 91.797386 | 44 |
| CCTATCA | 2130 | 0.0 | 91.46833 | 43 |
| TCGTCAA | 50 | 7.421477E-10 | 85.77313 | 43 |
| ACCACTC | 470 | 0.0 | 85.44969 | 44 |
| CGTTCGA | 180 | 0.0 | 85.09239 | 43 |
| TCGTTAT | 95 | 2.7643182E-8 | 84.550224 | 44 |
| TCACGTC | 95 | 2.7643182E-8 | 84.550224 | 44 |
| CGTATCT | 30 | 3.6779966E-8 | 81.15324 | 42 |
| CGTCTAA | 20 | 3.811359E-5 | 81.15324 | 42 |
| TATCGTT | 45 | 1.8189894E-12 | 81.15324 | 42 |