Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR840963.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 6170751 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 24-50 |
| %GC | 54 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| ATGGCACATGCAGCGCAAGTAGGTCTACAAGACGCTACTTCCCCTA | 10670 | 0.17291250287039617 | No Hit |
| AGCCATTGTGGCTCCGGCCGGTTGCGCGGG | 10542 | 0.17083820105526865 | No Hit |
| AGCCATTGTGGCTCCGGCCGGTTGCGCGGGCCCTCGGACCCTCA | 9589 | 0.15539437582232696 | No Hit |
| CTTTTCCAAGCGGCTGCCGAAGATGGCGGAGG | 8758 | 0.14192761950692875 | No Hit |
| TTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA | 6450 | 0.10452536490291053 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| CGGTATA | 30 | 7.2759576E-12 | 280.3258 | 44 |
| ACGTATG | 20 | 7.9764504E-8 | 280.32578 | 44 |
| CTATCAT | 4925 | 0.0 | 196.93954 | 44 |
| TAAGCCA | 2480 | 0.0 | 167.2912 | 44 |
| ACGACAC | 145 | 0.0 | 164.32892 | 44 |
| TTGACGC | 70 | 0.0 | 160.18617 | 44 |
| ATTGTAG | 140 | 0.0 | 120.139626 | 44 |
| TACGATC | 50 | 7.651965E-6 | 112.13031 | 44 |
| ACTTCTA | 1780 | 0.0 | 106.30332 | 44 |
| AACGACA | 505 | 0.0 | 99.864845 | 43 |
| TGCATGA | 185 | 0.0 | 90.91647 | 44 |
| CGTGTGC | 720 | 0.0 | 89.548515 | 44 |
| ATATATC | 100 | 2.8703653E-8 | 84.09773 | 44 |
| CATACAT | 185 | 0.0 | 83.340096 | 44 |
| AACTACC | 1105 | 0.0 | 81.18032 | 44 |
| ATTAAGC | 2335 | 0.0 | 81.114845 | 42 |
| CTTTACA | 1555 | 0.0 | 79.32048 | 44 |
| CCTATCA | 5695 | 0.0 | 78.24176 | 43 |
| GGTAAGG | 110 | 5.5588316E-8 | 76.452484 | 44 |
| ATAGCCA | 440 | 0.0 | 76.452484 | 44 |