Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR939791_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 2449043 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 101 |
| %GC | 45 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 4771 | 0.19481078935731222 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 4102 | 0.1674939966346038 | No Hit |
| GAGTAAGCAGTGGTATCAACGCAGAGTAAGCAGTGGTATCAACGCAGAGT | 3483 | 0.14221881771777792 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 2723 | 0.11118628786836326 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GTACATG | 9350 | 0.0 | 24.819988 | 1 |
| TACATGG | 9510 | 0.0 | 24.046371 | 2 |
| ACATGGG | 9580 | 0.0 | 22.78741 | 3 |
| GTATCAA | 12160 | 0.0 | 22.730661 | 1 |
| GGTATCA | 9105 | 0.0 | 21.96514 | 1 |
| GAGTACT | 5340 | 0.0 | 20.868864 | 12-13 |
| TCAACGC | 13880 | 0.0 | 19.49853 | 4 |
| CAACGCA | 13975 | 0.0 | 19.399721 | 5 |
| ATCAACG | 13950 | 0.0 | 19.366888 | 3 |
| AACGCAG | 14050 | 0.0 | 19.317488 | 6 |
| CATGGGA | 5475 | 0.0 | 19.204376 | 4 |
| GTACTTT | 5910 | 0.0 | 19.095518 | 14-15 |
| TATCAAC | 14310 | 0.0 | 19.011856 | 2 |
| AGAGTAC | 11310 | 0.0 | 18.164707 | 10-11 |
| CATGGGG | 4085 | 0.0 | 18.005783 | 4 |
| GTCGCGA | 110 | 2.433544E-5 | 17.643318 | 70-71 |
| AGTACTT | 5780 | 0.0 | 17.568245 | 12-13 |
| ACGCAGA | 15385 | 0.0 | 17.51875 | 7 |
| CGCAGAG | 15335 | 0.0 | 17.514414 | 8 |
| GTATAGG | 795 | 0.0 | 17.205963 | 1 |