Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR937571_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 1519082 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 101 |
| %GC | 44 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 8388 | 0.5521755902578004 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 7342 | 0.4833182145532631 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 5254 | 0.3458667800684887 | No Hit |
| GTGGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 1963 | 0.12922278060038891 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GGTATCA | 4010 | 0.0 | 54.16089 | 1 |
| GTATCAA | 7330 | 0.0 | 43.37466 | 1 |
| TATCAAC | 9790 | 0.0 | 32.275063 | 2 |
| ATCAACG | 9775 | 0.0 | 32.130157 | 3 |
| TCAACGC | 9940 | 0.0 | 31.549007 | 4 |
| CAACGCA | 10145 | 0.0 | 30.911499 | 5 |
| AACGCAG | 10420 | 0.0 | 30.221546 | 6 |
| ACGCAGA | 11915 | 0.0 | 26.309996 | 7 |
| CGCAGAG | 11935 | 0.0 | 26.22611 | 8 |
| GTACCGT | 110 | 9.481967E-5 | 25.907736 | 6 |
| GCAGAGT | 12770 | 0.0 | 24.06491 | 9 |
| GAGTACT | 7920 | 0.0 | 23.358942 | 12-13 |
| TAAGGTG | 530 | 0.0 | 23.309107 | 5 |
| GGACCGA | 265 | 3.092282E-11 | 23.300669 | 6 |
| TGGTATC | 1745 | 0.0 | 22.600122 | 2 |
| AGAGTAC | 11795 | 0.0 | 22.067518 | 10-11 |
| AGTACTT | 8345 | 0.0 | 21.742422 | 12-13 |
| GTACTTT | 8505 | 0.0 | 21.584702 | 14-15 |
| CAGAGTA | 12385 | 0.0 | 20.80533 | 10-11 |
| GTGGTAT | 2050 | 0.0 | 20.400593 | 1 |