Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR937020_2.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 760767 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 101 |
| %GC | 44 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 3820 | 0.5021248292841303 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 2886 | 0.3793539940612566 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 1779 | 0.2338429506011696 | No Hit |
| GTGGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 795 | 0.10449980085887006 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GGTATCA | 1895 | 0.0 | 44.899033 | 1 |
| GTATCAA | 3210 | 0.0 | 39.092373 | 1 |
| TCAACGC | 4025 | 0.0 | 30.83061 | 4 |
| ATCAACG | 4030 | 0.0 | 30.79236 | 3 |
| TATCAAC | 4080 | 0.0 | 30.415003 | 2 |
| CAACGCA | 4130 | 0.0 | 30.046783 | 5 |
| AACGCAG | 4260 | 0.0 | 29.34535 | 6 |
| CGCTAAT | 85 | 6.369569E-4 | 27.96782 | 2 |
| CGCAGAG | 4710 | 0.0 | 26.437258 | 8 |
| ACGCAGA | 4740 | 0.0 | 26.368462 | 7 |
| GAGTACT | 2995 | 0.0 | 25.069944 | 12-13 |
| GCAGAGT | 5020 | 0.0 | 24.615326 | 9 |
| TGGTATC | 1005 | 0.0 | 23.181286 | 2 |
| TACATGG | 2475 | 0.0 | 23.052261 | 2 |
| AGAGTAC | 4300 | 0.0 | 22.323471 | 10-11 |
| AGTACTT | 3205 | 0.0 | 22.018694 | 12-13 |
| GTACTTT | 3335 | 0.0 | 22.018257 | 14-15 |
| GTGGTAT | 1090 | 0.0 | 21.804026 | 1 |
| GTATAGG | 175 | 4.022131E-6 | 21.729269 | 1 |
| GTACATG | 2690 | 0.0 | 21.557615 | 1 |