Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR938867_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 4140731 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 101 |
| %GC | 46 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 15201 | 0.36710909257326785 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 13246 | 0.3198952069091182 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 8537 | 0.20617132578764474 | No Hit |
| GAGTAAGCAGTGGTATCAACGCAGAGTAAGCAGTGGTATCAACGCAGAGT | 4661 | 0.11256466551437415 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GGTATCA | 14255 | 0.0 | 31.543785 | 1 |
| GTATCAA | 20495 | 0.0 | 28.633326 | 1 |
| GTACATG | 16235 | 0.0 | 23.383799 | 1 |
| ATCAACG | 25040 | 0.0 | 23.194231 | 3 |
| TATCAAC | 25370 | 0.0 | 23.11697 | 2 |
| TCAACGC | 25225 | 0.0 | 23.024124 | 4 |
| CAACGCA | 25635 | 0.0 | 22.655882 | 5 |
| TACATGG | 16805 | 0.0 | 22.16481 | 2 |
| AACGCAG | 26230 | 0.0 | 22.123867 | 6 |
| ACATGGG | 16235 | 0.0 | 21.773932 | 3 |
| GAGTACT | 15455 | 0.0 | 20.909441 | 12-13 |
| TAAGGTG | 1670 | 0.0 | 20.173197 | 5 |
| GTACTTT | 16255 | 0.0 | 19.909563 | 14-15 |
| CATGGGG | 8525 | 0.0 | 19.759085 | 4 |
| AGTACTT | 16135 | 0.0 | 19.74883 | 12-13 |
| ACGCAGA | 29205 | 0.0 | 19.740217 | 7 |
| CGCAGAG | 29755 | 0.0 | 19.34344 | 8 |
| AGAGTAC | 24780 | 0.0 | 19.225998 | 10-11 |
| GCAGAGT | 31420 | 0.0 | 18.107193 | 9 |
| GTACACG | 735 | 0.0 | 17.497879 | 1 |