Basic Statistics
Measure | Value |
---|---|
Filename | SRR936017_1.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 2000710 |
Sequences flagged as poor quality | 0 |
Sequence length | 125 |
%GC | 43 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 7586 | 0.37916539628431906 | No Hit |
GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 5566 | 0.2782012385603111 | No Hit |
TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 3940 | 0.19693008981811455 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGTATCA | 840 | 0.0 | 47.82651 | 100-101 |
GGTATCA | 6300 | 0.0 | 42.878555 | 1 |
CGGTATC | 585 | 0.0 | 39.67829 | 100-101 |
CGTACAT | 1210 | 0.0 | 33.939697 | 100-101 |
ACGTATC | 360 | 0.0 | 33.065247 | 100-101 |
GCGTATC | 265 | 0.0 | 32.566147 | 100-101 |
CCGCGTA | 75 | 1.4760553E-7 | 31.742634 | 98-99 |
TACGCAG | 700 | 0.0 | 31.034094 | 100-101 |
GTATCAA | 11950 | 0.0 | 30.771267 | 1 |
CCGTATC | 410 | 0.0 | 29.032898 | 100-101 |
CCGTACA | 535 | 0.0 | 28.924362 | 100-101 |
CGAGTAC | 775 | 0.0 | 28.41478 | 100-101 |
TCGAGTA | 370 | 0.0 | 28.150139 | 100-101 |
AGTATCA | 2320 | 0.0 | 28.091206 | 100-101 |
AGGTATC | 1840 | 0.0 | 27.171003 | 100-101 |
TCGGTAT | 260 | 0.0 | 26.325018 | 100-101 |
CCTGTCG | 645 | 0.0 | 26.295443 | 96-97 |
CCGGTAT | 250 | 0.0 | 26.187672 | 100-101 |
CTCGGTA | 280 | 0.0 | 25.507475 | 98-99 |
CGCGTAC | 70 | 1.05445986E-4 | 25.507475 | 98-99 |