Basic Statistics
Measure | Value |
---|---|
Filename | SRR4062098_1.fastq |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 769357 |
Sequences flagged as poor quality | 0 |
Sequence length | 50 |
%GC | 50 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GCGCAAGACGGACCAGAGCGAAAGCATTTGCCAAGAATGTTTTCATTAAT | 910 | 0.11828059015515553 | No Hit |
GATTAAGAGGGACGGCCGGGGGCATTCGTATTGCGCCGCTAGAGGTGAAA | 881 | 0.11451120871065058 | No Hit |
GAATAGGACCGCGGTTCTATTTTGTTGGTTTTCGGAACTGAGGCCATGAT | 829 | 0.10775231784464169 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
ACACCGC | 50 | 2.3609045E-6 | 30.798452 | 6 |
GTATCAA | 895 | 0.0 | 24.586163 | 1 |
GGTATCA | 455 | 0.0 | 24.180899 | 1 |
CACACCG | 55 | 1.5934266E-4 | 23.998796 | 5 |
ATACCGT | 215 | 0.0 | 23.5337 | 6 |
TACCGTC | 220 | 0.0 | 22.998846 | 7 |
ACCGTCG | 225 | 0.0 | 22.48776 | 8 |
AGTTCGC | 60 | 2.8711758E-4 | 22.000326 | 32 |
TAATACC | 70 | 3.214641E-5 | 21.998894 | 4 |
TGGTCGA | 50 | 0.0025803456 | 21.998894 | 10 |
TATACTT | 115 | 8.763891E-9 | 21.042421 | 5 |
CCGTCGT | 235 | 0.0 | 20.59471 | 9 |
CGTCGTA | 250 | 0.0 | 20.238985 | 10 |
CTATTCC | 170 | 5.456968E-12 | 19.410791 | 4 |
TAGTCGA | 80 | 8.9821406E-5 | 19.250284 | 36 |
GGACCGT | 80 | 8.986625E-5 | 19.249035 | 6 |
CGGTCCA | 195 | 0.0 | 19.178524 | 10 |
GTATAAT | 105 | 1.7873681E-6 | 18.8611 | 1 |
AGTCGAT | 70 | 8.118852E-4 | 18.85742 | 37 |
CTATACT | 105 | 1.791821E-6 | 18.856195 | 4 |