Basic Statistics
Measure | Value |
---|---|
Filename | SRR522061_1.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 20397597 |
Sequences flagged as poor quality | 0 |
Sequence length | 53 |
%GC | 45 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
AAGCAGTGGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 354903 | 1.739925541229195 | No Hit |
AAGCAGTGGTATCAACGCAGAGAACTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 59798 | 0.29316198373759417 | No Hit |
AAGCAGTGGTATCAACGCAGAGTACATGGGGGGCTGGTGAGATGGCTCAGTGG | 38676 | 0.18961057030394315 | No Hit |
AAGCAGTGGTATCAACGCAGAGTACATGGGGGGGCTGGTGAGATGGCTCAGTG | 26477 | 0.12980450589351286 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
AAGCAGT | 212885 | 0.0 | 44.444508 | 1 |
AGTGGTA | 216730 | 0.0 | 43.490124 | 5 |
ATCAACG | 218750 | 0.0 | 43.435516 | 11 |
GTGGTAT | 216975 | 0.0 | 43.42369 | 6 |
TCAACGC | 219045 | 0.0 | 43.41564 | 12 |
CAACGCA | 219670 | 0.0 | 43.348816 | 13 |
GTATCAA | 218225 | 0.0 | 43.26326 | 9 |
GGTATCA | 218310 | 0.0 | 43.25933 | 8 |
ACGCAGA | 219880 | 0.0 | 43.21764 | 15 |
AGCAGTG | 219615 | 0.0 | 43.18632 | 2 |
AACGCAG | 220540 | 0.0 | 43.170353 | 14 |
TATCAAC | 218850 | 0.0 | 43.133263 | 10 |
TGGTATC | 218460 | 0.0 | 43.11668 | 7 |
GCAGTGG | 220440 | 0.0 | 43.100384 | 3 |
CGCAGAG | 220765 | 0.0 | 43.038002 | 16 |
CAGTGGT | 221615 | 0.0 | 42.585564 | 4 |
AGAGTAC | 188885 | 0.0 | 42.32824 | 19 |
CAGAGTA | 192235 | 0.0 | 41.75564 | 18 |
ACATGGG | 151500 | 0.0 | 41.626797 | 24 |
GAGTACT | 60000 | 0.0 | 41.434414 | 20 |