Basic Statistics
Measure | Value |
---|---|
Filename | SRR522119_1.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 28662645 |
Sequences flagged as poor quality | 0 |
Sequence length | 50 |
%GC | 42 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 187431 | 0.6539208087739286 | No Hit |
TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 142362 | 0.49668130767415214 | No Hit |
GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 91061 | 0.31769922140821266 | No Hit |
TCTCTGAGCGGGCTGGCAAGGCAGACCGATATCGTATGCCGTCTTCTGCT | 69505 | 0.24249332188288975 | Illumina PCR Primer Index 1 (95% over 24bp) |
ACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 34495 | 0.12034827909287507 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GGTATCA | 92065 | 0.0 | 37.84613 | 1 |
TATGCCG | 14675 | 0.0 | 33.25229 | 35 |
GCCGTCT | 14875 | 0.0 | 32.221302 | 38 |
CCGTCTT | 15050 | 0.0 | 31.773886 | 39 |
GACCGAT | 15615 | 0.0 | 31.405935 | 24 |
TCGTATG | 15600 | 0.0 | 31.302803 | 32 |
CGTATGC | 15705 | 0.0 | 31.125566 | 33 |
ATGCCGT | 15390 | 0.0 | 31.027218 | 36 |
TATCGTA | 10965 | 0.0 | 30.804695 | 30 |
ATATCGT | 11020 | 0.0 | 30.6297 | 29 |
ACCGATA | 11200 | 0.0 | 30.59117 | 25 |
ATCGTAT | 11375 | 0.0 | 30.335348 | 31 |
AGACCGA | 16900 | 0.0 | 30.305517 | 23 |
TGCCGTC | 15910 | 0.0 | 30.096416 | 37 |
CGATATC | 11390 | 0.0 | 29.965895 | 27 |
GATATCG | 11655 | 0.0 | 29.285326 | 28 |
CGTCTTC | 16465 | 0.0 | 29.09805 | 40 |
CCGATAT | 11835 | 0.0 | 28.838865 | 26 |
GCGGGCT | 18385 | 0.0 | 28.773565 | 8 |
CGGGCTG | 18920 | 0.0 | 28.029663 | 9 |