Basic Statistics
Measure | Value |
---|---|
Filename | SRR522110_1.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 32856475 |
Sequences flagged as poor quality | 0 |
Sequence length | 50 |
%GC | 43 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 151493 | 0.4610750240249448 | No Hit |
TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 136069 | 0.41413146115035165 | No Hit |
GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 78001 | 0.23739917322232526 | No Hit |
TCTCTGAGCGGGCTGGCAAGGCAGACCGATATCGTATGCCGTCTTCTGCT | 41905 | 0.1275395489017005 | Illumina PCR Primer Index 1 (95% over 24bp) |
ACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 39134 | 0.1191058992177341 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GGTATCA | 66940 | 0.0 | 35.510654 | 1 |
TATGCCG | 9880 | 0.0 | 33.098167 | 35 |
TCGTATG | 10710 | 0.0 | 30.473629 | 32 |
GACCGAT | 11015 | 0.0 | 30.31559 | 24 |
CGTATGC | 11140 | 0.0 | 29.353182 | 33 |
CCGTCTT | 11000 | 0.0 | 28.787561 | 39 |
GCCGTCT | 11030 | 0.0 | 28.451637 | 38 |
GCGGGCT | 13045 | 0.0 | 28.119083 | 8 |
ATATCGT | 7445 | 0.0 | 28.081064 | 29 |
TATCGTA | 7480 | 0.0 | 28.008467 | 30 |
ATGCCGT | 11425 | 0.0 | 27.812826 | 36 |
AGACCGA | 12785 | 0.0 | 27.649544 | 23 |
ACCGATA | 7775 | 0.0 | 27.50314 | 25 |
CGATATC | 7795 | 0.0 | 27.467196 | 27 |
ATCGTAT | 7820 | 0.0 | 27.240433 | 31 |
GATATCG | 7990 | 0.0 | 26.907005 | 28 |
CGGGCTG | 13810 | 0.0 | 26.641056 | 9 |
TGAGCGG | 13985 | 0.0 | 26.592192 | 5 |
CCGATAT | 8075 | 0.0 | 26.460192 | 26 |
TGCCGTC | 12040 | 0.0 | 26.30149 | 37 |