Basic Statistics
Measure | Value |
---|---|
Filename | SRR522081_1.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 32005877 |
Sequences flagged as poor quality | 0 |
Sequence length | 50 |
%GC | 44 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
TCTCTGAGCGGGCTGGCAAGGCAGACCGATATCGTATGCCGTCTTCTGCT | 63910 | 0.19968207713852051 | Illumina PCR Primer Index 1 (95% over 24bp) |
GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 47325 | 0.1478634689497807 | No Hit |
TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 45116 | 0.14096161151903444 | No Hit |
TCTCTGAGCGGGCTGGCAAGGCAGACCGATCTCGTATGCCGTCTTCTGCT | 36202 | 0.11311047655404038 | Illumina Paired End PCR Primer 2 (96% over 29bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CTTCTGC | 31520 | 0.0 | 43.766163 | 43 |
TTCTGCT | 31485 | 0.0 | 43.713993 | 44 |
TCTTCTG | 36445 | 0.0 | 37.76574 | 42 |
TATGCCG | 40890 | 0.0 | 35.624416 | 35 |
CCGTCTT | 41270 | 0.0 | 35.2432 | 39 |
CGTCTTC | 41695 | 0.0 | 35.203007 | 40 |
GCCGTCT | 41495 | 0.0 | 35.076496 | 38 |
CGTATGC | 42890 | 0.0 | 33.97034 | 33 |
TCGTATG | 43470 | 0.0 | 33.92927 | 32 |
ATGCCGT | 43020 | 0.0 | 33.892418 | 36 |
TGCCGTC | 43115 | 0.0 | 33.793037 | 37 |
GACCGAT | 43750 | 0.0 | 33.09156 | 24 |
AGACCGA | 45500 | 0.0 | 32.61835 | 23 |
GCGGGCT | 46670 | 0.0 | 32.3102 | 8 |
ATATCGT | 26470 | 0.0 | 32.272503 | 29 |
GGTATCA | 54605 | 0.0 | 32.24235 | 1 |
TGAGCGG | 46975 | 0.0 | 32.03785 | 5 |
ATCGTAT | 27220 | 0.0 | 31.918694 | 31 |
TATCGTA | 26835 | 0.0 | 31.835087 | 30 |
ACCGATA | 26860 | 0.0 | 31.834269 | 25 |