Basic Statistics
Measure | Value |
---|---|
Filename | SRR522100_1.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 37601870 |
Sequences flagged as poor quality | 0 |
Sequence length | 50 |
%GC | 46 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
AAGCAGTGGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 111731 | 0.29714213681394036 | No Hit |
GATCGGAAGAGCGGTTCAGCAGGAATGCCGAGATCGGAAGAGCGGTTCAG | 69387 | 0.18453071615853148 | Illumina Paired End PCR Primer 2 (97% over 36bp) |
CGGTTCAGCAGGAATGCCGAGATCGGAAGAGCGGTTCAGCAGGAATGCCG | 60085 | 0.1597925847836823 | Illumina Paired End PCR Primer 2 (100% over 31bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
TCAACGC | 38640 | 0.0 | 29.713158 | 12 |
ATCAACG | 39280 | 0.0 | 29.23459 | 11 |
CAACGCA | 39760 | 0.0 | 28.726923 | 13 |
ACGCAGA | 40800 | 0.0 | 27.795364 | 15 |
AACGCAG | 41050 | 0.0 | 27.781347 | 14 |
AAGCAGT | 43480 | 0.0 | 27.319805 | 1 |
GTATCAA | 42435 | 0.0 | 27.241247 | 9 |
GGTATCA | 42605 | 0.0 | 27.106726 | 8 |
GTGGTAT | 43310 | 0.0 | 26.736734 | 6 |
CGCAGAG | 42575 | 0.0 | 26.590044 | 16 |
TGGTATC | 43960 | 0.0 | 26.266203 | 7 |
AGTGGTA | 44730 | 0.0 | 26.06033 | 5 |
TATCAAC | 44995 | 0.0 | 25.785149 | 10 |
AGAGTAC | 42825 | 0.0 | 25.504534 | 19 |
CAGAGTA | 45535 | 0.0 | 24.149675 | 18 |
AGCAGTG | 49340 | 0.0 | 24.078188 | 2 |
GCAGAGT | 46775 | 0.0 | 23.844473 | 17 |
TCGTATG | 3170 | 0.0 | 23.212801 | 40 |
GAGTACT | 26305 | 0.0 | 23.07477 | 20 |
TATGCCG | 3300 | 0.0 | 22.233818 | 43 |