Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR522100_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 37601870 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 50 |
| %GC | 46 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| AAGCAGTGGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 111731 | 0.29714213681394036 | No Hit |
| GATCGGAAGAGCGGTTCAGCAGGAATGCCGAGATCGGAAGAGCGGTTCAG | 69387 | 0.18453071615853148 | Illumina Paired End PCR Primer 2 (97% over 36bp) |
| CGGTTCAGCAGGAATGCCGAGATCGGAAGAGCGGTTCAGCAGGAATGCCG | 60085 | 0.1597925847836823 | Illumina Paired End PCR Primer 2 (100% over 31bp) |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| TCAACGC | 38640 | 0.0 | 29.713158 | 12 |
| ATCAACG | 39280 | 0.0 | 29.23459 | 11 |
| CAACGCA | 39760 | 0.0 | 28.726923 | 13 |
| ACGCAGA | 40800 | 0.0 | 27.795364 | 15 |
| AACGCAG | 41050 | 0.0 | 27.781347 | 14 |
| AAGCAGT | 43480 | 0.0 | 27.319805 | 1 |
| GTATCAA | 42435 | 0.0 | 27.241247 | 9 |
| GGTATCA | 42605 | 0.0 | 27.106726 | 8 |
| GTGGTAT | 43310 | 0.0 | 26.736734 | 6 |
| CGCAGAG | 42575 | 0.0 | 26.590044 | 16 |
| TGGTATC | 43960 | 0.0 | 26.266203 | 7 |
| AGTGGTA | 44730 | 0.0 | 26.06033 | 5 |
| TATCAAC | 44995 | 0.0 | 25.785149 | 10 |
| AGAGTAC | 42825 | 0.0 | 25.504534 | 19 |
| CAGAGTA | 45535 | 0.0 | 24.149675 | 18 |
| AGCAGTG | 49340 | 0.0 | 24.078188 | 2 |
| GCAGAGT | 46775 | 0.0 | 23.844473 | 17 |
| TCGTATG | 3170 | 0.0 | 23.212801 | 40 |
| GAGTACT | 26305 | 0.0 | 23.07477 | 20 |
| TATGCCG | 3300 | 0.0 | 22.233818 | 43 |