Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR522940_2.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 46729016 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 100 |
| %GC | 47 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GAGTAAGCAGTGGTATCAACGCAGAGTAAGCAGTGGTATCAACGCAGAGT | 187567 | 0.4013930017272352 | No Hit |
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 77089 | 0.164970304531985 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 62543 | 0.13384189386739923 | No Hit |
| GTATCAACGCAGAGTAAGCAGTGGTATCAACGCAGAGTAAGCAGTGGTAT | 48560 | 0.10391830206739212 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GTACATG | 152570 | 0.0 | 27.381477 | 1 |
| TACATGG | 155160 | 0.0 | 26.40943 | 2 |
| ACATGGG | 158285 | 0.0 | 25.014338 | 3 |
| CATGGGG | 103805 | 0.0 | 23.069338 | 4 |
| TACCTGG | 42145 | 0.0 | 21.291359 | 2 |
| ATGGGGG | 55745 | 0.0 | 18.643547 | 5 |
| GAGTACA | 118150 | 0.0 | 18.059322 | 1 |
| GAGTACT | 85900 | 0.0 | 17.664635 | 12-13 |
| CCGTATC | 12645 | 0.0 | 17.09368 | 94 |
| AGAGTAC | 154760 | 0.0 | 17.025291 | 10-11 |
| AGTACAT | 114375 | 0.0 | 16.99438 | 2 |
| GTACTTT | 91230 | 0.0 | 16.470661 | 14-15 |
| AGTACTT | 88960 | 0.0 | 16.346416 | 12-13 |
| GTACCTG | 58770 | 0.0 | 16.036457 | 1 |
| ACCTGGG | 53425 | 0.0 | 15.977188 | 3 |
| CGCCGTA | 16930 | 0.0 | 15.653762 | 94 |
| TATACTG | 18110 | 0.0 | 15.3985 | 5 |
| GTATAGG | 11250 | 0.0 | 15.3419285 | 1 |
| CGTATCA | 9785 | 0.0 | 15.126773 | 94 |
| TCGCCGT | 24725 | 0.0 | 14.728618 | 94 |