Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1378443.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 374415 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 51 |
| %GC | 44 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| AAGAGCGGTTCAGCAGGAATGCCGAGACCGATATGAATCTCGTATGCCGTC | 2966 | 0.7921691171560968 | Illumina Paired End PCR Primer 2 (97% over 34bp) |
| CGGAAGAGCGGTTCAGCAGGAATGCCGAGACCGATATGAATCTCGTATGCC | 2087 | 0.5574028818289866 | Illumina Paired End PCR Primer 2 (97% over 37bp) |
| AAGAGCGGTTCAGCAGGAATGCCGAGATCGGAAGAGCGGTTCAGCAGGAAT | 1738 | 0.4641908043214081 | Illumina Paired End PCR Primer 2 (96% over 30bp) |
| TTCAGCAGGAATGCCGAGATCGGAAGAGCGGTTCAGCAGGAATGCCGAGAC | 741 | 0.1979087376307039 | Illumina Paired End PCR Primer 2 (100% over 35bp) |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| TATGCCG | 385 | 0.0 | 38.009354 | 43 |
| TGCCGTC | 400 | 0.0 | 37.14683 | 45 |
| ATGCCGT | 410 | 0.0 | 36.24081 | 44 |
| AGCGTCT | 30 | 0.005138174 | 30.005611 | 41 |
| CGTATGC | 560 | 0.0 | 26.120955 | 41 |
| CCGATAT | 555 | 0.0 | 25.940401 | 28 |
| ACCGATA | 555 | 0.0 | 25.940401 | 27 |
| CTCGTAT | 565 | 0.0 | 25.879425 | 39 |
| TCGTATG | 565 | 0.0 | 25.879425 | 40 |
| CGATATG | 565 | 0.0 | 25.879425 | 29 |
| TCTCGTA | 575 | 0.0 | 25.42935 | 38 |
| AGACCGA | 575 | 0.0 | 25.038128 | 25 |
| GTATGCC | 595 | 0.0 | 24.587715 | 42 |
| ATCTCGT | 595 | 0.0 | 24.57458 | 37 |
| AATCTCG | 595 | 0.0 | 24.57458 | 36 |
| GACCGAT | 590 | 0.0 | 24.401564 | 26 |
| GAGACCG | 595 | 0.0 | 24.196508 | 24 |
| ATGAATC | 620 | 0.0 | 23.58367 | 33 |
| CGAGACC | 620 | 0.0 | 23.220842 | 23 |
| TCGGAAG | 320 | 0.0 | 23.198166 | 28 |