Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1378110.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 651364 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 51 |
| %GC | 40 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| AAGAGCGGTTCAGCAGGAATGCCGAGACCGTTCTCCATCTCGTATGCCGTC | 2196 | 0.33713868129033847 | Illumina Paired End PCR Primer 2 (97% over 35bp) |
| TCCAGGGATTTATAAGCCGATGACGTCATAACATCCCTGACCCTTTAAATA | 1463 | 0.22460559687056697 | No Hit |
| CGGAAGAGCGGTTCAGCAGGAATGCCGAGACCGTTCTCCATCTCGTATGCC | 1302 | 0.1998882345355285 | Illumina Paired End PCR Primer 2 (97% over 38bp) |
| TCGTTGGAATTCCTCGGGGAATTCGGTATTCCCAGGCGGTCTCCCATCCAA | 1071 | 0.16442419292438637 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GACGTCA | 160 | 0.0 | 43.58951 | 22 |
| TAAGCCG | 160 | 0.0 | 43.58951 | 13 |
| CGTCATA | 160 | 0.0 | 43.58951 | 24 |
| TGCCGTC | 175 | 0.0 | 42.450516 | 45 |
| CCGATGA | 165 | 0.0 | 42.268616 | 17 |
| TTCGGTA | 120 | 0.0 | 41.245987 | 22 |
| TCGGTAT | 120 | 0.0 | 41.245987 | 23 |
| TGACGTC | 170 | 0.0 | 41.02542 | 21 |
| AGCCGAT | 175 | 0.0 | 39.853268 | 15 |
| GCCGATG | 175 | 0.0 | 39.853268 | 16 |
| ATTCGGT | 125 | 0.0 | 39.59615 | 21 |
| ATGACGT | 180 | 0.0 | 38.74623 | 20 |
| CGATGAC | 180 | 0.0 | 38.74623 | 18 |
| AATTCGG | 130 | 0.0 | 38.073223 | 20 |
| TATGCCG | 200 | 0.0 | 37.121387 | 43 |
| AAGCCGA | 195 | 0.0 | 35.765755 | 14 |
| ACGTCAT | 195 | 0.0 | 35.765755 | 23 |
| GATGACG | 195 | 0.0 | 35.765755 | 19 |
| AGGCGGT | 140 | 0.0 | 35.353703 | 34 |
| CTCGGGG | 115 | 0.0 | 35.21397 | 13 |