Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1378821.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 1211832 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 51 |
| %GC | 41 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| AAGAGCGGTTCAGCAGGAATGCCGAGACCGCTATTGATCTCGTATGCCGTC | 1580 | 0.1303811089325913 | Illumina Paired End PCR Primer 2 (96% over 32bp) |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| TATGCCG | 105 | 0.0 | 34.284615 | 43 |
| TGCCGTC | 120 | 0.0 | 29.999037 | 45 |
| ATGCCGT | 140 | 0.0 | 25.713463 | 44 |
| TCGGTAC | 55 | 1.3685608E-4 | 24.544666 | 23 |
| ACTGCGC | 60 | 2.4674632E-4 | 22.500208 | 9 |
| ATCGCGC | 55 | 0.0039373916 | 20.453888 | 22 |
| TCGCGCC | 55 | 0.0039373916 | 20.453888 | 23 |
| CCGGCTA | 130 | 2.773595E-8 | 19.037851 | 16 |
| TGCGCCC | 60 | 0.006511902 | 18.749397 | 11 |
| ACCGCTA | 200 | 1.8189894E-12 | 17.999422 | 27 |
| CTCGTAT | 215 | 0.0 | 17.790127 | 39 |
| CCGCTAT | 205 | 3.6379788E-12 | 17.560413 | 28 |
| ACGTATT | 90 | 1.8662412E-4 | 17.499437 | 45 |
| CGCCCGG | 155 | 1.09048415E-8 | 17.418797 | 13 |
| ATGCCGA | 350 | 0.0 | 17.356586 | 19 |
| TCGTATG | 210 | 5.456968E-12 | 17.142307 | 40 |
| GACCGCT | 225 | 1.8189894E-12 | 16.999454 | 26 |
| AGACCGC | 225 | 1.8189894E-12 | 16.999454 | 25 |
| GATCGCG | 80 | 0.0017112851 | 16.874458 | 21 |
| CGTATGC | 215 | 9.094947E-12 | 16.743649 | 41 |