Basic Statistics
Measure | Value |
---|---|
Filename | ERR1378103.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 660925 |
Sequences flagged as poor quality | 0 |
Sequence length | 51 |
%GC | 41 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
AAGAGCGGTTCAGCAGGAATGCCGAGACCGTTCGAAATCTCGTATGCCGTC | 1720 | 0.26024132844119985 | Illumina Paired End PCR Primer 2 (96% over 33bp) |
TCCAGGGATTTATAAGCCGATGACGTCATAACATCCCTGACCCTTTAAATA | 1533 | 0.23194764912811588 | No Hit |
TCGTTGGAATTCCTCGGGGAATTCGGTATTCCCAGGCGGTCTCCCATCCAA | 1060 | 0.16038128380678596 | No Hit |
CGGAAGAGCGGTTCAGCAGGAATGCCGAGACCGTTCGAAATCTCGTATGCC | 949 | 0.1435866399364527 | Illumina Paired End PCR Primer 2 (97% over 36bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
TAAGCCG | 185 | 0.0 | 42.560482 | 13 |
AGCCGAT | 195 | 0.0 | 40.377895 | 15 |
GCCGATG | 200 | 0.0 | 39.368446 | 16 |
ACGTCAT | 205 | 0.0 | 38.408237 | 23 |
CGTCATA | 205 | 0.0 | 38.408237 | 24 |
GCGGTCT | 130 | 0.0 | 38.070583 | 36 |
ATGCCGT | 185 | 0.0 | 37.696426 | 44 |
CGATGAC | 210 | 0.0 | 37.49376 | 18 |
TATGCCG | 190 | 0.0 | 36.70442 | 43 |
CCGATGA | 215 | 0.0 | 36.62181 | 17 |
GACGTCA | 215 | 0.0 | 36.62181 | 22 |
GATGACG | 215 | 0.0 | 36.62181 | 19 |
TGACGTC | 220 | 0.0 | 35.789497 | 21 |
AAGCCGA | 220 | 0.0 | 35.789497 | 14 |
GGCGGTC | 145 | 0.0 | 35.683716 | 35 |
TTCGGTA | 145 | 0.0 | 35.683716 | 22 |
TCGGTAT | 140 | 0.0 | 35.35126 | 23 |
ATGACGT | 225 | 0.0 | 34.994175 | 20 |
TGCCGTC | 200 | 0.0 | 34.874474 | 45 |
AGGCGGT | 150 | 0.0 | 32.99451 | 34 |