Basic Statistics
Measure | Value |
---|---|
Filename | ERR1378109.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 660374 |
Sequences flagged as poor quality | 0 |
Sequence length | 51 |
%GC | 41 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
AAGAGCGGTTCAGCAGGAATGCCGAGACCGGCTCCAATCTCGTATGCCGTC | 2669 | 0.404164912610127 | Illumina Paired End PCR Primer 2 (96% over 30bp) |
TCCAGGGATTTATAAGCCGATGACGTCATAACATCCCTGACCCTTTAAATA | 1544 | 0.2338069033608228 | No Hit |
CGGAAGAGCGGTTCAGCAGGAATGCCGAGACCGGCTCCAATCTCGTATGCC | 1428 | 0.2162410997404501 | Illumina Paired End PCR Primer 2 (96% over 33bp) |
TCGTTGGAATTCCTCGGGGAATTCGGTATTCCCAGGCGGTCTCCCATCCAA | 930 | 0.14082928764609146 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GGCGGTC | 140 | 0.0 | 43.38906 | 35 |
CTCGGGG | 130 | 0.0 | 43.26217 | 13 |
ATGACGT | 145 | 0.0 | 41.892883 | 20 |
CGTCATA | 145 | 0.0 | 41.892883 | 24 |
TGACGTC | 150 | 0.0 | 40.496456 | 21 |
AGGCGGT | 150 | 0.0 | 40.496456 | 34 |
CGATGAC | 145 | 0.0 | 40.341297 | 18 |
TGCCGTC | 315 | 0.0 | 40.01165 | 45 |
GACGTCA | 155 | 0.0 | 39.19012 | 22 |
TTCGGTA | 155 | 0.0 | 39.19012 | 22 |
CCGATGA | 150 | 0.0 | 38.99659 | 17 |
GCCGATG | 150 | 0.0 | 38.99659 | 16 |
AGCCGAT | 150 | 0.0 | 38.993637 | 15 |
TAAGCCG | 150 | 0.0 | 38.993637 | 13 |
ACGTCAT | 160 | 0.0 | 37.965427 | 23 |
GATGACG | 155 | 0.0 | 37.738632 | 19 |
ATGCCGT | 340 | 0.0 | 37.052776 | 44 |
GCGGTCT | 165 | 0.0 | 36.81496 | 36 |
CGGTATT | 165 | 0.0 | 36.81496 | 24 |
TATGCCG | 345 | 0.0 | 36.515778 | 43 |