Basic Statistics
Measure | Value |
---|---|
Filename | ERR1391299.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 576766 |
Sequences flagged as poor quality | 0 |
Sequence length | 51 |
%GC | 40 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
TCCAGGGATTTATAAGCCGATGACGTCATAACATCCCTGACCCTTTAAATA | 3373 | 0.5848125582992063 | No Hit |
TCGTTGGAATTCCTCGGGGAATTCGGTATTCCCAGGCGGTCTCCCATCCAA | 2416 | 0.41888738240464934 | No Hit |
AAGAGCGGTTCAGCAGGAATGCCGAGACCGAGCGTAATCTCGTATGCCGTC | 931 | 0.16141728187861282 | Illumina Paired End PCR Primer 2 (96% over 33bp) |
CGGAAGAGCGGTTCAGCAGGAATGCCGAGACCGAGCGTAATCTCGTATGCC | 791 | 0.13714400640814473 | Illumina Paired End PCR Primer 2 (97% over 36bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
AATTCGG | 325 | 0.0 | 42.9182 | 20 |
GAATTCG | 325 | 0.0 | 42.9182 | 19 |
TCGGGGA | 330 | 0.0 | 42.26792 | 14 |
CGGGGAA | 330 | 0.0 | 42.26792 | 15 |
GCGGTCT | 335 | 0.0 | 41.63706 | 36 |
TTCGGTA | 335 | 0.0 | 41.63706 | 22 |
TCGGTAT | 335 | 0.0 | 41.63706 | 23 |
AAGCCGA | 325 | 0.0 | 41.53374 | 14 |
TAAGCCG | 325 | 0.0 | 41.53374 | 13 |
AGATTCG | 60 | 3.6379788E-12 | 41.25962 | 10 |
CCTCGGG | 285 | 0.0 | 41.047966 | 12 |
TTCGGGG | 55 | 6.002665E-11 | 40.90444 | 13 |
ATGACGT | 325 | 0.0 | 40.841515 | 20 |
TATGCCG | 100 | 0.0 | 40.495396 | 43 |
ATTCGGT | 345 | 0.0 | 40.430183 | 21 |
GGCGGTC | 345 | 0.0 | 40.430183 | 35 |
GCCGATG | 335 | 0.0 | 40.293926 | 16 |
CGGTATT | 350 | 0.0 | 39.856068 | 24 |
CTCGGGG | 295 | 0.0 | 39.656506 | 13 |
TGACGTC | 335 | 0.0 | 39.62236 | 21 |