Basic Statistics
Measure | Value |
---|---|
Filename | ERR1391171.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 739386 |
Sequences flagged as poor quality | 0 |
Sequence length | 51 |
%GC | 39 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
TCCAGGGATTTATAAGCCGATGACGTCATAACATCCCTGACCCTTTAAATA | 2839 | 0.3839672376809948 | No Hit |
TCGTTGGAATTCCTCGGGGAATTCGGTATTCCCAGGCGGTCTCCCATCCAA | 1820 | 0.24615018407164865 | No Hit |
AAGAGCGGTTCAGCAGGAATGCCGAGATCGGAAGAGCGGTTCAGCAGGAAT | 1074 | 0.1452556580730498 | Illumina Paired End PCR Primer 2 (96% over 30bp) |
AAGAGCGGTTCAGCAGGAATGCCGAGACCGAGTAGCATCTCGTATGCCGTC | 770 | 0.10414046249185135 | Illumina Paired End PCR Primer 2 (96% over 31bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GCGGTCT | 280 | 0.0 | 43.392075 | 36 |
CGATGAC | 305 | 0.0 | 42.04842 | 18 |
TAAGCCG | 305 | 0.0 | 42.04842 | 13 |
AGGCGGT | 290 | 0.0 | 41.895794 | 34 |
CGGTATT | 285 | 0.0 | 41.84135 | 24 |
TCGGTAT | 285 | 0.0 | 41.84135 | 23 |
GCCGATG | 310 | 0.0 | 41.370224 | 16 |
CGTCATA | 310 | 0.0 | 41.370224 | 24 |
GGCGGTC | 295 | 0.0 | 41.185696 | 35 |
TTCGGTA | 290 | 0.0 | 41.119946 | 22 |
CCGATGA | 315 | 0.0 | 40.71355 | 17 |
GACGTCA | 315 | 0.0 | 40.71355 | 22 |
AATTCGG | 295 | 0.0 | 40.422997 | 20 |
TGACGTC | 320 | 0.0 | 40.0774 | 21 |
ACGTCAT | 320 | 0.0 | 40.0774 | 23 |
GATGACG | 320 | 0.0 | 40.0774 | 19 |
CAGGCGG | 310 | 0.0 | 39.918636 | 33 |
GATCGGA | 130 | 0.0 | 39.806976 | 26 |
TCGGGGA | 300 | 0.0 | 39.749283 | 14 |
ATTCGGT | 300 | 0.0 | 39.749283 | 21 |