Basic Statistics
Measure | Value |
---|---|
Filename | ERR1378150.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 650791 |
Sequences flagged as poor quality | 0 |
Sequence length | 51 |
%GC | 41 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CGGAAGAGCGGTTCAGCAGGAATGCCGAGACCGTCTGTAATCTCGTATGCC | 1264 | 0.1942251813562265 | Illumina Paired End PCR Primer 2 (96% over 33bp) |
TTCAGCAGGAATGCCGAGATCGGAAGAGCGGTTCAGCAGGAATGCCGAGAC | 746 | 0.11462973519916532 | Illumina Paired End PCR Primer 2 (100% over 35bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CTCGTAT | 120 | 0.0 | 39.37581 | 42 |
GACCGTC | 120 | 0.0 | 39.37278 | 29 |
AGATCGG | 120 | 0.0 | 39.369755 | 17 |
GATCGGA | 120 | 0.0 | 39.369755 | 18 |
CGTATGC | 130 | 0.0 | 38.0777 | 44 |
TCTCGTA | 125 | 0.0 | 37.800777 | 41 |
AGACCGT | 135 | 0.0 | 36.6646 | 28 |
ATCTCGT | 130 | 0.0 | 36.346897 | 40 |
TCGTATG | 140 | 0.0 | 35.35515 | 43 |
ACCGTCT | 140 | 0.0 | 35.35515 | 30 |
GAGACCG | 135 | 0.0 | 34.998028 | 27 |
ATCGGAA | 140 | 0.0 | 33.745506 | 19 |
TCGGAAG | 135 | 0.0 | 33.328896 | 20 |
CCGTCTG | 160 | 0.0 | 30.938131 | 31 |
CGTCTGT | 165 | 0.0 | 30.000616 | 32 |
GGCTACG | 30 | 0.0051470553 | 29.998308 | 43 |
GTATGCC | 190 | 0.0 | 28.421635 | 45 |
GAGATCG | 180 | 0.0 | 26.246506 | 16 |
CGAGATC | 185 | 0.0 | 25.53714 | 15 |
AATCTCG | 200 | 0.0 | 24.750505 | 39 |