Basic Statistics
Measure | Value |
---|---|
Filename | ERR1378917.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 956228 |
Sequences flagged as poor quality | 0 |
Sequence length | 51 |
%GC | 42 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CGGAAGAGCGGTTCAGCAGGAATGCCGAGACCGGAGTGGATCTCGTATGCC | 12544 | 1.3118210301308892 | Illumina Paired End PCR Primer 2 (96% over 33bp) |
TTCAGCAGGAATGCCGAGATCGGAAGAGCGGTTCAGCAGGAATGCCGAGAC | 7289 | 0.7622659031109735 | Illumina Paired End PCR Primer 2 (100% over 35bp) |
AAGAGCGGTTCAGCAGGAATGCCGAGATCGGAAGAGCGGTTCAGCAGGAAT | 1631 | 0.17056601563643817 | Illumina Paired End PCR Primer 2 (96% over 30bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CTCGTAT | 1520 | 0.0 | 36.857994 | 42 |
CGGAGTG | 1565 | 0.0 | 36.66079 | 32 |
TCTCGTA | 1535 | 0.0 | 36.49782 | 41 |
CCGGAGT | 1580 | 0.0 | 36.31274 | 31 |
ATCTCGT | 1555 | 0.0 | 35.8837 | 40 |
CGTATGC | 1560 | 0.0 | 35.768692 | 44 |
GACCGGA | 1610 | 0.0 | 35.63611 | 29 |
ACCGGAG | 1620 | 0.0 | 35.555016 | 30 |
TCGTATG | 1580 | 0.0 | 35.458324 | 43 |
AGACCGG | 1630 | 0.0 | 35.198856 | 28 |
GTATGCC | 1580 | 0.0 | 35.17352 | 45 |
GAGACCG | 1640 | 0.0 | 34.847034 | 27 |
GGATCTC | 1635 | 0.0 | 34.265537 | 38 |
GATCTCG | 1650 | 0.0 | 33.954033 | 39 |
AGATCGG | 1105 | 0.0 | 32.375076 | 17 |
CGAGACC | 1800 | 0.0 | 32.37451 | 26 |
GATCGGA | 1120 | 0.0 | 31.94148 | 18 |
TGGATCT | 1785 | 0.0 | 31.764227 | 37 |
AGTGGAT | 1825 | 0.0 | 31.684452 | 35 |
ATCGGAA | 1130 | 0.0 | 31.658815 | 19 |