Basic Statistics
Measure | Value |
---|---|
Filename | ERR1378892.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 938062 |
Sequences flagged as poor quality | 0 |
Sequence length | 51 |
%GC | 42 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CGGAAGAGCGGTTCAGCAGGAATGCCGAGACCGAGCGCTATCTCGTATGCC | 13422 | 1.4308222697433643 | Illumina Paired End PCR Primer 2 (97% over 36bp) |
TTCAGCAGGAATGCCGAGATCGGAAGAGCGGTTCAGCAGGAATGCCGAGAC | 5697 | 0.6073159343412269 | Illumina Paired End PCR Primer 2 (100% over 35bp) |
AAGAGCGGTTCAGCAGGAATGCCGAGATCGGAAGAGCGGTTCAGCAGGAAT | 1345 | 0.14338071470755664 | Illumina Paired End PCR Primer 2 (96% over 30bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CCGAGCG | 1670 | 0.0 | 39.340897 | 31 |
CGCTATC | 1675 | 0.0 | 39.22346 | 36 |
CTCGTAT | 1650 | 0.0 | 39.138035 | 42 |
CGAGCGC | 1680 | 0.0 | 39.106724 | 32 |
TATCTCG | 1665 | 0.0 | 38.918503 | 39 |
GCGCTAT | 1695 | 0.0 | 38.76065 | 35 |
TCTCGTA | 1670 | 0.0 | 38.66725 | 41 |
TCGTATG | 1675 | 0.0 | 38.553886 | 43 |
AGCGCTA | 1710 | 0.0 | 38.55222 | 34 |
ATCTCGT | 1675 | 0.0 | 38.551826 | 40 |
CGTATGC | 1700 | 0.0 | 37.986916 | 44 |
GACCGAG | 1750 | 0.0 | 37.671024 | 29 |
AGACCGA | 1750 | 0.0 | 37.671024 | 28 |
ACCGAGC | 1745 | 0.0 | 37.65003 | 30 |
GAGCGCT | 1770 | 0.0 | 37.245365 | 33 |
GAGACCG | 1770 | 0.0 | 37.118248 | 27 |
GTATGCC | 1740 | 0.0 | 36.984337 | 45 |
GCTATCT | 1830 | 0.0 | 35.53241 | 37 |
CGAGACC | 1910 | 0.0 | 34.63314 | 26 |
CTATCTC | 1905 | 0.0 | 34.133495 | 38 |