Basic Statistics
Measure | Value |
---|---|
Filename | ERR1042434.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 207771 |
Sequences flagged as poor quality | 0 |
Sequence length | 43 |
%GC | 54 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GGGTAGGCACACGCTGAGCCAGTCAGTGTAGCGCGCGTGCAGC | 359 | 0.17278638501042012 | No Hit |
CCTTAGATGTCCGGGGCTGCACGCGCGCTACACTGACTGGCTC | 318 | 0.15305312098416046 | No Hit |
GTGTAGCGCGCGTGCAGCCCCGGACATCTAAGGGCATCACAGA | 301 | 0.14487103590010156 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
ATAACGA | 30 | 3.589424E-4 | 30.833334 | 12 |
GAGCTAG | 25 | 0.005486112 | 29.6 | 1 |
GTCTAAA | 25 | 0.005486112 | 29.6 | 1 |
GTTCAGA | 25 | 0.005486112 | 29.6 | 6 |
GTTAATT | 35 | 8.8448584E-4 | 26.42857 | 2 |
TGAGCGG | 35 | 8.8448584E-4 | 26.42857 | 10 |
ACGAACG | 35 | 8.8448584E-4 | 26.42857 | 15 |
CGAACGA | 45 | 1.3181212E-4 | 24.666666 | 16 |
TAATCAT | 40 | 0.0019258951 | 23.125 | 5 |
TCCGATA | 40 | 0.0019258951 | 23.125 | 8 |
CCGATAA | 40 | 0.0019258951 | 23.125 | 9 |
GGTTAAT | 40 | 0.0019258951 | 23.125 | 1 |
CGATAAC | 40 | 0.0019258951 | 23.125 | 10 |
CGAGACT | 50 | 2.6917903E-4 | 22.2 | 20 |
GATAACG | 45 | 0.0038154242 | 20.555555 | 11 |
TTCCGAT | 45 | 0.0038154242 | 20.555555 | 7 |
TAACGAA | 45 | 0.0038154242 | 20.555555 | 13 |
CTCGGAG | 45 | 0.0038154242 | 20.555555 | 12 |
AACGAGA | 45 | 0.0038154242 | 20.555555 | 18 |
ACGAGAC | 45 | 0.0038154242 | 20.555555 | 19 |