Basic Statistics
Measure | Value |
---|---|
Filename | ERR1630552.fastq |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 1907618 |
Sequences flagged as poor quality | 0 |
Sequence length | 43 |
%GC | 46 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 10379 | 0.5440816767298274 | No Hit |
TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 8784 | 0.46046954893484965 | No Hit |
GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 8137 | 0.4265529052462285 | No Hit |
ACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 5988 | 0.3138993236591393 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGAATTA | 105 | 4.0017767E-11 | 22.90476 | 15 |
TATCCCG | 45 | 0.0038267204 | 20.555555 | 5 |
AGTCGGT | 285 | 0.0 | 18.175438 | 11 |
CAGTCGG | 290 | 0.0 | 17.224138 | 10 |
AGACGTA | 70 | 0.0025933343 | 15.857143 | 5 |
GCAGTCG | 320 | 0.0 | 15.609375 | 9 |
CGGTGAT | 350 | 0.0 | 15.328572 | 14 |
GGTATCA | 5810 | 0.0 | 14.806369 | 1 |
TGCGTTA | 75 | 0.004105725 | 14.8 | 8 |
TCGGTGA | 375 | 0.0 | 14.306667 | 13 |
CGGACCA | 285 | 0.0 | 14.280702 | 9 |
TCTAGCG | 210 | 1.364242E-10 | 14.095238 | 28 |
AATGCGT | 105 | 1.656535E-4 | 14.095238 | 6 |
CGCAATA | 210 | 1.364242E-10 | 14.095238 | 36 |
GTCGGTG | 370 | 0.0 | 14.0 | 12 |
TCTATGG | 530 | 0.0 | 13.962264 | 2 |
AAGACGG | 305 | 0.0 | 13.95082 | 5 |
CTTATAC | 1285 | 0.0 | 13.677043 | 37 |
TAGGACA | 800 | 0.0 | 13.64375 | 4 |
CAAGACG | 340 | 0.0 | 13.6029415 | 4 |