Basic Statistics
Measure | Value |
---|---|
Filename | ERR1042792.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 5391366 |
Sequences flagged as poor quality | 0 |
Sequence length | 43 |
%GC | 48 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 16101 | 0.29864416550462347 | No Hit |
TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 15426 | 0.28612414738676617 | No Hit |
GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 9225 | 0.1711069142773835 | No Hit |
ACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 6328 | 0.11737285133303879 | No Hit |
CTCTAATACTGGTGATGCTAGAGGTGATGTTTTTGGTAAACAG | 5394 | 0.10004885589292214 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
TTAACGG | 1315 | 0.0 | 17.163498 | 35 |
TAACGGC | 1340 | 0.0 | 16.705225 | 36 |
GTATTAG | 2230 | 0.0 | 15.845291 | 1 |
ATTAGAG | 1680 | 0.0 | 15.526785 | 3 |
TTTAACG | 1565 | 0.0 | 14.894569 | 34 |
AACGGCC | 1505 | 0.0 | 14.873754 | 37 |
TATTAGA | 1850 | 0.0 | 14.8 | 2 |
CGTATTA | 455 | 0.0 | 13.824175 | 15 |
GTTTAAC | 1740 | 0.0 | 13.502872 | 33 |
CCGTATT | 480 | 0.0 | 13.104167 | 14 |
CTCTAAT | 1410 | 0.0 | 12.858156 | 1 |
CACATGT | 1850 | 0.0 | 12.8 | 28 |
GGTATCA | 9045 | 0.0 | 12.742399 | 1 |
GACACAT | 1850 | 0.0 | 12.7 | 26 |
TCGCTAA | 205 | 2.0705556E-8 | 12.634147 | 14 |
TTAGAGG | 2540 | 0.0 | 12.600393 | 4 |
TGACACA | 1975 | 0.0 | 12.551898 | 25 |
GTGACAC | 1860 | 0.0 | 12.532258 | 24 |
ACCGCCT | 1825 | 0.0 | 12.367123 | 12 |
GGCACCG | 1915 | 0.0 | 12.365535 | 9 |