Basic Statistics
Measure | Value |
---|---|
Filename | ERR1041851.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 2228225 |
Sequences flagged as poor quality | 0 |
Sequence length | 43 |
%GC | 53 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 3654 | 0.16398703003511764 | No Hit |
TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 3438 | 0.15429321545176095 | No Hit |
GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 2345 | 0.10524071850912722 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GGTATCA | 1960 | 0.0 | 30.959183 | 1 |
GTATCAA | 2925 | 0.0 | 20.871796 | 2 |
TAGACTA | 115 | 3.0540832E-9 | 19.304348 | 5 |
GTTCTAA | 215 | 0.0 | 18.930233 | 1 |
TTTAGCG | 90 | 2.1540618E-6 | 18.5 | 26 |
TTAGACT | 115 | 6.412847E-8 | 17.695652 | 4 |
GTATTAG | 320 | 0.0 | 16.765625 | 1 |
CTAGACT | 170 | 8.54925E-11 | 16.32353 | 4 |
ATAACCG | 80 | 3.384349E-4 | 16.1875 | 5 |
CTACACT | 255 | 0.0 | 15.960784 | 4 |
GTCGCTA | 95 | 7.0644295E-5 | 15.578948 | 37 |
CGGATAA | 190 | 2.7284841E-11 | 15.578948 | 25 |
TAAGATA | 155 | 7.219569E-9 | 15.516129 | 4 |
TACACTG | 280 | 0.0 | 15.196429 | 5 |
ATTACTC | 220 | 1.8189894E-12 | 15.136364 | 3 |
CCAATAC | 380 | 0.0 | 15.092106 | 3 |
CCTAGTA | 210 | 9.094947E-12 | 14.97619 | 2 |
GTCTTAC | 210 | 9.094947E-12 | 14.97619 | 1 |
ATCAACG | 4130 | 0.0 | 14.826877 | 4 |
CAACGCA | 4085 | 0.0 | 14.809058 | 6 |