Basic Statistics
Measure | Value |
---|---|
Filename | ERR1041472.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 9328133 |
Sequences flagged as poor quality | 0 |
Sequence length | 43 |
%GC | 44 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 61205 | 0.6561334406359772 | No Hit |
TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 59982 | 0.6430225641079517 | No Hit |
GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 41570 | 0.4456411588471133 | No Hit |
GTACGGAAGCAGTGGTATCAACGCAGAGTACGGAAGCAGTGGT | 20941 | 0.22449293979834978 | No Hit |
ACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 13598 | 0.14577407933613296 | No Hit |
GAGTACGGAAGCAGTGGTATCAACGCAGAGTACGGAAGCAGTG | 11846 | 0.12699218589614877 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GGTATCA | 24685 | 0.0 | 15.386065 | 1 |
GTATCAA | 41265 | 0.0 | 14.462862 | 1 |
GCTCGGA | 1665 | 0.0 | 11.888889 | 11 |
AGGTCGC | 1620 | 0.0 | 11.533951 | 35 |
GTCGCCC | 1545 | 0.0 | 11.255664 | 37 |
CGAACTA | 1160 | 0.0 | 11.004311 | 29 |
GTATTAG | 2965 | 0.0 | 10.98145 | 1 |
CCGTTAA | 910 | 0.0 | 10.978022 | 16 |
TGCTCGG | 1980 | 0.0 | 10.838384 | 10 |
GCGAACT | 1185 | 0.0 | 10.772152 | 28 |
GTATTAA | 2275 | 0.0 | 10.652747 | 1 |
GCTCCGA | 1755 | 0.0 | 10.646724 | 29 |
CTATCGC | 600 | 0.0 | 10.483334 | 9 |
TGTACTG | 3720 | 0.0 | 10.393817 | 5 |
TTGTACG | 980 | 0.0 | 10.382653 | 9 |
TTAACGG | 1130 | 0.0 | 10.314159 | 35 |
TATACTG | 2280 | 0.0 | 10.304824 | 5 |
ATAATAC | 3275 | 0.0 | 10.280916 | 3 |
TATCAAC | 57945 | 0.0 | 10.280438 | 2 |
TCACGTA | 1205 | 0.0 | 10.13278 | 25 |