Basic Statistics
Measure | Value |
---|---|
Filename | ERR1041894.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 1902311 |
Sequences flagged as poor quality | 0 |
Sequence length | 43 |
%GC | 55 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 6524 | 0.3429512839908932 | No Hit |
TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 5417 | 0.2847589064038425 | No Hit |
GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 3687 | 0.19381688903654554 | No Hit |
CCCTCAGAGAGGCGAGGGTTCGAGGGCACGAGTTCGAGGCCAA | 2981 | 0.15670413512827291 | No Hit |
GAGTACGGGGGAGCCATTGTGGCTCCGGCCGGTTGCGCGGGCC | 1908 | 0.10029905730451014 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GGTATCA | 2610 | 0.0 | 32.463604 | 1 |
GTATCAA | 3865 | 0.0 | 21.82665 | 2 |
CTAACGC | 65 | 6.90517E-5 | 19.923077 | 3 |
GTACTAG | 65 | 6.90517E-5 | 19.923077 | 1 |
TAGACTG | 90 | 2.1537253E-6 | 18.5 | 5 |
TGTACTA | 90 | 2.1537253E-6 | 18.5 | 5 |
TAATCCG | 60 | 9.239845E-4 | 18.5 | 5 |
TTAACGG | 75 | 2.0681637E-4 | 17.266666 | 35 |
GTGTTAG | 450 | 0.0 | 16.855556 | 1 |
TCTATAC | 115 | 1.2431647E-6 | 16.086956 | 3 |
GTTAGCC | 485 | 0.0 | 15.639175 | 3 |
CTACCCT | 560 | 0.0 | 15.196429 | 4 |
CTATACT | 245 | 0.0 | 15.10204 | 4 |
CATAGGG | 245 | 0.0 | 15.10204 | 2 |
AGTGCGC | 160 | 1.0975782E-8 | 15.031251 | 10 |
ACGCCCT | 770 | 0.0 | 14.896104 | 4 |
TATCAAC | 5735 | 0.0 | 14.806452 | 3 |
ACCTATA | 100 | 1.09396184E-4 | 14.8 | 2 |
ACCGTGT | 225 | 1.8189894E-12 | 14.8 | 8 |
TATACAG | 140 | 6.0003003E-7 | 14.535715 | 5 |