Basic Statistics
Measure | Value |
---|---|
Filename | ERR1042644.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 13676569 |
Sequences flagged as poor quality | 0 |
Sequence length | 43 |
%GC | 47 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 54945 | 0.40174549625713873 | No Hit |
TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 54102 | 0.3955816696424374 | No Hit |
GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 33611 | 0.24575608107559724 | No Hit |
ACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 20441 | 0.14945999979965735 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GGTATCA | 16595 | 0.0 | 25.16089 | 1 |
GTATCAA | 24355 | 0.0 | 17.182098 | 2 |
TATACTG | 3015 | 0.0 | 15.462686 | 5 |
ACGGACC | 1810 | 0.0 | 14.820442 | 8 |
CGAACGA | 865 | 0.0 | 14.543352 | 16 |
TACGACG | 1230 | 0.0 | 14.439024 | 5 |
ACGACGG | 1215 | 0.0 | 14.008231 | 6 |
CGACGGT | 1195 | 0.0 | 13.933054 | 7 |
GACGGAC | 1860 | 0.0 | 13.924731 | 7 |
TCGTTTA | 960 | 0.0 | 13.682292 | 30 |
CGCGCTA | 1465 | 0.0 | 13.259385 | 24 |
TAACGCC | 1140 | 0.0 | 12.982456 | 4 |
GTATTAG | 2325 | 0.0 | 12.9698925 | 1 |
CCGATAA | 985 | 0.0 | 12.959392 | 9 |
ACGAACG | 965 | 0.0 | 12.84456 | 15 |
ACGCGCG | 1515 | 0.0 | 12.821782 | 21 |
TACCGTC | 1420 | 0.0 | 12.637323 | 7 |
CTATACT | 2295 | 0.0 | 12.575164 | 4 |
CGTCGTA | 1160 | 0.0 | 12.439655 | 10 |
GTATAAG | 2390 | 0.0 | 12.384936 | 1 |