Basic Statistics
Measure | Value |
---|---|
Filename | ERR523056_1.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 4941950 |
Sequences flagged as poor quality | 0 |
Sequence length | 100 |
%GC | 46 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GAGTAAGCAGTGGTATCAACGCAGAGTAAGCAGTGGTATCAACGCAGAGT | 22779 | 0.4609314137132104 | No Hit |
GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 18338 | 0.37106810064852946 | No Hit |
GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 15052 | 0.30457612885601837 | No Hit |
TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 8874 | 0.17956474670929493 | No Hit |
GTATCAACGCAGAGTAAGCAGTGGTATCAACGCAGAGTAAGCAGTGGTAT | 6176 | 0.12497091229170673 | No Hit |
TATCAACGCAGAGTAAGCAGTGGTATCAACGCAGAGTAAGCAGTGGTATC | 4987 | 0.10091158348425217 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
TACCTGG | 4355 | 0.0 | 32.593872 | 2 |
GTACATG | 19245 | 0.0 | 31.82034 | 1 |
TACATGG | 19665 | 0.0 | 30.66549 | 2 |
ATGGGGG | 10280 | 0.0 | 28.983274 | 5 |
TATAACG | 360 | 0.0 | 28.723501 | 2 |
ACATGGG | 20240 | 0.0 | 28.674995 | 3 |
CATGGGG | 14590 | 0.0 | 28.312935 | 4 |
GTACCTG | 5745 | 0.0 | 25.46139 | 1 |
TGGGGGG | 10210 | 0.0 | 25.177517 | 6 |
ACCTGGG | 5750 | 0.0 | 24.355402 | 3 |
TATCACG | 585 | 0.0 | 22.49673 | 2 |
ATAACGC | 455 | 0.0 | 21.689762 | 3 |
GGGTTAG | 2055 | 0.0 | 21.514326 | 1 |
GAGTACT | 16445 | 0.0 | 21.183098 | 12-13 |
TAACGCA | 485 | 0.0 | 20.348333 | 4 |
AGAGTAC | 24700 | 0.0 | 19.651514 | 10-11 |
AGTACTT | 17090 | 0.0 | 18.856909 | 12-13 |
GTTAAGA | 4995 | 0.0 | 18.816801 | 4 |
TTAAGAG | 5215 | 0.0 | 18.653801 | 5 |
GGTTAAG | 4170 | 0.0 | 18.59494 | 3 |