Basic Statistics
Measure | Value |
---|---|
Filename | ERR1042066.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 1675164 |
Sequences flagged as poor quality | 0 |
Sequence length | 43 |
%GC | 47 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 12775 | 0.7626118994916319 | No Hit |
TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 12472 | 0.7445241182355877 | No Hit |
ACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 5147 | 0.30725349876191227 | No Hit |
GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 3948 | 0.23567841715796184 | No Hit |
GCGCAAGACGGACCAGAGCGAAAGCATTTGCCAAGAATGTTTT | 2984 | 0.17813181276579487 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GGTATCA | 4430 | 0.0 | 22.049662 | 1 |
TCTATCG | 80 | 6.9640373E-7 | 20.8125 | 29 |
GACGGAC | 620 | 0.0 | 20.58871 | 7 |
AAGACGG | 670 | 0.0 | 20.156717 | 5 |
ACGGACC | 640 | 0.0 | 19.945312 | 8 |
CGGACCA | 655 | 0.0 | 19.770992 | 9 |
GCGCAAG | 705 | 0.0 | 18.631207 | 1 |
GTATCAA | 5245 | 0.0 | 18.623451 | 2 |
CGCAAGA | 690 | 0.0 | 18.5 | 2 |
AGACGGA | 705 | 0.0 | 18.106382 | 6 |
ATGGTCG | 225 | 0.0 | 18.088888 | 36 |
GCGAAAG | 740 | 0.0 | 17.5 | 18 |
ACCGATC | 65 | 0.0015803416 | 17.076923 | 8 |
GTAGCAC | 120 | 1.0420081E-7 | 16.958334 | 3 |
CCGATAA | 230 | 0.0 | 16.891304 | 9 |
CGAAAGC | 740 | 0.0 | 16.75 | 19 |
CCGAGTT | 100 | 5.881022E-6 | 16.65 | 13 |
AGCGAAA | 760 | 0.0 | 16.552631 | 17 |
CGTAGAC | 135 | 2.221168E-8 | 16.444445 | 3 |
CAAGACG | 815 | 0.0 | 16.343557 | 4 |