Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1633338.fastq |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 888923 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 46 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 3663 | 0.412071686749021 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 2532 | 0.28483906930071556 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 1507 | 0.16953099424809573 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| CAAGCGG | 25 | 0.005495681 | 29.6 | 32 |
| GGTATCA | 815 | 0.0 | 27.01227 | 1 |
| TTCGCCG | 100 | 0.0 | 25.900002 | 24 |
| GCCGGCA | 115 | 0.0 | 25.73913 | 15 |
| GTCGATT | 90 | 1.4188117E-10 | 24.666666 | 12 |
| GATAGAC | 60 | 1.3368372E-6 | 24.666666 | 3 |
| TCGATAC | 95 | 2.8012437E-10 | 23.368422 | 1 |
| GTATCAA | 2085 | 0.0 | 22.44844 | 1 |
| TCGATTG | 100 | 5.347829E-10 | 22.2 | 13 |
| ACCGTTT | 60 | 3.7250644E-5 | 21.583332 | 10 |
| GGTCGAT | 105 | 9.822543E-10 | 21.142859 | 11 |
| CGATTGG | 105 | 9.822543E-10 | 21.142859 | 14 |
| TCTATAC | 115 | 1.3460522E-10 | 20.913044 | 3 |
| TAAACGA | 45 | 0.003825137 | 20.555555 | 29 |
| TACACCG | 45 | 0.003825137 | 20.555555 | 5 |
| GCTTCGC | 145 | 0.0 | 20.413794 | 22 |
| GCAATAC | 55 | 5.141835E-4 | 20.181818 | 3 |
| CCGCTCT | 145 | 7.2759576E-12 | 19.13793 | 28 |
| CGCTCTC | 150 | 1.2732926E-11 | 18.499998 | 29 |
| TGGTCGA | 125 | 8.571078E-9 | 17.76 | 10 |