Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1633449.fastq |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 1028615 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 45 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 5418 | 0.5267276872299159 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 3707 | 0.3603875113623659 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 2195 | 0.21339373818192423 | No Hit |
| ATCATTAACTGAATCCATAGGTTAATGAGGCGAACCGGGGGAA | 1194 | 0.11607841612265035 | No Hit |
| ATTGAAAGCTGAGTATTTTTAAGACAAAGGTTTCAGGAAGAAA | 1065 | 0.10353728071241426 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| CGTCCGT | 20 | 0.0018418464 | 37.0 | 8 |
| GCTAACG | 25 | 0.0054960777 | 29.6 | 3 |
| GGTATCA | 1220 | 0.0 | 24.262295 | 1 |
| GTATCAA | 3265 | 0.0 | 23.061255 | 1 |
| GTCGCCA | 145 | 0.0 | 22.965519 | 12 |
| TCGCCAT | 200 | 0.0 | 21.275002 | 13 |
| TCGCAAG | 45 | 0.0038255388 | 20.555555 | 19 |
| GCAACGC | 45 | 0.0038255388 | 20.555555 | 3 |
| CGTCGCA | 45 | 0.0038255388 | 20.555555 | 17 |
| GTCGCAA | 55 | 5.142589E-4 | 20.181818 | 18 |
| GGCCGCA | 190 | 0.0 | 19.473684 | 33 |
| CGAGTCG | 80 | 1.6164018E-5 | 18.5 | 21 |
| ACGTCGC | 50 | 0.0070343907 | 18.5 | 16 |
| TTGGCCG | 185 | 0.0 | 18.0 | 31 |
| TACCCGA | 65 | 0.001579781 | 17.076923 | 30 |
| TTAACGG | 130 | 1.3924364E-8 | 17.076923 | 35 |
| ATCGGGA | 155 | 4.0017767E-10 | 16.709679 | 21 |
| ATTCGTG | 80 | 3.382134E-4 | 16.1875 | 11 |
| TTGTCCG | 115 | 1.2418077E-6 | 16.086956 | 13 |
| ATTACGC | 70 | 0.0025922451 | 15.857143 | 3 |