Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1041497.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 7479012 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 45 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 54336 | 0.7265130741868043 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 52338 | 0.6997983156063929 | No Hit |
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 38764 | 0.5183037545600943 | No Hit |
| ACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 23744 | 0.3174750889556 | No Hit |
| AACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 8518 | 0.11389204884281506 | No Hit |
| GAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 8013 | 0.10713982007249086 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GGTATCA | 21910 | 0.0 | 15.696713 | 1 |
| ATCTCGT | 530 | 0.0 | 12.566038 | 37 |
| TACGCTA | 475 | 0.0 | 12.463158 | 31 |
| CTACGCT | 475 | 0.0 | 12.463158 | 30 |
| GACGCTA | 505 | 0.0 | 12.089108 | 26 |
| ACGCTAT | 490 | 0.0 | 12.081633 | 32 |
| GCGTATA | 240 | 1.4895704E-8 | 11.562499 | 9 |
| TATACTG | 1385 | 0.0 | 11.487365 | 5 |
| TTAACGG | 455 | 0.0 | 11.384616 | 35 |
| GTATCAA | 30390 | 0.0 | 11.292366 | 2 |
| CGCTATC | 585 | 0.0 | 10.752137 | 33 |
| AACGCAG | 31815 | 0.0 | 10.565615 | 5 |
| GAGTACG | 19350 | 0.0 | 10.010077 | 1 |
| TTAGACT | 910 | 0.0 | 9.961539 | 4 |
| AGTACGG | 19240 | 0.0 | 9.9326935 | 2 |
| CGCGAAA | 205 | 4.2276137E-5 | 9.92683 | 15 |
| GTATTAG | 1825 | 0.0 | 9.832876 | 1 |
| CGCTACG | 640 | 0.0 | 9.828125 | 28 |
| GAACCGT | 415 | 9.094947E-12 | 9.807229 | 6 |
| GTATAGA | 1165 | 0.0 | 9.527897 | 1 |