Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1041847.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 4347280 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 54 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 5180 | 0.11915496586371248 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 5048 | 0.11611858449421246 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GGTATCA | 2520 | 0.0 | 28.557539 | 1 |
| GTATCAA | 3785 | 0.0 | 19.110964 | 2 |
| TCTATAC | 260 | 0.0 | 17.076923 | 3 |
| CTAGTAC | 250 | 0.0 | 17.02 | 3 |
| GTATATA | 310 | 0.0 | 16.709679 | 1 |
| GTATACG | 80 | 3.3852734E-4 | 16.1875 | 1 |
| GTATTAG | 535 | 0.0 | 15.214952 | 1 |
| TATACCG | 195 | 4.1836756E-11 | 15.179487 | 5 |
| GTACTAG | 190 | 4.5656634E-10 | 14.605264 | 1 |
| TACGTCG | 180 | 3.3378456E-9 | 14.388888 | 5 |
| CCTACAC | 715 | 0.0 | 13.972027 | 3 |
| TACATAC | 430 | 0.0 | 13.767442 | 3 |
| CTACGCT | 190 | 7.1431714E-9 | 13.631579 | 4 |
| GTGTTAG | 750 | 0.0 | 13.566666 | 1 |
| TTATACG | 205 | 1.4279067E-9 | 13.536585 | 35 |
| GTCTTAG | 465 | 0.0 | 13.526883 | 1 |
| CGCTATA | 480 | 0.0 | 13.489583 | 2 |
| TCGCTAT | 590 | 0.0 | 13.48305 | 1 |
| GTATAGG | 385 | 0.0 | 13.454547 | 1 |
| CGCGTAT | 110 | 2.4588258E-4 | 13.454545 | 12 |