Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1041854.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 1186112 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 54 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 3524 | 0.29710516376193813 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 3337 | 0.28133936761452544 | No Hit |
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 2137 | 0.18016848324610155 | No Hit |
| CCCACGCCCTCTGCATCTTTTTTCTTTTCCCCCACGGATCTTT | 1318 | 0.11111935466465224 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GGTATCA | 1690 | 0.0 | 29.008877 | 1 |
| GTATCAA | 2330 | 0.0 | 20.881973 | 2 |
| TATACTG | 125 | 1.8189894E-11 | 20.72 | 5 |
| GTATACG | 45 | 0.0038258794 | 20.555555 | 1 |
| GTATTGG | 110 | 1.7553248E-9 | 20.181818 | 1 |
| CTTAATA | 70 | 1.21923855E-4 | 18.5 | 2 |
| TTATACC | 70 | 1.21923855E-4 | 18.5 | 4 |
| TTAATAC | 75 | 2.0673365E-4 | 17.266666 | 3 |
| TAATACT | 100 | 5.878108E-6 | 16.650002 | 4 |
| CCTTATA | 80 | 3.3826803E-4 | 16.1875 | 2 |
| TGTCCGT | 80 | 3.3826803E-4 | 16.1875 | 10 |
| GTACTGT | 195 | 1.8189894E-12 | 16.128206 | 6 |
| TACTAGG | 115 | 1.2422006E-6 | 16.086956 | 2 |
| TAATATA | 70 | 0.0025925583 | 15.857143 | 4 |
| TAGCACC | 140 | 3.4728146E-8 | 15.857143 | 4 |
| GTTACAC | 70 | 0.0025925583 | 15.857143 | 3 |
| TTAGGCA | 105 | 9.343525E-6 | 15.857142 | 4 |
| TAAGACT | 105 | 9.343525E-6 | 15.857142 | 4 |
| CCTACAC | 235 | 0.0 | 15.744679 | 3 |
| GTGTACA | 295 | 0.0 | 15.677965 | 1 |