Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1632089.fastq |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 1036296 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 50 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 2426 | 0.23410299759914155 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 1892 | 0.18257331882010544 | No Hit |
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 1845 | 0.17803793510734384 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GGTATCA | 515 | 0.0 | 26.223303 | 1 |
| CCGTATA | 40 | 0.0019310598 | 23.125 | 2 |
| GCGGTAA | 50 | 0.007034427 | 18.499998 | 34 |
| CGCTCTA | 50 | 0.007034427 | 18.499998 | 14 |
| CCCGTAT | 50 | 0.007034427 | 18.499998 | 1 |
| GTGCTAT | 105 | 4.7961294E-7 | 17.619047 | 1 |
| GTATTAT | 95 | 3.6056936E-6 | 17.526316 | 1 |
| GCTTATA | 95 | 3.6056936E-6 | 17.526316 | 1 |
| CTTATAC | 1185 | 0.0 | 16.548523 | 37 |
| GTATATA | 150 | 4.667527E-9 | 16.033333 | 1 |
| TACTAGG | 70 | 0.002592262 | 15.857143 | 2 |
| GTCCAAT | 110 | 1.4515146E-5 | 15.136364 | 1 |
| GTATCAA | 870 | 0.0 | 15.097702 | 2 |
| TCGTCTA | 75 | 0.004104043 | 14.8 | 29 |
| TAATACT | 100 | 1.0931989E-4 | 14.799999 | 4 |
| ATCAACG | 1435 | 0.0 | 14.310103 | 2 |
| TCAACGC | 1450 | 0.0 | 14.162069 | 3 |
| CGGAGTG | 145 | 8.906736E-7 | 14.034483 | 25 |
| TATCAAC | 1465 | 0.0 | 14.017065 | 1 |
| CAACGCA | 1495 | 0.0 | 13.983277 | 4 |