Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1042077.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 7539341 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 48 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 26989 | 0.35797558433820675 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 23401 | 0.3103852180183918 | No Hit |
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 15452 | 0.20495159988120978 | No Hit |
| ACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 7770 | 0.1030594053246829 | No Hit |
| GCGCAAGACGGACCAGAGCGAAAGCATTTGCCAAGAATGTTTT | 7702 | 0.1021574697311078 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| ACGGACC | 1695 | 0.0 | 18.554571 | 8 |
| GACGGAC | 1715 | 0.0 | 18.338192 | 7 |
| AAGACGG | 1795 | 0.0 | 18.139275 | 5 |
| TAACGGC | 565 | 0.0 | 16.699116 | 36 |
| CGGACCA | 1960 | 0.0 | 16.517859 | 9 |
| GGTATCA | 12135 | 0.0 | 16.510506 | 1 |
| TTAACGG | 595 | 0.0 | 16.478992 | 35 |
| AGACGGA | 1925 | 0.0 | 16.433765 | 6 |
| CGCAAGA | 2050 | 0.0 | 16.424389 | 2 |
| CTAACGC | 150 | 4.6838977E-9 | 16.033333 | 3 |
| CAAGACG | 2260 | 0.0 | 15.798673 | 4 |
| TAACGCC | 1405 | 0.0 | 15.669039 | 4 |
| GCGCAAG | 2330 | 0.0 | 15.641632 | 1 |
| TCGTTTA | 1265 | 0.0 | 15.355732 | 30 |
| TCTAGCG | 865 | 0.0 | 15.184971 | 28 |
| TACCGTC | 1220 | 0.0 | 15.012294 | 7 |
| CTAGCGG | 885 | 0.0 | 14.841807 | 29 |
| GTATTAG | 2490 | 0.0 | 14.785141 | 1 |
| TATACTG | 1385 | 0.0 | 14.693141 | 5 |
| CGCATCG | 1330 | 0.0 | 14.605263 | 13 |