Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1042371.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 11356624 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 48 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 52621 | 0.4633507281741475 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 43670 | 0.3845332908794022 | No Hit |
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 36418 | 0.32067628548765903 | No Hit |
| ACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 18914 | 0.16654597352170855 | No Hit |
| GTGTAGCGCGCGTGCAGCCCCGGACATCTAAGGGCATCACAGA | 11666 | 0.10272418986487535 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| ACGGACC | 2485 | 0.0 | 17.197184 | 8 |
| TCTAGCG | 1580 | 0.0 | 16.743671 | 28 |
| GACGGAC | 2510 | 0.0 | 16.731075 | 7 |
| AAGACGG | 2660 | 0.0 | 16.204887 | 5 |
| GGTATCA | 23025 | 0.0 | 16.133768 | 1 |
| CTAGCGG | 1715 | 0.0 | 15.425656 | 29 |
| CGGACCA | 2795 | 0.0 | 14.958855 | 9 |
| AGACGGA | 2870 | 0.0 | 14.761325 | 6 |
| CGCAAGA | 2915 | 0.0 | 14.6603775 | 2 |
| TAACGCC | 2045 | 0.0 | 14.474327 | 4 |
| TTAACGG | 530 | 0.0 | 13.962264 | 35 |
| TCGTTTA | 1980 | 0.0 | 13.921718 | 30 |
| TACGACG | 1955 | 0.0 | 13.815856 | 5 |
| TATACTG | 1895 | 0.0 | 13.569921 | 5 |
| CGCAATA | 2020 | 0.0 | 13.554456 | 36 |
| GCGCAAG | 3295 | 0.0 | 13.36267 | 1 |
| GTATACG | 540 | 0.0 | 13.361111 | 1 |
| GTACTAG | 735 | 0.0 | 13.340137 | 1 |
| CGCATCG | 2115 | 0.0 | 13.295507 | 13 |
| CGAACGA | 1520 | 0.0 | 13.266448 | 16 |