Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1042536.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 9090984 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 50 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 15690 | 0.17258857787011836 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 11740 | 0.12913893589516823 | No Hit |
| GCGCAAGACGGACCAGAGCGAAAGCATTTGCCAAGAATGTTTT | 11349 | 0.1248379713351162 | No Hit |
| GGGTAGGCACACGCTGAGCCAGTCAGTGTAGCGCGCGTGCAGC | 11138 | 0.12251699045999861 | No Hit |
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 10685 | 0.11753403151958028 | No Hit |
| GTGTAGCGCGCGTGCAGCCCCGGACATCTAAGGGCATCACAGA | 10127 | 0.11139608209628353 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GGTATCA | 10945 | 0.0 | 18.102787 | 1 |
| TCGTTTA | 2085 | 0.0 | 17.21343 | 30 |
| AAGACGG | 2695 | 0.0 | 16.749537 | 5 |
| GACGGAC | 2665 | 0.0 | 16.174484 | 7 |
| TCTAGCG | 1615 | 0.0 | 16.037151 | 28 |
| CAAGACG | 2875 | 0.0 | 15.893913 | 4 |
| CGCAAGA | 2815 | 0.0 | 15.772647 | 2 |
| ACGGACC | 2785 | 0.0 | 15.610413 | 8 |
| TAACGCC | 2525 | 0.0 | 15.239604 | 4 |
| CTAGCGG | 1725 | 0.0 | 15.014492 | 29 |
| ATAACGC | 2625 | 0.0 | 14.940952 | 3 |
| CGTTTAT | 2425 | 0.0 | 14.876288 | 31 |
| AGACGGA | 3010 | 0.0 | 14.75083 | 6 |
| CCGGTCG | 2445 | 0.0 | 14.678937 | 20 |
| CGAACGA | 1800 | 0.0 | 14.594444 | 16 |
| CGCAATA | 1790 | 0.0 | 14.572626 | 36 |
| GCGCAAG | 3155 | 0.0 | 14.541997 | 1 |
| CGCATCG | 2505 | 0.0 | 14.401198 | 13 |
| ATGGTCG | 2340 | 0.0 | 14.388888 | 36 |
| AATAACG | 2600 | 0.0 | 14.230769 | 2 |