Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1041430.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 4601638 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 46 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 16814 | 0.36539162793770397 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 14331 | 0.31143258118087513 | No Hit |
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 12131 | 0.2636235184080104 | No Hit |
| ACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 7691 | 0.16713613717550144 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GGTATCA | 6715 | 0.0 | 17.549515 | 1 |
| GACGGAC | 645 | 0.0 | 16.348837 | 7 |
| ACGGACC | 655 | 0.0 | 16.099236 | 8 |
| TATACTG | 1355 | 0.0 | 15.154981 | 5 |
| AAGACGG | 820 | 0.0 | 15.115853 | 5 |
| GTCGCTA | 130 | 4.4506633E-6 | 14.230769 | 37 |
| CGGGTCG | 210 | 1.364242E-10 | 14.095238 | 34 |
| CGTATAC | 135 | 6.5767654E-6 | 13.703704 | 3 |
| CTAACGC | 95 | 0.0012460683 | 13.631579 | 3 |
| GTACTAG | 300 | 0.0 | 13.566666 | 1 |
| GTATTAG | 1095 | 0.0 | 13.515983 | 1 |
| TAACACG | 375 | 0.0 | 13.32 | 4 |
| CGGACCA | 820 | 0.0 | 13.085365 | 9 |
| CGCAAGA | 940 | 0.0 | 12.989362 | 2 |
| GCGCAAG | 1040 | 0.0 | 12.985577 | 1 |
| CTAATCG | 200 | 1.4659236E-8 | 12.950001 | 26 |
| TCTAGCG | 360 | 0.0 | 12.847222 | 28 |
| TCTACAC | 780 | 0.0 | 12.807693 | 3 |
| CGACGGT | 435 | 0.0 | 12.758621 | 7 |
| ACGAACG | 385 | 0.0 | 12.493506 | 15 |