Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1041561.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 1806218 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 48 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 5353 | 0.2963651120739578 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 5281 | 0.29237888228331244 | No Hit |
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 3819 | 0.21143627181215113 | No Hit |
| GCGCAAGACGGACCAGAGCGAAAGCATTTGCCAAGAATGTTTT | 1875 | 0.10380806746472462 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GTACTAG | 150 | 0.0 | 20.966667 | 1 |
| CTAGCGG | 260 | 0.0 | 18.5 | 29 |
| CTAGTAC | 115 | 6.410846E-8 | 17.695652 | 3 |
| TCTAGCG | 275 | 0.0 | 17.49091 | 28 |
| TGCGACG | 120 | 1.0421354E-7 | 16.958334 | 22 |
| CGTATTA | 100 | 5.881533E-6 | 16.650002 | 15 |
| GCGTTAT | 285 | 0.0 | 16.22807 | 1 |
| CGCAATA | 285 | 0.0 | 16.22807 | 36 |
| CCGTATT | 105 | 9.348936E-6 | 15.857142 | 14 |
| TTAACGG | 105 | 9.348936E-6 | 15.857142 | 35 |
| TTCGCGG | 140 | 3.4764525E-8 | 15.857142 | 32 |
| ATGCGAC | 155 | 7.215931E-9 | 15.5161295 | 21 |
| TTTAGAC | 180 | 2.0190782E-10 | 15.416667 | 3 |
| TACCCCG | 120 | 1.9368927E-6 | 15.416667 | 5 |
| TCGAACG | 245 | 0.0 | 15.10204 | 3 |
| TAGACTC | 135 | 3.9754923E-7 | 15.074075 | 5 |
| GGTATCA | 2720 | 0.0 | 15.03125 | 1 |
| CGAGCCG | 535 | 0.0 | 14.869159 | 15 |
| ATGACCG | 75 | 0.0041056126 | 14.8 | 5 |
| GCATAGG | 150 | 8.108509E-8 | 14.8 | 1 |