Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1632334.fastq |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 125945 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 49 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 273 | 0.2167612846877605 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 249 | 0.197705347572353 | No Hit |
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 167 | 0.132597562428044 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| CGGACAT | 25 | 1.2273916E-4 | 37.0 | 21 |
| ATCTAAG | 35 | 2.3715776E-5 | 31.714287 | 26 |
| CATCTAA | 35 | 2.3715776E-5 | 31.714287 | 25 |
| ACATCTA | 35 | 2.3715776E-5 | 31.714287 | 24 |
| GTAGAAC | 25 | 0.0054780017 | 29.6 | 3 |
| CTAAGGG | 45 | 3.9711085E-6 | 28.777779 | 28 |
| CCGGACA | 35 | 8.8254164E-4 | 26.428572 | 20 |
| GACATCT | 35 | 8.8254164E-4 | 26.428572 | 23 |
| GCGCGTG | 45 | 1.3140915E-4 | 24.666668 | 8 |
| AGCGCGC | 45 | 1.3140915E-4 | 24.666668 | 5 |
| GGTATCA | 90 | 1.4006218E-10 | 24.666668 | 1 |
| TAAGGGC | 40 | 0.0019217023 | 23.125002 | 29 |
| GGACATC | 40 | 0.0019217023 | 23.125002 | 22 |
| GCGCGCG | 50 | 2.683628E-4 | 22.2 | 6 |
| CGCGCGT | 50 | 2.683628E-4 | 22.2 | 7 |
| TCTAAGG | 50 | 2.683628E-4 | 22.2 | 27 |
| CGCGTGC | 50 | 2.683628E-4 | 22.2 | 9 |
| TCTTATA | 170 | 0.0 | 21.764706 | 37 |
| CCCGGAC | 45 | 0.0038071973 | 20.555557 | 19 |
| TAGCGCG | 55 | 5.108219E-4 | 20.181818 | 4 |