Basic Statistics
| Measure | Value |
|---|---|
| Filename | ERR1041976.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 1891698 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 43 |
| %GC | 48 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 6732 | 0.3558707573830495 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 2844 | 0.15034112210299955 | No Hit |
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 2519 | 0.13316078993581426 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| TAACGCC | 35 | 8.871521E-4 | 26.428572 | 4 |
| CTAACGC | 40 | 0.0019316458 | 23.125 | 3 |
| CTTATAC | 2460 | 0.0 | 16.845528 | 37 |
| GTCTAAC | 90 | 4.4483997E-5 | 16.444445 | 1 |
| ATTAGAG | 310 | 0.0 | 16.112904 | 3 |
| CGATTAA | 150 | 8.109237E-8 | 14.8 | 22 |
| GTATTAG | 650 | 0.0 | 14.515386 | 1 |
| CGTTTCG | 90 | 8.2774984E-4 | 14.388888 | 23 |
| TATACAC | 805 | 0.0 | 14.018634 | 37 |
| TTAACGG | 240 | 5.456968E-12 | 13.874999 | 35 |
| TATTAGA | 470 | 0.0 | 13.776596 | 2 |
| TTACACT | 260 | 1.8189894E-12 | 13.519231 | 4 |
| GTACTAG | 145 | 1.3729701E-5 | 12.75862 | 1 |
| TAGCGGA | 160 | 2.6974176E-6 | 12.71875 | 17 |
| ACCGTGT | 160 | 2.6974176E-6 | 12.71875 | 8 |
| GTATAGG | 395 | 0.0 | 12.64557 | 1 |
| AACGCAG | 4150 | 0.0 | 12.571083 | 5 |
| TATACTG | 265 | 3.274181E-11 | 12.566038 | 5 |
| GTATTGG | 300 | 1.8189894E-12 | 12.333333 | 1 |
| CCTACAC | 330 | 0.0 | 12.333333 | 3 |