Basic Statistics
Measure | Value |
---|---|
Filename | ERR1041440.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 1746222 |
Sequences flagged as poor quality | 0 |
Sequence length | 43 |
%GC | 49 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTT | 2608 | 0.14935099889933812 | No Hit |
TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTT | 2059 | 0.1179116973672305 | No Hit |
GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT | 1878 | 0.10754646316447737 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CTAACGC | 60 | 1.3380723E-6 | 24.666666 | 3 |
GGTATCA | 955 | 0.0 | 24.408377 | 1 |
TCTAACG | 85 | 1.246066E-6 | 19.588236 | 2 |
GTACTAT | 190 | 0.0 | 18.5 | 1 |
TCTATAC | 285 | 0.0 | 17.526316 | 3 |
CCGTTTA | 75 | 2.0680415E-4 | 17.266666 | 13 |
TAACGCC | 205 | 0.0 | 17.146341 | 4 |
GTATACT | 195 | 0.0 | 17.076923 | 4 |
GTAATCG | 185 | 1.8189894E-12 | 17.000002 | 20 |
CGTATTA | 110 | 7.813305E-7 | 16.818182 | 15 |
GGGGTTA | 335 | 0.0 | 16.567163 | 6 |
ACCGCGT | 90 | 4.4480847E-5 | 16.444445 | 8 |
GTACTAG | 135 | 2.2213499E-8 | 16.444443 | 1 |
ATCGATA | 205 | 0.0 | 16.243902 | 23 |
GCGTTTA | 240 | 0.0 | 16.1875 | 32 |
TCTAGCG | 195 | 1.8189894E-12 | 16.128206 | 28 |
TAATCGA | 195 | 1.8189894E-12 | 16.128206 | 21 |
ATACCAT | 255 | 0.0 | 15.960784 | 6 |
AATCGAT | 210 | 0.0 | 15.857144 | 22 |
CTATACC | 445 | 0.0 | 15.382022 | 4 |