Basic Statistics
Measure | Value |
---|---|
Filename | SRR3549934_1.fastq |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 2284925 |
Sequences flagged as poor quality | 0 |
Sequence length | 51 |
%GC | 50 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 6647 | 0.2909067037211287 | No Hit |
CCTGTCTCTTATACACATCTGACGCGTCGTTGATCGTATGCCGTCTTCTGC | 4644 | 0.20324518310229 | No Hit |
GCTGTCTCTTATACACATCTGACGCGTCGTTGATCGTATGCCGTCTTCTGC | 3716 | 0.16263115857194438 | No Hit |
CTGTCTCTTATACACATCTGACGCGTCGTTGATCGTATGCCGTCTTCTGCT | 2919 | 0.12775036379749882 | Illumina Single End Adapter 1 (95% over 21bp) |
CGCTGTCTCTTATACACATCTGACGCGTCGTTGATCGTATGCCGTCTTCTG | 2665 | 0.11663402518682232 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGAATAT | 320 | 0.0 | 39.375 | 14 |
ATAACGC | 170 | 0.0 | 38.38235 | 11 |
GCTACGA | 335 | 0.0 | 37.611942 | 10 |
TCTAGCG | 60 | 1.5643309E-10 | 37.500004 | 1 |
TACGGGA | 790 | 0.0 | 37.02532 | 4 |
CGGTCTA | 250 | 0.0 | 36.9 | 31 |
TATTACG | 55 | 2.752131E-9 | 36.81818 | 1 |
TCGTAAG | 55 | 2.752131E-9 | 36.81818 | 1 |
CGTTTTT | 3195 | 0.0 | 35.774647 | 1 |
TTACGGG | 605 | 0.0 | 35.702477 | 3 |
ATGCGAA | 145 | 0.0 | 35.689655 | 26 |
CGTAAGG | 280 | 0.0 | 35.357143 | 2 |
TAGGGCG | 685 | 0.0 | 35.14599 | 5 |
CGTTCGG | 750 | 0.0 | 35.1 | 45 |
TCATGCG | 90 | 0.0 | 35.000004 | 1 |
TAATGCG | 150 | 0.0 | 34.5 | 24 |
GGGCGAT | 3845 | 0.0 | 34.349804 | 7 |
TGTAACG | 145 | 0.0 | 34.137928 | 1 |
TACGAAT | 370 | 0.0 | 34.054054 | 12 |
TATCACG | 40 | 1.5612399E-5 | 33.75 | 1 |