Basic Statistics
Measure | Value |
---|---|
Filename | SRR3128920.fastq |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 2612007 |
Sequences flagged as poor quality | 0 |
Sequence length | 100 |
%GC | 47 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 17475 | 0.669025772136139 | No Hit |
CTGTCTCTTATACACATCTGACGCATTGCGTCTCGTATGCCGTCTTCTGC | 4438 | 0.1699076610437874 | Illumina PCR Primer Index 3 (95% over 23bp) |
CCTGTCTCTTATACACATCTGACGCATTGCGTCTCGTATGCCGTCTTCTG | 4238 | 0.1622507137232021 | TruSeq Adapter, Index 14 (95% over 24bp) |
GCTGTCTCTTATACACATCTGACGCATTGCGTCTCGTATGCCGTCTTCTG | 3439 | 0.1316612091774639 | TruSeq Adapter, Index 14 (95% over 24bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGTTTTT | 7530 | 0.0 | 79.87253 | 1 |
ACGGGTA | 300 | 0.0 | 78.32846 | 4 |
ATAGGGC | 1615 | 0.0 | 74.496925 | 3 |
AGGGTAC | 1835 | 0.0 | 74.27047 | 5 |
TAGGGCA | 1485 | 0.0 | 73.42305 | 4 |
AGGGTAT | 1480 | 0.0 | 72.39813 | 5 |
CGTAGGG | 670 | 0.0 | 72.249245 | 2 |
TAGGGTA | 1095 | 0.0 | 72.10511 | 4 |
AAGGGTA | 2000 | 0.0 | 71.90554 | 4 |
GTAGGGC | 1150 | 0.0 | 71.517296 | 3 |
TAGGGCG | 645 | 0.0 | 71.40642 | 4 |
GTAGGGT | 1090 | 0.0 | 71.14237 | 3 |
GGTAAGG | 1605 | 0.0 | 70.958084 | 1 |
GACCGAT | 280 | 0.0 | 70.49292 | 8 |
TAAGGGA | 2165 | 0.0 | 69.89867 | 3 |
AGATAGG | 1440 | 0.0 | 69.28431 | 1 |
GAATAGG | 1695 | 0.0 | 68.85627 | 1 |
TAGGGTC | 810 | 0.0 | 68.46488 | 4 |
ATAGGGA | 2115 | 0.0 | 68.44019 | 3 |
AGGGATG | 3305 | 0.0 | 67.40019 | 5 |