Basic Statistics
Measure | Value |
---|---|
Filename | SRR3126521.fastq |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 2169034 |
Sequences flagged as poor quality | 0 |
Sequence length | 100 |
%GC | 46 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 33087 | 1.5254256042090626 | No Hit |
GGGCAGGGACTTAATCAACGCAAGCTTATGACCCGCACTTACTGGGAATT | 2675 | 0.12332678971376197 | No Hit |
CGCTGTCTCTTATACACATCTGACGCCTACCCTCTCGTATGCCGTCTTCT | 2462 | 0.11350675000945121 | TruSeq Adapter, Index 16 (95% over 22bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGTTTTT | 11755 | 0.0 | 87.83189 | 1 |
AGGGTAC | 1515 | 0.0 | 78.17269 | 5 |
TAGGGTA | 1055 | 0.0 | 74.83832 | 4 |
CGTAGGG | 565 | 0.0 | 73.235504 | 2 |
ATAGGGC | 1495 | 0.0 | 72.931305 | 3 |
AGTAGGG | 3660 | 0.0 | 71.94397 | 2 |
ACGGGTA | 335 | 0.0 | 71.54718 | 4 |
TAGGGCA | 1155 | 0.0 | 70.80019 | 4 |
ATAACGG | 185 | 0.0 | 68.67999 | 1 |
AAGGGTA | 1940 | 0.0 | 68.557014 | 4 |
GAATAGG | 1540 | 0.0 | 68.448746 | 1 |
AGGGCAT | 1770 | 0.0 | 67.9726 | 5 |
TAGGGCG | 360 | 0.0 | 67.884094 | 4 |
GAGGGTA | 1575 | 0.0 | 67.73489 | 4 |
GTACGGG | 530 | 0.0 | 67.42566 | 2 |
ATAGGGA | 1740 | 0.0 | 67.253876 | 3 |
GTAGGGC | 930 | 0.0 | 67.21031 | 3 |
GTAGGGA | 1735 | 0.0 | 67.17681 | 3 |
TAAGAGG | 1505 | 0.0 | 66.91377 | 1 |
GGTAAGG | 1205 | 0.0 | 66.78013 | 1 |