Basic Statistics
Measure | Value |
---|---|
Filename | SRR3127351.fastq |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 1384321 |
Sequences flagged as poor quality | 0 |
Sequence length | 100 |
%GC | 46 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 16522 | 1.1935093088958415 | No Hit |
GCTGTCTCTTATACACATCTGACGCCGGAGTTATCGTATGCCGTCTTCTG | 2209 | 0.1595728158425683 | TruSeq Adapter, Index 22 (95% over 21bp) |
CTGTCTCTTATACACATCTGACGCCGGAGTTATCGTATGCCGTCTTCTGC | 2096 | 0.15140996921956684 | No Hit |
TCCGCTACGACCAACTCATACACCTCCTATGAAAAAACTTCCTACCACTC | 1978 | 0.14288593469289276 | No Hit |
CCTGTCTCTTATACACATCTGACGCCGGAGTTATCGTATGCCGTCTTCTG | 1964 | 0.1418746085626094 | TruSeq Adapter, Index 22 (95% over 21bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGTTTTT | 4820 | 0.0 | 84.42496 | 1 |
CGTAGGG | 510 | 0.0 | 78.531006 | 2 |
ATAACGG | 135 | 0.0 | 76.84153 | 1 |
ATAGGGC | 820 | 0.0 | 74.700226 | 3 |
ACGGGTA | 135 | 0.0 | 73.29561 | 4 |
GTACGGG | 265 | 0.0 | 72.90048 | 2 |
AATGCGG | 345 | 0.0 | 72.43757 | 1 |
TAGGGCA | 655 | 0.0 | 71.21743 | 4 |
GAGGGAT | 1575 | 0.0 | 71.20145 | 4 |
AGGGATG | 2020 | 0.0 | 71.189476 | 5 |
TAGGGTA | 530 | 0.0 | 71.12242 | 4 |
TACGCGG | 80 | 0.0 | 70.72913 | 1 |
AGGGCAT | 1305 | 0.0 | 70.58157 | 5 |
AGTAGGG | 2375 | 0.0 | 70.231514 | 2 |
TGTAGGG | 995 | 0.0 | 70.08597 | 2 |
GAATAGG | 870 | 0.0 | 68.2902 | 1 |
AGTAAGG | 970 | 0.0 | 68.05552 | 1 |
AAGAGGG | 4880 | 0.0 | 67.588165 | 2 |
ATGAGGG | 2190 | 0.0 | 67.343025 | 2 |
ATAGGGA | 1375 | 0.0 | 67.16543 | 3 |