Basic Statistics
Measure | Value |
---|---|
Filename | SRR3127803.fastq |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 2935735 |
Sequences flagged as poor quality | 0 |
Sequence length | 100 |
%GC | 47 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 32320 | 1.1009168061831194 | No Hit |
GGGCAGGGACTTAATCAACGCAAGCTTATGACCCGCACTTACTGGGAATT | 3875 | 0.13199420247399715 | No Hit |
GCTGTCTCTTATACACATCTGACGCGGACCTTATCGTATGCCGTCTTCTG | 3817 | 0.13001854731438633 | TruSeq Adapter, Index 23 (95% over 21bp) |
CCTGTCTCTTATACACATCTGACGCGGACCTTATCGTATGCCGTCTTCTG | 3706 | 0.1262375520951312 | TruSeq Adapter, Index 23 (95% over 21bp) |
CTGTCTCTTATACACATCTGACGCGGACCTTATCGTATGCCGTCTTCTGC | 3291 | 0.11210139879791602 | TruSeq Adapter, Index 20 (95% over 23bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGTTTTT | 9685 | 0.0 | 85.33797 | 1 |
CGTAGGG | 795 | 0.0 | 76.85444 | 2 |
TAGGGCA | 2210 | 0.0 | 71.8815 | 4 |
TAGGGTC | 910 | 0.0 | 71.79036 | 4 |
TAGGGTA | 865 | 0.0 | 70.635 | 4 |
ATAGGGC | 1900 | 0.0 | 70.00444 | 3 |
GGATAGG | 2000 | 0.0 | 69.38445 | 1 |
TAGAGGG | 5960 | 0.0 | 68.843155 | 2 |
AGGGCAT | 3020 | 0.0 | 68.78728 | 5 |
ATAGGGA | 2365 | 0.0 | 67.56791 | 3 |
AAGGGTA | 2365 | 0.0 | 67.36919 | 4 |
AGTAGGG | 4625 | 0.0 | 67.27273 | 2 |
GAATAGG | 1850 | 0.0 | 66.87351 | 1 |
AGGGATG | 3500 | 0.0 | 66.8735 | 5 |
ATAGAGG | 2245 | 0.0 | 66.84125 | 1 |
AGATAGG | 1830 | 0.0 | 66.83322 | 1 |
AGGGTAT | 1560 | 0.0 | 66.58255 | 5 |
AAGAGGG | 10480 | 0.0 | 65.79026 | 2 |
ATAAGGG | 4220 | 0.0 | 65.48738 | 2 |
ATAACGG | 345 | 0.0 | 65.44739 | 1 |