Basic Statistics
Measure | Value |
---|---|
Filename | SRR3552055_1.fastq |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 459810 |
Sequences flagged as poor quality | 0 |
Sequence length | 42-51 |
%GC | 41 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 20948 | 4.555794784802418 | No Hit |
CGCTGTCTCTTATACACATCTGACGCAATCTCGGTCGTATGCCGTCTTCTG | 756 | 0.1644157369348209 | No Hit |
CGGTCGGCGTCCCCCAACTTCTTAGAGGGACAAGTGGCGTTCAGCCACCCG | 726 | 0.15789130292947087 | No Hit |
CGTTTCTGTCTCTTATACACATCTGACGCAATCTCGGTCGTATGCCGTCTT | 580 | 0.12613905743676737 | No Hit |
CGTTTTTTTTCTGTCTCTTATACACATCTGACGCAATCTCGGTCGTATGCC | 485 | 0.10547834975315892 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CTCACGA | 30 | 2.193001E-6 | 44.908985 | 24 |
CGTTAGG | 40 | 6.9230737E-9 | 44.908985 | 2 |
AAGCACG | 20 | 7.0993195E-4 | 44.908985 | 1 |
TGCGACG | 30 | 2.193001E-6 | 44.908985 | 1 |
GCTTACG | 20 | 7.0993195E-4 | 44.908985 | 1 |
GCGATAC | 20 | 7.0993195E-4 | 44.908985 | 9 |
CGGTCTA | 30 | 2.193001E-6 | 44.908985 | 31 |
CGTTTTT | 8805 | 0.0 | 44.14392 | 1 |
CCGCCCA | 45 | 1.9586878E-8 | 39.919094 | 36 |
TCGGCGT | 110 | 0.0 | 38.78503 | 4 |
GTCGGCG | 110 | 0.0 | 38.78503 | 3 |
GCGATCA | 35 | 6.3274438E-6 | 38.493412 | 9 |
ACGTAGG | 70 | 0.0 | 38.493412 | 2 |
GTTTTTT | 10370 | 0.0 | 38.477943 | 2 |
TCGCAAC | 30 | 1.15250994E-4 | 37.424152 | 16 |
ATAGCGG | 60 | 1.5825208E-10 | 37.424152 | 2 |
TAACGCC | 30 | 1.15250994E-4 | 37.424152 | 12 |
TATAGCG | 30 | 1.15250994E-4 | 37.424152 | 1 |
ACAACGA | 85 | 0.0 | 36.983868 | 13 |
CGTAGGG | 165 | 0.0 | 36.743713 | 3 |