Basic Statistics
Measure | Value |
---|---|
Filename | SRR1546830_1.fastq.gz |
File type | Conventional base calls |
Encoding | Illumina 1.5 |
Total Sequences | 1369288 |
Sequences flagged as poor quality | 0 |
Sequence length | 51 |
%GC | 40 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 50189 | 3.6653355612551923 | No Hit |
CGTTTTCTGTCTCTTATACACATCTGACGCTACCAGTATCGTATGCCGTCT | 2879 | 0.21025525674657194 | No Hit |
CGTTTCTGTCTCTTATACACATCTGACGCTACCAGTATCGTATGCCGTCTT | 2647 | 0.19331214470586172 | No Hit |
GACCAGAAAAATGGTCCTGCCAAGCGGCTACAGTGTTCTTCTTTCAGATAC | 2173 | 0.15869561407096244 | No Hit |
CGTTTTTCTGTCTCTTATACACATCTGACGCTACCAGTATCGTATGCCGTC | 1531 | 0.1118099333376178 | No Hit |
CGTTCTGTCTCTTATACACATCTGACGCTACCAGTATCGTATGCCGTCTTC | 1461 | 0.10669778746326558 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGTTGCG | 30 | 2.165807E-6 | 45.000004 | 1 |
CGGCTAT | 35 | 1.2121563E-7 | 45.000004 | 40 |
CCGGATA | 40 | 6.8175723E-9 | 45.0 | 20 |
CACGACG | 325 | 0.0 | 45.0 | 26 |
CTAACCG | 20 | 7.03376E-4 | 45.0 | 1 |
CGCGTCA | 40 | 6.8175723E-9 | 45.0 | 39 |
TCGGTAC | 40 | 6.8175723E-9 | 45.0 | 36 |
CGTTTTT | 20660 | 0.0 | 44.4228 | 1 |
CGACGGT | 330 | 0.0 | 44.31818 | 28 |
TCACGAC | 340 | 0.0 | 43.014706 | 25 |
ACGGTCT | 340 | 0.0 | 43.014706 | 30 |
GCCGACC | 90 | 0.0 | 42.5 | 14 |
GACGGTC | 345 | 0.0 | 42.3913 | 29 |
CGGTCTA | 345 | 0.0 | 42.3913 | 31 |
CGCGGGT | 85 | 0.0 | 42.35294 | 4 |
CGTTTTC | 400 | 0.0 | 41.625 | 1 |
GTTAGCG | 65 | 0.0 | 41.53846 | 1 |
TCGTTAG | 60 | 3.6379788E-12 | 41.250004 | 1 |
GCGATAT | 155 | 0.0 | 40.64516 | 9 |
CTCACGA | 360 | 0.0 | 40.625 | 24 |