Basic Statistics
Measure | Value |
---|---|
Filename | SRR1545772_1.fastq.gz |
File type | Conventional base calls |
Encoding | Illumina 1.5 |
Total Sequences | 719339 |
Sequences flagged as poor quality | 0 |
Sequence length | 50 |
%GC | 45 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CGTTTTATTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 3618 | 0.5029617468259054 | No Hit |
CGTTATTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 1540 | 0.21408543120837323 | No Hit |
CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 1457 | 0.20254706056532457 | No Hit |
GACCAGAAAAATGGTCCTGCCAAGCGGCTACAGTGTTCTTCTTTCAGATA | 911 | 0.12664404404599222 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGGGTAT | 40 | 8.305506E-9 | 44.0 | 6 |
ACACGAC | 40 | 8.305506E-9 | 44.0 | 26 |
ACGCCCG | 20 | 7.8545156E-4 | 44.0 | 27 |
CCGGATA | 20 | 7.8545156E-4 | 44.0 | 20 |
CGATTAC | 20 | 7.8545156E-4 | 44.0 | 10 |
CGTTCGA | 40 | 8.305506E-9 | 44.0 | 14 |
TTATACG | 20 | 7.8545156E-4 | 44.0 | 1 |
TCTACGG | 30 | 2.5263907E-6 | 44.0 | 2 |
TTGCACG | 25 | 4.441172E-5 | 44.0 | 1 |
CTAGGCG | 40 | 8.305506E-9 | 44.0 | 5 |
CGTTTTA | 1870 | 0.0 | 43.294117 | 1 |
CGGTCTA | 140 | 0.0 | 42.428574 | 31 |
CGTTATT | 1105 | 0.0 | 41.809956 | 1 |
CGAATAT | 80 | 0.0 | 41.25 | 14 |
GTTATTT | 1185 | 0.0 | 41.21519 | 2 |
GACGGTC | 155 | 0.0 | 41.16129 | 29 |
TAAGGAC | 285 | 0.0 | 40.91228 | 5 |
GTTTTAT | 2150 | 0.0 | 40.827908 | 2 |
TTACTGG | 985 | 0.0 | 40.64975 | 40 |
ACGGGTA | 65 | 0.0 | 40.615387 | 5 |