Basic Statistics
Measure | Value |
---|---|
Filename | SRR1547200_1.fastq.gz |
File type | Conventional base calls |
Encoding | Illumina 1.5 |
Total Sequences | 1190739 |
Sequences flagged as poor quality | 0 |
Sequence length | 50 |
%GC | 48 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 5255 | 0.4413225736286458 | No Hit |
GACCAGAAAAATGGTCCTGCCAAGCGGCTACAGTGTTCTTCTTTCAGATA | 1407 | 0.11816191457573826 | No Hit |
GATACTGAAGCTACGAATATACTGACTATGAAGACCTATGCTTTGATTCA | 1318 | 0.11068756461323599 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GTTACGC | 20 | 7.856594E-4 | 44.000004 | 12 |
TATTCGC | 20 | 7.856594E-4 | 44.000004 | 12 |
CGCGTAT | 20 | 7.856594E-4 | 44.000004 | 34 |
CGACGTC | 30 | 2.527795E-6 | 44.0 | 18 |
TAATGCG | 60 | 0.0 | 44.0 | 1 |
ATCGAAT | 30 | 2.527795E-6 | 44.0 | 15 |
CGGTCTA | 145 | 0.0 | 40.965515 | 31 |
CGTTTTT | 3695 | 0.0 | 40.725307 | 1 |
TATAGCG | 130 | 0.0 | 40.615387 | 1 |
TACGGGA | 225 | 0.0 | 40.08889 | 4 |
TCGTACA | 55 | 7.8216544E-11 | 40.0 | 34 |
TACGAAT | 195 | 0.0 | 39.48718 | 12 |
GGCGATA | 490 | 0.0 | 38.163265 | 8 |
GCGATAT | 135 | 0.0 | 37.481483 | 9 |
ATAGCGG | 235 | 0.0 | 37.44681 | 2 |
ATTAGCG | 100 | 0.0 | 37.399998 | 1 |
TAAGGGA | 1230 | 0.0 | 37.382114 | 4 |
GGGCGAT | 2525 | 0.0 | 37.37822 | 7 |
CACGACG | 165 | 0.0 | 37.333332 | 26 |
ATAGGGA | 1415 | 0.0 | 36.848057 | 4 |