Basic Statistics
Measure | Value |
---|---|
Filename | SRR1545221_1.fastq.gz |
File type | Conventional base calls |
Encoding | Illumina 1.5 |
Total Sequences | 1933303 |
Sequences flagged as poor quality | 0 |
Sequence length | 51 |
%GC | 45 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 7927 | 0.41002367450937593 | No Hit |
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA | 3965 | 0.20508942467890445 | No Hit |
GATACTGAAGCTACGAATATACTGACTATGAAGACCTATGCTTTGATTCAT | 3252 | 0.168209535701336 | No Hit |
GACCAGAAAAATGGTCCTGCCAAGCGGCTACAGTGTTCTTCTTTCAGATAC | 2014 | 0.10417404824799836 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGCATCG | 50 | 2.1827873E-11 | 45.0 | 21 |
TCGCTAA | 25 | 3.8917977E-5 | 45.0 | 10 |
TTACGCG | 25 | 3.8917977E-5 | 45.0 | 1 |
TTACGAG | 135 | 0.0 | 41.666664 | 1 |
TCTACGG | 60 | 3.6379788E-12 | 41.250004 | 2 |
TCGATAG | 50 | 1.0822987E-9 | 40.5 | 1 |
CGGTCTA | 185 | 0.0 | 40.135136 | 31 |
GACCGTT | 45 | 1.929402E-8 | 40.000004 | 9 |
GCGATCG | 45 | 1.929402E-8 | 40.000004 | 9 |
TTTACGG | 175 | 0.0 | 39.857143 | 2 |
TCACGAC | 205 | 0.0 | 39.512196 | 25 |
ATAGGGC | 805 | 0.0 | 39.130432 | 4 |
TACGATA | 35 | 6.2505023E-6 | 38.571426 | 29 |
TCTAACG | 35 | 6.2505023E-6 | 38.571426 | 1 |
TCGTAAG | 140 | 0.0 | 38.571426 | 1 |
AGGGATC | 1345 | 0.0 | 38.475838 | 6 |
AGGGTAC | 790 | 0.0 | 38.449368 | 6 |
CGTTTTT | 6170 | 0.0 | 38.32658 | 1 |
ACCCGCT | 1155 | 0.0 | 38.18182 | 34 |
GCGCGAC | 1185 | 0.0 | 38.16456 | 9 |