Basic Statistics
Measure | Value |
---|---|
Filename | SRR1547685_1.fastq.gz |
File type | Conventional base calls |
Encoding | Illumina 1.5 |
Total Sequences | 549597 |
Sequences flagged as poor quality | 0 |
Sequence length | 51 |
%GC | 46 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 1746 | 0.31768732362076213 | No Hit |
GACCAGAAAAATGGTCCTGCCAAGCGGCTACAGTGTTCTTCTTTCAGATAC | 1743 | 0.31714146911282265 | No Hit |
GAGCTGTCTCTTATACACATCTGACGCATGCCTGTTCGTATGCCGTCTTCT | 551 | 0.10025527795821303 | No Hit |
GAGAGCTGTCTCTTATACACATCTGACGCATGCCTGTTCGTATGCCGTCTT | 550 | 0.10007332645556653 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
TCGTTTG | 35 | 1.2103737E-7 | 45.000004 | 1 |
ACACGTG | 70 | 0.0 | 45.000004 | 42 |
ACGCCGA | 25 | 3.8881386E-5 | 45.000004 | 12 |
ACCGCTC | 25 | 3.8881386E-5 | 45.000004 | 18 |
TTGAGCG | 25 | 3.8881386E-5 | 45.000004 | 1 |
GTTACAC | 25 | 3.8881386E-5 | 45.000004 | 33 |
GGGCGTA | 25 | 3.8881386E-5 | 45.000004 | 7 |
GTCAACG | 25 | 3.8881386E-5 | 45.000004 | 1 |
CGAATAT | 25 | 3.8881386E-5 | 45.000004 | 14 |
CGTACCA | 25 | 3.8881386E-5 | 45.000004 | 13 |
TAAGGAG | 35 | 1.2103737E-7 | 45.000004 | 1 |
ATAACGG | 25 | 3.8881386E-5 | 45.000004 | 2 |
ATGCACC | 25 | 3.8881386E-5 | 45.000004 | 27 |
GCGTACC | 25 | 3.8881386E-5 | 45.000004 | 12 |
TGCGTTG | 25 | 3.8881386E-5 | 45.000004 | 1 |
AGTCGTG | 25 | 3.8881386E-5 | 45.000004 | 12 |
CCCGAGT | 25 | 3.8881386E-5 | 45.000004 | 18 |
TTTAGCG | 40 | 6.8030204E-9 | 45.0 | 1 |
TCGTTAG | 20 | 7.03007E-4 | 45.0 | 1 |
CGAAAGA | 20 | 7.03007E-4 | 45.0 | 42 |