Basic Statistics
Measure | Value |
---|---|
Filename | SRR3549069_1.fastq |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 1536225 |
Sequences flagged as poor quality | 0 |
Sequence length | 51 |
%GC | 44 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 8084 | 0.5262249995931586 | No Hit |
GACCAGAAAAATGGTCCTGCCAAGCGGCTACAGTGTTCTTCTTTCAGATAC | 3909 | 0.2544549138309818 | No Hit |
GATACTGAAGCTACGAATATACTGACTATGAAGACCTATGCTTTGATTCAT | 3389 | 0.22060570554443523 | No Hit |
AAGGAAGGAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAG | 1806 | 0.11756090416442905 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGAACCC | 30 | 2.1659816E-6 | 45.000004 | 27 |
CTACGGC | 30 | 2.1659816E-6 | 45.000004 | 18 |
ACCGTAT | 20 | 7.0340285E-4 | 45.000004 | 18 |
TCGACAC | 30 | 2.1659816E-6 | 45.000004 | 34 |
GTCGACA | 35 | 1.2122837E-7 | 45.0 | 14 |
CACGTTA | 45 | 3.8562575E-10 | 45.0 | 13 |
TATACGA | 25 | 3.8914208E-5 | 45.0 | 44 |
TGTACGC | 25 | 3.8914208E-5 | 45.0 | 25 |
CGGTCTA | 150 | 0.0 | 43.500004 | 31 |
TATAGCG | 130 | 0.0 | 43.26923 | 1 |
CGACGGT | 160 | 0.0 | 42.187504 | 28 |
TATACGG | 120 | 0.0 | 41.250004 | 2 |
CGTTTTT | 5565 | 0.0 | 41.19946 | 1 |
TCACGAC | 175 | 0.0 | 41.14286 | 25 |
ACGACGG | 175 | 0.0 | 41.14286 | 27 |
CCGTCGG | 50 | 1.0822987E-9 | 40.5 | 20 |
ACGGGTA | 235 | 0.0 | 40.212765 | 5 |
AGGGATG | 4060 | 0.0 | 40.17857 | 6 |
CGAGGGA | 645 | 0.0 | 40.116276 | 4 |
CGTATGG | 135 | 0.0 | 40.000004 | 2 |