Basic Statistics
Measure | Value |
---|---|
Filename | SRR4064233_1.fastq |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 646765 |
Sequences flagged as poor quality | 0 |
Sequence length | 50 |
%GC | 47 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GTACATGGGGTGGTATCAACGCAAAAAAAAAAAAAAAAAAAAAAAAAAAA | 1421 | 0.21970885870447535 | No Hit |
GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 1078 | 0.1666756859137399 | No Hit |
GTATCAACGCAGAGTACATGGGGTGGTATCAACGCAAAAAAAAAAAAAAA | 804 | 0.12431099394679675 | No Hit |
GTACATGGGTGGTATCAACGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAA | 737 | 0.11395174445123035 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGCGGGA | 110 | 7.2759576E-12 | 25.99872 | 44 |
TTCGGGC | 130 | 6.730261E-11 | 22.00062 | 35 |
GTAAACG | 130 | 6.730261E-11 | 22.00062 | 27 |
CTAGACC | 50 | 0.0025798897 | 21.998917 | 3 |
TTGCCGA | 60 | 2.8717343E-4 | 21.998917 | 41 |
AGTAAAC | 145 | 1.2732926E-11 | 21.240335 | 26 |
TAAACGC | 135 | 1.1277734E-10 | 21.18578 | 28 |
TAACGGC | 125 | 9.931682E-10 | 21.120594 | 36 |
CGCTTCG | 150 | 2.1827873E-11 | 20.535498 | 32 |
AAACGCT | 145 | 2.9467628E-10 | 19.724693 | 29 |
CGCAATA | 115 | 2.0171865E-7 | 19.130972 | 36 |
GTATATC | 380 | 0.0 | 19.104322 | 3 |
ACGCTTC | 150 | 4.638423E-10 | 19.068676 | 31 |
ACTATAC | 70 | 8.1204314E-4 | 18.856216 | 3 |
TAGACAA | 70 | 8.1204314E-4 | 18.856216 | 5 |
TTATGCC | 60 | 0.0074105742 | 18.332432 | 3 |
TTAACGG | 145 | 6.2882464E-9 | 18.207409 | 35 |
ATTTCGT | 145 | 6.2937033E-9 | 18.206001 | 42 |
AACGCTT | 170 | 1.2551027E-10 | 18.119558 | 30 |
GCATATA | 85 | 1.4300687E-4 | 18.116756 | 2 |