Basic Statistics
Measure | Value |
---|---|
Filename | SRR936074_1.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 6704024 |
Sequences flagged as poor quality | 0 |
Sequence length | 125 |
%GC | 45 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 10343 | 0.15428047393625083 | No Hit |
GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 7655 | 0.11418515208179444 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGTATCA | 2425 | 0.0 | 45.287148 | 100-101 |
CGGTATC | 2035 | 0.0 | 35.1 | 100-101 |
GGTATCA | 14870 | 0.0 | 34.21954 | 1 |
CGTACAT | 3670 | 0.0 | 34.141006 | 100-101 |
CCTGTCG | 2020 | 0.0 | 29.462587 | 96-97 |
GTATCAA | 28650 | 0.0 | 28.770283 | 1 |
CGAGTAC | 2435 | 0.0 | 27.989607 | 100-101 |
TACGCAG | 1870 | 0.0 | 27.852024 | 100-101 |
GCGTATC | 900 | 0.0 | 27.447062 | 100-101 |
TCGGTAT | 970 | 0.0 | 26.386818 | 100-101 |
CCGTACA | 2000 | 0.0 | 25.892832 | 100-101 |
ACGTATC | 1050 | 0.0 | 25.793625 | 100-101 |
TCGAGTA | 1255 | 0.0 | 25.611814 | 100-101 |
CCGTATC | 1360 | 0.0 | 25.603968 | 100-101 |
TCGTATC | 1160 | 0.0 | 24.887087 | 100-101 |
CCGGTAT | 1025 | 0.0 | 23.8095 | 100-101 |
TCGTACA | 2220 | 0.0 | 23.595001 | 100-101 |
AGTATCA | 6890 | 0.0 | 23.455297 | 100-101 |
ATCAACG | 35600 | 0.0 | 22.711166 | 3 |
TCAACGC | 36020 | 0.0 | 22.44635 | 4 |