Basic Statistics
Measure | Value |
---|---|
Filename | SRR936168_1.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 3967452 |
Sequences flagged as poor quality | 0 |
Sequence length | 125 |
%GC | 46 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 8343 | 0.2102860979792572 | No Hit |
GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 5989 | 0.15095330705954352 | No Hit |
TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 4119 | 0.10381978156257467 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGTATCA | 1310 | 0.0 | 44.52875 | 100-101 |
CGGTATC | 1005 | 0.0 | 35.536194 | 100-101 |
CGTACAT | 2060 | 0.0 | 33.806828 | 100-101 |
GGTATCA | 8400 | 0.0 | 33.721054 | 1 |
GTATCAA | 16755 | 0.0 | 32.319935 | 1 |
ACGTATC | 535 | 0.0 | 28.370838 | 100-101 |
TCGTATC | 640 | 0.0 | 27.901468 | 100-101 |
TACGCAG | 1360 | 0.0 | 26.916708 | 100-101 |
TCGTACA | 1195 | 0.0 | 26.39938 | 100-101 |
CGAGTAC | 1355 | 0.0 | 25.698177 | 100-101 |
ATCAACG | 21005 | 0.0 | 25.407454 | 3 |
TATCAAC | 21340 | 0.0 | 25.176836 | 2 |
TCAACGC | 21230 | 0.0 | 25.110155 | 4 |
TCGTATA | 465 | 0.0 | 24.961311 | 100-101 |
CCGTATA | 455 | 0.0 | 24.855812 | 100-101 |
CCGTACA | 1200 | 0.0 | 24.801302 | 100-101 |
CAACGCA | 21935 | 0.0 | 24.329002 | 5 |
AGTATCA | 3995 | 0.0 | 23.839048 | 100-101 |
GGGTATC | 1865 | 0.0 | 23.777334 | 100-101 |
CGTATAG | 445 | 0.0 | 23.40797 | 100-101 |