Basic Statistics
Measure | Value |
---|---|
Filename | SRR2031472_2.fastq |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 490381 |
Sequences flagged as poor quality | 0 |
Sequence length | 101 |
%GC | 44 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 1556 | 0.3173043001258205 | No Hit |
GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 1246 | 0.2540881477871288 | No Hit |
TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 646 | 0.1317343045509512 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GGTATCA | 605 | 0.0 | 61.985405 | 1 |
GTATCAA | 1305 | 0.0 | 42.195408 | 1 |
TCAACGC | 1520 | 0.0 | 35.29368 | 4 |
TGGTATC | 285 | 0.0 | 34.981346 | 2 |
CAACGCA | 1575 | 0.0 | 34.664055 | 5 |
ATCAACG | 1550 | 0.0 | 34.304283 | 3 |
AACGCAG | 1640 | 0.0 | 33.28338 | 6 |
TATCAAC | 1660 | 0.0 | 32.03111 | 2 |
GTGGTAT | 360 | 0.0 | 30.327951 | 1 |
GGCCGTA | 65 | 0.005859 | 29.212206 | 1 |
ACGCAGA | 1870 | 0.0 | 29.1897 | 7 |
CGCAGAG | 1890 | 0.0 | 28.880814 | 8 |
TACATGG | 1615 | 0.0 | 28.514204 | 2 |
GTACATG | 1685 | 0.0 | 27.045128 | 1 |
ACATGGG | 1680 | 0.0 | 25.99804 | 3 |
TAGCCCT | 500 | 0.0 | 25.636328 | 4 |
GTATAAT | 315 | 0.0 | 25.61864 | 1 |
GTATAGC | 205 | 9.604264E-10 | 25.47162 | 1 |
CATGGGG | 1025 | 0.0 | 25.011053 | 4 |
GCAGAGT | 2240 | 0.0 | 23.944391 | 9 |