Basic Statistics
Measure | Value |
---|---|
Filename | SRR4062210_1.fastq |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 1045925 |
Sequences flagged as poor quality | 0 |
Sequence length | 50 |
%GC | 51 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GCGCAAGACGGACCAGAGCGAAAGCATTTGCCAAGAATGTTTTCATTAAT | 1232 | 0.11779047254822286 | No Hit |
GATTAAGAGGGACGGCCGGGGGCATTCGTATTGCGCCGCTAGAGGTGAAA | 1122 | 0.10727346607070298 | No Hit |
GAATAGGACCGCGGTTCTATTTTGTTGGTTTTCGGAACTGAGGCCATGAT | 1093 | 0.10450080072662955 | No Hit |
GTTCAAAGCAGGCCCGAGCCGCCTGGATACCGCAGCTAGGAATAATGGAA | 1080 | 0.10325788177928628 | No Hit |
TCGTAGTTCCGACCATAAACGATGCCGACTGGCGATGCGGCGGCGTTATT | 1053 | 0.10067643473480412 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GTATCAA | 1345 | 0.0 | 27.156872 | 1 |
GGTATCA | 620 | 0.0 | 23.778095 | 1 |
CGGATCG | 75 | 2.0625976E-6 | 23.468273 | 26 |
ATACGAA | 220 | 0.0 | 21.001436 | 40 |
CGAATGC | 235 | 0.0 | 19.660921 | 43 |
TACGAAT | 255 | 0.0 | 18.118887 | 41 |
GTATTAA | 135 | 5.512993E-8 | 17.928867 | 1 |
TCAACGC | 2020 | 0.0 | 17.642242 | 4 |
CGCAATA | 280 | 0.0 | 17.285246 | 36 |
TAATACC | 140 | 8.3760824E-8 | 17.28442 | 4 |
CAACGCA | 2075 | 0.0 | 17.280632 | 5 |
AACGCAG | 2075 | 0.0 | 17.280632 | 6 |
ATCAACG | 2065 | 0.0 | 17.257788 | 3 |
GTCCTAA | 90 | 2.2122369E-4 | 17.113918 | 1 |
TCGATTT | 90 | 2.2138536E-4 | 17.112284 | 30 |
TCTAGCG | 270 | 0.0 | 17.112282 | 28 |
TATCAAC | 2155 | 0.0 | 16.84409 | 2 |
GTTATAG | 170 | 2.4610927E-9 | 16.82629 | 1 |
CAATACG | 275 | 0.0 | 16.801151 | 38 |
CTAGCGG | 250 | 0.0 | 16.721144 | 29 |