Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR522139_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 35161354 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 50 |
| %GC | 46 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| TCTCTGAGCGGGCTGGCAAGGCAGACCGATATCGTATGCCGTCTTCTGCT | 317408 | 0.9027183651687589 | Illumina PCR Primer Index 1 (95% over 24bp) |
| TCTCTGAGCGGGCTGGCAAGGCAGACCGATCTCGTATGCCGTCTTCTGCT | 79994 | 0.2275054595451586 | Illumina Paired End PCR Primer 2 (96% over 29bp) |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 58704 | 0.1669560279163311 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 52621 | 0.14965578401787372 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GCCGTCT | 54415 | 0.0 | 34.53951 | 38 |
| TATGCCG | 55585 | 0.0 | 34.170692 | 35 |
| CCGTCTT | 56220 | 0.0 | 33.470634 | 39 |
| ATGCCGT | 56150 | 0.0 | 33.454395 | 36 |
| TGCCGTC | 56965 | 0.0 | 32.95502 | 37 |
| CGTCTTC | 57130 | 0.0 | 32.83504 | 40 |
| TATCGTA | 45895 | 0.0 | 32.14626 | 30 |
| ATATCGT | 46010 | 0.0 | 32.05177 | 29 |
| ATCGTAT | 46475 | 0.0 | 31.988777 | 31 |
| TCGTATG | 59355 | 0.0 | 31.959572 | 32 |
| ACCGATA | 46250 | 0.0 | 31.9377 | 25 |
| CCGATAT | 46280 | 0.0 | 31.91686 | 26 |
| CGTATGC | 59770 | 0.0 | 31.807312 | 33 |
| CGATATC | 46660 | 0.0 | 31.733694 | 27 |
| GCGGGCT | 62720 | 0.0 | 31.700056 | 8 |
| GATATCG | 46710 | 0.0 | 31.69917 | 28 |
| TGAGCGG | 63085 | 0.0 | 31.631487 | 5 |
| AGACCGA | 60780 | 0.0 | 31.579397 | 23 |
| GACCGAT | 60245 | 0.0 | 31.354334 | 24 |
| CGGGCTG | 64865 | 0.0 | 30.709314 | 9 |