Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR1547531_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Illumina 1.5 |
| Total Sequences | 4593193 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 51 |
| %GC | 42 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 88693 | 1.9309661057133893 | No Hit |
| CGTTTCTGTCTCTTATACACATCTGACGCGTGTGAGATCGTATGCCGTCTT | 7305 | 0.15903969199639553 | No Hit |
| CGTTTTCTGTCTCTTATACACATCTGACGCGTGTGAGATCGTATGCCGTCT | 5775 | 0.1257295306337008 | No Hit |
| CGTTCTGTCTCTTATACACATCTGACGCGTGTGAGATCGTATGCCGTCTTC | 4604 | 0.10023528295022657 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| CGTTTTT | 41750 | 0.0 | 44.067665 | 1 |
| CGACGGT | 510 | 0.0 | 42.35294 | 28 |
| CGGTCTA | 510 | 0.0 | 41.47059 | 31 |
| TACGGGA | 2050 | 0.0 | 40.939026 | 4 |
| GGCGATA | 940 | 0.0 | 40.212765 | 8 |
| GCGCGAC | 640 | 0.0 | 39.375 | 9 |
| TAGGGAC | 4555 | 0.0 | 38.23271 | 5 |
| ACGGGAC | 1615 | 0.0 | 38.173374 | 5 |
| GCGATAC | 195 | 0.0 | 38.076923 | 9 |
| ATTCGCG | 65 | 9.094947E-12 | 38.07692 | 1 |
| TAGGGAT | 7445 | 0.0 | 38.018803 | 5 |
| ACGGGAT | 2175 | 0.0 | 37.965515 | 5 |
| AGGGATC | 6005 | 0.0 | 37.731056 | 6 |
| ATAGGGA | 6435 | 0.0 | 37.692307 | 4 |
| ATTAGCG | 275 | 0.0 | 37.636364 | 1 |
| CGGTCGA | 60 | 1.5643309E-10 | 37.499996 | 27 |
| AGTACGG | 560 | 0.0 | 37.366074 | 2 |
| TAAGGGA | 6455 | 0.0 | 37.017815 | 4 |
| AGGGCGA | 2010 | 0.0 | 36.9403 | 6 |
| GGGCGAT | 3795 | 0.0 | 36.81818 | 7 |