Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR1546852_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Illumina 1.5 |
| Total Sequences | 4814161 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 51 |
| %GC | 45 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 41391 | 0.8597759817338888 | No Hit |
| GACCAGAAAAATGGTCCTGCCAAGCGGCTACAGTGTTCTTCTTTCAGATAC | 6619 | 0.13749020857424585 | No Hit |
| GAATGATACCTGTCTCTTATACACATCTGACGCCACCTTTCTCGTATGCCG | 6029 | 0.12523469821636626 | No Hit |
| GAATCTGTCTCTTATACACATCTGACGCCACCTTTCTCGTATGCCGTCTTC | 5823 | 0.12095565561683541 | TruSeq Adapter, Index 27 (95% over 22bp) |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| CGTTTTT | 25600 | 0.0 | 43.13672 | 1 |
| CGACGGT | 515 | 0.0 | 39.757282 | 28 |
| TACGGGT | 820 | 0.0 | 39.512196 | 4 |
| CACGACG | 505 | 0.0 | 38.76238 | 26 |
| TACGGGA | 965 | 0.0 | 38.704662 | 4 |
| TCACGAC | 525 | 0.0 | 37.714287 | 25 |
| GGGCGAT | 7015 | 0.0 | 37.590878 | 7 |
| CTATCGA | 60 | 1.5643309E-10 | 37.499996 | 27 |
| AGGGCGA | 3705 | 0.0 | 37.22672 | 6 |
| TAGGGTA | 3545 | 0.0 | 37.002823 | 5 |
| CGGGTAT | 735 | 0.0 | 36.734695 | 6 |
| CGAGGGA | 2430 | 0.0 | 36.666664 | 4 |
| CGGTCTA | 530 | 0.0 | 36.509434 | 31 |
| GTTTTTT | 32210 | 0.0 | 36.35905 | 2 |
| ATACCGG | 155 | 0.0 | 36.29032 | 2 |
| ACGGGTA | 875 | 0.0 | 36.25714 | 5 |
| CACGACC | 590 | 0.0 | 36.228813 | 27 |
| CGTAGAT | 25 | 0.0021077748 | 36.0 | 16 |
| CGTAATA | 25 | 0.0021077748 | 36.0 | 29 |
| ACACGAC | 595 | 0.0 | 35.92437 | 26 |