Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR1545790_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Illumina 1.5 |
| Total Sequences | 1224814 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 50 |
| %GC | 46 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| CGTTTTATTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 4834 | 0.39467217063162247 | No Hit |
| CGTTATTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 2090 | 0.17063815403808252 | No Hit |
| CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 1889 | 0.15422749903250618 | No Hit |
| GACCAGAAAAATGGTCCTGCCAAGCGGCTACAGTGTTCTTCTTTCAGATA | 1557 | 0.12712134250588253 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| CGCATTA | 20 | 7.856685E-4 | 44.0 | 12 |
| CTATACG | 20 | 7.856685E-4 | 44.0 | 1 |
| TCGCTAA | 20 | 7.856685E-4 | 44.0 | 17 |
| AATTCGC | 70 | 0.0 | 44.0 | 13 |
| CATACGT | 20 | 7.856685E-4 | 44.0 | 30 |
| ATCGGTA | 20 | 7.856685E-4 | 44.0 | 44 |
| CGCAATA | 20 | 7.856685E-4 | 44.0 | 33 |
| CGATACG | 30 | 2.5278532E-6 | 44.0 | 12 |
| CGTTTTA | 2535 | 0.0 | 43.566074 | 1 |
| TTACGGG | 280 | 0.0 | 41.642857 | 3 |
| CGCATGG | 75 | 0.0 | 41.066666 | 2 |
| CGTTATT | 1495 | 0.0 | 41.056858 | 1 |
| TAGGCCG | 70 | 0.0 | 40.857143 | 6 |
| GTTTTAT | 2995 | 0.0 | 40.621037 | 2 |
| ACACGAC | 120 | 0.0 | 40.333332 | 26 |
| TGCGTAA | 60 | 3.6379788E-12 | 40.333332 | 1 |
| CACGACC | 115 | 0.0 | 40.173916 | 27 |
| CGACTAG | 55 | 7.8216544E-11 | 40.000004 | 2 |
| GTACGGG | 355 | 0.0 | 39.661972 | 3 |
| TTACTGG | 1850 | 0.0 | 39.48108 | 40 |