Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR1547641_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Illumina 1.5 |
| Total Sequences | 1627766 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 51 |
| %GC | 41 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 35330 | 2.170459390354633 | No Hit |
| CGTTTCTGTCTCTTATACACATCTGACGCATACCGCATCGTATGCCGTCTT | 2334 | 0.14338670300276576 | No Hit |
| CGTTTTCTGTCTCTTATACACATCTGACGCATACCGCATCGTATGCCGTCT | 1751 | 0.1075707441978761 | No Hit |
| GACCAGAAAAATGGTCCTGCCAAGCGGCTACAGTGTTCTTCTTTCAGATAC | 1649 | 0.10130448725431052 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| CGTTTTT | 16430 | 0.0 | 43.46622 | 1 |
| CCGAACC | 55 | 6.184564E-11 | 40.909092 | 18 |
| GCGATAC | 50 | 1.0822987E-9 | 40.5 | 9 |
| TAACGCG | 45 | 1.9292202E-8 | 40.000004 | 1 |
| TACGCGG | 80 | 0.0 | 39.375 | 2 |
| TCACGAC | 110 | 0.0 | 38.863636 | 25 |
| AATCGGT | 35 | 6.249893E-6 | 38.571426 | 26 |
| GGGCGAT | 2200 | 0.0 | 38.147728 | 7 |
| TAGGGAC | 1540 | 0.0 | 37.694805 | 5 |
| GTTTTTT | 19435 | 0.0 | 37.636997 | 2 |
| CGGGACC | 365 | 0.0 | 37.602737 | 6 |
| AAGGGCG | 485 | 0.0 | 37.57732 | 5 |
| TTACGCG | 30 | 1.1401305E-4 | 37.500004 | 1 |
| TAATGCG | 60 | 1.5643309E-10 | 37.500004 | 1 |
| AGGGATC | 1585 | 0.0 | 37.47634 | 6 |
| GGCACCG | 495 | 0.0 | 37.272728 | 8 |
| GACCGAT | 750 | 0.0 | 37.2 | 9 |
| TATGGGC | 660 | 0.0 | 37.159092 | 4 |
| ATATGCG | 85 | 0.0 | 37.058823 | 1 |
| CTCGTAG | 55 | 2.750312E-9 | 36.81818 | 1 |