Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR1544724_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Illumina 1.5 |
| Total Sequences | 1174083 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 52 |
| %GC | 43 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA | 4748 | 0.40440071102298564 | No Hit |
| CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 4111 | 0.35014560299399616 | No Hit |
| CTGTCTCTTATACACATCTGACGCGGACCTTATCGTATGCCGTCTTCTGCTT | 1703 | 0.1450493704448493 | TruSeq Adapter, Index 20 (96% over 25bp) |
| CCTGTCTCTTATACACATCTGACGCGGACCTTATCGTATGCCGTCTTCTGCT | 1487 | 0.1266520339703411 | TruSeq Adapter, Index 23 (95% over 23bp) |
| GAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA | 1446 | 0.1231599469543465 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| CTTTGCG | 75 | 0.0 | 46.000004 | 1 |
| TAGCTAT | 75 | 0.0 | 46.000004 | 11 |
| TCACGAT | 40 | 5.6115823E-9 | 46.0 | 12 |
| TCACGAA | 30 | 1.8614373E-6 | 46.0 | 22 |
| CCTTCGC | 20 | 6.31151E-4 | 46.0 | 42 |
| TCTGCGA | 30 | 1.8614373E-6 | 46.0 | 19 |
| ACAACGT | 45 | 3.092282E-10 | 46.0 | 30 |
| AGCGTTA | 30 | 1.8614373E-6 | 46.0 | 30 |
| CGTGGTA | 20 | 6.31151E-4 | 46.0 | 37 |
| CTTCGAC | 30 | 1.8614373E-6 | 46.0 | 23 |
| TATCACG | 30 | 1.8614373E-6 | 46.0 | 1 |
| CGGAATT | 20 | 6.31151E-4 | 46.0 | 14 |
| AGTCCCG | 20 | 6.31151E-4 | 46.0 | 24 |
| TACCCGC | 20 | 6.31151E-4 | 46.0 | 29 |
| TCGCAAT | 20 | 6.31151E-4 | 46.0 | 1 |
| AAATCGA | 20 | 6.31151E-4 | 46.0 | 30 |
| TCGCAAG | 55 | 1.8189894E-12 | 46.0 | 1 |
| GCGACGC | 20 | 6.31151E-4 | 46.0 | 32 |
| TCCGCGA | 20 | 6.31151E-4 | 46.0 | 30 |
| CTCGTCT | 20 | 6.31151E-4 | 46.0 | 19 |