Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR1547921_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Illumina 1.5 |
| Total Sequences | 1426779 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 51 |
| %GC | 42 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 12181 | 0.8537411890699261 | No Hit |
| GACCAGAAAAATGGTCCTGCCAAGCGGCTACAGTGTTCTTCTTTCAGATAC | 2740 | 0.19204095378471367 | No Hit |
| CTGTCTCTTATACACATCTGACGCTATAGGACTCGTATGCCGTCTTCTGCT | 2179 | 0.1527216198163836 | TruSeq Adapter, Index 21 (95% over 23bp) |
| AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA | 1932 | 0.13540989880002438 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| CCTAGCG | 35 | 1.2122109E-7 | 45.000004 | 1 |
| TAACGAC | 30 | 2.1658707E-6 | 45.000004 | 22 |
| GCGATCG | 30 | 2.1658707E-6 | 45.000004 | 9 |
| TGATCGC | 20 | 7.033859E-4 | 45.0 | 28 |
| CTATGCG | 45 | 3.8562575E-10 | 45.0 | 1 |
| CTCACGG | 40 | 6.8175723E-9 | 45.0 | 2 |
| GACCGTT | 20 | 7.033859E-4 | 45.0 | 23 |
| GCTATCG | 20 | 7.033859E-4 | 45.0 | 41 |
| CCTCGCG | 45 | 3.8562575E-10 | 45.0 | 20 |
| TAGTTCG | 20 | 7.033859E-4 | 45.0 | 1 |
| CCCGTAG | 40 | 6.8175723E-9 | 45.0 | 43 |
| CTAAACG | 55 | 1.8189894E-12 | 45.0 | 1 |
| CTACGCT | 20 | 7.033859E-4 | 45.0 | 40 |
| TGTTACG | 20 | 7.033859E-4 | 45.0 | 1 |
| GCGGCCC | 20 | 7.033859E-4 | 45.0 | 38 |
| CGCCGTT | 20 | 7.033859E-4 | 45.0 | 26 |
| GTGACCG | 20 | 7.033859E-4 | 45.0 | 9 |
| CGATCGT | 20 | 7.033859E-4 | 45.0 | 10 |
| TCGACTG | 20 | 7.033859E-4 | 45.0 | 1 |
| GCGTCGC | 20 | 7.033859E-4 | 45.0 | 9 |