Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR1547957_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Illumina 1.5 |
| Total Sequences | 2997942 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 51 |
| %GC | 46 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 12600 | 0.4202883177860012 | No Hit |
| GACCAGAAAAATGGTCCTGCCAAGCGGCTACAGTGTTCTTCTTTCAGATAC | 3939 | 0.13139013363167135 | No Hit |
| CCTGTCTCTTATACACATCTGACGCTTCTCTGCTCGTATGCCGTCTTCTGC | 3255 | 0.10857448209471697 | TruSeq Adapter, Index 23 (95% over 24bp) |
| GGGGTTGGGGATTTAGCTCAGTGGTAGAGCGCTTGCCTAGCAAGCGCAAGG | 3204 | 0.10687331509415458 | No Hit |
| CTGTCTCTTATACACATCTGACGCTTCTCTGCTCGTATGCCGTCTTCTGCT | 3202 | 0.10680660266275997 | Illumina Single End Adapter 1 (95% over 22bp) |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| CAACGTA | 20 | 7.035104E-4 | 45.0 | 38 |
| TCACGAC | 970 | 0.0 | 42.912373 | 25 |
| CGACGGT | 960 | 0.0 | 42.656246 | 28 |
| CGGTCTA | 970 | 0.0 | 41.75258 | 31 |
| GGCGATA | 910 | 0.0 | 40.796703 | 8 |
| CTCACGA | 1050 | 0.0 | 39.857143 | 24 |
| CGTTTTT | 7430 | 0.0 | 39.82167 | 1 |
| GCGATAC | 130 | 0.0 | 39.807693 | 9 |
| ATAGCGG | 425 | 0.0 | 39.705883 | 2 |
| CGTAAGG | 375 | 0.0 | 39.600002 | 2 |
| CTATGCG | 165 | 0.0 | 39.545452 | 1 |
| TATAGCG | 240 | 0.0 | 39.374996 | 1 |
| GCGATAT | 335 | 0.0 | 38.955223 | 9 |
| TTAGGGA | 5120 | 0.0 | 38.847656 | 4 |
| AGGGTAA | 1480 | 0.0 | 38.76689 | 6 |
| TCGGGTA | 105 | 0.0 | 38.571426 | 5 |
| GGGCGAT | 4125 | 0.0 | 38.4 | 7 |
| CGCATCG | 135 | 0.0 | 38.333336 | 21 |
| GACGGTC | 1070 | 0.0 | 38.271027 | 29 |
| TAAGGGA | 4585 | 0.0 | 38.227917 | 4 |