Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR3550487_1.fastq |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 909373 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 51 |
| %GC | 48 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 5233 | 0.575451437418969 | No Hit |
| GCTGTCTCTTATACACATCTGACGCATCGGAACTCGTATGCCGTCTTCTGC | 1406 | 0.15461202388898726 | Illumina Single End Adapter 2 (95% over 22bp) |
| CCTGTCTCTTATACACATCTGACGCATCGGAACTCGTATGCCGTCTTCTGC | 1341 | 0.14746424184575527 | RNA PCR Primer, Index 30 (95% over 24bp) |
| CTGTCTCTTATACACATCTGACGCATCGGAACTCGTATGCCGTCTTCTGCT | 1132 | 0.12448137342982472 | RNA PCR Primer, Index 30 (96% over 25bp) |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| CTATGCG | 40 | 6.8121153E-9 | 45.000004 | 1 |
| GCGGTAT | 20 | 7.0325076E-4 | 45.000004 | 16 |
| CGGTCTA | 35 | 1.2115561E-7 | 45.0 | 31 |
| TGCGACG | 30 | 2.1649994E-6 | 44.999996 | 1 |
| CGAATAT | 165 | 0.0 | 43.636368 | 14 |
| CTACGAA | 175 | 0.0 | 41.142857 | 11 |
| CGTTTTT | 2355 | 0.0 | 40.414013 | 1 |
| TACGAAT | 180 | 0.0 | 40.0 | 12 |
| ATAGACG | 40 | 3.457153E-7 | 39.375004 | 1 |
| TCGTAAG | 40 | 3.457153E-7 | 39.375004 | 1 |
| GCTACGA | 185 | 0.0 | 38.91892 | 10 |
| CGTAAGG | 110 | 0.0 | 38.863636 | 2 |
| GCACGTT | 35 | 6.2468516E-6 | 38.571426 | 10 |
| TATTGCG | 70 | 0.0 | 38.571426 | 1 |
| CTAACGG | 70 | 0.0 | 38.571426 | 2 |
| CGCACGG | 95 | 0.0 | 37.894737 | 2 |
| CGATTAT | 30 | 1.13973445E-4 | 37.499996 | 10 |
| TACTAAC | 30 | 1.13973445E-4 | 37.499996 | 25 |
| TATGCGG | 140 | 0.0 | 36.964283 | 2 |
| GCGTAAG | 55 | 2.748493E-9 | 36.81818 | 1 |