Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR3549905_1.fastq |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 977950 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 51 |
| %GC | 50 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 2125 | 0.21729127255994685 | No Hit |
| GCTGTCTCTTATACACATCTGACGCCTTGTCGATCGTATGCCGTCTTCTGC | 1563 | 0.15982412188762204 | No Hit |
| CCTGTCTCTTATACACATCTGACGCCTTGTCGATCGTATGCCGTCTTCTGC | 1412 | 0.1443836596963035 | No Hit |
| CTGTCTCTTATACACATCTGACGCCTTGTCGATCGTATGCCGTCTTCTGCT | 995 | 0.10174344291630452 | Illumina Single End Adapter 2 (95% over 21bp) |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| TCTAGCG | 40 | 6.8139343E-9 | 45.0 | 1 |
| CCCGTAG | 20 | 7.03277E-4 | 45.0 | 27 |
| GTTAACG | 20 | 7.03277E-4 | 45.0 | 1 |
| CGCAATA | 20 | 7.03277E-4 | 45.0 | 32 |
| CGGTCTA | 155 | 0.0 | 42.09677 | 31 |
| GCGTAAG | 95 | 0.0 | 40.26316 | 1 |
| AATAGCG | 40 | 3.457517E-7 | 39.375 | 1 |
| TCACGAC | 175 | 0.0 | 38.571426 | 25 |
| TATAGCG | 65 | 9.094947E-12 | 38.07692 | 1 |
| ATAACGG | 65 | 9.094947E-12 | 38.07692 | 2 |
| ATAGGGT | 220 | 0.0 | 37.840908 | 4 |
| CTCACGA | 185 | 0.0 | 37.7027 | 24 |
| CGTAAGG | 150 | 0.0 | 37.500004 | 2 |
| GGATCGA | 30 | 1.1397974E-4 | 37.499996 | 8 |
| CGCGAGG | 120 | 0.0 | 37.499996 | 2 |
| ATCGACG | 30 | 1.1397974E-4 | 37.499996 | 15 |
| CCCGAAT | 30 | 1.1397974E-4 | 37.499996 | 29 |
| CGTTAGG | 195 | 0.0 | 36.923077 | 2 |
| TACGGGA | 265 | 0.0 | 36.509434 | 4 |
| TAAGGGA | 715 | 0.0 | 36.503498 | 4 |