Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR3550428_1.fastq |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 535849 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 51 |
| %GC | 48 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 2794 | 0.5214155480368536 | No Hit |
| CGCTGTCTCTTATACACATCTGACGCTGTGAGACTCGTATGCCGTCTTCTG | 706 | 0.1317535350443875 | No Hit |
| CCTGTCTCTTATACACATCTGACGCTGTGAGACTCGTATGCCGTCTTCTGC | 625 | 0.11663733626450735 | TruSeq Adapter, Index 15 (95% over 22bp) |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| AAGTACG | 25 | 3.888009E-5 | 45.0 | 1 |
| TGCCCGA | 20 | 7.0299115E-4 | 45.0 | 20 |
| AATTGCG | 25 | 3.888009E-5 | 45.0 | 1 |
| TACGAAT | 45 | 3.8380676E-10 | 45.0 | 12 |
| TTACGCG | 20 | 7.0299115E-4 | 45.0 | 1 |
| GCTACGA | 45 | 3.8380676E-10 | 45.0 | 10 |
| GTACGAG | 40 | 6.8030204E-9 | 45.0 | 1 |
| CGTTTTT | 1715 | 0.0 | 40.801746 | 1 |
| CCTCGAT | 50 | 1.0786607E-9 | 40.5 | 15 |
| GCTAGCG | 45 | 1.9250365E-8 | 40.0 | 1 |
| ATTGCGG | 120 | 0.0 | 39.375004 | 2 |
| AACACGT | 240 | 0.0 | 38.437504 | 41 |
| TCAAGCG | 270 | 0.0 | 38.333336 | 17 |
| AACGGGT | 60 | 1.546141E-10 | 37.500004 | 4 |
| AATACGG | 30 | 1.1391084E-4 | 37.500004 | 2 |
| AGTACGG | 90 | 0.0 | 37.5 | 2 |
| CACGACC | 275 | 0.0 | 36.818184 | 27 |
| CTACGAA | 55 | 2.743036E-9 | 36.81818 | 11 |
| GTACGGG | 215 | 0.0 | 36.627907 | 3 |
| TGCGGGA | 425 | 0.0 | 36.52941 | 4 |