Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR3553114_1.fastq |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 961414 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 51 |
| %GC | 46 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| CTGTCTCTTATACACATCTGACGCTAGGCAGATCGTATGCCGTCTTCTGCT | 4153 | 0.43196791392677863 | Illumina Single End Adapter 1 (95% over 22bp) |
| AATGATACGGCTGTCTCTTATACACATCTGACGCTAGGCAGATCGTATGCC | 2174 | 0.22612526965490412 | No Hit |
| AATGATACCTGTCTCTTATACACATCTGACGCTAGGCAGATCGTATGCCGT | 2008 | 0.2088590347134533 | No Hit |
| AATCTGTCTCTTATACACATCTGACGCTAGGCAGATCGTATGCCGTCTTCT | 1970 | 0.20490652310035012 | No Hit |
| CCTGTCTCTTATACACATCTGACGCTAGGCAGATCGTATGCCGTCTTCTGC | 1099 | 0.11431079638948465 | Illumina Single End Adapter 1 (95% over 21bp) |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GATCGCG | 35 | 1.211647E-7 | 45.000004 | 9 |
| CTATACG | 35 | 1.211647E-7 | 45.000004 | 1 |
| CGCGACG | 60 | 0.0 | 45.000004 | 1 |
| ACCGCAC | 20 | 7.03271E-4 | 45.0 | 10 |
| CCGTACG | 20 | 7.03271E-4 | 45.0 | 1 |
| ACGGAGT | 20 | 7.03271E-4 | 45.0 | 32 |
| TAGTGCG | 20 | 7.03271E-4 | 45.0 | 1 |
| TCGCACG | 45 | 1.927765E-8 | 40.0 | 1 |
| CGGGAAT | 1565 | 0.0 | 39.824284 | 6 |
| CCTCGCG | 40 | 3.457426E-7 | 39.375 | 1 |
| CTAACCG | 160 | 0.0 | 39.375 | 1 |
| AGGGCGT | 200 | 0.0 | 39.375 | 6 |
| TTACGGG | 460 | 0.0 | 38.641308 | 3 |
| AGCACGG | 1240 | 0.0 | 38.46774 | 2 |
| ATGCGGG | 395 | 0.0 | 38.16456 | 3 |
| ACTGCGG | 130 | 0.0 | 38.07692 | 2 |
| CGTACAG | 65 | 9.094947E-12 | 38.07692 | 1 |
| TAGACGG | 155 | 0.0 | 37.741936 | 2 |
| ACGGGAA | 1705 | 0.0 | 37.741936 | 5 |
| GCACGGG | 1525 | 0.0 | 37.62295 | 3 |