Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR3552020_1.fastq |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 525707 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 51 |
| %GC | 47 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 5796 | 1.1025152794237094 | No Hit |
| GCTGTCTCTTATACACATCTGACGCACTAGCACTCGTATGCCGTCTTCTGC | 1341 | 0.25508505688529926 | TruSeq Adapter, Index 16 (95% over 21bp) |
| CGTTTTTTTTCTGTCTCTTATACACATCTGACGCACTAGCACTCGTATGCC | 1185 | 0.22541073259439193 | No Hit |
| GAATGATACCTGTCTCTTATACACATCTGACGCACTAGCACTCGTATGCCG | 1106 | 0.21038335042143247 | No Hit |
| GAATCTGTCTCTTATACACATCTGACGCACTAGCACTCGTATGCCGTCTTC | 1019 | 0.19383420802842646 | No Hit |
| GAATGATACGGCTGTCTCTTATACACATCTGACGCACTAGCACTCGTATGC | 965 | 0.18356232654311241 | No Hit |
| CTGTCTCTTATACACATCTGACGCACTAGCACTCGTATGCCGTCTTCTGCT | 838 | 0.1594043830498738 | TruSeq Adapter, Index 13 (95% over 22bp) |
| CCTGTCTCTTATACACATCTGACGCACTAGCACTCGTATGCCGTCTTCTGC | 645 | 0.1226919177412513 | TruSeq Adapter, Index 16 (95% over 21bp) |
| GATACTGAAGCTACGAATATACTGACTATGAAGACCTATGCTTTGATTCAT | 558 | 0.1061427753482453 | No Hit |
| TCTGTCTCTTATACACATCTGACGCACTAGCACTCGTATGCCGTCTTCTGC | 549 | 0.10443079510069296 | TruSeq Adapter, Index 16 (95% over 21bp) |
| GAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAA | 548 | 0.10424057507318715 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| ACGTTAG | 30 | 2.1632459E-6 | 45.000004 | 1 |
| TTATGCG | 25 | 3.887908E-5 | 45.0 | 1 |
| TAACGCG | 25 | 3.887908E-5 | 45.0 | 1 |
| CTACGAA | 90 | 0.0 | 45.0 | 11 |
| CGAATAT | 90 | 0.0 | 45.0 | 14 |
| TCATACG | 20 | 7.029791E-4 | 45.0 | 31 |
| TAGTGCG | 25 | 3.887908E-5 | 45.0 | 1 |
| TCGACTC | 20 | 7.029791E-4 | 45.0 | 28 |
| CGTCATA | 20 | 7.029791E-4 | 45.0 | 37 |
| TTCGACT | 20 | 7.029791E-4 | 45.0 | 27 |
| TTCGAAT | 20 | 7.029791E-4 | 45.0 | 16 |
| AGTACGC | 20 | 7.029791E-4 | 45.0 | 1 |
| GCTACGA | 95 | 0.0 | 42.63158 | 10 |
| TACGGGA | 235 | 0.0 | 41.17021 | 4 |
| CGTTTTT | 2280 | 0.0 | 40.55921 | 1 |
| GACCTAT | 85 | 0.0 | 39.705883 | 33 |
| AGGTACG | 40 | 3.4534423E-7 | 39.375 | 1 |
| GGATCGA | 40 | 3.4534423E-7 | 39.375 | 8 |
| AGTACGG | 115 | 0.0 | 39.130432 | 2 |
| TACGAAT | 110 | 0.0 | 38.863636 | 12 |