Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR3128576.fastq |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 3148010 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 100 |
| %GC | 46 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 33556 | 1.0659432466860015 | No Hit |
| CTGTCTCTTATACACATCTGACGCAAGTCTTCTCGTATGCCGTCTTCTGC | 4688 | 0.14891947611348122 | TruSeq Adapter, Index 23 (96% over 25bp) |
| CCTGTCTCTTATACACATCTGACGCAAGTCTTCTCGTATGCCGTCTTCTG | 4346 | 0.13805546996356427 | Illumina PCR Primer Index 7 (95% over 22bp) |
| GCTGTCTCTTATACACATCTGACGCAAGTCTTCTCGTATGCCGTCTTCTG | 4047 | 0.12855740610735036 | Illumina PCR Primer Index 7 (95% over 22bp) |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| CGTTTTT | 10170 | 0.0 | 85.31653 | 1 |
| ACGGGTA | 350 | 0.0 | 77.88295 | 4 |
| ATAGGGC | 1815 | 0.0 | 73.799034 | 3 |
| TAGGGCA | 1685 | 0.0 | 72.519684 | 4 |
| GACCGAT | 295 | 0.0 | 71.69237 | 8 |
| ATAGCGG | 680 | 0.0 | 70.58045 | 1 |
| GGTAAGG | 1905 | 0.0 | 70.14821 | 1 |
| ACGGGAT | 925 | 0.0 | 70.11643 | 4 |
| TATAGGG | 2650 | 0.0 | 69.35352 | 2 |
| CGTAGGG | 895 | 0.0 | 69.32478 | 2 |
| TAGCGGG | 1745 | 0.0 | 68.9576 | 2 |
| TAGAGGG | 5780 | 0.0 | 68.71736 | 2 |
| AGGGCAT | 2565 | 0.0 | 68.34455 | 5 |
| TAAGGGA | 3050 | 0.0 | 67.64678 | 3 |
| AGTAGGG | 5285 | 0.0 | 67.32689 | 2 |
| GAATAGG | 1980 | 0.0 | 67.253426 | 1 |
| AGGGTAC | 1545 | 0.0 | 67.22739 | 5 |
| GAGGGAT | 3800 | 0.0 | 66.91078 | 4 |
| GTAGGGC | 1485 | 0.0 | 66.77878 | 3 |
| AGGGAAT | 3615 | 0.0 | 66.694725 | 5 |