Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR2079144_1.fastq |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 18095794 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 75 |
| %GC | 48 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GATCGGAAGAGCACACGTCTGAACTCCAGTCACCGCTCATTATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAAA | 80769 | 0.44634128792580197 | TruSeq Adapter, Index 2 (97% over 37bp) |
| ATCGGAAGAGCACACGTCTGAACTCCAGTCACCGCTCATTATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAAAA | 23680 | 0.13085913776427827 | TruSeq Adapter, Index 2 (97% over 36bp) |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| TATGCCG | 14370 | 0.0 | 47.83069 | 48 |
| TCGTATG | 15015 | 0.0 | 45.91231 | 45 |
| TATCTCG | 14960 | 0.0 | 45.571068 | 41 |
| CGTATGC | 15230 | 0.0 | 45.151455 | 46 |
| CTCGTAT | 16135 | 0.0 | 42.53204 | 44 |
| ATGCCGT | 16245 | 0.0 | 42.246613 | 49 |
| TGCCGTC | 17050 | 0.0 | 40.494698 | 50 |
| GTATGCC | 17160 | 0.0 | 40.416443 | 47 |
| TCTCGTA | 16945 | 0.0 | 40.29375 | 43 |
| ACCGCTC | 17195 | 0.0 | 40.092937 | 32 |
| GCCGTCT | 17205 | 0.0 | 39.86776 | 51 |
| TCACCGC | 17680 | 0.0 | 39.168877 | 30 |
| ATCTCGT | 17845 | 0.0 | 38.300213 | 42 |
| CGCTCAT | 18065 | 0.0 | 38.181175 | 34 |
| CACCGCT | 18325 | 0.0 | 37.73376 | 31 |
| CCGTCTT | 18515 | 0.0 | 37.158733 | 52 |
| GTCACCG | 18770 | 0.0 | 36.87622 | 29 |
| CCGCTCA | 19100 | 0.0 | 36.148308 | 33 |
| TTATCTC | 19350 | 0.0 | 35.498734 | 40 |
| CATTATC | 20100 | 0.0 | 34.173397 | 38 |