Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR2049460_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 3372547 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 100 |
| %GC | 43 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 13107 | 0.38863802342858383 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 12763 | 0.3784380173204406 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 7379 | 0.21879606125578088 | No Hit |
| GTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 4312 | 0.12785589051835305 | No Hit |
| GTGGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 3619 | 0.10730762239933203 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GGTATCA | 7105 | 0.0 | 58.503098 | 1 |
| GTATCAA | 12690 | 0.0 | 39.246983 | 1 |
| GTGGTAT | 2790 | 0.0 | 37.11938 | 1 |
| TATAACG | 330 | 0.0 | 34.18648 | 2 |
| TGGTATC | 3095 | 0.0 | 32.350132 | 2 |
| ATCAACG | 15600 | 0.0 | 31.638931 | 3 |
| TCAACGC | 15830 | 0.0 | 31.208935 | 4 |
| TATCAAC | 16235 | 0.0 | 30.430393 | 2 |
| CAACGCA | 16420 | 0.0 | 30.173422 | 5 |
| AACGCAG | 17055 | 0.0 | 29.238153 | 6 |
| ATAACGC | 430 | 0.0 | 27.32931 | 3 |
| ACGCAGA | 19195 | 0.0 | 25.905014 | 7 |
| CGCAGAG | 19410 | 0.0 | 25.642284 | 8 |
| GCAGAGT | 20925 | 0.0 | 23.471298 | 9 |
| CAGAGTA | 20405 | 0.0 | 21.754627 | 10-11 |
| TAACGCA | 620 | 0.0 | 21.228703 | 4 |
| GAGTACT | 16520 | 0.0 | 21.010035 | 12-13 |
| AGAGTAC | 19800 | 0.0 | 20.47294 | 10-11 |
| GTACATG | 8555 | 0.0 | 20.084225 | 1 |
| CTGTCGC | 930 | 0.0 | 19.203789 | 9 |