Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR4050258_1.fastq |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 2803752 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 25 |
| %GC | 45 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GTATCAACGCAGAGTACTTTTTTTT | 17259 | 0.6155679960281794 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTT | 11814 | 0.4213639437439545 | No Hit |
| GTACTTTTTTTTTTTTTTTTTTTTT | 6715 | 0.2395004979042369 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTT | 6540 | 0.23325886169675492 | No Hit |
| ACGCAGAGTACTTTTTTTTTTTTTT | 5402 | 0.192670393101815 | No Hit |
| GAGTACTTTTTTTTTTTTTTTTTTT | 4633 | 0.1652428602815085 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GGTATCA | 1955 | 0.0 | 15.9119835 | 1 |
| GTATCAA | 5855 | 0.0 | 12.793703 | 1 |
| AACCGCG | 120 | 7.512426E-10 | 12.656822 | 7 |
| CCGACCA | 220 | 0.0 | 12.513664 | 9 |
| CGGTCTA | 70 | 1.09890534E-4 | 12.205445 | 10 |
| CGACCAT | 225 | 0.0 | 11.813666 | 10 |
| GTCTAAT | 240 | 0.0 | 11.53029 | 1 |
| CGAACGA | 160 | 4.5474735E-11 | 11.272884 | 16 |
| TATAACG | 120 | 1.2521014E-7 | 11.098867 | 2 |
| AGAACCG | 190 | 0.0 | 11.014828 | 5 |
| CGTCGTA | 165 | 8.185452E-11 | 10.931477 | 10 |
| CGGACGG | 70 | 0.0014753768 | 10.871777 | 5 |
| ATACCGT | 185 | 5.456968E-12 | 10.779633 | 6 |
| CGAGCTC | 185 | 5.456968E-12 | 10.77598 | 10 |
| GCGTTAT | 125 | 2.1090091E-7 | 10.6873865 | 1 |
| ACTGTTC | 1140 | 0.0 | 10.658947 | 8 |
| AATCGCT | 190 | 9.094947E-12 | 10.492401 | 15 |
| AGGCCCG | 245 | 0.0 | 10.46181 | 10 |
| CGAGCCG | 220 | 0.0 | 10.356135 | 15 |
| GATATAC | 1115 | 0.0 | 10.184169 | 1 |