Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR4050235_1.fastq |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 3931224 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 25 |
| %GC | 48 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GTATCAACGCAGAGTACTTTTTTTT | 18425 | 0.46868354487050345 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTT | 12304 | 0.3129814022299416 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTT | 6726 | 0.1710917515766082 | No Hit |
| ACGCAGAGTACTTTTTTTTTTTTTT | 5728 | 0.14570525617466723 | No Hit |
| GTACTTTTTTTTTTTTTTTTTTTTT | 5373 | 0.1366749897741772 | No Hit |
| GAGTACTTTTTTTTTTTTTTTTTTT | 4153 | 0.10564139820066218 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GGTATCA | 2165 | 0.0 | 15.979414 | 1 |
| GTATCAA | 7235 | 0.0 | 13.475635 | 1 |
| TAGATCG | 110 | 3.8100552E-8 | 12.093192 | 5 |
| TTAACCG | 65 | 8.0155017E-4 | 11.694516 | 5 |
| CGTCTTA | 75 | 2.0793203E-4 | 11.397656 | 15 |
| AACCGCG | 270 | 0.0 | 10.553117 | 7 |
| TCGTTAA | 290 | 0.0 | 10.48087 | 12 |
| GCGTTAT | 175 | 2.237357E-10 | 10.347315 | 1 |
| CGTCGTA | 340 | 0.0 | 10.336504 | 10 |
| AGAACCG | 425 | 0.0 | 10.284295 | 5 |
| ATACCGT | 440 | 0.0 | 10.144996 | 6 |
| CCGTCCG | 95 | 1.6480571E-4 | 9.998325 | 9 |
| CGCGTAT | 95 | 1.6491153E-4 | 9.99769 | 7 |
| ACCGTCC | 155 | 4.083995E-8 | 9.804186 | 8 |
| CGACCAT | 375 | 0.0 | 9.625054 | 10 |
| GAACCGC | 420 | 0.0 | 9.497442 | 6 |
| GCGGTAA | 130 | 4.267098E-6 | 9.495751 | 18 |
| CGCCAGT | 420 | 0.0 | 9.495751 | 18 |
| CGGTTCT | 525 | 0.0 | 9.407829 | 12 |
| ATAACGC | 455 | 0.0 | 9.396183 | 3 |