Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR4064166_1.fastq |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 625026 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 50 |
| %GC | 48 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 983 | 0.15727345742417115 | No Hit |
| GTACATGGGGTGGTATCAACGCAAAAAAAAAAAAAAAAAAAAAAAAAAAA | 821 | 0.13135453565131688 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| ATAGACG | 70 | 3.213548E-5 | 21.99876 | 3 |
| ATACCGT | 105 | 7.735798E-8 | 20.9512 | 6 |
| GCGAAGC | 55 | 0.0044806176 | 20.000473 | 26 |
| TAATACC | 55 | 0.0044826926 | 19.998873 | 4 |
| CGAAAAC | 90 | 1.00048655E-5 | 19.556019 | 23 |
| CCGTCGT | 90 | 1.0011781E-5 | 19.554455 | 9 |
| CGTCGTA | 90 | 1.0011781E-5 | 19.554455 | 10 |
| ACCGTCG | 90 | 1.0011781E-5 | 19.554455 | 8 |
| GTCCTAT | 135 | 2.59206E-9 | 19.554453 | 1 |
| CGGTCCG | 80 | 8.978094E-5 | 19.250456 | 26 |
| ACTGATC | 70 | 8.120405E-4 | 18.85608 | 8 |
| ACGGACC | 200 | 0.0 | 18.698946 | 8 |
| TACCGTC | 95 | 1.595873E-5 | 18.525272 | 7 |
| CGGACCA | 215 | 0.0 | 18.417566 | 9 |
| GTATTAT | 60 | 0.0074106054 | 18.3323 | 1 |
| ATAGATC | 60 | 0.0074106054 | 18.3323 | 3 |
| TCTGTAC | 60 | 0.0074106054 | 18.3323 | 3 |
| TATAGGG | 60 | 0.0074106054 | 18.3323 | 5 |
| GTATAGA | 120 | 3.1577656E-7 | 18.3323 | 1 |
| GATTTCG | 170 | 1.2551027E-10 | 18.118074 | 41 |