Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR1512067_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 1106810 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 25 |
| %GC | 45 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GTATCAACGCAGAGTACTTTTTTTT | 3969 | 0.3585981333742919 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTT | 3076 | 0.27791581210867267 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTT | 2994 | 0.27050713311227764 | No Hit |
| ACGCAGAGTACTTTTTTTTTTTTTT | 1413 | 0.1276641880720268 | No Hit |
| GTACTTTTTTTTTTTTTTTTTTTTT | 1212 | 0.10950388955647311 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| CCGTGCC | 65 | 5.4653057E-5 | 13.149843 | 9 |
| GGTATCA | 835 | 0.0 | 13.008425 | 1 |
| TAGACCT | 70 | 1.08747554E-4 | 12.217743 | 4 |
| GGCTAAT | 65 | 7.823898E-4 | 11.726893 | 1 |
| TAGACCA | 155 | 2.5465852E-11 | 11.648458 | 4 |
| GTAGACC | 165 | 8.0035534E-11 | 10.942491 | 3 |
| GCTAATA | 80 | 3.7589797E-4 | 10.690525 | 2 |
| CTAATAC | 80 | 3.7589797E-4 | 10.690525 | 3 |
| TTTAGCC | 110 | 6.0185994E-6 | 10.36657 | 3 |
| TAGGACC | 185 | 6.002665E-11 | 10.273178 | 4 |
| GTATAAA | 215 | 1.8189894E-12 | 10.192852 | 1 |
| GTAGAAC | 225 | 0.0 | 10.136202 | 3 |
| TGTACCG | 75 | 0.0026403326 | 10.136202 | 5 |
| GGTATAG | 85 | 6.4131804E-4 | 10.088576 | 1 |
| GCCTCGA | 85 | 6.608215E-4 | 10.055762 | 16 |
| AATCCCG | 95 | 1.6471547E-4 | 9.997408 | 19 |
| GCACTGT | 185 | 6.493792E-10 | 9.754229 | 6 |
| AGGACGT | 510 | 0.0 | 9.689016 | 5 |
| ATAGGGG | 80 | 0.0045031873 | 9.502689 | 3 |
| TAAGGTG | 110 | 6.80877E-5 | 9.502689 | 5 |