Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR2049447_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 5132925 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 100 |
| %GC | 42 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 19078 | 0.3716789160176702 | No Hit |
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 18967 | 0.3695164063375171 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 11007 | 0.21443913558059002 | No Hit |
| GTGGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 5526 | 0.10765791434708281 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GGTATCA | 11130 | 0.0 | 59.208405 | 1 |
| GTATCAA | 19820 | 0.0 | 39.874706 | 1 |
| GTGGTAT | 4465 | 0.0 | 37.846306 | 1 |
| TGGTATC | 4510 | 0.0 | 36.688343 | 2 |
| ATCAACG | 24775 | 0.0 | 31.60991 | 3 |
| TCAACGC | 25140 | 0.0 | 31.300562 | 4 |
| TATCAAC | 25550 | 0.0 | 30.761484 | 2 |
| CAACGCA | 26085 | 0.0 | 30.148592 | 5 |
| AACGCAG | 27140 | 0.0 | 29.075748 | 6 |
| ACGCAGA | 30435 | 0.0 | 25.804113 | 7 |
| CGCAGAG | 30710 | 0.0 | 25.4047 | 8 |
| GCGGTAT | 335 | 0.0 | 25.29171 | 1 |
| GCAGAGT | 33180 | 0.0 | 23.201889 | 9 |
| GTACATG | 15360 | 0.0 | 22.615993 | 1 |
| TACCTGG | 3825 | 0.0 | 21.998018 | 2 |
| GAGTACT | 24840 | 0.0 | 21.54107 | 12-13 |
| CAGAGTA | 32545 | 0.0 | 21.38011 | 10-11 |
| TACATGG | 15665 | 0.0 | 21.335419 | 2 |
| GTACCTG | 4350 | 0.0 | 20.992441 | 1 |
| ACATGGG | 15600 | 0.0 | 20.791529 | 3 |