Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR2049487_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 6609529 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 100 |
| %GC | 43 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 19127 | 0.28938521943091555 | No Hit |
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 19107 | 0.28908262600860063 | No Hit |
| TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 11205 | 0.16952796485195845 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GGTATCA | 12150 | 0.0 | 56.732292 | 1 |
| GTATCAA | 21830 | 0.0 | 40.97938 | 1 |
| GTGGTAT | 5030 | 0.0 | 33.510475 | 1 |
| ATCAACG | 27555 | 0.0 | 32.141117 | 3 |
| TCAACGC | 27990 | 0.0 | 31.692043 | 4 |
| TATCAAC | 28265 | 0.0 | 31.350403 | 2 |
| CAACGCA | 28930 | 0.0 | 30.688545 | 5 |
| TGGTATC | 5380 | 0.0 | 30.002237 | 2 |
| AACGCAG | 30260 | 0.0 | 29.416698 | 6 |
| ACGCAGA | 34300 | 0.0 | 25.882582 | 7 |
| CGCAGAG | 34600 | 0.0 | 25.549501 | 8 |
| GCAGAGT | 37250 | 0.0 | 23.49217 | 9 |
| GTACATG | 16350 | 0.0 | 22.980036 | 1 |
| TACATGG | 16640 | 0.0 | 21.776054 | 2 |
| GAGTACT | 28045 | 0.0 | 21.726353 | 12-13 |
| CAGAGTA | 36785 | 0.0 | 20.831465 | 10-11 |
| ACATGGG | 16835 | 0.0 | 20.713184 | 3 |
| AGAGTAC | 34920 | 0.0 | 20.67893 | 10-11 |
| AGTACTT | 29415 | 0.0 | 19.667944 | 12-13 |
| GTACTTT | 31180 | 0.0 | 19.29317 | 14-15 |