Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR1547156_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Illumina 1.5 |
| Total Sequences | 342885 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 50 |
| %GC | 44 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 1603 | 0.46750368199250475 | No Hit |
| CGTTTTTATTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 985 | 0.2872683261151698 | No Hit |
| GACCAGAAAAATGGTCCTGCCAAGCGGCTACAGTGTTCTTCTTTCAGATA | 616 | 0.17965206993598437 | No Hit |
| GATACTGAAGCTACGAATATACTGACTATGAAGACCTATGCTTTGATTCA | 359 | 0.10469982647243245 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GGGTAGT | 20 | 7.8487437E-4 | 44.000004 | 7 |
| ATATGCG | 20 | 7.8487437E-4 | 44.000004 | 1 |
| CGTTAGA | 20 | 7.8487437E-4 | 44.000004 | 2 |
| CGCTCCC | 20 | 7.8487437E-4 | 44.000004 | 19 |
| TATCTAG | 20 | 7.8487437E-4 | 44.000004 | 1 |
| AGCGATA | 65 | 0.0 | 44.000004 | 8 |
| GCGATAC | 40 | 8.283678E-9 | 44.000004 | 9 |
| ATGGTCG | 20 | 7.8487437E-4 | 44.000004 | 44 |
| TATACGA | 20 | 7.8487437E-4 | 44.000004 | 2 |
| ACGAGCC | 20 | 7.8487437E-4 | 44.000004 | 5 |
| CCGTTGG | 20 | 7.8487437E-4 | 44.000004 | 1 |
| ACCGCTC | 35 | 1.442113E-7 | 44.0 | 18 |
| TGAACGC | 25 | 4.4362845E-5 | 44.0 | 29 |
| AAGCGAC | 30 | 2.5225017E-6 | 44.0 | 15 |
| CGCGACC | 90 | 0.0 | 44.0 | 10 |
| TATGCGA | 25 | 4.4362845E-5 | 44.0 | 2 |
| GTAGACG | 25 | 4.4362845E-5 | 44.0 | 1 |
| CTACGGG | 30 | 2.5225017E-6 | 44.0 | 3 |
| TCGCTAT | 30 | 2.5225017E-6 | 44.0 | 23 |
| AGACCGA | 100 | 0.0 | 44.0 | 8 |