Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR1547222_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Illumina 1.5 |
| Total Sequences | 736485 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 50 |
| %GC | 47 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 4660 | 0.6327352220343931 | No Hit |
| GACCAGAAAAATGGTCCTGCCAAGCGGCTACAGTGTTCTTCTTTCAGATA | 1466 | 0.1990536127687597 | No Hit |
| AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA | 993 | 0.13482962993136316 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| AAATCGT | 30 | 2.5264726E-6 | 44.000004 | 12 |
| ATTAGCG | 30 | 2.5264726E-6 | 44.000004 | 1 |
| TACTGGC | 30 | 2.5264726E-6 | 44.000004 | 3 |
| ACGTTAG | 25 | 4.441277E-5 | 44.0 | 1 |
| GACGTAG | 20 | 7.854638E-4 | 44.0 | 1 |
| AAGCGGT | 20 | 7.854638E-4 | 44.0 | 3 |
| ACCCTTA | 20 | 7.854638E-4 | 44.0 | 31 |
| ACGGGTA | 75 | 0.0 | 44.0 | 5 |
| TCTAGCG | 40 | 8.305506E-9 | 44.0 | 1 |
| TGCGCAA | 25 | 4.441277E-5 | 44.0 | 1 |
| TGCGAAC | 20 | 7.854638E-4 | 44.0 | 37 |
| TCATCGG | 20 | 7.854638E-4 | 44.0 | 2 |
| GTCCATA | 25 | 4.441277E-5 | 44.0 | 33 |
| TTACCTA | 20 | 7.854638E-4 | 44.0 | 40 |
| CGCATCG | 35 | 1.4451325E-7 | 43.999996 | 21 |
| GCTTGCG | 35 | 1.4451325E-7 | 43.999996 | 1 |
| GTCTCAC | 280 | 0.0 | 43.214283 | 22 |
| TCACGAC | 275 | 0.0 | 43.2 | 25 |
| CGGTCTA | 265 | 0.0 | 43.16981 | 31 |
| ACGGTCT | 265 | 0.0 | 43.16981 | 30 |