Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR1547143_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Illumina 1.5 |
| Total Sequences | 1674029 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 50 |
| %GC | 44 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 6464 | 0.38613429038565045 | No Hit |
| GACCAGAAAAATGGTCCTGCCAAGCGGCTACAGTGTTCTTCTTTCAGATA | 3926 | 0.2345240136222252 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| TTTAGCG | 100 | 0.0 | 44.0 | 1 |
| GATCGGA | 20 | 7.857513E-4 | 44.0 | 9 |
| GCGCGTA | 25 | 4.4437147E-5 | 44.0 | 42 |
| TAGCGCG | 25 | 4.4437147E-5 | 44.0 | 1 |
| TATATCG | 20 | 7.857513E-4 | 44.0 | 28 |
| TTGTACG | 60 | 0.0 | 44.0 | 1 |
| TCGATCT | 20 | 7.857513E-4 | 44.0 | 40 |
| CCGTCGA | 25 | 4.4437147E-5 | 44.0 | 36 |
| CGTTATC | 20 | 7.857513E-4 | 44.0 | 44 |
| CCCGTTA | 25 | 4.4437147E-5 | 44.0 | 32 |
| CCCCGAT | 30 | 2.5284116E-6 | 44.0 | 40 |
| ACGTGCG | 35 | 1.4466059E-7 | 44.0 | 1 |
| TCGTATA | 20 | 7.857513E-4 | 44.0 | 13 |
| GCGTGTC | 25 | 4.4437147E-5 | 44.0 | 36 |
| CGGTATA | 35 | 1.4466059E-7 | 44.0 | 2 |
| TGTCGGT | 20 | 7.857513E-4 | 44.0 | 40 |
| TACGTCC | 20 | 7.857513E-4 | 44.0 | 21 |
| AATCGCG | 55 | 1.8189894E-12 | 44.0 | 1 |
| TACCGAT | 25 | 4.4437147E-5 | 44.0 | 38 |
| TATAGCG | 135 | 0.0 | 42.37037 | 1 |