Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR1544747_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Illumina 1.5 |
| Total Sequences | 1201583 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 52 |
| %GC | 44 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 5572 | 0.46372160724644074 | No Hit |
| AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA | 3081 | 0.256411750166239 | No Hit |
| CTGTCTCTTATACACATCTGACGCGGGAATATTCGTATGCCGTCTTCTGCTT | 1669 | 0.13890010095016325 | Illumina Single End Adapter 2 (95% over 21bp) |
| CCTGTCTCTTATACACATCTGACGCGGGAATATTCGTATGCCGTCTTCTGCT | 1386 | 0.115347836978386 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| CGTTGAT | 340 | 0.0 | 46.000004 | 25 |
| AACGTAT | 40 | 5.6115823E-9 | 46.0 | 10 |
| AACCGGC | 20 | 6.3115696E-4 | 46.0 | 1 |
| TCGCATA | 20 | 6.3115696E-4 | 46.0 | 29 |
| ATCTCGA | 20 | 6.3115696E-4 | 46.0 | 10 |
| CGTATTA | 30 | 1.8614737E-6 | 46.0 | 25 |
| CATGCGT | 30 | 1.8614737E-6 | 46.0 | 36 |
| CTATCGG | 30 | 1.8614737E-6 | 46.0 | 2 |
| GCGAAGT | 20 | 6.3115696E-4 | 46.0 | 18 |
| GACCGAA | 30 | 1.8614737E-6 | 46.0 | 14 |
| GCACGTT | 30 | 1.8614737E-6 | 46.0 | 31 |
| AAGCGTA | 25 | 3.4172495E-5 | 46.0 | 40 |
| TGAGTCG | 30 | 1.8614737E-6 | 46.0 | 1 |
| CCTAACG | 20 | 6.3115696E-4 | 46.0 | 19 |
| ATCGTTA | 20 | 6.3115696E-4 | 46.0 | 43 |
| CGTTATG | 40 | 5.6115823E-9 | 46.0 | 41 |
| CCCGTCA | 30 | 1.8614737E-6 | 46.0 | 28 |
| CCCGTAT | 30 | 1.8614737E-6 | 46.0 | 23 |
| CCCCGTA | 30 | 1.8614737E-6 | 46.0 | 22 |
| CCCGTAG | 20 | 6.3115696E-4 | 46.0 | 42 |