Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR1544752_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Illumina 1.5 |
| Total Sequences | 2295423 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 52 |
| %GC | 42 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| CGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 9263 | 0.40354217937173237 | No Hit |
| AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA | 5493 | 0.23930229853059765 | No Hit |
| CTGTCTCTTATACACATCTGACGCTAAGTCCTTCGTATGCCGTCTTCTGCTT | 3130 | 0.13635830955775907 | Illumina Single End Adapter 2 (95% over 21bp) |
| TTCAAAGGGACCTAATCGGAGGAGCTACTCTAGTATTAATAAATATTAGCCC | 2657 | 0.11575208578113924 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| CGATGCG | 35 | 1.020162E-7 | 46.000004 | 10 |
| TTTACGC | 25 | 3.418229E-5 | 46.0 | 23 |
| CGATTCG | 80 | 0.0 | 46.0 | 10 |
| TATGCGT | 40 | 5.6152203E-9 | 46.0 | 20 |
| CCCAACG | 30 | 1.8622213E-6 | 46.0 | 23 |
| GTGTCGA | 25 | 3.418229E-5 | 46.0 | 33 |
| CGAATGT | 50 | 1.6370905E-11 | 46.0 | 18 |
| TATCGCC | 20 | 6.3127757E-4 | 46.0 | 34 |
| TAGTCCG | 20 | 6.3127757E-4 | 46.0 | 15 |
| CGAAGTA | 30 | 1.8622213E-6 | 46.0 | 39 |
| TGCGTAG | 165 | 0.0 | 44.606064 | 1 |
| TTACCGG | 120 | 0.0 | 44.083332 | 2 |
| AATACGG | 420 | 0.0 | 43.261906 | 2 |
| ACGCGAG | 70 | 0.0 | 42.714287 | 1 |
| TACGGGT | 240 | 0.0 | 42.166668 | 4 |
| TAACGCG | 60 | 1.8189894E-12 | 42.166668 | 1 |
| CGACGGT | 290 | 0.0 | 42.034485 | 28 |
| ACGGGAT | 1285 | 0.0 | 41.88327 | 5 |
| TGTGACG | 105 | 0.0 | 41.619045 | 1 |
| TACGGGA | 1000 | 0.0 | 41.4 | 4 |