Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR4062398_1.fastq |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 836350 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 50 |
| %GC | 49 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| TCCATGTACTCTGCGTTGATACCACTGCTTCCATGTACTCTGCGTTGATA | 2186 | 0.26137382674717524 | No Hit |
| GTACATGGAAGCAGTGGTATCAACGCAGAGTACATGGAAGCAGTGGTATC | 1867 | 0.22323190052011718 | No Hit |
| GTCTTGCGCCGGTCCAAGAATTTCACCTCTAGCGGCGCAATACGAATGCC | 961 | 0.11490404734859808 | No Hit |
| GAGTACATGGAAGCAGTGGTATCAACGCAGAGTACATGGAAGCAGTGGTA | 951 | 0.11370837568003826 | No Hit |
| CATGTACTCTGCGTTGATACCACTGCTTCCATGTACTCTGCGTTGATACC | 849 | 0.10151252466072816 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| TAGGACG | 195 | 0.0 | 23.692211 | 4 |
| ATCGGGG | 50 | 0.0025798401 | 21.99991 | 24 |
| CCGTCGT | 165 | 0.0 | 21.333246 | 9 |
| GGACGTG | 200 | 0.0 | 20.899914 | 6 |
| CGTGATT | 95 | 6.8613735E-7 | 20.84202 | 28 |
| CAACGTA | 65 | 4.930552E-4 | 20.30761 | 20 |
| ATCGTTT | 230 | 0.0 | 20.086874 | 29 |
| GTGTATC | 55 | 0.0044779307 | 20.003506 | 1 |
| ATACCGT | 165 | 3.6379788E-12 | 19.99992 | 6 |
| GTCGGAA | 210 | 0.0 | 19.904682 | 39 |
| GATATAC | 260 | 0.0 | 19.46495 | 1 |
| TACCGTC | 170 | 5.456968E-12 | 19.411686 | 7 |
| CATCGTT | 230 | 0.0 | 19.130358 | 28 |
| CGTCGTA | 185 | 1.8189894E-12 | 19.026949 | 10 |
| TCGTTTA | 245 | 0.0 | 18.857065 | 30 |
| GATTTCG | 95 | 1.596207E-5 | 18.52624 | 41 |
| GTCTTAG | 145 | 6.2846084E-9 | 18.210087 | 1 |
| CGCAATA | 230 | 0.0 | 18.17384 | 36 |
| ACCGTCG | 195 | 1.8189894E-12 | 18.051208 | 8 |
| GTGTTAT | 135 | 5.5084456E-8 | 17.929068 | 1 |