FastQCFastQC Report
Thu 26 May 2016
SRR522112_1.fastq.gz

Summary

[OK]Basic Statistics

MeasureValue
FilenameSRR522112_1.fastq.gz
File typeConventional base calls
EncodingSanger / Illumina 1.9
Total Sequences36211672
Sequences flagged as poor quality0
Sequence length50
%GC43

[OK]Per base sequence quality

Per base quality graph

[OK]Per tile sequence quality

Per base quality graph

[OK]Per sequence quality scores

Per Sequence quality graph

[FAIL]Per base sequence content

Per base sequence content

[OK]Per sequence GC content

Per sequence GC content graph

[OK]Per base N content

N content graph

[OK]Sequence Length Distribution

Sequence length distribution

[FAIL]Sequence Duplication Levels

Duplication level graph

[WARN]Overrepresented sequences

SequenceCountPercentagePossible Source
GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT1998370.5518579755168445No Hit
TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT1602210.44245678575681346No Hit
TCTCTGAGCGGGCTGGCAAGGCAGACCGATATCGTATGCCGTCTTCTGCT1435970.39654893593424795Illumina PCR Primer Index 1 (95% over 24bp)
GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT985900.272260281160174No Hit
TCTCTGAGCGGGCTGGCAAGGCAGACCGATCTCGTATGCCGTCTTCTGCT730820.20181890524138185Illumina Paired End PCR Primer 2 (96% over 29bp)
ACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT419670.11589357155339305No Hit

[OK]Adapter Content

Adapter graph

[FAIL]Kmer Content

Kmer graph

SequenceCountPValueObs/Exp MaxMax Obs/Exp Position
GGTATCA897500.036.4080281
TATGCCG310350.036.24066535
GACCGAT324900.035.49128324
TCGTATG319550.035.47186332
CGTATGC325000.034.78999733
ACCGATA216000.034.46383325
ATATCGT210500.034.44630429
TATCGTA210550.034.43081730
GATATCG214950.034.3641228
GCGGGCT362000.034.3446468
CGATATC215500.034.2825127
AGACCGA350800.034.24587623
CCGATAT216200.034.1374926
TGAGCGG369850.033.8251345
CCGTCTT319700.033.81060439
ATCGTAT218850.033.7299231
GCCGTCT321800.033.5423538
CGGGCTG372150.033.4492959
ATGCCGT326900.033.4396736
AGCGGGC377800.033.048057