FastQCFastQC Report
Thu 26 May 2016
SRR522072_1.fastq.gz

Summary

[OK]Basic Statistics

MeasureValue
FilenameSRR522072_1.fastq.gz
File typeConventional base calls
EncodingSanger / Illumina 1.9
Total Sequences26586227
Sequences flagged as poor quality0
Sequence length50
%GC45

[OK]Per base sequence quality

Per base quality graph

[FAIL]Per tile sequence quality

Per base quality graph

[OK]Per sequence quality scores

Per Sequence quality graph

[FAIL]Per base sequence content

Per base sequence content

[OK]Per sequence GC content

Per sequence GC content graph

[OK]Per base N content

N content graph

[OK]Sequence Length Distribution

Sequence length distribution

[FAIL]Sequence Duplication Levels

Duplication level graph

[WARN]Overrepresented sequences

SequenceCountPercentagePossible Source
TCTCTGAGCGGGCTGGCAAGGCAGACCGATATCGTATGCCGTCTTCTGCT1568290.5898881402013155Illumina PCR Primer Index 1 (95% over 24bp)
GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT697570.2623802166437532No Hit
TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT603800.22711007470146102No Hit
TCTCTGAGCGGGCTGGCAAGGCAGACCGATCTCGTATGCCGTCTTCTGCT463600.17437600303345036Illumina Paired End PCR Primer 2 (96% over 29bp)
GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT377500.14199081351408005No Hit

[OK]Adapter Content

Adapter graph

[FAIL]Kmer Content

Kmer graph

SequenceCountPValueObs/Exp MaxMax Obs/Exp Position
TATGCCG283650.034.3975635
CCGTCTT295750.033.45139739
GCCGTCT296950.033.36051638
GGTATCA309950.033.3092581
ATCGTAT228650.032.31954231
TATCGTA227250.032.28162830
GACCGAT304100.032.2352924
ATATCGT228100.032.16785429
TCGTATG303300.032.04181332
CGTCTTC308100.032.0386540
CGTATGC304500.032.0038733
ACCGATA233700.031.92031925
ATGCCGT309700.031.77537336
CCGATAT237050.031.44398526
AGACCGA316500.031.4241223
TGCCGTC314700.031.35347737
GATATCG238350.031.28277228
CGATATC238400.031.25781627
GCGGGCT332750.031.0177568
TGAGCGG336150.030.8303745