Next-Generation Sequencing Informatics

Below is a table with informatics and IT statistics for the major next-generation/massively parallel sequencing platforms. The information in the table is approximate and should only be used for general, informational purposes.

Next-Generation Sequencing Statistics

Notes:

  • Units: B – bytes, b – bases
  • PA is primary analysis (includes image feature extraction and base calling)
  • PA CPU is calculated as the wall clock multiplied by the number of CPU cores
  • ABI SOLiD data, except rate, are representative of a single slide
  • ABI SOLiD and Illumina GA IIx primary analysis is done on instrument
  • 454 paired-end reads vary in length depending on location of internal adapter
  • SRA is the size of the files (SFF, SRF, or FASTQ) that are submitted to the NCBI Short Read Archive

9 comments on this post.

  1. :

    [...] estimations of our disk space requirements in the next couple of years. I came across this table (Next-Generation Sequencing Informatics Statistics) and thought it would be [...]

  2. Next Generation Sequencing » Overview of NGS platforms:

    [...] David Dooling over at PolITiGenomics has compiled a nice table with some important metrics of the three most prevalent NGS platforms – go check it out! [...]

  3. Next-Generation Sequencing Statistics « ScienceRoll:

    [...] trackback Have you ever wondered about the typical run time of a next-generation genome sequencer? PolITiGenomics has the [...]

  4. :

    [...] PolITiGenomics Next-Generation Sequencing Informatics – Great overview of the current technologies.tags: science genomics bioinformatics [...]

  5. Managing and Analyzing Next-Generation Sequence Data « LabLynx LiMS Journal:

    [...] Dooling D2008 Dec 4. Next Generation Sequencing Informatics Table. https://politigenomics.com/next-gener​ation-sequencing-informatics . [...]

  6. Pathogens: Genes and Genomes » Do you store your image files?:

    [...] not sure that the storage cost argument is right. Referencing David Dooling’s just-updated next-generation informatics table you are looking at 2.8 terabytes of image data per GA2 run. Phil Green suggests these can be [...]

  7. Next Generation Sequencing » NGS platforms overview:

    [...] can find the original table here. As the original compilation, this work is openly available through an Attribution-Share Alike 3.0 [...]

  8. :

    [...] somewhat dated comparison of next-generation sequencing platforms on another blog, PolITiGenomics (which also examines newer platforms when they are [...]

  9. Managing and Analyzing Next-Generation Sequence Data | LIMSblog.com:

    [...] Dooling D2008 Dec 4. Next Generation Sequencing Informatics Table. https://politigenomics.com/next-gener​ation-sequencing-informatics . [...]

Leave a comment

Vendor: Roche Illumina ABI Technology: 454 Solexa GA SOLiD Platform: GS20 FLX Ti I II IIx 1 2 3 Reads: (M) 0.5 0.5 1.25 28 100 250 40 115 320 Fragment Read length: 100 200 400 35 50 100 25 35 50 Run time: (d) 0.25 0.3 0.4 3 3 5 6 5 8 Yield: (Gb) 0.05 0.1 0.5 1 5 25 1 4 16 Rate: (Gb/d) 0.2 0.33 1.25 0.33 1.67 5 0.34 1.6 4 Images: (TB) 0.01 0.01 0.03 0.5 1.1 2.8 1.8 2.5 1.9 PA Disk: (GB) 3 3 15 175 300 300 300 750 1200 PA CPU: (hr) 10 140 220 100 70 NA NA NA NA SRA: (GB) 0.5 1 4 30 50 2.5 100 140 600 Paired-end Read length: 200 400 2×35 2×50 2×100 2×25 2×35 2×50 Insert: (kb) 3.5 3.5 0.2 0.2 0.2 3 3 2 Run time: (d) 0.3 0.4 6 10 10 12 10 16 Yield: (Gb) 0.1 0.5 2 9 50 2 8 32 Rate: (Gb/d) 0.33 1.25 0.33 1.67 5 0.34 1.6 4 Images: (TB) 0.01 0.03 1 2.2 5.6 3.6 5 3.8 PA Disk: (GB) 3 15 350 500 550 600 1500 2400 PA CPU: (hr) 140 220 160 120 NA NA NA NA SRA: (GB) 1 4 60 100 3.5 200 280 1200