MaSuRCA assembler

MaSuRCA is whole genome assembly software. It combines the efficiency of the de Bruijn graph and Overlap-Layout-Consensus (OLC) approaches. MaSuRCA can assemble data sets containing only short reads from Illumina sequencing or a mixture of short reads and long reads (Sanger, 454).
If you use MaSuRCA in your work, please cite our paper:

Zimin, A. et al. The MaSuRCA genome Assembler. Bioinformatics (2013). doi:10.1093/bioinformatics/btt476
To obtain the source code of MaSuRCA and compilation instructions, see this page.

Genomes assembled with MaSuRCA

  • Loblolly pine, Pinus taeda, 22Gb genome, joint with David Neale and Chuck Langley, UC-Davis, and Pinerefseq consortium. Illumina data.
  • Indian cow, Bos indicus, joint with USDA-ARS, 454/Illumina mixed data.
  • Rhesus macaque, Macaca mulatta, joint with Robert Norgren, University of Nebraska. Sanger/Illumina mixed data.
  • Water Buffalo, Bubalus bubalus, joint with USDA-ARS and CASPUR, Italy. 454/Illumina mixed data.
  • Domestic cat, Felis felis, joint with Wes Warren, Washington University, St. Louis. Sanger/454/Illumina mixed data.
  • American bison, Bison bison. 454/Illumina mixed data.
  • Philippine tarsier, Tarsius syrichta, joint with Wes Warren, Washington University, St. Louis. Sanger/Illumina mixed data.
  • Cardiocondyla ant, 454/Illumina mixed data.
  • Fire ant, joint with Sasha Mikheleev, OIST, Japan. 454/Illumina mixed data.
  • Stalk-eyed fly, Teleopsis dalmanni, joint with Gerry Wilkinson, University of Maryland. 454/Illumina mixed data.