Please use this identifier to cite or link to this item:
Scopus Web of Science® Altmetric
Type: Journal article
Title: Genome assembly and transcriptome resource for river buffalo, Bubalus bubalis (2n = 50)
Author: Williams, J.
Iamartino, D.
Pruitt, K.
Sonstegard, T.
Smith, T.
Low, W.
Biagini, T.
Bomba, L.
Capomaccio, S.
Castiglioni, B.
Coletta, A.
Corrado, F.
Ferré, F.
Iannuzzi, L.
Lawley, C.
Macciotta, N.
McClure, M.
Mancini, G.
Matassino, D.
Mazza, R.
et al.
Citation: GigaScience, 2017; 6(10):1-6
Publisher: Oxford University Press
Issue Date: 2017
ISSN: 2047-217X
Statement of
John L. Williams, Daniela Iamartino, Kim D. Pruitt, Tad Sonstegard, Timothy P.L. Smith, Wai Yee Low, Tommaso Biagini, Lorenzo Bomba, Stefano Capomaccio, Bianca Castiglioni, Angelo Coletta, Federica Corrado, Fabrizio Ferré, Leopoldo Iannuzzi, Cynthia Lawley, Nicolò Macciotta, Matthew McClure, Giordano Mancini, Donato Matassino, Raffaele Mazza, Marco Milanesi, Bianca Moioli, Nicola Morandi, Luigi Ramunno, Vincenzo Peretti, Fabio Pilla, Paola Ramelli, Steven Schroeder, Francesco Strozzi, Francoise Thibaud-Nissen, Luigi Zicarelli, Paolo Ajmone-Marsan, Alessio Valentini, Giovanni Chillemi, and Aleksey Zimin
Abstract: Water buffalo is a globally important species for agriculture and local economies. A de novo assembled, well annotated, reference sequence for the water buffalo is an important prerequisite for studying the biology of this species, and necessary to manage genetic diversity and to use modern breeding and genomic selection techniques. However, no such genome assembly has been previously reported. There are two species of domestic water buffalo, the river (2n = 50) and the swamp (2n = 48) buffalo. Here we describe a draft quality reference sequence for the river buffalo created from Illumina GA and Roche 454 short read sequences using the MaSuRCA assembler. The assembled sequence is 2.83 Gb, consisting of 366,983 scaffolds with a scaffold N50 of 1.41 Mb and contig N50 of 21,398 bp. Annotation of the genome was supported by transcriptome data from 30 tissues, and identified 21,711 predicted protein coding genes. Searches for complete mammalian BUSCO gene groups found 98.6% of curated single copy orthologs present among predicted genes, which suggests a high level of completeness of the genome. The annotated sequence is available from NCBI at accession GCA_000471725.1.
Keywords: Water buffalo; genome assembly; transcriptome; annotation
Rights: © The Author 2017. Published by Oxford University Press. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
RMID: 0030075951
DOI: 10.1093/gigascience/gix088
Appears in Collections:Animal and Veterinary Sciences publications

Files in This Item:
File Description SizeFormat 
hdl_112431.pdfPublished version196.43 kBAdobe PDFView/Open

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.