Skip to main content

A high utility integrated map of the pig genome

Abstract

Background

The domestic pig is being increasingly exploited as a system for modeling human disease. It also has substantial economic importance for meat-based protein production. Physical clone maps have underpinned large-scale genomic sequencing and enabled focused cloning efforts for many genomes. Comparative genetic maps indicate that there is more structural similarity between pig and human than, for example, mouse and human, and we have used this close relationship between human and pig as a way of facilitating map construction.

Results

Here we report the construction of the most highly continuous bacterial artificial chromosome (BAC) map of any mammalian genome, for the pig (Sus scrofa domestica) genome. The map provides a template for the generation and assembly of high-quality anchored sequence across the genome. The physical map integrates previous landmark maps with restriction fingerprints and BAC end sequences from over 260,000 BACs derived from 4 BAC libraries and takes advantage of alignments to the human genome to improve the continuity and local ordering of the clone contigs. We estimate that over 98% of the euchromatin of the 18 pig autosomes and the X chromosome along with localized coverage on Y is represented in 172 contigs, with chromosome 13 (218 Mb) represented by a single contig. The map is accessible through pre-Ensembl, where links to marker and sequence data can be found.

Conclusion

The map will enable immediate electronic positional cloning of genes, benefiting the pig research community and further facilitating use of the pig as an alternative animal model for human disease. The clone map and BAC end sequence data can also help to support the assembly of maps and genome sequences of other artiodactyls.

Background

The pig is a domesticated eutherian mammal and a member of the Cetartiodactyla order, a clade distinct from rodent and primates that last shared a common ancestor with man between 79 and 87 million years ago [1]. It has co-evolved with humans for several thousand years and today has considerable economic importance as a source of meat-based protein. The pig is also being used increasingly in biomedical research for studies of a spectrum of human diseases that may be modeled less well in rodents, including obesity, arthritis, cardiovascular disease, and skin and eye conditions [2, 3]. An area of particular interest has been its potential to supply organs, tissues and cells for transplant through so called xenotransplantation (that is, transplantation of pig organs to humans), providing that issues surrounding porcine endogenous retroviruses, cytomegaloviruses and lymphotropic herpesvirus can be resolved [4, 5].

To promote the identification of genes associated with single gene disorders and the mapping of quantitative trait loci (QTL), a variety of genomic resources have been developed in recent years [6]. Microsatellite (ms) linkage maps and whole-genome radiation hybrid (RH) maps have been constructed that contain over 3,000 markers and provide resolution down to approximately 1 Mb [7–10].

Several large insert bacterial artificial chromosome (BAC) clone libraries have been generated [11, 12]. The libraries have been used successfully to isolate and characterize pig genes and to map and sequence small regions of the genome of particular interest, for example the major histocompatibility complex [13], and the killer immunoglobulin-like receptor gene [14]. The characterization of specific genes has been facilitated by expressed sequence tag sequencing projects that have generated approximately one million sequences in the public domain. A low genome sequence coverage of 0.66× generated from multiple pigs [15] has also facilitated the identification of specific genes and should serve as an abundant source of polymorphism data, such as single nucleotide polymorphisms, once a reference sequence is obtained.

For many species, physical maps have played an integral part in the construction of their genome sequence [16–18]. The capacity to select a minimum tile path (MTP) of overlapping clones covering the genome provides not only a substrate for genome sequencing [19, 20] but also a resource for the generation of mapped clone arrays for chromosome structure comparison [21, 22]. A physical map also has great utility for targeted positional cloning and QTL mapping [23, 24]. Porcine genomic resources have been well exploited to provide comprehensive characterization of the genome. Blocks of conserved segments have been established in relation to the human genome. Cytogenetic evidence indicates that the mean lengths of conserved segments are twice as long between the human and pig as between the mouse and human genomes [25, 26] and the rate of recombination in the porcine genome has been shown to be lower than in humans [27, 28]. These characteristics suggested that the finished human genome would provide a good framework for the construction of a physical clone map of the pig genome through alignment of pig BAC end sequences (BES), using an approach that proved successful in the assembly of the mouse physical map [17].

In this manuscript we address the development of a highly contiguous physical map that integrates radiation hybrid maps and four BAC clone libraries. The BAC map represents the first phase of the international pig genome sequencing project [3]. It provides an immediate resource to the porcine community for rapid electronic positional cloning and a substrate for efficient and economic selection of a MTP of BACs for the determination of the genome sequence. The clone map and BAC end sequence data can also help to support the assembly of maps and genome sequences of other artiodactyls.

Results

Fingerprint and BAC end sequence data

The physical map of the pig genome incorporates BAC clones from four different libraries that have been used for positional cloning and to map QTLs in the pig genome. HindIII restriction digest fingerprints (see Materials and methods) were generated from 264,987 BAC clones, corresponding to approximately 16-fold coverage of the genome (Table 1) based on an estimated genome size of 2.6 to 2.7 Gb [29, 30]. The fingerprint coverage comprises 6.7 genome equivalents of the CHORI-242 library [31] generated from a single Duroc sow, approximately four genome equivalents from each of both the PigEBAC (male Large White × Meishan F1) [11] and the RPCI-44 BAC libraries (constructed from pooled material from four male pigs, one-quarter Meishan, three-eighths Yorkshire, and three-eighths Landrace) [32], and 1.5-fold clone coverage of the INRA BAC library (made from a single Large White male) [12].

Table 1 Fingerprint and BAC end sequence summary

BES were generated for all clones (see Materials and methods) from the CHORI-242 and PigEBAC libraries, and for fingerprinted clones from the RPCI-44 and INRA libraries (Table 1). The sequences have been deposited in the Ensembl and NCBI trace repositories [33, 34] (EMBL:CL320165-CL440255; EMBL:CT033849; EMBL:CT033851-CT476799; EMBL:CR861517-CR925665).

A total of 620,089 high quality sequence reads were produced from 335,463 BACs. After trimming vector sequences, the reads had an average Phred Q20 length of 635 bases and 92% were mated pairs. Taken together, the BES provide a total of 393,756,515 bases representing 15% of the pig genome.

Porcine BAC end sequence alignment to the human genome: a guide for pig map assembly

We took advantage of the conservation of synteny between the porcine and the human genomes to inform the assembly of the fingerprint map, by aligning the pig BES to Build 35 of the human genome [20] (see Materials and methods). After removal of 19% of BES that contained fewer than 100 bases of continuous, non-repetitive DNA, alignment to the human genome sequence using BLASTN (< 1e-05, ≥ 100 bp match, ≥ 66% sequence identity) yielded 297,742 (48% of the total) BES with a match to human sequences (Table 2). The representation of each of the BAC libraries in the aligned set was proportional to the number of clones in the combined BES, indicating that none of the libraries showed any obvious bias (Table 1 in Additional data file 1). Using the requirement that BES alignments to the human genome were consistent with BAC positioning within fingerprint contigs, we established 172,169 high confidence so-called homologous crosslinks [17] between the pig and human genomes (Table 2). The average distance between crosslinks was 17 kb. As observed previously for the mouse genome, the incidence of BES alignments to human chromosomes 19 and Y was lower than observed on other chromosomes. This is most likely due to the large regions of repetitive sequences on these chromosomes that adversely affect the number of unique BES placements. The matching frequency between pig and human genomes is approximately three-fold greater than that observed between mouse and human [17], consistent with the higher level of structural and sequence similarity observed between the pig and human genomes [10, 26, 35, 36].

Table 2 Summary of pig BAC end sequence alignments to NCBI build 35 of the human genome

Of the 297,742 BES that were aligned to the human genome, 132,675 (45%) mapped to regions containing annotated genes. Using the 28,597 sequences that overlap at least 1 human exon, it was possible to identify pig BACs that mapped, at least partially, to 11,180 of 21,999 annotated human genes. The distribution of pig BES matches to human exons is shown in Table 2 in Additional data file 1. It is striking that although human chromosome (HAS)19 has the lowest density of pig BES matches, with only 53 per Mb, it has the highest density of exon matches, with 19.6 per Mb. This presumably reflects the fact that HSA19 is the most gene rich of all human chromosomes [37].

The mapping of pig clones to the human genome through BES alignments can be viewed in human Ensembl [38], go to ContigView and turn on Pig BAC ends from the DAS source menu.

Assembly of the physical BAC map

The assembly of the BAC clone map was affected in two phases. In the first phase the HindIII digest fingerprint data representing 16 genome equivalents were analyzed and assembled using the FPC database [39]. The initial automated assembly was performed using a cutoff of 10-10 and generated > 12,000 contigs containing binned and ordered overlapping sets of fingerprints. Following preliminary assembly, contigs were merged by identifying fingerprint overlaps using a lower stringency supported by ordering information derived from BES placement on the human genome. In order to avoid 'humanizing' the map assembly, contigs were not joined if the alignment to the human genome was not supported by fingerprint overlap data. In the second phase of the map assembly the consistency of contig joining was assessed and further merges were made. In this phase, human/pig homology breakpoints [36] were targeted using information about the relative positions of markers from the UIUC RH map of the pig genome [10]. The RH map contains 2,413 markers distributed at an average distance of 1 per Mb across the genome and was integrated with the FPC database by fingerprinting 2,083 RPCI-44 clones with BES containing markers positioned on the RH map.

The incorporation of human-pig homology data combined with fingerprint and RH marker data across a region of HSA6 42.6-43.7 Mb and pig chromosome (SSC)7 39.3-40.2 Mb is illustrated in Figure 1. Using the combined fingerprint, RH map and human/pig conservation of synteny data, we have reduced the number of contigs from 8,718 to 172 with an increase in average contig length from 367 to 15,000 Kb. These data have also been used to construct a detailed map of conserved syntenic blocks between pig and human (Figure 2a, b). The map is accessible through Pre-Ensembl [40].

Figure 1
figure 1

Alignment between human chromosome 6 tilepath and pig chromosome 7 physical map via porcine BES matches. (a) Mb scale; (b) human sequence tilepath; (c) BES matches to human; (d) sequenced clone EMBL:CR956379 (CH242-196P11); (e) pig clone map - green indicate clones with BAC end sequence match to human; (f) UIUC RH map; (g) estimated Mb.

Figure 2
figure 2

Blocks of conserved synteny between pig and human. (a) Pig SSC7 to human chromosomes 6, 14 and 15. (b) HSA13 compared to pig chromosome 11. Block inversions between pig and human are denoted with broken lines. Contig coverage is depicted by bars in the center of SSC7 and HSA13.

In the current integrated map, 96% of the RH markers show concordance (2,001/2,083 markers) between their assignment on the RH map and their physical map location, based on positioning of the fingerprint contigs containing BES matches (Figure 1 in Additional data file 1). There are 45 markers that appear to have a different chromosome assignment in the two maps and 37 markers have local mis-ordering on the fingerprint map. We are currently investigating the discrepancies to assess whether the anomalies have arisen as a result of mis-labeling of BES or fingerprints, or whether there are regions of the RH map in which markers are incompletely resolved.

Genome coverage

The physical map comprises 172 contigs distributed over 18 autosomes and 2 sex chromosomes X and Y (Table 3). Over 98% of the contigs have been placed and ordered on the UIUC RH map and markers designed from BES in the remaining unordered contigs are being mapped on the RH panel to position them in the genome. The level of continuity of the current map is higher than has been achieved for other mammalian genomes using similar approaches, probably reflecting the high level of conserved synteny between the pig and human genomes and the benefit of the availability of the near complete human genome sequence, along with a map of RH markers distributed evenly across the genome. The most contiguous chromosome is the acrocentric SSC13, which shares homology to all of HSA3 and HSA21, and is represented in the current assembly by a single contig of 218 Mb. To our knowledge, this is the longest map contig that has been constructed prior to genome sequencing. Cytogenetic data from the most centromeric and telomeric clones on SSC13 indicate that they coincide with the ends of the chromosome showing good evidence of map completion (Additional data file 2).

Table 3 Distribution of contigs across pig chromosomes

SSCX is currently the most fragmented chromosome, with 27 contigs. There are several possible explanations for the contig number being three- to four-fold higher than might be expected on the basis of size comparison with the autosomes. Firstly, the resolution of the RH map is lower on the X, as it is hindered by the presence of the selectable marker for the panel (HPRT) on this chromosome [8, 41]. The clone coverage of SSCX is also slightly lower compared to the autosomes as only one of the BAC libraries, the CHORI-242, is derived from a female animal. Given its homology to other mammalian X chromosomes, it is also likely that continuity of the physical map is affected by the abundance of repetitive elements such as LINE1s [42].

Using the alignment of the map to the human genome sequence and fiber FISH experiments, we estimate that the average size of the map gaps is 200-250 kb, giving an approximate total of 40 Mb not covered in the current map assembly. We are attempting to close the remaining gaps through further screening of the CHORI-242 BAC library and through the construction and end sequencing of a sheared fosmid library generated from DNA used to make the CHORI-242 library.

From karyotyping we have performed there are regions of ribosomal DNA (rDNA) near the centromere of SSC10p and on acrocentric SSC16. Given the repetitive nature of these sequences it is probable that the map lacks complete coverage across these regions.

Comparison of chromosome size

Our estimates of chromosome size from the fingerprint map are consistent with previous estimates of the relative sizes of the chromosomes, with the exception of chromosomes 14 and 15. It appears from the map that SSC15 (173 Mb) is larger than SSC14 (139 Mb) (Table 3). It is difficult at this stage to determine whether the smaller size estimate of chromosome 14 relative to 15 is due to a higher GC content and gene density reducing the number of HindIII fragments produced from the BACs and thus resulting in artificial suppression of chromosome size calculated from the fragment number (see Materials and methods). The relative size estimates are consistent, however, with a previous report by Mikawa et al. [43]. In addition, the potential presence of un-clonable regions, such as sequences unstable in current BAC vector systems, especially on SSC14, will diminish map coverage. We will aim to cover such regions using alternative vectors, such as fosmids.

Chromosome Y

Although 50% of the clones fingerprinted came from BAC libraries generated from male pigs, only around 0.25% of BES (1,350) and BAC fingerprints (675) are estimated to derive from the Y chromosome (Table 1). In addition to the small number of sequences, potential BES matches to HSAY may have been excluded due to the repetitive nature of the sequence. However, some coverage across Y has been established. Publicly available gene markers already assigned to clones from the PigEBAC library [44] have been incorporated into the FPC database and have facilitated the detection of contigs for each gene. We anticipate extending the map coverage of the Y chromosome by further integration of Y specific markers or through the incorporation of flow-sorted chromosome Y specific libraries.

Utility of the physical map for rapid positional cloning of pig genes

Positional cloning of regions implicated in disease or traits of agronomic importance has typically involved the identification of regions of interest through large scale quantitative trait locus studies [6], followed by the establishment of a physical map for the region using either homologous coding sequences for the region of interest from human, or localized pig markers to isolate BAC clones experimentally through hydridization or PCR [23]. By systematically generating a whole genome BAC physical map and aligning the BES to the human genome it is now possible to 'electronically clone' regions of importance in the pig genome more rapidly and cost effectively than before. For example, based on the BES alignment between pig and human shown in Figure 1, we were able to identify BAC clone CH242-196P11 spanning 42.9-43.1 Mb on HSA6. Subsequent sequencing and annotation of CH242-196P11 (accession number CR956379) revealed the presence of the same set of genes as found in the homologous region in human (Figure 3).

Figure 3
figure 3

Comparison of homologous annotation between pig and human. Pig clone CH242-196P11 was aligned to HSA6 by BES matches, and the clone was then sequenced [EMBL:CR956379]. The resulting annotation showed the expected homologues to known human genes present in the pig sequence as in human sequences [EMBL:AL035587, EMBL:AL158815, EMBL:AL136304]. Full gene annotation for CH242-196P11 is available in the EMBL entry. Red lines indicate regions of sequence homology with a 75% sequence similarity over the whole region. Figure generated using Artemis Comparison Tool [57].

Discussion

The publicly accessible integrated physical map described here is being used to select a highly efficient tilepath through the pig genome to provide a substrate for the swine genome sequencing project [3, 45]. This follows the strategy used for other large scale mammalian sequencing projects, such as human, mouse and rat [19, 20, 46]. By using information about the extent of clone overlaps derived from the fingerprint data and re-assessing the relative positions of paired BES alignments to the human genome, we have been able to optimize the selection of an initial tilepath of minimally redundant clones through assembled clone contigs across the pig genome. Subsequent rounds of clone selection will be based on fingerprint data and BES alignments to pig clones. The map may also be improved in an iterative manner as the chromosomes are sequenced and the sequence is used to identify further clones to contiguate the map.

Any collection of clones spanning the genome can act as not only the template for clone based sequencing but also as a resource for functional genomic studies, for example, for gene targeting or to produce mapped BAC arrays. It is our intention to maintain the collection of sequenced clones and to make either individual clones or the complete set available to investigators.

Although a whole genome shotgun strategy for genome sequencing has become very popular and can be used to generate sequence to varying depths for a wide range of animal genomes very rapidly [47, 48], physical clone maps continue to provide important orthologous data for the correct assembly of genomes [49]. BAC maps can be used to de-compress regions where highly similar, but non-contiguous repetitive sequences (> 97% identical) are assembled coincidentally [50]. By inspecting BAC contigs demonstrating elevated clone depth (> 3 times expected) and those showing BES alignment to regions of known duplications in the human genome, we have been able to identify approximately 30 regions that are potentially segmentally duplicated in the pig genome. By increasing the map assembly stringency we have begun to resolve these regions prior to tilepath selection and thus improve the coverage of the initial assembly of the pig genome sequence. BAC clones also continue to prove the most effective substrates for finishing genome sequences and thus providing researchers with reference genomes that will have long lasting utility.

Materials and methods

Map construction

A bacterial clone physical map of the genome was constructed using HindIII restriction enzyme fingerprinting [51, 52]. BAC library plates were directly cultured into 170 μl 2× TY media in 384-well plates. After overnight growth the BAC DNA was extracted by alkaline lysis on a Packard MiniTrack, and digested with HindIII in the 384-well plates. Following electrophoresis on 121 lane 1% agarose gels, the data are collected using a Typhoon 8600 fluorimager, raw images were entered into the fingerprint database using the software IMAGE [53]. The output of normalized band values, sizes and gel traces were analyzed in FPC [39], which bins and orders clones on the basis of shared bands, taking marker data into account when available.

The first fingerprint assembly was performed using a cutoff of 10-10 and a fixed match tolerance of 7. The initial set of contigs were merged by ordering them along the human genome through BES matches and, crucially, scrutinizing potential joins by reducing the stringency at which fingerprint overlaps were accepted, where the cutoff was lower than that of the initial build (10-09 to 10-05). A further round of merging along each pig chromosome was enabled by incorporation of the UIUC RH map.

BAC end sequencing

Clones from the CHORI-242 and PigEBAC [8] libraries were end sequenced at the WTSI using the following protocol. BAC clones were inoculated into 1.5 ml of 2× TY media containing 12.5 μg/ml chloramphenicol in 2 ml 96-well growth boxes (Corning Life Sciences, New York, New York, USA) and incubated for 22 hours at 37°C, shaking at 320 rpm. Plates were centrifuged at 4,000 rpm for 3 minutes to obtain pellets, the supernatant was discarded and the cells were re-suspended in100 μl of GTE+RNaseA, and 100 μl of NaOH/SDS was added and mixed before adding 100 μl of 3 M KOAc and further mixing [54]. After filtration through two Millipore plates (MADVN6550/MANUBA50) on a vacuum manifold, the resultant material was dried and re-suspended in 35 μl of 10 mM Tris pH 8.0. The resulting 'prepped' DNA was sequenced with BigDye Terminator Ready Mix v3.0 and BigDye Bufferx5 (Applied Biosystems Foster City, California, USA) using T7 and SP6 primers. The sequenced products were cleaned up by washing with 5 μl 3 M NaOAc and 125 μl 96% ethanol.

The DNA was precipitated by centrifugation at 4,000 rpm for 15 minutes, then washed with a further 100 μl 70% ethanol and the centrifugation repeated. The products were re-suspended in 10 μl of 0.1 M EDTA, pH 7.4 and loaded on to ABI3700 capillary sequencers. Clones from Segment 2 of RPCI-44 porcine BAC library [32] and from Segment 1 of CHORI-242 porcine BAC library were end-sequenced at UIUC using the method described [30]. INRA library BAC clones were end-sequenced at Genoscope (Evry, France) using a method similar to that described above.

BAC end sequence analysis and placement

BES reads were repeat masked using RepeatMasker (version open-3.0.8, RepBase Update 9.11) [55] and then aligned by BLASTN version 2.0MP-WashU (22-Sep-2003) [56] to NCBI Build35 of the human genome. BLASTN hits (< 1e-05, ≥ 100 bp match, ≥ 66% sequence identity) were clustered using the FPC fingerprint assembly. Matches between the pig BES and the human genome had an average score of 866 and an average identity of 74.8%.

Coverage estimates

The sizes of map contigs were estimated from the number of fingerprint bands using a value of 6.047 kb per band. This value was estimated by dividing the entire sequenced insert of 33 finished pig clones (4,964 Kb) by the number of fingerprint bands detected in each clone (Table 3).

Additional data files

The following additional data are available with the online version of this paper. Additional data file 1 includes: Table 1, a breakdown of BES matches to human by library; Table 2, listing pig BES matches to human genes; and Figure 1, a plot of RH map (cR) versus physical map (Mb) position for markers on SSC1. Additional data file 2 shows fluorescent in situ hybridization (FISH) results for the most centromeric CH242-166N14 (red) and most telomeric clone CH242-248H14 (green) on SSC13.

References

  1. Kumar S, Hedges SB: A molecular timescale for vertebrate evolution. Nature. 1998, 392: 917-920. 10.1038/31927.

    Article  PubMed  CAS  Google Scholar 

  2. Tumbleson ME, Schook LB: Advances in Swine in Biomedical Research. 1996, New York: Plenum Press, 905-

    Book  Google Scholar 

  3. Schook LB, Beever JE, Roger J, Humphray S, Archibald A, Chardon P, Milan D, Rohrer G, Eversole K: Swine Genome Sequencing Consortium (SGSC): a strategic roadmap for sequencing the pig genome. Comparative Functional Genomics. 2005, 6: 251-255. 10.1002/cfg.479.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  4. Fishman JA, Patience C: Xenotransplantation: infectious risk revisited. Am J Transplant. 2004, 4: 1383-1390. 10.1111/j.1600-6143.2004.00542.x.

    Article  PubMed  Google Scholar 

  5. Nishitai R, Ikai I, Shiotani T, Katsura N, Matsushita T, Yamanokuchi S, Matsuo K, Sugimoto S, Yamaoka Y: Absence of PERV infection in baboons after transgenic porcine liver perfusion. J Surg Res. 2005, 124: 45-51. 10.1016/j.jss.2004.09.006.

    Article  PubMed  Google Scholar 

  6. Bidanel JP, Rothschild MF: Current status of quantitative trait loci mapping in pigs. Pig News Information. 2002, 23: N39-N54.

    Google Scholar 

  7. Archibald AL, Haley CS, Brown JF, Couperwhite S, McQueen HA, Nicholson D, Coppieters W, Van de Weghe A, Stratil A, Wintero AK, et al: The PiGMaP consortium linkage map of the pig (Sus scrofa). Mamm Genome. 1995, 6: 157-175. 10.1007/BF00293008.

    Article  PubMed  CAS  Google Scholar 

  8. Hawken RJ, Murtaugh J, Flickinger GH, Yerle M, Robic A, Milan D, Gellin J, Beattie CW, Schook LB, Alexander LJ: A first-generation porcine whole-genome radiation hybrid map. Mamm Genome. 1999, 10: 824-830. 10.1007/s003359901097.

    Article  PubMed  CAS  Google Scholar 

  9. Rohrer GA, Alexander LJ, Hu Z, Smith TP, Keele JW, Beattie CW: A comprehensive map of the porcine genome. Genome Res. 1996, 6: 371-391. 10.1101/gr.6.5.371.

    Article  PubMed  CAS  Google Scholar 

  10. Meyers SN, Rogatcheva MB, Larkin DM, Yerle M, Milan D, Hawken RJ, Schook LB, Beever JE: Piggy-BACing the human genome II. A high-resolution, physically anchored, comparative map of the porcine autosomes. Genomics. 2005, 86: 739-752. 10.1016/j.ygeno.2005.04.010.

    Article  PubMed  Google Scholar 

  11. Anderson SI, Lopez-Corrales NL, Gorick B, Archibald AL: A large-fragment porcine genomic library resource in a BAC vector. Mamm Genome. 2000, 11: 811-814. 10.1007/s003350010155.

    Article  PubMed  CAS  Google Scholar 

  12. Rogel-Gaillard C, Bourgeaux N, Billault A, Vaiman M, Chardon P: Construction of a swine BAC library: application to the characterization and mapping of porcine type C endoviral elements. Cytogenet Cell Genet. 1999, 85: 205-211. 10.1159/000015294.

    Article  PubMed  CAS  Google Scholar 

  13. Renard C, Hart E, Sehra H, Beasley H, Coggill P, Howe K, Harrow J, Gilbert J, Sims S, Rogers J, et al: The genomic sequence and analysis of the swine major histocompatibility complex. Genomics. 2006, 88: 96-110. 10.1016/j.ygeno.2006.01.004.

    Article  PubMed  CAS  Google Scholar 

  14. Sambrook JG, Sehra H, Coggill P, Humphray S, Palmer S, Sims S, Takamatsu HH, Wileman T, Archibald AL, Beck S: Identification of a single killer immunoglobulin-like receptor (KIR) gene in the porcine leukocyte receptor complex on chromosome 6q. Immunogenetics. 2006, 58: 481-486. 10.1007/s00251-006-0110-9.

    Article  PubMed  CAS  Google Scholar 

  15. Wernersson R, Schierup MH, Jorgensen FG, Gorodkin J, Panitz F, Staerfeldt HH, Christensen OF, Mailund T, Hornshoj H, Klein A, et al: Pigs in sequence space: a 0.66X coverage pig genome survey based on shotgun sequencing. BMC Genomics. 2005, 6: 70-10.1186/1471-2164-6-70.

    Article  PubMed  PubMed Central  Google Scholar 

  16. McPherson JD, Marra M, Hillier L, Waterston RH, Chinwalla A, Wallis J, Sekhon M, Wylie K, Mardis ER, Wilson RK, et al: A physical map of the human genome. Nature. 2001, 409: 934-941. 10.1038/35057157.

    Article  PubMed  CAS  Google Scholar 

  17. Gregory SG, Sekhon M, Schein J, Zhao S, Osoegawa K, Scott CE, Evans RS, Burridge PW, Cox TV, Fox CA, et al: A physical map of the mouse genome. Nature. 2002, 418: 743-750. 10.1038/nature00957.

    Article  PubMed  CAS  Google Scholar 

  18. Krzywinski M, Wallis J, Gosele C, Bosdet I, Chiu R, Graves T, Hummel O, Layman D, Mathewson C, Wye N, et al: Integrated and sequence-ordered BAC- and YAC-based physical maps for the rat genome. Genome Res. 2004, 14: 766-779. 10.1101/gr.2336604.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  19. Waterston RH, Lindblad-Toh K, Birney E, Rogers J, Abril JF, Agarwal P, Agarwala R, Ainscough R, Alexandersson M, An P, et al: Initial sequencing and comparative analysis of the mouse genome. Nature. 2002, 420: 520-562. 10.1038/nature01262.

    Article  PubMed  CAS  Google Scholar 

  20. Finishing the euchromatic sequence of the human genome. Nature. 2004, 431: 931-945. 10.1038/nature03001.

  21. Ishkanian AS, Malloff CA, Watson SK, DeLeeuw RJ, Chi B, Coe BP, Snijders A, Albertson DG, Pinkel D, Marra MA, et al: A tiling resolution DNA microarray with complete coverage of the human genome. Nat Genet. 2004, 36: 299-303. 10.1038/ng1307.

    Article  PubMed  CAS  Google Scholar 

  22. Chung YJ, Jonkers J, Kitson H, Fiegler H, Humphray S, Scott C, Hunt S, Yu Y, Nishijima I, Velds A, et al: A whole-genome mouse BAC microarray with 1-Mb resolution for analysis of DNA copy number changes by array comparative genomic hybridization. Genome Res. 2004, 14: 188-196. 10.1101/gr.1878804.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  23. Jeon JT, Park EW, Jeon HJ, Kim TH, Lee KT, Cheong IC: A large-insert porcine library with sevenfold genome coverage: a tool for positional cloning of candidate genes for major quantitative traits. Mol Cells. 2003, 16: 113-116.

    PubMed  CAS  Google Scholar 

  24. Haase B, Humphray SJ, Lyer S, Renner M, Poustka A, Mollenhauer J, Leeb T: Molecular characterization of the porcine deleted in malignant brain tumors 1 gene (DMBT1). Gene. 2006, 376: 184-191. 10.1016/j.gene.2006.03.002.

    Article  PubMed  CAS  Google Scholar 

  25. Biltueva LS, Yang F, Vorobieva NV, Graphodatsky AS: Comparative map between the domestic pig and dog. Mamm Genome. 2004, 15: 809-818. 10.1007/s00335-004-2391-3.

    Article  PubMed  CAS  Google Scholar 

  26. Rettenberger G, Klett C, Zechner U, Kunz J, Vogel W, Hameister H: Visualization of the conservation of synteny between humans and pigs by heterologous chromosomal painting. Genomics. 1995, 26: 372-378. 10.1016/0888-7543(95)80222-8.

    Article  PubMed  CAS  Google Scholar 

  27. Ellegren H, Chowdhary BP, Johansson M, Marklund L, Fredholm M, Gustavsson I, Andersson L: A primary linkage map of the porcine genome reveals a low rate of genetic recombination. Genetics. 1994, 137: 1089-1100.

    PubMed  CAS  PubMed Central  Google Scholar 

  28. Marklund L, Johansson Moller M, Hoyheim B, Davies W, Fredholm M, Juneja RK, Mariani P, Coppieters W, Ellegren H, Andersson L: A comprehensive linkage map of the pig based on a wild pig-Large White intercross. Anim Genet. 1996, 27: 255-269.

    Article  PubMed  CAS  Google Scholar 

  29. Schmitz A, Chaput B, Fouchet P, Guilly MN, Frelat G, Vaiman M: Swine chromosomal DNA quantification by bivariate flow karyotyping and karyotype interpretation. Cytometry. 1992, 13: 703-710. 10.1002/cyto.990130706.

    Article  PubMed  CAS  Google Scholar 

  30. Rogatcheva MB, He W, Larkin DM, Marron BM, Ehrhardt MJ, Beever JE, Schook LB: Survey sequencing of the porcine genome using BAC end sequences. Plant and Animal Genome XII final abstract guide: 10-14 January 2004. 2004, San Diego, CA, 59-

    Google Scholar 

  31. CHORI-242 BAC library. [http://bacpac.chori.org/porcine242.htm]

  32. Fahrenkrug SC, Rohrer GA, Freking BA, Smith TP, Osoegawa K, Shu CL, Catanese JJ, de Jong PJ: A porcine BAC library with tenfold genome coverage: a resource for physical and genetic map integration. Mamm Genome. 2001, 12: 472-474. 10.1007/s003350020015.

    Article  PubMed  CAS  Google Scholar 

  33. Ensembl Trace Server. [http://trace.ensembl.org/]

  34. NCBI Trace Archive. [http://www.ncbi.nlm.nih.gov/Traces/trace.cgi]

  35. Thomas JW, Touchman JW, Blakesley RW, Bouffard GG, Beckstrom-Sternberg SM, Margulies EH, Blanchette M, Siepel AC, Thomas PJ, McDowell JC, et al: Comparative analyses of multi-species sequences from targeted genomic regions. Nature. 2003, 424: 788-793. 10.1038/nature01858.

    Article  PubMed  CAS  Google Scholar 

  36. Murphy WJ, Larkin DM, Everts-van der Wind A, Bourque G, Tesler G, Auvil L, Beever JE, Chowdhary BP, Galibert F, Gatzke L, et al: Dynamics of mammalian chromosome evolution inferred from multispecies comparative maps. Science. 2005, 309: 613-617. 10.1126/science.1111387.

    Article  PubMed  CAS  Google Scholar 

  37. Grimwood J, Gordon LA, Olsen A, Terry A, Schmutz J, Lamerdin J, Hellsten U, Goodstein D, Couronne O, Tran-Gyamfi M, et al: The DNA sequence and biology of human chromosome 19. Nature. 2004, 428: 529-535. 10.1038/nature02399.

    Article  PubMed  CAS  Google Scholar 

  38. Human Ensembl Site. [http://www.ensembl.org/Homo_sapiens/]

  39. Soderlund C, Humphray S, Dunham A, French L: Contigs built with fingerprints, markers, and FPC V4.7. Genome Res. 2000, 10: 1772-1787. 10.1101/gr.GR-1375R.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  40. Pig Map in Pre-ensembl. [http://pre.ensembl.org/Sus_scrofa/index.html]

  41. Yerle M, Echard G, Robic A, Mairal A, Dubut-Fontana C, Riquet J, Pinton P, Milan D, Lahbib-Mansais Y, Gellin J: A somatic cell hybrid panel for pig regional gene mapping characterized by molecular cytogenetics. Cytogenet Cell Genet. 1996, 73: 194-202.

    Article  PubMed  CAS  Google Scholar 

  42. Ross MT, Grafham DV, Coffey AJ, Scherer S, McLay K, Muzny D, Platzer M, Howell GR, Burrows C, Bird CP, et al: The DNA sequence of the human X chromosome. Nature. 2005, 434: 325-337. 10.1038/nature03440.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  43. Mikawa A, Suzuki H, Suzuki K, Toki D, Uenishi H, Awata T, Hamasima N: Characterization of 298 ESTs from porcine back fat tissue and their assignment to the SSRH radiation hybrid map. Mamm Genome. 2004, 15: 315-322. 10.1007/s00335-003-2323-7.

    Article  PubMed  CAS  Google Scholar 

  44. Quilter CR, Blott SC, Mileham AJ, Affara NA, Sargent CA, Griffin DK: A mapping and evolutionary study of porcine sex chromosome genes. Mamm Genome. 2002, 13: 588-594. 10.1007/s00335-002-3026-1.

    Article  PubMed  CAS  Google Scholar 

  45. International Swine Genome Sequencing Consortium. [http://piggenome.org/]

  46. Gibbs RA, Weinstock GM, Metzker ML, Muzny DM, Sodergren EJ, Scherer S, Scott G, Steffen D, Worley KC, Burch PE, et al: Genome sequence of the Brown Norway rat yields insights into mammalian evolution. Nature. 2004, 428: 493-521. 10.1038/nature02426.

    Article  PubMed  CAS  Google Scholar 

  47. Margulies EH, Vinson JP, Miller W, Jaffe DB, Lindblad-Toh K, Chang JL, Green ED, Lander ES, Mullikin JC, Clamp M: An initial strategy for the systematic identification of functional elements in the human genome by low-redundancy comparative sequencing. Proc Natl Acad Sci USA. 2005, 102: 4795-4800. 10.1073/pnas.0409882102.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  48. Lindblad-Toh K, Wade CM, Mikkelsen TS, Karlsson EK, Jaffe DB, Kamal M, Clamp M, Chang JL, Kulbokas EJ, Zody MC, et al: Genome sequence, comparative analysis and haplotype structure of the domestic dog. Nature. 2005, 438: 803-819. 10.1038/nature04338.

    Article  PubMed  CAS  Google Scholar 

  49. Warren RL, Varabei D, Platt D, Huang X, Messina D, Yang SP, Kronstad JW, Krzywinski M, Warren WC, Wallis JW, et al: Physical map-assisted whole-genome shotgun sequence assemblies. Genome Res. 2006, 16: 768-775. 10.1101/gr.5090606.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  50. She X, Jiang Z, Clark RA, Liu G, Cheng Z, Tuzun E, Church DM, Sutton G, Halpern AL, Eichler EE: Shotgun sequence assembly and recent segmental duplications within the human genome. Nature. 2004, 431: 927-930. 10.1038/nature03062.

    Article  PubMed  CAS  Google Scholar 

  51. Humphray SJ, Knaggs SJ, Ragoussis I: Contiguation of bacterial clones. Methods Mol Biol. 2001, 175: 69-108.

    PubMed  CAS  Google Scholar 

  52. Schein J, Kucaba T, Sekhon M, Smailus D, Waterston R, Marra M: High-throughput BAC fingerprinting. Methods Mol Biol. 2004, 255: 143-156.

    PubMed  CAS  Google Scholar 

  53. Image Software Site. [http://www.sanger.ac.uk/Software/Image]

  54. End Sequencing Protocol. [http://www.sanger.ac.uk/Teams/Team51/PACBACPrep.shtml]

  55. RepeatMasker. [http://www.repeatmasker.org/faq.html#faq3]

  56. WU-BLASTN. [http://blast.wustl.edu/]

  57. Carver TJ, Rutherford KM, Berriman M, Rajandream MA, Barrell BG, Parkhill J: ACT: the Artemis Comparison Tool. Bioinformatics. 2005, 21: 3422-3423. 10.1093/bioinformatics/bti553.

    Article  PubMed  CAS  Google Scholar 

Download references

Acknowledgements

The work undertaken at WTSI for this project was funded by the UK Biotechnology and Biological Sciences Research Council, the UK Department of the Environment, Food and Rural Affairs (grant: LS3105), Roslin Institute and Sygen-PIC. The work was also supported by USDA/NRI-CREES grant AG2001-35205-11698 and USDA-ARS AG58-5438-2-313. We would like to acknowledge the contributions of the Mapping Core Group, the WebTeam, and BAC End Sequencing Group The Wellcome Trust Sanger Institute, and The Keck Center for Comparative and Functional Genomics, University of Illinois at Urbana-Champaign for their contributions to the project. We also thank Genoscope for BAC-end sequencing the INRA BAC library.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sean J Humphray.

Electronic supplementary material

13059_2007_1620_MOESM1_ESM.doc

Additional data file 1: Table 1: breakdown of BES matches to human by library. Table 2: pig BES matches to human genes. Figure 1: plot of RH map (cR) versus physical map (Mb) position for markers on SSC1. (DOC 64 KB)

13059_2007_1620_MOESM2_ESM.tiff

Additional data file 2: Fluorescent in situ hybridization (FISH) results for the most centromeric CH242-166N14 (red) and most telomeric clone CH242-248H14 (green) on SSC13. (TIFF 248 KB)

Authors’ original submitted files for images

Rights and permissions

Reprints and permissions

About this article

Cite this article

Humphray, S.J., Scott, C.E., Clark, R. et al. A high utility integrated map of the pig genome. Genome Biol 8, R139 (2007). https://doi.org/10.1186/gb-2007-8-7-r139

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/gb-2007-8-7-r139

Keywords