Transcription factors of the T-box family are required both for early cell-fate decisions, such as those necessary for formation of the basic vertebrate body plan, and for differentiation and organogenesis. When mutated, T-box genes give dramatic phenotypes in mouse and zebrafish, and they have been implicated both in fundamentals of limb patterning and in a number of human congenital malformations such as Holt-Oram, ulnar-mammary and DiGeorge syndromes, as well as being amplified in a subset of cancers. Genes encoding members of the T-box family have recently been shown to comprise approximately 0.1% of genomes as diverse as those of nematodes and humans and have been identified in a wide variety of animals from ctenophores (comb jellies) to mammals; they are, however, completely absent from genomes from other organisms (such as the model plant Arabidopsis thaliana).
Gene organization and evolutionary history
Positional cloning and sequencing of the genes defective in the mouse gastrulation mutant Brachyury, also known as T, and of the Drosophila behavior mutant optomotor-blind (omb), show extensive sequence similarity between the amino-terminal regions of the two proteins [1,2]; the region of similarity contains a unique sequence-specific DNA-binding domain. Since these initial observations, over 50 proteins have been identified with sequence similarity to the DNA-binding domain of Brachyury and Omb. This domain is now referred to as the T-box and the genes are collectively referred to as the T-box gene family. Members of this family are expressed in, and are required for, the development of multiple cell types in diverse organisms, as demonstrated by genetic studies in flies, worms, fish, mice, dogs, and humans [3,4,5,6,7]. For many of these genes, such as Brachyury, there are clear orthologs (direct homologs), with a high degree of sequence similarity, expression pattern, and function between a variety of vertebrates, including fish, frogs, dogs, and mice [1,2,3,4,5,6,7]. Other T-box genes appear to be unique to a particular species; for instance, VegT, a T-box gene thought to be required for endoderm formation in Xenopus, has no apparent ortholog in mice or humans. The family has 18 members in mammals; representatives have been identified from a wide range of animals, including various chordates, Drosophila melanogaster (11 members), Caenorhabditis elegans (14 members), annelids, and cnidarians.
Analysis of T-box genes shows most of their loci to be dispersed randomly throughout chordate genomes (see Table 1 for their locations in the human genome), although several examples of clustering have been reported. One instance occurs in C. elegans, for which genomic sequencing has shown a tight linkage between Tbx8 and Tbx9 ; a second is in mouse, where Tbx2 and Tbx4 are tightly linked on chromosome 11 and Tbx3 and Tbx5 are linked on chromosome 5. The association between these latter T-box genes appears to be conserved in other mammals, as human Tbx2 and Tbx4, and Tbx3 and Tbx5, have a similar arrangement on chromosomes 17 and 12, respectively [9,10]. Phylogenetic analysis of these cognate pairs suggests they arose through initial duplication of an ancestral gene by an unequal crossover event between two alleles; in the case of Tbx2 and Tbx4, and Tbx3 and Tbx5, these events occurred at least 600 million years ago [8,11]. Although possible mechanisms for the duplication have been put forward, such as duplication of entire chromosomes or genomes, the functional consequences of the pairs of linked Tbx genes remains to be established.
Table 1. Mouse and human T-box-containing genes
T-box genes contain multiple exons, and the T-box is generally encoded by at least five exons dispersed over a relatively large distance. For example, the human Tbx5 gene contains eight exons distributed over 53 kilobases (kb) of chromosome 12 . As has been found for other gene families, the intron-exon boundaries of T-box homologs are conserved throughout evolution, but the lengths of the introns vary between species [10,12]. Most T-box family members encode a single transcript and there are few direct demonstrations of alternative exon splicing. One exception is Xenopus VegT/Antipodean protein, which is found in two different isoforms resulting from alternative splicing of the 3' end of the VegT gene. Interestingly, the isoforms appear to be tissue-specific: one is present maternally in the endodermal layer of the embryo and the other is expressed zygotically in the mesodermal layer .
Characteristic structural features
T-box proteins generally range in size from 50 kDa to 78 kDa. Brachyury, the founding member of the T-box gene family, has been shown to encode a sequence-specific DNA-binding protein that functions as a transcriptional activator [1,14,15,16,17]. Although crystallographic analysis of T-box proteins has been achieved only for a truncated version of the Xenopus homolog of Brachyury, Xbra, and a truncated version of Tbx3, the results clearly demonstrate that the T-box is unlike any other DNA-binding domain  (Figure 1). Studies with a number of T-box proteins have shown that they comprise at least two structural and functional domains: a sequence-specific DNA-binding domain (the T-box) and a transcriptional activator or repressor domain [3,4]. The relative position of the domains varies between different members of the family, but the order is conserved for any one member of the T-box family and its orthologs [10,12].
Figure 1. Ribbon diagram of crystal structures of (a,b) Xenopus Xbra and (c) human TBX3 bound to DNA. Beta strands are depicted in red and alpha helices in (a,b) orange or (c) turquoise. Reproduced with permission from [18,61].
The T-box is defined as the minimal region within the T-box protein that is both necessary and sufficient for sequence-specific DNA binding [3,4,5,14,17]. Despite the sequence variations within the T-box between family members, examination of downstream targets and binding-site selection experiments for a number of T-box proteins show that all members of the family so far examined bind to the DNA consensus sequence TCACACCT. In several binding-site selection studies, members of the T-box family preferentially bound sequences that contain two or more core motifs arranged in various orientations. The ability to bind the sequence is protein-specific; for example, Xbra can bind to two core motifs arranged head-to-head, whereas VegT cannot; conversely, VegT can bind to two core motifs arranged tail-to-tail whereas Xbra cannot. The biological relevance of these findings remains unknown, as no downstream target of any T-box gene has been found to contain a double site .
The T-box is a relatively large DNA-binding domain, generally comprising about a third of the entire protein (17-26 kDa), and individual T-box gene family members show varying degrees of homology across the domain. Specific residues within the T-box are 100% conserved in all members of the family, however. This observation has provided the basis for subdivision of the family (see Figure 2) . It has recently been demonstrated that the specificity of several T-box proteins for their target sites lies mainly within the T-box. But specificity does not appear to reflect binding affinity , suggesting that other functions may lie in the T-box, such as regions required for protein-protein interactions. Consistent with this proposal, our mutational analysis has identified a single amino-acid residue within the T-box of Xenopus Xbra, Eomesodermin, and VegT (Lys149, Asn155 or Asn353, respectively) that is required for the correct target specificity of the respective proteins . In addition, one T-box protein, Mga, contains both a T-box and a basic helix-loop-helix leucine zipper (bHLH-zip) domain . When heterodimerized with the bHLH protein Max, Mga is converted to a transcriptional activator with apparent dual specificity, regulating genes containing either a Max-binding or a T-box-domain-binding site . Conversely, human Tbx22 has been found to contain a truncated T-box lacking residues found in the amino-terminal portion of all other family members and would be predicted not to bind DNA . Tbx22 may therefore represent a case in which the T-box has functions other than DNA binding.
Figure 2. Conservation of selected T-box residues and the presence of diagnostic residues for different members of the family. Position 149 is always a lysine in Xbra proteins from different species (blue) but not in other T-box proteins (red). A diagram of Xenopus Xbra is above, showing the relative positions of the DNA-binding domain, the nuclear localization signal, and the transcriptional activation domain.
Transcriptional regulatory domains
T-box proteins have been demonstrated to function both as transcriptional activators and as repressors. In all cases studied, the transcriptional regulation activity has been shown to require sequences located in the carboxy-terminal portion of the protein. Only in the case of Brachyury and its frog and zebrafish orthologs, however, has the region both necessary and sufficient for transcriptional regulation been accurately mapped [16,22]. Interestingly, there are only a few small blocks of conservation between the Brachyury orthologs in this region, and the overall level of similarity is low.
Localization and function
The T-box genes share two characteristics of interest to researchers studying cell specification and differentiation: they tend to be expressed in specific organs or cell types, especially during development, and they are generally required for the development of those tissues (Table 1). In the few cases for which intracellular localization has been analyzed, T-box proteins have shown to be localized exclusively in the nucleus. These considerations, together with their DNA-binding and transcriptional activation/repression capacity, mean that T-box proteins are well placed to fulfill a wide array of important regulatory roles in development. This is supported by the observation that mutant alleles commonly give a phenotype even in heterozygotes (that is, they show haploinsufficiency), indicating that the level of a T-box protein is important for determining its function. In addition, mutational studies have demonstrated that T-box genes are required cell-autonomously (active in the cell in which they are expressed). For example, Brachyury is expressed in posterior mesoderm and in the developing notochord, and it is required for the formation of these cells in mice [1,23,24,25,26,27,28].
T-box genes are also required in specific tissue types at later stages of development: for example, Tbx2, Tbx3, Tbx4 and Tbx5 in the developing limb (reviewed in ). Tbx2 and Tbx3 are expressed in the anterior and posterior margins of both forelimb and hindlimb buds . The posterior expression of Tbx3 is crucial for the development of the more distal limb elements, as shown in human patients lacking TBX3, who have ulnar-mammary syndrome and lack the ulna (a forearm bone) and digits [31,32].
In contrast to the overlap in expression of Tbx2 and Tbx3, their close homologs, Tbx4 and Tbx5, are expressed exclusively in the hindlimb bud and the forelimb bud, respectively . Although the signals that direct expression of these genes to the forelimb or hindlimb are unknown, it is likely that at least part of this involves an interpretation of the 'Hox code', the set of Hox genes expressed in the mesoderm that eventually produces the mesenchyme cells that migrate into the limb buds [33,34]. The expression of Tbx4 lies downstream of Ptx1, a homeobox-containing gene expressed in posterior lateral plate mesoderm. Expression is, however, independent of the signals that direct limb-bud outgrowth. Significantly, these genes not only delineate forelimb and hindlimb territories but also specify forelimb or hindlimb type [33,34]. Retroviral overexpression of Tbx4 inappropriately in forelimb mesenchyme of chick embryos, or of Tbx5 in the hindlimb, can transform the tissue into hindlimb or forelimb type, respectively . The transformation includes both the mesodermally derived skeletal elements and the overlying ectoderm, which develops feathers or scales depending on which gene is expressed . Tbx4 and Tbx5 are thus thought to act as 'selector' genes for the limb bud, defining what type of limb develops; this is corroborated by the correlation of Tbx4 or Tbx5 expression with hindlimb or forelimb identity in the limb buds induced by ectopic expression of fibroblast growth factors in the flank . Finally, mutations in human TBX5 affect forelimb growth and heart development [35,36]. Interestingly, missense mutations within the T-box of human TBX5 that contact the minor groove of DNA (such as Arg237Gln) result primarily in limb abnormalities, whereas the other aspect of Holt-Oram syndrome, aberrant heart development, is predominantly seen as a result of a missense mutation that alters a residue that contacts the major groove (Gly80Arg) . This suggests that tissue-specific target genes are affected by mutations in different residues within the T-box of TBX5.
Mutations in T-box proteins have also been implicated in DiGeorge syndrome, a complex disease that includes abnormalities of the heart's outflow tract [38,39,40], and Tbx2 is amplified in some types of breast cancer .
Despite the essential role for individual members of the T-box gene family in a wide variety of developmental processes, relatively little is known about the genetic and biochemical pathways in which T-box genes act [42,43]. Thus, one of the critical areas for future research is to identify the factors that act directly upstream and downstream of individual T-box genes.
At the cellular level, genetic mutations have provided clues about the requirement for T-box genes in a variety of developmental processes, but the exact function of T-box genes, including questions of genetic redundancy, still needs to be established. For example, in ulnar-mammary syndrome patients it is not clear why there is no corresponding defect in the legs, a region that normally expresses Tbx3 [30,31]. This may perhaps be due to a redundant function with other T-box genes expressed in the hindlimb, such as Tbx2 or Tbx4.
In order to dissect the function of individual T-box genes, it will also be necessary to generate allelic series (as has been useful for the study of Holt-Oram syndrome) and conditional mutations. A case for the latter has already been demonstrated for Eomesodermin, a gene expressed in all vertebrates just prior to gastrulation in the prospective mesoderm and, in the mouse, in the trophectoderm, an extraembryonic tissue that is required for placenta formation and is thus unique to mammals . Mice lacking Eomesodermin fail at, or shortly after, implantation, because of a defect in the trophectoderm. This phenotype can be rescued by wild-type trophectoderm, even if the embryo itself is mutant, but when embryonic tissues lack Eomesodermin, mesoderm differentiation and migration fails completely .
Finally, relatively little is known about post-translational processing, protein turnover, or protein stability for any T-box protein. In addition, only in the case of Tbx5 [46,47], Tbr1 , and Mga  has a protein-protein interaction domain been reported and, with the exception of Mga, the biological significance of these interactions has not yet been determined.
Nature 1990, 343:617-622.
The seminal piece of work identifying Brachyury as the gene disrupted in a classical mouse mutation.PubMed Abstract | Publisher Full Text
Biochem Biophys Res Commun 1992, 186:918-925.
First clues that Brachyury and Omb may be members of a protein family.PubMed Abstract
Int Rev Cytol 2001, 207:1-70.
A comprehensive review of the T-box gene family.PubMed Abstract
Trends Genet 1999, 15:154-158.
A review of the molecular and cellular aspects of the T-box gene family.PubMed Abstract | Publisher Full Text
BioEssays 1998, 20:9-19.
A review of T-box family members.PubMed Abstract | Publisher Full Text
Haworth K, Putt W, Cattanach B, Breen M, Binns M, Lingaas F, Edwards YH: Canine homolog of the T-box transcription factor T; failure of the protein to bind to its DNA target leads to a short-tail phenotype.
Mamm Genome 2001, 12:212-218.
This paper demonstrates the high degree of conservation in the function of Brachyury and explains why Corgis have short tails.PubMed Abstract | Publisher Full Text
Trends Genet 1994, 10:280-286.
A review of T-box family members and their role in early development.PubMed Abstract | Publisher Full Text
Genomics 1995, 25:214-219.
Shows the conservation of the T-box family from worms to mice.PubMed Abstract | Publisher Full Text
Genomics 1997, 40:262-266.
Work on the evolutionary history of the T-box family.PubMed Abstract | Publisher Full Text
Entrez Nucleotide [http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Nucleotide] webcite
One of the main sequence databases.
Genetics 1996, 144:249-254.
A proposal for how an ancestral locus gave rise to two T-box gene clusters.PubMed Abstract
Genomics 1998, 48:24-33.
A proposal for classifying the T-box family based on genomic structure of the DNA-binding domain.PubMed Abstract | Publisher Full Text
Mech Dev 1999, 86:87-98.
The first direct demonstration of alternative splicing of a T-box gene.PubMed Abstract | Publisher Full Text
EMBO J 1993, 12:3211-3220.
Brachyury can bind to DNA in a sequence-specific fashion.PubMed Abstract
Dev Biol 1995, 168:406-415.
Further evidence that the sequence and expression of Brachyury is evolutionarily conserved.PubMed Abstract | Publisher Full Text
Development 1996, 122:2427-2435.
Xbra and Ntl function as transcriptional activators in vivo.PubMed Abstract | Publisher Full Text
Development 2001, 128:3749-3758.
Analysis of the binding site specificity and protein specificity of Xbra, Eomesodermin, and VegT and how it relates to early mesodermal patterning.PubMed Abstract | Publisher Full Text
Nature 1997, 389:884-888.
The first crystal structure of a T-box protein.PubMed Abstract | Publisher Full Text
Genome 1997, 40:458-464.
The emergence of the gene family in the worm.PubMed Abstract
EMBO J 1999, 18:7019-7028.
Shows that one T-box gene family member may have more than one type of DNA-binding domain, and hence more than one function.PubMed Abstract | Publisher Full Text
Gene 2000, 255:289-296.
Cloning of a T-box gene family member with a truncated version of the DNA-binding domain.PubMed Abstract | Publisher Full Text
EMBO J 1995, 14:4763-4772.
Brachyury can function as a transcriptional activator.PubMed Abstract
J Exp Zool 1935, 70:429-459.
This paper and 24,25] are classic studies on the mouse mutation Brachyury.
Genetics 1938, 23:573-584.
Proc Natl Acad Sci USA 1944, 30:134-140.
Nature 1990, 343:657-659.
The first demonstration that a T-box gene family member is highly restricted in its temporal and spatial expression.PubMed Abstract | Publisher Full Text
Nature 1991, 353:348-351.
Brachyury functions in a cell-autonomous fashion.PubMed Abstract | Publisher Full Text
Development 1993, 117:1321-1331.
Brachyury regulates cell migration.PubMed Abstract | Publisher Full Text
Cell Tissue Res 1999, 296:57-66.
A review of the role of T-box genes in limb development.PubMed Abstract | Publisher Full Text
Chapman DL, Garvey N, Hancock S, Alexiou M, Agulnik SI, Gibson-Brown JJ, Cebra-Thomas J, Bollag RJ, Silver LM, Papaioannou VE: Expression of the T-box family genes, Tbx1-Tbx5, during early mouse development.
Dev Dynam 1996, 206:379-390.
Demonstrates that each member of the T-box gene family has a highly restricted pattern of expression during embryogenesis.PubMed Abstract | Publisher Full Text
Bamshad M, Lin RC, Law DJ, Watkins WC, Krakowiak PA, Moore ME, Franceschini P, Lala R, Holmes LB, Gebuhr TC, et al.: Mutations in human TBX3 alter limb, apocrine and genital development in ulnar-mammary syndrome.
Nat Genet 1997, 16:311-315.
Studies linking ulnar-mammary syndrome with mutations in TBX3.PubMed Abstract
Proc Natl Acad Sci USA 1999, 96:10212-10217.
TBX3 can function as a transcriptional repressor.PubMed Abstract | Publisher Full Text | PubMed Central Full Text
Nature 1999, 398:814-818.
This paper and  show roles for T-box gene family members in limb identity.PubMed Abstract | Publisher Full Text
Development 1998, 125:1867-1875.
See .PubMed Abstract | Publisher Full Text
Basson CT, Bachinsky DR, Lin RC, Levi T, Elkins JA, Soults J, Grayzel D, Kroumpouzou E, Traill TA, Leblanc-Straceski J, et al.: Mutations in human TBX5 cause limb and cardiac malformation in Holt-Oram syndrome.
Nat Genet 1997, 15:30-35.
This study and  link Holt-Oram syndrome to mutations in TBX5.PubMed Abstract
Li QY, Newbury-Ecob RA, Terrett JA, Wilson DI, Curtis AR, Yi CH, Gebuhr T, Bullen PJ, Robson SC, Strachan T, et al.: Holt-Oram syndrome is caused by mutations in TBX5, a member of the Brachyury (T) gene family.
Nat Genet 1997, 15:21-29.
See .PubMed Abstract
Basson CT, Huang T, Lin RC, Bachinsky DR, Weremowicz S, Vaglio A, Bruzzone R, Quadrelli R, Lerone M, Romeo G, et al.: Different TBX5 interactions in heart and limb defined by Holt-Oram syndrome mutations.
Proc Natl Acad Sci USA 1999, 96:2919-2924.
This paper suggests that missense mutation may disrupt different functions of Tbx5.PubMed Abstract | Publisher Full Text | PubMed Central Full Text
Cell 2001, 104:619-629.
This paper and [39,40] demonstrate that the phenotype of the velocadio-facial/DiGeorge syndrome is due, in part, to a deletion of TBX1.PubMed Abstract | Publisher Full Text
Nat Genet 2001, 27:286-291.
See .PubMed Abstract | Publisher Full Text
Lindsay EA, Vitelli F, Su H, Morishima M, Huynh T, Pramparo T, Jurecic V, Ogunrinu G, Sutherland HF, Scambler PJ, et al.: Tbx1 haploinsufficiency in the DiGeorge syndrome region causes aortic arch defects in mice.
Nature 2001, 410:97-101.
See .PubMed Abstract | Publisher Full Text
Jacobs JJ, Keblusek P, Robanus-Maandag E, Kristel P, Lingbeek M, Nederlof PM, van Welsem T, van de Vijver MJ, Koh EY, Daley GQ, van Lohuizen M: Senescence bypass screen identifies TBX2, which represses Cdkn2a (p19(ARF)) and is amplified in a subset of human breast cancers.
Nat Genet 2000, 26:291-299.
TBX2 is amplified in a subset of breast cancers.PubMed Abstract | Publisher Full Text
Cold Spring Harb Symp Quant Biol 1997, 62:337-346.
Delineation of the Brachyury/Xbra molecular pathway.PubMed Abstract
Dev Growth Differ 2001, 43:1-11.
Comprehensive review of potential targets of the T-box gene family.PubMed Abstract | Publisher Full Text
Mech Dev 1999, 81:199-203.
Cloning and expression of Eomesodermin from mouse.PubMed Abstract | Publisher Full Text
Nature 2000, 404:95-99.
The requirement of Eomesodermin in separate lineages during early mouse embryogenesis.PubMed Abstract | Publisher Full Text
Bruneau BG, Nemer G, Schmitt JP, Charron F, Robitaille L, Caron S, Conner DA, Gessler M, Nemer M, Seidman CE, Seidman JG: A murine model of Holt-Oram syndrome defines roles of the T-box transcription factor Tbx5 in cardiogenesis and disease.
Cell 2001, 106:709-721.
Functional conservation of Tbx5 between mouse and human.PubMed Abstract | Publisher Full Text
Nat Genet 2001, 28:276-280.
Protein-protein interactions with a T-box gene.PubMed Abstract | Publisher Full Text
Nature 2000, 404:298-302.
Interactions of a T-box protein with other proteins.PubMed Abstract | Publisher Full Text
Yi CH, Terrett JA, Li QY, Ellington K, Packham EA, Armstrong-Buisseret L, McClure P, Slingsby T, Brook JD: Identification, mapping, and phylogenomic analysis of four new human members of the T-box gene family: EOMES, TBX6, TBX18, and TBX19.
Genomics 1999, 55:10-20.
Initial identification of TBX19.PubMed Abstract | Publisher Full Text
Proc Natl Acad Sci USA 2001, 98:8674-8679.
Clues to the function of TBX19.PubMed Abstract | Publisher Full Text | PubMed Central Full Text
J Exp Zool 2000, 288:23-31.
Cloning of T-brain and its evolutionary implications.PubMed Abstract | Publisher Full Text
Cell 2000, 100:655-69.
The role of TBX21 in differentiation.PubMed Abstract | Publisher Full Text
Law DJ, Garvey N, Agulnik SI, Perlroth V, Hahn OM, Rhinehart RE, Gebuhr TC, Silver LM: TBX10, a member of the Tbx1-subfamily of conserved developmental genes, is located at human chromosome 11q13 and proximal mouse chromosome 19.
Mamm Genome 1998, 9:397-399.
Cloning of TBX10.PubMed Abstract | Publisher Full Text
Genomics 1998, 51:68-75.
First description of TBX15.PubMed Abstract | Publisher Full Text
Mech Dev 2000, 95:253-258.
Cloning of the zebrafish homolog of Drosophila H15, TBX20/12.PubMed Abstract | Publisher Full Text
Genomics 2000, 67:317-332.
Cloning of the human homolog of Drosophila H15, TBX20/12.PubMed Abstract | Publisher Full Text
Braybrook C, Doudney K, Marcano AC, Arnason A, Bjornsson A, Patton MA, Goodfellow PJ, Moore GE, Stanier P: The T-box transcription factor gene TBX22 is mutated in X-linked cleft palate and ankyloglossia.
Nat Genet 2001, 29:179-183.
Demonstration that in a subset of individuals with cleft palate, the defect is associated with mutations in TBX22.PubMed Abstract | Publisher Full Text
Genomics 1995, 28:255-260.
The initial cloning and characterization of TBX2.PubMed Abstract | Publisher Full Text
Gibson-Brown JJ, Agulnik SI, Chapman DL, Alexiou M, Garvey N, Silver LM, Papaioannou VE: Evidence of a role for T-box genes in the evolution of limb morphogenesis and the specification of forelimb/hindlimb identity.
Mech Dev 1996, 56:93-101.
The suggestion of a role for TBX4 and TBX5 in limb development.PubMed Abstract | Publisher Full Text
Nature 1998, 391:695-697.
The finding that mutations in TBX6 convert mesodermal tissue to neural tissue.PubMed Abstract | Publisher Full Text
Structure 2002, 10:343-356.
The second structure of a T-box protein.PubMed Abstract | Publisher Full Text