The F-box is a protein motif of approximately 50 amino acids that functions as a site of protein-protein interaction. F-box proteins were first characterized as components of SCF ubiquitin-ligase complexes (named after their main components, Skp I, Cullin, and an F-box protein), in which they bind substrates for ubiquitin-mediated proteolysis. The F-box motif links the F-box protein to other components of the SCF complex by binding the core SCF component Skp I. F-box proteins have more recently been discovered to function in non-SCF protein complexes in a variety of cellular functions. There are 11 F-box proteins in budding yeast, 326 predicted in Caenorhabditis elegans, 22 in Drosophila, and at least 38 in humans. F-box proteins often include additional carboxy-terminal motifs capable of protein-protein interaction; the most common secondary motifs in yeast and human F-box proteins are WD repeats and leucine-rich repeats, both of which have been found to bind phosphorylated substrates to the SCF complex. The majority of F-box proteins have other associated motifs, and the functions of most of these proteins have not yet been defined.
Gene organization and evolutionary history
There is little known at present about the organization of the genes encoding F-box proteins, with most studies focusing on the protein products. This article provides an overview of those studies, summarizing the current knowledge about the structure, function and regulation of the F-box proteins.
The F-box was initially observed as a region of homology among the proteins Cdc4, ß-TrCP, Met30, Scon2, and MD6, all of which contain WD (Trp-Asp) repeats, by Kumar and Paietta in 1995 . The implications of the homology were not appreciated until Bai et al.  recognized in 1996 that the F-box was a widespread motif that was required for protein-protein interaction. The name F-box was given by Bai et al. on the basis of the presence of the motif in cyclin F.
The F-box motif itself is generally found in the amino-terminal half of proteins and is often coupled with other motifs in the carboxy-terminal part of the protein, the two most common of which in humans are leucine-rich repeats (LRRs) and WD repeats. The nomenclature for human F-box proteins proposed by the Human Genome Organization follows the pattern proposed by Cenciarelli et al.  and Winston et al. : FBXL denotes a protein containing an F-box and LRRs; FBXW denotes a protein with an F-box and WD repeats; and FBXO denotes a protein with an F-box and either another or no other motif. A similar nomenclature is followed in mice, but in other organisms, proteins are not at present named according to the presence of an F-box.
There are 11 F-box proteins in the completed Saccharomyces cerevisiae genome, 326 predicted in Caenorhabditis elegans, 22 in Drosophila, and at least 38 in humans (see Table 1 and Additional data file 1), but no known examples of F-box proteins in prokaryotes. F-box proteins contain a wide range of secondary motifs including zinc fingers, cyclin domains, leucine zippers, ring fingers, tetratricopeptide (TPR) repeats, and proline-rich regions. The diversity of associated protein domains suggests that F-box motifs have been transferred into existing proteins multiple times during eukaryotic evolution. Evolutionary constraints are higher for certain classes of F-box proteins: all of the human FBXW or FBXL proteins have counterparts in C. elegans with most also conserved in yeast, but only about half of the human FBXO class of proteins is conserved in nematodes or yeast.
Table 1. F-box proteins in the yeast, nematode, and human genomes
An interesting observation is the huge number of F-box proteins in C. elegans. The F-box motif is the fourth most common protein domain in C. elegans, with their number dwarfing the F-boxes found in other species by a factor often. Over half of the predicted C. elegans F-box proteins (135) are found with another motif known as DUF38 (domain of unknown function 38) or FTH (FOG-2 homology) . The FTH/DUF38 domain is found mostly in nematodes, with none in humans or yeast. A second domain, PfamB-45, is found in another 56 C. elegans F-box proteins. Both of these cases suggest the expansion of single progenitor genes within nematodes.
Characteristic structural features
The F-box motif has approximately 50 residues. As can be seen from the consensus sequence (Figure 1), there are very few invariant positions; the least variable are positions 8 (92% of the 234 F-box proteins used for the consensus have leucine or methionine), 9 (92% proline), 16 (86% isoleucine or valine), 20 (81% leucine or methionine), and 32 (92% serine or cysteine). This lack of a strict consensus makes identification by eye difficult; it is therefore prudent to use search algorithms to detect F-boxes. Currently, the two best search algorithms are found in the Prosite and Pfam databases . Occasionally, one database will give a significant score to an F-box in a given protein when the other does not detect it, so both databases should be searched.
Figure 1. The F-box consensus sequence. The consensus was derived from the alignment of 234 sequences used to create the Pfam F-box profile ; the single-letter amino-acid code is used. Bold and underlined capital letters signify residues found in over 40% of the F-box sequences; bold, non-underlined, capital letters signify residues found in 20-40% of the F-boxes; bold lower case letters indicate residues found in 15-19% of the F-boxes; and non-bold lower case letters indicate residues found in 10-14% of the F-boxes. A minority of F-boxes contain small insertions in the alignment after positions 11 or 24, or small (1-3 residue gaps) at various locations.
Localization and function
There have been a limited number of studies analyzing the subcellular localization of F-box proteins, and in all but a couple of cases this analysis was performed with overexpressed tagged proteins (see for example the supplementary material in [3,4]). Some F-box proteins were found to be distributed both in the cytoplasm and in the nucleus. The identical localization of wild-type and mutant F-box proteins demonstrates that the presence of the F-box and the F-box-dependent binding to Skp1 does not determine the subcellular localization of these proteins. While the expression of mRNAs encoding some F-box proteins have been found in all tissues tested, others are clearly tissue-specific. Because of the large number of F-box proteins, this information is too complex to be summarized here.
The F-box motif functions to mediate protein-protein interaction. F-box proteins were first described as components of SCF ubiqutin-ligase (E3) complexes [7,8]. SCF complexes contain four components: Skp1, a cullin, Rbx1/Roc1/Hrt1, and an F-box protein (Figure 2a) [9,10,11]. SCF complexes facilitate interaction between substrates and ubiquitin-conjugating enzymes, which then covalently transfer ubiquitin onto substrates. Poly-ubiquitinated substrates are subsequently degraded by the 26S proteasome . The F-box protein is the subunit of the SCF complex that binds specific substrates, and it links to the complex by binding Skp1 through the F-box itself.
Figure 2. F-box protein functions. (a) The SCF complex. The F-box protein is linked to the SCF complex via interaction between the F-box and Skp1. A ubiquitin-conjugating enzyme (Ubc) binds to the SCF complex and transfers ubiquitin (Ub) onto substrates bound by the F-box protein. When the substrate becomes poly-ubiquitinated, it is degraded by the 26S proteasome. (b) Skp1 binds to the F-box of Ctf13, facilitating Ctf13 phosphorylation, which allows Ctf13 to form the structural core of the CBF3 centromere-binding complex. (c) The F-box of Elongin A binds Elongin C (El C). The association of Elongins B and C with A increases Elongin A transcriptional activity. (d) The FOG-2/GLD-1 complex binds the 3' UTR of tra-2 mRNA to translationally repress it. The function of the F-box of FOG-2 is currently unknown. (e) Cyclin F binds to cyclin B1-cdc2 through a direct association of the cyclin F 'cyclin box' with the CRS domain of cyclin B1, and may be required for cyclin B1 nuclear localization. The function of the F-box of cyclin F is unknown. NLS, nuclear localization signal.
In both yeast and human cells, multiple SCF complexes are present that differ only in the F-box protein component. In yeast, there are three characterized SCF complexes: SCFCdc4, SCFMet30, and SCFGrr1, designated according to their F-box-containing component. The ability of the SCF backbone to bind multiple F-box proteins, each with specific substrate specificity, substantially increases the substrate repertoire. The F-box proteins found to function in SCF complexes have so far been those that have WD repeats or LRRs in their carboxyl termini, with substrate binding occurring via those motifs. Interestingly, human FBX04 and FBXO7 have been found to co-immunoprecipitate both with the cullin Cull and with Skp1, and the immunoprecipitates are associated with ubiquitin-ligase activity, suggesting that classes of F-box proteins other than the FBXW and FBXL classes can function in SCF complexes .
SCF complexes generally recognize substrates after they are phosphorylated on specific epitopes . Phosphorylation is one of the major mechanisms used by cells to rapidly transduce signals. SCF complexes are therefore ideal for dynamic processes that require an abrupt change to be made irreversible (at least in the short term) via the degradation of key proteins. Examples of such processes are cell-cycle phase transitions - during which the cell-cycle regulators that were required for the previous phase are degraded as the cell enters the new phase - and shifts in transcription that last for a longer time period than otherwise because a transcriptional inhibitor is degraded. There is a wide variety of SCF targets that include cell-cycle regulators, for example, G1-phase cyclins, cyclin-dependent kinase inhibitors, DNA replication factors, and transcription factors that promote cell-cycle progression, as well as non-cell-cycle functions, such as a cytoskeletal regulator, cell-surface receptors, transcription-factor inhibitors, and non-cell-cycle transcription factors (Table 2).
Table 2. F-box proteins that function in SCF complexes
F-box proteins have also been found to function in four other biochemical contexts. First, in yeast, the Ctf13 protein contains a diverged F-box motif that is not picked up by Prosite or Pfam search algorithms, but which has been demonstrated to be required for binding to Skp1 . Ctf13 is an integral component of the CBF3 kinetochore complex, which binds microtubules to the condensed mitotic chromosomes (Figure 2b). Binding of Skp1 facilitates Ctf13 phosphorylation by an unknown kinase, which allows Ctf13 to assemble into the CBF3 complex [13,14]. Complete loss of CTF13 is lethal, while at permissive temperatures, Ctf13 temperature-sensitive mutants missegregate chromosomes .
Second, Elongin A, the transcriptionally active subunit of the Elongin (SIII) complex - which facilitates transcription elongation by RNA polymerase II  - is an F-box protein (Figure 2c). Elongin A was isolated by virtue of its ability to increase the catalytic rate of transcript elongation by RNA polymerase II in vitro . Binding of the other components of the complex, Elongin B and C, increases the specific activity of Elongin A. The F-box motif of Elongin A is in the smallest region shown to be sufficient for Elongin A to bind Elongin C in both yeast and humans [17,18]. Elongin C has homology to Skp1; the F-box-Elongin C interaction may therefore be evolutionarily conserved.
The third additional biochemical context in which F-boxes are implicated is in C. elegans, where the F-box protein FOG-2, which also contains an FTH/DUF38 motif, forms a complex with the RNA-binding protein GLD-1 through an interaction with the FTH/DUF38 domain and/or sequences carboxy-terminal to it (Figure 2d) . FOG-2 is required for spermatogenesis in C. elegans hermaphrodites. The complex binds the 3' untranslated region of tra-2 mRNA in the germline and inhibits its translation, thereby allowing spermatogenesis to occur [5,19]; the function of the F-box motif in FOG-2 has not been determined.
Finally, in both Xenopus and human, cyclin F has been found to bind cyclin B1 through a direct protein interaction between the cyclin box of cyclin F and the cytoplasmic retention signal (CRS) domain of cyclin B (Figure 2e) . Subcellular mislocalization of cyclin F or cyclin B causes a co-mislocalization of the other cyclin, indicating a strong interaction. The distribution of cyclin B1 changes from cytoplasmic to nuclear during the transition from G2 to M phase, yet neither cyclin B1 nor the associated cdc2 kinase has a nuclear localization signal (NLS). The interaction of cyclin B1 with cyclin F, which has two NLSs, may be important in mediating its nuclear entry. The function of the F-box motif of cyclin F is currently unknown.
F-box proteins have been observed to be regulated by several mechanisms and at different levels: for example, synthesis, degradation, and association with SCF components. The three yeast F-box proteins Cdc4, Grr1, and Met30 are intrinsically unstable proteins whose levels do not oscillate during the cell cycle. It appears that they are subjected to ubiquitin-proteasome mediated degradation by an autocatalytic mechanism. Whereas the degradation of Cdc4 and Grr1 is dependent on their abilities to bind Skp1 through their F-boxes [21,22], Met30 seems to be ubiquitinated in a cullin-dependent manner but in an F-box-independent manner .
Mammalian Skp2 is degraded by the ubiquitin-proteasome pathway but its expression is mostly regulated at a transcriptional level (A.C. Carrano and M.P., unpublished observations; ). The expression of both Skp2 mRNA  and Skp2 protein  are cell-cycle-regulated, peaking in S phase and declining as cells progress through M phase. In contrast, the expression of the other subunits of the SCFSkp2 ligase complex (Cull, Skp1, and Roc1), as well as its ubiquitin-conjugating enzyme (Ubc3), do not fluctuate through the cell cycle. Thus, although the ubiquitination of Skp2 substrates is regulated by their own phosphorylation, which allows their recognition by Skp2, a second level of control is ensured by the cell-cycle oscillations in Skp2 levels. The only characterized post-translational modification of an F-box protein is phosphorylation of Skp2 on Ser76 by the cyclin A-cdk2 complex , but the significance of this modification is currently unknown.
Enforced expression of ß-catenin induces the expression of the F-box protein ß-TrCP . Although ß-catenin can act as a transcriptional regulator, induction of ß-TrCP by ß-catenin is due to a stabilization of ß-TrCP mRNA. As ß-catenin is an SCFß-Trcp substrate, stimulation of ß-TrCP expression by ß-catenin results in an accelerated degradation of ß-catenin itself, suggesting that a negative feedback loop may control the ß-catenin pathway. Finally, the association of Grr1 with Skp1 is regulated by glucose levels . Grr1 is required to transduce the glucose signal to transcriptional regulatory proteins. When glucose levels are high, the post-translational association of Grr1 with Skp1 is markedly increased, and this effect is dependent on the carboxy-terminal region of Grr1.
Currently, the dominant paradigm for F-box proteins is the SCF complex, in which the F-box motif is required to tether the substrate-binding protein to the complex. Much current research is focused on identifying the F-box proteins that function in SCF complexes and the substrates that are bound by each F-box protein. The functions of the majority of F-box proteins are still unknown. Given the structural diversity of the family, it is likely that they will be involved in diverse cellular activities. Determining the enzymatic functions of these uncharacterized proteins will prove to be an important area of future research.
An open question is whether the F-box motif is specific for binding to Skp1 or Skp1-like proteins (for example, Elongin C). There are currently no examples of F-boxes binding other types of proteins. Interestingly, in C. elegans, where there is such a large number of F-box proteins, the ancestral Skp1 gene has also undergone amplification to produce 17 paralogs , potentially increasing the number of F-box-binding proteins.
In the four years since the discovery of the F-box, intensive research has illuminated the function of F-box proteins in several cellular settings. They are the critical determinant for controlling SCF substrate selection and are positioned as key regulators in many pathways of cell signaling, transcription, and the cell cycle. It is likely that the currently discovered functions are just the tip of the iceberg and that the range of F-box-dependent process will continue to expand.
E.T.K. is supported by NIH grant R01-GM55297 and HFSPO grant RG-229; M.P. is supported by HFSPO grant RG0229, an Irma T. Hirschl scholarship, and by the NIH grants R01-CA76584, R01-GM57587, P30-CA16087, and R21-CA66229.
Proc Natl Acad Sci USA 1995, 92:3343-3347.
The first description of the protein region that became known as the F box.PubMed Abstract | Publisher Full Text | PubMed Central Full Text
Cell 1996, 86:263-274.
This work was the first to recognize the F-box as a widespread protein motif,and demonstrated that the F-box bound to the Skp1 protein.PubMed Abstract | Publisher Full Text
Curr Biol 1999, 9:1177-1179.
This paper extended the number of known F-box proteins in humans and demonstrated that F-box proteins without WD or LRR motifs can bind to SCF components in vivo.PubMed Abstract | Publisher Full Text
Curr Biol 1999, 9:1180-1182.
Extended the number of known F-box proteins in humans and mice.PubMed Abstract | Publisher Full Text
Clifford R, Lee M-H, Nayak S, Ohmachi M, Giorgini F, Schedl T: FOG-2, a novel F-box containing protein, associates with the GLD-1 RNA binding protein to direct male sex determination in the C. elegans hermaphrodite germline.
Development 2000, in press.
Demonstrates that the F-box protein FOG-2 functions in a protein complex with the RNA binding protein GLD-1 to repress tra-2 mRNA translation.
ISREC ProfileScan Server [http://www.isrec.isb-sib.ch/software/PFSCAN_form.html] webcite
This site can be used to search both the Pfam and Prosite databases for F-box motifs.
Cell 1997, 91:209-219.
In this seminal paper and the following one, the authors reconstituted active SCF E3 complexes with purified components thereby demonstrating the biochemical action of the complex and the subunit interaction.PubMed Abstract | Publisher Full Text
Cell 1997, 91:221-230.
See .PubMed Abstract | Publisher Full Text
Curr Opin Genet Dev 2000, 10:54-64.
Provides an overview of SCF and APC complexes.PubMed Abstract | Publisher Full Text
Annu Rev Cell Dev Biol 1999, 15:435-467.
A detailed review of SCF ubiquitin-ligase complexes.PubMed Abstract | Publisher Full Text
Prog Biophys Mol Biol 1999, 72:299-328.
A second detailed review of SCF ubiquitin-ligase complexes.PubMed Abstract | Publisher Full Text
Annu Rev Biochem 1998, 67:425-479.
A detailed review of the ubiquitin proteolytic pathway.PubMed Abstract | Publisher Full Text
J Cell Biol 1999, 145:933-950.
This work provides evidence that the F-box motif of Ctf13 is required for binding Skp1 and that the CBF3 complex assembles around Ctf13.PubMed Abstract | Publisher Full Text
Cell 1997, 91:491-500.
Describes Skp1 binding to the kinetochore protein Ctf13 and promoting the activating-phosphorylation of Ctf13.PubMed Abstract | Publisher Full Text
Cell 1993, 73:761-774.
This paper describes the Ctf13 loss-of-function phenotype.PubMed Abstract | Publisher Full Text
FASEB J 1998, 12:1437-1446.
A review of RNA polymerase II transcriptional elongation factors, including the F-box protein Elongin A.PubMed Abstract | Publisher Full Text
EMBO J 1996, 15:5557-5566.
This work and  found that the smallest human or yeast  Elongin A region sufficient for binding Elongin C contained the F-box motif.PubMed Abstract | Publisher Full Text
J Biol Chem 2000, 275:11174-11180.
See PubMed Abstract | Publisher Full Text
EMBO J 1999, 18:258-269.
Shows that GLD-1 translationally represses tra-2 by binding the tra-2 3' UTR.PubMed Abstract | Publisher Full Text
EMBO J 2000, 19:1378-1388.
Demonstrates that cyclin F binds cyclin B through a cyclin box-CRS interaction.PubMed Abstract | Publisher Full Text
Mol Cell 1998, 2:571-580.
This study represents the first demonstration that F-box proteins are unstable proteins.PubMed Abstract | Publisher Full Text
Proc Natl Acad Sci USA 1999, 96:9124-9129.
This paper shows that ubiquitination of the F-box protein Cdc4 and Grr1 requires all the core components of the SCF and an intact F box.PubMed Abstract | Publisher Full Text | PubMed Central Full Text
EMBO J 2000, 19:282-294.
Demonstrates that Met30 can be degraded in an F-box-independent manner.PubMed Abstract | Publisher Full Text
Cell 1995, 82:915-925.
Describes the cloning of Skp1 and Skp2 as cyclin-A-interacting proteins.PubMed Abstract | Publisher Full Text
Nat Cell Biol 1999, 1:193-199.
This work and [39,40] demonstrate that the SCF Skp2 complex targets the degradation of p27Kip1in vitro and in vivo.PubMed Abstract | Publisher Full Text
Mol Cell Biol 1999, 19:635-645.
Identifies the phosphorylation site in Skp2 targeted by cyclin A.PubMed Abstract | Publisher Full Text | PubMed Central Full Text
Mol Cell 2000, 5:877-882.
Shows that ß-catenin induces a stabilization of ß-Trcp mRNA,suggesting that a negative feedback loop regulation may control the ß-catenin pathway.PubMed Abstract | Publisher Full Text
EMBO J 1997, 16:5629-5638.
This work demonstrates that the ability of Grr1 to bind the SCF component Skp1 is upregulated in cells grown in the presence of glucose.PubMed Abstract | Publisher Full Text
The InterPro project catalogs proteomes by analyzing protein motifs. Links from this page can produce lists of all of the cataloged F-box proteins in an organism, with accompanying figures of the motifs in each protein.
This page provides links to the alignments used to generate the Pfam F-box search algorithm.
Database of DNA and protein sequences.
Samach A, Klenz JE, Kohalmi SE, Risseeuw E, Haughn GW, Crosby WL: The UNUSUAL FLORAL ORGANS gene of Arabidopsis thaliana is an F-box protein required for normal patterning and growth in the floral meristem.
Plant J 1999, 20:433-445.
Demonstrates that the F-box protein UFO, which functions in flower development, interacts with Skp1-related proteins.PubMed Abstract | Publisher Full Text
EMBO J 1997, 16:6521-6534.
Demonstrates that the F-box protein FIM, an ortholog of UFO, interacts with Skp1-related proteins.PubMed Abstract | Publisher Full Text
Development 2000, 127:5071-5082.
Shows that the F-box protein LIN-23 functions cell autonomously to limit cell division in C.elegans.PubMed Abstract | Publisher Full Text
Curr Biol 2000, 10:1131-1134.
Found that mutants of the slimb gene,encoding an F-box protein, have centrosome overduplication.PubMed Abstract | Publisher Full Text
Cell 2000, 102:303-314.
Demonstrates that SCFMet30-mediated ubiqutination of Met4 inhibits Met4 transcriptional activity but does not lead to Met4 degradation.PubMed Abstract | Publisher Full Text
Mol Biol Cell 2000, 11:915-927.
This paper demonstrates that the transcription factor Gcn4 is targetedfor degradation by SCFCdc4.PubMed Abstract | Publisher Full Text | PubMed Central Full Text
Nakayama K, Nagahama H, Minamishima YA, Matsumoto M, Nakamichi I, Kitagawa K, Shirane M, Tsunematsu R, Tsukiyama T, Ishida N, et al.: Targeted disruption of Skp2 results in accumulation of cyclin E and p27(Kip1), polyploidy and centrosome overduplication.
EMBO J 2000, 19:2069-2081.
Demonstration that mice lacking the F-box protein Skp2 are viable but in certain tissues have an accumulation of cyclin E and p27Kip1 as well as centrosome overduplication.PubMed Abstract | Publisher Full Text
Nat Cell Biol 1999, 1:207-214.
See PubMed Abstract | Publisher Full Text
Curr Biol 1999, 9:661-664.
See PubMed Abstract | Publisher Full Text