<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art><ui>2041-9139-1-3</ui><ji>2041-9139</ji><fm>
<dochead>Research</dochead>
<bibl>
<title>
<p>A conserved cluster of three PRD-class homeobox genes (<it>homeobrain</it>, <it>rx </it>and <it>orthopedia</it>) in the Cnidaria and Protostomia</p>
</title>
<aug>
<au ce="yes" id="A1"><snm>Mazza</snm><mi>E</mi><fnm>Maureen</fnm><insr iid="I1"/><email>m_mazza@comcast.net</email></au>
<au ce="yes" id="A2"><snm>Pang</snm><fnm>Kevin</fnm><insr iid="I2"/><email>kpang@hawaii.edu</email></au>
<au ce="yes" id="A3"><snm>Reitzel</snm><mi>M</mi><fnm>Adam</fnm><insr iid="I1"/><insr iid="I3"/><email>areitzel@whoi.edu</email></au>
<au id="A4"><snm>Martindale</snm><mi>Q</mi><fnm>Mark</fnm><insr iid="I2"/><email>mqmartin@hawaii.edu</email></au>
<au ca="yes" id="A5"><snm>Finnerty</snm><mi>R</mi><fnm>John</fnm><insr iid="I1"/><email>jrf3@bu.edu</email></au>
</aug>
<insg>
<ins id="I1"><p>Department of Biology, Boston University, 5 Cummington Street, Boston, MA 02215, USA</p></ins>
<ins id="I2"><p>Kewalo Marine Lab, Pacific Biosciences Research Center, University of Hawaii, 41 Ahui St., Honolulu, HI 96813, USA</p></ins>
<ins id="I3"><p>Woods Hole Oceanographic Institution, Woods Hole, MA 02543, USA</p></ins>
</insg>
<source>EvoDevo</source>
<issn>2041-9139</issn>
<pubdate>2010</pubdate>
<volume>1</volume>
<issue>1</issue>
<fpage>3</fpage>
<url>http://www.evodevojournal.com/content/1/1/3</url>
<xrefbib><pubidlist><pubid idtype="pmpid">20849646</pubid><pubid idtype="doi">10.1186/2041-9139-1-3</pubid></pubidlist></xrefbib>
</bibl>
<history><rec><date><day>1</day><month>12</month><year>2009</year></date></rec><acc><date><day>5</day><month>7</month><year>2010</year></date></acc><pub><date><day>5</day><month>7</month><year>2010</year></date></pub></history>
<cpyrt><year>2010</year><collab>Mazza et al; licensee BioMed Central Ltd.</collab><note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
<abs>
<sec>
<st>
<p>Abstract</p>
</st>
<sec>
<st>
<p>Background</p>
</st>
<p>Homeobox genes are a superclass of transcription factors with diverse developmental regulatory functions, which are found in plants, fungi and animals. In animals, several Antennapedia (ANTP)-class homeobox genes reside in extremely ancient gene clusters (for example, the Hox, ParaHox, and NKL clusters) and the evolution of these clusters has been implicated in the morphological diversification of animal bodyplans. By contrast, similarly ancient gene clusters have not been reported among the other classes of homeobox genes (that is, the LIM, POU, PRD and SIX classes).</p>
</sec>
<sec>
<st>
<p>Results</p>
</st>
<p>Using a combination of <it>in silico </it>queries and phylogenetic analyses, we found that a cluster of three PRD-class homeobox genes (<it>Homeobrain (hbn)</it>, <it>Rax (rx) </it>and <it>Orthopedia (otp)</it>) is present in cnidarians, insects and mollusks (a partial cluster comprising hbn and rx is present in the placozoan <it>Trichoplax adhaerens</it>). We failed to identify this 'HRO' cluster in deuterostomes; in fact, the <it>Homeobrain </it>gene appears to be missing from the chordate genomes we examined, although it is present in hemichordates and echinoderms. To illuminate the ancestral organization and function of this ancient cluster, we mapped the constituent genes against the assembled genome of a model cnidarian, the sea anemone <it>Nematostella vectensis</it>, and characterized their spatiotemporal expression using <it>in situ </it>hybridization. In <it>N. vectensis</it>, these genes reside in a span of 33 kb with the same gene order as previously reported in insects. Comparisons of genomic sequences and expressed sequence tags revealed the presence of alternative transcripts of Nv-otp and two highly unusual protein-coding polymorphisms in the terminal helix of the Nv-rx homeodomain. A population genetic survey revealed the Rx polymorphisms to be widespread in natural populations. During larval development, all three genes are expressed in the ectoderm, in non-overlapping territories along the oral-aboral axis, with distinct temporal expression.</p>
</sec>
<sec>
<st>
<p>Conclusion</p>
</st>
<p>We report the first evidence for a PRD-class homeobox cluster that appears to have been conserved since the time of the cnidarian-bilaterian ancestor, and possibly even earlier, given the presence of a partial cluster in the placozoan <it>Trichoplax</it>. Very similar clusters comprising these three genes exist in <it>Nematostella </it>and diverse protostomes. Interestingly, in chordates, one member of the ancestral cluster (<it>homeobrain</it>) has apparently been lost, and there is no linkage between <it>rx </it>and <it>orthopedia </it>in any of the vertebrates. In <it>Nematostella</it>, the spatial expression of these three genes along the body column is not colinear with their physical order in the cluster but the temporal expression is, therefore, using the terminology that has been applied to the Hox cluster genes, the HRO cluster would appear to exhibit temporal but not spatial colinearity. It remains to be seen whether the mechanisms responsible for the evolutionary conservation of the HRO cluster are the same mechanisms responsible for cohesion of the Hox cluster and other ANTP-class homeobox clusters that have been widely conserved throughout animal evolution.</p>
</sec>
</sec>
</abs>
</fm><bdy>
<sec>
<st>
<p>Background</p>
</st>
<p>Gene clusters have been crucially important in the evolution of animals because close physical linkage affects genetic recombination, molecular evolution and gene regulation <abbrgrp>
<abbr bid="B1">1</abbr>
<abbr bid="B2">2</abbr>
<abbr bid="B3">3</abbr>
<abbr bid="B4">4</abbr>
<abbr bid="B5">5</abbr>
<abbr bid="B6">6</abbr>
<abbr bid="B7">7</abbr>
</abbrgrp>. For example, clustering can contribute to coordinated transcriptional regulation of linked genes if the local chromatin structure affects several genes in the same chromosomal neighborhood or if shared regulatory elements drive the expression of neighboring genes, such as the gene clusters of the human cardiac transcriptome <abbrgrp>
<abbr bid="B8">8</abbr>
</abbrgrp>.</p>
<p>Gene clusters are also important tools for reconstructing genome evolution. The clustering of related genes can reveal the mechanism underlying the expansion of a gene family, and conserved gene clusters define homologous chromosomal segments that may be compared across species <abbrgrp>
<abbr bid="B9">9</abbr>
<abbr bid="B10">10</abbr>
<abbr bid="B11">11</abbr>
<abbr bid="B12">12</abbr>
<abbr bid="B13">13</abbr>
<abbr bid="B14">14</abbr>
<abbr bid="B15">15</abbr>
<abbr bid="B16">16</abbr>
<abbr bid="B17">17</abbr>
<abbr bid="B18">18</abbr>
<abbr bid="B19">19</abbr>
<abbr bid="B20">20</abbr>
<abbr bid="B21">21</abbr>
</abbrgrp>.</p>
<p>Homeobox gene clusters, in particular, have received a great deal of study. Since the pioneering studies by Ed Lewis and co-workers on the Antennapedia (ANTP) and Bithorax complexes of <it>Drosophila </it>
<abbrgrp>
<abbr bid="B22">22</abbr>
</abbrgrp>, comparative genomic studies have revealed (i) that certain homeobox clusters are widely conserved throughout the animal kingdom, (ii) that clustering may influence gene expression and (iii) that clustering may be conserved by stabilizing selection if disrupting the cluster has deleterious effects on the spatiotemporal expression of linked genes <abbrgrp>
<abbr bid="B23">23</abbr>
<abbr bid="B24">24</abbr>
<abbr bid="B25">25</abbr>
<abbr bid="B26">26</abbr>
<abbr bid="B27">27</abbr>
<abbr bid="B28">28</abbr>
</abbrgrp>.</p>
<p>All of the aforementioned homeobox clusters consist of <it>ANTP</it>-class genes <abbrgrp>
<abbr bid="B29">29</abbr>
</abbrgrp>. Based on published studies, conserved clusters of non-<it>ANTP </it>class homeobox genes that date from the time of the bilaterian common ancestor do not appear to be as common, although less ancient clusters of PRD (PRD) class genes have been identified. A cluster of several <it>Rhox </it>genes that function during gametogenesis appears to be restricted to rodents <abbrgrp>
<abbr bid="B15">15</abbr>
</abbrgrp>. A cluster of three <it>Otx </it>genes has been identified in the sea anemone <it>Nematostella</it>, but this cluster appears to be restricted to anthozoan cnidarians <abbrgrp>
<abbr bid="B30">30</abbr>
</abbrgrp>.</p>
<p>The widespread occurrence of conserved ANTP-class clusters could reflect the unique early history of this class; recent studies suggest that the various ANTP-class gene clusters can all be traced to four ancient arrays (extended Hox, ParaHox, NKL and EHG box <abbrgrp>
<abbr bid="B10">10</abbr>
</abbrgrp>) or even to a single ancestral 'metaHox' cluster <abbrgrp>
<abbr bid="B9">9</abbr>
</abbrgrp>. Different elements of the ancestral linkage arrangements could have been conserved in different animal lineages. The non-ANTP homeobox genes may never have been organized in metaclusters, so the retention of subclusters would not have been possible. Alternatively, the apparent paucity of evolutionarily conserved non-ANTP homeobox clusters may simply reflect a search bias; the effort to identify clusters of Hox-related genes has spanned about 20 years and has involved dozens of laboratories. A comparable effort to identify clusters of non-ANTP homeobox genes has not yet been undertaken; however, conserved clusters of Iroquois (<it>irx</it>) genes have been identified in arthropods and vertebrates <abbrgrp>
<abbr bid="B21">21</abbr>
</abbrgrp>. The insect-crustacean ancestor is thought to have possessed a cluster of two <it>irx </it>genes, whereas a cluster of three <it>irx </it>genes is thought to have been present in the common ancestor of vertebrates <abbrgrp>
<abbr bid="B21">21</abbr>
</abbrgrp>. It is not yet clear whether a cluster of <it>irx </it>genes was already present in the protostome-deuterostome ancestor, or whether clusters arose independently in arthropods and vertebrates.</p>
<p>A cluster of three PRD-class homeobox genes (<it>Homeobrain (hbn)</it>, <it>Rax (rx) </it>and <it>Orthopedia (otp)</it>) has been reported in <it>Drosophila </it>
<abbrgrp>
<abbr bid="B31">31</abbr>
</abbrgrp>. These three genes are clustered within a span of ~38 kb on chromosome 2, and they exhibit very similar spatiotemporal expression. All three genes are involved in patterning specific regions of the embryonic brain. To date, a comparable cluster has not yet been described in other species.</p>
<p>In this report, we describe the <it>hbn</it>-<it>rx</it>-<it>otp </it>cluster in the sea anemone, <it>Nematostella vectensis</it>. The <it>Nematostella </it>cluster spans only ~33.5 kb and the relative order of the genes is the same as in <it>Drosophila</it>, although the transcriptional orientation of <it>orthopedia </it>is reversed. We also report the conservation of all or part of this cluster in other protostomes including a mollusk and a number of insects (for example, mosquito, honeybee and flour beetle), suggesting that (i) this cluster was present in the cnidarian-bilaterian ancestor and (ii) linkage between these genes may be under strong stabilizing selection in some lineages. In the sea anemone, <it>in situ </it>hybridization reveals that all three genes are expressed in the ectoderm in non-overlapping territories along the oral-aboral axis. <it>NvHbn </it>is expressed during early gastrula stages around the blastopore (future mouth), and expression persists in the ectoderm around the base of the tentacles into the juvenile polyp stage. <it>NvRx </it>is expressed in individual cells in the aboral ectoderm, and is first expressed in mid gastrula stages. <it>NvOtp </it>expression first becomes apparent in the larva, in the ectodermal layer of the pharynx. Thus, in <it>Nematostella</it>, the expression of these clustered genes is consistent with temporal but not spatial colinearity.</p>
</sec>
<sec>
<st>
<p>Materials and methods</p>
</st>
<sec>
<st>
<p>Retrieval of <it>Nematostella </it>PRD-class homeodomain genes</p>
</st>
<p>In total, 33 PRD-class homeobox genes were previously identified in the <it>Nematostella </it>genome <abbrgrp>
<abbr bid="B32">32</abbr>
</abbrgrp> by conducting BLAST searches of a draft genome assembly (available at StellaBase; <url>http://www.stellabase.org/</url>
<abbrgrp>
<abbr bid="B33">33</abbr>
</abbrgrp>). Phylogenetic analysis of homeodomain sequences by Bayesian and neighbor-joining methods identified single representatives of the <it>homeobrain </it>(HBN_Nv079), <it>orthopedia </it>(OTP_Nv047) and <it>rx </it>(RX_Nv129) families in <it>Nematostella </it>
<abbrgrp>
<abbr bid="B32">32</abbr>
</abbrgrp>. Subsequently, we designed gene-specific primers to amplify the 3' and 5' ends of the <it>hbn</it>, <it>otp </it>and <it>rx </it>transcripts from cDNA (Table <tblr tid="T1">1</tblr>). Overlapping 5' and 3' RACE (rapid amplification of cDNA ends) fragments were conceptually spliced to reconstruct complete transcripts. In addition, <it>hbn</it>, <it>otp </it>and <it>rx </it>transcripts were also identified among 150,000 <it>Nematostella </it>expressed sequence tags (ESTs) sequenced as part of the genome-sequencing project recently completed by the Joint Genome Institute <abbrgrp>
<abbr bid="B34">34</abbr>
</abbrgrp>. Once the full-length transcripts were assembled, the longest open reading frames (ORFs) in frame with the highly conserved homeodomain were inferred using MacVector V.7.2.3 (Accelrys Inc., San Diego, CA, USA).</p>
<tbl id="T1"><title><p>Table 1</p></title><caption><p>Primers used to amplify RACE products and characterize Rx polymorphism.</p></caption><tblbdy cols="4">
      <r>
         <c ca="left">
            <p>
               <b>Gene</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Procedure</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Name</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Sequence 5' &#8594; 3'</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <it>hb</it>
            </p>
         </c>
         <c ca="left">
            <p>5'-RACE</p>
         </c>
         <c ca="left">
            <p>F1</p>
         </c>
         <c ca="left">
            <p>CAGGCGAAGTCGAACTACGTTCACGACATACC</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>F2</p>
         </c>
         <c ca="left">
            <p>CGCAA-TACCCCGATGTGTTCACGAGAGAGG</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>3'-RACE</p>
         </c>
         <c ca="left">
            <p>R1</p>
         </c>
         <c ca="left">
            <p>TCTTGCCTCGCT-CAAGTCCAGCCGAAG</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>R2</p>
         </c>
         <c ca="left">
            <p>GGGCAAGTTCCTCTCTCGTGAACACATCGG</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <it>otp</it>
            </p>
         </c>
         <c ca="left">
            <p>5'-RACE</p>
         </c>
         <c ca="left">
            <p>R1</p>
         </c>
         <c ca="left">
            <p>CGCTGCCAGCTCTTCTCGCATGAACAC</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>R2</p>
         </c>
         <c ca="left">
            <p>CGGA-TAGTGAGTCCTGGCGAAACATCGC</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>3'-RACE</p>
         </c>
         <c ca="left">
            <p>F1</p>
         </c>
         <c ca="left">
            <p>CGAGCGATGTTT-CGCCAGGACTCACTATC</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>F2</p>
         </c>
         <c ca="left">
            <p>CGGATGTGTTCATGCGAGAAGAGCTGG</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <it>rx</it>
            </p>
         </c>
         <c ca="left">
            <p>5'-RACE</p>
         </c>
         <c ca="left">
            <p>R1</p>
         </c>
         <c ca="left">
            <p>TGTACCCTAACTTCAGGGAGGCTTATTTTTAGCG</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>R2</p>
         </c>
         <c ca="left">
            <p>TGGATAGTGGGATTTCTCGAAGGCTCGC</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>3'-RACE</p>
         </c>
         <c ca="left">
            <p>F1</p>
         </c>
         <c ca="left">
            <p>TCAGAAGA-AACCGTACCACCTTCACAACG</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>F2</p>
         </c>
         <c ca="left">
            <p>TCGAGCGAGCCTTCGAGAAATCCC</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>polymorphism</p>
         </c>
         <c ca="left">
            <p>RX_for</p>
         </c>
         <c ca="left">
            <p>GATCCTTAATTCAAACCCTGGGAC</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>RX_rev</p>
         </c>
         <c ca="left">
            <p>TCGGGCAAATCTGA TACGAGTAG</p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p>RACE = rapid amplification of cDNA ends.</p>
   </tblfn></tbl>
</sec>
<sec>
<st>
<p>Retrieval of PRD-class homeodomains from other taxa</p>
</st>
<p>Human and fruit fly (<it>Drosophila melanogaster</it>) homeodomains from the PRD class were taken from a previous study <abbrgrp>
<abbr bid="B32">32</abbr>
</abbrgrp>. We searched for <it>hbn</it>, <it>rx </it>and <it>otp </it>homeodomains in the partially or wholly sequenced genomes of four additional vertebrates (<it>Mus musculus</it>, <it>Gallus gallus</it>, <it>Xenopus tropicalis </it>and <it>Danio rerio</it>), four non-vertebrate deuterostomes (<it>Ciona intestinalis</it>, <it>Branchiostoma floridae</it>, <it>Saccoglossus kowalevskii </it>and <it>Strongylocentrotus purpuratus</it>), three additional insects (<it>Anopheles gambiae</it>, <it>Apis mellifera </it>and <it>Tribolium castaneum</it>), one nematode (<it>Caenorhabditis elegans</it>), one mollusk (<it>Lottia gigantea</it>), one annelid (<it>Capitella teleta</it>) and a placozoan (<it>Trichoplax adhaerens</it>). Database searches were conducted via Entrez <url>http://www.ncbi.nlm.nih.gov/gquery/gquery.fcgi</url> using gene names (<it>orthopedia</it>, <it>rx</it>/<it>rax</it>, <it>homeobrain</it>), and similarity searches were performed using the tBLASTn and BLASTp search algorithms. The homeodomains of <it>hbn</it>, <it>otp </it>and <it>rx </it>from <it>Nematostella</it>, human and fruit fly were used as the query sequences. Sequences that matched the query sequence were then subjected to reciprocal tBLASTn and/or reciprocal BLASTp similarity searches against the genome from which the query sequence was derived. We formally tested for orthology with phylogenetic analyses (see below).</p>
</sec>
<sec>
<st>
<p>Determination of gene linkage and gene structure</p>
</st>
<p>Predicted homeobox sequences for <it>Nematostella hbn</it>, <it>otp </it>and <it>rx </it>were mapped to the publicly available assembly of the <it>Nematostella </it>genome (DOE-Joint Genome Institute (JGI) <it>Nematostella vectensis </it>genome assembly 1.0; <url>http://genome.jgi-psf.org/Nemve1/Nemve1.home.html</url>) using BLASTn to determine whether the <it>hbn</it>, <it>rx </it>and <it>otp </it>loci of <it>Nematostella </it>are linked as in <it>Drosophila </it>
<abbrgrp>
<abbr bid="B31">31</abbr>
</abbrgrp>. Gene structures were determined by mapping transcripts for <it>hbn</it>, <it>rx </it>and <it>otp </it>against the genome. To identify possible non-homeodomain genes that might be closely linked to <it>hbn</it>, <it>rx </it>and <it>otp</it>, the regions flanking these genes were compared against the RefSeq database using BLASTx (V.2.2.15).</p>
</sec>
<sec>
<st>
<p>Identification of homeobrain-rx-orthopedia clusters in other taxa</p>
</st>
<p>The genomic organization of the <it>hbn</it>, <it>rx </it>and <it>otp </it>cluster in <it>Drosophila </it>has been described previously <abbrgrp>
<abbr bid="B31">31</abbr>
</abbrgrp>. In other species in which two or three of these PRD-class genes were identified, the genes were mapped to the corresponding assembled genome to determine if they might be linked. The level of coverage for different assemblies varies widely, from completed genome assemblies for human, mouse, fruit fly and nematode, to assemblies at the level of scaffolds, contigs and linkage groups for numerous other animals. Therefore, we have uncertainty for linkage of some genes that are present in the genome because current assemblies limit absolute characterization of location. For species in which at least two of the three genes matched the same scaffold, linkage group or chromosome, we determined the relative position and transcriptional direction of each gene from the cluster.</p>
</sec>
<sec>
<st>
<p>Phylogenetic analysis</p>
</st>
<p>
<it>Hbn</it>, <it>otp</it>, <it>rx </it>and other PRD-class homeodomain sequences were aligned using a web implementation of the computer program Muscle <abbrgrp>
<abbr bid="B35">35</abbr>
</abbrgrp>. The default parameters were used, and the alignment algorithm did not introduce any alignment gaps. Phylogenetic analysis was performed using the neighbor-joining method as implemented in the computer package Phylip (V.3.61; <abbrgrp>
<abbr bid="B36">36</abbr>
</abbrgrp>). In total, 111 PRD-class homeodomains were used in the phylogenetic analysis, including 33 from <it>Nematostella</it>, 39 from human and 24 from fruit fly, which had been identified in a previous study <abbrgrp>
<abbr bid="B32">32</abbr>
</abbrgrp>. Human and fruit fly homeodomain families comprising multiple invariant or nearly invariant homeodomains were pared to a single representative. Select homeodomains were added to provide broader representation from deuterostomes and protostomes (Deuterostomia: <it>Branchiostoma floridae </it>(<it>otp</it>, <it>rx</it>), <it>Strongylocentrotus purpuratus </it>(<it>otp</it>, <it>rx</it>, <it>hbn</it>), <it>Saccoglossus kowalevskii </it>(<it>otp</it>, rx); Protostomia: <it>Caenorhabditis elegans </it>(<it>rx</it>), <it>Patella vulgata </it>(<it>otp</it>), <it>Platynereis dumerilii </it>(<it>rx</it>) and <it>Capitella teleta </it>(<it>hbn</it>). <it>Distalless </it>and <it>hox1 </it>homeodomains from <it>Drosophila melanogaster </it>and <it>Branchiostoma floridae </it>were included as outgroups. Homeodomain sequences and accession numbers for <it>homeobrain</it>, <it>otp </it>and <it>rx </it>are provided in Figure <figr fid="F1">1</figr>. Sequences and accession numbers for the other PRD-class homeodomains used in the phylogenetic analysis are available from Ryan <it>et al. </it>
<abbrgrp>
<abbr bid="B32">32</abbr>
</abbrgrp>. Pairwise distances between homeodomains were calculated using the PAM distance matrix. Support for individual nodes was assessed using 1,000 replicates of the bootstrap <abbrgrp>
<abbr bid="B36">36</abbr>
</abbrgrp>. Amino acid substitutions were localized to particular branches on the phylogeny using MacClade V.4 <abbrgrp>
<abbr bid="B37">37</abbr>
</abbrgrp>.</p>
<fig id="F1"><title><p>Figure 1</p></title><caption><p>Alignment of octapeptide regions, homeodomains and OAR domains from representative <it>hbn</it>, <it>otp </it>and <it>rx </it>genes of cnidarians, protostomes and deuterostomes</p></caption><text>
   <p><b>Alignment of octapeptide regions, homeodomains and OAR domains from representative <it>hbn</it>, <it>otp </it>and <it>rx </it>genes of cnidarians, protostomes and deuterostomes</b>. Homeodomain sequences are aligned against the <it>Drosophila antennapedia </it>homeodomain. <it>Nematostella</it>, human and fruit fly representatives were taken from Ryan <it>et al. </it><abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. Homeodomains from additional taxa were identified by similarity searches (tBLASTn and BLASTp) conducted through the BLAST interface at National Center for Biotechnology Information (see Methods) and at respective genome assemblies available through the Joint Genome Institute. Abbreviations for taxa are as follows: (cnidarians: <it>Acropora millepora </it>(Acm), <it>Hydra magnipapillata </it>(Hym), <it>Nematostella vectensis </it>(Nev); placozoan: <it>Trichoplax adhaerens </it>(Tra); protostome bilaterians: <it>Aedes aegypti </it>(Aea), <it>Anopheles gambiae </it>(Ang), <it>Apis mellifera </it>(Apm), <it>Caenorhabditis elegans </it>(Cae), <it>Capitella teleta </it>(Cap), <it>Lottia gigantea </it>(Log), <it>Tribolium castaneum </it>(Trc); deuterostome bilaterians: <it>Branchiostoma floridae </it>(Brf), <it>Ciona intestinalis </it>(Cii), <it>Danio rerio </it>(Dar), <it>Gallus gallus </it>(Gag), <it>Homo sapiens </it>(Hos), <it>Mus musculus </it>(Mum), <it>Saccoglossus kowalevskii </it>(Sak), <it>Strongylocentrotus purpuratus </it>(Stp), <it>Xenopus laevis </it>(Xel)). The octapeptide <abbrgrp><abbr bid="B72">72</abbr><abbr bid="B73">73</abbr></abbrgrp> and/or the OAR domain <abbrgrp><abbr bid="B51">51</abbr><abbr bid="B75">75</abbr></abbrgrp> are indicated when identified. Two different versions of the <it>Rx </it>homeodomain of <it>Nematostella </it>are shown, and the polymorphic residues are highlighted.</p>
</text><graphic file="2041-9139-1-3-1"/></fig>
</sec>
<sec>
<st>
<p>Population genetic analysis of polymorphisms in the homeodomain of Rx</p>
</st>
<p>To confirm the existence and determine the geographic distribution of two unusual polymorphisms at positions 52 and 59 within the homeodomain of <it>Nematostella Rx</it>, we designed primers to amplify a 585 bp fragment of the gene (Table <tblr tid="T1">1</tblr>). PCR was carried out using 0.5 U Taq DNA polymerase, 1 &#215; Mg-free PCR buffer, 1.7 mM MgCl<sub>2</sub>, 0.67 mM dNTPs, 1 &#956;m of each primer and approximately 5 ng genomic DNA in 20 &#956;l reactions. Thermal cycling consisted of denaturation for 5 minutes at 94LC, followed by 30 cycles of 30 seconds at 94&#176;C, 30 seconds at 60&#176;C and 1 minute at 72&#176;C, with a final extension of 15 minutes at 72&#176;C. PCR products were directly sequenced (RX_for; Macrogen, Seoul, South Korea). All polymorphisms were verified by eye.</p>
</sec>
<sec>
<st>
<p>
<it>In situ </it>hybridization</p>
</st>
<p>Digoxygenin-labeled riboprobes were generated from 3' RACE products for each gene (<it>homeobrain</it>, 749 nt; <it>orthopedia</it>, 1556 nt; <it>rx</it>, 1419 nt). Each probe included a portion of the homeobox, plus a region of highly divergent coding sequence downstream of the homeobox and the 3' untranslated region (UTR). Whole-mount <it>in situ </it>hybridization was performed on embryos and larvae using published protocols <abbrgrp>
<abbr bid="B38">38</abbr>
</abbrgrp>. Stringent hybridization conditions were employed (65&#176;C for 20-44 hours using probe concentrations of 1.0 ng/&#956;l).</p>
</sec>
</sec>
<sec>
<st>
<p>Results</p>
</st>
<sec>
<st>
<p>
<it>Nematostella </it>Otp, Rx and Hbn transcripts and gene structure</p>
</st>
<p>Using RACE PCR, we cloned and annotated full transcripts for <it>otp</it>, <it>rx </it>and <it>hbn </it>from <it>Nematostella </it>(accession numbers HM004556-8). In addition, we queried the EST databases for <it>Nematostella </it>to further annotate these genes and to identify non-synonymous polymorphisms in each gene.</p>
</sec>
<sec>
<st>
<p>Otp</p>
</st>
<p>The <it>otp </it>transcript is 1901 nucleotides long, and encodes a predicted protein of 291 amino acids. When mapped to the genomic sequence, this transcript spans 9587 nucleotides, including four exons (see Additional file <supplr sid="S1">1</supplr>). We identified two putative full-length <it>otp </it>transcripts that encode different splice variants in <it>Nematostella</it>. Comparing the two splice variants of the <it>Nematostella otp </it>gene, one lacked a highly conserved transcriptional repression domain that was present in the other and that is typical of <it>otp </it>orthologs from other taxa. Neither splice variant that we have identified encodes an OAR (<it>otp</it>, <it>aristaless </it>and <it>rax</it>) domain. However, a highly conserved version of this domain is predicted to be encoded by a stretch of nucleotides that is located immediately downstream and in frame with exon 3 (see Additional file <supplr sid="S1">1</supplr>, boxed sequence). The predicted OAR motif of <it>Nematostella Otp </it>is highly conserved relative to deuterostome sequences; in fact, it is identical to both the human and sea urchin sequences at 15 of 16 residues. By contrast, in protostome <it>orthopedia </it>proteins, only the central core of the OAR domain appears to be conserved (for example, SIAALRRRA in <it>Drosophila</it>), or the OAR domain appears to have been lost entirely (for example, in <it>Aedes aegypti </it>or <it>Anopheles gambiae</it>). Given this degree of sequence conservation, we predict that <it>Nematostella </it>expresses at least one additional <it>Otp </it>splice variant that includes this OAR domain. Whereas other reported <it>orthopedia </it>proteins lack an octapeptide domain upstream of the homeodomain, the <it>Nematostella </it>gene encodes a stretch of eight residues upstream of the homeodomain that shares five out of eight residues with the octapeptide of deuterostome Rx proteins (HSIxxILx; Figure <figr fid="F1">1</figr>).</p>
<suppl id="S1">
<title>
<p>Additional file 1</p>
</title>
<text>
<p>
<b>OTP annotation</b>. Alignment of <it>Orthopedia </it>transcripts against the assembled genome. Three <it>otp </it>transcripts were mapped against scaffold_62 of the publicly available <it>Nematostella </it>genome assembly. The position relative to the scaffold is indicated to the right of the nucleotide sequence. For transcripts 1-3, identity to the genomic sequence is indicated with a full stop (.). Long introns have been truncated for clarity. Polymorphic positions are highlighted in black. We reconstructed one <it>otp </it>transcript (1) by conceptually splicing overlapping 3' and 5' RACE fragments. This transcript is 1045 nucleotides long and it maps between positions 812065 and 802467 of the scaffold. Another <it>otp </it>transcript (2) was identified among the ESTs sequenced as part of the <it>Nematostella </it>genome project (jgi|Nemev1|205678|fgenesh1_pg.scaffold_62000087). This transcript maps between positions 790736 and 812044 of the scaffold. A third <it>Otp </it>transcript (3) had been previously deposited in the EST database at NCBI <abbrgrp>
<abbr bid="B86">86</abbr>
</abbrgrp>; GenBank accession <ext-link ext-link-id="DV090169" ext-link-type="gen">DV090169</ext-link>). This transcript is only 616 nucleotides in length and it appears to be truncated at both ends. The predicted amino acid sequences are shown beneath the nucleotide sequences. Three conserved domains are indicated in bold type; the octapeptide (HSIVGILN), the 60 amino acid homeodomain and the 16 amino acid OAR domain. The OAR domain is downstream and in frame with the homeodomain, but the boxed amino acids are not encoded by any of the three <it>otp </it>transcripts we recovered.</p>
</text>
<file name="2041-9139-1-3-S1.PDF">
   <p>Click here for file</p>
</file>
</suppl>
</sec>
<sec>
<st>
<p>Rx</p>
</st>
<p>The <it>Nematostella rx </it>transcript we assembled using RACE is 1792 nucleotides long and it encodes a predicted protein 266 amino acids long (see Additional file <supplr sid="S2">2</supplr>). It extends over a 4972-nucleotide region of the genome, comprising three exons (see Additional file <supplr sid="S2">2</supplr>). There are three noteworthy regions of similarity between the <it>rx </it>proteins of <it>Nematostella </it>and bilaterians: the homeodomain, the octapeptide and the OAR domain (Figure <figr fid="F1">1</figr>). In the homeodomain, <it>Nematostella </it>is identical to both human and fruit fly at 90% of residues (54/60). Among PRD-class homeodomains, the presence of a valine at residue 43 appears to be a synapomorphy of the <it>rx </it>family, and this trait is shared by all the <it>rx </it>sequences in our dataset. The <it>rx </it>family is also characterized by arginine-alanine at positions 18-19 of the homeodomain. This is seen in all taxa except the coral <it>Acropora millepora</it>, which has a glutamine at position 18 <abbrgrp>
<abbr bid="B29">29</abbr>
<abbr bid="B32">32</abbr>
<abbr bid="B39">39</abbr>
<abbr bid="B40">40</abbr>
</abbrgrp>. The sea anemone is also identical to both human and fruit fly at six of seven residues in the octapeptide. Within the OAR domain, <it>Nematostella </it>and human are identical at 13 of 16 residues, and <it>Nematostella </it>and <it>Drosophila </it>are identical at 14 of 16 residues (Figure <figr fid="F1">1</figr>).</p>
<suppl id="S2">
<title>
<p>Additional file 2</p>
</title>
<text>
<p>
<b>RX annotation</b>. Alignment of <it>rx </it>transcripts against the assembled genome. We reconstructed one <it>rx </it>transcript (1) by conceptually splicing overlapping 3' and 5' RACE fragments (RACE). We also identified two <it>rx </it>sequences among the 150,000 ESTs generated by the Joint Genome Institute <it>Nematostella </it>sequencing project (2: 2664141-1, 3: 2664141-2) and two <it>rx </it>ESTs that were previously deposited at NCBI (4: CAGN10625, 5:CV088198). The RACE product spans nucleotides 785,552 to 790,345 of scaffold_62 in the Joint Genome Institute <it>Nematostella </it>genome assembly. Location relative to the scaffold is indicated to the right of the nucleotide sequence. The long second intron (3713 nucleotides in length) has been truncated for clarity. Polymorphic nucleotides are highlighted in black. Corresponding polymorphic amino acids are boxed. The predicted amino sequence is shown below the nucleotide sequence. Three conserved motifs are shown in bold type: the octapeptide (HSIDAILG), the 60-amino acid homeodomain and the 16-amino acid OAR motif. There are two non-silent polymorphisms within the homeodomain (K/R at position 52 and E/Q at position 59 (see Figure 5 for geographic distribution). The EST CV088198 (5) does not encode the complete OAR motif. It encodes a predicted protein (ending in a phenylalanine) that is 24 residues shorter than the predicted protein encoded by the other transcripts.</p>
</text>
<file name="2041-9139-1-3-S2.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<p>We identified four additional EST sequences for <it>Nematostella rx</it>, two that were previously deposited at the National Center for Biotechnology Information (NCBI) (CAGN10625 and DV088198) and two that were generated by the Joint Genome Institute (JGI) as part of the <it>Nematostella </it>genome sequencing initiative (2664141-1 and 2664141-2). One of the EST sequences (DV088198) does not encode the complete OAR motif, instead producing a predicted protein 24 residues shorter than the other transcripts (see Additional file <supplr sid="S2">2</supplr>). Comparison of the genome assembly, our RACE product and these four ESTs revealed 28 single-nucleotide polymorphisms, including six that result in a change to the protein sequence (Additional file <supplr sid="S2">2</supplr>). Two of the polymorphic amino acid positions reside within helix 3 of the homeodomain, the so-called 'DNA-binding helix' (Figure <figr fid="F1">1</figr>; see Additional file <supplr sid="S2">2</supplr>): an A/G polymorphism that results in an amino acid change from arginine (R) to lysine (K) at position 52, and a C/G polymorphism results in an amino acid substitution from glutamine (Q) to glutamic acid (E) at position 59, an amide to acidic amino acid. Both the possession of lysine at position 52 and glutamic acid at position 59 are unique within the <it>Rx </it>family (Figure <figr fid="F1">1</figr>). In fact, the possession of a lysine at position 52 is very unusual for PRD-class homeodomains in general; of the 111 PRD-class homeodomains included in our phylogenetic analysis, the only other sequence with a lysine at this position is the <it>Nematostella DMBXd </it>homeodomain. Over the entire PRD class, the possession of glutamic acid at position 59 is not as uncommon. The Pax4/6 family exhibits glutamic acid at this position, as do three other <it>Nematostella </it>PRD-class homeodomains (DMBXb, NVHD_101 and NVHD_148).</p>
<p>To verify the existence of the two nonsynonymous polymorphisms identified in EST sequences (see above) in natural populations and to begin characterizing their geographic distribution, we sequenced a fragment of the <it>rx </it>gene from 95 individual animals collected throughout the range of the species (see Additional file <supplr sid="S3">3</supplr>). At position 52, the phylogenetically unusual lysine variant is relatively rare, accounting for only 15.16% of all alleles, but it does exhibit a broad geographic distribution, being found in seven different estuaries from both the Atlantic and Pacific coasts of North America. However, despite the wide distribution of the K allele, KK homozygotes were only recovered in a single location, Kingsport in Nova Scotia. The apparently limited geographic distribution of this genotype is interesting considering that KR heterozygotes accounted for more than a quarter of all individual animals assayed (RR = 71.58%, RK = 26.16%, KK = 2.11%). At position 59, the glutamine variant accounted for 96.28% of all alleles. Despite its overall rarity (3.72%), the glutamic acid variant was recovered in four widely separated estuaries in New Jersey, Maryland, California and Washington, although no EE homozygotes were recovered in any individual (QQ = 92.55%, QE = 7.45%, EE = 0.00%).</p>
<suppl id="S3">
<title>
<p>Additional file 3</p>
</title>
<text>
<p>
<b>RX polymorphism map</b>. Geographic distribution of <it>rx </it>polymorphisms. In total, 95 individual animals were successfully genotyped at each of the two polymorphic positions in the <it>Rx </it>homeodomain from 24 estuaries. Collection sites were as follows: 1, Spurwink River, ME; 2, Odiorne Point, NH; 3, Rye Harbor, NH; 4, Wallis Sands, NH; 5, Old Town Hill, MA; 6, Crane Reserve, MA; 7, Neponset River, MA; 8, Pocasset River, MA; 9, Sippewissett Marsh, MA; 10, Clinton, CT; 11, Kingsport, Nova Scotia; 12, Halifax, NS; 13, Meadowlands, NJ; 14, Rhodes River, MD: 15, Baruch, SC; 16, San Juan Island, WA; 17, Willapa Bay, WA; 18, Coos Bay, OR; 19, Humboldt, CA; 20, Bodega Bay, CA; 21, Tomales Bay, CA; 22, Fort Gillkicker Lagoon, UK; 23, Salterns, UK; 24, Half Moon Lagoon, UK. The overall genotypic frequencies are: position 52: KK = 71.58%, KR = 26.16%, RR = 2.11%; position 59: QQ = 92.55%, QE = 7.45%, EE = 0.00%.</p>
</text>
<file name="2041-9139-1-3-S3.TIFF">
   <p>Click here for file</p>
</file>
</suppl>
</sec>
<sec>
<st>
<p>Hbn</p>
</st>
<p>The <it>Nematostella homeobrain </it>transcript that we assembled using RACE encompasses three exons (see Additional file <supplr sid="S4">4</supplr>). Similar to <it>otp </it>and <it>rx</it>, an intron interrupts the homeodomain between positions 46 and 47. The third exon, 647 nucleotides in length, encodes the final 14 amino acids of the homeodomain plus an OAR motif, which is located near the carboxy terminus of the predicted protein. The <it>Nematostella homeobrain </it>protein has high similarity to insect orthologs within the homeodomain region but very little similarity could be detected elsewhere. For example, the homeodomain is identical to its <it>Drosophila </it>ortholog at 95% of residues (57/60). Interestingly, the sequence identity between <it>Nematostella </it>and <it>Drosophila </it>exceeds the sequence identity between <it>Nematostella </it>and its fellow cnidarian, <it>Hydra magnipapillata </it>(83%; 53 of 60 identical residues). This suggests that the <it>Hydra </it>homeodomain has evolved relatively quickly.</p>
<suppl id="S4">
<title>
<p>Additional file 4</p>
</title>
<text>
<p>
<b>HBN annotation</b>. Annotated <it>Nematostella Homeobrain </it>locus. We reconstructed one <it>hbn </it>transcript (1) by conceptually splicing overlapping 3' and 5' RACE fragments (RACE). We also identified three <it>Hb </it>ESTs that were previously deposited at NCBI (2: DV0879878, 3: DV084683; 4: DV086666). The transcript obtained by RACE is 1139 nucleotides long and comprises three exons, which collectively span nucleotide positions 777,772 to 782,115 of scaffold_62 in the Joint Genome Institute <it>Nematostella </it>genome assembly. The position relative to the scaffold is indicated to the right of the nucleotide sequence. The predicted amino acids are shown below the nucleotides that encode them. Polymorphic nucleotides are highlighted in black. Corresponding polymorphic amino acids are boxed. Long introns have been truncated for clarity. Three conserved protein motifs are shown in bold type; the octapeptide (YTIDMILG), the 60-amino acid homeodomain and the 16-amino acid OAR domain.</p>
</text>
<file name="2041-9139-1-3-S4.PDF">
   <p>Click here for file</p>
</file>
</suppl>
</sec>
<sec>
<st>
<p>Phylogenetic analysis</p>
</st>
<p>The maximum-likelihood (ML) and neighbor-joining (NJ) analyses performed here reinforce the conclusions of Ryan <it>et al. </it>
<abbrgrp>
<abbr bid="B32">32</abbr>
</abbrgrp>: <it>Nematostella hbn</it>, <it>otp </it>and <it>rx </it>sequences group with putative orthologs from bilaterian taxa with a moderate to high level of bootstrap support (Figure <figr fid="F2">2</figr>). The <it>Nematostella hbn </it>homeodomain is nested within a clade that also includes <it>hbn </it>sequences from multiple animals including protostomes (<it>for example</it>, <it>Capitella </it>and <it>Drosophila</it>), <it>Trichoplax </it>and two deuterostomes (<it>Strongylocentrotus </it>and <it>Saccoglossus</it>). Bootstrap support for this clade is moderate (ML = 77, NJ = 64). We were unable to recover a clear <it>hbn </it>ortholog from any chordate model system <abbrgrp>
<abbr bid="B41">41</abbr>
</abbrgrp>
. Our phylogenetic analyses recovered a monophyletic grouping for predicted <it>rx </it>orthologs from all species sampled except the most basal member queried, the placozoan <it>Trichoplax</it>. Similar to the <it>hbn </it>clade, bootstrap support was moderate for <it>rx </it>orthologs (ML = 43, NJ = 80). Finally, we identified putative <it>otp </it>orthologs from all species expect the nematode <it>C. elegans </it>(ML = 66, NJ = 98).</p>
<fig id="F2"><title><p>Figure 2</p></title><caption><p>Neighbor-joining phylogeny of <it>PRD </it>class genes based on homeodomain sequences</p></caption><text>
   <p><b>Neighbor-joining phylogeny of <it>PRD </it>class genes based on homeodomain sequences</b>. Pairwise distances between homeodomains were calculated using the PAM matrix in the computer program protdist in the Phylip package (Phylogenetic Inference Package, V.3.61). Abbreviations for taxa are as in Figure 1. Numbers above nodes indicate the percentage of 1000 bootstrap replicates in which the given node was recovered in a neighbor-joining analysis. The tree is rooted using <it>Hox1 </it>and <it>distalless </it>homeodomains from the lancelet and the fruit fly.</p>
</text><graphic file="2041-9139-1-3-2"/></fig>
</sec>
<sec>
<st>
<p>The Homeobrain-Rax-Orthopedia cluster</p>
</st>
<p>The <it>Nematostella hbn</it>, <it>otp </it>and <it>rx </it>transcripts map to a single scaffold in the publicly available assembly of the <it>Nematostella </it>genome (jgi scaffold 62; DOE-JGI <it>Nematostella vectensis </it>genome assembly V.1.0; <url>http://genome.jgi-psf.org/Nemve1/Nemve1.home.html</url>). The entire scaffold is 1,036,593 nucleotides long, and the cluster is located roughly 225,000 nucleotides from the nearest end of the scaffold. The <it>rx </it>locus intervenes between the <it>otp </it>and <it>hbn </it>loci. <it>Otp </it>and <it>rx </it>are encoded on one strand, whereas <it>hbn </it>is encoded on the opposite strand (Figure <figr fid="F3">3</figr>). The entire three-gene cluster spans 34,246 nucleotides (from the first nucleotide in the predicted 5' exon of <it>orthopedia </it>to the first nucleotide in the predicted 5' exon of <it>homeobrain</it>). The distance between the predicted stop codon of <it>hbn </it>and the predicted stop codon of <it>rx </it>is only 3,437 nucleotides (Figure <figr fid="F3">3</figr>). The intergenic distance between <it>otp </it>and the predicted transcription start site of <it>rx </it>is 12,158 nucleotides. Using BLASTx we searched for predicted genes positioned in the intergenic regions within the <it>otp-rx-hbn </it>cluster. No significant hits were recovered in the smaller intergenic region between <it>rx </it>and <it>hbn </it>(E &lt; 10). We identified one position of moderate similarity to other proteins in the region between <it>otp </it>and <it>rx </it>(~ 1 &#215; 10<sup>-6</sup>). Upon closer inspection there were no consistent proteins, regions or domains identified from these results, and the hits appeared as spurious matches. Based on the current assembly of the <it>Nematostella </it>genome, the nearest annotated genes bracketing the cluster are a predicted ferredoxin (JGI: 101894) located 16.6 kb from <it>otp </it>and a predicted gene similar to RIKEN (JGI: 242384) located 5.8 kb from <it>hbn</it>.</p>
<fig id="F3"><title><p>Figure 3</p></title><caption><p>Organization of the <it>homeobrain-rx-orthopedia </it>cluster of <it>Nematostella</it></p></caption><text>
   <p><b>Organization of the <it>homeobrain-rx-orthopedia </it>cluster of <it>Nematostella</it></b>. <it>Homeobrain </it>or<it>thopedia </it>and <it>rx </it>ESTs map to a ~35 kb region within a single scaffold 1,036,593 nucleotides long in the Joint Genome Institute's assembly of the <it>Nematostella </it>genome (scaffold 062, Joint Genome Institute <it>Nematostella vectensis </it>genome portal, V.1.0). Arrows above the cluster indicate the transcriptional orientation of each gene. The nucleotide length of each intron, exon and intergenic region is indicated below the cluster.</p>
</text><graphic file="2041-9139-1-3-3"/></fig>
<p>The genomic organization of the <it>Drosophila homeobrain </it>cluster has been described previously <abbrgrp>
<abbr bid="B31">31</abbr>
</abbrgrp>. The fruit fly cluster exhibits the same relative gene order as the <it>Nematostella </it>cluster, but the intergenic distances are greater and the transcriptional orientation of <it>otp </it>is reversed. The intergenic distance between <it>hbn </it>and <it>rx </it>is approximately 17.0 kb in the <it>Drosophila </it>cluster, whereas the intergenic distance between <it>rx </it>and <it>otp </it>is approximately 21.0 kb (Figure <figr fid="F4">4</figr>).</p>
<fig id="F4"><title><p>Figure 4</p></title><caption><p>Phylogenetic distribution of the <it>Otp</it>-<it>Rx</it>-<it>Hbn </it>cluster</p></caption><text>
   <p><b>Phylogenetic distribution of the <it>Otp</it>-<it>Rx</it>-<it>Hbn </it>cluster</b>. Several sequenced metazoan genomes were searched for evidence of linkage between <it>orthopedia</it>, <it>rx </it>and <it>homeobrain</it>. The genomic organization of the fruit fly homeobrain related gene was previously reported identified by Walldorf <it>et al. </it><abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. Predicted linkage of these three genes was determined for insects (mosquito, honeybee, flour beetle), chordates (vertebrates, amphioxus, ascidian), sea urchin, two lophotrochozoans (limpet, polychaete annelid), sea anemone and placozoan by querying the available genome sequences using the annotated/predicted gene sequences for each gene (this study). Solid lines between genes represent linkage on a single chromosome, scaffold or contig. Approximate distance (kb between genes) is noted above each line, and the relative transcriptional orientation of each gene is indicated by the direction of the arrow (the arrow points from 5' to 3'). Question marks indicate uncertain linkage. Genes confirmed to be on different chromosomes or genomic scaffolds are represented by squares with no interconnecting lines.</p>
</text><graphic file="2041-9139-1-3-4"/></fig>
<p>Other animal genomes in which <it>otp</it>, <it>rx </it>and <it>hbn </it>genes have been identified or predicted were analyzed for evidence of gene linkage. A <it>homeobrain </it>cluster was identified in the sequenced genomes of three additional insects: <it>Anopheles gambiae </it>(mosquito), <it>Apis mellifera </it>(honeybee) and <it>Tribolium castaneum </it>(flour beetle). In honeybee and flour beetle, the cluster organization is similar to that of <it>Drosophila</it>: the clusters have the same gene order (<it>otp</it>-<it>rx</it>-<it>hbn</it>), the genes exhibit the same transcriptional orientations and the intergenic distances are comparable (Figure <figr fid="F4">4</figr>). In the mosquito cluster, the gene order and scale are different, with <it>rx </it>linked to <it>otp </it>across a span of approximately 109 kb, and <it>otp </it>linked to <it>hbn </it>across a span of approximately 386 kb. In addition, there are a number of genes located in the intergenic spaces between these Paired class genes. For example, a predicted ATP-synthase-like gene is located between <it>rx </it>and <it>otp</it>, and a calcium/calmodulin protein kinase is located between <it>otp </it>and <it>hbn</it>. We also identified the three-member <it>homeobrain </it>cluster in the limpet <it>Lottia</it>. Interestingly, the gene order (<it>otp</it>-<it>hbn</it>-<it>rx</it>) differs from the insects and <it>Nematostella</it>, suggesting that an inversion occurred in the lophotrochozoan lineage. For the annelid <it>Capitella </it>we were only able to confirm linkage of <it>otp</it>-<it>hbn</it>, which has a comparable intergenic distance and the same transcriptional orientation as <it>Lottia</it>, but <it>rx </it>was located on a different scaffold in the current assembly, precluding a confident assessment of linkage. <it>Rx </it>is located near the end of the scaffold; however, there is one annotated gene (globin precursor, JGI 21023) between <it>rx </it>and the end of scaffold. Furthermore, if the orientation of the <it>Capitella </it>cluster is the same as that for <it>Lottia </it>(the most closely related species in our analysis), there would be &gt; 400 kb of sequence separating these genes. We identified a number of genes within 40 kb of <it>hbn</it>, none of which were <it>rx </it>or any homeobox gene.</p>
<p>The sea urchin <it>Strongylocentrotus purpuratus </it>and the hemichordate <it>Saccoglossus kowalevskii </it>are the only deuterostomes in which we could identify a <it>homeobrain </it>gene. Because a public assembly of the <it>Saccoglossus </it>genome is not yet available, we were unable to assess linkage and can only report that orthologs for all three members of the cluster are present. In the sea urchin, <it>hbn </it>and <it>rx </it>are closely linked on the same genomic scaffold (Scaffold_v2_14510, length 454945 nt). The two genes are separated by a distance of approximately 22 kb. This intergenic distance is similar to that observed in <it>Drosophila</it>, <it>Apis</it>, <it>Tribolium </it>and <it>Lottia</it>. However, unlike these protostomes, in the urchin, both genes are encoded on the same strand of DNA. <it>Otp </it>maps to a different scaffold (Scaffold_v2_10421, length 387609 nt). Because <it>otp </it>resides on a different scaffold, evidence of linkage to <it>hbn </it>and <it>rx </it>could not be established. However, by taking into account the distances flanking <it>otp </it>on its scaffold and <it>hbn </it>and <it>rx </it>on their scaffold, <it>otp </it>can be no closer than 437,964 nucleotides from <it>hbn </it>and 193,112 nucleotides from <it>rx</it>.</p>
<p>We could not identify a <it>homeobrain </it>locus in the genomes of the urochordate <it>Ciona</it>, the cephalochordate <it>Branchiostoma </it>or any of the vertebrate taxa we queried (<it>Homo</it>, <it>Mus</it>, <it>Rattus</it>, <it>Gallus</it>, <it>Xenopus </it>and <it>Danio</it>). In all of the vertebrates and <it>Ciona</it>, <it>otp </it>and <it>rx </it>are located on different chromosomes, so no remnant of the <it>hbn</it>-<it>rx</it>-<it>otp </it>cluster remains. In the current <it>Branchiostoma </it>assembly, the <it>otp </it>and <it>rx </it>homeodomains are separated by ~162 kb on the same genomic scaffold. There are seven annotated genes within the intergenic region (JGI: 102897, 136227, 286088, 102900, 130204, 102902 and 270692) with diverse functions (for example, SCAMP (secretory carrier membrane protein), N-acetyltransferase, tubulin chaperone).</p>
<p>We identified orthologs of <it>otp </it>and <it>hbn</it>, but not <it>rx</it>, in the placozoan <it>Trichoplax</it>. Both of these genes are located on the same scaffold, are separated by approximately 15 kb and have the same transcriptional orientation. Because <it>Trichoplax </it>represents the earliest diverging taxa in our study, the close proximity of <it>otp </it>and <it>hbn </it>strongly suggests that these genes have been clustered since very early in animal evolution.</p>
</sec>
<sec>
<st>
<p>Developmental expression of otp, rx and homeobrain <it>Orthopedia</it>
</p>
</st>
<p>Whole-mount <it>in situ </it>hybridization using an antisense <it>NvOtp </it>riboprobe of 1,556 bp revealed that expression begins in mid planula stages (Figure <figr fid="F5">5e</figr>), well after gastrulation is complete. Expression is confined to the oral pole and the oral end of the pharyngeal ectoderm. Expression appears strongest at the oral opening but extends into the pharyngeal ectoderm. Expression is also apparent in scattered cells in the body wall ectoderm, which appear to be neurons based on morphological criteria, namely their position deep within the ectoderm and their basally located nuclei (Figure <figr fid="F5">5f</figr>, Figure <figr fid="F5">5g</figr>) <abbrgrp>
<abbr bid="B42">42</abbr>
</abbrgrp>. Later, <it>Otp </it>expression is also seen in the tentacle ectoderm (Figure <figr fid="F5">5h</figr>, Figure <figr fid="F5">5i</figr>). Oral expression persists well into polyp stages (Figure <figr fid="F5">5j</figr>, Figure <figr fid="F5">5k</figr>). The probe used for <it>in situ </it>hybridization overlaps a substantial portion (&lt; 300 bp) of all three predicted splice variants, so this probe would presumably bind to all three.</p>
<fig id="F5"><title><p>Figure 5</p></title><caption><p>Expression of <it>NvOtp</it></p></caption><text>
   <p><b>Expression of <it>NvOtp</it></b>. <it>In situ </it>hybridization using a 1556 bp probe of <it>otp</it>. In all panels, the asterisk denotes the blastopore or oral end. <b>(a) </b>Expression is not detected in the egg, <b>(b) </b>during early cleavage or <b>(c,d) </b>during gastrulation. <b>(e,f) </b>Expression is detected in the planula stage in the ectoderm around the pharynx (ph) and in single ectodermal cells (arrows). <b>(g) </b>Oral view of the same planula from (f). <b>(h,i) </b>Similar expression in later stages as tentacles (t) begin to grow. <b>(j,k) </b>Polyp stages showing high expression in an oral ectodermal ring.</p>
</text><graphic file="2041-9139-1-3-5"/></fig>
</sec>
<sec>
<st>
<p>
<it>Rx</it>
</p>
</st>
<p>Expression of <it>Rx </it>begins before <it>otp </it>expression, during mid gastrulation (Figure <figr fid="F6">6c</figr>) <it>Rx </it>is initially expressed in the aboral hemisphere (Figure <figr fid="F6">6d</figr>), but by the end of gastrulation, expression becomes refined to a band that encircles the aboral side of the midbody but is excluded from the aboral pole, including the position of the presumptive apical tuft. Within this band, expression is spotty; only a fraction of the cells within the boundaries of the expression domain actually express <it>Rx</it>. As with <it>otp </it>and <it>hbn</it>, these <it>rx</it>-expressing cells in the ectoderm have the morphological appearance of neurons (Figure <figr fid="F6">6f</figr>, Figure <figr fid="F6">6i</figr>). <it>Rx </it>expression persists in this aboral band throughout planula and polyp stages (Figure <figr fid="F6">6g-j</figr>). During adult polyp stages, a second domain of expression is seen in individual ectodermal cells in the middle of each tentacle (Figure <figr fid="F6">6</figr>).</p>
<fig id="F6"><title><p>Figure 6</p></title><caption><p>Expression of <it>NvRx</it></p></caption><text>
   <p><b>Expression of <it>NvRx</it></b>. <it>In situ </it>hybridization using a 1419 bp probe of <it>rx</it>. The asterisks denote the blastopore or oral end. <b>(a,b) </b>No expression is detected in the egg or early cleavage stages. <b>(c,d) </b>Mid gastrula stages showing expression around the aboral ectoderm. <b>(e</b>) Late gastrula stage showing a band of expression towards the aboral pole. <b>(f) </b>The same embryo in (e) as viewed from the aboral pole, showing a lack of expression where the future apical tuft (at) will form. <b>(g-j) </b>Planula stages, with (i) showing a surface view of the patchy expression. <b>(k) </b>Polyp stage, showing a close-up view of expression in the ectoderm of the tentacles (t).</p>
</text><graphic file="2041-9139-1-3-6"/></fig>
</sec>
<sec>
<st>
<p>
<it>Homeobrain</it>
</p>
</st>
<p>
<it>Homeobrain </it>is first detected in the late blastula, when it is expressed throughout most of the blastoderm except in the region surrounding the blastopore (the presumptive gastrodermis; Figure <figr fid="F7">7c,d</figr>). With the onset of gastrulation, expression becomes excluded from the aboral pole, where the apical tuft of sensory cilia will form (Figure <figr fid="F7">7d,e</figr>). As gastrulation proceeds, expression persists in populations of ectodermal cells around the oral pole and later in those that encircle the base of each developing tentacle (Figure <figr fid="F7">7g,h</figr>). Expression is also seen in individual cells that are scattered throughout the mid body (Figure <figr fid="F7">7g</figr>). These cells have the morphological appearance of neurons. During polyp stages, expression is confined primarily to the ectoderm at the base of the tentacles (Figure <figr fid="F7">7i-k</figr>).</p>
<fig id="F7"><title><p>Figure 7</p></title><caption><p>Expression of <it>NvHbn</it></p></caption><text>
   <p><b>Expression of <it>NvHbn</it></b>. <it>In situ </it>hybridization using a 749 bp probe of <it>hbn</it>. The asterisks mark the blastopore or oral end. <b>(a,b) </b>Egg and early cleavage stages showing a lack of expression. <b>(c) </b>Blastula stage embryo showing expression everywhere except in the blastopore. <b>(d) </b>Mid gastrula showing lack of expression in the invaginating endoderm (en). <b>(e,f) </b>Lateral and oral views of a late gastrula, stage embryo. <b>(g) </b>Planula stage, showing strong expression around the future tentacles (arrow) and in the putative neurons (n) of the mid body. <b>(h,i) </b>Early polyp stage showing expression around the base of the tentacles (t). <b>(j,k) </b>Older polyp stages, where expression is confined to the base of the tentacles.</p>
</text><graphic file="2041-9139-1-3-7"/></fig>
</sec>
</sec>
<sec>
<st>
<p>Discussion</p>
</st>
<sec>
<st>
<p>The functional evolution of hbn, rx and otp</p>
</st>
<p>
<it>Homeobrain </it>was originally identified in <it>Drosophila </it>
<abbrgrp>
<abbr bid="B43">43</abbr>
</abbrgrp>, and mapped to a region of chromosome 2 that contained two additional PRD-class homeobox genes, <it>Orthopedia </it>and <it>rx </it>
<abbrgrp>
<abbr bid="B44">44</abbr>
<abbr bid="B45">45</abbr>
</abbrgrp>. <it>Homeobrain </it>is expressed in the fly embryonic brain and ventral nerve cord <abbrgrp>
<abbr bid="B31">31</abbr>
</abbrgrp>. Recently the first lophotrochozan homeobrain-like gene was reported <abbrgrp>
<abbr bid="B41">41</abbr>
</abbrgrp>, and found to be expressed in a restricted region of the anterior brain. <it>Hbn </it>expression has also been reported in the sea urchin, where it is expressed in oral ganglia of the animal pole <abbrgrp>
<abbr bid="B46">46</abbr>
</abbrgrp>.</p>
<p>
<it>Otp </it>is associated with neural development in a phylogenetically diverse collection of animals, and this may represent an ancestral role for this homeodomain family. <it>Orthopedia</it>-related genes in mouse are involved in brain patterning and development <abbrgrp>
<abbr bid="B47">47</abbr>
<abbr bid="B48">48</abbr>
<abbr bid="B49">49</abbr>
<abbr bid="B50">50</abbr>
<abbr bid="B51">51</abbr>
<abbr bid="B52">52</abbr>
</abbrgrp>. In the hemichordate <it>Saccoglossus</it>, <it>otp </it>has punctate, ectodermal expression in neural domains of the prosome (proboscis) and mesocome (collar) <abbrgrp>
<abbr bid="B53">53</abbr>
</abbrgrp>. In planaria, <it>orthopedia </it>is implicated in patterning the branch structure of the brain <abbrgrp>
<abbr bid="B54">54</abbr>
<abbr bid="B55">55</abbr>
</abbrgrp>. In the limpet, <it>otp </it>is involved in the development of the larval apical sensory organ <abbrgrp>
<abbr bid="B56">56</abbr>
</abbrgrp>, and in flies it is involved in the developing CNS and hindgut and anal pads <abbrgrp>
<abbr bid="B51">51</abbr>
</abbrgrp>. However, a connection to neural development is not obvious in the sea urchin, where <it>otp </it>appears to be involved in larval skeletal morphogenesis and the establishment of oral ectodermal cell fate <abbrgrp>
<abbr bid="B57">57</abbr>
<abbr bid="B58">58</abbr>
<abbr bid="B59">59</abbr>
<abbr bid="B60">60</abbr>
<abbr bid="B61">61</abbr>
</abbrgrp>.</p>
<p>
<it>Rx </it>(retinal associated homeobox) genes have been extensively studied in many deuterostome taxa including human, mouse, <it>Xenopus</it>, chicken, zebrafish, medaka and tunicate <abbrgrp>
<abbr bid="B45">45</abbr>
<abbr bid="B62">62</abbr>
<abbr bid="B63">63</abbr>
<abbr bid="B64">64</abbr>
<abbr bid="B65">65</abbr>
<abbr bid="B66">66</abbr>
<abbr bid="B67">67</abbr>
<abbr bid="B68">68</abbr>
<abbr bid="B69">69</abbr>
<abbr bid="B70">70</abbr>
</abbrgrp>. In all of these organisms, <it>rx </it>is involved in brain development and the formation of retinal territories and associated neural structures. In the hemichordate, <it>rx </it>is expressed throughout the prosome ectoderm, the most anterior region of this organism, but absent from the most apical pole <abbrgrp>
<abbr bid="B53">53</abbr>
</abbrgrp>. <it>Rx </it>is expressed more broadly during sea urchin development, with punctate expression in the animal pole and developing gut <abbrgrp>
<abbr bid="B46">46</abbr>
</abbrgrp>. A fruit fly <it>rx </it>gene has also been identified, and studies suggest that it is necessary for proper brain development, but it is not required for eye development <abbrgrp>
<abbr bid="B45">45</abbr>
<abbr bid="B71">71</abbr>
</abbrgrp>.</p>
</sec>
<sec>
<st>
<p>Alternative splice variants in orthopedia and rx</p>
</st>
<p>In addition to a homeodomain and a PRD domain, two additional motifs have been identified in some PRD-like genes. The octapeptide motif <abbrgrp>
<abbr bid="B72">72</abbr>
<abbr bid="B73">73</abbr>
</abbrgrp>, located towards the N-terminus, is involved in transcriptional repression <abbrgrp>
<abbr bid="B74">74</abbr>
</abbrgrp>. The OAR domain, found in <it>otp</it>, <it>aristaless </it>and <it>rx </it>
<abbrgrp>
<abbr bid="B40">40</abbr>
<abbr bid="B51">51</abbr>
<abbr bid="B67">67</abbr>
<abbr bid="B75">75</abbr>
</abbrgrp> is typically located at the C-terminus and is known to function as a transactivator in <it>Otp </it>
<abbrgrp>
<abbr bid="B40">40</abbr>
<abbr bid="B51">51</abbr>
</abbrgrp>.</p>
<p>We have identified alternative transcripts for <it>Nematostella orthopedia </it>and <it>rx</it>. For rx, some transcripts encode a highly conserved OAR domain and one transcript does not. For <it>orthopedia</it>, none of our RACE products nor any ESTs in publicly available databases included the OAR domain. However, a well-conserved OAR domain was identified downstream and in-frame of these sequences in predicted genomic gene models, suggesting that this domain is probably expressed in an as yet unidentified splice variant. The OAR domain has been described as an intramolecular switch, which acts to reduce the affinity of the homeodomain transcription factor for its binding site. In the mouse, ectopically expressed mutant forms of the <it>Alx3 </it>and <it>Cart1 </it>proteins lacking the OAR domain exhibit increased binding to their DNA targets <abbrgrp>
<abbr bid="B76">76</abbr>
</abbrgrp>.</p>
<p>Alternative splice variants involving the presence or absence of an OAR domain have also been identified the mouse <it>prx1 </it>gene, a member of the PMX family of PRD-class homeobox genes <abbrgrp>
<abbr bid="B77">77</abbr>
</abbrgrp>. The carboxy terminus of the <it>Prx1a </it>protein includes an activation domain and an OAR domain, whereas the carboxy terminus of <it>Prx1b </it>encodes a repression domain and lacks an OAR domain <abbrgrp>
<abbr bid="B77">77</abbr>
<abbr bid="B78">78</abbr>
</abbrgrp>. The tissue distribution of both transcripts appears to be similar in mice and humans, but different tissues exhibit pronounced differences in the relative ratios of <it>prx1a </it>and <it>prx1b </it>
<abbrgrp>
<abbr bid="B79">79</abbr>
<abbr bid="B80">80</abbr>
</abbrgrp>. It has been hypothesized that the presence of the OAR domain in <it>prx1a </it>could render it sensitive to modulation via an unidentified partner protein that interacts with the OAR domain itself. In the absence of this cofactor, the OAR domain masks the activation domain and reduces the affinity of <it>prx1a </it>for DNA binding sites. When bound by its co-factor, the activation domain becomes unmasked and the DNA binding affinity increases <abbrgrp>
<abbr bid="B76">76</abbr>
<abbr bid="B77">77</abbr>
<abbr bid="B81">81</abbr>
</abbrgrp>. In the case of <it>Nematostella rx</it>, the situation may be somewhat simpler than in the mouse <it>prx1 </it>gene, because except for the presence or absence of the OAR domain itself, the alternative splice variants encode essentially identical proteins.</p>
</sec>
<sec>
<st>
<p>Functional inferences about <it>Nematostella </it>otp, rx and hbn based on expression data</p>
</st>
<p>In the most famous example of a conserved homeobox cluster, the Hox genes, the spatial ordering of Hox expression territories along the body's main axis and the timing of their onset mirrors the physical ordering of linked Hox genes (although not in all taxa). This correspondence is termed colinearity. However, in animals with dispersed Hox clusters, such as the urochordate <it>Oikopleura dioica</it>, some spatial colinearity remains whereas temporal colinearity is absent <abbrgrp>
<abbr bid="B82">82</abbr>
</abbrgrp>. This suggests that it is temporal rather than spatial colinearity that is driving the maintenance of these clusters. Although work has been performed in studying temporal colinearity in Hox, ParaHox and NK clusters, the homeobrain cluster could prove to be another supporting example.</p>
<p>In <it>Nematostella</it>, <it>hbn, rx and otp </it>appear to be expressed in a temporally colinear pattern. <it>Hbn </it>is expressed first, in the blastula stages, followed by <it>rx </it>at mid gastrulation and finally <it>otp </it>in the planula. However, in the case of the <it>Drosophila homeobrain </it>cluster, there is no clear evidence of temporal colinearity. Homeobrain is expressed first, in the syncytial blastoderm, then <it>otp </it>is expressed slightly before <it>rx </it>
<abbrgrp>
<abbr bid="B31">31</abbr>
<abbr bid="B51">51</abbr>
<abbr bid="B65">65</abbr>
</abbrgrp>. Future studies in other animals will help determine whether temporal colinearity is widely conserved among homeobrain clusters.</p>
<p>Although the expression of <it>Nematostella hbn</it>, <it>rx </it>and <it>otp </it>is consistent with temporal colinearity, it is not consistent with spatial colinearity. The three genes are expressed in non-overlapping domains along the oral-aboral axis; however, these domains do not appear to be related to their position in the cluster. <it>NvRx </it>is expressed at the most aboral domain, although expression is not present at the aboral pole, where the apical tuft will form. <it>NvHbn </it>is more broadly expressed in oral ectoderm during early embryogenesis and becomes confined to the most oral ectoderm, mainly around the base of the tentacles. Finally, <it>NvOtp </it>is also expressed in the oral ectoderm, in the domain that will invaginate and form the pharynx. <it>NvOtp </it>and <it>NvHbn </it>are both expressed in close proximity to three paralogs of <it>Otx </it>(<it>NvOtxA, B, C</it>) in the pharyngeal ectoderm and tentacles surrounding the oral pole <abbrgrp>
<abbr bid="B30">30</abbr>
</abbrgrp>.</p>
<p>In addition to the broad non-overlapping domains, all these genes are also expressed in individual cells throughout the body column. Based on their cell morphology, it appears that these cells may be neurons. Additionally, the expression of <it>NvOtp </it>and <it>NvOtx(A,B,C) </it>in the oral ectoderm coincides with formation of the oral nerve ring <abbrgrp>
<abbr bid="B30">30</abbr>
<abbr bid="B42">42</abbr>
</abbrgrp>. Considering the function of these genes in other animals and the expression patterns seen here, it is likely that <it>hbn, rx and otp </it>play some role in neural development in <it>Nematostella</it>. However, it is interesting that there is no expression in the apical tuft, another strongly neurogenic region in <it>Nematostella</it>.</p>
</sec>
<sec>
<st>
<p>The evolutionary history of the homeobrain cluster</p>
</st>
<p>The literature on homeobox clusters would suggest that the history of the ANTP class has been qualitatively different from that of the histories of other classes of homeobox genes. The most intensively studied and widely conserved homeobox clusters are all composed of ANTP-class homeobox genes (the Antennapedia complex, the Bithorax complex, the Hox cluster, the ParaHox cluster, the NK cluster and the EGH box cluster). Ultimately, all of these ANTP-class clusters may derive from a single ancestral cluster. For example, the ANTP-C and BX-C of <it>Drosophila </it>are clearly derived from a single ancestral Hox cluster that is widely shared by protostomes and deuterostomes, and in simpler form, by cnidarians. The Hox cluster, in turn, appears to have broken off from an ancestral 'mega-Hox' or 'extended Hox' cluster that at one time may have encompassed the Hox cluster, the ParaHox cluster, the EGH cluster and the NK cluster. Over time, the hypothetical ancestral cluster appears to have fragmented, and different remnants of this ancestral cluster may be more highly conserved in different animal lineages <abbrgrp>
<abbr bid="B83">83</abbr>
</abbrgrp>.</p>
<p>This study is the first to provide evidence that a non-ANTP-class homeobox cluster was conserved over hundreds of millions of years of animal evolution. Clearly, the <it>hbn</it>-<it>rx</it>-<it>otp </it>cluster has been fairly well conserved over the evolutionary history of holometabolous insects. A cluster with the same constituent genes, in the same orientation, spanning a comparable distance is found in representatives of three different orders of insects (Coleoptera, Diptera and Hymenoptera; Figure <figr fid="F4">4</figr>). The cluster also appears to date to the ancestral Protostome, as evidenced by conservation of the three-gene cluster in the limpet <it>Lottia</it>, although the order of the genes has changed. In addition, the cluster was also likely to have been present in the cnidarian-bilaterian common ancestor, some 600 million years ago. The cluster in <it>Nematostella </it>involves the same closely linked genes in the same order as the inferred ancestral cluster of holometabolous insects, but the <it>otp </it>locus has been inverted. Similarly, in four other key taxa (cephalochordate, sea urchin, annelid and placozoan) we find evidence of a partial cluster, in which two of the three genes are linked. Further sequencing and assemblies of the echinoderm and hemichordate genomes will reveal the extent of the <it>hbn</it>-<it>rx</it>-<it>otp </it>cluster in the deuterostome ancestor. The placozoan genomic data suggests that a portion of this cluster (<it>hbn </it>and <it>otp</it>) dates back even further to the ancestral eumetazoan. However, it remains to be seen whether <it>rx </it>was present in this ancestor and lost in the placozoan lineage, or whether <it>rx </it>evolved after the split.</p>
<p>Mechanistically, it is easier to envision how a cluster of three closely related genes might remain linked than to envision how three closely related genes, if already dispersed, could independently become so closely juxtaposed in multiple taxa. If the genes arose by tandem duplication, the cluster would have originated as a result of the gene duplication; that is, the starting point would have been a cluster. Subsequently, the cluster could have been maintained over hundreds of millions of years of evolution in multiple animal lineages by stabilizing selection.</p>
<p>Closely linked genes will tend to reside in the same chromosomal territories, and they may come under the influence of shared regulatory elements. For this reason, the proper regulation of linked genes may be related to their physical proximity in the genome. This is the general explanation for why <it>Hox </it>genes have remained clustered for hundreds of millions of years in many animals that have been examined. In the case of <it>Hox </it>genes, the spatial ordering of <it>Hox </it>expression territories along the body's main axis mirrors the physical ordering of linked <it>Hox </it>genes along the chromosome. This correspondence is termed colinearity. In the anterior CNS, <it>rx</it>, <it>hbn </it>and <it>otp </it>are expressed in nested territories, which is somewhat reminiscent of <it>Hox </it>genes. Future bioinformatics studies could test for conserved regulatory elements within the <it>hbn</it>-<it>rx</it>-<it>otp </it>clusters of fly, honeybee and flour beetle. The functionality of these putative enhancer-binding sites could then be studied experimentally. In addition, the effect of cluster disruption can be examined experimentally in <it>Drosophila </it>and in mice, as it has been with Hox genes <abbrgrp>
<abbr bid="B84">84</abbr>
<abbr bid="B85">85</abbr>
</abbrgrp>. It may also prove very informative to use evolutionary comparisons of taxa with intact and disrupted clusters to investigate the consequences of cluster disruption, as has recently been carried out for the ParaHox cluster <abbrgrp>
<abbr bid="B20">20</abbr>
</abbrgrp>.</p>
</sec>
<sec>
<st>
<p>The loss of homeobrain in chordates</p>
</st>
<p>The <it>homeobrain </it>gene appears to have been lost in chordates. We could not identify it in the sequenced genomes of human, mouse, chicken, clawed frog, zebrafish, lancelet or tunicate (Figure <figr fid="F4">4</figr>). However, we identified clear orthologs in two other deuterostomes (sea urchin and hemichordate) and throughout the protostomes and 'basal' metazoans. Thus, this gene may have been truly lost from the genome or it may have become so highly modified that it is no longer recognizable as a <it>homeobrain </it>ortholog. The phylogenetic analysis performed here, based on homeodomain sequences, does not strongly suggest another PRD-class gene as a possible ortholog. Unfortunately, we cannot rely on regions outside the 60-amino acid homeodomain to obtain additional phylogenetic signal because there was not sufficient sequence conservation to permit alignments across all genes represented in the phylogeny. The highly conserved 128-amino acid <it>Paired </it>domain, present in many PRD-class homeobox genes, is absent from the <it>homeobrain</it>, or<it>thopedia </it>and <it>rx </it>families, among others <abbrgrp>
<abbr bid="B29">29</abbr>
<abbr bid="B39">39</abbr>
<abbr bid="B40">40</abbr>
</abbrgrp>. With the present data, our analyses support the hypothesis that <it>homeobrain </it>was lost early in the chordate lineage.</p>
</sec>
<sec>
<st>
<p>Coding polymorphisms in <it>Nematostella</it>
</p>
</st>
<p>Despite potential stabilizing selection to maintain the cluster, we observed many polymorphisms in the <it>Nematostella </it>genes comprising the cluster, including nonsynonymous substitutions in the homeodomain of <it>rx </it>that have a patchy distribution in natural populations. The functional role of these polymorphisms awaits future experimental characterization. The presence of few homozygotes for the rare amino acid for each position in our sampling of natural populations is potential evidence for a functional difference between alleles.</p>
</sec>
</sec>
<sec>
<st>
<p>Conclusions</p>
</st>
<p>A PRD-class homeobox cluster comprising <it>homeobrain</it>, or<it>thopedia </it>and <it>rax </it>is widely conserved in animals and thus represents an ancient gene cluster dating to early in metazoan evolution. Very similar clusters comprising these three genes are present in <it>Nematostella</it>, in diverse protostomes, and potentially in non-chordate deuterostomes. Chordates appear to have lost one member (<it>homeobrain</it>) and the linkage between the other two genes, <it>rx </it>and <it>orthopedia</it>, has dissolved in the diverse vertebrates we examined.</p>
</sec>
<sec>
<st>
<p>Competing interests</p>
</st>
<p>The authors declare that they have no competing interests.</p>
</sec>
<sec>
<st>
<p>Authors' contributions</p>
</st>
<p>MEM performed the majority of the gene mapping analysis, participated in the phylogenetic analysis and contributed to drafting the manuscript. KP performed the <it>in situ </it>hybridization studies. AMR performed the population genetic studies of rx and participated in the gene mapping studies and phylogenetic analysis. MQM participated in the design of the study and oversaw the analysis of the gene expression data. JRF participated in the design of the study and the phylogenetic analysis, and contributed to drafting the manuscript. All authors read and approved the final manuscript.</p>
</sec>
</bdy><bm>
<ack>
<sec>
<st>
<p>Acknowledgements</p>
</st>
<p>We are extremely grateful for the insightful suggestions of two anonymous reviewers. This research was supported by NSF grant IOS-0818831 to JRF.</p>
</sec>
</ack>
<refgrp><bibl id="B1"><title><p>Conservation of gene linkage in dispersed vertebrate NK homeobox clusters</p></title><aug><au><snm>Wotton</snm><fnm>KR</fnm></au><au><snm>Weierud</snm><fnm>FK</fnm></au><au><snm>Juarez-Morales</snm><fnm>JL</fnm></au><au><snm>Alvares</snm><fnm>LE</fnm></au><au><snm>Dietrich</snm><fnm>S</fnm></au><au><snm>Lewis</snm><fnm>KE</fnm></au></aug><source>Dev Genes Evol</source><pubdate>2009</pubdate><volume>219</volume><fpage>481</fpage><lpage>496</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s00427-009-0311-y</pubid><pubid idtype="pmpid">20112453</pubid></pubidlist></xrefbib></bibl><bibl id="B2"><title><p>Genomic evolution of Hox gene clusters</p></title><aug><au><snm>Lemons</snm><fnm>D</fnm></au><au><snm>McGinnis</snm><fnm>W</fnm></au></aug><source>Science</source><pubdate>2006</pubdate><volume>313</volume><fpage>1918</fpage><lpage>1922</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1132040</pubid><pubid idtype="pmpid" link="fulltext">17008523</pubid></pubidlist></xrefbib></bibl><bibl id="B3"><title><p>The NK homeobox gene cluster predates the origin of hox genes</p></title><aug><au><snm>Larroux</snm><fnm>C</fnm></au><au><snm>Fahey</snm><fnm>B</fnm></au><au><snm>Degnan</snm><fnm>SM</fnm></au><au><snm>Adamski</snm><fnm>M</fnm></au><au><snm>Rokhsar</snm><fnm>DS</fnm></au><au><snm>Degnan</snm><fnm>BM</fnm></au></aug><source>Current Biology</source><pubdate>2007</pubdate><volume>17</volume><fpage>706</fpage><lpage>710</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.cub.2007.03.008</pubid><pubid idtype="pmpid" link="fulltext">17379523</pubid></pubidlist></xrefbib></bibl><bibl id="B4"><title><p>Clustered Fox genes in lophotrochozoans and the evolution of the bilaterian Fox gene cluster</p></title><aug><au><snm>Shimeld</snm><fnm>SM</fnm></au><au><snm>Boyle</snm><fnm>MJ</fnm></au><au><snm>Brunet</snm><fnm>T</fnm></au><au><snm>Luke</snm><fnm>GN</fnm></au><au><snm>Seaver</snm><fnm>EC</fnm></au></aug><source>Dev Biol</source><pubdate>2010</pubdate><volume>340</volume><fpage>234</fpage><lpage>48</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.ydbio.2010.01.015</pubid><pubid idtype="pmpid" link="fulltext">20096280</pubid></pubidlist></xrefbib></bibl><bibl id="B5"><title><p>Identification and comparative analysis of the protocadherin cluster in a reptile, the green anole lizard</p></title><aug><au><snm>Jiang</snm><fnm>XJ</fnm></au><au><snm>Li</snm><fnm>S</fnm></au><au><snm>Ravi</snm><fnm>V</fnm></au><au><snm>Venkatesh</snm><fnm>B</fnm></au><au><snm>Yu</snm><fnm>WP</fnm></au></aug><source>PLoS One</source><pubdate>2009</pubdate><volume>4</volume><fpage>e7614</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pone.0007614</pubid><pubid idtype="pmcid">2764143</pubid><pubid idtype="pmpid">19898614</pubid></pubidlist></xrefbib></bibl><bibl id="B6"><title><p>Platypus globin genes and flanking loci suggest a new insertional model for beta-globin evolution in birds and mammals</p></title><aug><au><snm>Patel</snm><fnm>VS</fnm></au><au><snm>Cooper</snm><fnm>SJ</fnm></au><au><snm>Deakin</snm><fnm>JE</fnm></au><au><snm>Fulton</snm><fnm>B</fnm></au><au><snm>Graves</snm><fnm>T</fnm></au><au><snm>Warren</snm><fnm>WC</fnm></au><au><snm>Wilson</snm><fnm>RK</fnm></au><au><snm>Graves</snm><fnm>JA</fnm></au></aug><source>BMC Biol</source><pubdate>2008</pubdate><volume>6</volume><fpage>34</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1741-7007-6-34</pubid><pubid idtype="pmcid">2529266</pubid><pubid idtype="pmpid">18657265</pubid></pubidlist></xrefbib></bibl><bibl id="B7"><title><p>The Dlx gene complement of the leopard shark, Triakis semifasciata, resembles that of mammals: implications for genomic and morphological evolution of jawed vertebrates</p></title><aug><au><snm>Stock</snm><fnm>DW</fnm></au></aug><source>Genetics</source><pubdate>2005</pubdate><volume>169</volume><fpage>807</fpage><lpage>817</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1534/genetics.104.031831</pubid><pubid idtype="pmcid">1449088</pubid><pubid idtype="pmpid">15489533</pubid></pubidlist></xrefbib></bibl><bibl id="B8"><title><p>Chromosomal clustering of a human transcriptome reveals regulatory background</p></title><aug><au><snm>Vogel</snm><fnm>JH</fnm></au><au><snm>von Heydebreck</snm><fnm>A</fnm></au><au><snm>Purmann</snm><fnm>A</fnm></au><au><snm>Sperling</snm><fnm>S</fnm></au></aug><source>BMC Bioinformatics</source><pubdate>2005</pubdate><volume>6</volume><fpage>230</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2105-6-230</pubid><pubid idtype="pmcid">1261156</pubid><pubid idtype="pmpid">16171528</pubid></pubidlist></xrefbib></bibl><bibl id="B9"><title><p>MetaHox gene clusters</p></title><aug><au><snm>Coulier</snm><fnm>F</fnm></au><au><snm>Popovici</snm><fnm>C</fnm></au><au><snm>Villet</snm><fnm>R</fnm></au><au><snm>Birnbaum</snm><fnm>D</fnm></au></aug><source>J Exp Zool</source><pubdate>2000</pubdate><volume>288</volume><fpage>345</fpage><lpage>351</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/1097-010X(20001215)288:4&lt;345::AID-JEZ7&gt;3.0.CO;2-Y</pubid><pubid idtype="pmpid">11144283</pubid></pubidlist></xrefbib></bibl><bibl id="B10"><title><p>Evidence for 14 homeobox gene clusters in human genome ancestry</p></title><aug><au><snm>Pollard</snm><fnm>SL</fnm></au><au><snm>Holland</snm><fnm>PW</fnm></au></aug><source>Curr Biol</source><pubdate>2000</pubdate><volume>10</volume><fpage>1059</fpage><lpage>1062</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0960-9822(00)00676-X</pubid><pubid idtype="pmpid" link="fulltext">10996074</pubid></pubidlist></xrefbib></bibl><bibl id="B11"><title><p>Evolution of the integrin alpha and beta protein families</p></title><aug><au><snm>Hughes</snm><fnm>AL</fnm></au></aug><source>J Mol Evol</source><pubdate>2001</pubdate><volume>52</volume><fpage>63</fpage><lpage>72</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">11139295</pubid></xrefbib></bibl><bibl id="B12"><title><p>Evolution and divergence of sodium channel genes in vertebrates</p></title><aug><au><snm>Lopreato</snm><fnm>GF</fnm></au><au><snm>Lu</snm><fnm>Y</fnm></au><au><snm>Southwell</snm><fnm>A</fnm></au><au><snm>Atkinson</snm><fnm>NS</fnm></au><au><snm>Hillis</snm><fnm>DM</fnm></au><au><snm>Wilcox</snm><fnm>TP</fnm></au><au><snm>Zakon</snm><fnm>HH</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2001</pubdate><volume>98</volume><fpage>7588</fpage><lpage>7592</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.131171798</pubid><pubid idtype="pmcid">34712</pubid><pubid idtype="pmpid">11416226</pubid></pubidlist></xrefbib></bibl><bibl id="B13"><title><p>EVG, the remnants of a primordial bilaterian's synteny of functionally unrelated genes</p></title><aug><au><snm>Granadino</snm><fnm>B</fnm></au><au><snm>Rey-Campos</snm><fnm>J</fnm></au></aug><source>J Mol Evol</source><pubdate>2003</pubdate><volume>57</volume><fpage>515</fpage><lpage>519</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s00239-003-2503-9</pubid><pubid idtype="pmpid" link="fulltext">14738309</pubid></pubidlist></xrefbib></bibl><bibl id="B14"><title><p>Comparative genomics of the keratin-associated protein (KAP) gene clusters in human, chimpanzee, and baboon</p></title><aug><au><snm>Shibuya</snm><fnm>K</fnm></au><au><snm>Kudoh</snm><fnm>J</fnm></au><au><snm>Obayashi</snm><fnm>I</fnm></au><au><snm>Shimizu</snm><fnm>A</fnm></au><au><snm>Sasaki</snm><fnm>T</fnm></au><au><snm>Minoshima</snm><fnm>S</fnm></au><au><snm>Shimizu</snm><fnm>N</fnm></au></aug><source>Mamm Genome</source><pubdate>2004</pubdate><volume>15</volume><fpage>179</fpage><lpage>192</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s00335-003-2313-9</pubid><pubid idtype="pmpid" link="fulltext">15014967</pubid></pubidlist></xrefbib></bibl><bibl id="B15"><title><p>Rhox: a new homeobox gene cluster</p></title><aug><au><snm>Maclean</snm><fnm>JA</fnm></au><au><snm>Chen</snm><fnm>MA</fnm></au><au><snm>Wayne</snm><fnm>CM</fnm></au><au><snm>Bruce</snm><fnm>SR</fnm></au><au><snm>Rao</snm><fnm>M</fnm></au><au><snm>Meistrich</snm><fnm>ML</fnm></au><au><snm>Macleod</snm><fnm>C</fnm></au><au><snm>Wilkinson</snm><fnm>MF</fnm></au></aug><source>Cell</source><pubdate>2005</pubdate><volume>120</volume><fpage>369</fpage><lpage>382</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.cell.2004.12.022</pubid><pubid idtype="pmpid" link="fulltext">15707895</pubid></pubidlist></xrefbib></bibl><bibl id="B16"><title><p>Minimal ProtoHox cluster inferred from bilaterian and cnidarian Hox complements</p></title><aug><au><snm>Chourrout</snm><fnm>D</fnm></au><au><snm>Delsuc</snm><fnm>F</fnm></au><au><snm>Chourrout</snm><fnm>P</fnm></au><au><snm>Edvardsen</snm><fnm>RB</fnm></au><au><snm>Rentzsch</snm><fnm>F</fnm></au><au><snm>Renfer</snm><fnm>E</fnm></au><au><snm>Jensen</snm><fnm>MF</fnm></au><au><snm>Zhu</snm><fnm>B</fnm></au><au><snm>de Jong</snm><fnm>P</fnm></au><au><snm>Steele</snm><fnm>RE</fnm></au><au><snm>Technau</snm><fnm>U</fnm></au></aug><source>Nature</source><pubdate>2006</pubdate><volume>442</volume><fpage>684</fpage><lpage>687</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature04863</pubid><pubid idtype="pmpid" link="fulltext">16900199</pubid></pubidlist></xrefbib></bibl><bibl id="B17"><title><p>An ancient Fox gene cluster in bilaterian animals</p></title><aug><au><snm>Mazet</snm><fnm>F</fnm></au><au><snm>Amemiya</snm><fnm>CT</fnm></au><au><snm>Shimeld</snm><fnm>SM</fnm></au></aug><source>Curr Biol</source><pubdate>2006</pubdate><volume>16</volume><fpage>R314</fpage><lpage>316</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.cub.2006.03.088</pubid><pubid idtype="pmpid" link="fulltext">16682334</pubid></pubidlist></xrefbib></bibl><bibl id="B18"><title><p>Pre-bilaterian origins of the hox cluster and the hox code: evidence from the sea anemone, <it>Nematostella vectensis</it></p></title><aug><au><snm>Ryan</snm><fnm>JF</fnm></au><au><snm>Mazza</snm><fnm>ME</fnm></au><au><snm>Pang</snm><fnm>K</fnm></au><au><snm>Matus</snm><fnm>DQ</fnm></au><au><snm>Baxevanis</snm><fnm>AD</fnm></au><au><snm>Martindale</snm><fnm>MQ</fnm></au><au><snm>Finnerty</snm><fnm>JR</fnm></au></aug><source>PLoS ONE</source><pubdate>2007</pubdate><volume>2</volume><fpage>e153</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pone.0000153</pubid><pubid idtype="pmcid">1779807</pubid><pubid idtype="pmpid">17252055</pubid></pubidlist></xrefbib></bibl><bibl id="B19"><title><p>Conserved and novel Wnt clusters in the basal eumetazoan <it>Nematostella vectensis</it></p></title><aug><au><snm>Sullivan</snm><fnm>JC</fnm></au><au><snm>Ryan</snm><fnm>JF</fnm></au><au><snm>Mullikin</snm><fnm>JC</fnm></au><au><snm>Finnerty</snm><fnm>JR</fnm></au></aug><source>Dev Genes Evol</source><pubdate>2007</pubdate><volume>217</volume><fpage>235</fpage><lpage>239</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s00427-007-0136-5</pubid><pubid idtype="pmpid" link="fulltext">17310352</pubid></pubidlist></xrefbib></bibl><bibl id="B20"><title><p>Features of the ancestral bilaterian inferred from Platynereis dumerilii ParaHox genes</p></title><aug><au><snm>Hui</snm><fnm>J</fnm></au><au><snm>Raible</snm><fnm>F</fnm></au><au><snm>Korchagina</snm><fnm>N</fnm></au><au><snm>Dray</snm><fnm>N</fnm></au><au><snm>Samain</snm><fnm>S</fnm></au><au><snm>Magdelenat</snm><fnm>G</fnm></au><au><snm>Jubin</snm><fnm>C</fnm></au><au><snm>Segurens</snm><fnm>B</fnm></au><au><snm>Balavoine</snm><fnm>G</fnm></au><au><snm>Arendt</snm><fnm>D</fnm></au><au><snm>Ferrier</snm><fnm>D</fnm></au></aug><source>BMC Biology</source><pubdate>2009</pubdate><volume>7</volume><fpage>43</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1741-7007-7-43</pubid><pubid idtype="pmcid">2723086</pubid><pubid idtype="pmpid">19627570</pubid></pubidlist></xrefbib></bibl><bibl id="B21"><title><p>Evolutionary history of the iroquois/Irx genes in metazoans</p></title><aug><au><snm>Kerner</snm><fnm>P</fnm></au><au><snm>Ikmi</snm><fnm>A</fnm></au><au><snm>Coen</snm><fnm>D</fnm></au><au><snm>Vervoort</snm><fnm>M</fnm></au></aug><source>BMC Evolutionary Biology</source><pubdate>2009</pubdate><volume>9</volume><fpage>74</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2148-9-74</pubid><pubid idtype="pmcid">2674049</pubid><pubid idtype="pmpid">19368711</pubid></pubidlist></xrefbib></bibl><bibl id="B22"><title><p>A gene complex controlling segmentation in Drosophila</p></title><aug><au><snm>Lewis</snm><fnm>EB</fnm></au></aug><source>Nature</source><pubdate>1978</pubdate><volume>276</volume><fpage>565</fpage><lpage>570</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/276565a0</pubid><pubid idtype="pmpid">103000</pubid></pubidlist></xrefbib></bibl><bibl id="B23"><title><p>Why are Hox genes clustered?</p></title><aug><au><snm>Mann</snm><fnm>RS</fnm></au></aug><source>Bioessays</source><pubdate>1997</pubdate><volume>19</volume><fpage>661</fpage><lpage>664</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/bies.950190804</pubid><pubid idtype="pmpid">9264246</pubid></pubidlist></xrefbib></bibl><bibl id="B24"><title><p>Hox clusters and bilaterian phylogeny</p></title><aug><au><snm>Balavoine</snm><fnm>G</fnm></au><au><snm>de Rosa</snm><fnm>R</fnm></au><au><snm>Adoutte</snm><fnm>A</fnm></au></aug><source>Mol Phylogenet Evol</source><pubdate>2002</pubdate><volume>24</volume><fpage>366</fpage><lpage>373</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S1055-7903(02)00237-3</pubid><pubid idtype="pmpid" link="fulltext">12220978</pubid></pubidlist></xrefbib></bibl><bibl id="B25"><title><p>Genomics of the HOX gene cluster</p></title><aug><au><snm>Martinez</snm><fnm>P</fnm></au><au><snm>Amemiya</snm><fnm>CT</fnm></au></aug><source>Comp Biochem Physiol B Biochem Mol Biol</source><pubdate>2002</pubdate><volume>133</volume><fpage>571</fpage><lpage>580</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S1096-4959(02)00121-5</pubid><pubid idtype="pmpid" link="fulltext">12470820</pubid></pubidlist></xrefbib></bibl><bibl id="B26"><title><p>Genetic organization and embryonic expression of the ParaHox genes in the sea urchin S. purpuratus: insights into the relationship between clustering and colinearity</p></title><aug><au><snm>Arnone</snm><fnm>MI</fnm></au><au><snm>Rizzo</snm><fnm>F</fnm></au><au><snm>Annunciata</snm><fnm>R</fnm></au><au><snm>Cameron</snm><fnm>RA</fnm></au><au><snm>Peterson</snm><fnm>KJ</fnm></au><au><snm>Martinez</snm><fnm>P</fnm></au></aug><source>Dev Biol</source><pubdate>2006</pubdate><volume>300</volume><fpage>63</fpage><lpage>73</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.ydbio.2006.07.037</pubid><pubid idtype="pmpid" link="fulltext">16959236</pubid></pubidlist></xrefbib></bibl><bibl id="B27"><title><p>The rise and fall of Hox gene clusters</p></title><aug><au><snm>Duboule</snm><fnm>D</fnm></au></aug><source>Development</source><pubdate>2007</pubdate><volume>134</volume><fpage>2549</fpage><lpage>2560</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1242/dev.001065</pubid><pubid idtype="pmpid" link="fulltext">17553908</pubid></pubidlist></xrefbib></bibl><bibl id="B28"><title><p>Organization and structure of hox gene loci in medaka genome and comparison with those of pufferfish and zebrafish genomes</p></title><aug><au><snm>Kurosawa</snm><fnm>G</fnm></au><au><snm>Takamatsu</snm><fnm>N</fnm></au><au><snm>Takahashi</snm><fnm>M</fnm></au><au><snm>Sumitomo</snm><fnm>M</fnm></au><au><snm>Sanaka</snm><fnm>E</fnm></au><au><snm>Yamada</snm><fnm>K</fnm></au><au><snm>Nishii</snm><fnm>K</fnm></au><au><snm>Matsuda</snm><fnm>M</fnm></au><au><snm>Asakawa</snm><fnm>S</fnm></au><au><snm>Ishiguro</snm><fnm>H</fnm></au><au><snm>Miura</snm><fnm>K</fnm></au><au><snm>Kurosawa</snm><fnm>Y</fnm></au><au><snm>Shimizu</snm><fnm>N</fnm></au><au><snm>Kohara</snm><fnm>Y</fnm></au><au><snm>Hori</snm><fnm>H</fnm></au></aug><source>Gene</source><pubdate>2006</pubdate><volume>370</volume><fpage>75</fpage><lpage>82</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.gene.2005.11.015</pubid><pubid idtype="pmpid" link="fulltext">16472944</pubid></pubidlist></xrefbib></bibl><bibl id="B29"><title><p>Homeodomain proteins</p></title><aug><au><snm>B&#252;rglin</snm><fnm>TR</fnm></au></aug><source>Encyclopedia or Molecular Cell Biology and Molecular Medicine</source><publisher>Weinheim: Wiley-VCH Verlag</publisher><editor>Meyers RA</editor><edition>2</edition><pubdate>2005</pubdate></bibl><bibl id="B30"><title><p>Genomic organization, gene structure, and developmental expression of three Clustered otx genes in the sea anemone Nematostella vectensis</p></title><aug><au><snm>Mazza</snm><fnm>ME</fnm></au><au><snm>Pang</snm><fnm>K</fnm></au><au><snm>Martindale</snm><fnm>MQ</fnm></au><au><snm>Finnerty</snm><fnm>JR</fnm></au></aug><source>J Exp Zoolog B Mol Dev Evol</source><pubdate>2007</pubdate><volume>308</volume><fpage>494</fpage><lpage>506</lpage><xrefbib><pubid idtype="doi">10.1002/jez.b.21158</pubid></xrefbib></bibl><bibl id="B31"><title><p>Homeobrain, a novel paired-like homeobox gene is expressed in the <it>Drosophila </it>brain</p></title><aug><au><snm>Walldorf</snm><fnm>U</fnm></au><au><snm>Kiewe</snm><fnm>A</fnm></au><au><snm>Wickert</snm><fnm>M</fnm></au><au><snm>Ronshaugen</snm><fnm>M</fnm></au><au><snm>McGinnis</snm><fnm>W</fnm></au></aug><source>Mech Dev</source><pubdate>2000</pubdate><volume>96</volume><fpage>141</fpage><lpage>144</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0925-4773(00)00380-4</pubid><pubid idtype="pmpid" link="fulltext">10940637</pubid></pubidlist></xrefbib></bibl><bibl id="B32"><title><p>The cnidarian-bilaterian ancestor possessed at least 56 homeoboxes. Evidence from the starlet sea anemone, Nematostella vectensis</p></title><aug><au><snm>Ryan</snm><fnm>JF</fnm></au><au><snm>Burton</snm><fnm>PM</fnm></au><au><snm>Mazza</snm><fnm>ME</fnm></au><au><snm>Kwong</snm><fnm>GK</fnm></au><au><snm>Mullikin</snm><fnm>JC</fnm></au><au><snm>Finnerty</snm><fnm>JR</fnm></au></aug><source>Genome Biol</source><pubdate>2006</pubdate><volume>7</volume><fpage>R64</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2006-7-7-r64</pubid><pubid idtype="pmcid">1779571</pubid><pubid idtype="pmpid">16867185</pubid></pubidlist></xrefbib></bibl><bibl id="B33"><title><p>StellaBase: The Nematostella vectensis genomics database</p></title><aug><au><snm>Sullivan</snm><fnm>JC</fnm></au><au><snm>Ryan</snm><fnm>JF</fnm></au><au><snm>Watson</snm><fnm>JA</fnm></au><au><snm>Webb</snm><fnm>J</fnm></au><au><snm>Mullikin</snm><fnm>JC</fnm></au><au><snm>Rokhsar</snm><fnm>D</fnm></au><au><snm>Finnerty</snm><fnm>JR</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2006</pubdate><volume>34</volume><fpage>D495</fpage><lpage>499</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkj020</pubid><pubid idtype="pmcid">1347383</pubid><pubid idtype="pmpid">16381919</pubid></pubidlist></xrefbib></bibl><bibl id="B34"><title><p>Sea anemone genome reveals ancestral eumetazoan gene repertoire and genomic organization</p></title><aug><au><snm>Putnam</snm><fnm>NH</fnm></au><au><snm>Srivastava</snm><fnm>M</fnm></au><au><snm>Hellsten</snm><fnm>U</fnm></au><au><snm>Dirks</snm><fnm>B</fnm></au><au><snm>Chapman</snm><fnm>J</fnm></au><au><snm>Salamov</snm><fnm>A</fnm></au><au><snm>Terry</snm><fnm>A</fnm></au><au><snm>Shapiro</snm><fnm>H</fnm></au><au><snm>Lindquist</snm><fnm>E</fnm></au><au><snm>Kapitonov</snm><fnm>VV</fnm></au><au><snm>Jurka</snm><fnm>J</fnm></au><au><snm>Genikhovich</snm><fnm>G</fnm></au><au><snm>Grigoriev</snm><fnm>IV</fnm></au><au><snm>Lucas</snm><fnm>SM</fnm></au><au><snm>Steele</snm><fnm>RE</fnm></au><au><snm>Finnerty</snm><fnm>JR</fnm></au><au><snm>Technau</snm><fnm>U</fnm></au><au><snm>Martindale</snm><fnm>MQ</fnm></au><au><snm>Rokhsar</snm><fnm>DS</fnm></au></aug><source>Science</source><pubdate>2007</pubdate><volume>317</volume><fpage>86</fpage><lpage>94</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1139158</pubid><pubid idtype="pmpid" link="fulltext">17615350</pubid></pubidlist></xrefbib></bibl><bibl id="B35"><title><p>MUSCLE: multiple sequence alignment with high accuracy and high throughput</p></title><aug><au><snm>Edgar</snm><fnm>R</fnm></au></aug><source>Nucleic Acids Research</source><pubdate>2004</pubdate><volume>32</volume><fpage>1792</fpage><lpage>1797</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkh340</pubid><pubid idtype="pmcid">390337</pubid><pubid idtype="pmpid">15034147</pubid></pubidlist></xrefbib></bibl><bibl id="B36"><title><p>PHYLIP (Phylogeny Inference Package) version 3.6</p></title><aug><au><snm>Felsenstein</snm><fnm>J</fnm></au></aug><publisher>Department of Genome Sciences, University of Washington, Seattle</publisher><pubdate>2005</pubdate></bibl><bibl id="B37"><title><p>MacClade, version 4</p></title><aug><au><snm>Maddison</snm><fnm>DR</fnm></au><au><snm>Maddison</snm><fnm>WP</fnm></au></aug><pubdate>2005</pubdate><xrefbib><pubid idtype="pmpid" link="fulltext">15732050</pubid></xrefbib></bibl><bibl id="B38"><title><p>Early evolution of a homeobox gene: the parahox gene Gsx in the Cnidaria and the Bilateria</p></title><aug><au><snm>Finnerty</snm><fnm>JR</fnm></au><au><snm>Paulson</snm><fnm>D</fnm></au><au><snm>Burton</snm><fnm>P</fnm></au><au><snm>Pang</snm><fnm>K</fnm></au><au><snm>Martindale</snm><fnm>MQ</fnm></au></aug><source>Evol Dev</source><pubdate>2003</pubdate><volume>5</volume><fpage>331</fpage><lpage>345</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1046/j.1525-142X.2003.03041.x</pubid><pubid idtype="pmpid" link="fulltext">12823450</pubid></pubidlist></xrefbib></bibl><bibl id="B39"><title><p>A comprehensive classification of homeobox genes</p></title><aug><au><snm>B&#252;rglin</snm><fnm>TR</fnm></au></aug><source>Guidebook to the Homeobox Genes</source><publisher>New York: Oxford University Press</publisher><editor>Duboule D</editor><pubdate>1994</pubdate><fpage>25</fpage><lpage>72</lpage></bibl><bibl id="B40"><title><p>Evolution of homeobox genes: Q50 Paired-like genes founded the Paired class</p></title><aug><au><snm>Galliot</snm><fnm>B</fnm></au><au><snm>de Vargas</snm><fnm>C</fnm></au><au><snm>Miller</snm><fnm>DJ</fnm></au></aug><source>Dev Genes Evol</source><pubdate>1999</pubdate><volume>209</volume><fpage>186</fpage><lpage>197</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s004270050243</pubid><pubid idtype="pmpid" link="fulltext">10079362</pubid></pubidlist></xrefbib></bibl><bibl id="B41"><title><p>Capitella sp. I homeobrain-like, the first lophotrochozoan member of a novel paired-like homeobox gene family</p></title><aug><au><snm>Fr&#246;bius</snm><fnm>AC</fnm></au><au><snm>Seaver</snm><fnm>EC</fnm></au></aug><source>Gene Expr Patterns</source><pubdate>2006</pubdate><volume>6</volume><fpage>958</fpage><lpage>91</lpage><xrefbib><pubid idtype="doi">10.1016/j.modgep.2006.04.002</pubid></xrefbib></bibl><bibl id="B42"><title><p>Anatomy and development of the nervous system of <it>Nematostella vectensis</it>, an anthozoan cnidarian</p></title><aug><au><snm>Marlow</snm><fnm>HQ</fnm></au><au><snm>Srivastava</snm><fnm>M</fnm></au><au><snm>Matus</snm><fnm>DQ</fnm></au><au><snm>Rokhsar</snm><fnm>D</fnm></au><au><snm>Martindale</snm><fnm>MQ</fnm></au></aug><source>Dev Neurobiol</source><pubdate>2009</pubdate><volume>69</volume><fpage>235</fpage><lpage>254</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/dneu.20698</pubid><pubid idtype="pmpid" link="fulltext">19170043</pubid></pubidlist></xrefbib></bibl><bibl id="B43"><title><p>A novel, tissue-specific, Drosophila homeobox gene</p></title><aug><au><snm>Barad</snm><fnm>M</fnm></au><au><snm>Jack</snm><fnm>T</fnm></au><au><snm>Chadwick</snm><fnm>R</fnm></au><au><snm>McGinnis</snm><fnm>W</fnm></au></aug><source>Embo J</source><pubdate>1988</pubdate><volume>7</volume><fpage>2151</fpage><lpage>2161</lpage><xrefbib><pubidlist><pubid idtype="pmcid">454523</pubid><pubid idtype="pmpid">2901348</pubid></pubidlist></xrefbib></bibl><bibl id="B44"><title><p>Drosophila homeobox genes</p></title><aug><au><snm>Dessain</snm><fnm>S</fnm></au><au><snm>McGinnis</snm><fnm>W</fnm></au></aug><source>Adv Dev Biochem</source><pubdate>1993</pubdate><volume>2</volume><fpage>1</fpage><lpage>55</lpage><xrefbib><pubid idtype="doi">full_text</pubid></xrefbib></bibl><bibl id="B45"><title><p>Isolation of a <it>Drosophila </it>homolog of the vertebrate homeobox gene <it>Rx </it>and its possible role in brain and eye development</p></title><aug><au><snm>Eggert</snm><fnm>T</fnm></au><au><snm>Hauck</snm><fnm>B</fnm></au><au><snm>Hildebrandt</snm><fnm>N</fnm></au><au><snm>Gehring</snm><fnm>WJ</fnm></au><au><snm>Walldorf</snm><fnm>U</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>1998</pubdate><volume>95</volume><fpage>2343</fpage><lpage>2348</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.95.5.2343</pubid><pubid idtype="pmcid">19340</pubid><pubid idtype="pmpid">9482887</pubid></pubidlist></xrefbib></bibl><bibl id="B46"><title><p>A genomic view of the sea urchin nervous system</p></title><aug><au><snm>Burke</snm><fnm>RD</fnm></au><au><snm>Angerer</snm><fnm>LM</fnm></au><au><snm>Elphick</snm><fnm>MR</fnm></au><au><snm>Humphrey</snm><fnm>GW</fnm></au><au><snm>Yaguchi</snm><fnm>S</fnm></au><au><snm>Kiyama</snm><fnm>T</fnm></au><au><snm>Liang</snm><fnm>S</fnm></au><au><snm>Mu</snm><fnm>X</fnm></au><au><snm>Agca</snm><fnm>C</fnm></au><au><snm>Klein</snm><fnm>WH</fnm></au><au><snm>Brandhorst</snm><fnm>BP</fnm></au><au><snm>Rowe</snm><fnm>M</fnm></au><au><snm>Wilson</snm><fnm>K</fnm></au><au><snm>Churcher</snm><fnm>AM</fnm></au><au><snm>Taylor</snm><fnm>JS</fnm></au><au><snm>Chen</snm><fnm>N</fnm></au><au><snm>Murray</snm><fnm>G</fnm></au><au><snm>Wang</snm><fnm>D</fnm></au><au><snm>Mellott</snm><fnm>D</fnm></au><au><snm>Olinski</snm><fnm>R</fnm></au><au><snm>Hallb&#246;&#246;k</snm><fnm>F</fnm></au><au><snm>Thorndyke</snm><fnm>MC</fnm></au></aug><source>Dev Biol</source><pubdate>2006</pubdate><volume>300</volume><fpage>434</fpage><lpage>460</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.ydbio.2006.08.007</pubid><pubid idtype="pmcid">1950334</pubid><pubid idtype="pmpid">16965768</pubid></pubidlist></xrefbib></bibl><bibl id="B47"><title><p>The role of Otx and Otp genes in brain development</p></title><aug><au><snm>Acampora</snm><fnm>D</fnm></au><au><snm>Postiglione</snm><fnm>MP</fnm></au><au><snm>Avantaggiato</snm><fnm>V</fnm></au><au><snm>Di Bonito</snm><fnm>M</fnm></au><au><snm>Simeone</snm><fnm>A</fnm></au></aug><source>Int J Dev Biol</source><pubdate>2000</pubdate><volume>44</volume><fpage>669</fpage><lpage>677</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">11061431</pubid></xrefbib></bibl><bibl id="B48"><title><p>Progressive impairment of developing neuroendocrine cell lineages in the hypothalamus of mice lacking the Orthopedia gene</p></title><aug><au><snm>Acampora</snm><fnm>D</fnm></au><au><snm>Postiglione</snm><fnm>MP</fnm></au><au><snm>Avantaggiato</snm><fnm>V</fnm></au><au><snm>Di Bonito</snm><fnm>M</fnm></au><au><snm>Vaccarino</snm><fnm>FM</fnm></au><au><snm>Michaud</snm><fnm>J</fnm></au><au><snm>Simeone</snm><fnm>A</fnm></au></aug><source>Genes Dev</source><pubdate>1999</pubdate><volume>13</volume><fpage>2787</fpage><lpage>2800</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gad.13.21.2787</pubid><pubid idtype="pmcid">317121</pubid><pubid idtype="pmpid">10557207</pubid></pubidlist></xrefbib></bibl><bibl id="B49"><title><p>Differential regulation of the zebrafish orthopedia 1 gene during fate determination of diencephalic neurons</p></title><aug><au><snm>Del Giacco</snm><fnm>L</fnm></au><au><snm>Sordino</snm><fnm>P</fnm></au><au><snm>Pistocchi</snm><fnm>A</fnm></au><au><snm>Andreakis</snm><fnm>N</fnm></au><au><snm>Tarallo</snm><fnm>R</fnm></au><au><snm>Di Benedetto</snm><fnm>B</fnm></au><au><snm>Cotelli</snm><fnm>F</fnm></au></aug><source>BMC Dev Biol</source><pubdate>2006</pubdate><volume>6</volume><fpage>50</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-213X-6-50</pubid><pubid idtype="pmcid">1635040</pubid><pubid idtype="pmpid">17074092</pubid></pubidlist></xrefbib></bibl><bibl id="B50"><title><p>Identification, chromosomal assignment, and expression analysis of the human homeodomain-containing gene Orthopedia (OTP)</p></title><aug><au><snm>Lin</snm><fnm>X</fnm></au><au><snm>State</snm><fnm>MW</fnm></au><au><snm>Vaccarino</snm><fnm>FM</fnm></au><au><snm>Greally</snm><fnm>J</fnm></au><au><snm>Hass</snm><fnm>M</fnm></au><au><snm>Leckman</snm><fnm>JF</fnm></au></aug><source>Genomics</source><pubdate>1999</pubdate><volume>60</volume><fpage>96</fpage><lpage>104</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1006/geno.1999.5882</pubid><pubid idtype="pmpid" link="fulltext">10458915</pubid></pubidlist></xrefbib></bibl><bibl id="B51"><title><p><it>Orthopedia</it>, a novel homeobox-containing gene expressed in the developing CNS of both mouse and <it>Drosophila</it></p></title><aug><au><snm>Simeone</snm><fnm>A</fnm></au><au><snm>D&apos;Apice</snm><fnm>MR</fnm></au><au><snm>Nigro</snm><fnm>V</fnm></au><au><snm>Casanova</snm><fnm>J</fnm></au><au><snm>Graziani</snm><fnm>F</fnm></au><au><snm>Acampora</snm><fnm>D</fnm></au><au><snm>Avantaggiato</snm><fnm>V</fnm></au></aug><source>Neuron</source><pubdate>1994</pubdate><volume>13</volume><fpage>83</fpage><lpage>101</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/0896-6273(94)90461-8</pubid><pubid idtype="pmpid" link="fulltext">7913821</pubid></pubidlist></xrefbib></bibl><bibl id="B52"><title><p>The murine Otp homeobox gene plays an essential role in the specification of neuronal cell lineages in the developing hypothalamus</p></title><aug><au><snm>Wang</snm><fnm>W</fnm></au><au><snm>Lufkin</snm><fnm>T</fnm></au></aug><source>Dev Biol</source><pubdate>2000</pubdate><volume>227</volume><fpage>432</fpage><lpage>449</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1006/dbio.2000.9902</pubid><pubid idtype="pmpid" link="fulltext">11071765</pubid></pubidlist></xrefbib></bibl><bibl id="B53"><title><p>Anteroposterior patterning in hemichordates and the origins of the chordate nervous system</p></title><aug><au><snm>Lowe</snm><fnm>CJ</fnm></au><au><snm>Wu</snm><fnm>M</fnm></au><au><snm>Salic</snm><fnm>A</fnm></au><au><snm>Evans</snm><fnm>L</fnm></au><au><snm>Lander</snm><fnm>E</fnm></au><au><snm>Stange-Thomann</snm><fnm>N</fnm></au><au><snm>Gruber</snm><fnm>CE</fnm></au><au><snm>Gerhart</snm><fnm>J</fnm></au><au><snm>Kirschner</snm><fnm>M</fnm></au></aug><source>Cell</source><pubdate>2003</pubdate><volume>113</volume><fpage>853</fpage><lpage>865</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0092-8674(03)00469-0</pubid><pubid idtype="pmpid" link="fulltext">12837244</pubid></pubidlist></xrefbib></bibl><bibl id="B54"><title><p>A planarian orthopedia homolog is specifically expressed in the branch region of both the mature and regenerating brain</p></title><aug><au><snm>Umesono</snm><fnm>Y</fnm></au><au><snm>Watanabe</snm><fnm>K</fnm></au><au><snm>Agata</snm><fnm>K</fnm></au></aug><source>Dev Growth Differ</source><pubdate>1997</pubdate><volume>39</volume><fpage>723</fpage><lpage>727</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1046/j.1440-169X.1997.t01-5-00008.x</pubid><pubid idtype="pmpid">9493832</pubid></pubidlist></xrefbib></bibl><bibl id="B55"><title><p>Distinct structural domains in the planarian brain defined by the expression of evolutionarily conserved homeobox genes</p></title><aug><au><snm>Umesono</snm><fnm>Y</fnm></au><au><snm>Watanabe</snm><fnm>K</fnm></au><au><snm>Agata</snm><fnm>K</fnm></au></aug><source>Dev Genes Evol</source><pubdate>1999</pubdate><volume>209</volume><fpage>31</fpage><lpage>39</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s004270050224</pubid><pubid idtype="pmpid" link="fulltext">9914416</pubid></pubidlist></xrefbib></bibl><bibl id="B56"><title><p>Novel and conserved roles for orthodenticle/otx and orthopedia/otp orthologs in the gastropod mollusc Patella vulgata</p></title><aug><au><snm>Nederbragt</snm><fnm>AJ</fnm></au><au><snm>te Welscher</snm><fnm>P</fnm></au><au><snm>van den Driesche</snm><fnm>S</fnm></au><au><snm>van Loon</snm><fnm>AE</fnm></au><au><snm>Dictus</snm><fnm>WJ</fnm></au></aug><source>Dev Genes Evol</source><pubdate>2002</pubdate><volume>212</volume><fpage>330</fpage><lpage>337</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s00427-002-0246-z</pubid><pubid idtype="pmpid" link="fulltext">12185486</pubid></pubidlist></xrefbib></bibl><bibl id="B57"><title><p>Homeobox genes and sea urchin development</p></title><aug><au><snm>Di Bernardo</snm><fnm>M</fnm></au><au><snm>Bellomonte</snm><fnm>D</fnm></au><au><snm>Castagnetti</snm><fnm>S</fnm></au><au><snm>Melfi</snm><fnm>R</fnm></au><au><snm>Oliveri</snm><fnm>P</fnm></au><au><snm>Spinelli</snm><fnm>G</fnm></au></aug><source>Int J Dev Biol</source><pubdate>2000</pubdate><volume>44</volume><fpage>637</fpage><lpage>643</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">11061427</pubid></xrefbib></bibl><bibl id="B58"><title><p>Spatially restricted expression of PlOtp, a Paracentrotus lividus orthopedia-related homeobox gene, is correlated with oral ectodermal patterning and skeletal morphogenesis in late-cleavage sea urchin embryos</p></title><aug><au><snm>Di Bernardo</snm><fnm>M</fnm></au><au><snm>Castagnetti</snm><fnm>S</fnm></au><au><snm>Bellomonte</snm><fnm>D</fnm></au><au><snm>Oliveri</snm><fnm>P</fnm></au><au><snm>Melfi</snm><fnm>R</fnm></au><au><snm>Palla</snm><fnm>F</fnm></au><au><snm>Spinelli</snm><fnm>G</fnm></au></aug><source>Development</source><pubdate>1999</pubdate><volume>126</volume><fpage>2171</fpage><lpage>2179</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">10207142</pubid></xrefbib></bibl><bibl id="B59"><title><p>Evolution of OTP-independent larval skeleton patterning in the direct-developing sea urchin, Heliocidaris erythrogramma</p></title><aug><au><snm>Zhou</snm><fnm>N</fnm></au><au><snm>Wilson</snm><fnm>KA</fnm></au><au><snm>Andrews</snm><fnm>ME</fnm></au><au><snm>Kauffman</snm><fnm>JS</fnm></au><au><snm>Raff</snm><fnm>RA</fnm></au></aug><source>J Exp Zoolog B Mol Dev Evol</source><pubdate>2003</pubdate><volume>300</volume><fpage>58</fpage><lpage>71</lpage></bibl><bibl id="B60"><title><p>Regulatory sequences driving expression of the sea urchin Otp homeobox gene in oral ectoderm cells</p></title><aug><au><snm>Cavalieri</snm><fnm>V</fnm></au><au><snm>Bernardo</snm><fnm>MD</fnm></au><au><snm>Spinelli</snm><fnm>G</fnm></au></aug><source>Gene Expr Patterns</source><pubdate>2007</pubdate><volume>7</volume><fpage>124</fpage><lpage>30</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.modgep.2006.06.001</pubid><pubid idtype="pmpid" link="fulltext">16843737</pubid></pubidlist></xrefbib></bibl><bibl id="B61"><title><p>Impairing Otp homeodomain function in oral ectoderm cells affects skeletogenesis in sea urchin embryos</p></title><aug><au><snm>Cavalieri</snm><fnm>V</fnm></au><au><snm>Spinelli</snm><fnm>G</fnm></au><au><snm>Di Bernardo</snm><fnm>M</fnm></au></aug><source>Dev Biol</source><pubdate>2003</pubdate><volume>262</volume><fpage>107</fpage><lpage>118</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0012-1606(03)00317-8</pubid><pubid idtype="pmpid" link="fulltext">14512022</pubid></pubidlist></xrefbib></bibl><bibl id="B62"><title><p>Regulation of vertebrate eye development by Rx genes</p></title><aug><au><snm>Bailey</snm><fnm>TJ</fnm></au><au><snm>El-Hodiri</snm><fnm>H</fnm></au><au><snm>Zhang</snm><fnm>L</fnm></au><au><snm>Shah</snm><fnm>R</fnm></au><au><snm>Mathers</snm><fnm>PH</fnm></au><au><snm>Jamrich</snm><fnm>M</fnm></au></aug><source>Int J Dev Biol</source><pubdate>2004</pubdate><volume>48</volume><fpage>761</fpage><lpage>770</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1387/ijdb.041878tb</pubid><pubid idtype="pmpid" link="fulltext">15558469</pubid></pubidlist></xrefbib></bibl><bibl id="B63"><title><p>Xrx1, a novel Xenopus homeobox gene expressed during eye and pineal gland development</p></title><aug><au><snm>Casarosa</snm><fnm>S</fnm></au><au><snm>Andreazzoli</snm><fnm>M</fnm></au><au><snm>Simeone</snm><fnm>A</fnm></au><au><snm>Barsacchi</snm><fnm>G</fnm></au></aug><source>Mech Dev</source><pubdate>1997</pubdate><volume>61</volume><fpage>187</fpage><lpage>198</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0925-4773(96)00640-5</pubid><pubid idtype="pmpid">9076688</pubid></pubidlist></xrefbib></bibl><bibl id="B64"><title><p>Expression of three Rx homeobox genes in embryonic and adult zebrafish</p></title><aug><au><snm>Chuang</snm><fnm>JC</fnm></au><au><snm>Mathers</snm><fnm>PH</fnm></au><au><snm>Raymond</snm><fnm>PA</fnm></au></aug><source>Mech Dev</source><pubdate>1999</pubdate><volume>84</volume><fpage>195</fpage><lpage>198</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0925-4773(99)00077-5</pubid><pubid idtype="pmpid" link="fulltext">10473141</pubid></pubidlist></xrefbib></bibl><bibl id="B65"><title><p>Zebrafish genes rx1 and rx2 help define the region of forebrain that gives rise to retina</p></title><aug><au><snm>Chuang</snm><fnm>JC</fnm></au><au><snm>Raymond</snm><fnm>PA</fnm></au></aug><source>Dev Biol</source><pubdate>2001</pubdate><volume>231</volume><fpage>13</fpage><lpage>30</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1006/dbio.2000.0125</pubid><pubid idtype="pmpid" link="fulltext">11180949</pubid></pubidlist></xrefbib></bibl><bibl id="B66"><title><p>The ascidian homolog of the vertebrate homeobox gene Rx is essential for ocellus development and function</p></title><aug><au><snm>D&apos;Aniello</snm><fnm>S</fnm></au><au><snm>D&apos;Aniello</snm><fnm>E</fnm></au><au><snm>Locascio</snm><fnm>A</fnm></au><au><snm>Memoli</snm><fnm>A</fnm></au><au><snm>Corrado</snm><fnm>M</fnm></au><au><snm>Russo</snm><fnm>MT</fnm></au><au><snm>Aniello</snm><fnm>F</fnm></au><au><snm>Fucci</snm><fnm>L</fnm></au><au><snm>Brown</snm><fnm>ER</fnm></au><au><snm>Branno</snm><fnm>M</fnm></au></aug><source>Differentiation</source><pubdate>2006</pubdate><volume>74</volume><fpage>222</fpage><lpage>234</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1111/j.1432-0436.2006.00071.x</pubid><pubid idtype="pmpid" link="fulltext">16759288</pubid></pubidlist></xrefbib></bibl><bibl id="B67"><title><p>rax, a novel paired-type homeobox gene, shows expression in the anterior neural fold and developing retina</p></title><aug><au><snm>Furukawa</snm><fnm>T</fnm></au><au><snm>Kozak</snm><fnm>CA</fnm></au><au><snm>Cepko</snm><fnm>CL</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>1997</pubdate><volume>94</volume><fpage>3088</fpage><lpage>3093</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.94.7.3088</pubid><pubid idtype="pmcid">20326</pubid><pubid idtype="pmpid">9096350</pubid></pubidlist></xrefbib></bibl><bibl id="B68"><title><p>Medaka eyeless is the key factor linking retinal determination and eye growth</p></title><aug><au><snm>Loosli</snm><fnm>F</fnm></au><au><snm>Winkler</snm><fnm>S</fnm></au><au><snm>Burgtorf</snm><fnm>C</fnm></au><au><snm>Wurmbach</snm><fnm>E</fnm></au><au><snm>Ansorge</snm><fnm>W</fnm></au><au><snm>Henrich</snm><fnm>T</fnm></au><au><snm>Grabher</snm><fnm>C</fnm></au><au><snm>Arendt</snm><fnm>D</fnm></au><au><snm>Carl</snm><fnm>M</fnm></au><au><snm>Krone</snm><fnm>A</fnm></au><au><snm>Grzebisz</snm><fnm>E</fnm></au><au><snm>Wittbrodt</snm><fnm>J</fnm></au></aug><source>Development</source><pubdate>2001</pubdate><volume>128</volume><fpage>4035</fpage><lpage>4044</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">11641226</pubid></xrefbib></bibl><bibl id="B69"><title><p>Regulation of eye formation by the Rx and pax6 homeobox genes</p></title><aug><au><snm>Mathers</snm><fnm>PH</fnm></au><au><snm>Jamrich</snm><fnm>M</fnm></au></aug><source>Cell Mol Life Sci</source><pubdate>2000</pubdate><volume>57</volume><fpage>186</fpage><lpage>194</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/PL00000683</pubid><pubid idtype="pmpid" link="fulltext">10766016</pubid></pubidlist></xrefbib></bibl><bibl id="B70"><title><p>Identification of chick rax/rx genes with overlapping patterns of expression during early eye and brain development</p></title><aug><au><snm>Ohuchi</snm><fnm>H</fnm></au><au><snm>Tomonari</snm><fnm>S</fnm></au><au><snm>Itoh</snm><fnm>H</fnm></au><au><snm>Mikawa</snm><fnm>T</fnm></au><au><snm>Noji</snm><fnm>S</fnm></au></aug><source>Mech Dev</source><pubdate>1999</pubdate><volume>85</volume><fpage>193</fpage><lpage>195</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0925-4773(99)00094-5</pubid><pubid idtype="pmpid" link="fulltext">10415362</pubid></pubidlist></xrefbib></bibl><bibl id="B71"><title><p>Drosophila retinal homeobox (drx) is not required for establishment of the visual system, but is required for brain and clypeus development</p></title><aug><au><snm>Davis</snm><fnm>RJ</fnm></au><au><snm>Tavsanli</snm><fnm>BC</fnm></au><au><snm>Dittrich</snm><fnm>C</fnm></au><au><snm>Walldorf</snm><fnm>U</fnm></au><au><snm>Mardon</snm><fnm>G</fnm></au></aug><source>Dev Biol</source><pubdate>2003</pubdate><volume>259</volume><fpage>272</fpage><lpage>287</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0012-1606(03)00201-X</pubid><pubid idtype="pmpid" link="fulltext">12871701</pubid></pubidlist></xrefbib></bibl><bibl id="B72"><title><p>Conservation of the paired domain in metazoans and its structure in three isolated human genes</p></title><aug><au><snm>Burri</snm><fnm>M</fnm></au><au><snm>Tromvoukis</snm><fnm>Y</fnm></au><au><snm>Bopp</snm><fnm>D</fnm></au><au><snm>Frigerio</snm><fnm>G</fnm></au><au><snm>Noll</snm><fnm>M</fnm></au></aug><source>EMBO J</source><pubdate>1989</pubdate><volume>8</volume><fpage>1183</fpage><lpage>1190</lpage><xrefbib><pubidlist><pubid idtype="pmcid">400932</pubid><pubid idtype="pmpid">2501086</pubid></pubidlist></xrefbib></bibl><bibl id="B73"><title><p>Molecular genetics of aristaless, a prd-type homeo box gene involved in the morphogenesis of proximal and distal pattern elements in a subset of appendages in Drosophila</p></title><aug><au><snm>Schneitz</snm><fnm>K</fnm></au><au><snm>Spielmann</snm><fnm>P</fnm></au><au><snm>Noll</snm><fnm>M</fnm></au></aug><source>Genes Dev</source><pubdate>1993</pubdate><volume>7</volume><fpage>114</fpage><lpage>129</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gad.7.1.114</pubid><pubid idtype="pmpid" link="fulltext">8093690</pubid></pubidlist></xrefbib></bibl><bibl id="B74"><title><p>A conserved region of engrailed, shared among all en-, gsc-, Nk1-, Nk2- and msh-class homeoproteins, mediates active transcriptional repression <it>in vivo</it></p></title><aug><au><snm>Smith</snm><fnm>ST</fnm></au><au><snm>Jaynes</snm><fnm>JB</fnm></au></aug><source>Development</source><pubdate>1996</pubdate><volume>122</volume><fpage>3141</fpage><lpage>3150</lpage><xrefbib><pubidlist><pubid idtype="pmcid">2729110</pubid><pubid idtype="pmpid">8898227</pubid></pubidlist></xrefbib></bibl><bibl id="B75"><title><p>Expression of a novel aristaless related homeobox gene 'Arx' in the vertebrate telencephalon, diencephalon and floor plate</p></title><aug><au><snm>Miura</snm><fnm>H</fnm></au><au><snm>Yanazawa</snm><fnm>M</fnm></au><au><snm>Kato</snm><fnm>K</fnm></au><au><snm>Kitamura</snm><fnm>K</fnm></au></aug><source>Mech Dev</source><pubdate>1997</pubdate><volume>65</volume><fpage>99</fpage><lpage>109</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0925-4773(97)00062-2</pubid><pubid idtype="pmpid">9256348</pubid></pubidlist></xrefbib></bibl><bibl id="B76"><title><p>The OAR/aristaless domain of the homeodomain protein Cart1 has an attenuating role <it>in vivo</it></p></title><aug><au><snm>Brouwer</snm><fnm>A</fnm></au><au><snm>ten Berge</snm><fnm>D</fnm></au><au><snm>Wiegerinck</snm><fnm>R</fnm></au><au><snm>Meijlink</snm><fnm>F</fnm></au></aug><source>Mech Dev</source><pubdate>2003</pubdate><volume>120</volume><fpage>241</fpage><lpage>252</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0925-4773(02)00416-1</pubid><pubid idtype="pmpid" link="fulltext">12559496</pubid></pubidlist></xrefbib></bibl><bibl id="B77"><title><p>The identification of Prx1 transcription regulatory domains provides a mechanism for unequal compensation by the Prx1 and Prx2 loci</p></title><aug><au><snm>Norris</snm><fnm>RA</fnm></au><au><snm>Kern</snm><fnm>MJ</fnm></au></aug><source>J Biol Chem</source><pubdate>2001</pubdate><volume>276</volume><fpage>26829</fpage><lpage>26837</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1074/jbc.M100239200</pubid><pubid idtype="pmpid" link="fulltext">11373278</pubid></pubidlist></xrefbib></bibl><bibl id="B78"><title><p>Identification of domains mediating transcription activation, repression, and inhibition in the paired-related homeobox protein, Prx2 (S8)</p></title><aug><au><snm>Norris</snm><fnm>RA</fnm></au><au><snm>Kern</snm><fnm>MJ</fnm></au></aug><source>DNA Cell Biol</source><pubdate>2001</pubdate><volume>20</volume><fpage>89</fpage><lpage>99</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1089/104454901750070292</pubid><pubid idtype="pmpid" link="fulltext">11244566</pubid></pubidlist></xrefbib></bibl><bibl id="B79"><title><p>A novel murine homeobox gene isolated by a tissue specific PCR cloning strategy</p></title><aug><au><snm>Kern</snm><fnm>MJ</fnm></au><au><snm>Witte</snm><fnm>DP</fnm></au><au><snm>Valerius</snm><fnm>MT</fnm></au><au><snm>Aronow</snm><fnm>BJ</fnm></au><au><snm>Potter</snm><fnm>SS</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>1992</pubdate><volume>20</volume><fpage>5189</fpage><lpage>5195</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/20.19.5189</pubid><pubid idtype="pmcid">334304</pubid><pubid idtype="pmpid">1383943</pubid></pubidlist></xrefbib></bibl><bibl id="B80"><title><p>Human PRRX1 and PRRX2 genes: cloning, expression, genomic localization, and exclusion as disease genes for Nager syndrome</p></title><aug><au><snm>Norris</snm><fnm>RA</fnm></au><au><snm>Scott</snm><fnm>KK</fnm></au><au><snm>Moore</snm><fnm>CS</fnm></au><au><snm>Stetten</snm><fnm>G</fnm></au><au><snm>Brown</snm><fnm>CR</fnm></au><au><snm>Jabs</snm><fnm>EW</fnm></au><au><snm>Wulfsberg</snm><fnm>EA</fnm></au><au><snm>Yu</snm><fnm>J</fnm></au><au><snm>Kern</snm><fnm>MJ</fnm></au></aug><source>Mamm Genome</source><pubdate>2000</pubdate><volume>11</volume><fpage>1000</fpage><lpage>1005</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s003350010193</pubid><pubid idtype="pmpid" link="fulltext">11063257</pubid></pubidlist></xrefbib></bibl><bibl id="B81"><title><p>Comparative analysis of Prx1 and Prx2 expression in mice provides evidence for incomplete compensation</p></title><aug><au><snm>Chesterman</snm><fnm>ES</fnm></au><au><snm>Kern</snm><fnm>MJ</fnm></au></aug><source>Anat Rec</source><pubdate>2002</pubdate><volume>266</volume><fpage>1</fpage><lpage>4</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/ar.10028</pubid><pubid idtype="pmpid" link="fulltext">11748565</pubid></pubidlist></xrefbib></bibl><bibl id="B82"><title><p>Hox cluster disintegration with persistent anteroposterior order of expression in Oikopleura dioica</p></title><aug><au><snm>Seo</snm><fnm>HC</fnm></au><au><snm>Edvardsen</snm><fnm>RB</fnm></au><au><snm>Maeland</snm><fnm>AD</fnm></au><au><snm>Bjordal</snm><fnm>M</fnm></au><au><snm>Jensen</snm><fnm>MF</fnm></au><au><snm>Hansen</snm><fnm>A</fnm></au><au><snm>Flaat</snm><fnm>M</fnm></au><au><snm>Weissenbach</snm><fnm>J</fnm></au><au><snm>Lehrach</snm><fnm>H</fnm></au><au><snm>Wincker</snm><fnm>P</fnm></au><au><snm>Reinhardt</snm><fnm>R</fnm></au><au><snm>Chourrout</snm><fnm>D</fnm></au></aug><source>Nature</source><pubdate>2004</pubdate><volume>431</volume><fpage>67</fpage><lpage>71</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature02709</pubid><pubid idtype="pmpid" link="fulltext">15343333</pubid></pubidlist></xrefbib></bibl><bibl id="B83"><title><p>The genesis and evolution of homeobox gene clusters</p></title><aug><au><snm>Garcia-Fernandez</snm><fnm>J</fnm></au></aug><source>Nat Rev Genet</source><pubdate>2005</pubdate><volume>6</volume><fpage>881</fpage><lpage>892</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nrg1723</pubid><pubid idtype="pmpid" link="fulltext">16341069</pubid></pubidlist></xrefbib></bibl><bibl id="B84"><title><p>Inversion-induced disruption of the Hoxd cluster leads to the partition of regulatory landscapes</p></title><aug><au><snm>Spitz</snm><fnm>F</fnm></au><au><snm>Herkenne</snm><fnm>C</fnm></au><au><snm>Morris</snm><fnm>MA</fnm></au><au><snm>Duboule</snm><fnm>D</fnm></au></aug><source>Nat Genet</source><pubdate>2005</pubdate><volume>37</volume><fpage>889</fpage><lpage>893</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ng1597</pubid><pubid idtype="pmpid" link="fulltext">15995706</pubid></pubidlist></xrefbib></bibl><bibl id="B85"><title><p>Chromosomal continuity in the abdominal region of the bithorax complex of Drosophila is not essential for its contribution to metameric identity</p></title><aug><au><snm>Tiong</snm><fnm>SYK</fnm></au><au><snm>Whittle</snm><fnm>JRS</fnm></au><au><snm>Gribbin</snm><fnm>MC</fnm></au></aug><source>Development</source><pubdate>1987</pubdate><volume>101</volume><fpage>135</fpage><lpage>142</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">3449364</pubid></xrefbib></bibl><bibl id="B86"><title><p>Maintenance of ancestral complexity and non-metazoan genes in two basal cnidarians</p></title><aug><au><snm>Technau</snm><fnm>U</fnm></au><au><snm>Rudd</snm><fnm>S</fnm></au><au><snm>Maxwell</snm><fnm>P</fnm></au><au><snm>Gordon</snm><fnm>PM</fnm></au><au><snm>Saina</snm><fnm>M</fnm></au><au><snm>Grasso</snm><fnm>LC</fnm></au><au><snm>Hayward</snm><fnm>DC</fnm></au><au><snm>Sensen</snm><fnm>CW</fnm></au><au><snm>Saint</snm><fnm>R</fnm></au><au><snm>Holstein</snm><fnm>TW</fnm></au><au><snm>Ball</snm><fnm>EE</fnm></au><au><snm>Miller</snm><fnm>DJ</fnm></au></aug><source>Trends Genet</source><pubdate>2005</pubdate><volume>21</volume><fpage>633</fpage><lpage>639</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.tig.2005.09.007</pubid><pubid idtype="pmpid" link="fulltext">16226338</pubid></pubidlist></xrefbib></bibl></refgrp>
</bm></art>
