FIGURE 2 - Amino acid sequence alignments of aromatic amino acid hydroxylase (AAAH) proteins.
The top sequence of the C. elegans AAAH gene described here, is written in full. Beneath this sequence, identities are indicated with a dash (-), gaps with a period (.), or different amino acids with the appropriate letter. The entire sequence of each protein is shown, except for ZK1290.2 (Ce TrpH). Protein sequences for ZK1290.2 and B0432.5 are predicted from GSC genomic sequence (as found in GenBank); there are no available cDNA sequences. Predictions for the 5' end of ZK1290.2 in particular suggest a much longer N-terminal domain than is typical of TrpH proteins. For some of the proteins there exist multiple isoforms derived from alternative splicing. In those cases, the most easily aligned/highly conserved form of the protein was used. Alignments were initially done with the GCG pileup program; much of the 5' sequence was aligned "manually," with the aid of various pairwise alignments made using SIM (http://www.expasy.ch/sprot/sim-prot.html; (Huang and Miller, 1991).
 
K08F8.4                                                                            MPPAGQDDLDFLKYAMESYVADVNA.....DIGKTTIV
Rat PheH                                                                      MAAVVLENGVLSRKLSDFGQET--IE-NSN.....QN-AISLI
Fly PheH/TrpH                                                                      -YQRQVSFDKPTRVEDSA-IVEGVDIKEARNTCLLFSP
ZK1290.2 (Ce TrpH)    .EFQKHHAIGYLKEEMASAMKFQYYSKKAAGKTMSNSVSMSSDNRMEDFKRRFRRSGSLGIPFVPEE-VKQLFTPTRTVRREASIREGDEEEGVQILTI
Rat TrpH                                                                                            MIEDNKENKDHSSER-RV-LI
Fly TH                        MMAVAAAQKNREMFAIKKSYSIENGYPSRRRSLVDDARFETLVVKQTKQTVLEEARSKAN-YGLTEDEILLAN-ASESSDAEAAMQSAAL-
Rat TH                MPTPSAPSPQPKGFRRAVSEQDAKQAEAVTSPRFIGRRQSLIEDARKE............REAA-AAAAAAVASSEPGNPLEA-VFEERDGNAVLNLLF


K08F8.4              FTLREKAGALAETLKLFQAHDVNLSHIESRPSRLMKDAMRCSLNLLKLKTI..........VRLKELLSISNKKLKRRFLFKTGTPKTKQNKDSVPWFPQ
Rat PheH             -S-K-EV----KV-R--EEN-I--T---------N--EYEFFTY-D-...............-T-PV-GSII-S-RNDIGATVHELSRDKE-NT-----R
Fly PheH/TrpH        KDSSLSS----NI-AI-KK--I--V-----S-.-RVPGYEFFVEADG...............KSGA-GKAIEDVKEQCSY-NIISRDY-D-ATA-----R
ZK1290.2 (Ce TrpH)   IVKSSRVSEDISKMIANLPDHTRIK-L-T-D-QDGSSKTMDV-LEIE-FHYGKQEAMDLMRLNGLDVHEV-STIRPTAIKEQYTE-GSDDATTGSE---K
Rat TrpH             -S-KNEV-G-IKA--I--ENH---L-----K-KRRNSEFEIFVDCDI................NR-Q-NDIFPL--SHTTVLSLP.EKEDVMET-----K
B0432.5  (Ce TH)                     MSSAK-QIC-V-T-GNEASH-VLLACKATKN..............QLIHSAELLTQNHVALTKFSIFAKKLSDEKNQ-QI---R
Fly TH               VR-K-GISS-GRI--AIETFHGTVQ-V---Q--VEGVDHDVLIK-DM...............TRGN--QLIRSLRQSGSFSSMNLMADNNLNVKA----K
Rat TH               SLRGT-PSS-SRAV-V-ETFEAKIH-L-T--AQRPLAGSPHLEY..................FVRFEVPSGDLAALLSSVRRVSDDVRSARE-K-----R


K08F8.4              KINDIDQFANRILSYGAELDADHPGFKDMTYRERRKFFADIAFNFKHGDKIPTITYTDEEIATWRTVYNELTVMYPKNACQEFNYIFPLLQQNCGFGPDR
Rat PheH             T-QEL-R---Q-----------------PV--A---Q-----Y-YR--QP--RVE--E--KQ--G--FRT-KAL-KTH--Y-H-H-----EKY---RE-N
Fly PheH/TrpH        R-R-L-R---Q-----S--------RT-PE--K---Y-----Y-Y---EPL-HVD--K---E--GIIFRN--KL-KTH--R-Y-HV----VD----RE-N
ZK1290.2 (Ce TrpH)   S-YEL-IC-K-VIM---G----------TE--Q--MM--EL-L-Y-Q--P--RTE--SS-RK--GII-RK-RELHK-HA-KQ-LDN-E--ERH--YSENN
Rat TrpH             --S-L-FC---V-L--S-----------NV--R---Y--EL-M-Y----P--K-EF-E---K--G-IFR--NKL--TH--R-YLRNL---SKY--YRE-N
B0432.5  (Ce TH)     H-SEL-QCSKC-TK-EPTT-PR---HG-VA-IA----LN-Q-LE--F--E-GYVD--E--H---KA--EK-GDLHLSHT-AVYRQNLKI--EEKVLTA--
Fly TH               HASEL-NCNHLMTK-EPD--MN----A-KV--Q----I-E---AY-Y--P--F-D-S-V-VK---S-FKTVQDLA--H--A-YRAA-QK--DEQI-VET-
Rat TH               -VSEL-KCHHLVTK-DPD--L-----S-QV--Q---LI-E---QY---EP--HVE--A------KE--VT-KGL-ATH--R-HLEG-Q--ERY--YRE-S


K08F8.4              IPQLQDVSDFLKDCTGYTIRPVAGLLSSRDFLAGLAFRVFHSTQYIRHHSAPKYTPEPDICHELLGHVPLFADVEFAQFSQEIGLASLGAPDDVIEKLAT
Rat PheH             ----E---Q--QT---FRL-------------GS-------C------G-K-M------------------S-RS-----------------EY------
Fly PheH/TrpH        ----E-L-N--R----F-L-----------------------------P-K-M------V----M--------PA------------------Y----S-
ZK1290.2 (Ce TrpH)   ----E-ICK---GK--FRV-----Y--A--------Y---FC---V---AE-F------TV---M--MA----PD---------------SEEDLK----
Rat TrpH             V---E---N---ER--FS------Y--P----SG-------C---V--S-D-L------T----------L-EPS---------------SEETVQ----
B0432.5  (Ce TH)     ---IR--NK--Q-K--FEL--CS----A-----S------QT-T-L---KS-HHS----LI--------M-S-PLL--M--D---M----S-EH----S-
Fly TH               L----EM----R-N--FSL--A----TA-----S----I-QS---V--VNS-YH-----SI------M--L--PS---------------S-EE----S-
Rat TH               ----E---R---ER--FQL--------A-----S------QC------A-S-MHS----C---------ML--RT------D--------S-EE----S-


K08F8.4              LYWFTIEFGICQQDGE.............KKAYGAGLLSSFGELQYALSDKPEVVDFDPAVCCVTKYPITEYQPKYFLAESFASAKNKLKSWAATINRPF
Rat PheH             I----V---L-KEGDS.............L----------------C-----KLLPLELEKTACQE-SV--F--L-YV----SD--E-VRTF----P---
Fly PheH/TrpH        IFS--V-Y-L-R-E--.............L----------Y---E-C-T---QLK--E-E-TG-------QF--L-YV-D--ET--E-TIKF-NS-P---
ZK1290.2 (Ce TrpH)   --F-S----LSSD-AADSPVKENGSNHERF-V--------A----H-VEGSATIIR---DRVVEQECL--TF-SA--YTRN-EE-QQ--RMFTNNMK---
Rat TrpH             C-F--V---L-K---Q.............LRVF-------IS--RH---GHAK-KP---K-A-KQECL--SF-DV--VS---ED--E-MREF-K-VK---
B0432.5  (Ce TH)     V---IV---L-KE--K.............L--I------AY---MH-C--A--HK------TA-Q--EDDD---L--V-D-IHD-LA--RKY-SSMD---
Fly TH               V----V---L-KEH-Q.............I----------Y---LH-I---C-HRA-E--STA-QP-QDQ----I-YV----ED--D-FRR-VS-MS---
Rat TH               V----V---L-K-N--.............L----------Y---LHS--EE---RA---DTAA-QP-QDQT---V--VS---ND--D--RNY-SR-Q---

K08F8.4              QIRYNAYTQRVEILDKVAALQRLARDIRSDISTLEEALGKVNNLKMK@
Rat PheH             SV--DP------V--NTQQ-KI--DS-N-EVGI-CANLQ-IKS@
Fly PheH/TrpH        GV-------S--V--SKPQISN-MDN-N-EFQI-QN-VANCASE@
ZK1290.2 (Ce TrpH)   IV---P--ES--V-NNSRSIMLAVNSL----NL-AG-ALHYIL@
Rat TrpH             GVK--P---SIQV-RDSKSITSAMNEL-H-LDVVND--AR-SRWPSV@
B0432.5  (Ce TH)     SVV-DPF-KSI-AIESS-D-EKAFSRLSN-L-AITH-ADRMKISITM@
Fly TH               EV-FNPH-E---V--S-DK-ET-VHQMNTE-LH-TN-IS-LRRPF@
Rat TH               SVKFDP--LAIDV--SPHTI--SLEGVQ--ELHTLAHALSAIS@