CmoCh19G000540 (gene) Cucurbita moschata (Rifu)

NameCmoCh19G000540
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionTRYPTOPHAN SYNTHASE ALPHA CHAIN family protein
LocationCmo_Chr19 : 304735 .. 308708 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACATTGCTTTCAATTCTTCGTCGTTACTTCGATTGAACAAGATTCCCACTGCCCTTGTCTTTCCCACTCATCCTCGTAAGATTTCAGTCGTCCAATCCAAGCGGTTTGTTCCAATGGCGGCCCTTACAGTTTCCAATGCTATAGGACTGTCGGAAACATTCAGCAAATTGAAGGAACAGGGCAAGGTAAATTTGTTCAACCGTTTATTATCTCCAACTTTAATTTTTTGACACTTCGTTTCTGGGTTGTTTGTTTATATTTGCTCTTGTTTTGTTGAATTATACTTCTTCTGCATTTTGGTGTTCACTGTTTGATACTCTTGAGAGTCCTCATTTCAATTCAATCTAATCAATCCAAGTCTTTACTGGGTTAAATTCTTTCACCAATGCTTAATTGGGATTGATCATGAGATGGGTTAGAGTTCTTCATTCATCAATGTTGTTCTAAATTCATACATGTTTTAAAGGTCGCATTTATCCCCTACATAACTGCTGGTGATCCTGATCTTTCCACCACGGCTGAGGCATTGAAGGTTCTTTCCAACCATGGATCAGATATAATTGAACTTGGTGTTCCATATTCAGATCCCTTGGCAGATGGCCCTGTTATACAGGTTGAATATCTCTCTATCTCTCTTCATTTTTTTCATTCTTAATGTATTGTTCCTTATTCAATTGTCTTTGAATGCGTTCTGTAGGCTGCAGCGACTCGGTCCTTGGCTAGGGGGACAAACTTTAGTGCAATCATTTCTATGCTGAAGGAGGTACTCTCGTTGTACCATGAAACGTGCTGTAGATAATATTAATAGTTTTTCCTCTTCCAATTTTGATCTCAAATTATTTTAGGTAATAGAAAGAATAACTGTCACTTATATAACGACTTAACGCACACCAAATGTGTTATTTTGCGAACTAGGTATAATGAGGTATAGTAACCTAGTCATTAACATTTAAGGTTGTCGTTTGAGAATTTGTTTAATTAGATAATACTGGAACGGAGTCACACTCGAAGCTGCATTTCACCAGGCTGTAAAATTTGTCTTTCCTTTAACGAAGAAATATGTACGTGTATACTTAAATTAGCTCTTGCACTTCTTTTTCGTTAGGAAACTTGAGTTCTCGTGCACGGCATCATCAAATTAAATGTCTTTTGGAATAAGCTTCTGTCTCTATATGCTTCACTGCCATTTAGTTTCTCATTAATGGAATTGAAAGTTGACATATTTTTGTTTTGTTTCCGGCTGCTTAGGTTATTCCTGAACTGTCTTGCCCAATTGCCCTTTTTTCATACTATAACCCAATTCTTAAGCGTGGCGTCGAGAACTTCATGATGACAATAAGAGATGCTGGTGTACATGGTAATTAATGAATTAACCCAATCTTTTATAATGAATTATGTTCAAACTTTTTATCAATTTTTTTTGCCCTACTTTCAAGATAACATGACAATTAGGTCAATTGTCTGGTACTAAATGTTATGGGTTGGCATTTGCTCTAACTTGCTAGAAATGTTGGTTTATTTAGTAGAACATCATTAAATTGTCATTTTCTTTAACCCAGAGAACGGTACAAAACTGAAGTATGTATTAAATTGTCATTATATCATTGCTTAATGAACATGATATTCCAAAATCCTTGTTTCAAATTTATATGTGCACTTGGTGTGAGGTTCTAATGAGCTTGTTTCTGAAATCATGAACTTTGCGAACAACTTCTTTTCTCTTAATGAGAAAAGAGAGGACAGAGGCATATAGGAAGATTATCTGTCTGCTTCACTATTATGTGATTATCTAACTTAATATCATCAACTTCGTGATGAGTCAATGACTATTTGAAAATGAATAATGTTACAACTCTAAATGTAATTTTTCTTGCTTTACAAACTTCCATGGAAAGAAACCTTTCTGAATCTTCGGTTCGTGAATATGTTTGTAATTACATTGCACTTTCTTGCAGGGCTTGTTGTTCCGGACGTTCCTCTGGAAGAGACTGAAATTTTGAGAAAAGAAGCTGTAAAGTACAACATCGAGCTGGTGAATTCCTTTTCTCATAATGGTTGTATTATCTAAATTGATTATTTGTTTTAGGTTTTTGATATTTATTTATGATTTTTACTTAAAGATATTGCACATAGTGGCAAAAACAACCTCTGAAATGGACCTAGCAAATGGAACAGTCCAAAGATCTTGATAACTTCCCCGTTGATTCCACACGTAAATTAGGGACATAATCTGCAGTGCTGAAGATTAGTATATTATTACTTGGTTGAACCTGCAGAGTATCCAGTTTTCTTGATCACGATTGTTAAGGCGGAAAACAGTTGCCTCGAGTACTGAAAATTATTTGATTGTTGAACTGTACTTCCCTGCATTCTTTTCATATCTGTAGAATGAAATTATTAAGGTAAAAAGTAGCCAATTCGACTCGTTTGTCTAAAGATAGGATGATTAGTTGTTATACATCATCCATTTGTTTGGTAATGCTTTGTTCACGAAATCAGAAAAACGGGCGAAATCTGAAGAACAGGTGGAGTTGTTATACATCATCATTAGTTGTTATACATCATCATTAGTTGTTATACATCATCCATTTGTGTGGTAATGCTTTGTTCACAAATTCAGAAGAACAGGCAGAGTGTATTGTTTCTAGATATTTGCTGGTTCTTATGAATTTTGTACTCTGCTAAAGGTACTCTTAACTACACCGACCACGCCTAAAGATAGGATGAAAGCGATCGTTGAAGCTTCAGAGGGATTTGTGTATCTTGTAAGACCTTTTTCTGTTTTTGTCTCTTTATATGCAAATAAGTTGTAGCTTTTTCTTGTTGTTCATGGTGTCATTGAACTTGAAATTGTCTTGCTTGTAACATTACCATGACGGCACATCTAGAGTGTTCCAAAATTATTTTTCAGAATGGTATTTTAACAATGTGATTATTTTCAGAACCCTCTTTTTGATGGAGTTAAGTTATAGAACAACAACAGAATTGTGTAGGAAGAAGAGAGATCGTTTTCGGTTTTGTTGGAGCATATTGCTTTAATCGCTTTGTTTTCGTGCACAAACGGAATGATAGATTGCATTGAAAGAACAAGGAGTTTTGGTGAAGATAGTAAATTACAGTTATTCTCTATGTTACTTCCTCTGTATTATACCATTGAGATCGAGTCGAAATTACGATGTCCTTCTCTGTCAGACTTGCAATATGAAATTTGTTTTCGTTCGATTATAGGTGAGCTCCGTAGGAGTTACCGGTGCCCGTACATCAGTGAGTGATCGAGTTCAAACTCTCCTTGAGGAAATCAAAGAGGTGCATCTTCTTCAGTCTTAATAACTATCTAGGATGTACTTGTAATAGCTACAAATTTGAAAGCTGATGACTTTGAAACCAATCAATAAAATAAGCCTTACACTCTCTACTTTGTTTGGCAGGCAACAGAAAAACCAGTGGCAGTAGGTTTCGGTATATCAAAACCCGAGCACGTGAAACAGGTAAAAGAATTTCGTCATACACTGAAACAGTTTACTTCTCCTGCATTTTTTCTGGATAATATGTTGTTGTGTTCGTCCATAGGTGGCCAACTGGGGGGCTGATGGGATCATAGTTGGTAGTGCTATGGTGAAGCTGTTGGGAGAAGCTCAGACTCCTGAAGAAGGATTGAAGGCACTAGAAAGCTTCACCAAATCCTTAAAATCTGCTCTCTCCTAATCAAATTTTGATGGAATAAGAGGTCAGCTAGCTCTATCACTGAAGCCTGCTAGTGTTTTACAGTCATCTTTAAATGATACTTAAAACATTTCTCTTTCTTTTTAGCTGAATAAGACTGTCAGTTTAGTTCTGTTTAAGGATATTGCTTGGTGTTTTAGCTAATATAAGTTTTGTTCTTTCTGGTTCAAAAGGTTCCACTCTCATGCCTGCATTTTTGTTAAGGTTTTGAGTATAGGAAGCTCTTTAGTTGAAGGACGTTTCAAAC

mRNA sequence

ATGGACATTGCTTTCAATTCTTCGTCGTTACTTCGATTGAACAAGATTCCCACTGCCCTTGTCTTTCCCACTCATCCTCGTAAGATTTCAGTCGTCCAATCCAAGCGGTTTGTTCCAATGGCGGCCCTTACAGTTTCCAATGCTATAGGACTGTCGGAAACATTCAGCAAATTGAAGGAACAGGGCAAGGTCGCATTTATCCCCTACATAACTGCTGGTGATCCTGATCTTTCCACCACGGCTGAGGCATTGAAGGTTCTTTCCAACCATGGATCAGATATAATTGAACTTGGTGTTCCATATTCAGATCCCTTGGCAGATGGCCCTGTTATACAGGCTGCAGCGACTCGGTCCTTGGCTAGGGGGACAAACTTTAGTGCAATCATTTCTATGCTGAAGGAGGTTATTCCTGAACTGTCTTGCCCAATTGCCCTTTTTTCATACTATAACCCAATTCTTAAGCGTGGCGTCGAGAACTTCATGATGACAATAAGAGATGCTGGTGTACATGGGCTTGTTGTTCCGGACGTTCCTCTGGAAGAGACTGAAATTTTGAGAAAAGAAGCTGTAAAGTACAACATCGAGCTGGTACTCTTAACTACACCGACCACGCCTAAAGATAGGATGAAAGCGATCGTTGAAGCTTCAGAGGGATTTGTGTATCTTGTGAGCTCCGTAGGAGTTACCGGTGCCCGTACATCAGTGAGTGATCGAGTTCAAACTCTCCTTGAGGAAATCAAAGAGGCAACAGAAAAACCAGTGGCAGTAGGTTTCGGTATATCAAAACCCGAGCACGTGAAACAGGTGGCCAACTGGGGGGCTGATGGGATCATAGTTGGTAGTGCTATGGTGAAGCTGTTGGGAGAAGCTCAGACTCCTGAAGAAGGATTGAAGGCACTAGAAAGCTTCACCAAATCCTTAAAATCTGCTCTCTCCTAATCAAATTTTGATGGAATAAGAGGTCAGCTAGCTCTATCACTGAAGCCTGCTAGTGTTTTACAGTCATCTTTAAATGATACTTAAAACATTTCTCTTTCTTTTTAGCTGAATAAGACTGTCAGTTTAGTTCTGTTTAAGGATATTGCTTGGTGTTTTAGCTAATATAAGTTTTGTTCTTTCTGGTTCAAAAGGTTCCACTCTCATGCCTGCATTTTTGTTAAGGTTTTGAGTATAGGAAGCTCTTTAGTTGAAGGACGTTTCAAAC

Coding sequence (CDS)

ATGGACATTGCTTTCAATTCTTCGTCGTTACTTCGATTGAACAAGATTCCCACTGCCCTTGTCTTTCCCACTCATCCTCGTAAGATTTCAGTCGTCCAATCCAAGCGGTTTGTTCCAATGGCGGCCCTTACAGTTTCCAATGCTATAGGACTGTCGGAAACATTCAGCAAATTGAAGGAACAGGGCAAGGTCGCATTTATCCCCTACATAACTGCTGGTGATCCTGATCTTTCCACCACGGCTGAGGCATTGAAGGTTCTTTCCAACCATGGATCAGATATAATTGAACTTGGTGTTCCATATTCAGATCCCTTGGCAGATGGCCCTGTTATACAGGCTGCAGCGACTCGGTCCTTGGCTAGGGGGACAAACTTTAGTGCAATCATTTCTATGCTGAAGGAGGTTATTCCTGAACTGTCTTGCCCAATTGCCCTTTTTTCATACTATAACCCAATTCTTAAGCGTGGCGTCGAGAACTTCATGATGACAATAAGAGATGCTGGTGTACATGGGCTTGTTGTTCCGGACGTTCCTCTGGAAGAGACTGAAATTTTGAGAAAAGAAGCTGTAAAGTACAACATCGAGCTGGTACTCTTAACTACACCGACCACGCCTAAAGATAGGATGAAAGCGATCGTTGAAGCTTCAGAGGGATTTGTGTATCTTGTGAGCTCCGTAGGAGTTACCGGTGCCCGTACATCAGTGAGTGATCGAGTTCAAACTCTCCTTGAGGAAATCAAAGAGGCAACAGAAAAACCAGTGGCAGTAGGTTTCGGTATATCAAAACCCGAGCACGTGAAACAGGTGGCCAACTGGGGGGCTGATGGGATCATAGTTGGTAGTGCTATGGTGAAGCTGTTGGGAGAAGCTCAGACTCCTGAAGAAGGATTGAAGGCACTAGAAAGCTTCACCAAATCCTTAAAATCTGCTCTCTCCTAA
BLAST of CmoCh19G000540 vs. Swiss-Prot
Match: TRPA1_ARATH (Tryptophan synthase alpha chain OS=Arabidopsis thaliana GN=TRPA1 PE=1 SV=2)

HSP 1 Score: 430.3 bits (1105), Expect = 1.9e-119
Identity = 215/268 (80.22%), Postives = 246/268 (91.79%), Query Frame = 1

Query: 44  TVSNAIGLSETFSKLKEQGKVAFIPYITAGDPDLSTTAEALKVLSNHGSDIIELGVPYSD 103
           T S+ +GLSETF++LK QGKVA IPYITAGDPDLSTTA+ALKVL + GSDIIELGVPYSD
Sbjct: 6   TPSSTVGLSETFARLKSQGKVALIPYITAGDPDLSTTAKALKVLDSCGSDIIELGVPYSD 65

Query: 104 PLADGPVIQAAATRSLARGTNFSAIISMLKEVIPELSCPIALFSYYNPILKRGVENFMMT 163
           PLADGP IQAAA RSL +GTNF++IISMLKEVIP+LSCPIALF+YYNPIL+RGVEN+M  
Sbjct: 66  PLADGPAIQAAARRSLLKGTNFNSIISMLKEVIPQLSCPIALFTYYNPILRRGVENYMTV 125

Query: 164 IRDAGVHGLVVPDVPLEETEILRKEAVKYNIELVLLTTPTTPKDRMKAIVEASEGFVYLV 223
           I++AGVHGL+VPDVPLEETE LR EA K+ IELVLLTTPTTPK+RM AIVEASEGF+YLV
Sbjct: 126 IKNAGVHGLLVPDVPLEETETLRNEARKHQIELVLLTTPTTPKERMNAIVEASEGFIYLV 185

Query: 224 SSVGVTGARTSVSDRVQTLLEEIKEATEKPVAVGFGISKPEHVKQVANWGADGIIVGSAM 283
           SSVGVTG R SV+++VQ+LL++IKEAT KPVAVGFGISKPEHVKQVA WGADG+IVGSAM
Sbjct: 186 SSVGVTGTRESVNEKVQSLLQQIKEATSKPVAVGFGISKPEHVKQVAEWGADGVIVGSAM 245

Query: 284 VKLLGEAQTPEEGLKALESFTKSLKSAL 312
           VK+LGE+++PE+GLK LE FTKSLKSAL
Sbjct: 246 VKILGESESPEQGLKELEFFTKSLKSAL 273

BLAST of CmoCh19G000540 vs. Swiss-Prot
Match: TRPA2_ARATH (Tryptophan synthase alpha chain, chloroplastic OS=Arabidopsis thaliana GN=TSA1 PE=1 SV=1)

HSP 1 Score: 423.7 bits (1088), Expect = 1.7e-117
Identity = 214/283 (75.62%), Postives = 252/283 (89.05%), Query Frame = 1

Query: 30  SVVQSKRFVPMAALTVSN-AIGLSETFSKLKEQGKVAFIPYITAGDPDLSTTAEALKVLS 89
           S +  KRF PMA+L+ S+  +GL++TF++LK+QGKVAFIPYITAGDPDLSTTAEALKVL 
Sbjct: 29  SSLSFKRFTPMASLSTSSPTLGLADTFTQLKKQGKVAFIPYITAGDPDLSTTAEALKVLD 88

Query: 90  NHGSDIIELGVPYSDPLADGPVIQAAATRSLARGTNFSAIISMLKEVIPELSCPIALFSY 149
             GSDIIELGVPYSDPLADGPVIQAAATRSL RGTN  +I+ ML +V+P++SCPI+LF+Y
Sbjct: 89  ACGSDIIELGVPYSDPLADGPVIQAAATRSLERGTNLDSILEMLDKVVPQISCPISLFTY 148

Query: 150 YNPILKRGVENFMMTIRDAGVHGLVVPDVPLEETEILRKEAVKYNIELVLLTTPTTPKDR 209
           YNPILKRG+  FM +IR  GV GLVVPDVPLEETE+LRKEA+  +IELVLLTTPTTP +R
Sbjct: 149 YNPILKRGLGKFMSSIRAVGVQGLVVPDVPLEETEMLRKEALNNDIELVLLTTPTTPTER 208

Query: 210 MKAIVEASEGFVYLVSSVGVTGARTSVSDRVQTLLEEIKEATEKPVAVGFGISKPEHVKQ 269
           MK IV+ASEGF+YLVSS+GVTGAR+SVS +VQ+LL++IKEAT+KPVAVGFGISKPEHVKQ
Sbjct: 209 MKLIVDASEGFIYLVSSIGVTGARSSVSGKVQSLLKDIKEATDKPVAVGFGISKPEHVKQ 268

Query: 270 VANWGADGIIVGSAMVKLLGEAQTPEEGLKALESFTKSLKSAL 312
           +A WGADG+IVGSAMVKLLG+A++P EGLK LE  TKSLKSAL
Sbjct: 269 IAGWGADGVIVGSAMVKLLGDAKSPTEGLKELEKLTKSLKSAL 311

BLAST of CmoCh19G000540 vs. Swiss-Prot
Match: TRPA_MAIZE (Indole-3-glycerol phosphate lyase, chloroplastic OS=Zea mays GN=BX1 PE=1 SV=2)

HSP 1 Score: 322.8 bits (826), Expect = 4.2e-87
Identity = 156/261 (59.77%), Postives = 202/261 (77.39%), Query Frame = 1

Query: 51  LSETFSKLKEQGKVAFIPYITAGDPDLSTTAEALKVLSNHGSDIIELGVPYSDPLADGPV 110
           +S+T + L  +GK AFIPYITAGDPDL+TTAEAL++L   G+D+IELGVP SDP  DGP+
Sbjct: 90  VSDTMAALMAKGKTAFIPYITAGDPDLATTAEALRLLDGCGADVIELGVPCSDPYIDGPI 149

Query: 111 IQAAATRSLARGTNFSAIISMLKEVIPELSCPIALFSYYNPILKRGVENFMMTIRDAGVH 170
           IQA+  R+LA GT   A++ ML+EV PELSCP+ L SYY PI+ R +      +++AGVH
Sbjct: 150 IQASVARALASGTTMDAVLEMLREVTPELSCPVVLLSYYKPIMSRSLAE----MKEAGVH 209

Query: 171 GLVVPDVPLEETEILRKEAVKYNIELVLLTTPTTPKDRMKAIVEASEGFVYLVSSVGVTG 230
           GL+VPD+P      L  EA   N+ELVLLTTP  P+DRMK I +ASEGFVYLVS  GVTG
Sbjct: 210 GLIVPDLPYVAAHSLWSEAKNNNLELVLLTTPAIPEDRMKEITKASEGFVYLVSVNGVTG 269

Query: 231 ARTSVSDRVQTLLEEIKEATEKPVAVGFGISKPEHVKQVANWGADGIIVGSAMVKLLGEA 290
            R +V+ RV++L++E+K+ T KPVAVGFGISKPEHVKQ+A WGADG+I+GSAMV+ LGEA
Sbjct: 270 PRANVNPRVESLIQEVKKVTNKPVAVGFGISKPEHVKQIAQWGADGVIIGSAMVRQLGEA 329

Query: 291 QTPEEGLKALESFTKSLKSAL 312
            +P++GL+ LE + + +K+AL
Sbjct: 330 ASPKQGLRRLEEYARGMKNAL 346

BLAST of CmoCh19G000540 vs. Swiss-Prot
Match: TRPA_CYAP8 (Tryptophan synthase alpha chain OS=Cyanothece sp. (strain PCC 8801) GN=trpA PE=3 SV=1)

HSP 1 Score: 309.3 bits (791), Expect = 4.8e-83
Identity = 154/261 (59.00%), Postives = 196/261 (75.10%), Query Frame = 1

Query: 51  LSETFSKLKEQGKVAFIPYITAGDPDLSTTAEALKVLSNHGSDIIELGVPYSDPLADGPV 110
           +S+ F  L+++ + A IP+ITAGDPDL TTA+AL++L   G+D+IELGVPYSDPLADGPV
Sbjct: 4   VSDCFQSLRDRRQCALIPFITAGDPDLETTAKALRLLDASGADLIELGVPYSDPLADGPV 63

Query: 111 IQAAATRSLARGTNFSAIISMLKEVIPELSCPIALFSYYNPILKRGVENFMMTIRDAGVH 170
           IQAAATR+L RG     ++ ++KEV PE+  PI LF+YYNPI  RGVE F+  ++ AGV 
Sbjct: 64  IQAAATRALGRGVKLEDVLGVVKEVSPEIKAPIILFTYYNPIFYRGVEAFLQQVKAAGVQ 123

Query: 171 GLVVPDVPLEETEILRKEAVKYNIELVLLTTPTTPKDRMKAIVEASEGFVYLVSSVGVTG 230
           GLVVPD+PLEE E L K A +  I + LL  PT+P +R++AI   S+GF+YLVS  GVTG
Sbjct: 124 GLVVPDLPLEEAESLLKPAHEVGIAVTLLVAPTSPIERIEAIARQSQGFIYLVSVTGVTG 183

Query: 231 ARTSVSDRVQTLLEEIKEATEKPVAVGFGISKPEHVKQVANWGADGIIVGSAMVKLLGEA 290
            R+ V+ RV+ LL  ++ AT+KP+ VGFGISKPEH  QV NWGAD +IVGSAMVK L E 
Sbjct: 184 MRSQVTSRVKELLTSLRSATDKPIGVGFGISKPEHALQVKNWGADAVIVGSAMVKRLAEG 243

Query: 291 QTPEEGLKALESFTKSLKSAL 312
            TPEEGLKA+ +F + LK AL
Sbjct: 244 -TPEEGLKAIGAFCQDLKQAL 263

BLAST of CmoCh19G000540 vs. Swiss-Prot
Match: TRPA_CYAP4 (Tryptophan synthase alpha chain OS=Cyanothece sp. (strain PCC 7425 / ATCC 29141) GN=trpA PE=3 SV=1)

HSP 1 Score: 303.5 bits (776), Expect = 2.6e-81
Identity = 149/264 (56.44%), Postives = 191/264 (72.35%), Query Frame = 1

Query: 49  IGLSETFSKLKEQGKVAFIPYITAGDPDLSTTAEALKVLSNHGSDIIELGVPYSDPLADG 108
           + +S  FS L+++ + A IP+ITAGDP L  TA+AL+VL   G+D+IELGVPYSDPLADG
Sbjct: 2   VSVSTCFSALRDRAQCALIPFITAGDPSLEITAKALQVLDQQGADLIELGVPYSDPLADG 61

Query: 109 PVIQAAATRSLARGTNFSAIISMLKEVIPELSCPIALFSYYNPILKRGVENFMMTIRDAG 168
           P IQAAATR+L +GT   A++ M+  V P L  P+ LF+YYNPI  RGVE F+  +  AG
Sbjct: 62  PTIQAAATRALQKGTRLDAVLEMISHVAPNLRSPLILFTYYNPIFHRGVEPFLQQVAQAG 121

Query: 169 VHGLVVPDVPLEETEILRKEAVKYNIELVLLTTPTTPKDRMKAIVEASEGFVYLVSSVGV 228
           V GLVVPD+PLEE + +  +A    IEL LL  PTTP+ R+ AI E S+GF+YLVS+ GV
Sbjct: 122 VQGLVVPDLPLEEADTVLTQAAAVGIELTLLVAPTTPRSRIAAIAERSQGFIYLVSTTGV 181

Query: 229 TGARTSVSDRVQTLLEEIKEATEKPVAVGFGISKPEHVKQVANWGADGIIVGSAMVKLLG 288
           TG R+ V  RV  LL E+++ T+KP+ VGFGIS+PEH +QV  WGAD  IVGSA VK L 
Sbjct: 182 TGMRSKVEGRVHELLLELQQVTDKPIGVGFGISQPEHARQVMEWGADAAIVGSAFVKRLA 241

Query: 289 EAQTPEEGLKALESFTKSLKSALS 313
           E  TPE+GL A+  F +SLK+AL+
Sbjct: 242 EG-TPEQGLAAIADFCRSLKTALT 264

BLAST of CmoCh19G000540 vs. TrEMBL
Match: A0A0A0LKV4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G225330 PE=3 SV=1)

HSP 1 Score: 535.0 bits (1377), Expect = 6.0e-149
Identity = 276/311 (88.75%), Postives = 293/311 (94.21%), Query Frame = 1

Query: 1   MDIAFNSSSLLRLNKIPTALVFPTHPRKISVVQSKRFVPMAALTVSNAIGLSETFSKLKE 60
           MDI  NSS L + NKIPT L+FP HP KISV QSKR VPMA+LT S+A+GLSETFSKLKE
Sbjct: 1   MDIVLNSSRLFQFNKIPTTLIFPPHPCKISVFQSKRVVPMASLTASSAVGLSETFSKLKE 60

Query: 61  QGKVAFIPYITAGDPDLSTTAEALKVLSNHGSDIIELGVPYSDPLADGPVIQAAATRSLA 120
           QGKVAFIPYITAGDPDLSTTAEALKVLS  GSDIIELGVPYSDPLADGPVIQAAATRSLA
Sbjct: 61  QGKVAFIPYITAGDPDLSTTAEALKVLSTSGSDIIELGVPYSDPLADGPVIQAAATRSLA 120

Query: 121 RGTNFSAIISMLKEVIPELSCPIALFSYYNPILKRGVENFMMTIRDAGVHGLVVPDVPLE 180
           RGTNFSAIISMLKEVIPELSCPIALFSYYNPILKRG+ NFM+TI+DAGV GLVVPDVPLE
Sbjct: 121 RGTNFSAIISMLKEVIPELSCPIALFSYYNPILKRGIGNFMLTIKDAGVRGLVVPDVPLE 180

Query: 181 ETEILRKEAVKYNIELVLLTTPTTPKDRMKAIVEASEGFVYLVSSVGVTGARTSVSDRVQ 240
           ETEILRKEAVK++IELVLLTTPTTP+DRMKAIVEASEGFVYLVSSVGVTGAR SVS++VQ
Sbjct: 181 ETEILRKEAVKHSIELVLLTTPTTPRDRMKAIVEASEGFVYLVSSVGVTGARASVSNKVQ 240

Query: 241 TLLEEIKEATEKPVAVGFGISKPEHVKQVANWGADGIIVGSAMVKLLGEAQTPEEGLKAL 300
           TLLEEIKE TEKPVAVGFGISKPEHVKQV++WGADGIIVGSAMVKLLGEAQ+PEEGLKAL
Sbjct: 241 TLLEEIKEVTEKPVAVGFGISKPEHVKQVSSWGADGIIVGSAMVKLLGEAQSPEEGLKAL 300

Query: 301 ESFTKSLKSAL 312
           E+FTKSL SAL
Sbjct: 301 ENFTKSLTSAL 311

BLAST of CmoCh19G000540 vs. TrEMBL
Match: A0A0A0LD81_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G843770 PE=3 SV=1)

HSP 1 Score: 503.4 bits (1295), Expect = 1.9e-139
Identity = 262/312 (83.97%), Postives = 279/312 (89.42%), Query Frame = 1

Query: 1   MDIAFNSSSLLRLNKIPTALVFPTHPRKISVVQSKRFVPMAALTVSNAIGLSETFSKLKE 60
           MDIAF SS +L LNKIP  L+F   P KISV QSKRF PMAAL     +GLSETF  L+E
Sbjct: 1   MDIAFKSSRILPLNKIPNTLIFSPRPCKISVSQSKRFAPMAALAAYPVVGLSETFKNLRE 60

Query: 61  QGKVAFIPYITAGDPDLSTTAEALKVLSNHGSDIIELGVPYSDPLADGPVIQAAATRSLA 120
           QGKVA IPYITAGDPDLSTTAEALKVLS  GSDIIELGVPYSDPLADGPVIQAAATRSLA
Sbjct: 61  QGKVALIPYITAGDPDLSTTAEALKVLSKCGSDIIELGVPYSDPLADGPVIQAAATRSLA 120

Query: 121 RGTNFSAIISMLKEVIPELSCPIALFSYYNPILKRGVENFMMTIRDAGVHGLVVPDVPLE 180
           R TNF+AIISMLK VIPELS PI+LF+YYNPILKRGVENFMM I+D GV GLVVPDVPLE
Sbjct: 121 RETNFNAIISMLKGVIPELSRPISLFTYYNPILKRGVENFMMIIKDTGVRGLVVPDVPLE 180

Query: 181 ETEILRKEAVKYNIELVLLTTPTTPKDRMKAIVEASEGFVYLVSSVGVTGARTSVSDRVQ 240
           ETE+LRKEAVK+NIELVLLTTPTTPK+RMK IVEASEGFVYLVSS+GVTG RTSVS RVQ
Sbjct: 181 ETEVLRKEAVKHNIELVLLTTPTTPKERMKNIVEASEGFVYLVSSIGVTGTRTSVSSRVQ 240

Query: 241 TLLEEIKEATEKPVAVGFGISKPEHVKQVANWGADGIIVGSAMVKLLGEAQTPEEGLKAL 300
           TLLEE+KE TEKPVAVGFGISKPEHVKQVA WGADGII+GSAMVKLLGEAQ+PEEGLK L
Sbjct: 241 TLLEEVKEVTEKPVAVGFGISKPEHVKQVAEWGADGIIIGSAMVKLLGEAQSPEEGLKEL 300

Query: 301 ESFTKSLKSALS 313
           E+FT+SLKSALS
Sbjct: 301 ENFTRSLKSALS 312

BLAST of CmoCh19G000540 vs. TrEMBL
Match: A0A067JC03_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_21847 PE=3 SV=1)

HSP 1 Score: 468.0 bits (1203), Expect = 9.0e-129
Identity = 241/305 (79.02%), Postives = 271/305 (88.85%), Query Frame = 1

Query: 7   SSSLLRLNKIPTALVFPTHPRKISVVQSKRFVPMAALTVSNAIGLSETFSKLKEQGKVAF 66
           ++S L L K  T L     P K +VV ++R   MAALTV+ ++GL+ETFS LK++GKVAF
Sbjct: 8   TASFLHLRKPETHLFIRFPPYKSAVVSTRRIAEMAALTVTPSLGLAETFSNLKKEGKVAF 67

Query: 67  IPYITAGDPDLSTTAEALKVLSNHGSDIIELGVPYSDPLADGPVIQAAATRSLARGTNFS 126
           IPYITAGDPDLSTTAEALKVL + GSDIIELGVPYSDPLADGPVIQAAATRSLA+GTNF 
Sbjct: 68  IPYITAGDPDLSTTAEALKVLDSCGSDIIELGVPYSDPLADGPVIQAAATRSLAKGTNFD 127

Query: 127 AIISMLKEVIPELSCPIALFSYYNPILKRGVENFMMTIRDAGVHGLVVPDVPLEETEILR 186
           A+ISMLKEVIP+LS P+ALF+YYNPILKRG+E FM T++D GVHGLVVPDVPLEETE+LR
Sbjct: 128 AVISMLKEVIPQLSSPVALFTYYNPILKRGIEKFMSTVKDVGVHGLVVPDVPLEETELLR 187

Query: 187 KEAVKYNIELVLLTTPTTPKDRMKAIVEASEGFVYLVSSVGVTGARTSVSDRVQTLLEEI 246
           KEAVK NIELVLLTTPTTPK RMKAIV+ASEGFVYLVSSVGVTGAR SVSDRVQTLL+EI
Sbjct: 188 KEAVKNNIELVLLTTPTTPKGRMKAIVKASEGFVYLVSSVGVTGARASVSDRVQTLLQEI 247

Query: 247 KEATEKPVAVGFGISKPEHVKQVANWGADGIIVGSAMVKLLGEAQTPEEGLKALESFTKS 306
           KE T KPVAVGFGISKPEHVKQVA WGADG+IVGSA+VK+LG+A++PEEGLK LE+ TKS
Sbjct: 248 KEETTKPVAVGFGISKPEHVKQVAGWGADGVIVGSAIVKVLGDAKSPEEGLKELENLTKS 307

Query: 307 LKSAL 312
           LKSAL
Sbjct: 308 LKSAL 312

BLAST of CmoCh19G000540 vs. TrEMBL
Match: A0A061DQA0_THECC (Aldolase-type TIM barrel family protein OS=Theobroma cacao GN=TCM_004218 PE=3 SV=1)

HSP 1 Score: 464.9 bits (1195), Expect = 7.6e-128
Identity = 234/281 (83.27%), Postives = 259/281 (92.17%), Query Frame = 1

Query: 32  VQSKRFVPMAALTVSNAIGLSETFSKLKEQGKVAFIPYITAGDPDLSTTAEALKVLSNHG 91
           V +KRF PMA +T +  +GL++TFSKLK QGKVA IPYITAGDPDLSTT EALKVL + G
Sbjct: 32  VSTKRFTPMATVTTATTLGLADTFSKLKTQGKVALIPYITAGDPDLSTTTEALKVLDSCG 91

Query: 92  SDIIELGVPYSDPLADGPVIQAAATRSLARGTNFSAIISMLKEVIPELSCPIALFSYYNP 151
           +DIIELGVPYSDPLADGPVIQAAATRSLARGTNF+AI+SMLKEV+PELSCPIALF+YYNP
Sbjct: 92  ADIIELGVPYSDPLADGPVIQAAATRSLARGTNFNAILSMLKEVVPELSCPIALFTYYNP 151

Query: 152 ILKRGVENFMMTIRDAGVHGLVVPDVPLEETEILRKEAVKYNIELVLLTTPTTPKDRMKA 211
           ILKRGVE F+ T++D G+HGLVVPDVPLEETEILR+EA+K  IELVLLTTPTTP DRMKA
Sbjct: 152 ILKRGVEKFLSTVKDVGIHGLVVPDVPLEETEILRREALKNKIELVLLTTPTTPIDRMKA 211

Query: 212 IVEASEGFVYLVSSVGVTGARTSVSDRVQTLLEEIKEATEKPVAVGFGISKPEHVKQVAN 271
           IVEASEGFVYLVSS+GVTGAR SVSDRVQTL+ EIKEAT KPVAVGFGISKPEHVKQVA 
Sbjct: 212 IVEASEGFVYLVSSIGVTGARASVSDRVQTLIGEIKEATTKPVAVGFGISKPEHVKQVAG 271

Query: 272 WGADGIIVGSAMVKLLGEAQTPEEGLKALESFTKSLKSALS 313
           WGADG+IVGSAMVKLLGEA++PE+GLKALE FTKSLKSAL+
Sbjct: 272 WGADGVIVGSAMVKLLGEAESPEDGLKALEIFTKSLKSALA 312

BLAST of CmoCh19G000540 vs. TrEMBL
Match: B9S4R8_RICCO (Trytophan synthase alpha subunit, putative OS=Ricinus communis GN=RCOM_0991580 PE=3 SV=1)

HSP 1 Score: 462.6 bits (1189), Expect = 3.8e-127
Identity = 237/306 (77.45%), Postives = 268/306 (87.58%), Query Frame = 1

Query: 6   NSSSLLRLNKIPTALVFPTHPRKISVVQSKRFVPMAALTVSNAIGLSETFSKLKEQGKVA 65
           +++S L   K  T L+  +   K +V+ ++RF PMA LT +  + LSETFS LK++GKVA
Sbjct: 7   STTSFLHDRKPETHLLIRSPSYKPTVISTRRFAPMATLTAAKNLSLSETFSNLKKRGKVA 66

Query: 66  FIPYITAGDPDLSTTAEALKVLSNHGSDIIELGVPYSDPLADGPVIQAAATRSLARGTNF 125
           FIPYITAGDPDLSTTAEALK+L + GSDIIELGVPYSDPLADGPVIQAAATRSLARGTNF
Sbjct: 67  FIPYITAGDPDLSTTAEALKLLDSCGSDIIELGVPYSDPLADGPVIQAAATRSLARGTNF 126

Query: 126 SAIISMLKEVIPELSCPIALFSYYNPILKRGVENFMMTIRDAGVHGLVVPDVPLEETEIL 185
            AI SMLKEV+P+LSCPIALF+YYNPILKRG+E FM T++D GVHGLVVPDVPLEETE+L
Sbjct: 127 DAITSMLKEVVPQLSCPIALFTYYNPILKRGIEKFMSTVKDIGVHGLVVPDVPLEETELL 186

Query: 186 RKEAVKYNIELVLLTTPTTPKDRMKAIVEASEGFVYLVSSVGVTGARTSVSDRVQTLLEE 245
           R EA K NIELVLLTTPTTP +RMKAIVEA+EGFVYLVSSVGVTGAR SVSDRVQTLL+E
Sbjct: 187 RNEAAKKNIELVLLTTPTTPTERMKAIVEAAEGFVYLVSSVGVTGARASVSDRVQTLLQE 246

Query: 246 IKEATEKPVAVGFGISKPEHVKQVANWGADGIIVGSAMVKLLGEAQTPEEGLKALESFTK 305
           IKEAT KPVAVGFGISKPEHVKQVA WGADG+IVGSAMVK+LGEA++PEEGL+ L + TK
Sbjct: 247 IKEATAKPVAVGFGISKPEHVKQVAGWGADGVIVGSAMVKVLGEAKSPEEGLEELATLTK 306

Query: 306 SLKSAL 312
           SLKSAL
Sbjct: 307 SLKSAL 312

BLAST of CmoCh19G000540 vs. TAIR10
Match: AT4G02610.1 (AT4G02610.1 Aldolase-type TIM barrel family protein)

HSP 1 Score: 430.3 bits (1105), Expect = 1.0e-120
Identity = 215/268 (80.22%), Postives = 246/268 (91.79%), Query Frame = 1

Query: 44  TVSNAIGLSETFSKLKEQGKVAFIPYITAGDPDLSTTAEALKVLSNHGSDIIELGVPYSD 103
           T S+ +GLSETF++LK QGKVA IPYITAGDPDLSTTA+ALKVL + GSDIIELGVPYSD
Sbjct: 6   TPSSTVGLSETFARLKSQGKVALIPYITAGDPDLSTTAKALKVLDSCGSDIIELGVPYSD 65

Query: 104 PLADGPVIQAAATRSLARGTNFSAIISMLKEVIPELSCPIALFSYYNPILKRGVENFMMT 163
           PLADGP IQAAA RSL +GTNF++IISMLKEVIP+LSCPIALF+YYNPIL+RGVEN+M  
Sbjct: 66  PLADGPAIQAAARRSLLKGTNFNSIISMLKEVIPQLSCPIALFTYYNPILRRGVENYMTV 125

Query: 164 IRDAGVHGLVVPDVPLEETEILRKEAVKYNIELVLLTTPTTPKDRMKAIVEASEGFVYLV 223
           I++AGVHGL+VPDVPLEETE LR EA K+ IELVLLTTPTTPK+RM AIVEASEGF+YLV
Sbjct: 126 IKNAGVHGLLVPDVPLEETETLRNEARKHQIELVLLTTPTTPKERMNAIVEASEGFIYLV 185

Query: 224 SSVGVTGARTSVSDRVQTLLEEIKEATEKPVAVGFGISKPEHVKQVANWGADGIIVGSAM 283
           SSVGVTG R SV+++VQ+LL++IKEAT KPVAVGFGISKPEHVKQVA WGADG+IVGSAM
Sbjct: 186 SSVGVTGTRESVNEKVQSLLQQIKEATSKPVAVGFGISKPEHVKQVAEWGADGVIVGSAM 245

Query: 284 VKLLGEAQTPEEGLKALESFTKSLKSAL 312
           VK+LGE+++PE+GLK LE FTKSLKSAL
Sbjct: 246 VKILGESESPEQGLKELEFFTKSLKSAL 273

BLAST of CmoCh19G000540 vs. TAIR10
Match: AT3G54640.1 (AT3G54640.1 tryptophan synthase alpha chain)

HSP 1 Score: 423.7 bits (1088), Expect = 9.8e-119
Identity = 214/283 (75.62%), Postives = 252/283 (89.05%), Query Frame = 1

Query: 30  SVVQSKRFVPMAALTVSN-AIGLSETFSKLKEQGKVAFIPYITAGDPDLSTTAEALKVLS 89
           S +  KRF PMA+L+ S+  +GL++TF++LK+QGKVAFIPYITAGDPDLSTTAEALKVL 
Sbjct: 29  SSLSFKRFTPMASLSTSSPTLGLADTFTQLKKQGKVAFIPYITAGDPDLSTTAEALKVLD 88

Query: 90  NHGSDIIELGVPYSDPLADGPVIQAAATRSLARGTNFSAIISMLKEVIPELSCPIALFSY 149
             GSDIIELGVPYSDPLADGPVIQAAATRSL RGTN  +I+ ML +V+P++SCPI+LF+Y
Sbjct: 89  ACGSDIIELGVPYSDPLADGPVIQAAATRSLERGTNLDSILEMLDKVVPQISCPISLFTY 148

Query: 150 YNPILKRGVENFMMTIRDAGVHGLVVPDVPLEETEILRKEAVKYNIELVLLTTPTTPKDR 209
           YNPILKRG+  FM +IR  GV GLVVPDVPLEETE+LRKEA+  +IELVLLTTPTTP +R
Sbjct: 149 YNPILKRGLGKFMSSIRAVGVQGLVVPDVPLEETEMLRKEALNNDIELVLLTTPTTPTER 208

Query: 210 MKAIVEASEGFVYLVSSVGVTGARTSVSDRVQTLLEEIKEATEKPVAVGFGISKPEHVKQ 269
           MK IV+ASEGF+YLVSS+GVTGAR+SVS +VQ+LL++IKEAT+KPVAVGFGISKPEHVKQ
Sbjct: 209 MKLIVDASEGFIYLVSSIGVTGARSSVSGKVQSLLKDIKEATDKPVAVGFGISKPEHVKQ 268

Query: 270 VANWGADGIIVGSAMVKLLGEAQTPEEGLKALESFTKSLKSAL 312
           +A WGADG+IVGSAMVKLLG+A++P EGLK LE  TKSLKSAL
Sbjct: 269 IAGWGADGVIVGSAMVKLLGDAKSPTEGLKELEKLTKSLKSAL 311

BLAST of CmoCh19G000540 vs. NCBI nr
Match: gi|659096912|ref|XP_008449352.1| (PREDICTED: tryptophan synthase alpha chain-like isoform X1 [Cucumis melo])

HSP 1 Score: 538.5 bits (1386), Expect = 7.7e-150
Identity = 278/312 (89.10%), Postives = 294/312 (94.23%), Query Frame = 1

Query: 1   MDIAFNSSSLLRLNKIPTALVFPTHPRKISVVQSKRFVPMAALTVSNAIGLSETFSKLKE 60
           MD   NSS L + NKIPT L+FP HP KISV QSKR VPMA+LT S+A+GLSETFSKLKE
Sbjct: 1   MDTVLNSSRLFQFNKIPTTLIFPPHPCKISVFQSKRVVPMASLTASSAVGLSETFSKLKE 60

Query: 61  QGKVAFIPYITAGDPDLSTTAEALKVLSNHGSDIIELGVPYSDPLADGPVIQAAATRSLA 120
           QGKVAFIPYITAGDPDLSTTAEALKVLS  GSDIIELGVPYSDPLADGPVIQAAATRSLA
Sbjct: 61  QGKVAFIPYITAGDPDLSTTAEALKVLSTSGSDIIELGVPYSDPLADGPVIQAAATRSLA 120

Query: 121 RGTNFSAIISMLKEVIPELSCPIALFSYYNPILKRGVENFMMTIRDAGVHGLVVPDVPLE 180
           RGTNFSAIISMLKEVIPELSCPIALFSYYNPILKRG+ NFM+TI+DAGV GLVVPDVPLE
Sbjct: 121 RGTNFSAIISMLKEVIPELSCPIALFSYYNPILKRGIGNFMLTIKDAGVQGLVVPDVPLE 180

Query: 181 ETEILRKEAVKYNIELVLLTTPTTPKDRMKAIVEASEGFVYLVSSVGVTGARTSVSDRVQ 240
           ETEILRKEAVK+NIELVLLTTPTTP+DRMKAIVEASEGFVYLVSSVGVTGAR SVS++VQ
Sbjct: 181 ETEILRKEAVKHNIELVLLTTPTTPRDRMKAIVEASEGFVYLVSSVGVTGARASVSNKVQ 240

Query: 241 TLLEEIKEATEKPVAVGFGISKPEHVKQVANWGADGIIVGSAMVKLLGEAQTPEEGLKAL 300
           TLLEEIKE TEKPVAVGFGISKPEHVKQV++WGADGIIVGSAMVKLLGEAQ+PEEGLKAL
Sbjct: 241 TLLEEIKEVTEKPVAVGFGISKPEHVKQVSSWGADGIIVGSAMVKLLGEAQSPEEGLKAL 300

Query: 301 ESFTKSLKSALS 313
           E+FTKSLKSALS
Sbjct: 301 ENFTKSLKSALS 312

BLAST of CmoCh19G000540 vs. NCBI nr
Match: gi|449455268|ref|XP_004145375.1| (PREDICTED: tryptophan synthase alpha chain-like isoform X1 [Cucumis sativus])

HSP 1 Score: 535.0 bits (1377), Expect = 8.6e-149
Identity = 276/311 (88.75%), Postives = 293/311 (94.21%), Query Frame = 1

Query: 1   MDIAFNSSSLLRLNKIPTALVFPTHPRKISVVQSKRFVPMAALTVSNAIGLSETFSKLKE 60
           MDI  NSS L + NKIPT L+FP HP KISV QSKR VPMA+LT S+A+GLSETFSKLKE
Sbjct: 1   MDIVLNSSRLFQFNKIPTTLIFPPHPCKISVFQSKRVVPMASLTASSAVGLSETFSKLKE 60

Query: 61  QGKVAFIPYITAGDPDLSTTAEALKVLSNHGSDIIELGVPYSDPLADGPVIQAAATRSLA 120
           QGKVAFIPYITAGDPDLSTTAEALKVLS  GSDIIELGVPYSDPLADGPVIQAAATRSLA
Sbjct: 61  QGKVAFIPYITAGDPDLSTTAEALKVLSTSGSDIIELGVPYSDPLADGPVIQAAATRSLA 120

Query: 121 RGTNFSAIISMLKEVIPELSCPIALFSYYNPILKRGVENFMMTIRDAGVHGLVVPDVPLE 180
           RGTNFSAIISMLKEVIPELSCPIALFSYYNPILKRG+ NFM+TI+DAGV GLVVPDVPLE
Sbjct: 121 RGTNFSAIISMLKEVIPELSCPIALFSYYNPILKRGIGNFMLTIKDAGVRGLVVPDVPLE 180

Query: 181 ETEILRKEAVKYNIELVLLTTPTTPKDRMKAIVEASEGFVYLVSSVGVTGARTSVSDRVQ 240
           ETEILRKEAVK++IELVLLTTPTTP+DRMKAIVEASEGFVYLVSSVGVTGAR SVS++VQ
Sbjct: 181 ETEILRKEAVKHSIELVLLTTPTTPRDRMKAIVEASEGFVYLVSSVGVTGARASVSNKVQ 240

Query: 241 TLLEEIKEATEKPVAVGFGISKPEHVKQVANWGADGIIVGSAMVKLLGEAQTPEEGLKAL 300
           TLLEEIKE TEKPVAVGFGISKPEHVKQV++WGADGIIVGSAMVKLLGEAQ+PEEGLKAL
Sbjct: 241 TLLEEIKEVTEKPVAVGFGISKPEHVKQVSSWGADGIIVGSAMVKLLGEAQSPEEGLKAL 300

Query: 301 ESFTKSLKSAL 312
           E+FTKSL SAL
Sbjct: 301 ENFTKSLTSAL 311

BLAST of CmoCh19G000540 vs. NCBI nr
Match: gi|659130730|ref|XP_008465320.1| (PREDICTED: tryptophan synthase alpha chain-like [Cucumis melo])

HSP 1 Score: 508.1 bits (1307), Expect = 1.1e-140
Identity = 263/312 (84.29%), Postives = 280/312 (89.74%), Query Frame = 1

Query: 1   MDIAFNSSSLLRLNKIPTALVFPTHPRKISVVQSKRFVPMAALTVSNAIGLSETFSKLKE 60
           MDIAF  S +L LNKIP  L+F   P KISV Q+KRF PMAAL     +GLSETF KL+E
Sbjct: 1   MDIAFKPSRILPLNKIPNTLIFSPRPCKISVSQTKRFAPMAALAAYPVVGLSETFKKLRE 60

Query: 61  QGKVAFIPYITAGDPDLSTTAEALKVLSNHGSDIIELGVPYSDPLADGPVIQAAATRSLA 120
           QGKVA IPYITAGDPDLSTTAEALKVLS  GSDIIELGVPYSDPLADGPVIQAAATRSLA
Sbjct: 61  QGKVALIPYITAGDPDLSTTAEALKVLSTCGSDIIELGVPYSDPLADGPVIQAAATRSLA 120

Query: 121 RGTNFSAIISMLKEVIPELSCPIALFSYYNPILKRGVENFMMTIRDAGVHGLVVPDVPLE 180
           R TNF+AIISMLK VIPELSCPI+LF+YYNPILKRGVENFMM I+D GV GLVVPDVPLE
Sbjct: 121 RETNFNAIISMLKGVIPELSCPISLFTYYNPILKRGVENFMMIIKDTGVRGLVVPDVPLE 180

Query: 181 ETEILRKEAVKYNIELVLLTTPTTPKDRMKAIVEASEGFVYLVSSVGVTGARTSVSDRVQ 240
           ETE+LRKEAVK+NIELVLLTTPTTPK+RMK IVEASEGFVYLVSS+GVTG RTSVS RVQ
Sbjct: 181 ETEVLRKEAVKHNIELVLLTTPTTPKERMKKIVEASEGFVYLVSSIGVTGTRTSVSGRVQ 240

Query: 241 TLLEEIKEATEKPVAVGFGISKPEHVKQVANWGADGIIVGSAMVKLLGEAQTPEEGLKAL 300
           TLLEE+KE TEKPVAVGFGISKPEHVKQVA WGADGII+GSAMVKLLGEAQ+PEEGLK L
Sbjct: 241 TLLEEVKEVTEKPVAVGFGISKPEHVKQVAEWGADGIIIGSAMVKLLGEAQSPEEGLKEL 300

Query: 301 ESFTKSLKSALS 313
           E+FTKSLKSALS
Sbjct: 301 ENFTKSLKSALS 312

BLAST of CmoCh19G000540 vs. NCBI nr
Match: gi|449446897|ref|XP_004141207.1| (PREDICTED: tryptophan synthase alpha chain [Cucumis sativus])

HSP 1 Score: 503.4 bits (1295), Expect = 2.8e-139
Identity = 262/312 (83.97%), Postives = 279/312 (89.42%), Query Frame = 1

Query: 1   MDIAFNSSSLLRLNKIPTALVFPTHPRKISVVQSKRFVPMAALTVSNAIGLSETFSKLKE 60
           MDIAF SS +L LNKIP  L+F   P KISV QSKRF PMAAL     +GLSETF  L+E
Sbjct: 1   MDIAFKSSRILPLNKIPNTLIFSPRPCKISVSQSKRFAPMAALAAYPVVGLSETFKNLRE 60

Query: 61  QGKVAFIPYITAGDPDLSTTAEALKVLSNHGSDIIELGVPYSDPLADGPVIQAAATRSLA 120
           QGKVA IPYITAGDPDLSTTAEALKVLS  GSDIIELGVPYSDPLADGPVIQAAATRSLA
Sbjct: 61  QGKVALIPYITAGDPDLSTTAEALKVLSKCGSDIIELGVPYSDPLADGPVIQAAATRSLA 120

Query: 121 RGTNFSAIISMLKEVIPELSCPIALFSYYNPILKRGVENFMMTIRDAGVHGLVVPDVPLE 180
           R TNF+AIISMLK VIPELS PI+LF+YYNPILKRGVENFMM I+D GV GLVVPDVPLE
Sbjct: 121 RETNFNAIISMLKGVIPELSRPISLFTYYNPILKRGVENFMMIIKDTGVRGLVVPDVPLE 180

Query: 181 ETEILRKEAVKYNIELVLLTTPTTPKDRMKAIVEASEGFVYLVSSVGVTGARTSVSDRVQ 240
           ETE+LRKEAVK+NIELVLLTTPTTPK+RMK IVEASEGFVYLVSS+GVTG RTSVS RVQ
Sbjct: 181 ETEVLRKEAVKHNIELVLLTTPTTPKERMKNIVEASEGFVYLVSSIGVTGTRTSVSSRVQ 240

Query: 241 TLLEEIKEATEKPVAVGFGISKPEHVKQVANWGADGIIVGSAMVKLLGEAQTPEEGLKAL 300
           TLLEE+KE TEKPVAVGFGISKPEHVKQVA WGADGII+GSAMVKLLGEAQ+PEEGLK L
Sbjct: 241 TLLEEVKEVTEKPVAVGFGISKPEHVKQVAEWGADGIIIGSAMVKLLGEAQSPEEGLKEL 300

Query: 301 ESFTKSLKSALS 313
           E+FT+SLKSALS
Sbjct: 301 ENFTRSLKSALS 312

BLAST of CmoCh19G000540 vs. NCBI nr
Match: gi|659096914|ref|XP_008449353.1| (PREDICTED: tryptophan synthase alpha chain-like isoform X2 [Cucumis melo])

HSP 1 Score: 488.8 bits (1257), Expect = 7.0e-135
Identity = 252/273 (92.31%), Postives = 266/273 (97.44%), Query Frame = 1

Query: 40  MAALTVSNAIGLSETFSKLKEQGKVAFIPYITAGDPDLSTTAEALKVLSNHGSDIIELGV 99
           MA+LT S+A+GLSETFSKLKEQGKVAFIPYITAGDPDLSTTAEALKVLS  GSDIIELGV
Sbjct: 1   MASLTASSAVGLSETFSKLKEQGKVAFIPYITAGDPDLSTTAEALKVLSTSGSDIIELGV 60

Query: 100 PYSDPLADGPVIQAAATRSLARGTNFSAIISMLKEVIPELSCPIALFSYYNPILKRGVEN 159
           PYSDPLADGPVIQAAATRSLARGTNFSAIISMLKEVIPELSCPIALFSYYNPILKRG+ N
Sbjct: 61  PYSDPLADGPVIQAAATRSLARGTNFSAIISMLKEVIPELSCPIALFSYYNPILKRGIGN 120

Query: 160 FMMTIRDAGVHGLVVPDVPLEETEILRKEAVKYNIELVLLTTPTTPKDRMKAIVEASEGF 219
           FM+TI+DAGV GLVVPDVPLEETEILRKEAVK+NIELVLLTTPTTP+DRMKAIVEASEGF
Sbjct: 121 FMLTIKDAGVQGLVVPDVPLEETEILRKEAVKHNIELVLLTTPTTPRDRMKAIVEASEGF 180

Query: 220 VYLVSSVGVTGARTSVSDRVQTLLEEIKEATEKPVAVGFGISKPEHVKQVANWGADGIIV 279
           VYLVSSVGVTGAR SVS++VQTLLEEIKE TEKPVAVGFGISKPEHVKQV++WGADGIIV
Sbjct: 181 VYLVSSVGVTGARASVSNKVQTLLEEIKEVTEKPVAVGFGISKPEHVKQVSSWGADGIIV 240

Query: 280 GSAMVKLLGEAQTPEEGLKALESFTKSLKSALS 313
           GSAMVKLLGEAQ+PEEGLKALE+FTKSLKSALS
Sbjct: 241 GSAMVKLLGEAQSPEEGLKALENFTKSLKSALS 273

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
TRPA1_ARATH1.9e-11980.22Tryptophan synthase alpha chain OS=Arabidopsis thaliana GN=TRPA1 PE=1 SV=2[more]
TRPA2_ARATH1.7e-11775.62Tryptophan synthase alpha chain, chloroplastic OS=Arabidopsis thaliana GN=TSA1 P... [more]
TRPA_MAIZE4.2e-8759.77Indole-3-glycerol phosphate lyase, chloroplastic OS=Zea mays GN=BX1 PE=1 SV=2[more]
TRPA_CYAP84.8e-8359.00Tryptophan synthase alpha chain OS=Cyanothece sp. (strain PCC 8801) GN=trpA PE=3... [more]
TRPA_CYAP42.6e-8156.44Tryptophan synthase alpha chain OS=Cyanothece sp. (strain PCC 7425 / ATCC 29141)... [more]
Match NameE-valueIdentityDescription
A0A0A0LKV4_CUCSA6.0e-14988.75Uncharacterized protein OS=Cucumis sativus GN=Csa_2G225330 PE=3 SV=1[more]
A0A0A0LD81_CUCSA1.9e-13983.97Uncharacterized protein OS=Cucumis sativus GN=Csa_3G843770 PE=3 SV=1[more]
A0A067JC03_JATCU9.0e-12979.02Uncharacterized protein OS=Jatropha curcas GN=JCGZ_21847 PE=3 SV=1[more]
A0A061DQA0_THECC7.6e-12883.27Aldolase-type TIM barrel family protein OS=Theobroma cacao GN=TCM_004218 PE=3 SV... [more]
B9S4R8_RICCO3.8e-12777.45Trytophan synthase alpha subunit, putative OS=Ricinus communis GN=RCOM_0991580 P... [more]
Match NameE-valueIdentityDescription
AT4G02610.11.0e-12080.22 Aldolase-type TIM barrel family protein[more]
AT3G54640.19.8e-11975.62 tryptophan synthase alpha chain[more]
Match NameE-valueIdentityDescription
gi|659096912|ref|XP_008449352.1|7.7e-15089.10PREDICTED: tryptophan synthase alpha chain-like isoform X1 [Cucumis melo][more]
gi|449455268|ref|XP_004145375.1|8.6e-14988.75PREDICTED: tryptophan synthase alpha chain-like isoform X1 [Cucumis sativus][more]
gi|659130730|ref|XP_008465320.1|1.1e-14084.29PREDICTED: tryptophan synthase alpha chain-like [Cucumis melo][more]
gi|449446897|ref|XP_004141207.1|2.8e-13983.97PREDICTED: tryptophan synthase alpha chain [Cucumis sativus][more]
gi|659096914|ref|XP_008449353.1|7.0e-13592.31PREDICTED: tryptophan synthase alpha chain-like isoform X2 [Cucumis melo][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002028Trp_synthase_suA
IPR011060RibuloseP-bd_barrel
IPR013785Aldolase_TIM
IPR018204Trp_synthase_alpha_AS
Vocabulary: Molecular Function
TermDefinition
GO:0004834tryptophan synthase activity
GO:0003824catalytic activity
Vocabulary: Biological Process
TermDefinition
GO:0006568tryptophan metabolic process
GO:0008152metabolic process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009094 L-phenylalanine biosynthetic process
biological_process GO:0000162 tryptophan biosynthetic process
biological_process GO:0006571 tyrosine biosynthetic process
biological_process GO:0008152 metabolic process
biological_process GO:0006568 tryptophan metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0004834 tryptophan synthase activity
molecular_function GO:0003824 catalytic activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh19G000540.1CmoCh19G000540.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002028Tryptophan synthase, alpha chainHAMAPMF_00131Trp_synth_alphacoord: 61..310
score: 37
IPR002028Tryptophan synthase, alpha chainPFAMPF00290Trp_syntAcoord: 55..310
score: 7.6
IPR002028Tryptophan synthase, alpha chainTIGRFAMsTIGR00262TIGR00262coord: 55..308
score: 1.4
IPR011060Ribulose-phosphate binding barrelunknownSSF51366Ribulose-phoshate binding barrelcoord: 51..292
score: 1.22
IPR013785Aldolase-type TIM barrelGENE3DG3DSA:3.20.20.70coord: 51..311
score: 1.5E
IPR018204Tryptophan synthase, alpha chain, active sitePROSITEPS00167TRP_SYNTHASE_ALPHAcoord: 95..108
scor
NoneNo IPR availableunknownCoilCoilcoord: 232..252
scor
NoneNo IPR availablePANTHERPTHR10314SER/THR DEHYDRATASE, TRP SYNTHASEcoord: 44..98
score: 1.3E-54coord: 121..199
score: 1.3
NoneNo IPR availablePANTHERPTHR10314:SF88TRYPTOPHAN SYNTHASE ALPHA CHAIN-RELATEDcoord: 44..98
score: 1.3E-54coord: 121..199
score: 1.3