Tan0021600 (gene) Snake gourd v1

Overview
NameTan0021600
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCBS domain-containing protein CBSX1, chloroplastic-like
LocationLG01: 6519583 .. 6525148 (-)
RNA-Seq ExpressionTan0021600
SyntenyTan0021600
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CACGCATTGTATGTCCGCTCATTGATTAAAGTCACACTAAGCATTAAATATCTCCGACCGACCAGACGGGGATTTTCCTCTCCCTTCAACCCGTTTTTTACTCGGGAAACTGTTGCTTTGCTTATACGGAACCAGAACCAACCTAATTTTCCGATTTCACTTTGATTCTGAACTTTTCCGGTTGTAATTCTCTTCCGTTCAGCAGATCAATCAGCCATTGATTCTTCCACCTATTCTCTTCCCAGATTACTGTGTATTCGTTTTTCCATCAGCCAAAGCTCAGTTTACTCAACAAACAGCAAGCAAAATGAGCTCCATCTCTCTGTCCAATTTACATGCTCTCGCGCTGGCTTATCCTCATCATCTTTCTCACTCTCAGCGTCAGCAATGGTGTCACCGGCCTCTGCCTTCGTCTAATTCTCTCTCTAGGCTACAGCGGTTTGGGATCTCTGGCCGAAATTCCGCTGGACCTCCCTTGCCTCTGGTTCTTGCTTCCGCTGGAGCTGGCGTTGTCGACTCCTTTCCGGTTTGTCGTTTCTTTCTAACATTCTTTTGGCTGCATTTGTTCTATCTCCACGGAGACGTTTTTGTCTTTAGGTTTTTACTCGAAATGAAACTCTAATTGCTGCTTTAATGTAATATAACGAATGACTGGAATTCCGTCATGTATCAATCTCTTCGCGTAATTATGGTTTGTATGTTTGTTCTGTATGAATGGATATGAAAATAATAGCAAAAACTTGCTGCCATTTTGGCTGTGAAGTAATCGAGAGATTATACTTCTCCTTTATGACGATGAATTGAAGGAAAAGAGCCAGCGCCGTCTAAGTTCATGCTGCAAAGTTTTAGAAAGGAGCATAGAACCAAAGCAGACGCTCGAAAGTACTTGTTATGATTATTTAGAATGATTTTTTTTAGAGAGGGAGAGTTTGGGGGAATTAACTTGTCAATGGCTTTTTAGACCTTTGGTTTTATACTTCTAATAGCTGCTTGACCGTCTGTATCTCGATCTTATTTTATCATGGTGATTTTCAGATCAGAGGAACATATACGGTTGGTGATTTTATGACGAGAAAGGAGAATCTTTATGTTGTTAAACCTACAACAACAGTTGATGAAGGTGCGGTTGGTTTAGGGCTTTTATTTTAAATATTTTCATTAGAGTTGCTCATGACAATTGTTTAATTCATTGGCTTTTGTGGTTGCAGCATTAGAAGCCCTTGTGGAAAAGAGAATAACTGGTTTTCCTGTTGTGGATGAAGATTGGAATTTGGTAGAGTTTTTCCTTTCATCTTTTTGTTCTTTTCGCTAGTGCTTTTATATTGCCGCTGCCAACTGACTGCTGACTGTGGAAGTGAATTATCGAAGGTCTGATGACATTGTTGTGGAGGGGACTGGGGAAAAGATCTTCTGTTTGTTGTCACGAAAAATCATGCAAAATAAATATATTATTGACATTTTTCATTTATTTTTTGAGGGGTTGGAGTGTATGTCTATCTATGGTCAAACTGTTACCGTTAAGCCATGTCAGTACGTTTAGTCCATTTACGTTTAGAGTACTTGTTGTCGAGAATCCAAACTGCTATTGAAGTTGATAACCTGCTGCACTTGGTCACAAATAATTTTAGTGCCTTTACATTAGTTGCTTCCTGTGGCAGGTTGGGGTCGTCTCAGATTATGACTTATTAGCACTTGATTCCATTTCAGGTATTGTAATGTAACTTCTGATTGCACAGATTTCAACTTAAATCACCATGACTTGAGGAATTATTCTCGCGTATTTTTCTGTTGAAATTGGGTTTGTGTCCTTTTTTAGTTTTGGTTAAATTACTAGCACTTTATTTATTCTTCTTATATTGATTCCTAAACTTTGAAATGCTTCCTTCAGATTGTTTTTGTTTTCTGACAGACCAAACATTTCCCAGTCATATCATATGTATCAAATTATTTTAGAATAAGTATTTTCAACGTTAACAGAAATAAACATCATTTAGGGGTTGTTTGAGGCACTGAGCGAGTTATAATAATATGAGTTATAATAGTTTGTGGGTTATTATAATCTGAAATAATATAATATTATTTAAAATGCAGAGTTGTTTAGTCAAGAGTTATAATAATCTGTGTTTAGAGTACATAACATTCCACATGTTATTATTACCTGCCCCAAACATGCCCTAAATTTTTACAGCTGATTGAGTTGATTCATTGAACTTTATTTGATGCAAAGGTGGACAAGTAACAAAGTTTCGTGATTTGTGTGTATATATTAATTTTCAATCAAATTCAGAAATAGAAAAATTGTTATCAGAAAATGTTTAGTATACTCTCTCTTGCTGGACTCTCGATGCAAAACTACATGCTCCACATCTTTGTATGCATGTAGTCTCTGTATCAAGTGTCCTCAATAAATCTCATGTTGCCCTCCATCTCTTCTCTTTCTTCTTATAATTTGTTCTTCATCCTATTTTTTTTTTGCTGCACACACACACATCTTTTTATTCCGCCCATAAATGCATAAATTTTAGTCTCTCATTCCAGAAGCTCTATTTTACTTACTTTACTTTACCTATACTCATTCCCTACTTTGCAAAAATCTAGGATACAAACAACCATCCTTTGCAAAAATCTAGGGATACAAGTTTTTGGATTACTTTGGGTCCATCATCATCCTTTGGATGGAAAATTGGTTCAAATGATCTGCGGCCGCCCTTTTAAGAGGAAAGCAAAATTGCTTCGGTCAGTGCTGTTAAGTTGACACTCTGGAGTATTTTGTTGGAAAGAGGAAAAAATATATTGGGATACAATTTTTTTTTTTGGGGGCAAAACTTGAAATATGAAATGGTTACAACTATTATGTCTCAATATGGTGCATCTTATAGTTTATTTGCTCATTTGCTGATGGCTGTATGAAAAACCTGATAGTTTGAGTATTTTGTATATAATGAATATCTCTTGAGCAATAGTTGGAAACTGGGTTATGGATTCTGCAACTGTCAAGCTGATAAATTGTTCGCTGCCTCATGTTTATCAATATTTGTTCAAACATTGAGGCCAATGAAAATAGTAAGGGATTTCGAGAGAATGGGTTAAAATAATGGCCACTTACCTACAGTTTAATATCTTACGAGTTTCATTTGCAATCAAATGTAGTAAAGTCAAGTGGTTGTCTTGTAAGATTATTTGAGGCATACGTAAACTGACGTGGACTCTCACAATTTTTTTTAACAAAGTTCATTCATAATTCATCAATCATTTTCTTGTACTTTATTGGATTCCATATATTCCATTGCTTATCTTACTCAATAATGTTTCTCCCTTATTTTTTGGAAATTAGGTGGTACTCAAAGTGATACTAATCTGTTTCCTGATGTGGATAGTTCGTGGAAAGTATGTAAGCAATGCCCTCATTTATCTTCTGGTTTTTTGCTTTTGTTATTTTGTAGACATGGGGATGTTTAAAAAATGCATTGACCCAGAAAACCCAATAAATCCAACCAAACCGTTTGGGTTGGGTTTCTGCTAGTTCAACGCAAATCATCAGAAAGTTCTTTTTAAAAAAAAAAAAAAATATTTCTTAATTTTTAGAGTTTTTATACTCTATATAGATTAATCTTTGGAGTATTTTCATCTCCCCCCCACCACACACACACACAAAAAAAAAAAAAAGACATTTAACCATGACTTTTTATTCATACATTTGATTACTACTTTTTATTTATGGTGTGAAAACTGGTGGATAGGTTCTTAAATTACTTCTATTTGTTTATTGTTTTCTTACAAAATTTTGAAATAAATAACTTGAGTTTTTTTTTTTTTTTTTTTTTTTTTTGAGTGCATAGTCGTTACAATGAGATTTTCTGTATATCTGATCGTAGCACTCAACTGCTATTTTCTTTTGTAATGATCATGTAGGCAGCACCACTTCTTTTCAAAATACCGAATTTCCACAAACATGCCACATGAAAACATATATACCTGCAAAATTATCTTTAGAACGAGATATGTATAGTTGCTTATTTTAAATGAAAATTTATTTTGAGGAGAGTGCTTTAAAATCGAGATAGACGGGTCACAGAAATTAGCTTTGCCATTGACATCCCTTATTTAAGTAAATTTTTTTGGAGAAGAAGAGTCACAGAAATTAGCTCTGCCATTGACATCCCTTGATTAATTAAATTTCTTTGGAGAAAGAGGAGTCACGTGATCATGACCATGAATTATGTGGATTGGTGGTTCACCACCAGTTTATAATAAAATAAAAAATAAGAGTATATTTTTGCTGAATGTGTTGGCAGACATTCAATGAGATACAGAAACTGCTCAGTAAAACAAATGGTAAAGTTGTTGGCGACCTGATGACACCTGCTCCTCTAGTTGTTCGTGAAACTTCAAATTTAGAAGATGCTGCCAGGTAAACATTTTATACCTCTCTTCTGCATCTTACAATCTGAATTCATGGTTCCTTCAGTGTTTCGACCTAGATATTTAATCAATTATACTCTTTTCTTTTTTGTTTTTTGTAACTGTTAAGTCTTAAAGATTTGATCTGCGGGTGCCTCCATGTTATTCTGTGACTTTGTTCAATCATCATTTGTCACGCTGGTTTCTCACAACTTCTTACATAGGGGGAAAACATCATACAACATCTGTGCCTGTGGCTATCAAAAACAACGTTTAAATGACACATTTAAGAGAGGAGAGACTTCAGAGTCAACAATATACTCATTCTTAGAAGTTAATCTATGACTATCAAAACTTGAAACTATTGTTTTAACTTCCAATGTTCAGGTTGTTGCTTGAAACAAAATATCGACGATTGCCAGTAGTAAATGCAGATGGCAAGCTGGTAGGAATATTCCTCTTTCTTTTCTTTTCTATTCTTCTTTAAAGCGTAATAACTTTTTAAAGCTCCCTCAGTAACCGGTAACTGTCTTGTTTGCTTCTGTTTCAATGTTAAAGGTGGGGATCATTACTAGGGGAAATGTCGTTAGAGCAGCCTTGCAGATAAAACGCGCTGCTGAAAGGTCGACGTAATTCTAGGAACATTGGCTGGTGTTGCTCCGAGGTAAACATGCAACACAAAAGCCTAGCATTATTTGTTGATCATAAATATTGTCTCTTATAGTTTACACAGGGTTGGATTGTTAAAGTAACCCGAAGTTGTGGATTCCGTCCTCAAGTAAACTACATCTTTAAAGGGAAGTTTATTGATGCTCTAAGGTTCTTCTTCCCAATCTCCTAACTGAGAAATGTTGGTCTATGTCAATATTTGGGTATCCCCAGCATTTCAGGTCAGCCCACTGTTAGATTTCTCTAGCCATTATTGTTAGAACCTTATTGTTTAGATTTCTCTGAACATTTTGAACATAATCAGTGAGCATCTTTCTACTGTTTTTGGGGATGTCTTGTTCTGATGAAAGCATGAGTCTGTAATTGTGAGGTATATCCCCAATATTACGCCATGACTTCAATTAGGAGTTATGACTAACTTAATACTTAAAAAGAGAGTGATTGGTGATTGGAATAGCTCGTATAACAAAATCTATTTTTGTTCCTTTGTTTTAGC

mRNA sequence

CACGCATTGTATGTCCGCTCATTGATTAAAGTCACACTAAGCATTAAATATCTCCGACCGACCAGACGGGGATTTTCCTCTCCCTTCAACCCGTTTTTTACTCGGGAAACTGTTGCTTTGCTTATACGGAACCAGAACCAACCTAATTTTCCGATTTCACTTTGATTCTGAACTTTTCCGGTTGTAATTCTCTTCCGTTCAGCAGATCAATCAGCCATTGATTCTTCCACCTATTCTCTTCCCAGATTACTGTGTATTCGTTTTTCCATCAGCCAAAGCTCAGTTTACTCAACAAACAGCAAGCAAAATGAGCTCCATCTCTCTGTCCAATTTACATGCTCTCGCGCTGGCTTATCCTCATCATCTTTCTCACTCTCAGCGTCAGCAATGGTGTCACCGGCCTCTGCCTTCGTCTAATTCTCTCTCTAGGCTACAGCGGTTTGGGATCTCTGGCCGAAATTCCGCTGGACCTCCCTTGCCTCTGGTTCTTGCTTCCGCTGGAGCTGGCGTTGTCGACTCCTTTCCGATCAGAGGAACATATACGGTTGGTGATTTTATGACGAGAAAGGAGAATCTTTATGTTGTTAAACCTACAACAACAGTTGATGAAGCATTAGAAGCCCTTGTGGAAAAGAGAATAACTGGTTTTCCTGTTGTGGATGAAGATTGGAATTTGGTTGGGGTCGTCTCAGATTATGACTTATTAGCACTTGATTCCATTTCAGGTGGTACTCAAAGTGATACTAATCTGTTTCCTGATGTGGATAGTTCGTGGAAAACATTCAATGAGATACAGAAACTGCTCAGTAAAACAAATGGTAAAGTTGTTGGCGACCTGATGACACCTGCTCCTCTAGTTGTTCGTGAAACTTCAAATTTAGAAGATGCTGCCAGGTTGTTGCTTGAAACAAAATATCGACGATTGCCAGTAGTAAATGCAGATGGCAAGCTGGTGGGGATCATTACTAGGGGAAATGTCGTTAGAGCAGCCTTGCAGATAAAACGCGCTGCTGAAAGGTCGACGTAATTCTAGGAACATTGGCTGGTGTTGCTCCGAGGTAAACATGCAACACAAAAGCCTAGCATTATTTGTTGATCATAAATATTGTCTCTTATAGTTTACACAGGGTTGGATTGTTAAAGTAACCCGAAGTTGTGGATTCCGTCCTCAAGTAAACTACATCTTTAAAGGGAAGTTTATTGATGCTCTAAGGTTCTTCTTCCCAATCTCCTAACTGAGAAATGTTGGTCTATGTCAATATTTGGGTATCCCCAGCATTTCAGGTCAGCCCACTGTTAGATTTCTCTAGCCATTATTGTTAGAACCTTATTGTTTAGATTTCTCTGAACATTTTGAACATAATCAGTGAGCATCTTTCTACTGTTTTTGGGGATGTCTTGTTCTGATGAAAGCATGAGTCTGTAATTGTGAGGTATATCCCCAATATTACGCCATGACTTCAATTAGGAGTTATGACTAACTTAATACTTAAAAAGAGAGTGATTGGTGATTGGAATAGCTCGTATAACAAAATCTATTTTTGTTCCTTTGTTTTAGC

Coding sequence (CDS)

ATGAGCTCCATCTCTCTGTCCAATTTACATGCTCTCGCGCTGGCTTATCCTCATCATCTTTCTCACTCTCAGCGTCAGCAATGGTGTCACCGGCCTCTGCCTTCGTCTAATTCTCTCTCTAGGCTACAGCGGTTTGGGATCTCTGGCCGAAATTCCGCTGGACCTCCCTTGCCTCTGGTTCTTGCTTCCGCTGGAGCTGGCGTTGTCGACTCCTTTCCGATCAGAGGAACATATACGGTTGGTGATTTTATGACGAGAAAGGAGAATCTTTATGTTGTTAAACCTACAACAACAGTTGATGAAGCATTAGAAGCCCTTGTGGAAAAGAGAATAACTGGTTTTCCTGTTGTGGATGAAGATTGGAATTTGGTTGGGGTCGTCTCAGATTATGACTTATTAGCACTTGATTCCATTTCAGGTGGTACTCAAAGTGATACTAATCTGTTTCCTGATGTGGATAGTTCGTGGAAAACATTCAATGAGATACAGAAACTGCTCAGTAAAACAAATGGTAAAGTTGTTGGCGACCTGATGACACCTGCTCCTCTAGTTGTTCGTGAAACTTCAAATTTAGAAGATGCTGCCAGGTTGTTGCTTGAAACAAAATATCGACGATTGCCAGTAGTAAATGCAGATGGCAAGCTGGTGGGGATCATTACTAGGGGAAATGTCGTTAGAGCAGCCTTGCAGATAAAACGCGCTGCTGAAAGGTCGACGTAA

Protein sequence

MSSISLSNLHALALAYPHHLSHSQRQQWCHRPLPSSNSLSRLQRFGISGRNSAGPPLPLVLASAGAGVVDSFPIRGTYTVGDFMTRKENLYVVKPTTTVDEALEALVEKRITGFPVVDEDWNLVGVVSDYDLLALDSISGGTQSDTNLFPDVDSSWKTFNEIQKLLSKTNGKVVGDLMTPAPLVVRETSNLEDAARLLLETKYRRLPVVNADGKLVGIITRGNVVRAALQIKRAAERST
Homology
BLAST of Tan0021600 vs. ExPASy Swiss-Prot
Match: O23193 (CBS domain-containing protein CBSX1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CBSX1 PE=1 SV=2)

HSP 1 Score: 271.2 bits (692), Expect = 1.1e-71
Identity = 139/208 (66.83%), Postives = 171/208 (82.21%), Query Frame = 0

Query: 33  LPSSNSLSRLQRFGISGRNSAGPPLPLVLASAGAGVV--DSFPIRGTYTVGDFMTRKENL 92
           LP   S+    +F  S    +   +P   ++AG+ ++   S P  G YTVG+FMT+KE+L
Sbjct: 28  LPRFLSVQPCHKFTFSRSFPSKSRIPSASSAAGSTLMTNSSSPRSGVYTVGEFMTKKEDL 87

Query: 93  YVVKPTTTVDEALEALVEKRITGFPVVDEDWNLVGVVSDYDLLALDSISGGTQSDTNLFP 152
           +VVKPTTTVDEALE LVE RITGFPV+DEDW LVG+VSDYDLLALDSISG  +++ ++FP
Sbjct: 88  HVVKPTTTVDEALELLVENRITGFPVIDEDWKLVGLVSDYDLLALDSISGSGRTENSMFP 147

Query: 153 DVDSSWKTFNEIQKLLSKTNGKVVGDLMTPAPLVVRETSNLEDAARLLLETKYRRLPVVN 212
           +VDS+WKTFN +QKLLSKTNGK+VGDLMTPAPLVV E +NLEDAA++LLETKYRRLPVV+
Sbjct: 148 EVDSTWKTFNAVQKLLSKTNGKLVGDLMTPAPLVVEEKTNLEDAAKILLETKYRRLPVVD 207

Query: 213 ADGKLVGIITRGNVVRAALQIKRAAERS 239
           +DGKLVGIITRGNVVRAALQIKR+ +R+
Sbjct: 208 SDGKLVGIITRGNVVRAALQIKRSGDRN 235

BLAST of Tan0021600 vs. ExPASy Swiss-Prot
Match: Q9C5D0 (CBS domain-containing protein CBSX2, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CBSX2 PE=1 SV=1)

HSP 1 Score: 267.3 bits (682), Expect = 1.7e-70
Identity = 152/244 (62.30%), Postives = 185/244 (75.82%), Query Frame = 0

Query: 1   MSSISLSNLHALALAYPHHLSHSQRQQWCHRPLPSSNSLSRLQRFGISGRNSAGPPLPLV 60
           M SISLSN  ++ +     L+    Q +    LP S+S   L       R+S   P   V
Sbjct: 1   MGSISLSN--SMPITRLPLLTSLYHQSF----LPISSSSFSLLPLSNRRRSSTFSPSITV 60

Query: 61  ----LASAGAGVVDSFPIR-GTYTVGDFMTRKENLYVVKPTTTVDEALEALVEKRITGFP 120
                A A     +S P + G YTVGDFMT ++NL+VVKP+T+VD+ALE LVEK++TG P
Sbjct: 61  SAFFAAPASVNNNNSVPAKNGGYTVGDFMTPRQNLHVVKPSTSVDDALELLVEKKVTGLP 120

Query: 121 VVDEDWNLVGVVSDYDLLALDSISGGTQSDTNLFPDVDSSWKTFNEIQKLLSKTNGKVVG 180
           V+D++W LVGVVSDYDLLALDSISG +Q+DTNLFPDVDS+WKTFNE+QKL+SKT GKVVG
Sbjct: 121 VIDDNWTLVGVVSDYDLLALDSISGRSQNDTNLFPDVDSTWKTFNELQKLISKTYGKVVG 180

Query: 181 DLMTPAPLVVRETSNLEDAARLLLETKYRRLPVVNADGKLVGIITRGNVVRAALQIKRAA 240
           DLMTP+PLVVR+++NLEDAARLLLETK+RRLPVV+ADGKL+GI+TRGNVVRAALQIKR  
Sbjct: 181 DLMTPSPLVVRDSTNLEDAARLLLETKFRRLPVVDADGKLIGILTRGNVVRAALQIKRET 238

BLAST of Tan0021600 vs. ExPASy Swiss-Prot
Match: P42851 (Inosine-5'-monophosphate dehydrogenase OS=Pyrococcus furiosus (strain ATCC 43587 / DSM 3638 / JCM 8422 / Vc1) OX=186497 GN=guaB PE=3 SV=1)

HSP 1 Score: 65.9 bits (159), Expect = 7.3e-10
Identity = 42/138 (30.43%), Postives = 70/138 (50.72%), Query Frame = 0

Query: 88  ENLYVVKPTTTVDEALEALVEKRITGFPVVDEDWNLVGVVSDYDLLALDSISGGTQSDTN 147
           E++  + P  T+D AL  + +  I G PVV+ED  +VG+++  D+ A +           
Sbjct: 101 EDVITIAPDETIDYALFLMEKHGIDGLPVVEED-RVVGIITKKDIAARE----------- 160

Query: 148 LFPDVDSSWKTFNEIQKLLSKTNGKVVGDLMTPAPLVVRETSNLEDAARLLLETKYRRLP 207
                                  G+ V +LMT   + V E+ ++E+A ++++E +  RLP
Sbjct: 161 -----------------------GRTVKELMTREVITVPESVDVEEALKIMMENRIDRLP 203

Query: 208 VVNADGKLVGIITRGNVV 226
           VVN DGKLVG+IT  ++V
Sbjct: 221 VVNEDGKLVGLITMSDLV 203

BLAST of Tan0021600 vs. ExPASy Swiss-Prot
Match: Q58821 (Uncharacterized protein MJ1426 OS=Methanocaldococcus jannaschii (strain ATCC 43067 / DSM 2661 / JAL-1 / JCM 10045 / NBRC 100440) OX=243232 GN=MJ1426 PE=4 SV=1)

HSP 1 Score: 64.3 bits (155), Expect = 2.1e-09
Identity = 39/145 (26.90%), Postives = 80/145 (55.17%), Query Frame = 0

Query: 92  VVKPTTTVDEALEALVEKRITGFPVVDEDWNLVGVVSDYDLLALDSISGGTQSDTNLFPD 151
           VV     + + +    + +I+G PV+++D  LVG++S+ D+  + +I    +    + P 
Sbjct: 26  VVYEDNDLIDVIRLFRKNKISGAPVLNKDGKLVGIISESDI--VKTIVTHNEDLNLILPS 85

Query: 152 ----VDSSWKTFNEIQKLLSKTNGKV---VGDLMTPAPLVVRETSNLEDAARLLLETKYR 211
               ++   KT  +I++ +      +   V D+MT   +V +    + DAA+L+++   +
Sbjct: 86  PLDLIELPLKTALKIEEFMEDLKNALKTKVRDVMTRKVIVAKPDMTINDAAKLMVKNNIK 145

Query: 212 RLPVVNADGKLVGIITRGNVVRAAL 230
           RLPVV+ +G L+GI+TRG+++ A +
Sbjct: 146 RLPVVDDEGNLIGIVTRGDLIEALI 168

BLAST of Tan0021600 vs. ExPASy Swiss-Prot
Match: O58045 (Inosine-5'-monophosphate dehydrogenase OS=Pyrococcus horikoshii (strain ATCC 700860 / DSM 12428 / JCM 9974 / NBRC 100139 / OT-3) OX=70601 GN=guaB PE=1 SV=1)

HSP 1 Score: 60.8 bits (146), Expect = 2.3e-08
Identity = 42/138 (30.43%), Postives = 68/138 (49.28%), Query Frame = 0

Query: 88  ENLYVVKPTTTVDEALEALVEKRITGFPVVDEDWNLVGVVSDYDLLALDSISGGTQSDTN 147
           E++  + P  TVD AL  + +  I G PVV ED  +VG+++  D+ A +           
Sbjct: 101 EDVITIAPDETVDFALFLMEKHGIDGLPVV-EDEKVVGIITKKDIAARE----------- 160

Query: 148 LFPDVDSSWKTFNEIQKLLSKTNGKVVGDLMTPAPLVVRETSNLEDAARLLLETKYRRLP 207
                                  GK+V +LMT   + V E+  +E+A ++++E +  RLP
Sbjct: 161 -----------------------GKLVKELMTKEVITVPESIEVEEALKIMIENRIDRLP 203

Query: 208 VVNADGKLVGIITRGNVV 226
           VV+  GKLVG+IT  ++V
Sbjct: 221 VVDERGKLVGLITMSDLV 203

BLAST of Tan0021600 vs. NCBI nr
Match: XP_038894434.1 (CBS domain-containing protein CBSX2, chloroplastic-like [Benincasa hispida])

HSP 1 Score: 429.5 bits (1103), Expect = 1.9e-116
Identity = 223/239 (93.31%), Postives = 228/239 (95.40%), Query Frame = 0

Query: 1   MSSISLSNLHALALAYPHHLSHSQRQQWCHRPLPSSNSLSRLQRFGISGRNSAGPPLPLV 60
           MSSISLSN HALA AYPHHL HSQRQQWC RPL SSNSLS+LQRFGIS R SA PPLPLV
Sbjct: 1   MSSISLSNSHALARAYPHHLPHSQRQQWCPRPLLSSNSLSKLQRFGISDRYSARPPLPLV 60

Query: 61  LASAGAGVVDSFPIRGTYTVGDFMTRKENLYVVKPTTTVDEALEALVEKRITGFPVVDED 120
           LAS+GAGVVDSFPIRGTYTVGDFMTRKENLYVVKPTTTVDEALE LVEKRITGFPVVD+D
Sbjct: 61  LASSGAGVVDSFPIRGTYTVGDFMTRKENLYVVKPTTTVDEALEVLVEKRITGFPVVDDD 120

Query: 121 WNLVGVVSDYDLLALDSISGGTQSDTNLFPDVDSSWKTFNEIQKLLSKTNGKVVGDLMTP 180
           WNLVGVVSDYDLLALDSISGGTQSDTNLFPDVDSSWKTFNEIQKLLSKTNGKVVGDLMT 
Sbjct: 121 WNLVGVVSDYDLLALDSISGGTQSDTNLFPDVDSSWKTFNEIQKLLSKTNGKVVGDLMTS 180

Query: 181 APLVVRETSNLEDAARLLLETKYRRLPVVNADGKLVGIITRGNVVRAALQIKRAAERST 240
           APL VRETSNLEDAARLLLETKYRRLPVV+ADG+LVGIITRGNVVRAALQIKRAAERST
Sbjct: 181 APLAVRETSNLEDAARLLLETKYRRLPVVDADGRLVGIITRGNVVRAALQIKRAAERST 239

BLAST of Tan0021600 vs. NCBI nr
Match: XP_023535982.1 (CBS domain-containing protein CBSX1, chloroplastic-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 422.2 bits (1084), Expect = 3.1e-114
Identity = 220/239 (92.05%), Postives = 228/239 (95.40%), Query Frame = 0

Query: 1   MSSISLSNLHALALAYPHHLSHSQRQQWCHRPLPSSNSLSRLQRFGISGRNSAGPPLPLV 60
           MSSISLS   ALA AYPHHL +SQRQQ C RPLPSSNSL+RLQRF IS RNSAGPPLPLV
Sbjct: 1   MSSISLSCALALARAYPHHLPNSQRQQLCRRPLPSSNSLTRLQRFEISDRNSAGPPLPLV 60

Query: 61  LASAGAGVVDSFPIRGTYTVGDFMTRKENLYVVKPTTTVDEALEALVEKRITGFPVVDED 120
           LASAGAG+++SFPIRGTYTVGDFMTRKE+L+VVKPTTTVDEALE LVEKRITGFPVVDED
Sbjct: 61  LASAGAGILNSFPIRGTYTVGDFMTRKEDLFVVKPTTTVDEALETLVEKRITGFPVVDED 120

Query: 121 WNLVGVVSDYDLLALDSISGGTQSDTNLFPDVDSSWKTFNEIQKLLSKTNGKVVGDLMTP 180
           WNLVGVVSDYDLLALDSISGGTQSDTNLFPDVDSSWKTFNEIQKLLSKTNGKV+GDLMTP
Sbjct: 121 WNLVGVVSDYDLLALDSISGGTQSDTNLFPDVDSSWKTFNEIQKLLSKTNGKVIGDLMTP 180

Query: 181 APLVVRETSNLEDAARLLLETKYRRLPVVNADGKLVGIITRGNVVRAALQIKRAAERST 240
           APLVVRETSNLEDAARLLLETKYRRLPVVNADGKLVGIITRGNVVRAALQIKRAAE ST
Sbjct: 181 APLVVRETSNLEDAARLLLETKYRRLPVVNADGKLVGIITRGNVVRAALQIKRAAEGST 239

BLAST of Tan0021600 vs. NCBI nr
Match: XP_022982496.1 (CBS domain-containing protein CBSX1, chloroplastic-like [Cucurbita maxima])

HSP 1 Score: 422.2 bits (1084), Expect = 3.1e-114
Identity = 221/239 (92.47%), Postives = 228/239 (95.40%), Query Frame = 0

Query: 1   MSSISLSNLHALALAYPHHLSHSQRQQWCHRPLPSSNSLSRLQRFGISGRNSAGPPLPLV 60
           MSSISLS   ALA AYPHHL +SQRQQ C RPLPSSNSL+RLQRF IS RNSAGPPLPLV
Sbjct: 1   MSSISLSCALALAPAYPHHLPNSQRQQLCRRPLPSSNSLTRLQRFEISDRNSAGPPLPLV 60

Query: 61  LASAGAGVVDSFPIRGTYTVGDFMTRKENLYVVKPTTTVDEALEALVEKRITGFPVVDED 120
           LASAGAGV++SFPIRGTYTVGDFMTRKE+L+VVKPTTTVDEALE LVEKRITGFPVVDED
Sbjct: 61  LASAGAGVLNSFPIRGTYTVGDFMTRKEDLFVVKPTTTVDEALETLVEKRITGFPVVDED 120

Query: 121 WNLVGVVSDYDLLALDSISGGTQSDTNLFPDVDSSWKTFNEIQKLLSKTNGKVVGDLMTP 180
           WNLVGVVSDYDLLALDSISGGTQSDTNLFPDVDSSWKTFNEIQKLLSKTNGKV+GDLMTP
Sbjct: 121 WNLVGVVSDYDLLALDSISGGTQSDTNLFPDVDSSWKTFNEIQKLLSKTNGKVIGDLMTP 180

Query: 181 APLVVRETSNLEDAARLLLETKYRRLPVVNADGKLVGIITRGNVVRAALQIKRAAERST 240
           APLVVRETSNLEDAARLLLETKYRRLPVVNADGKLVGIITRGNVVRAALQIKRAAE ST
Sbjct: 181 APLVVRETSNLEDAARLLLETKYRRLPVVNADGKLVGIITRGNVVRAALQIKRAAEGST 239

BLAST of Tan0021600 vs. NCBI nr
Match: KAG6600315.1 (CBS domain-containing protein CBSX1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 420.6 bits (1080), Expect = 8.9e-114
Identity = 220/239 (92.05%), Postives = 228/239 (95.40%), Query Frame = 0

Query: 1   MSSISLSNLHALALAYPHHLSHSQRQQWCHRPLPSSNSLSRLQRFGISGRNSAGPPLPLV 60
           MSSISLS   ALA AYPHHL +SQRQ+ C RPLPSSNSL+RLQRF IS RNSAGPPLPLV
Sbjct: 1   MSSISLSCALALAPAYPHHLPNSQRQKLCRRPLPSSNSLTRLQRFEISDRNSAGPPLPLV 60

Query: 61  LASAGAGVVDSFPIRGTYTVGDFMTRKENLYVVKPTTTVDEALEALVEKRITGFPVVDED 120
           LASAGAGV++SFPIRGTYTVGDFMTRKE+L+VVKPTTTVDEALE LVEKRITGFPVVDED
Sbjct: 61  LASAGAGVLNSFPIRGTYTVGDFMTRKEDLFVVKPTTTVDEALETLVEKRITGFPVVDED 120

Query: 121 WNLVGVVSDYDLLALDSISGGTQSDTNLFPDVDSSWKTFNEIQKLLSKTNGKVVGDLMTP 180
           WNLVGVVSDYDLLALDSISGGTQSDTNLFPDVDSSWKTFNEIQKLLSKTNGKV+GDLMTP
Sbjct: 121 WNLVGVVSDYDLLALDSISGGTQSDTNLFPDVDSSWKTFNEIQKLLSKTNGKVIGDLMTP 180

Query: 181 APLVVRETSNLEDAARLLLETKYRRLPVVNADGKLVGIITRGNVVRAALQIKRAAERST 240
           APLVVRETSNLEDAARLLLETKYRRLPVVNADGKLVGIITRGNVVRAALQIKRAAE ST
Sbjct: 181 APLVVRETSNLEDAARLLLETKYRRLPVVNADGKLVGIITRGNVVRAALQIKRAAEGST 239

BLAST of Tan0021600 vs. NCBI nr
Match: KAG7030973.1 (CBS domain-containing protein CBSX1, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 420.2 bits (1079), Expect = 1.2e-113
Identity = 219/239 (91.63%), Postives = 228/239 (95.40%), Query Frame = 0

Query: 1   MSSISLSNLHALALAYPHHLSHSQRQQWCHRPLPSSNSLSRLQRFGISGRNSAGPPLPLV 60
           MSSISLS   ALA AYPHHL +SQRQ+ C RPLPSSNSL+RLQRF IS RNSAGPPLPLV
Sbjct: 1   MSSISLSCALALAPAYPHHLPNSQRQKLCRRPLPSSNSLTRLQRFEISDRNSAGPPLPLV 60

Query: 61  LASAGAGVVDSFPIRGTYTVGDFMTRKENLYVVKPTTTVDEALEALVEKRITGFPVVDED 120
           LASAGAG+++SFPIRGTYTVGDFMTRKE+L+VVKPTTTVDEALE LVEKRITGFPVVDED
Sbjct: 61  LASAGAGILNSFPIRGTYTVGDFMTRKEDLFVVKPTTTVDEALETLVEKRITGFPVVDED 120

Query: 121 WNLVGVVSDYDLLALDSISGGTQSDTNLFPDVDSSWKTFNEIQKLLSKTNGKVVGDLMTP 180
           WNLVGVVSDYDLLALDSISGGTQSDTNLFPDVDSSWKTFNEIQKLLSKTNGKV+GDLMTP
Sbjct: 121 WNLVGVVSDYDLLALDSISGGTQSDTNLFPDVDSSWKTFNEIQKLLSKTNGKVIGDLMTP 180

Query: 181 APLVVRETSNLEDAARLLLETKYRRLPVVNADGKLVGIITRGNVVRAALQIKRAAERST 240
           APLVVRETSNLEDAARLLLETKYRRLPVVNADGKLVGIITRGNVVRAALQIKRAAE ST
Sbjct: 181 APLVVRETSNLEDAARLLLETKYRRLPVVNADGKLVGIITRGNVVRAALQIKRAAEGST 239

BLAST of Tan0021600 vs. ExPASy TrEMBL
Match: A0A6J1IZH4 (CBS domain-containing protein CBSX1, chloroplastic-like OS=Cucurbita maxima OX=3661 GN=LOC111481297 PE=4 SV=1)

HSP 1 Score: 422.2 bits (1084), Expect = 1.5e-114
Identity = 221/239 (92.47%), Postives = 228/239 (95.40%), Query Frame = 0

Query: 1   MSSISLSNLHALALAYPHHLSHSQRQQWCHRPLPSSNSLSRLQRFGISGRNSAGPPLPLV 60
           MSSISLS   ALA AYPHHL +SQRQQ C RPLPSSNSL+RLQRF IS RNSAGPPLPLV
Sbjct: 1   MSSISLSCALALAPAYPHHLPNSQRQQLCRRPLPSSNSLTRLQRFEISDRNSAGPPLPLV 60

Query: 61  LASAGAGVVDSFPIRGTYTVGDFMTRKENLYVVKPTTTVDEALEALVEKRITGFPVVDED 120
           LASAGAGV++SFPIRGTYTVGDFMTRKE+L+VVKPTTTVDEALE LVEKRITGFPVVDED
Sbjct: 61  LASAGAGVLNSFPIRGTYTVGDFMTRKEDLFVVKPTTTVDEALETLVEKRITGFPVVDED 120

Query: 121 WNLVGVVSDYDLLALDSISGGTQSDTNLFPDVDSSWKTFNEIQKLLSKTNGKVVGDLMTP 180
           WNLVGVVSDYDLLALDSISGGTQSDTNLFPDVDSSWKTFNEIQKLLSKTNGKV+GDLMTP
Sbjct: 121 WNLVGVVSDYDLLALDSISGGTQSDTNLFPDVDSSWKTFNEIQKLLSKTNGKVIGDLMTP 180

Query: 181 APLVVRETSNLEDAARLLLETKYRRLPVVNADGKLVGIITRGNVVRAALQIKRAAERST 240
           APLVVRETSNLEDAARLLLETKYRRLPVVNADGKLVGIITRGNVVRAALQIKRAAE ST
Sbjct: 181 APLVVRETSNLEDAARLLLETKYRRLPVVNADGKLVGIITRGNVVRAALQIKRAAEGST 239

BLAST of Tan0021600 vs. ExPASy TrEMBL
Match: A0A5A7T6Y9 (CBS domain-containing protein CBSX1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold832G001350 PE=4 SV=1)

HSP 1 Score: 419.9 bits (1078), Expect = 7.4e-114
Identity = 217/239 (90.79%), Postives = 224/239 (93.72%), Query Frame = 0

Query: 1   MSSISLSNLHALALAYPHHLSHSQRQQWCHRPLPSSNSLSRLQRFGISGRNSAGPPLPLV 60
           M+SISLSN HALA  YPHHL HS RQQWC RPL SSNSLS+L RFGIS R  A PPLPLV
Sbjct: 1   MTSISLSNSHALARPYPHHLPHSHRQQWCSRPLLSSNSLSKLHRFGISDRFPARPPLPLV 60

Query: 61  LASAGAGVVDSFPIRGTYTVGDFMTRKENLYVVKPTTTVDEALEALVEKRITGFPVVDED 120
           LAS+GAGVVDSFP+RGTYTVGDFMTRKENLYVVKPTTTVDEALE LVEKRITGFPVVD+D
Sbjct: 61  LASSGAGVVDSFPLRGTYTVGDFMTRKENLYVVKPTTTVDEALEVLVEKRITGFPVVDDD 120

Query: 121 WNLVGVVSDYDLLALDSISGGTQSDTNLFPDVDSSWKTFNEIQKLLSKTNGKVVGDLMTP 180
           WNLVGVVSDYDLLALDSISGGTQSDTNLFPDVDSSWKTFNEIQKLLSKTNGKVVGDLMT 
Sbjct: 121 WNLVGVVSDYDLLALDSISGGTQSDTNLFPDVDSSWKTFNEIQKLLSKTNGKVVGDLMTS 180

Query: 181 APLVVRETSNLEDAARLLLETKYRRLPVVNADGKLVGIITRGNVVRAALQIKRAAERST 240
           +PL VRETSNLEDAARLLLETKYRRLPVV+ADGKLVGIITRGNVVRAALQIKRAAERST
Sbjct: 181 SPLAVRETSNLEDAARLLLETKYRRLPVVDADGKLVGIITRGNVVRAALQIKRAAERST 239

BLAST of Tan0021600 vs. ExPASy TrEMBL
Match: A0A1S3C251 (CBS domain-containing protein CBSX1, chloroplastic-like OS=Cucumis melo OX=3656 GN=LOC103496172 PE=4 SV=1)

HSP 1 Score: 419.9 bits (1078), Expect = 7.4e-114
Identity = 217/239 (90.79%), Postives = 224/239 (93.72%), Query Frame = 0

Query: 1   MSSISLSNLHALALAYPHHLSHSQRQQWCHRPLPSSNSLSRLQRFGISGRNSAGPPLPLV 60
           M+SISLSN HALA  YPHHL HS RQQWC RPL SSNSLS+L RFGIS R  A PPLPLV
Sbjct: 1   MTSISLSNSHALARPYPHHLPHSHRQQWCSRPLLSSNSLSKLHRFGISDRFPARPPLPLV 60

Query: 61  LASAGAGVVDSFPIRGTYTVGDFMTRKENLYVVKPTTTVDEALEALVEKRITGFPVVDED 120
           LAS+GAGVVDSFP+RGTYTVGDFMTRKENLYVVKPTTTVDEALE LVEKRITGFPVVD+D
Sbjct: 61  LASSGAGVVDSFPLRGTYTVGDFMTRKENLYVVKPTTTVDEALEVLVEKRITGFPVVDDD 120

Query: 121 WNLVGVVSDYDLLALDSISGGTQSDTNLFPDVDSSWKTFNEIQKLLSKTNGKVVGDLMTP 180
           WNLVGVVSDYDLLALDSISGGTQSDTNLFPDVDSSWKTFNEIQKLLSKTNGKVVGDLMT 
Sbjct: 121 WNLVGVVSDYDLLALDSISGGTQSDTNLFPDVDSSWKTFNEIQKLLSKTNGKVVGDLMTS 180

Query: 181 APLVVRETSNLEDAARLLLETKYRRLPVVNADGKLVGIITRGNVVRAALQIKRAAERST 240
           +PL VRETSNLEDAARLLLETKYRRLPVV+ADGKLVGIITRGNVVRAALQIKRAAERST
Sbjct: 181 SPLAVRETSNLEDAARLLLETKYRRLPVVDADGKLVGIITRGNVVRAALQIKRAAERST 239

BLAST of Tan0021600 vs. ExPASy TrEMBL
Match: A0A6J1FQM3 (CBS domain-containing protein CBSX1, chloroplastic-like OS=Cucurbita moschata OX=3662 GN=LOC111447897 PE=4 SV=1)

HSP 1 Score: 416.4 bits (1069), Expect = 8.1e-113
Identity = 218/239 (91.21%), Postives = 226/239 (94.56%), Query Frame = 0

Query: 1   MSSISLSNLHALALAYPHHLSHSQRQQWCHRPLPSSNSLSRLQRFGISGRNSAGPPLPLV 60
           MSSISLS + ALA AYPHHL +SQRQQ C RPLPSSNSL+RLQRF IS RNSAGPPL LV
Sbjct: 1   MSSISLSCVLALARAYPHHLPNSQRQQLCRRPLPSSNSLTRLQRFEISDRNSAGPPLHLV 60

Query: 61  LASAGAGVVDSFPIRGTYTVGDFMTRKENLYVVKPTTTVDEALEALVEKRITGFPVVDED 120
           LASAGAGV++SFPIRGTYTVGDFMTRKE+L+VVKPTTTVDEALE LVEKRITGFPVVDED
Sbjct: 61  LASAGAGVLNSFPIRGTYTVGDFMTRKEDLFVVKPTTTVDEALETLVEKRITGFPVVDED 120

Query: 121 WNLVGVVSDYDLLALDSISGGTQSDTNLFPDVDSSWKTFNEIQKLLSKTNGKVVGDLMTP 180
           WNLVGVVSDYDLLALDSISGGTQSD NLFPDVDSSWKTFNEIQKLLSKTNGKV+GDLMTP
Sbjct: 121 WNLVGVVSDYDLLALDSISGGTQSDNNLFPDVDSSWKTFNEIQKLLSKTNGKVIGDLMTP 180

Query: 181 APLVVRETSNLEDAARLLLETKYRRLPVVNADGKLVGIITRGNVVRAALQIKRAAERST 240
           APLVVRETSNLEDAARLLLETKYRRLPVVNADGKLVGIITRGNVVRAALQIKRA E ST
Sbjct: 181 APLVVRETSNLEDAARLLLETKYRRLPVVNADGKLVGIITRGNVVRAALQIKRAGEGST 239

BLAST of Tan0021600 vs. ExPASy TrEMBL
Match: A0A0A0L716 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G202220 PE=4 SV=1)

HSP 1 Score: 416.0 bits (1068), Expect = 1.1e-112
Identity = 215/239 (89.96%), Postives = 222/239 (92.89%), Query Frame = 0

Query: 1   MSSISLSNLHALALAYPHHLSHSQRQQWCHRPLPSSNSLSRLQRFGISGRNSAGPPLPLV 60
           MSSISLSN H LA  YPHHL HS RQQWC RPL S+NSLS+L RFGIS R  A PPLPLV
Sbjct: 1   MSSISLSNSHPLARPYPHHLPHSHRQQWCSRPLLSTNSLSKLHRFGISDRFPARPPLPLV 60

Query: 61  LASAGAGVVDSFPIRGTYTVGDFMTRKENLYVVKPTTTVDEALEALVEKRITGFPVVDED 120
           LAS+GAGVVDSFP+RGTYTVGDFMTRKENLYVVKPTTTVDEALE LVEKRITGFPVVD+D
Sbjct: 61  LASSGAGVVDSFPLRGTYTVGDFMTRKENLYVVKPTTTVDEALEVLVEKRITGFPVVDDD 120

Query: 121 WNLVGVVSDYDLLALDSISGGTQSDTNLFPDVDSSWKTFNEIQKLLSKTNGKVVGDLMTP 180
           WNLVGVVSDYDLLALDSISGGTQSDTNLFPDVDSSWKTFNEIQKLL KTNGKVVGDLMT 
Sbjct: 121 WNLVGVVSDYDLLALDSISGGTQSDTNLFPDVDSSWKTFNEIQKLLCKTNGKVVGDLMTS 180

Query: 181 APLVVRETSNLEDAARLLLETKYRRLPVVNADGKLVGIITRGNVVRAALQIKRAAERST 240
           +PL VRETSNLEDAARLLLETKYRRLPVV+ADGKLVGIITRGNVVRAALQIKRAAERST
Sbjct: 181 SPLAVRETSNLEDAARLLLETKYRRLPVVDADGKLVGIITRGNVVRAALQIKRAAERST 239

BLAST of Tan0021600 vs. TAIR 10
Match: AT4G36910.1 (Cystathionine beta-synthase (CBS) family protein )

HSP 1 Score: 271.2 bits (692), Expect = 8.1e-73
Identity = 139/208 (66.83%), Postives = 171/208 (82.21%), Query Frame = 0

Query: 33  LPSSNSLSRLQRFGISGRNSAGPPLPLVLASAGAGVV--DSFPIRGTYTVGDFMTRKENL 92
           LP   S+    +F  S    +   +P   ++AG+ ++   S P  G YTVG+FMT+KE+L
Sbjct: 28  LPRFLSVQPCHKFTFSRSFPSKSRIPSASSAAGSTLMTNSSSPRSGVYTVGEFMTKKEDL 87

Query: 93  YVVKPTTTVDEALEALVEKRITGFPVVDEDWNLVGVVSDYDLLALDSISGGTQSDTNLFP 152
           +VVKPTTTVDEALE LVE RITGFPV+DEDW LVG+VSDYDLLALDSISG  +++ ++FP
Sbjct: 88  HVVKPTTTVDEALELLVENRITGFPVIDEDWKLVGLVSDYDLLALDSISGSGRTENSMFP 147

Query: 153 DVDSSWKTFNEIQKLLSKTNGKVVGDLMTPAPLVVRETSNLEDAARLLLETKYRRLPVVN 212
           +VDS+WKTFN +QKLLSKTNGK+VGDLMTPAPLVV E +NLEDAA++LLETKYRRLPVV+
Sbjct: 148 EVDSTWKTFNAVQKLLSKTNGKLVGDLMTPAPLVVEEKTNLEDAAKILLETKYRRLPVVD 207

Query: 213 ADGKLVGIITRGNVVRAALQIKRAAERS 239
           +DGKLVGIITRGNVVRAALQIKR+ +R+
Sbjct: 208 SDGKLVGIITRGNVVRAALQIKRSGDRN 235

BLAST of Tan0021600 vs. TAIR 10
Match: AT4G34120.1 (Cystathionine beta-synthase (CBS) family protein )

HSP 1 Score: 267.3 bits (682), Expect = 1.2e-71
Identity = 152/244 (62.30%), Postives = 185/244 (75.82%), Query Frame = 0

Query: 1   MSSISLSNLHALALAYPHHLSHSQRQQWCHRPLPSSNSLSRLQRFGISGRNSAGPPLPLV 60
           M SISLSN  ++ +     L+    Q +    LP S+S   L       R+S   P   V
Sbjct: 1   MGSISLSN--SMPITRLPLLTSLYHQSF----LPISSSSFSLLPLSNRRRSSTFSPSITV 60

Query: 61  ----LASAGAGVVDSFPIR-GTYTVGDFMTRKENLYVVKPTTTVDEALEALVEKRITGFP 120
                A A     +S P + G YTVGDFMT ++NL+VVKP+T+VD+ALE LVEK++TG P
Sbjct: 61  SAFFAAPASVNNNNSVPAKNGGYTVGDFMTPRQNLHVVKPSTSVDDALELLVEKKVTGLP 120

Query: 121 VVDEDWNLVGVVSDYDLLALDSISGGTQSDTNLFPDVDSSWKTFNEIQKLLSKTNGKVVG 180
           V+D++W LVGVVSDYDLLALDSISG +Q+DTNLFPDVDS+WKTFNE+QKL+SKT GKVVG
Sbjct: 121 VIDDNWTLVGVVSDYDLLALDSISGRSQNDTNLFPDVDSTWKTFNELQKLISKTYGKVVG 180

Query: 181 DLMTPAPLVVRETSNLEDAARLLLETKYRRLPVVNADGKLVGIITRGNVVRAALQIKRAA 240
           DLMTP+PLVVR+++NLEDAARLLLETK+RRLPVV+ADGKL+GI+TRGNVVRAALQIKR  
Sbjct: 181 DLMTPSPLVVRDSTNLEDAARLLLETKFRRLPVVDADGKLIGILTRGNVVRAALQIKRET 238

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
O231931.1e-7166.83CBS domain-containing protein CBSX1, chloroplastic OS=Arabidopsis thaliana OX=37... [more]
Q9C5D01.7e-7062.30CBS domain-containing protein CBSX2, chloroplastic OS=Arabidopsis thaliana OX=37... [more]
P428517.3e-1030.43Inosine-5'-monophosphate dehydrogenase OS=Pyrococcus furiosus (strain ATCC 43587... [more]
Q588212.1e-0926.90Uncharacterized protein MJ1426 OS=Methanocaldococcus jannaschii (strain ATCC 430... [more]
O580452.3e-0830.43Inosine-5'-monophosphate dehydrogenase OS=Pyrococcus horikoshii (strain ATCC 700... [more]
Match NameE-valueIdentityDescription
XP_038894434.11.9e-11693.31CBS domain-containing protein CBSX2, chloroplastic-like [Benincasa hispida][more]
XP_023535982.13.1e-11492.05CBS domain-containing protein CBSX1, chloroplastic-like [Cucurbita pepo subsp. p... [more]
XP_022982496.13.1e-11492.47CBS domain-containing protein CBSX1, chloroplastic-like [Cucurbita maxima][more]
KAG6600315.18.9e-11492.05CBS domain-containing protein CBSX1, chloroplastic, partial [Cucurbita argyrospe... [more]
KAG7030973.11.2e-11391.63CBS domain-containing protein CBSX1, chloroplastic [Cucurbita argyrosperma subsp... [more]
Match NameE-valueIdentityDescription
A0A6J1IZH41.5e-11492.47CBS domain-containing protein CBSX1, chloroplastic-like OS=Cucurbita maxima OX=3... [more]
A0A5A7T6Y97.4e-11490.79CBS domain-containing protein CBSX1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5... [more]
A0A1S3C2517.4e-11490.79CBS domain-containing protein CBSX1, chloroplastic-like OS=Cucumis melo OX=3656 ... [more]
A0A6J1FQM38.1e-11391.21CBS domain-containing protein CBSX1, chloroplastic-like OS=Cucurbita moschata OX... [more]
A0A0A0L7161.1e-11289.96Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G202220 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G36910.18.1e-7366.83Cystathionine beta-synthase (CBS) family protein [more]
AT4G34120.11.2e-7162.30Cystathionine beta-synthase (CBS) family protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000644CBS domainSMARTSM00116cbs_1coord: 89..137
e-value: 5.3E-8
score: 42.6
IPR000644CBS domainPFAMPF00571CBScoord: 80..134
e-value: 9.5E-14
score: 51.6
IPR000644CBS domainPROSITEPS51371CBScoord: 84..144
score: 13.724406
NoneNo IPR availableGENE3D3.10.580.10coord: 76..196
e-value: 1.3E-49
score: 169.9
NoneNo IPR availablePANTHERPTHR48108CBS DOMAIN-CONTAINING PROTEIN CBSX2, CHLOROPLASTICcoord: 16..196
NoneNo IPR availablePANTHERPTHR48108:SF12CBS DOMAIN-CONTAINING PROTEIN CBSX1, CHLOROPLASTIC-LIKEcoord: 16..196
NoneNo IPR availableSUPERFAMILY54631CBS-domain paircoord: 76..196

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0021600.1Tan0021600.1mRNA
Tan0021600.2Tan0021600.2mRNA
Tan0021600.3Tan0021600.3mRNA