Cp4.1LG10g09030 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG10g09030
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionCBS domain-containing protein CBSX5
LocationCp4.1LG10 : 4763472 .. 4765684 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AATAATAGTGGAATGAAGTTAGAAACGAACTCACGAGCGTCTTCCTCCTCCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTAGAAGCTTCTTCAACCTCAGTTTCTCTCTCCTCTGTGTTCGTCTTCCTGCTTGTGCATGGCTATGAGGTTATTGGCTCATGAGGTCTCTGACATATGCCTTGGAAAGCCCCCACTAAGGTCCATCCCTGTCTCCGCCACACTCGCCCATGCCCTCTCTGTGCTCAAGAGGCTCCGTCAAAACTACATCAGCGTTTGGAGTTGCACCTGCCATTCCTCCAAAACTGCCTCCGATCATGATTGCCGATGTATTGGCAAGGTTTCTGTTGTTGATGTTATCTTATTCCTTTGTAGAGAAGAGAACCTCTCGCAACCTGCTGTCGCGCTTCAATCTTATGTTTCGATTCTCATTCCTGAGGTTCCTGTCGTCGTTAGGTATTTAGAGCCTCATGCTAGGTGCGTAATTCTGTTTTCCTTCACTTTATCTTGATTCGAGTATATATTGTTCTTCCGTTTTGAATTTTCTTGTTTTTGTATTCTGATTCACGTTTGCTTTTTTTTCCTGGAAAATTTGTTGTGTTCTTGGTTTCTCTGGAGTTGGAATCGAATATTGTCGTGTTTTCTTTGCTCCGTTGATGACGAAAACGTGCAATTTTCATTTACATCTCCTTCGATTTCGGCCTATTTAAACACCATCTCTGTAATTCACTCTGATCTTGAAAATCAATCTCAGAACTATGAACTGCAAAACTGTATGATCCATGATTATTGATTTTTTATTTGAATATAATATTTCCTGTTTCTCTTTTTTATCCAATTTTTTCTTAGCTAGATTTTCTTGTTGGCTTTCTGCATGACTTTTGAATTAATAATCCTCTGTGTTTCCCTTGAAACTTTTACGACATTCTCAATCGTCTCCTTTAATCTGCATTAGAAAACTGAATTTCTCTACAATATTAGAGCTTAATCGATCGATAACGATGATCCTAGAAGCTTCGAGCAAAAGGCTGATTGTGCAGAATTCTAATGTCAGGAGTCATTTCGTTTACCAATTCTTCCATGTAGCCGAAATATATATAGCTTTTAATGTATAAACTGATAGCTTTCTCCGATAACTTTCAGCTTGGTGGAAGCCATAGATCTCCTCCTCGAAGGCGCGCAAAATCTCGTAGTCCCAATTCAAACTAAAACCTCTGCAAAGTCCAGAAAGAAGGTTCTCAAGGAAGTTGCACCATTCGACTGCCCGCTTCACAATGACCTTGAATACTGTTGGCTCACCCAAGAGGACATAATCCGTTACCTCCTCAACTCTATTGGGCTTTTTTGCCCAACCTCCATCACTCCCATTAATTCCCTCAACGCCATCGACACAGCTAACATCCTTGCTGTACACTACGACGATCCTGCATTTTCTGCCCTGCCCCTCCTTTCTCAAGCCATCATCCAGCAATCCTCCGTTGCAATTGTCGATTCAGATGGGAAGTTGATCGGAGAAATCTCACCCCTCACAATGAATTCTTGTGATGTAACTGTAACGGCTGCGATTGCGACACTTTCAGCTGGTGAGCTAATGGCGTATGTTGACTATGGTAACCCACCGGAGGATTTGGTTCATTTGATCAAAGACAGACTGGAAGAAAGAAACTTAGGGGCACTGTTGGACTGGGTGGAAGAAGAGACAGCAATATTCTCATGTTCTTCATTTTGTTCATCTTCCTCAGATGATGATTCTGTCTCTAGCTGGGGAAGCAGTGGAAGGTTACGAAGGTGTTCAACCAGGCAAGTGAGGAGCTCAGAGGCTGCTGTGTGCAATCCCCGGAGTTCATTGGTGGCAGTGATGATTCAAGCTCTTGCGCGCCGTGTTGCAAGTATGTGGGTGATCGAAGAAGATGGAGCCTTGGTCGGTATCATCACGTTTGCATCGATGCTGAAGATTTTCCAGGAGCACTTGAAATCCATGCGTTAAAAGGACAACAGCAACAGCTTTGCCTAATCTTAACCCGAAACTACATTAATAACGTTCTACATTGGGTGTGGCTTTCAACTCAGTTGTGTTATTTTGCTCACTTTATTTTGGATCCAATGTGAATCATGTAAATCCTTAACACGTACAGACTCACTTTCTATAAGTTCTGATGATAACCGTAGATTATTGCAGCTTTCTAAACTTCTTTTTTCCAAAAGTATCTAT

mRNA sequence

AATAATAGTGGAATGAAGTTAGAAACGAACTCACGAGCGTCTTCCTCCTCCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTAGAAGCTTCTTCAACCTCAGTTTCTCTCTCCTCTGTGTTCGTCTTCCTGCTTGTGCATGGCTATGAGGTTATTGGCTCATGAGGTCTCTGACATATGCCTTGGAAAGCCCCCACTAAGGTCCATCCCTGTCTCCGCCACACTCGCCCATGCCCTCTCTGTGCTCAAGAGGCTCCGTCAAAACTACATCAGCGTTTGGAGTTGCACCTGCCATTCCTCCAAAACTGCCTCCGATCATGATTGCCGATGTATTGGCAAGGTTTCTGTTGTTGATGTTATCTTATTCCTTTGTAGAGAAGAGAACCTCTCGCAACCTGCTGTCGCGCTTCAATCTTATGTTTCGATTCTCATTCCTGAGGTTCCTGTCGTCGTTAGGTATTTAGAGCCTCATGCTAGCTTGGTGGAAGCCATAGATCTCCTCCTCGAAGGCGCGCAAAATCTCGTAGTCCCAATTCAAACTAAAACCTCTGCAAAGTCCAGAAAGAAGGTTCTCAAGGAAGTTGCACCATTCGACTGCCCGCTTCACAATGACCTTGAATACTGTTGGCTCACCCAAGAGGACATAATCCGTTACCTCCTCAACTCTATTGGGCTTTTTTGCCCAACCTCCATCACTCCCATTAATTCCCTCAACGCCATCGACACAGCTAACATCCTTGCTGTACACTACGACGATCCTGCATTTTCTGCCCTGCCCCTCCTTTCTCAAGCCATCATCCAGCAATCCTCCGTTGCAATTGTCGATTCAGATGGGAAGTTGATCGGAGAAATCTCACCCCTCACAATGAATTCTTGTGATGTAACTGTAACGGCTGCGATTGCGACACTTTCAGCTGGTGAGCTAATGGCGTATGTTGACTATGGTAACCCACCGGAGGATTTGGTTCATTTGATCAAAGACAGACTGGAAGAAAGAAACTTAGGGGCACTGTTGGACTGGGTGGAAGAAGAGACAGCAATATTCTCATGTTCTTCATTTTGTTCATCTTCCTCAGATGATGATTCTGTCTCTAGCTGGGGAAGCAGTGGAAGGTTACGAAGGTGTTCAACCAGGCAAGTGAGGAGCTCAGAGGCTGCTGTGTGCAATCCCCGGAGTTCATTGGTGGCAGTGATGATTCAAGCTCTTGCGCGCCGTGTTGCAAGTATGTGGGTGATCGAAGAAGATGGAGCCTTGGTCGGTATCATCACGTTTGCATCGATGCTGAAGATTTTCCAGGAGCACTTGAAATCCATGCGTTAAAAGGACAACAGCAACAGCTTTGCCTAATCTTAACCCGAAACTACATTAATAACGTTCTACATTGGGTGTGGCTTTCAACTCAGTTGTGTTATTTTGCTCACTTTATTTTGGATCCAATGTGAATCATGTAAATCCTTAACACGTACAGACTCACTTTCTATAAGTTCTGATGATAACCGTAGATTATTGCAGCTTTCTAAACTTCTTTTTTCCAAAAGTATCTAT

Coding sequence (CDS)

ATGGCTATGAGGTTATTGGCTCATGAGGTCTCTGACATATGCCTTGGAAAGCCCCCACTAAGGTCCATCCCTGTCTCCGCCACACTCGCCCATGCCCTCTCTGTGCTCAAGAGGCTCCGTCAAAACTACATCAGCGTTTGGAGTTGCACCTGCCATTCCTCCAAAACTGCCTCCGATCATGATTGCCGATGTATTGGCAAGGTTTCTGTTGTTGATGTTATCTTATTCCTTTGTAGAGAAGAGAACCTCTCGCAACCTGCTGTCGCGCTTCAATCTTATGTTTCGATTCTCATTCCTGAGGTTCCTGTCGTCGTTAGGTATTTAGAGCCTCATGCTAGCTTGGTGGAAGCCATAGATCTCCTCCTCGAAGGCGCGCAAAATCTCGTAGTCCCAATTCAAACTAAAACCTCTGCAAAGTCCAGAAAGAAGGTTCTCAAGGAAGTTGCACCATTCGACTGCCCGCTTCACAATGACCTTGAATACTGTTGGCTCACCCAAGAGGACATAATCCGTTACCTCCTCAACTCTATTGGGCTTTTTTGCCCAACCTCCATCACTCCCATTAATTCCCTCAACGCCATCGACACAGCTAACATCCTTGCTGTACACTACGACGATCCTGCATTTTCTGCCCTGCCCCTCCTTTCTCAAGCCATCATCCAGCAATCCTCCGTTGCAATTGTCGATTCAGATGGGAAGTTGATCGGAGAAATCTCACCCCTCACAATGAATTCTTGTGATGTAACTGTAACGGCTGCGATTGCGACACTTTCAGCTGGTGAGCTAATGGCGTATGTTGACTATGGTAACCCACCGGAGGATTTGGTTCATTTGATCAAAGACAGACTGGAAGAAAGAAACTTAGGGGCACTGTTGGACTGGGTGGAAGAAGAGACAGCAATATTCTCATGTTCTTCATTTTGTTCATCTTCCTCAGATGATGATTCTGTCTCTAGCTGGGGAAGCAGTGGAAGGTTACGAAGGTGTTCAACCAGGCAAGTGAGGAGCTCAGAGGCTGCTGTGTGCAATCCCCGGAGTTCATTGGTGGCAGTGATGATTCAAGCTCTTGCGCGCCGTGTTGCAAGTATGTGGGTGATCGAAGAAGATGGAGCCTTGGTCGGTATCATCACGTTTGCATCGATGCTGAAGATTTTCCAGGAGCACTTGAAATCCATGCGTTAA

Protein sequence

MAMRLLAHEVSDICLGKPPLRSIPVSATLAHALSVLKRLRQNYISVWSCTCHSSKTASDHDCRCIGKVSVVDVILFLCREENLSQPAVALQSYVSILIPEVPVVVRYLEPHASLVEAIDLLLEGAQNLVVPIQTKTSAKSRKKVLKEVAPFDCPLHNDLEYCWLTQEDIIRYLLNSIGLFCPTSITPINSLNAIDTANILAVHYDDPAFSALPLLSQAIIQQSSVAIVDSDGKLIGEISPLTMNSCDVTVTAAIATLSAGELMAYVDYGNPPEDLVHLIKDRLEERNLGALLDWVEEETAIFSCSSFCSSSSDDDSVSSWGSSGRLRRCSTRQVRSSEAAVCNPRSSLVAVMIQALARRVASMWVIEEDGALVGIITFASMLKIFQEHLKSMR
BLAST of Cp4.1LG10g09030 vs. Swiss-Prot
Match: CBSX5_ARATH (CBS domain-containing protein CBSX5 OS=Arabidopsis thaliana GN=CBSX5 PE=2 SV=2)

HSP 1 Score: 296.6 bits (758), Expect = 4.0e-79
Identity = 168/398 (42.21%), Postives = 254/398 (63.82%), Query Frame = 1

Query: 1   MAMRLLAHEVSDICLGKPPLRSIPVSAT-LAHALSVLKRLRQNYISVWSCTCHSSKTASD 60
           MA+ LL++ VSD+CLGKPPLR +  S++ ++ A++ LK     ++SVW+C          
Sbjct: 1   MALSLLSYNVSDLCLGKPPLRCLSSSSSSVSDAIAALKSSEDTFLSVWNCNHDDDNNT-- 60

Query: 61  HDCRCIGKVSVVDVILFLCREENLSQPAVALQSYVSILIPEVPVVVRYLEPHASLVEAID 120
            +C C+GK+S+ DVI  L ++ + S    AL S VS+L+P+   +V +++P  SL+EAID
Sbjct: 61  -ECECLGKISMADVICHLSKDHDHS--LCALNSSVSVLLPKTRSIVLHVQPSCSLIEAID 120

Query: 121 LLLEGAQNLVVPIQTKTSAKSRKKVLKEVAPFDCPLHNDLEYCWLTQEDIIRYLLNSIGL 180
           L+++GAQNL+VPI TK   K +K+    V+       N   +CW+TQEDII++LL  I  
Sbjct: 121 LIIKGAQNLIVPIHTKPYTK-KKQHNDNVSVTTTTHSNGQRFCWITQEDIIQFLLGFIAA 180

Query: 181 FCPTSITPINSLNAID-TANILAVHYDDPAFSALPLLSQAIIQQSSVAIVDSDG-----K 240
           F P     ++ L  I+ T  ++AV Y   A + +  +S A+  Q+SVA+VD +G      
Sbjct: 181 FSPLPAMSLSDLGVINSTHTVVAVDYHSSASAVVSAVSNALAVQTSVAVVDGEGDDPFTS 240

Query: 241 LIGEISPLTMNSCDVTVTAAIATLSAGELMAYVDYGNPPEDLVHLIKDRLEERNLGALLD 300
           LIGEISP+T+  CD T  AA+ATLSAG+LMAY+D  NPPE LV ++++RLE++ L  L+ 
Sbjct: 241 LIGEISPMTLTCCDETAAAAVATLSAGDLMAYIDGANPPESLVQIVRNRLEDKGLIGLMS 300

Query: 301 WVEEETAIFSCSSFCSSSSDDDSVSSWGSSGRLRRCSTRQVRSSEAAVCNPRSSLVAVMI 360
             +  ++  + S +   SS++++     S GR    S R  R SEA VCNP+SSL+AVMI
Sbjct: 301 LFDSLSSYSTSSGY---SSEEEAPVRTTSYGRSMSSSARMARKSEAIVCNPKSSLMAVMI 360

Query: 361 QALARRVASMWVIEEDGALVGIITFASMLKIFQEHLKS 392
           QA+A RV   WV+E+DG  VG++TF  +LK+F++ L++
Sbjct: 361 QAVAHRVNYAWVVEKDGCFVGMVTFVDILKVFRKFLEN 389

BLAST of Cp4.1LG10g09030 vs. TrEMBL
Match: A0A0A0KHL3_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G517230 PE=4 SV=1)

HSP 1 Score: 596.7 bits (1537), Expect = 2.1e-167
Identity = 312/396 (78.79%), Postives = 350/396 (88.38%), Query Frame = 1

Query: 1   MAMRLLAHEVSDICLGKPPLRSIPVSATLAHALSVLKRLRQNYISVWSCTCHSSKTASDH 60
           MA+RLL H++SDICLGKP L SI +SATLA ALS LK+L +NYISVW+C+ H SK++S +
Sbjct: 1   MAVRLLDHQLSDICLGKPALTSISLSATLADALSALKKLGENYISVWNCSSHYSKSSSHY 60

Query: 61  DCRCIGKVSVVDVILFLCREENLSQPAVALQSYVSILIPEVPVVVRYLEPHASLVEAIDL 120
           DCRCIGK+SV+DV+LFLC+EENLSQPA+ALQS VS+LIP VPV+VR+LEPHASL+EAIDL
Sbjct: 61  DCRCIGKISVLDVVLFLCKEENLSQPALALQSSVSVLIPPVPVLVRHLEPHASLMEAIDL 120

Query: 121 LLEGAQNLVVPIQTKTSAKSRKKVLKEVAPFDCPLHNDLEYCWLTQEDIIRYLLNSIGLF 180
           LLEGAQNLVVPIQT+TSAKSR+KVL+ VAPFDCPLHN LEYCW+TQEDIIRYLLNSIGLF
Sbjct: 121 LLEGAQNLVVPIQTRTSAKSREKVLEVVAPFDCPLHNGLEYCWITQEDIIRYLLNSIGLF 180

Query: 181 CPTSITPINSLNAIDTANILAVHYDDPAFSALPLLSQAIIQQSSVAIVDSDGKLIGEISP 240
            PTSITP+NSLNAIDT NILA+HYDDPA SALPLLSQAII QSS+AIVDSDGKLIGEISP
Sbjct: 181 SPTSITPVNSLNAIDTVNILALHYDDPALSALPLLSQAIIHQSSIAIVDSDGKLIGEISP 240

Query: 241 LTMNSCDVTVTAAIATLSAGELMAYVDYGNPPEDLVHLIKDRLEERNLGALLDWVEEE-- 300
           LT+NS D T+TAAI TLSAGELMAYV+  +PPE LV L+KDRLE RNL  LL+WVEEE  
Sbjct: 241 LTLNSFDETITAAIVTLSAGELMAYVNCNDPPEYLVQLVKDRLEGRNLRGLLEWVEEESA 300

Query: 301 -TAIFSCSSFCSSSSDDDSVSSWGSSGRLRRCSTRQV-RSSEAAVCNPRSSLVAVMIQAL 360
            +A+ SCSSFCSSSSDDDS S WG SG+LR+CSTRQV RSSE AVCNPRSSLVAVMIQAL
Sbjct: 301 MSAMSSCSSFCSSSSDDDSGSWWGRSGKLRKCSTRQVRRSSEVAVCNPRSSLVAVMIQAL 360

Query: 361 ARRVASMWVIEEDGALVGIITFASMLKIFQEHLKSM 393
           A RV  MWV EED  LVGIITF SMLK+F E LKSM
Sbjct: 361 ALRVPYMWVTEEDECLVGIITFTSMLKVFHERLKSM 396

BLAST of Cp4.1LG10g09030 vs. TrEMBL
Match: W9RD98_9ROSA (CBS domain-containing protein CBSX5 OS=Morus notabilis GN=L484_012099 PE=4 SV=1)

HSP 1 Score: 469.9 bits (1208), Expect = 3.0e-129
Identity = 250/396 (63.13%), Postives = 309/396 (78.03%), Query Frame = 1

Query: 1   MAMRLLAHEVSDICLGKPPLRSIPVSATLAHALSVLKRLRQNYISVWSCTCHSSKTA--- 60
           MA+ LLA EVSD+CLGKP LR++ V+AT+  ALS LKRL Q Y+SVWSC  HSSK     
Sbjct: 1   MAVNLLAREVSDLCLGKPALRALLVTATVGEALSALKRLGQTYLSVWSCD-HSSKIGTGG 60

Query: 61  SDHDCRCIGKVSVVDVILFLCREENLSQPAVALQSYVSILIPEVPVVVRYLEPHASLVEA 120
           S  +CRC+GKV V D I FLC+EENL  PA ALQ+ V +LIP+VP +VR+LEP+ASL+EA
Sbjct: 61  SAGNCRCVGKVCVADAICFLCKEENLKSPATALQASVLVLIPKVPGLVRHLEPNASLLEA 120

Query: 121 IDLLLEGAQNLVVPIQTKTSAKSRKKVLKEVAPFDCPLHNDLEYCWLTQEDIIRYLLNSI 180
           +DL+LEGAQNLV+PIQT++S+ + +K L +   F+   H++ EYCW+TQEDIIRYLLN I
Sbjct: 121 VDLILEGAQNLVIPIQTRSSSSNSRKNLLQKPSFNSARHDNREYCWITQEDIIRYLLNCI 180

Query: 181 GLFCPTSITPINSLNAIDTANILAVHYDDPAFSALPLLSQAIIQQSSVAIVDSDGKLIGE 240
           GLF P   +PIN+LN ID+ +ILAV+YDDPA S LPL+SQA++ Q+SVA+VD+D KLIGE
Sbjct: 181 GLFSPIPASPINALNLIDSKHILAVNYDDPAASTLPLISQALVCQTSVAVVDTDSKLIGE 240

Query: 241 ISPLTMNSCDVTVTAAIATLSAGELMAYVDYGNPPEDLVHLIKDRLEERNLGALLDWVEE 300
           ISP T+NSCD TV AAIATLSAG+LMAY+D G PPEDLV L+KDRLEERN GALLD +EE
Sbjct: 241 ISPFTLNSCDETVAAAIATLSAGDLMAYIDCGGPPEDLVQLVKDRLEERNCGALLDLMEE 300

Query: 301 E-TAIFSCSSFCSSSSDDDSVSSWGSSGRLRRCSTRQVRSSEAAVCNPRSSLVAVMIQAL 360
           + + I S SS CSSSSD++       SGR    S R VR SEA VC+P SSLVAVM+QAL
Sbjct: 301 DYSTISSSSSLCSSSSDEE-------SGRFGGNSARMVRRSEAIVCHPWSSLVAVMVQAL 360

Query: 361 ARRVASMWVIEEDGALVGIITFASMLKIFQEHLKSM 393
           A R++ MWV+E DG L+GI+TFA MLK+F+E L  M
Sbjct: 361 AHRLSYMWVVEADGTLMGIVTFAGMLKVFRERLNLM 388

BLAST of Cp4.1LG10g09030 vs. TrEMBL
Match: A0A061GEK1_THECC (CBS domain-containing protein OS=Theobroma cacao GN=TCM_029902 PE=4 SV=1)

HSP 1 Score: 461.1 bits (1185), Expect = 1.4e-126
Identity = 249/398 (62.56%), Postives = 308/398 (77.39%), Query Frame = 1

Query: 1   MAMRLLAHEVSDICLGKPPLRSIPVSATLAHALSVLKRLRQNYISVWSCTCH----SSKT 60
           MA+ LL  EVSD+CLGKP LRS+ +SAT+ HALSVLKR   NYISVW+C       + KT
Sbjct: 1   MAVSLLEREVSDLCLGKPALRSLSISATVGHALSVLKRFGDNYISVWNCDHRHLPDADKT 60

Query: 61  -ASDHDCRCIGKVSVVDVILFLCREENLSQPAVALQSYVSILIPEVPVVVRYLEPHASLV 120
            A   +CRC+GKV +VD+I FLC+EENLS P  ALQ+ VS+LIP+VP ++R+LEP+ASLV
Sbjct: 61  DAGFEECRCVGKVCMVDIICFLCKEENLSNPGTALQAPVSVLIPKVPGLIRHLEPNASLV 120

Query: 121 EAIDLLLEGAQNLVVPIQTKTSAKSRKKVLKEVAPFDCPLHNDLEYCWLTQEDIIRYLLN 180
           EA+DL+LEGAQNLV+P+++ T+  SRKK+L ++   +  LHN+ EYCWLTQEDIIRYLLN
Sbjct: 121 EAMDLILEGAQNLVIPLESGTT-NSRKKLL-QITLSNSTLHNNREYCWLTQEDIIRYLLN 180

Query: 181 SIGLFCPTSITPINSLNAIDTANILAVHYDDPAFSALPLLSQAIIQQSSVAIVDSDGKLI 240
           SIGLF PT + PINSLN IDT NILAVHYDDPA  ALP ++Q++  Q+SVAIVD+DGKLI
Sbjct: 181 SIGLFSPTPVNPINSLNIIDTQNILAVHYDDPASLALPFIAQSLEMQTSVAIVDTDGKLI 240

Query: 241 GEISPLTMNSCDVTVTAAIATLSAGELMAYVDYGNPPEDLVHLIKDRLEERNLGALLDWV 300
           GEISP T+NSC   V AAIATLSAG+LMAY+D G  PEDL+ L+K+RL+ERNL   L+ +
Sbjct: 241 GEISPFTLNSCGEDVAAAIATLSAGDLMAYIDCGGRPEDLIQLVKERLQERNLEQALELM 300

Query: 301 EEETAIFSCSSFCSS-SSDDDSVSSWGSSGRLRRCSTRQVRSSEAAVCNPRSSLVAVMIQ 360
           EE++ I S +SF SS SS  D     G  GRL   S R VR SEA VC P SSLVAVMIQ
Sbjct: 301 EEDSGISSGASFSSSYSSSSDEEFGVGRGGRLGGYSARLVRRSEAIVCYPWSSLVAVMIQ 360

Query: 361 ALARRVASMWVIEEDGALVGIITFASMLKIFQEHLKSM 393
           ALA RV+ +WV+E+DG L GI+TFA M+K+F+E L+SM
Sbjct: 361 ALAHRVSYVWVVEDDGTLAGIVTFAGMMKVFRERLRSM 396

BLAST of Cp4.1LG10g09030 vs. TrEMBL
Match: A0A0A0LV40_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G073740 PE=4 SV=1)

HSP 1 Score: 452.2 bits (1162), Expect = 6.4e-124
Identity = 252/398 (63.32%), Postives = 298/398 (74.87%), Query Frame = 1

Query: 1   MAMRLLAHEVSDICLGKPPLRSIPVSATLAHALSVLKRLRQNYISVWSCTCHSSKTA-SD 60
           MA  LLA EVSD+CLGKP LRSI +SATLA ALS+L ++ + YISVWSC  HSS  A SD
Sbjct: 1   MAASLLACEVSDLCLGKPALRSISISATLADALSILTKIDEGYISVWSCGDHSSSKADSD 60

Query: 61  HDCRCIGKVSVVDVILFLCREENLSQPAVALQSYVSILIPEVPVVVRYLEPHASLVEAID 120
             CRC+GKV +VD+I FLCR+ENL QPA+ LQS +S+LIPE   +VR+LEPHASL+EAID
Sbjct: 61  LHCRCVGKVCMVDIICFLCRQENLLQPAIGLQSPISVLIPEGFELVRHLEPHASLMEAID 120

Query: 121 LLLEGAQNLVVPIQTKTSAKSRKKVLKE-VAPFDCPLHNDLEYCWLTQEDIIRYLLNSIG 180
           L+ +G  NLV+PI  K S   RK +LK+ +A     LHND EYCWL  EDIIRYLLNSIG
Sbjct: 121 LIHDGVHNLVIPI--KMSISKRKNILKKSLANSISSLHNDQEYCWLAPEDIIRYLLNSIG 180

Query: 181 LFCPTSITPINSLNAIDTANILAVHYDDPAFSALPLLSQAIIQQSSVAIVDSDGKLIGEI 240
           LF  T+  PINS N IDT NILAV YD+ A S LPL+SQA+I QSSVAIVD D KLIGEI
Sbjct: 181 LFSTTAANPINSFNIIDTNNILAVRYDESALSILPLISQALIHQSSVAIVDLDDKLIGEI 240

Query: 241 SPLTMNSCDVTVTAAIATLSAGELMAYVDYGNPPEDLVHLIKDRLEERNLGALLDWVEEE 300
           SP T+N CD TV AAIATL+AGELM Y+D G PP+DLV L+K+RLEE+NL A+L+WVEEE
Sbjct: 241 SPFTLNFCDETVVAAIATLTAGELMGYIDCGGPPDDLVQLVKERLEEKNLEAVLEWVEEE 300

Query: 301 --TAIFSCSSFCSSSSDD----DSVSSWGSSGRLRRCSTRQVRSSEAAVCNPRSSLVAVM 360
             T   S SS CSSS D+     S S  G SGR+   S R +R SEA VC P +SLVAVM
Sbjct: 301 SLTISSSSSSICSSSDDEFGCGSSSSGSGRSGRICGYSARVMRRSEAIVCYPWNSLVAVM 360

Query: 361 IQALARRVASMWVIEEDGALVGIITFASMLKIFQEHLK 391
           IQALA RV+ MWVI+EDG L G +TF S+L +F++ LK
Sbjct: 361 IQALAHRVSYMWVIQEDGTLAGTVTFPSLLAVFRDRLK 396

BLAST of Cp4.1LG10g09030 vs. TrEMBL
Match: A0A067JB04_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_21418 PE=4 SV=1)

HSP 1 Score: 440.7 bits (1132), Expect = 1.9e-120
Identity = 239/398 (60.05%), Postives = 304/398 (76.38%), Query Frame = 1

Query: 1   MAMRLLAHEVSDICLGKPPLRSIPVSATLAHALSVLKRLRQNYISVWSCTC--HSSKTAS 60
           MA+ LLA EVSD+CLGKP LRS+ +SAT+  ALS LKR    Y+SVWSC    H+S++  
Sbjct: 1   MAVSLLAREVSDLCLGKPALRSLSISATVGDALSALKRSEDPYLSVWSCDHRRHASRSKI 60

Query: 61  D-HDCRCIGKVSVVDVILFLCREENLSQPAVALQSYVSILIPEVPVVVRYLEPHASLVEA 120
           D + CRC+GKV +VDVI FLC+E+NL  P+ ALQ  VS+L+P+VP +VR+L+PHASL+EA
Sbjct: 61  DVNGCRCMGKVCMVDVICFLCKEDNLKNPSSALQEPVSVLLPKVPGLVRHLDPHASLLEA 120

Query: 121 IDLLLEGAQNLVVPIQTKTSAKSRKKVLKEVAPFDCPLHNDLEYCWLTQEDIIRYLLNSI 180
           IDL+LEGAQN+V+P+    S  +RKK+  + +  +  LHN+ EYCWLTQEDI+RYLLN I
Sbjct: 121 IDLILEGAQNIVIPVH---SPYTRKKLTYKPSS-NSTLHNNREYCWLTQEDIVRYLLNCI 180

Query: 181 GLFCPTSITPINSLNAIDTANILAVHYDDPAFSALPLLSQAIIQQSSVAIVDSDGKLIGE 240
           GLFCPT    I  L  IDT +  A+ YD+PA SALPL+SQ++++Q+SVAIVD DGKLIGE
Sbjct: 181 GLFCPTPNHTIEFLKIIDTKSSSAICYDEPASSALPLISQSLVKQTSVAIVDIDGKLIGE 240

Query: 241 ISPLTMNSCDVTVTAAIATLSAGELMAYVDYGNPPEDLVHLIKDRLEERNLGALLDWVEE 300
           ISP T+NSCD +V AA+ATLSAG+LMAYVD  +PPEDLV L+K+RLEE NLGA +D +EE
Sbjct: 241 ISPYTLNSCDESVAAAVATLSAGDLMAYVDCCSPPEDLVRLVKERLEEMNLGAAVDLLEE 300

Query: 301 ETAI---FSCSSFCSSSSDDDSVSSWGSSGRLRRCSTRQVRSSEAAVCNPRSSLVAVMIQ 360
            + I    SCSS+ SSS ++  +   G SGRL   STR VR ++A VC P SSLVAVMIQ
Sbjct: 301 ASVISPSSSCSSYSSSSDEEFGLGQLGRSGRLGGHSTRVVRRTDAIVCFPWSSLVAVMIQ 360

Query: 361 ALARRVASMWVIEEDGALVGIITFASMLKIFQEHLKSM 393
           AL+ RV+  WV+EEDG LVG++TFA ML +F+E LKSM
Sbjct: 361 ALSHRVSCAWVVEEDGTLVGVVTFAGMLNVFRERLKSM 394

BLAST of Cp4.1LG10g09030 vs. TAIR10
Match: AT5G53750.1 (AT5G53750.1 CBS domain-containing protein)

HSP 1 Score: 299.3 bits (765), Expect = 3.5e-81
Identity = 178/411 (43.31%), Postives = 263/411 (63.99%), Query Frame = 1

Query: 1   MAMRLLAHEVSDICLGKPPLRSIPVS-ATLAHALSVLKRLRQNYISVWSCTCHSSKTASD 60
           MA+ LL+HE+SD+C+GKPPLR + V+ AT+A A++ LK   + +++VWSC  H  KT  +
Sbjct: 1   MALTLLSHELSDLCIGKPPLRCLSVATATVADAIAALKSSDEPFLTVWSCN-HDEKTDDN 60

Query: 61  HDCRCIGKVSVVDVILFLCR-EENLSQPAVALQSYVSILIPEVPVVVRYLEPHASLVEAI 120
             C C+GK+ + DVI +L + + N+   + A  + VS+L+P+   +V +++   SL+EAI
Sbjct: 61  DKCECLGKICMADVICYLSKFDNNVLSLSSAFDASVSVLLPKSRALVVHVQSSCSLIEAI 120

Query: 121 DLLLEGAQNLVVPIQTKTSAKSRKK-------VLKEVAPFDCPLH-NDLEYCWLTQEDII 180
           DL+++GAQNL+VPI TK+  K R++       V+  +       H N  E+CW+TQEDII
Sbjct: 121 DLIIKGAQNLIVPIHTKSITKRRQQQKLLKRNVVVSLTNATSTTHKNSREFCWITQEDII 180

Query: 181 RYLLNSIGLFCPTSITPINSLNAID-TANILAVHYDDPAFSALPLLSQAIIQQSSVAIV- 240
           R+LL+SI +F P     I+ L  I+ T  ILAV Y   A SA+  +S+AI+   SVA+V 
Sbjct: 181 RFLLDSISVFSPLPSLSISDLGVINSTHTILAVDYYSSAASAVSAISRAILDNVSVAVVG 240

Query: 241 ------DSDGKLIGEISPLTMNSCDVTVTAAIATLSAGELMAYVDYGNPPEDLVHLIKDR 300
                 D    LIGEISP+T+  CD T  AA+ATLSAG+LM+Y+D   PPE LV ++++R
Sbjct: 241 KGCDQEDPCMVLIGEISPMTLACCDETAVAAVATLSAGDLMSYIDGSGPPESLVGVVRNR 300

Query: 301 LEERNLGALLDWVEEETAIFSCSSFCSSSSDDDSVSS----WGSSGRLRRCSTRQVRSSE 360
           LE++ +  L+  ++      S S    SSSD++S +       S GR    + R  R S 
Sbjct: 301 LEDKGMVGLISLID------SLSLSSGSSSDEESPAGKTRMTSSYGRSVSSAARMARKSV 360

Query: 361 AAVCNPRSSLVAVMIQALARRVASMWVIEEDGALVGIITFASMLKIFQEHL 390
           A VCN +SSL+AVMIQA+A RV+ +WVI+EDG L+G++TF  +LK+F+E L
Sbjct: 361 AIVCNRKSSLMAVMIQAIAHRVSYVWVIDEDGCLIGMVTFVDILKLFREFL 404

BLAST of Cp4.1LG10g09030 vs. TAIR10
Match: AT4G27460.1 (AT4G27460.1 Cystathionine beta-synthase (CBS) family protein)

HSP 1 Score: 296.6 bits (758), Expect = 2.3e-80
Identity = 168/398 (42.21%), Postives = 254/398 (63.82%), Query Frame = 1

Query: 1   MAMRLLAHEVSDICLGKPPLRSIPVSAT-LAHALSVLKRLRQNYISVWSCTCHSSKTASD 60
           MA+ LL++ VSD+CLGKPPLR +  S++ ++ A++ LK     ++SVW+C          
Sbjct: 1   MALSLLSYNVSDLCLGKPPLRCLSSSSSSVSDAIAALKSSEDTFLSVWNCNHDDDNNT-- 60

Query: 61  HDCRCIGKVSVVDVILFLCREENLSQPAVALQSYVSILIPEVPVVVRYLEPHASLVEAID 120
            +C C+GK+S+ DVI  L ++ + S    AL S VS+L+P+   +V +++P  SL+EAID
Sbjct: 61  -ECECLGKISMADVICHLSKDHDHS--LCALNSSVSVLLPKTRSIVLHVQPSCSLIEAID 120

Query: 121 LLLEGAQNLVVPIQTKTSAKSRKKVLKEVAPFDCPLHNDLEYCWLTQEDIIRYLLNSIGL 180
           L+++GAQNL+VPI TK   K +K+    V+       N   +CW+TQEDII++LL  I  
Sbjct: 121 LIIKGAQNLIVPIHTKPYTK-KKQHNDNVSVTTTTHSNGQRFCWITQEDIIQFLLGFIAA 180

Query: 181 FCPTSITPINSLNAID-TANILAVHYDDPAFSALPLLSQAIIQQSSVAIVDSDG-----K 240
           F P     ++ L  I+ T  ++AV Y   A + +  +S A+  Q+SVA+VD +G      
Sbjct: 181 FSPLPAMSLSDLGVINSTHTVVAVDYHSSASAVVSAVSNALAVQTSVAVVDGEGDDPFTS 240

Query: 241 LIGEISPLTMNSCDVTVTAAIATLSAGELMAYVDYGNPPEDLVHLIKDRLEERNLGALLD 300
           LIGEISP+T+  CD T  AA+ATLSAG+LMAY+D  NPPE LV ++++RLE++ L  L+ 
Sbjct: 241 LIGEISPMTLTCCDETAAAAVATLSAGDLMAYIDGANPPESLVQIVRNRLEDKGLIGLMS 300

Query: 301 WVEEETAIFSCSSFCSSSSDDDSVSSWGSSGRLRRCSTRQVRSSEAAVCNPRSSLVAVMI 360
             +  ++  + S +   SS++++     S GR    S R  R SEA VCNP+SSL+AVMI
Sbjct: 301 LFDSLSSYSTSSGY---SSEEEAPVRTTSYGRSMSSSARMARKSEAIVCNPKSSLMAVMI 360

Query: 361 QALARRVASMWVIEEDGALVGIITFASMLKIFQEHLKS 392
           QA+A RV   WV+E+DG  VG++TF  +LK+F++ L++
Sbjct: 361 QAVAHRVNYAWVVEKDGCFVGMVTFVDILKVFRKFLEN 389

BLAST of Cp4.1LG10g09030 vs. NCBI nr
Match: gi|659078729|ref|XP_008439875.1| (PREDICTED: CBS domain-containing protein CBSX5-like [Cucumis melo])

HSP 1 Score: 601.7 bits (1550), Expect = 9.4e-169
Identity = 313/393 (79.64%), Postives = 349/393 (88.80%), Query Frame = 1

Query: 1   MAMRLLAHEVSDICLGKPPLRSIPVSATLAHALSVLKRLRQNYISVWSCTCHSSKTASDH 60
           MA+RLL H +SDICLGKP L SI +SATLA ALS LK+L +NYISVW+C+ H SK++S +
Sbjct: 1   MAVRLLDHHLSDICLGKPALTSISLSATLADALSALKKLGENYISVWNCSSHYSKSSSHY 60

Query: 61  DCRCIGKVSVVDVILFLCREENLSQPAVALQSYVSILIPEVPVVVRYLEPHASLVEAIDL 120
           DC+CIGK+SV+DV+LFLC+EENLSQPAVALQS VS+LIP VPV+V +LEPHASLVE IDL
Sbjct: 61  DCQCIGKISVLDVVLFLCKEENLSQPAVALQSSVSVLIPPVPVLVVHLEPHASLVEVIDL 120

Query: 121 LLEGAQNLVVPIQTKTSAKSRKKVLKEVAPFDCPLHNDLEYCWLTQEDIIRYLLNSIGLF 180
           LLEGAQNLVVPIQT+TSAKSR+KVL+ VAPFDCPLHN LEYCW+TQEDIIRYLLNSIGLF
Sbjct: 121 LLEGAQNLVVPIQTRTSAKSREKVLEVVAPFDCPLHNGLEYCWITQEDIIRYLLNSIGLF 180

Query: 181 CPTSITPINSLNAIDTANILAVHYDDPAFSALPLLSQAIIQQSSVAIVDSDGKLIGEISP 240
            PTSITPINSLNAIDTANILAVHYDDPA SALPL+SQAII QSSVAIV+SDGKLIGEISP
Sbjct: 181 SPTSITPINSLNAIDTANILAVHYDDPALSALPLISQAIIHQSSVAIVESDGKLIGEISP 240

Query: 241 LTMNSCDVTVTAAIATLSAGELMAYVDYGNPPEDLVHLIKDRLEERNLGALLDWVEEETA 300
           LT+NS D T+TAAI TLSAGELMAYV   +PPEDLV L+KDRLEERNL  LL+WVEEE+A
Sbjct: 241 LTLNSFDETITAAIVTLSAGELMAYVYCNDPPEDLVQLVKDRLEERNLRGLLEWVEEESA 300

Query: 301 IFSCSSFCSSSSDDDSVSSWGSSGRLRRCSTRQV-RSSEAAVCNPRSSLVAVMIQALARR 360
           + +CSSFCSSSSDDDS S WG SG+LR+CSTRQV RSSE AVCNP+SSLVAVMIQALA R
Sbjct: 301 MSTCSSFCSSSSDDDSGSWWGRSGKLRKCSTRQVRRSSEVAVCNPQSSLVAVMIQALALR 360

Query: 361 VASMWVIEEDGALVGIITFASMLKIFQEHLKSM 393
           V  MWV EEDG LVGI TF SMLK+F E LKSM
Sbjct: 361 VPYMWVTEEDGCLVGITTFTSMLKVFHERLKSM 393

BLAST of Cp4.1LG10g09030 vs. NCBI nr
Match: gi|449434344|ref|XP_004134956.1| (PREDICTED: CBS domain-containing protein CBSX5-like [Cucumis sativus])

HSP 1 Score: 596.7 bits (1537), Expect = 3.0e-167
Identity = 312/396 (78.79%), Postives = 350/396 (88.38%), Query Frame = 1

Query: 1   MAMRLLAHEVSDICLGKPPLRSIPVSATLAHALSVLKRLRQNYISVWSCTCHSSKTASDH 60
           MA+RLL H++SDICLGKP L SI +SATLA ALS LK+L +NYISVW+C+ H SK++S +
Sbjct: 1   MAVRLLDHQLSDICLGKPALTSISLSATLADALSALKKLGENYISVWNCSSHYSKSSSHY 60

Query: 61  DCRCIGKVSVVDVILFLCREENLSQPAVALQSYVSILIPEVPVVVRYLEPHASLVEAIDL 120
           DCRCIGK+SV+DV+LFLC+EENLSQPA+ALQS VS+LIP VPV+VR+LEPHASL+EAIDL
Sbjct: 61  DCRCIGKISVLDVVLFLCKEENLSQPALALQSSVSVLIPPVPVLVRHLEPHASLMEAIDL 120

Query: 121 LLEGAQNLVVPIQTKTSAKSRKKVLKEVAPFDCPLHNDLEYCWLTQEDIIRYLLNSIGLF 180
           LLEGAQNLVVPIQT+TSAKSR+KVL+ VAPFDCPLHN LEYCW+TQEDIIRYLLNSIGLF
Sbjct: 121 LLEGAQNLVVPIQTRTSAKSREKVLEVVAPFDCPLHNGLEYCWITQEDIIRYLLNSIGLF 180

Query: 181 CPTSITPINSLNAIDTANILAVHYDDPAFSALPLLSQAIIQQSSVAIVDSDGKLIGEISP 240
            PTSITP+NSLNAIDT NILA+HYDDPA SALPLLSQAII QSS+AIVDSDGKLIGEISP
Sbjct: 181 SPTSITPVNSLNAIDTVNILALHYDDPALSALPLLSQAIIHQSSIAIVDSDGKLIGEISP 240

Query: 241 LTMNSCDVTVTAAIATLSAGELMAYVDYGNPPEDLVHLIKDRLEERNLGALLDWVEEE-- 300
           LT+NS D T+TAAI TLSAGELMAYV+  +PPE LV L+KDRLE RNL  LL+WVEEE  
Sbjct: 241 LTLNSFDETITAAIVTLSAGELMAYVNCNDPPEYLVQLVKDRLEGRNLRGLLEWVEEESA 300

Query: 301 -TAIFSCSSFCSSSSDDDSVSSWGSSGRLRRCSTRQV-RSSEAAVCNPRSSLVAVMIQAL 360
            +A+ SCSSFCSSSSDDDS S WG SG+LR+CSTRQV RSSE AVCNPRSSLVAVMIQAL
Sbjct: 301 MSAMSSCSSFCSSSSDDDSGSWWGRSGKLRKCSTRQVRRSSEVAVCNPRSSLVAVMIQAL 360

Query: 361 ARRVASMWVIEEDGALVGIITFASMLKIFQEHLKSM 393
           A RV  MWV EED  LVGIITF SMLK+F E LKSM
Sbjct: 361 ALRVPYMWVTEEDECLVGIITFTSMLKVFHERLKSM 396

BLAST of Cp4.1LG10g09030 vs. NCBI nr
Match: gi|703114061|ref|XP_010100544.1| (CBS domain-containing protein CBSX5 [Morus notabilis])

HSP 1 Score: 469.9 bits (1208), Expect = 4.3e-129
Identity = 250/396 (63.13%), Postives = 309/396 (78.03%), Query Frame = 1

Query: 1   MAMRLLAHEVSDICLGKPPLRSIPVSATLAHALSVLKRLRQNYISVWSCTCHSSKTA--- 60
           MA+ LLA EVSD+CLGKP LR++ V+AT+  ALS LKRL Q Y+SVWSC  HSSK     
Sbjct: 1   MAVNLLAREVSDLCLGKPALRALLVTATVGEALSALKRLGQTYLSVWSCD-HSSKIGTGG 60

Query: 61  SDHDCRCIGKVSVVDVILFLCREENLSQPAVALQSYVSILIPEVPVVVRYLEPHASLVEA 120
           S  +CRC+GKV V D I FLC+EENL  PA ALQ+ V +LIP+VP +VR+LEP+ASL+EA
Sbjct: 61  SAGNCRCVGKVCVADAICFLCKEENLKSPATALQASVLVLIPKVPGLVRHLEPNASLLEA 120

Query: 121 IDLLLEGAQNLVVPIQTKTSAKSRKKVLKEVAPFDCPLHNDLEYCWLTQEDIIRYLLNSI 180
           +DL+LEGAQNLV+PIQT++S+ + +K L +   F+   H++ EYCW+TQEDIIRYLLN I
Sbjct: 121 VDLILEGAQNLVIPIQTRSSSSNSRKNLLQKPSFNSARHDNREYCWITQEDIIRYLLNCI 180

Query: 181 GLFCPTSITPINSLNAIDTANILAVHYDDPAFSALPLLSQAIIQQSSVAIVDSDGKLIGE 240
           GLF P   +PIN+LN ID+ +ILAV+YDDPA S LPL+SQA++ Q+SVA+VD+D KLIGE
Sbjct: 181 GLFSPIPASPINALNLIDSKHILAVNYDDPAASTLPLISQALVCQTSVAVVDTDSKLIGE 240

Query: 241 ISPLTMNSCDVTVTAAIATLSAGELMAYVDYGNPPEDLVHLIKDRLEERNLGALLDWVEE 300
           ISP T+NSCD TV AAIATLSAG+LMAY+D G PPEDLV L+KDRLEERN GALLD +EE
Sbjct: 241 ISPFTLNSCDETVAAAIATLSAGDLMAYIDCGGPPEDLVQLVKDRLEERNCGALLDLMEE 300

Query: 301 E-TAIFSCSSFCSSSSDDDSVSSWGSSGRLRRCSTRQVRSSEAAVCNPRSSLVAVMIQAL 360
           + + I S SS CSSSSD++       SGR    S R VR SEA VC+P SSLVAVM+QAL
Sbjct: 301 DYSTISSSSSLCSSSSDEE-------SGRFGGNSARMVRRSEAIVCHPWSSLVAVMVQAL 360

Query: 361 ARRVASMWVIEEDGALVGIITFASMLKIFQEHLKSM 393
           A R++ MWV+E DG L+GI+TFA MLK+F+E L  M
Sbjct: 361 AHRLSYMWVVEADGTLMGIVTFAGMLKVFRERLNLM 388

BLAST of Cp4.1LG10g09030 vs. NCBI nr
Match: gi|590624671|ref|XP_007025668.1| (CBS domain-containing protein [Theobroma cacao])

HSP 1 Score: 461.1 bits (1185), Expect = 2.0e-126
Identity = 249/398 (62.56%), Postives = 308/398 (77.39%), Query Frame = 1

Query: 1   MAMRLLAHEVSDICLGKPPLRSIPVSATLAHALSVLKRLRQNYISVWSCTCH----SSKT 60
           MA+ LL  EVSD+CLGKP LRS+ +SAT+ HALSVLKR   NYISVW+C       + KT
Sbjct: 1   MAVSLLEREVSDLCLGKPALRSLSISATVGHALSVLKRFGDNYISVWNCDHRHLPDADKT 60

Query: 61  -ASDHDCRCIGKVSVVDVILFLCREENLSQPAVALQSYVSILIPEVPVVVRYLEPHASLV 120
            A   +CRC+GKV +VD+I FLC+EENLS P  ALQ+ VS+LIP+VP ++R+LEP+ASLV
Sbjct: 61  DAGFEECRCVGKVCMVDIICFLCKEENLSNPGTALQAPVSVLIPKVPGLIRHLEPNASLV 120

Query: 121 EAIDLLLEGAQNLVVPIQTKTSAKSRKKVLKEVAPFDCPLHNDLEYCWLTQEDIIRYLLN 180
           EA+DL+LEGAQNLV+P+++ T+  SRKK+L ++   +  LHN+ EYCWLTQEDIIRYLLN
Sbjct: 121 EAMDLILEGAQNLVIPLESGTT-NSRKKLL-QITLSNSTLHNNREYCWLTQEDIIRYLLN 180

Query: 181 SIGLFCPTSITPINSLNAIDTANILAVHYDDPAFSALPLLSQAIIQQSSVAIVDSDGKLI 240
           SIGLF PT + PINSLN IDT NILAVHYDDPA  ALP ++Q++  Q+SVAIVD+DGKLI
Sbjct: 181 SIGLFSPTPVNPINSLNIIDTQNILAVHYDDPASLALPFIAQSLEMQTSVAIVDTDGKLI 240

Query: 241 GEISPLTMNSCDVTVTAAIATLSAGELMAYVDYGNPPEDLVHLIKDRLEERNLGALLDWV 300
           GEISP T+NSC   V AAIATLSAG+LMAY+D G  PEDL+ L+K+RL+ERNL   L+ +
Sbjct: 241 GEISPFTLNSCGEDVAAAIATLSAGDLMAYIDCGGRPEDLIQLVKERLQERNLEQALELM 300

Query: 301 EEETAIFSCSSFCSS-SSDDDSVSSWGSSGRLRRCSTRQVRSSEAAVCNPRSSLVAVMIQ 360
           EE++ I S +SF SS SS  D     G  GRL   S R VR SEA VC P SSLVAVMIQ
Sbjct: 301 EEDSGISSGASFSSSYSSSSDEEFGVGRGGRLGGYSARLVRRSEAIVCYPWSSLVAVMIQ 360

Query: 361 ALARRVASMWVIEEDGALVGIITFASMLKIFQEHLKSM 393
           ALA RV+ +WV+E+DG L GI+TFA M+K+F+E L+SM
Sbjct: 361 ALAHRVSYVWVVEDDGTLAGIVTFAGMMKVFRERLRSM 396

BLAST of Cp4.1LG10g09030 vs. NCBI nr
Match: gi|1009109249|ref|XP_015889089.1| (PREDICTED: CBS domain-containing protein CBSX5-like [Ziziphus jujuba])

HSP 1 Score: 459.9 bits (1182), Expect = 4.4e-126
Identity = 250/395 (63.29%), Postives = 301/395 (76.20%), Query Frame = 1

Query: 1   MAMRLLAHEVSDICLGKPPLRSIPVSATLAHALSVLKRLRQNYISVWSCTCHSSKTASDH 60
           MA+ L AHEVSD+CLGKP LRS+  +AT+  ALS LK+L  + ISVWSC    S    D 
Sbjct: 1   MAVSLFAHEVSDLCLGKPALRSLSFTATVGEALSALKKLGDSCISVWSCDHSKSSCIGDE 60

Query: 61  DC-RCIGKVSVVDVILFLCREENLSQPAVALQSYVSILIPEVPVVVRYLEPHASLVEAID 120
            C RC+GKV +VDVI FLC+E+NLS P  ALQ+ VS LIP+   +VR+LEP+ASLVEAID
Sbjct: 61  SCCRCVGKVCMVDVICFLCKEDNLSSPLTALQAPVSDLIPKDTTLVRHLEPNASLVEAID 120

Query: 121 LLLEGAQNLVVPIQTKTSAKSRKKVLKEVAPFDCPLHNDLEYCWLTQEDIIRYLLNSIGL 180
           L+LEGAQNLV+PIQTK S  +RKK L      D  LH + EYCWLTQEDIIRYLLN IGL
Sbjct: 121 LILEGAQNLVIPIQTKNS--TRKKPLINNPLSDSTLHKNREYCWLTQEDIIRYLLNCIGL 180

Query: 181 FCPTSITPINSLNAIDTANILAVHYDDPAFSALPLLSQAIIQQSSVAIVDSDGKLIGEIS 240
           FCPT+I PIN+L  ID  +I  VHYDDPA SAL L+S +++ Q+SVAIVD +GKLIGEIS
Sbjct: 181 FCPTAINPINALKVIDFDDIPVVHYDDPASSALTLISNSLVHQTSVAIVDVEGKLIGEIS 240

Query: 241 PLTMNSCDVTVTAAIATLSAGELMAYVDYGNPPEDLVHLIKDRLEERNLGALLDWVEEET 300
           P T+NSC+ T  AAIATLSAG+LMAY+D G PPEDLV L+K+RLEER+ GA ++ +EEE+
Sbjct: 241 PFTLNSCEETTAAAIATLSAGDLMAYIDGGGPPEDLVQLVKERLEERSYGAFVELMEEES 300

Query: 301 AIFSCSSFCSSSSDDDSVSSWG--SSGRLRRCSTRQVRSSEAAVCNPRSSLVAVMIQALA 360
            I S SSFCS SSDD+    +G   SG+L   S R VR SEA VC P SS+VAVMIQALA
Sbjct: 301 TISSSSSFCSCSSDDE----FGPVRSGKLGSYSARLVRRSEAIVCYPWSSMVAVMIQALA 360

Query: 361 RRVASMWVIEEDGALVGIITFASMLKIFQEHLKSM 393
            RV+ +WV+EEDG L GI+TFA MLK+F+EH+KSM
Sbjct: 361 HRVSYVWVVEEDGTLSGIVTFAGMLKVFREHVKSM 389

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
CBSX5_ARATH4.0e-7942.21CBS domain-containing protein CBSX5 OS=Arabidopsis thaliana GN=CBSX5 PE=2 SV=2[more]
Match NameE-valueIdentityDescription
A0A0A0KHL3_CUCSA2.1e-16778.79Uncharacterized protein OS=Cucumis sativus GN=Csa_6G517230 PE=4 SV=1[more]
W9RD98_9ROSA3.0e-12963.13CBS domain-containing protein CBSX5 OS=Morus notabilis GN=L484_012099 PE=4 SV=1[more]
A0A061GEK1_THECC1.4e-12662.56CBS domain-containing protein OS=Theobroma cacao GN=TCM_029902 PE=4 SV=1[more]
A0A0A0LV40_CUCSA6.4e-12463.32Uncharacterized protein OS=Cucumis sativus GN=Csa_1G073740 PE=4 SV=1[more]
A0A067JB04_JATCU1.9e-12060.05Uncharacterized protein OS=Jatropha curcas GN=JCGZ_21418 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G53750.13.5e-8143.31 CBS domain-containing protein[more]
AT4G27460.12.3e-8042.21 Cystathionine beta-synthase (CBS) family protein[more]
Match NameE-valueIdentityDescription
gi|659078729|ref|XP_008439875.1|9.4e-16979.64PREDICTED: CBS domain-containing protein CBSX5-like [Cucumis melo][more]
gi|449434344|ref|XP_004134956.1|3.0e-16778.79PREDICTED: CBS domain-containing protein CBSX5-like [Cucumis sativus][more]
gi|703114061|ref|XP_010100544.1|4.3e-12963.13CBS domain-containing protein CBSX5 [Morus notabilis][more]
gi|590624671|ref|XP_007025668.1|2.0e-12662.56CBS domain-containing protein [Theobroma cacao][more]
gi|1009109249|ref|XP_015889089.1|4.4e-12663.29PREDICTED: CBS domain-containing protein CBSX5-like [Ziziphus jujuba][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000644CBS_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG10g09030.1Cp4.1LG10g09030.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000644CBS domainPFAMPF00571CBScoord: 335..383
score: 4.
IPR000644CBS domainPROFILEPS51371CBScoord: 16..83
score: 6.332coord: 194..256
score: 7.03coord: 335..391
score:
NoneNo IPR availableGENE3DG3DSA:3.10.580.10coord: 194..252
score: 5.9E-9coord: 326..387
score: 5.
NoneNo IPR availablePANTHERPTHR13780AMP-ACTIVATED PROTEIN KINASE, GAMMA REGULATORY SUBUNITcoord: 154..284
score: 5.4E-142coord: 339..392
score: 5.4E-142coord: 1..131
score: 5.4E
NoneNo IPR availablePANTHERPTHR13780:SF39CBS DOMAIN-CONTAINING PROTEIN-RELATEDcoord: 1..131
score: 5.4E-142coord: 339..392
score: 5.4E-142coord: 154..284
score: 5.4E
NoneNo IPR availableunknownSSF54631CBS-domain paircoord: 196..259
score: 9.33E-10coord: 314..389
score: 9.33E-10coord: 3..175
score: 2.

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG10g09030Cp4.1LG09g08490Cucurbita pepo (Zucchini)cpecpeB019
Cp4.1LG10g09030Cp4.1LG19g07270Cucurbita pepo (Zucchini)cpecpeB076
Cp4.1LG10g09030Cp4.1LG20g04190Cucurbita pepo (Zucchini)cpecpeB086
The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG10g09030Cucumber (Gy14) v2cgybcpeB710
Cp4.1LG10g09030Cucumber (Chinese Long) v3cpecucB0092