Lsi01G019840 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi01G019840
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionCBS domain-containing protein
Locationchr01 : 26389127 .. 26392542 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGGATATTAGAATACAAAGAGAAGACTATTTTATTTTGAACATATTCATGAGGAGCTTTCTTTGGAAGGGGAGGAGGGGGTTGGGTTAAGGTAATGGGTCTCATCTCTTGAATTGGAAGTGGTTTAGAATATTCTGTTGGCTTTTCTAATTTTTGAAATTAATTAATCTGATTTTGTTAAACTCCGAAGTTCTGAAGCATTGCTGATGCTTTGCAAATTTGAACTGTTTCATGTAACCTTTGTTATCTTTAGTAATTATGATGCTCTGCGGTTATCTGATTTTTTTTTTTTTTTTTTTAAACCATTGTGAGAATTTTGCAGTGAAATGTCTGCATTCATGGCTTCTGAAGGGGCTCTATACTATCCATCTCTAGGTCTTGGAAACTCATTTGCTTTTAAATTTGAGGATCTGAAGGGTCGGGTACATCGCGTAAATTGTGGTGAGTGTTTAGGGTTTTTTTAGAACTATGATTTTGAAGAACTTTATTACAATGTGTATGCATATATGTTTGTGTGTGTGTGTGATGTATTGTTAACAAGTAAACCAAGTGTGCTGATGGTTTAAGGCGTTTTCTTTGAGGAACCATTTCATTGAGAAAATGAAATATATAAAGAGAGGAAAGCCTCGAAACCAAGTAGTGTAAACTTCCTAGTTGGAAATAGTGAAAAAGAGAAATATTTTTTTTTCAAGAAAACCCGCTTGACCCTACAACATTTGGATATCAAGGAAACTCATAGGATATTAAATTCCAAGTAGGTGGCCTCCATGGATCCTATTCCCTCTAAGTCCTATACTAAATCCATGACCTTTTTTTAACCACTAGGTCAATCCATGATATGATGGTTTGCAAATATAATAACTTCTGCAGTAAACACCAAATTGAGGCAATAAAAACTAAATTTTTTGCTTCACTGAGAATAATTATTTTGTTATGTTCCAACCATACTTCCACGTCCGACACTAGAGCCTGACCTTGTGTCAAAGAATGCTAATGCTGCTTTTGACTGAAATTGATTCAACAAAGATTCCTGTATAAGTCCTGAAAATCTTCAGGAAAAGGGAGCTAGAGACCACAAACCTCTATTAACTACCACCAAATAGTATTAACTACCACCAATTAGTAGTAGCTTTCCTCAAAGTATATTTCTCTGGTTGTGTTTGTATTTCATCTTGAAGATTGTTGTCATACTCTTTTATTAGGAGTGTTTTGAACACAGTTCGTAATTATCCCTTTTTAGATTCCTTGAACACCCTCCTGTTCTCCATTCGGTAAGAGATAACAGGGTAATTAAACTCTTCAATCTGTCCAGAGAACCATGTTGAGAGGAGCTGTTTCTATTTTTCTCCTTCTGTTGCATTTAGGCCTATTATCCCTGCCGTCAAAAGCTAAACCAAAGCCTAGAGGATCTCGTTTTAATCACACTTTTCCTAGTCGGGTAACGCCAACCTCCCATTAGAGAATGGTCAACCGACGTTATGAGGACCTTCAACGAAGGGTGATCACTACATTTCTTGTTTCCATTGGCAAGAGTGTGGAGGCATTAACCCTGGCTGATTTGGCTTATGCCGTTAAACAAGAAGGGGTAACAAATATACTGGTAAGAAAATATATGGTGATCACTACAGATGGTGGATTTGCGAAGATCGTTTTTAATCTTCTCCTAAAGAAGCAGAAATTAGAGGGTAGATTCTATCCAGTGGAACTATTTATTAATAATTGTTTTCTTAGGCACTGAGACCCTAGATGAGTTGGTATCTGTTGTGATGCAAAGGGTTGGTGCTACTAATGGTGCTAGTCGTCCTCTGCTTTTGGTATGTTTCTTGCTACTTTTTCTGACTGCATTTTTTTCGAGTTTATTGTGTAAGTCAAGGGTTTTGTTTCTTTTACAAGTATGAAGATAATGAAGGTGATAAAGTAGTTCTTGCTACTGATGGCGATCTCTCCGGTGCTGTAAACCATGCCAAGTCCACAGGAAAAAAGGTAGTTAATTTAGTAAAAGTATTCAACTTTCATGTTCCTTTTTTAGTATTATTCTTTTGTGACTTCATCTTTTTCTTGTTATTCTTAAACTTTATTGGTTCTCCAAATGTTGGTTGCTGTTTTATTTGTGCAACGTTACCGGTATGTCAAATTGTTCATAATATCAAAGGACATTTTTTAATACGAGACACAATTGCACCAATAAATCAAATGTACTAGAAGAGGAAAGATACCCTTAACCAAAGTCAAGGAGTTAAATAGAAGAGTATTTAGTTTTAGATAATGTACACCAAGAGGAGGAAACGAAGATTAACCTTCTAATATATTTTGAATTTCTTAAAAATCATCAAACATATCTATTTTTCCTCAGTCCAAAATTTCCAAAAGAATAGGCAAGGATTGCTTAAGAATCTCCCAGTAGCATGTATAGTTTAATGACAGCTATAATTTCCAGTCCTGTACTTCTCTGTTGCTATGTTGGTTTAAGTTTTGCATATCATTATTATTTACTTAGTTTGCTGACTTTATGTATTTTTGCATATTTTGGCAAATTTTAGTAAATTTCAGAACCCCTTTATGTTCATGCTGAAATCTAAGGAATTTTTTATATCTAGCAGACTGCTGAATTTAATATTTGATACTGTAATCCTTTTTCTTTAAAAGATTAAATTACCTTGTCATGAATTCTATTCTCAAGTCCATGGATTTCCACAGGCTCTGTATTGCTCTTCTGGATCGTAATTTAGTTGATGCCATATCTTGTGTACCAATCTAAAATTGTTATCTGGGCAACATCTTCCTTTTCAAGTATAATATTTTAAATTTGTATACAAATTGAATGTGAGCAAACTGCGCTCTCCAATTTCTTATTTTATTCTTGGGGAAACACTTGCAGGTTTTAAGGTTGCATTTGGATTTTCCCAAGTCGATCCAGCAAAGAGAGATTCAAACTGATGCAACATTGGACCAGAAACGTGGATCGTTGCATTTATATTCGGGTGCATTTGCTGCTGCCATTGTTATAACAAGCATTGGTGTATTGATCTATTTGAAGCGCTCTAAGATGTAAAGTTAAAAGACAAAATTATTGAAACAGCAAACGCATGCTCCTCATAAATTTCGGCAGTTGAGGACTAAATGGTAATTGAGCTCTAATTTTTGTGTAATTTTAAAGTCAACTTAGACATTCTTGTAAATAATGGAAAAGATGGACATAATTGAAATATAGGCCATTTATTAAGGCCTTTATGTATTTTATCCCAATACATTACAATACCCAAGTTCTCAAAGAAACTGAGATATGTTAAAAGGGAAAGAGTTGCAACACCAGAGGGAAATATGGTTAGTCTGATGTCTGGATCAGATAAAGCCTTCATTTTTTGTGAACTTGAGAATTGGTTAAGGTTCTTTCTGTCAGAAAAAACATCATTGAGG

mRNA sequence

GGGATATTAGAATACAAAGAGAAGACTATTTTATTTTGAACATATTCATGAGGAGCTTTCTTTGGAAGGGGAGGAGGGGGTTGGGTTAAGTGAAATGTCTGCATTCATGGCTTCTGAAGGGGCTCTATACTATCCATCTCTAGGTCTTGGAAACTCATTTGCTTTTAAATTTGAGGATCTGAAGGGTCGGGTACATCGCGTAAATTGTGGCACTGAGACCCTAGATGAGTTGGTATCTGTTGTGATGCAAAGGGTTGGTGCTACTAATGGTGCTAGTCGTCCTCTGCTTTTGTATGAAGATAATGAAGGTGATAAAGTAGTTCTTGCTACTGATGGCGATCTCTCCGGTGCTGTAAACCATGCCAAGTCCACAGGAAAAAAGGTTTTAAGGTTGCATTTGGATTTTCCCAAGTCGATCCAGCAAAGAGAGATTCAAACTGATGCAACATTGGACCAGAAACGTGGATCGTTGCATTTATATTCGGGTGCATTTGCTGCTGCCATTGTTATAACAAGCATTGGTGTATTGATCTATTTGAAGCGCTCTAAGATGTAAAGTTAAAAGACAAAATTATTGAAACAGCAAACGCATGCTCCTCATAAATTTCGGCAGTTGAGGACTAAATGGTAATTGAGCTCTAATTTTTGTGTAATTTTAAAGTCAACTTAGACATTCTTGTAAATAATGGAAAAGATGGACATAATTGAAATATAGGCCATTTATTAAGGCCTTTATGTATTTTATCCCAATACATTACAATACCCAAGTTCTCAAAGAAACTGAGATATGTTAAAAGGGAAAGAGTTGCAACACCAGAGGGAAATATGGTTAGTCTGATGTCTGGATCAGATAAAGCCTTCATTTTTTGTGAACTTGAGAATTGGTTAAGGTTCTTTCTGTCAGAAAAAACATCATTGAGG

Coding sequence (CDS)

ATGTCTGCATTCATGGCTTCTGAAGGGGCTCTATACTATCCATCTCTAGGTCTTGGAAACTCATTTGCTTTTAAATTTGAGGATCTGAAGGGTCGGGTACATCGCGTAAATTGTGGCACTGAGACCCTAGATGAGTTGGTATCTGTTGTGATGCAAAGGGTTGGTGCTACTAATGGTGCTAGTCGTCCTCTGCTTTTGTATGAAGATAATGAAGGTGATAAAGTAGTTCTTGCTACTGATGGCGATCTCTCCGGTGCTGTAAACCATGCCAAGTCCACAGGAAAAAAGGTTTTAAGGTTGCATTTGGATTTTCCCAAGTCGATCCAGCAAAGAGAGATTCAAACTGATGCAACATTGGACCAGAAACGTGGATCGTTGCATTTATATTCGGGTGCATTTGCTGCTGCCATTGTTATAACAAGCATTGGTGTATTGATCTATTTGAAGCGCTCTAAGATGTAA

Protein sequence

MSAFMASEGALYYPSLGLGNSFAFKFEDLKGRVHRVNCGTETLDELVSVVMQRVGATNGASRPLLLYEDNEGDKVVLATDGDLSGAVNHAKSTGKKVLRLHLDFPKSIQQREIQTDATLDQKRGSLHLYSGAFAAAIVITSIGVLIYLKRSKM
BLAST of Lsi01G019840 vs. Swiss-Prot
Match: Y3295_ARATH (CBS domain-containing protein CBSCBSPB3 OS=Arabidopsis thaliana GN=CBSCBSPB3 PE=1 SV=1)

HSP 1 Score: 150.6 bits (379), Expect = 1.4e-35
Identity = 80/150 (53.33%), Postives = 104/150 (69.33%), Query Frame = 1

Query: 13  YPSLGLGNSFAFKFEDLKGRVHRVNCGTETLDELVSVVMQRVGATNG--ASRPLLLYEDN 72
           YPSLGLGNSF+FKFEDLKGRVHR   G E L+EL+ +VMQR+G+ N     RP ++YED+
Sbjct: 408 YPSLGLGNSFSFKFEDLKGRVHRFTSGAENLEELMGIVMQRIGSDNNNVEQRPQIIYEDD 467

Query: 73  EGDKVVLATDGDLSGAVNHAKSTGKKVLRLHLDFPKSIQQREIQTDATLDQKRGSLHLYS 132
           EGDKV++ +D DL GAV  A+STG+KVLRLHLDF +S   R + ++ T  +K  S    S
Sbjct: 468 EGDKVLITSDSDLVGAVTLARSTGQKVLRLHLDFTES--TRSLSSETTQLKKGDSRDRGS 527

Query: 133 G--------AFAAAIVITSIGVLIYLKRSK 153
           G            A+V+TSI +++YLKRSK
Sbjct: 528 GWVSWRGGVVVTGAVVLTSIAIVVYLKRSK 555

BLAST of Lsi01G019840 vs. Swiss-Prot
Match: Y2650_ARATH (CBS domain-containing protein CBSCBSPB2 OS=Arabidopsis thaliana GN=CBSCBSPB2 PE=2 SV=1)

HSP 1 Score: 134.0 bits (336), Expect = 1.4e-30
Identity = 78/153 (50.98%), Postives = 102/153 (66.67%), Query Frame = 1

Query: 1   MSAFMA-SEGALYYPSLGLGNSFAFKFEDLKGRVHRVNCGTETLDELVSVVMQRVGATNG 60
           MSA +  SEG    PS GL +SFAFKFED KGRV R N   E+ +EL+SVVMQR  A +G
Sbjct: 387 MSAMLINSEGKQSCPSQGLVSSFAFKFEDRKGRVQRFNSTGESFEELMSVVMQRCEADSG 446

Query: 61  ASRPLLLYEDNEGDKVVLATDGDLSGAVNHAKSTGKKVLRLHLDFPKSIQQREIQTDATL 120
                ++Y+D+EGDKV+++ D DL  AV  A+S G+KVLRLHLDF ++I   E   D + 
Sbjct: 447 LQ---IMYQDDEGDKVLISRDSDLVAAVTFARSLGQKVLRLHLDFTETIAPLETIADLS- 506

Query: 121 DQKRGSLHLYSGAFAAAIVITSIGVLIYLKRSK 153
           +   G +   +G  A AIV+TSIG+ +YLKRSK
Sbjct: 507 EGNGGCVWWQTGVLAGAIVLTSIGLFVYLKRSK 535

BLAST of Lsi01G019840 vs. Swiss-Prot
Match: Y5053_ARATH (CBS domain-containing protein CBSCBSPB4 OS=Arabidopsis thaliana GN=CBSCBSPB4 PE=1 SV=2)

HSP 1 Score: 109.0 bits (271), Expect = 4.7e-23
Identity = 59/140 (42.14%), Postives = 81/140 (57.86%), Query Frame = 1

Query: 15  SLGLGNSFAFKFEDLKGRVHRVNCGTETLDELVSVVMQRVGAT-NGASRPLLLYEDNEGD 74
           S    N+FAFK +D KGR+HR  C T++L  L++ ++QR+G      + P ++YED + D
Sbjct: 407 SFSYPNTFAFKLQDKKGRMHRFMCETQSLTTLITAILQRMGDDIEPDNLPQIMYEDEDND 466

Query: 75  KVVLATDGDLSGAVNHAKSTGKKVLRLHLDFPKSIQQRE--IQTDATLDQKRGSLHLYSG 134
           KVVLA+D DL  AV HAKS G K L+LHLD+ +    R      D   DQ       Y  
Sbjct: 467 KVVLASDNDLGAAVEHAKSIGWKGLKLHLDYTEERGHRRGLSSEDMDYDQSNSWAAAYKT 526

Query: 135 AFAAAIVITSIGVLIYLKRS 152
             A A +   +GVL+YLKR+
Sbjct: 527 VAAGAALAAGLGVLVYLKRN 546

BLAST of Lsi01G019840 vs. Swiss-Prot
Match: Y5064_ARATH (CBS domain-containing protein CBSCBSPB5 OS=Arabidopsis thaliana GN=CBSCBSPB5 PE=1 SV=1)

HSP 1 Score: 109.0 bits (271), Expect = 4.7e-23
Identity = 59/140 (42.14%), Postives = 81/140 (57.86%), Query Frame = 1

Query: 15  SLGLGNSFAFKFEDLKGRVHRVNCGTETLDELVSVVMQRVGAT-NGASRPLLLYEDNEGD 74
           S    N+FAFK +D KGR+HR  C T++L  L++ ++QR+G      + P ++YED + D
Sbjct: 407 SFSYPNTFAFKLQDKKGRMHRFMCETQSLTTLITAILQRMGDDIEPDNLPQIMYEDEDND 466

Query: 75  KVVLATDGDLSGAVNHAKSTGKKVLRLHLDFPKSIQQRE--IQTDATLDQKRGSLHLYSG 134
           KVVLA+D DL  AV HAKS G K L+LHLD+ +    R      D   DQ       Y  
Sbjct: 467 KVVLASDNDLGAAVEHAKSIGWKGLKLHLDYTEERGHRRGLSSEDMDYDQSNSWAAAYKT 526

Query: 135 AFAAAIVITSIGVLIYLKRS 152
             A A +   +GVL+YLKR+
Sbjct: 527 VAAGAALAAGLGVLVYLKRN 546

BLAST of Lsi01G019840 vs. Swiss-Prot
Match: Y5349_ARATH (CBS domain-containing protein CBSCBSPB1 OS=Arabidopsis thaliana GN=CBSCBSPB1 PE=1 SV=1)

HSP 1 Score: 90.5 bits (223), Expect = 1.7e-17
Identity = 55/150 (36.67%), Postives = 79/150 (52.67%), Query Frame = 1

Query: 15  SLGLGNSFAFKFEDLKGRVHRVNCGTETLDELVSVVMQRVGATNGASR-PLLLYEDNEGD 74
           S    N+F+FK ED K R HR    T +L E+++ ++QRVG        P +LYED + D
Sbjct: 398 SFPFANTFSFKIEDKKHRKHRFISDTRSLTEVITAIIQRVGDDIDPDNFPQILYEDEDHD 457

Query: 75  KVVLATDGDLSGAVNHAKSTGKKVLRLHLDFPKSIQQRE-------------IQTDATLD 134
           KV+LA+D DL  A+ HAKS G K LRLHLD  +  + R              ++TDA   
Sbjct: 458 KVLLASDSDLQAAIEHAKSIGWKSLRLHLDDSREGKGRRRRRASGSAESMEYVETDAW-- 517

Query: 135 QKRGSLHLYSGAFAAAIVITSIGVLIYLKR 151
                   YSG  A A ++  +G + +L++
Sbjct: 518 -----AAAYSGVAAGAALVAGLGFMAFLRK 540

BLAST of Lsi01G019840 vs. TrEMBL
Match: A0A0A0KYM2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G495220 PE=4 SV=1)

HSP 1 Score: 266.5 bits (680), Expect = 1.9e-68
Identity = 134/153 (87.58%), Postives = 142/153 (92.81%), Query Frame = 1

Query: 1   MSAFMASEGALYYPSLGLGNSFAFKFEDLKGRVHRVNCGTETLDELVSVVMQRVGATNGA 60
           MSAFMASEG L YPSLGLGNSFAFKFEDLKGRVHRVNCGTETLDELVSVVMQR+GAT+ A
Sbjct: 387 MSAFMASEGTLNYPSLGLGNSFAFKFEDLKGRVHRVNCGTETLDELVSVVMQRIGATDSA 446

Query: 61  SRPLLLYEDNEGDKVVLATDGDLSGAVNHAKSTGKKVLRLHLDFPKSIQQREIQTDATLD 120
           +RPLLLYED+EGDKVVLATDGDLSGAVNHA+S G KVLRLHLDFP+SIQQ E Q DA LD
Sbjct: 447 NRPLLLYEDDEGDKVVLATDGDLSGAVNHARSIGLKVLRLHLDFPESIQQTEAQNDAMLD 506

Query: 121 QKRGSLHLYSGAFAAAIVITSIGVLIYLKRSKM 154
           QKRGSLHLYSGAFAAAI +TSIGVL YLKRSK+
Sbjct: 507 QKRGSLHLYSGAFAAAIALTSIGVLFYLKRSKV 539

BLAST of Lsi01G019840 vs. TrEMBL
Match: W9RJ31_9ROSA (CBS domain-containing protein OS=Morus notabilis GN=L484_010945 PE=4 SV=1)

HSP 1 Score: 186.4 bits (472), Expect = 2.6e-44
Identity = 94/155 (60.65%), Postives = 116/155 (74.84%), Query Frame = 1

Query: 1   MSAFMASEGALY--YPSLGLGNSFAFKFEDLKGRVHRVNCGTETLDELVSVVMQRVGATN 60
           MSAFM S+G     YPSLGLGN+F+FKFED KGRVHR+NCGTE+LDEL+S VMQR+GA +
Sbjct: 383 MSAFMTSDGTEIGKYPSLGLGNTFSFKFEDFKGRVHRLNCGTESLDELLSTVMQRIGAES 442

Query: 61  GASRPLLLYEDNEGDKVVLATDGDLSGAVNHAKSTGKKVLRLHLDFPKSIQQREIQTDAT 120
           G+  P +LYED+EGDKV+LATD DL  AV HA+S G KVLRLHLDF  S QQR +++   
Sbjct: 443 GSDHPQILYEDDEGDKVLLATDSDLVSAVTHARSIGLKVLRLHLDFSDSNQQRTLESSTA 502

Query: 121 LDQKRGSLHLYSGAFAAAIVITSIGVLIYLKRSKM 154
             Q       ++G  A A V+TSIGVL+YLKR+ +
Sbjct: 503 TTQGTRWTSSHTGLLAGAAVLTSIGVLLYLKRTNL 537

BLAST of Lsi01G019840 vs. TrEMBL
Match: F6H4G2_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_07s0031g02750 PE=4 SV=1)

HSP 1 Score: 185.7 bits (470), Expect = 4.4e-44
Identity = 93/158 (58.86%), Postives = 123/158 (77.85%), Query Frame = 1

Query: 1   MSAFMASEGAL----YYPSLGLGNSFAFKFEDLKGRVHRVNCGTETLDELVSVVMQRVGA 60
           +SA MA++GA      YPSLGLGNSFAFKFED+KGRVHR NCGTE+LDELVS VMQR+GA
Sbjct: 382 LSAVMAADGAEPGRNMYPSLGLGNSFAFKFEDIKGRVHRFNCGTESLDELVSAVMQRIGA 441

Query: 61  TNGASRPLLLYEDNEGDKVVLATDGDLSGAVNHAKSTGKKVLRLHLDFPKSIQQ-REIQT 120
           +    RP +LYED+EGDKV+L+TD DL  AV+HA+  G+KVLRL LD+ +SIQ+ R  QT
Sbjct: 442 STDQDRPQILYEDDEGDKVLLSTDSDLVSAVSHARVVGQKVLRLQLDYSESIQETRRPQT 501

Query: 121 DATLDQKRGSLHLYSGAFAAAIVITSIGVLIYLKRSKM 154
                +  G + L+SG  A+A++IT++G+++YLKR+K+
Sbjct: 502 GTDTVRGTGGVFLHSGILASAVIITAVGLMVYLKRAKL 539

BLAST of Lsi01G019840 vs. TrEMBL
Match: B9HF69_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0007s04880g PE=4 SV=2)

HSP 1 Score: 184.1 bits (466), Expect = 1.3e-43
Identity = 96/157 (61.15%), Postives = 116/157 (73.89%), Query Frame = 1

Query: 1   MSAFMASEGALY--YPSLGLGNSFAFKFEDLKGRVHRVNCGTETLDELVSVVMQRVGATN 60
           MSA MAS+GA    YPSLGLGNSFAFKFEDLKGR+HR+NC TE LDEL+S V+QR+GA +
Sbjct: 395 MSALMASDGAELGRYPSLGLGNSFAFKFEDLKGRIHRLNCCTENLDELLSTVLQRIGAES 454

Query: 61  GASRPLLLYEDNEGDKVVLATDGDLSGAVNHAKSTGKKVLRLHLDF--PKSIQQREIQTD 120
              RP LLYED++GDKV+LATDGDL GAV+HA+S G KVLRLHLD+  P +     + T 
Sbjct: 455 EQDRPQLLYEDDDGDKVLLATDGDLIGAVSHARSVGLKVLRLHLDYYDPSNQTTSPLDTT 514

Query: 121 ATLDQKRGSLHLYSGAFAAAIVITSIGVLIYLKRSKM 154
            T  Q+ G +   SG F A +V+  I V+ YLKRSKM
Sbjct: 515 TTATQRIGLVSFRSGIFVAGVVLAGIAVVAYLKRSKM 551

BLAST of Lsi01G019840 vs. TrEMBL
Match: A0A061DRZ9_THECC (CBS / octicosapeptide/Phox/Bemp1 domains-containing protein isoform 2 OS=Theobroma cacao GN=TCM_001612 PE=4 SV=1)

HSP 1 Score: 183.0 bits (463), Expect = 2.8e-43
Identity = 98/157 (62.42%), Postives = 115/157 (73.25%), Query Frame = 1

Query: 1   MSAFMASEGA-----LYYPSLGLGNSFAFKFEDLKGRVHRVNCGTETLDELVSVVMQRVG 60
           MSA MAS+G        YPSLGLGNSFAFKFEDLKGRVHR NCGTE LDEL+S +M R+ 
Sbjct: 318 MSAIMASDGGDAGKLSSYPSLGLGNSFAFKFEDLKGRVHRFNCGTENLDELLSAIMPRIA 377

Query: 61  ATNGASRPLLLYEDNEGDKVVLATDGDLSGAVNHAKSTGKKVLRLHLDFPKSIQQREIQT 120
           ++N   RP LLYED+EGDKV+LATD DL  AVNHA+S G KVLRLHLD   S QQ++ Q+
Sbjct: 378 SSNDHGRPQLLYEDDEGDKVLLATDSDLIVAVNHARSRGLKVLRLHLDSADSDQQKKSQS 437

Query: 121 DATLDQKRGSLHLYSGAFAAAIVITSIGVLIYLKRSK 153
             T  ++ G + L SG  A  +VIT I VL+YLKRSK
Sbjct: 438 SIT-SKRTGWVSLRSGLLAGVVVITGISVLVYLKRSK 473

BLAST of Lsi01G019840 vs. TAIR10
Match: AT3G52950.1 (AT3G52950.1 CBS / octicosapeptide/Phox/Bemp1 (PB1) domains-containing protein)

HSP 1 Score: 150.6 bits (379), Expect = 7.9e-37
Identity = 80/150 (53.33%), Postives = 104/150 (69.33%), Query Frame = 1

Query: 13  YPSLGLGNSFAFKFEDLKGRVHRVNCGTETLDELVSVVMQRVGATNG--ASRPLLLYEDN 72
           YPSLGLGNSF+FKFEDLKGRVHR   G E L+EL+ +VMQR+G+ N     RP ++YED+
Sbjct: 408 YPSLGLGNSFSFKFEDLKGRVHRFTSGAENLEELMGIVMQRIGSDNNNVEQRPQIIYEDD 467

Query: 73  EGDKVVLATDGDLSGAVNHAKSTGKKVLRLHLDFPKSIQQREIQTDATLDQKRGSLHLYS 132
           EGDKV++ +D DL GAV  A+STG+KVLRLHLDF +S   R + ++ T  +K  S    S
Sbjct: 468 EGDKVLITSDSDLVGAVTLARSTGQKVLRLHLDFTES--TRSLSSETTQLKKGDSRDRGS 527

Query: 133 G--------AFAAAIVITSIGVLIYLKRSK 153
           G            A+V+TSI +++YLKRSK
Sbjct: 528 GWVSWRGGVVVTGAVVLTSIAIVVYLKRSK 555

BLAST of Lsi01G019840 vs. TAIR10
Match: AT2G36500.1 (AT2G36500.1 CBS / octicosapeptide/Phox/Bemp1 (PB1) domains-containing protein)

HSP 1 Score: 134.0 bits (336), Expect = 7.6e-32
Identity = 78/153 (50.98%), Postives = 102/153 (66.67%), Query Frame = 1

Query: 1   MSAFMA-SEGALYYPSLGLGNSFAFKFEDLKGRVHRVNCGTETLDELVSVVMQRVGATNG 60
           MSA +  SEG    PS GL +SFAFKFED KGRV R N   E+ +EL+SVVMQR  A +G
Sbjct: 387 MSAMLINSEGKQSCPSQGLVSSFAFKFEDRKGRVQRFNSTGESFEELMSVVMQRCEADSG 446

Query: 61  ASRPLLLYEDNEGDKVVLATDGDLSGAVNHAKSTGKKVLRLHLDFPKSIQQREIQTDATL 120
                ++Y+D+EGDKV+++ D DL  AV  A+S G+KVLRLHLDF ++I   E   D + 
Sbjct: 447 LQ---IMYQDDEGDKVLISRDSDLVAAVTFARSLGQKVLRLHLDFTETIAPLETIADLS- 506

Query: 121 DQKRGSLHLYSGAFAAAIVITSIGVLIYLKRSK 153
           +   G +   +G  A AIV+TSIG+ +YLKRSK
Sbjct: 507 EGNGGCVWWQTGVLAGAIVLTSIGLFVYLKRSK 535

BLAST of Lsi01G019840 vs. TAIR10
Match: AT5G50640.1 (AT5G50640.1 CBS / octicosapeptide/Phox/Bemp1 (PB1) domains-containing protein)

HSP 1 Score: 109.0 bits (271), Expect = 2.6e-24
Identity = 59/140 (42.14%), Postives = 81/140 (57.86%), Query Frame = 1

Query: 15  SLGLGNSFAFKFEDLKGRVHRVNCGTETLDELVSVVMQRVGAT-NGASRPLLLYEDNEGD 74
           S    N+FAFK +D KGR+HR  C T++L  L++ ++QR+G      + P ++YED + D
Sbjct: 407 SFSYPNTFAFKLQDKKGRMHRFMCETQSLTTLITAILQRMGDDIEPDNLPQIMYEDEDND 466

Query: 75  KVVLATDGDLSGAVNHAKSTGKKVLRLHLDFPKSIQQRE--IQTDATLDQKRGSLHLYSG 134
           KVVLA+D DL  AV HAKS G K L+LHLD+ +    R      D   DQ       Y  
Sbjct: 467 KVVLASDNDLGAAVEHAKSIGWKGLKLHLDYTEERGHRRGLSSEDMDYDQSNSWAAAYKT 526

Query: 135 AFAAAIVITSIGVLIYLKRS 152
             A A +   +GVL+YLKR+
Sbjct: 527 VAAGAALAAGLGVLVYLKRN 546

BLAST of Lsi01G019840 vs. TAIR10
Match: AT5G50530.1 (AT5G50530.1 CBS / octicosapeptide/Phox/Bemp1 (PB1) domains-containing protein)

HSP 1 Score: 109.0 bits (271), Expect = 2.6e-24
Identity = 59/140 (42.14%), Postives = 81/140 (57.86%), Query Frame = 1

Query: 15  SLGLGNSFAFKFEDLKGRVHRVNCGTETLDELVSVVMQRVGAT-NGASRPLLLYEDNEGD 74
           S    N+FAFK +D KGR+HR  C T++L  L++ ++QR+G      + P ++YED + D
Sbjct: 407 SFSYPNTFAFKLQDKKGRMHRFMCETQSLTTLITAILQRMGDDIEPDNLPQIMYEDEDND 466

Query: 75  KVVLATDGDLSGAVNHAKSTGKKVLRLHLDFPKSIQQRE--IQTDATLDQKRGSLHLYSG 134
           KVVLA+D DL  AV HAKS G K L+LHLD+ +    R      D   DQ       Y  
Sbjct: 467 KVVLASDNDLGAAVEHAKSIGWKGLKLHLDYTEERGHRRGLSSEDMDYDQSNSWAAAYKT 526

Query: 135 AFAAAIVITSIGVLIYLKRS 152
             A A +   +GVL+YLKR+
Sbjct: 527 VAAGAALAAGLGVLVYLKRN 546

BLAST of Lsi01G019840 vs. TAIR10
Match: AT5G63490.1 (AT5G63490.1 CBS / octicosapeptide/Phox/Bemp1 (PB1) domains-containing protein)

HSP 1 Score: 90.5 bits (223), Expect = 9.7e-19
Identity = 55/150 (36.67%), Postives = 79/150 (52.67%), Query Frame = 1

Query: 15  SLGLGNSFAFKFEDLKGRVHRVNCGTETLDELVSVVMQRVGATNGASR-PLLLYEDNEGD 74
           S    N+F+FK ED K R HR    T +L E+++ ++QRVG        P +LYED + D
Sbjct: 398 SFPFANTFSFKIEDKKHRKHRFISDTRSLTEVITAIIQRVGDDIDPDNFPQILYEDEDHD 457

Query: 75  KVVLATDGDLSGAVNHAKSTGKKVLRLHLDFPKSIQQRE-------------IQTDATLD 134
           KV+LA+D DL  A+ HAKS G K LRLHLD  +  + R              ++TDA   
Sbjct: 458 KVLLASDSDLQAAIEHAKSIGWKSLRLHLDDSREGKGRRRRRASGSAESMEYVETDAW-- 517

Query: 135 QKRGSLHLYSGAFAAAIVITSIGVLIYLKR 151
                   YSG  A A ++  +G + +L++
Sbjct: 518 -----AAAYSGVAAGAALVAGLGFMAFLRK 540

BLAST of Lsi01G019840 vs. NCBI nr
Match: gi|449457321|ref|XP_004146397.1| (PREDICTED: CBS domain-containing protein CBSCBSPB3 [Cucumis sativus])

HSP 1 Score: 266.5 bits (680), Expect = 2.8e-68
Identity = 134/153 (87.58%), Postives = 142/153 (92.81%), Query Frame = 1

Query: 1   MSAFMASEGALYYPSLGLGNSFAFKFEDLKGRVHRVNCGTETLDELVSVVMQRVGATNGA 60
           MSAFMASEG L YPSLGLGNSFAFKFEDLKGRVHRVNCGTETLDELVSVVMQR+GAT+ A
Sbjct: 387 MSAFMASEGTLNYPSLGLGNSFAFKFEDLKGRVHRVNCGTETLDELVSVVMQRIGATDSA 446

Query: 61  SRPLLLYEDNEGDKVVLATDGDLSGAVNHAKSTGKKVLRLHLDFPKSIQQREIQTDATLD 120
           +RPLLLYED+EGDKVVLATDGDLSGAVNHA+S G KVLRLHLDFP+SIQQ E Q DA LD
Sbjct: 447 NRPLLLYEDDEGDKVVLATDGDLSGAVNHARSIGLKVLRLHLDFPESIQQTEAQNDAMLD 506

Query: 121 QKRGSLHLYSGAFAAAIVITSIGVLIYLKRSKM 154
           QKRGSLHLYSGAFAAAI +TSIGVL YLKRSK+
Sbjct: 507 QKRGSLHLYSGAFAAAIALTSIGVLFYLKRSKV 539

BLAST of Lsi01G019840 vs. NCBI nr
Match: gi|659083040|ref|XP_008442154.1| (PREDICTED: CBS domain-containing protein CBSCBSPB3 [Cucumis melo])

HSP 1 Score: 263.1 bits (671), Expect = 3.1e-67
Identity = 134/153 (87.58%), Postives = 140/153 (91.50%), Query Frame = 1

Query: 1   MSAFMASEGALYYPSLGLGNSFAFKFEDLKGRVHRVNCGTETLDELVSVVMQRVGATNGA 60
           MSAFMASEG L YPSLGLGNSFAFKFEDLKGRVHRVNCGTETLDELVSVVMQR+GATN A
Sbjct: 387 MSAFMASEGTLNYPSLGLGNSFAFKFEDLKGRVHRVNCGTETLDELVSVVMQRIGATNSA 446

Query: 61  SRPLLLYEDNEGDKVVLATDGDLSGAVNHAKSTGKKVLRLHLDFPKSIQQREIQTDATLD 120
           SRPLLLYED+EGDKVVLATDGDLSGAVNHA+S G KVLRLHLDFP+SIQQ E Q DA L 
Sbjct: 447 SRPLLLYEDDEGDKVVLATDGDLSGAVNHARSIGLKVLRLHLDFPESIQQTEAQNDAMLG 506

Query: 121 QKRGSLHLYSGAFAAAIVITSIGVLIYLKRSKM 154
            KRGSL LYSGAFAAA+V+TSIGVL YLKRSKM
Sbjct: 507 WKRGSLPLYSGAFAAAVVLTSIGVLFYLKRSKM 539

BLAST of Lsi01G019840 vs. NCBI nr
Match: gi|703111040|ref|XP_010099759.1| (CBS domain-containing protein [Morus notabilis])

HSP 1 Score: 186.4 bits (472), Expect = 3.7e-44
Identity = 94/155 (60.65%), Postives = 116/155 (74.84%), Query Frame = 1

Query: 1   MSAFMASEGALY--YPSLGLGNSFAFKFEDLKGRVHRVNCGTETLDELVSVVMQRVGATN 60
           MSAFM S+G     YPSLGLGN+F+FKFED KGRVHR+NCGTE+LDEL+S VMQR+GA +
Sbjct: 383 MSAFMTSDGTEIGKYPSLGLGNTFSFKFEDFKGRVHRLNCGTESLDELLSTVMQRIGAES 442

Query: 61  GASRPLLLYEDNEGDKVVLATDGDLSGAVNHAKSTGKKVLRLHLDFPKSIQQREIQTDAT 120
           G+  P +LYED+EGDKV+LATD DL  AV HA+S G KVLRLHLDF  S QQR +++   
Sbjct: 443 GSDHPQILYEDDEGDKVLLATDSDLVSAVTHARSIGLKVLRLHLDFSDSNQQRTLESSTA 502

Query: 121 LDQKRGSLHLYSGAFAAAIVITSIGVLIYLKRSKM 154
             Q       ++G  A A V+TSIGVL+YLKR+ +
Sbjct: 503 TTQGTRWTSSHTGLLAGAAVLTSIGVLLYLKRTNL 537

BLAST of Lsi01G019840 vs. NCBI nr
Match: gi|296082621|emb|CBI21626.3| (unnamed protein product [Vitis vinifera])

HSP 1 Score: 185.7 bits (470), Expect = 6.3e-44
Identity = 93/158 (58.86%), Postives = 123/158 (77.85%), Query Frame = 1

Query: 1   MSAFMASEGAL----YYPSLGLGNSFAFKFEDLKGRVHRVNCGTETLDELVSVVMQRVGA 60
           +SA MA++GA      YPSLGLGNSFAFKFED+KGRVHR NCGTE+LDELVS VMQR+GA
Sbjct: 399 LSAVMAADGAEPGRNMYPSLGLGNSFAFKFEDIKGRVHRFNCGTESLDELVSAVMQRIGA 458

Query: 61  TNGASRPLLLYEDNEGDKVVLATDGDLSGAVNHAKSTGKKVLRLHLDFPKSIQQ-REIQT 120
           +    RP +LYED+EGDKV+L+TD DL  AV+HA+  G+KVLRL LD+ +SIQ+ R  QT
Sbjct: 459 STDQDRPQILYEDDEGDKVLLSTDSDLVSAVSHARVVGQKVLRLQLDYSESIQETRRPQT 518

Query: 121 DATLDQKRGSLHLYSGAFAAAIVITSIGVLIYLKRSKM 154
                +  G + L+SG  A+A++IT++G+++YLKR+K+
Sbjct: 519 GTDTVRGTGGVFLHSGILASAVIITAVGLMVYLKRAKL 556

BLAST of Lsi01G019840 vs. NCBI nr
Match: gi|225438337|ref|XP_002272502.1| (PREDICTED: CBS domain-containing protein CBSCBSPB3 [Vitis vinifera])

HSP 1 Score: 185.7 bits (470), Expect = 6.3e-44
Identity = 93/158 (58.86%), Postives = 123/158 (77.85%), Query Frame = 1

Query: 1   MSAFMASEGAL----YYPSLGLGNSFAFKFEDLKGRVHRVNCGTETLDELVSVVMQRVGA 60
           +SA MA++GA      YPSLGLGNSFAFKFED+KGRVHR NCGTE+LDELVS VMQR+GA
Sbjct: 382 LSAVMAADGAEPGRNMYPSLGLGNSFAFKFEDIKGRVHRFNCGTESLDELVSAVMQRIGA 441

Query: 61  TNGASRPLLLYEDNEGDKVVLATDGDLSGAVNHAKSTGKKVLRLHLDFPKSIQQ-REIQT 120
           +    RP +LYED+EGDKV+L+TD DL  AV+HA+  G+KVLRL LD+ +SIQ+ R  QT
Sbjct: 442 STDQDRPQILYEDDEGDKVLLSTDSDLVSAVSHARVVGQKVLRLQLDYSESIQETRRPQT 501

Query: 121 DATLDQKRGSLHLYSGAFAAAIVITSIGVLIYLKRSKM 154
                +  G + L+SG  A+A++IT++G+++YLKR+K+
Sbjct: 502 GTDTVRGTGGVFLHSGILASAVIITAVGLMVYLKRAKL 539

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Y3295_ARATH1.4e-3553.33CBS domain-containing protein CBSCBSPB3 OS=Arabidopsis thaliana GN=CBSCBSPB3 PE=... [more]
Y2650_ARATH1.4e-3050.98CBS domain-containing protein CBSCBSPB2 OS=Arabidopsis thaliana GN=CBSCBSPB2 PE=... [more]
Y5053_ARATH4.7e-2342.14CBS domain-containing protein CBSCBSPB4 OS=Arabidopsis thaliana GN=CBSCBSPB4 PE=... [more]
Y5064_ARATH4.7e-2342.14CBS domain-containing protein CBSCBSPB5 OS=Arabidopsis thaliana GN=CBSCBSPB5 PE=... [more]
Y5349_ARATH1.7e-1736.67CBS domain-containing protein CBSCBSPB1 OS=Arabidopsis thaliana GN=CBSCBSPB1 PE=... [more]
Match NameE-valueIdentityDescription
A0A0A0KYM2_CUCSA1.9e-6887.58Uncharacterized protein OS=Cucumis sativus GN=Csa_4G495220 PE=4 SV=1[more]
W9RJ31_9ROSA2.6e-4460.65CBS domain-containing protein OS=Morus notabilis GN=L484_010945 PE=4 SV=1[more]
F6H4G2_VITVI4.4e-4458.86Putative uncharacterized protein OS=Vitis vinifera GN=VIT_07s0031g02750 PE=4 SV=... [more]
B9HF69_POPTR1.3e-4361.15Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0007s04880g PE=4 SV=2[more]
A0A061DRZ9_THECC2.8e-4362.42CBS / octicosapeptide/Phox/Bemp1 domains-containing protein isoform 2 OS=Theobro... [more]
Match NameE-valueIdentityDescription
AT3G52950.17.9e-3753.33 CBS / octicosapeptide/Phox/Bemp1 (PB1) domains-containing protein[more]
AT2G36500.17.6e-3250.98 CBS / octicosapeptide/Phox/Bemp1 (PB1) domains-containing protein[more]
AT5G50640.12.6e-2442.14 CBS / octicosapeptide/Phox/Bemp1 (PB1) domains-containing protein[more]
AT5G50530.12.6e-2442.14 CBS / octicosapeptide/Phox/Bemp1 (PB1) domains-containing protein[more]
AT5G63490.19.7e-1936.67 CBS / octicosapeptide/Phox/Bemp1 (PB1) domains-containing protein[more]
Match NameE-valueIdentityDescription
gi|449457321|ref|XP_004146397.1|2.8e-6887.58PREDICTED: CBS domain-containing protein CBSCBSPB3 [Cucumis sativus][more]
gi|659083040|ref|XP_008442154.1|3.1e-6787.58PREDICTED: CBS domain-containing protein CBSCBSPB3 [Cucumis melo][more]
gi|703111040|ref|XP_010099759.1|3.7e-4460.65CBS domain-containing protein [Morus notabilis][more]
gi|296082621|emb|CBI21626.3|6.3e-4458.86unnamed protein product [Vitis vinifera][more]
gi|225438337|ref|XP_002272502.1|6.3e-4458.86PREDICTED: CBS domain-containing protein CBSCBSPB3 [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR000270PB1_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0055114 oxidation-reduction process
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0016491 oxidoreductase activity
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi01G019840.1Lsi01G019840.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000270PB1 domainPFAMPF00564PB1coord: 22..103
score: 1.4
IPR000270PB1 domainSMARTSM00666PB1_newcoord: 21..105
score: 1.1
IPR000270PB1 domainPROFILEPS51745PB1coord: 19..105
score: 18
NoneNo IPR availableGENE3DG3DSA:3.10.20.240coord: 19..103
score: 6.7
NoneNo IPR availablePANTHERPTHR13780AMP-ACTIVATED PROTEIN KINASE, GAMMA REGULATORY SUBUNITcoord: 38..153
score: 1.3
NoneNo IPR availablePANTHERPTHR13780:SF48CBS DOMAIN-CONTAINING PROTEIN CBSCBSPB1-RELATEDcoord: 38..153
score: 1.3
NoneNo IPR availableunknownSSF54277CAD & PB1 domainscoord: 13..109
score: 2.0