Lsi03G004240 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi03G004240
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionAcyl-coenzyme A:6-aminopenicillanic acid acyl-transferase
Locationchr03 : 4791402 .. 4794251 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGAACAAAGAAGTGGAGAATCCAGAAGTTGGTTCTCCATGCAAGATAGAAAAACAAGAAGATATGGAAAACAAAGAGCTGGAAATGTTTGAAGTTGGTCCATGTGAATCTGCATACCAGATGGGGTTCTTGATTGGGAAGAGGTTCTCTGATACCATAAGGAGCAGATTGGAGAAAGACATGGTTCTTCGCCAACAGCTGCTTCCTTTTGCTCAAACTCCAGAGTCAAAGCCTCTTATAGAAGCCCTCTGCAAAAATAACAAGGCCAAGTACCCAACGCATTGGGATGAACTCATGGGAATTGTAGATGGAAGTCAAGCACCTCGCCTTGAGGTAATAATAATGGGCGGCTGCACCTAATCCCAAGCAGCCCTTTTTTATACATAGTAGAAACAATATAAAAATAATCAATCTTTTTTTAAAAAAACTATCACATGGATGACGGAACAATAGATGAGTGCAGTCCAACACCCAAGTTTTTCTCATAATAATAAGAGAAGTACTTAGGTTCTACCTAGGTGTGTGCTACCTAAGCTGCACCTATTTTTGTGCAGCCCAGACAACTATTTAATAAAAAGTTGTCACGTGGCAATTCTTTTTTAACGTGTGTAAAGAAAAGAAGAGCATGGACTAGGTGCAGCCTGACTGCACCTAATCATTGCTCTAATGATAATATAATGCAAATGGTAGAGAGAATAGAGAGTGAAAAATGAGATATTTTGTTTTTGCTAACTTTGGAAAATCAATGTGCAGATTATTCTACTCAACTTCAGGAGGGAGATTCTTCCATTTATTCAGGACGAGGCTTCCATTGTTGATTGTACAGATGATTGTTCTGATATTCTTGTTGTTAGTGATTCATTAGCCCTTGCAGCACATAATGAAGATGCCAGCTTTGGTCTGTCTGGCTACACGTAAGTTCACAAAAACTAAACTTTATCCTAGTTTTTATCAAAGAGCTCAACATACTGTTAAACCAATTTCATTTTTCTTTTCTTCACTTTTGAAGCTATTTGATCAAAGGAAAACTGCAAAATGGGACATGCTACATTGCTTATACACACGCAGGGGAGATACCAAGTCGTGCTTTCAGTTTTAATAGCAATGGCCTGGTAAAGTTCTCATCCCATACCATCATTATTCAGACATTTTCTTTCTTTTTTATTTTTTCTGAAACCATCCATTCATTCTGAAGGAACAATTTAAAGAGCATGGATAGATCTTACATAGGAAATTATAGACTCAAGCTCTAAATAGATGTTGAAATATCGAAAACTGACACATAATATTTGTCCAACTATTTTGATACTGAATGATAAAACATATCTTTTCCAATCTCTTAGGCATTTACCATGAATACAGTGCGTCCAGTGAATGATGAGATTGAACCTGGGGCAATCGGACGAAACTTTATCTCCCAGGACCTCCTTGAATCTACAAGCTTCGAAGATGCAATAGCTGTAAGGCCCTTCATTAATAATGTCTTCTATTAATTATAGATTATGTTTCAGCTTAATTACCTTAGCAAGTTCAAAACATAACAAATAGGGTTGGATTGTCTTTTTCAGAGAATTCGCTCAGCAGAAATATCCCTCGGCCACAATTATAATGTGATTGATATTCAGACAAGAAGAATTGCGAGTGTAGAAACAGCATCAAAGTTTAGATTGTCAGTCCACGAGGTTGAGGCTACACCATTTTTCCATGCAAATATGTATTCCCACCTTCAGAATATTAGGCAGGTAATAATATCAGGAAAATCCACAAATAAAGCAATGACACCATGCAATTGTATAACTTATAAGGTCGGCCTTCTAATTGAAAGAAAGTTACATTCATTCTTCAGCAATAAAAATTTGTCCTCAAAGCCTTCTCCATCTCCATAACATTGGAATTTGAACTAGAAAAAAGTTTGGATTCTGCCTAGATGCACCTATCTTTGTGCAGCCCAGGCAGTTGGACAATAAAAAGATAAATTATTTTTTATGTTTCTTTTTTTGTGTGTATAAATAGAGAGAAGCATGCGGCTGCACCTAGTGAGGTTGTACCTAATCATTACCCTTTGAACTATTAGCTAAATGAAATCTGCAAGTGATCAGTTTTGAAGTCAAAATAAATTGACACAGATAATAGATGAAAACTCCACAAGCAGAACAAGACGAGCTGATGTCATGGCAAAAGAAACAAAAGATGATTTCCTGTCGGTTATTGGAGATACAGACAACGAGGAATATCCTATCTATATGAAAGGTAAAATAAATAAAAAGATGATTTATATTGCCTAGCCTTTTCAAAACTGAATTCGAAAATTTTCTCAATGTCTTCAGGTCCTAAGCTTTACACAATGTGTAGTGTTCTAATTGACCTAGATGAAGAAACTCTATCAATTTTTCAAGGAAATCCGAAGAACAAAGAGATATCCCATGTCTTCTCCCTATCAGAGTTGAAGAAACCATAACCACACTGCTGGCATAAAGGCCCAACATATTGATATTAACCAGTAAAGGATTTCAGATTATGATCCACGTTTTATTTAAGAATTTTGATTGTTTTCCATTTTCCATTTTTGCTCTTCTAAGGATATTAAAGTAAGAGTATTTTAGTGTTTTTGTTTAGGGGAGATGATTTGGTATCAAATTAGAGTTTCATTCCTATCCACCACTATTAAGGTTAATTGAGAAATAAAATGGGCTATTTAACGCCCAAGTCACCTTCATGTAAGGGACGTTGAATTTGAATAAAATTGGAGTGTACCACACTTAAGAGCCATGAGCTTTATCACCTTGAAATTCTTGATCTAGGGCTGTCTGGTTGAACATTCAAATGAGTAAAGAACTAATTCTCCTT

mRNA sequence

ATGGAGAACAAAGAAGTGGAGAATCCAGAAGTTGGTTCTCCATGCAAGATAGAAAAACAAGAAGATATGGAAAACAAAGAGCTGGAAATGTTTGAAGTTGGTCCATGTGAATCTGCATACCAGATGGGGTTCTTGATTGGGAAGAGGTTCTCTGATACCATAAGGAGCAGATTGGAGAAAGACATGGTTCTTCGCCAACAGCTGCTTCCTTTTGCTCAAACTCCAGAGTCAAAGCCTCTTATAGAAGCCCTCTGCAAAAATAACAAGGCCAAGAGGGAGATTCTTCCATTTATTCAGGACGAGGCTTCCATTGTTGATTGTACAGATGATTGTTCTGATATTCTTGTTGTTAGTGATTCATTAGCCCTTGCAGCACATAATGAAGATGCCAGCTTTGGTCTGTCTGGCTACACCTATTTGATCAAAGGAAAACTGCAAAATGGGACATGCTACATTGCTTATACACACGCAGGGGAGATACCAAGTCGTGCTTTCAGTTTTAATAGCAATGGCCTGGCATTTACCATGAATACAGTGCGTCCAGTGAATGATGAGATTGAACCTGGGGCAATCGGACGAAACTTTATCTCCCAGGACCTCCTTGAATCTACAAGCTTCGAAGATGCAATAGCTAGAATTCGCTCAGCAGAAATATCCCTCGGCCACAATTATAATGTGATTGATATTCAGACAAGAAGAATTGCGAGTGTAGAAACAGCATCAAAGTTTAGATTGTCAGTCCACGAGGTTGAGGCTACACCATTTTTCCATGCAAATATGTATTCCCACCTTCAGAATATTAGGCAGATAATAGATGAAAACTCCACAAGCAGAACAAGACGAGCTGATGTCATGGCAAAAGAAACAAAAGATGATTTCCTGTCGGTTATTGGAGATACAGACAACGAGGAATATCCTATCTATATGAAAGGTCCTAAGCTTTACACAATGTGTAGTGTTCTAATTGACCTAGATGAAGAAACTCTATCAATTTTTCAAGGAAATCCGAAGAACAAAGAGATATCCCATGTCTTCTCCCTATCAGAGTTGAAGAAACCATAACCACACTGCTGGCATAAAGGCCCAACATATTGATATTAACCAGTAAAGGATTTCAGATTATGATCCACGTTTTATTTAAGAATTTTGATTGTTTTCCATTTTCCATTTTTGCTCTTCTAAGGATATTAAAGTAAGAGTATTTTAGTGTTTTTGTTTAGGGGAGATGATTTGGTATCAAATTAGAGTTTCATTCCTATCCACCACTATTAAGGTTAATTGAGAAATAAAATGGGCTATTTAACGCCCAAGTCACCTTCATGTAAGGGACGTTGAATTTGAATAAAATTGGAGTGTACCACACTTAAGAGCCATGAGCTTTATCACCTTGAAATTCTTGATCTAGGGCTGTCTGGTTGAACATTCAAATGAGTAAAGAACTAATTCTCCTT

Coding sequence (CDS)

ATGGAGAACAAAGAAGTGGAGAATCCAGAAGTTGGTTCTCCATGCAAGATAGAAAAACAAGAAGATATGGAAAACAAAGAGCTGGAAATGTTTGAAGTTGGTCCATGTGAATCTGCATACCAGATGGGGTTCTTGATTGGGAAGAGGTTCTCTGATACCATAAGGAGCAGATTGGAGAAAGACATGGTTCTTCGCCAACAGCTGCTTCCTTTTGCTCAAACTCCAGAGTCAAAGCCTCTTATAGAAGCCCTCTGCAAAAATAACAAGGCCAAGAGGGAGATTCTTCCATTTATTCAGGACGAGGCTTCCATTGTTGATTGTACAGATGATTGTTCTGATATTCTTGTTGTTAGTGATTCATTAGCCCTTGCAGCACATAATGAAGATGCCAGCTTTGGTCTGTCTGGCTACACCTATTTGATCAAAGGAAAACTGCAAAATGGGACATGCTACATTGCTTATACACACGCAGGGGAGATACCAAGTCGTGCTTTCAGTTTTAATAGCAATGGCCTGGCATTTACCATGAATACAGTGCGTCCAGTGAATGATGAGATTGAACCTGGGGCAATCGGACGAAACTTTATCTCCCAGGACCTCCTTGAATCTACAAGCTTCGAAGATGCAATAGCTAGAATTCGCTCAGCAGAAATATCCCTCGGCCACAATTATAATGTGATTGATATTCAGACAAGAAGAATTGCGAGTGTAGAAACAGCATCAAAGTTTAGATTGTCAGTCCACGAGGTTGAGGCTACACCATTTTTCCATGCAAATATGTATTCCCACCTTCAGAATATTAGGCAGATAATAGATGAAAACTCCACAAGCAGAACAAGACGAGCTGATGTCATGGCAAAAGAAACAAAAGATGATTTCCTGTCGGTTATTGGAGATACAGACAACGAGGAATATCCTATCTATATGAAAGGTCCTAAGCTTTACACAATGTGTAGTGTTCTAATTGACCTAGATGAAGAAACTCTATCAATTTTTCAAGGAAATCCGAAGAACAAAGAGATATCCCATGTCTTCTCCCTATCAGAGTTGAAGAAACCATAA

Protein sequence

MENKEVENPEVGSPCKIEKQEDMENKELEMFEVGPCESAYQMGFLIGKRFSDTIRSRLEKDMVLRQQLLPFAQTPESKPLIEALCKNNKAKREILPFIQDEASIVDCTDDCSDILVVSDSLALAAHNEDASFGLSGYTYLIKGKLQNGTCYIAYTHAGEIPSRAFSFNSNGLAFTMNTVRPVNDEIEPGAIGRNFISQDLLESTSFEDAIARIRSAEISLGHNYNVIDIQTRRIASVETASKFRLSVHEVEATPFFHANMYSHLQNIRQIIDENSTSRTRRADVMAKETKDDFLSVIGDTDNEEYPIYMKGPKLYTMCSVLIDLDEETLSIFQGNPKNKEISHVFSLSELKKP
BLAST of Lsi03G004240 vs. TrEMBL
Match: A0A0D2U9C1_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_008G251600 PE=4 SV=1)

HSP 1 Score: 411.8 bits (1057), Expect = 8.6e-112
Identity = 209/356 (58.71%), Postives = 265/356 (74.44%), Query Frame = 1

Query: 23  MENKELEMFEVGPCESAYQMGFLIGKRFSDTIRSRLEKDMVLRQQLLPFAQTPESKPLIE 82
           ME K LEMFEVGPCE  YQ+GFLIG+RF + IRSRL  D++L+ QLLPFA+TP ++PL++
Sbjct: 1   MEGKLLEMFEVGPCEDDYQLGFLIGQRFCNQIRSRLAGDLILQNQLLPFARTPHAQPLLK 60

Query: 83  ALCKNNKAK----------------------------REILPFIQDEA--SIVDCTDDCS 142
           AL + N+ K                            +EILPFI      S  D TDDCS
Sbjct: 61  ALSETNQKKFPRYWAELLGTADGSGVPVLDIILVNFRKEILPFISKTTMNSNADTTDDCS 120

Query: 143 DILVVSDSLALAAHNEDASFGLSGYTYLIKGKLQNGTCYIAYTHAGEIPSRAFSFNSNGL 202
           D+L+V DS+A+AAHNEDA+  L G+TYLIKGKL NG  +IAYT+AGE+PS AF  NS GL
Sbjct: 121 DVLIVGDSMAVAAHNEDANVALVGHTYLIKGKLSNGLSFIAYTYAGELPSCAFGLNSQGL 180

Query: 203 AFTMNTVRPVNDEIEPGAIGRNFISQDLLESTSFEDAIARIRSAEISLGHNYNVIDIQTR 262
           AFT+N+V PV DEI P  IGRNF+S+DLLE+TS  DA+ARIRS+E+S+GH+YN+IDIQ R
Sbjct: 181 AFTLNSVPPVEDEIAPAGIGRNFVSRDLLEATSTADALARIRSSEVSVGHSYNLIDIQKR 240

Query: 263 RIASVETASKFRLSVHEVEATPFFHANMYSHLQNIRQIIDENSTSRTRRADVMAKETKDD 322
            I +VETASK R+SVHEV  TPFFHANMY HLQ ++Q+ DENS SR +RA V+ + +K D
Sbjct: 241 MILNVETASKSRVSVHEVGTTPFFHANMYLHLQ-VQQVHDENSISRQKRAAVLPQGSKTD 300

Query: 323 FLSVIGDTDNEEYPIYMKGPKLYTMCSVLIDLDEETLSIFQGNPKNKEISHVFSLS 349
           FLS++GDT++ +YPIYM GP LYT+C+ +IDLDE TL+I +GNPK  ++SHVFS+S
Sbjct: 301 FLSLLGDTEDTKYPIYMTGPTLYTLCTTVIDLDERTLTIIEGNPKYGKVSHVFSMS 355

BLAST of Lsi03G004240 vs. TrEMBL
Match: A0A059CHR6_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_D01785 PE=4 SV=1)

HSP 1 Score: 408.7 bits (1049), Expect = 7.3e-111
Identity = 205/363 (56.47%), Postives = 272/363 (74.93%), Query Frame = 1

Query: 23  MENKELEMFEVGPCESAYQMGFLIGKRFSDTIRSRLEKDMVLRQQLLPFAQTPESKPLIE 82
           ME   LEMFEVGPCE+ +QMGFLIG+RFS  I+SRL +D++LR QLLP+A+ PES+PL+E
Sbjct: 1   MEEHALEMFEVGPCETPHQMGFLIGRRFSRLIQSRLSRDLILRNQLLPWARAPESRPLLE 60

Query: 83  ALCKNNKAK----------------------------REILPFIQDEASIVDCTD---DC 142
           ALC++N+ K                            +EILPFI D+ +  D  +   +C
Sbjct: 61  ALCEHNQTKFPRYWDELVGTAEGADVPVLDIVLINFRKEILPFIPDKETKSDLPEKAIEC 120

Query: 143 SDILVVSDSLALAAHNEDASFGLSGYTYLIKGKLQNGTCYIAYTHAGEIPSRAFSFNSNG 202
           SD+LVV +S+A+AAHNEDA+  L G+TYLIKG L +G C+I+YT+AGE+PS AF FN+NG
Sbjct: 121 SDVLVVGESMAVAAHNEDANVALVGHTYLIKGTLSSGLCFISYTYAGELPSCAFGFNNNG 180

Query: 203 LAFTMNTVRPVNDEIEPGAIGRNFISQDLLESTSFEDAIARIRSAEISLGHNYNVIDIQT 262
           + FT+N V P  +EI    IGRNFIS+DLLE+TS  DA ++IRSAE S+GH+YN+ID++ 
Sbjct: 181 MGFTLNAVPPSKEEIVASGIGRNFISRDLLEATSMTDATSKIRSAEASVGHSYNLIDLKA 240

Query: 263 RRIASVETASKFRLSVHEVEATPFFHANMYSHLQNIRQIIDENSTSRTRRADVMAKETKD 322
           RRI ++ETAS+ R+SV+EV+ TPFFHANMY HLQ ++Q+ DENS SR +RA V+ K++K 
Sbjct: 241 RRICNLETASRTRVSVNEVDDTPFFHANMYLHLQ-VKQVEDENSKSRQKRAAVLPKKSKQ 300

Query: 323 DFLSVIGDTDNEEYPIYMKGPKLYTMCSVLIDLDEETLSIFQGNPKNKEISHVFSLS--E 353
           DFLS++GDTD+ +YPIYM GP LYT+C+ L+DLDE+TLSI +GNPK  E+SH FSLS  +
Sbjct: 301 DFLSLLGDTDDAKYPIYMSGPTLYTLCTALVDLDEQTLSIIKGNPKKGEVSHTFSLSSPD 360

BLAST of Lsi03G004240 vs. TrEMBL
Match: A0A0A0LSY8_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G404920 PE=4 SV=1)

HSP 1 Score: 404.4 bits (1038), Expect = 1.4e-109
Identity = 205/318 (64.47%), Postives = 252/318 (79.25%), Query Frame = 1

Query: 23  MENKELEMFEVGPCESAYQMGFLIGKRFSDTIRSRLEKDMVLRQQLLPFAQTPESKPLIE 82
           ME+K+LE+FEVGPCESAYQMGFLIGKRFSDTI+SRL  D+VLR +LLPFAQ+P+S PLIE
Sbjct: 1   MEDKKLEIFEVGPCESAYQMGFLIGKRFSDTIKSRLHTDLVLRNELLPFAQSPQSHPLIE 60

Query: 83  ALCKNNKAK----------------------------REILPFIQDEA-SIVDCTDDCSD 142
           ALC NNK +                            +EILPF+Q E  S VDC+DDCSD
Sbjct: 61  ALCNNNKTRFPIYWDELVGTAEGSGVPILEIVLINFRKEILPFLQKEVPSSVDCSDDCSD 120

Query: 143 ILVVSDSLALAAHNEDASFGLSGYTYLIKGKLQNGTCYIAYTHAGEIPSRAFSFNSNGLA 202
           +L+VSD++A+AAHNEDA+  L G+TYL+KGKLQNG  ++AYT+AGE+PS AF FN +GLA
Sbjct: 121 LLLVSDNMAIAAHNEDANNALVGHTYLVKGKLQNGLSFLAYTYAGELPSCAFGFNDHGLA 180

Query: 203 FTMNTVRPVNDEIEPGAIGRNFISQDLLESTSFEDAIARIRSAEISLGHNYNVIDIQTRR 262
           FT+N+V P NDEI  GAIGRNFIS+DLLESTS E+AI RIRSAE+S+GH+YN+ID+QTRR
Sbjct: 181 FTLNSVPPTNDEIAAGAIGRNFISRDLLESTSLENAIFRIRSAEVSVGHSYNLIDVQTRR 240

Query: 263 IASVETASKFRLSVHEVEATPFFHANMYSHLQNIRQIIDENSTSRTRRADVMAKETKDDF 312
           I +VETAS++R SV EV ATPFFHANMY+HLQ I Q+ D NS SR +RA+ + KE+K+DF
Sbjct: 241 IVNVETASRYRFSVSEVGATPFFHANMYTHLQ-INQVQDPNSISRQKRANDLPKESKNDF 300

BLAST of Lsi03G004240 vs. TrEMBL
Match: A0A067JP62_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_23947 PE=4 SV=1)

HSP 1 Score: 400.2 bits (1027), Expect = 2.6e-108
Identity = 202/363 (55.65%), Postives = 269/363 (74.10%), Query Frame = 1

Query: 23  MENKELEMFEVGPCESAYQMGFLIGKRFSDTIRSRLEKDMVLRQQLLPFAQTPESKPLIE 82
           ME   LE FEVGPC++ YQMGFLIG+RFS+ IRSRL KD++L+ QLLPFAQT +S+PLI+
Sbjct: 1   MEGTRLEKFEVGPCDNGYQMGFLIGQRFSNEIRSRLSKDLILQNQLLPFAQTSQSQPLIK 60

Query: 83  ALCKNNKAK----------------------------REILPFIQDE----ASIVDCTDD 142
           AL  NN+ K                             EILPF+  +    + +   ++D
Sbjct: 61  ALIDNNRKKFPSFWDELIGTAQGSGVPLLDVILINLRMEILPFLPKKEDGNSKVESSSED 120

Query: 143 CSDILVVSDSLALAAHNEDASFGLSGYTYLIKGKLQNGTCYIAYTHAGEIPSRAFSFNSN 202
           CSDILVV D++A+AAHNEDA+  L G+TYLIK  L NG  +I YT+AGE+PS AF FNS+
Sbjct: 121 CSDILVVRDNMAIAAHNEDANVALLGHTYLIKANLSNGESFIGYTYAGELPSCAFGFNSH 180

Query: 203 GLAFTMNTVRPVNDEIEPGAIGRNFISQDLLESTSFEDAIARIRSAEISLGHNYNVIDIQ 262
           GLAFT+N+V P  DEI  G IGRNF+S+DLLE+TS E+A++RI S+++S+GH+YN++D +
Sbjct: 181 GLAFTLNSVPPSEDEIMAGGIGRNFVSRDLLEATSIENALSRIHSSQVSVGHSYNLMDTR 240

Query: 263 TRRIASVETASKFRLSVHEVEATPFFHANMYSHLQNIRQIIDENSTSRTRRADVMAKETK 322
            R+I +VETASK R+SVHE+  TPFFHANMY HLQ ++Q+ D+NS SR  RA+V+ KE+K
Sbjct: 241 ARKILNVETASKNRVSVHEIGRTPFFHANMYLHLQ-VKQVQDDNSKSRQERANVLPKESK 300

Query: 323 DDFLSVIGDTDNEEYPIYMKGPKLYTMCSVLIDLDEETLSIFQGNPKNKEISHVFSLS-- 352
           DDFLS++GDT++  YPIYM GP L+T+C+ LIDLD++TLSI +GNPK  ++S+VFS+S  
Sbjct: 301 DDFLSLLGDTNDTTYPIYMTGPMLHTLCTALIDLDDQTLSIIEGNPKKGKVSYVFSMSAG 360

BLAST of Lsi03G004240 vs. TrEMBL
Match: B9GX51_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0003s13500g PE=4 SV=2)

HSP 1 Score: 398.7 bits (1023), Expect = 7.6e-108
Identity = 200/357 (56.02%), Postives = 264/357 (73.95%), Query Frame = 1

Query: 28  LEMFEVGPCESAYQMGFLIGKRFSDTIRSRLEKDMVLRQQLLPFAQTPESKPLIEALCKN 87
           LEM E+GPC++ YQMGFL G+RFS+ IRSRL  D+VL+ QLLPFA+TPES+ LI+AL  N
Sbjct: 2   LEMLEIGPCDNPYQMGFLTGQRFSNKIRSRLSTDLVLQNQLLPFAKTPESQALIKALTIN 61

Query: 88  NKAK----------------------------REILPFIQD--------EASIVDCTDDC 147
           N+ K                            +EILPF+          + S+ +  DDC
Sbjct: 62  NQKKFPKYWDELLGTAEGSGVPVLYMILINFRKEILPFLPKSTAANSNADTSLYNTPDDC 121

Query: 148 SDILVVSDSLALAAHNEDASFGLSGYTYLIKGKLQNGTCYIAYTHAGEIPSRAFSFNSNG 207
           SD+LVVSDS+A+AAHNEDA+  L G+TY+IK  L NG  ++ YT+AGE+PS AF FNSNG
Sbjct: 122 SDVLVVSDSMAIAAHNEDANVALVGHTYIIKATLPNGLSFVGYTYAGELPSCAFGFNSNG 181

Query: 208 LAFTMNTVRPVNDEIEPGAIGRNFISQDLLESTSFEDAIARIRSAEISLGHNYNVIDIQT 267
           LAFT+N+V P   EI  G IGRNFIS+DLLE+TS +DA+++I+S+E+S+GH+YN+I+I T
Sbjct: 182 LAFTLNSVPPSEAEIVAGGIGRNFISRDLLEATSIDDALSKIQSSEVSVGHSYNLINIGT 241

Query: 268 RRIASVETASKFRLSVHEVEATPFFHANMYSHLQNIRQIIDENSTSRTRRADVMAKETKD 327
           RRI +VETAS+ R+SVHEV ATPFFHANMY HLQ + Q+ D+NS SR  RA V+ K +KD
Sbjct: 242 RRILNVETASRNRVSVHEVGATPFFHANMYLHLQ-VEQVDDDNSKSRQERAAVLPKRSKD 301

Query: 328 DFLSVIGDTDNEEYPIYMKGPKLYTMCSVLIDLDEETLSIFQGNPKNKEISHVFSLS 349
           DFLS++GDTD+  YPIYM GP LYT+C+ +IDLD++TLSI +GNPKN +++++FS+S
Sbjct: 302 DFLSLLGDTDHNRYPIYMTGPTLYTLCTAMIDLDDKTLSIIEGNPKNGKVAYIFSMS 357

BLAST of Lsi03G004240 vs. NCBI nr
Match: gi|449442062|ref|XP_004138801.1| (PREDICTED: uncharacterized protein LOC101214789 [Cucumis sativus])

HSP 1 Score: 463.0 bits (1190), Expect = 4.7e-127
Identity = 235/362 (64.92%), Postives = 287/362 (79.28%), Query Frame = 1

Query: 23  MENKELEMFEVGPCESAYQMGFLIGKRFSDTIRSRLEKDMVLRQQLLPFAQTPESKPLIE 82
           ME+K+LE+FEVGPCESAYQMGFLIGKRFSDTI+SRL  D+VLR +LLPFAQ+P+S PLIE
Sbjct: 1   MEDKKLEIFEVGPCESAYQMGFLIGKRFSDTIKSRLHTDLVLRNELLPFAQSPQSHPLIE 60

Query: 83  ALCKNNKAK----------------------------REILPFIQDEA-SIVDCTDDCSD 142
           ALC NNK +                            +EILPF+Q E  S VDC+DDCSD
Sbjct: 61  ALCNNNKTRFPIYWDELVGTAEGSGVPILEIVLINFRKEILPFLQKEVPSSVDCSDDCSD 120

Query: 143 ILVVSDSLALAAHNEDASFGLSGYTYLIKGKLQNGTCYIAYTHAGEIPSRAFSFNSNGLA 202
           +L+VSD++A+AAHNEDA+  L G+TYL+KGKLQNG  ++AYT+AGE+PS AF FN +GLA
Sbjct: 121 LLLVSDNMAIAAHNEDANNALVGHTYLVKGKLQNGLSFLAYTYAGELPSCAFGFNDHGLA 180

Query: 203 FTMNTVRPVNDEIEPGAIGRNFISQDLLESTSFEDAIARIRSAEISLGHNYNVIDIQTRR 262
           FT+N+V P NDEI  GAIGRNFIS+DLLESTS E+AI RIRSAE+S+GH+YN+ID+QTRR
Sbjct: 181 FTLNSVPPTNDEIAAGAIGRNFISRDLLESTSLENAIFRIRSAEVSVGHSYNLIDVQTRR 240

Query: 263 IASVETASKFRLSVHEVEATPFFHANMYSHLQNIRQIIDENSTSRTRRADVMAKETKDDF 322
           I +VETAS++R SV EV ATPFFHANMY+HLQ I Q+ D NS SR +RA+ + KE+K+DF
Sbjct: 241 IVNVETASRYRFSVSEVGATPFFHANMYTHLQ-INQVQDPNSISRQKRANDLPKESKNDF 300

Query: 323 LSVIGDTDNEEYPIYMKGPKLYTMCSVLIDLDEETLSIFQGNPKNKEISHVFSLS--ELK 354
           LSV+GD DN++YPIYM GP LYT+C+ LIDLDE+TLSI QGNPK   ISHVFS+   EL+
Sbjct: 301 LSVLGDMDNKKYPIYMTGPMLYTLCTALIDLDEQTLSIIQGNPKKNVISHVFSMPSVELR 360

BLAST of Lsi03G004240 vs. NCBI nr
Match: gi|659081326|ref|XP_008441274.1| (PREDICTED: uncharacterized protein LOC103485456 [Cucumis melo])

HSP 1 Score: 459.5 bits (1181), Expect = 5.2e-126
Identity = 234/362 (64.64%), Postives = 286/362 (79.01%), Query Frame = 1

Query: 23  MENKELEMFEVGPCESAYQMGFLIGKRFSDTIRSRLEKDMVLRQQLLPFAQTPESKPLIE 82
           ME+K+LEMFEVGPCESAYQMGFLIGKRF+DTI+SRL  D+VLR +LLPFAQ+P+S+PLIE
Sbjct: 1   MEDKKLEMFEVGPCESAYQMGFLIGKRFTDTIKSRLHTDLVLRNELLPFAQSPQSQPLIE 60

Query: 83  ALCKNNKAK----------------------------REILPFIQDEA-SIVDCTDDCSD 142
           ALC NNK +                            +EILPF+Q    S VDC+DDCSD
Sbjct: 61  ALCNNNKTRFPIYWDELVGIAEGSGVPILEIILINFRKEILPFLQKGVPSSVDCSDDCSD 120

Query: 143 ILVVSDSLALAAHNEDASFGLSGYTYLIKGKLQNGTCYIAYTHAGEIPSRAFSFNSNGLA 202
           +L+VS+++A+AAHNEDA+  L G+TYL+KG LQNG  ++AYT+AGE+PS AF FN +GLA
Sbjct: 121 LLLVSENMAIAAHNEDANHSLVGHTYLVKGILQNGLSFLAYTYAGELPSCAFGFNDHGLA 180

Query: 203 FTMNTVRPVNDEIEPGAIGRNFISQDLLESTSFEDAIARIRSAEISLGHNYNVIDIQTRR 262
           FT+N+V P NDEI  GAIGRNFIS+DLLESTS E+AI RIRSAE+S+GH+YN+ID+QTRR
Sbjct: 181 FTLNSVPPTNDEIAAGAIGRNFISRDLLESTSLENAIFRIRSAEVSVGHSYNLIDVQTRR 240

Query: 263 IASVETASKFRLSVHEVEATPFFHANMYSHLQNIRQIIDENSTSRTRRADVMAKETKDDF 322
           I +VETAS++R SV+EV ATP FHANMY+HLQ I Q+ D NS SR +RAD + KE+K+DF
Sbjct: 241 IVNVETASRYRFSVNEVGATPLFHANMYTHLQ-INQVQDPNSISRQKRADDLPKESKNDF 300

Query: 323 LSVIGDTDNEEYPIYMKGPKLYTMCSVLIDLDEETLSIFQGNPKNKEISHVFS--LSELK 354
           LSV+GD DN++YPIYM GP LYT+C+ LIDLDE+TLSI QGNPK   ISHVFS  L ELK
Sbjct: 301 LSVLGDMDNKKYPIYMTGPMLYTLCTALIDLDEQTLSIIQGNPKKNVISHVFSMPLVELK 360

BLAST of Lsi03G004240 vs. NCBI nr
Match: gi|823213975|ref|XP_012439733.1| (PREDICTED: uncharacterized protein LOC105765275 [Gossypium raimondii])

HSP 1 Score: 411.8 bits (1057), Expect = 1.2e-111
Identity = 209/356 (58.71%), Postives = 265/356 (74.44%), Query Frame = 1

Query: 23  MENKELEMFEVGPCESAYQMGFLIGKRFSDTIRSRLEKDMVLRQQLLPFAQTPESKPLIE 82
           ME K LEMFEVGPCE  YQ+GFLIG+RF + IRSRL  D++L+ QLLPFA+TP ++PL++
Sbjct: 1   MEGKLLEMFEVGPCEDDYQLGFLIGQRFCNQIRSRLAGDLILQNQLLPFARTPHAQPLLK 60

Query: 83  ALCKNNKAK----------------------------REILPFIQDEA--SIVDCTDDCS 142
           AL + N+ K                            +EILPFI      S  D TDDCS
Sbjct: 61  ALSETNQKKFPRYWAELLGTADGSGVPVLDIILVNFRKEILPFISKTTMNSNADTTDDCS 120

Query: 143 DILVVSDSLALAAHNEDASFGLSGYTYLIKGKLQNGTCYIAYTHAGEIPSRAFSFNSNGL 202
           D+L+V DS+A+AAHNEDA+  L G+TYLIKGKL NG  +IAYT+AGE+PS AF  NS GL
Sbjct: 121 DVLIVGDSMAVAAHNEDANVALVGHTYLIKGKLSNGLSFIAYTYAGELPSCAFGLNSQGL 180

Query: 203 AFTMNTVRPVNDEIEPGAIGRNFISQDLLESTSFEDAIARIRSAEISLGHNYNVIDIQTR 262
           AFT+N+V PV DEI P  IGRNF+S+DLLE+TS  DA+ARIRS+E+S+GH+YN+IDIQ R
Sbjct: 181 AFTLNSVPPVEDEIAPAGIGRNFVSRDLLEATSTADALARIRSSEVSVGHSYNLIDIQKR 240

Query: 263 RIASVETASKFRLSVHEVEATPFFHANMYSHLQNIRQIIDENSTSRTRRADVMAKETKDD 322
            I +VETASK R+SVHEV  TPFFHANMY HLQ ++Q+ DENS SR +RA V+ + +K D
Sbjct: 241 MILNVETASKSRVSVHEVGTTPFFHANMYLHLQ-VQQVHDENSISRQKRAAVLPQGSKTD 300

Query: 323 FLSVIGDTDNEEYPIYMKGPKLYTMCSVLIDLDEETLSIFQGNPKNKEISHVFSLS 349
           FLS++GDT++ +YPIYM GP LYT+C+ +IDLDE TL+I +GNPK  ++SHVFS+S
Sbjct: 301 FLSLLGDTEDTKYPIYMTGPTLYTLCTTVIDLDERTLTIIEGNPKYGKVSHVFSMS 355

BLAST of Lsi03G004240 vs. NCBI nr
Match: gi|702323709|ref|XP_010053184.1| (PREDICTED: uncharacterized protein LOC104441694 [Eucalyptus grandis])

HSP 1 Score: 408.7 bits (1049), Expect = 1.0e-110
Identity = 205/363 (56.47%), Postives = 272/363 (74.93%), Query Frame = 1

Query: 23  MENKELEMFEVGPCESAYQMGFLIGKRFSDTIRSRLEKDMVLRQQLLPFAQTPESKPLIE 82
           ME   LEMFEVGPCE+ +QMGFLIG+RFS  I+SRL +D++LR QLLP+A+ PES+PL+E
Sbjct: 1   MEEHALEMFEVGPCETPHQMGFLIGRRFSRLIQSRLSRDLILRNQLLPWARAPESRPLLE 60

Query: 83  ALCKNNKAK----------------------------REILPFIQDEASIVDCTD---DC 142
           ALC++N+ K                            +EILPFI D+ +  D  +   +C
Sbjct: 61  ALCEHNQTKFPRYWDELVGTAEGADVPVLDIVLINFRKEILPFIPDKETKSDLPEKAIEC 120

Query: 143 SDILVVSDSLALAAHNEDASFGLSGYTYLIKGKLQNGTCYIAYTHAGEIPSRAFSFNSNG 202
           SD+LVV +S+A+AAHNEDA+  L G+TYLIKG L +G C+I+YT+AGE+PS AF FN+NG
Sbjct: 121 SDVLVVGESMAVAAHNEDANVALVGHTYLIKGTLSSGLCFISYTYAGELPSCAFGFNNNG 180

Query: 203 LAFTMNTVRPVNDEIEPGAIGRNFISQDLLESTSFEDAIARIRSAEISLGHNYNVIDIQT 262
           + FT+N V P  +EI    IGRNFIS+DLLE+TS  DA ++IRSAE S+GH+YN+ID++ 
Sbjct: 181 MGFTLNAVPPSKEEIVASGIGRNFISRDLLEATSMTDATSKIRSAEASVGHSYNLIDLKA 240

Query: 263 RRIASVETASKFRLSVHEVEATPFFHANMYSHLQNIRQIIDENSTSRTRRADVMAKETKD 322
           RRI ++ETAS+ R+SV+EV+ TPFFHANMY HLQ ++Q+ DENS SR +RA V+ K++K 
Sbjct: 241 RRICNLETASRTRVSVNEVDDTPFFHANMYLHLQ-VKQVEDENSKSRQKRAAVLPKKSKQ 300

Query: 323 DFLSVIGDTDNEEYPIYMKGPKLYTMCSVLIDLDEETLSIFQGNPKNKEISHVFSLS--E 353
           DFLS++GDTD+ +YPIYM GP LYT+C+ L+DLDE+TLSI +GNPK  E+SH FSLS  +
Sbjct: 301 DFLSLLGDTDDAKYPIYMSGPTLYTLCTALVDLDEQTLSIIKGNPKKGEVSHTFSLSSPD 360

BLAST of Lsi03G004240 vs. NCBI nr
Match: gi|700208022|gb|KGN63141.1| (hypothetical protein Csa_2G404920 [Cucumis sativus])

HSP 1 Score: 404.4 bits (1038), Expect = 2.0e-109
Identity = 205/318 (64.47%), Postives = 252/318 (79.25%), Query Frame = 1

Query: 23  MENKELEMFEVGPCESAYQMGFLIGKRFSDTIRSRLEKDMVLRQQLLPFAQTPESKPLIE 82
           ME+K+LE+FEVGPCESAYQMGFLIGKRFSDTI+SRL  D+VLR +LLPFAQ+P+S PLIE
Sbjct: 1   MEDKKLEIFEVGPCESAYQMGFLIGKRFSDTIKSRLHTDLVLRNELLPFAQSPQSHPLIE 60

Query: 83  ALCKNNKAK----------------------------REILPFIQDEA-SIVDCTDDCSD 142
           ALC NNK +                            +EILPF+Q E  S VDC+DDCSD
Sbjct: 61  ALCNNNKTRFPIYWDELVGTAEGSGVPILEIVLINFRKEILPFLQKEVPSSVDCSDDCSD 120

Query: 143 ILVVSDSLALAAHNEDASFGLSGYTYLIKGKLQNGTCYIAYTHAGEIPSRAFSFNSNGLA 202
           +L+VSD++A+AAHNEDA+  L G+TYL+KGKLQNG  ++AYT+AGE+PS AF FN +GLA
Sbjct: 121 LLLVSDNMAIAAHNEDANNALVGHTYLVKGKLQNGLSFLAYTYAGELPSCAFGFNDHGLA 180

Query: 203 FTMNTVRPVNDEIEPGAIGRNFISQDLLESTSFEDAIARIRSAEISLGHNYNVIDIQTRR 262
           FT+N+V P NDEI  GAIGRNFIS+DLLESTS E+AI RIRSAE+S+GH+YN+ID+QTRR
Sbjct: 181 FTLNSVPPTNDEIAAGAIGRNFISRDLLESTSLENAIFRIRSAEVSVGHSYNLIDVQTRR 240

Query: 263 IASVETASKFRLSVHEVEATPFFHANMYSHLQNIRQIIDENSTSRTRRADVMAKETKDDF 312
           I +VETAS++R SV EV ATPFFHANMY+HLQ I Q+ D NS SR +RA+ + KE+K+DF
Sbjct: 241 IVNVETASRYRFSVSEVGATPFFHANMYTHLQ-INQVQDPNSISRQKRANDLPKESKNDF 300

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0D2U9C1_GOSRA8.6e-11258.71Uncharacterized protein OS=Gossypium raimondii GN=B456_008G251600 PE=4 SV=1[more]
A0A059CHR6_EUCGR7.3e-11156.47Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_D01785 PE=4 SV=1[more]
A0A0A0LSY8_CUCSA1.4e-10964.47Uncharacterized protein OS=Cucumis sativus GN=Csa_2G404920 PE=4 SV=1[more]
A0A067JP62_JATCU2.6e-10855.65Uncharacterized protein OS=Jatropha curcas GN=JCGZ_23947 PE=4 SV=1[more]
B9GX51_POPTR7.6e-10856.02Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0003s13500g PE=4 SV=2[more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
gi|449442062|ref|XP_004138801.1|4.7e-12764.92PREDICTED: uncharacterized protein LOC101214789 [Cucumis sativus][more]
gi|659081326|ref|XP_008441274.1|5.2e-12664.64PREDICTED: uncharacterized protein LOC103485456 [Cucumis melo][more]
gi|823213975|ref|XP_012439733.1|1.2e-11158.71PREDICTED: uncharacterized protein LOC105765275 [Gossypium raimondii][more]
gi|702323709|ref|XP_010053184.1|1.0e-11056.47PREDICTED: uncharacterized protein LOC104441694 [Eucalyptus grandis][more]
gi|700208022|gb|KGN63141.1|2.0e-10964.47hypothetical protein Csa_2G404920 [Cucumis sativus][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR005079Peptidase_C45
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016740 transferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi03G004240.1Lsi03G004240.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005079Peptidase C45PFAMPF03417AATcoord: 123..338
score: 1.9
NoneNo IPR availablePANTHERPTHR34180FAMILY NOT NAMEDcoord: 23..352
score: 2.1

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None