Moc04g00260 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc04g00260
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionATP-dependent zinc metalloprotease
Locationchr4: 178842 .. 181261 (+)
RNA-Seq ExpressionMoc04g00260
SyntenyMoc04g00260
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTATCCCGAGTCCTCCCAAACTCCAGATTTCCTCTTCTTCTCTCTATTTCCAACCTTTTCGGCACCAAATTTCCTTCCATTTCCTGCAAAAAACCCCTCGTGGAATCACTAGACATTTCCATTTAGAACGCCTTCAGCGTCTCCTCCATCTGCCCAGAGCTCTTCGTGAATGGCAAGACTACGAAGAGGCAGTGAAGCGCAAGGACCTCGCAGAAGCTCTTAGGTTTCTCGAGTCCTTTGACAGAGATAGCGCAATCGAACCCGTTAATGATTCAGCCGCTGCTGATTCAGCTCCGTCCGCTCTTCGAAATCCACGGTTGTCTGGCTGGGAGCGGGACTGGGAGGTACTCGACACTTGTTTGAATGCGGATGATATGAAGCTTGTTGCCAATGCTTATGGGTTTCTCAGGGATAGAGGATTTTTGCCCAATTTTGGAAAATGCAGGAACATTGGTAAACCCCTTTCATTGTCATAATCCTGAATTTATTTCATTAGCGTTAAATGTTTATCGCATTTTTCTCGTGTTTCTGTTGACATCAAATTGGGAAAAGAAGTAGTTTATCTTGGGAGATTAGTTCCCATCTGTTAACTTCGAATTTCATGAAGAAAGTTTTGGGTGAAGAGGCATGGGAACTGCGAAATTATTCTCTCTTACGGCTTTATCCACCACTTTCATACTTATTGGAGCATATTTCATACTGATGAACTGACCATCCCTGCAGTTTTGGAGGGTCGAAGAGACGTCACACCATCTGTGTTGGAATCTTCGACTGGATTACAAGGTGCCTTGAAAGATCTCTGCTTTTATTTTTCAGTTGTAGACCTTGAAACTGCATTCTCATATTCTACCAATCACTTACTTTTATTTTTCTTGCCTCCTGTGTTATTCCAGTGACCAAGTTGTCTCCAAAGAAGTGGGGTCTTTCAGGCAGCTCTAGTTACGCTTTGATTGCCTTTCTTGGTGGAACATCATTTCTGCTCTCACGGGACATTGATATTAGGCCGAACCTTTTGGCACTGCTGGGGCTTGCATTTTTGGACTCTATCCTCCTTGGTGGTACGTGTCTAGCGCAAATCTCTAGCTATTGGCCACCATATAGGCGTCGAATCCTTGTACATGAAGCTGGACATCTACTGACTGGTATGCTCCACTAACTAATATAGAATCGTATAGAAATCTACGGCCATTTCTACCAATTTTTCATATGCTGTTTACTTGGAATAGTTGTTCATGCAATTATGCAAGCACAAACTATTGAATGTTGTTAAAAGTTCCGTATACCCTATTGCAGCTTATCTCATGGGTTGCCCAATTCGTGGAGTTATTTTGGATCCAATTGTTGCCATGCAAATGGGGATACAGGGACAGGTAAACAAACCTTTCATATGCATTTCAAAACCCAGAATATGGTCTTAAAATGACGGTAATTGAAATTTTTTTAATATGAGATTCATGTGGGTTATGCATCAAACTGGTGGCTTTGTAAATAAAGGAAAATAATACAATAAAATTAACTCAATAATACATTGGATAACGAGCAGATTATCTCATATTCTGGATGGCTAAATGTGAAATCTTTGTTTTTGTACTAATGAGTCCTTTTGTTTTGAATTTAATAGGCAGGTACCCAGTTTTGGGATGAAAAAATGGCAAGCAACCTTGCTGAAGGACGTTTGGATGGTACTTCTTTTGACAGGTGATCTTGTTAAATAGCCGCATTTGAAAAAAGATCCCAACTTTTTCCATCGCTTCCCCTATTATACTATTATCAATCCAAAAAAGAGCTTAAATAGCAATTTGCATCTCTTGTGTGCATTCATTATCAATGGCAGCATATTTTTCATCCATTTTTTTCCCTACACTACTTTATATTTGTACTTCATTCTGCTTCCGGAGCCTTATATAAGCCTTCAAAAAGTTGATCTATATAATAAGGAAAGATATGGGACAGTTTGTAGGGATCACATGAACCCCAACTTTTTTAATCTGAAACTCTGTATATGGTAACTAACTGGGCAATACCCTCGGGTTGTGACTGTGGTCAGGTACTGCATGATCCTTTTTGCGGGCATTGCAGCTGAAGCTCTTGTTTACGGTGAAGCAGAGGGTGGAGAGAATGATGAAAATTTGTTTAGAAGTATCTGCATTCTTTTGCAACCGCCACTATCTGTTGCGCAGGTTTTTCTCAATCTAATGTCTTCCAATATGCAGATATATTGACCTCTTTTCTGACTTCTGATGGCATTTTCAATTTATAGATGTCAAATCAAGCAAGGTGGGCTGTTCTACAATCTTACAATCTGCTGAAGTGGCACAAACATGCACACCAAGTTGCCGTTAAAGCTTTGGAAAGTGGAAGCAGTCTCAGTGTTGTAATTAGGAAAATCGAGGATGCTTTGTCAACCAATGGATGA

mRNA sequence

ATGGCTATCCCGAGTCCTCCCAAACTCCAGATTTCCTCTTCTTCTCTCTATTTCCAACCTTTTCGGCACCAAATTTCCTTCCATTTCCTGCAAAAAACCCCTCGTGGAATCACTAGACATTTCCATTTAGAACGCCTTCAGCGTCTCCTCCATCTGCCCAGAGCTCTTCGTGAATGGCAAGACTACGAAGAGGCAGTGAAGCGCAAGGACCTCGCAGAAGCTCTTAGGTTTCTCGAGTCCTTTGACAGAGATAGCGCAATCGAACCCGTTAATGATTCAGCCGCTGCTGATTCAGCTCCGTCCGCTCTTCGAAATCCACGGTTGTCTGGCTGGGAGCGGGACTGGGAGGTACTCGACACTTGTTTGAATGCGGATGATATGAAGCTTGTTGCCAATGCTTATGGGTTTCTCAGGGATAGAGGATTTTTGCCCAATTTTGGAAAATGCAGGAACATTGTTTTGGAGGGTCGAAGAGACGTCACACCATCTGTGTTGGAATCTTCGACTGGATTACAAGTGACCAAGTTGTCTCCAAAGAAGTGGGGTCTTTCAGGCAGCTCTAGTTACGCTTTGATTGCCTTTCTTGGTGGAACATCATTTCTGCTCTCACGGGACATTGATATTAGGCCGAACCTTTTGGCACTGCTGGGGCTTGCATTTTTGGACTCTATCCTCCTTGGTGGTACGTGTCTAGCGCAAATCTCTAGCTATTGGCCACCATATAGGCGTCGAATCCTTGTACATGAAGCTGGACATCTACTGACTGCTTATCTCATGGGTTGCCCAATTCGTGGAGTTATTTTGGATCCAATTGTTGCCATGCAAATGGGGATACAGGGACAGGCAGGTACCCAGTTTTGGGATGAAAAAATGGCAAGCAACCTTGCTGAAGGACGTTTGGATGGTACTTCTTTTGACAGGTACTGCATGATCCTTTTTGCGGGCATTGCAGCTGAAGCTCTTGTTTACGGTGAAGCAGAGGGTGGAGAGAATGATGAAAATTTGTTTAGAAGTATCTGCATTCTTTTGCAACCGCCACTATCTGTTGCGCAGATGTCAAATCAAGCAAGGTGGGCTGTTCTACAATCTTACAATCTGCTGAAGTGGCACAAACATGCACACCAAGTTGCCGTTAAAGCTTTGGAAAGTGGAAGCAGTCTCAGTGTTGTAATTAGGAAAATCGAGGATGCTTTGTCAACCAATGGATGA

Coding sequence (CDS)

ATGGCTATCCCGAGTCCTCCCAAACTCCAGATTTCCTCTTCTTCTCTCTATTTCCAACCTTTTCGGCACCAAATTTCCTTCCATTTCCTGCAAAAAACCCCTCGTGGAATCACTAGACATTTCCATTTAGAACGCCTTCAGCGTCTCCTCCATCTGCCCAGAGCTCTTCGTGAATGGCAAGACTACGAAGAGGCAGTGAAGCGCAAGGACCTCGCAGAAGCTCTTAGGTTTCTCGAGTCCTTTGACAGAGATAGCGCAATCGAACCCGTTAATGATTCAGCCGCTGCTGATTCAGCTCCGTCCGCTCTTCGAAATCCACGGTTGTCTGGCTGGGAGCGGGACTGGGAGGTACTCGACACTTGTTTGAATGCGGATGATATGAAGCTTGTTGCCAATGCTTATGGGTTTCTCAGGGATAGAGGATTTTTGCCCAATTTTGGAAAATGCAGGAACATTGTTTTGGAGGGTCGAAGAGACGTCACACCATCTGTGTTGGAATCTTCGACTGGATTACAAGTGACCAAGTTGTCTCCAAAGAAGTGGGGTCTTTCAGGCAGCTCTAGTTACGCTTTGATTGCCTTTCTTGGTGGAACATCATTTCTGCTCTCACGGGACATTGATATTAGGCCGAACCTTTTGGCACTGCTGGGGCTTGCATTTTTGGACTCTATCCTCCTTGGTGGTACGTGTCTAGCGCAAATCTCTAGCTATTGGCCACCATATAGGCGTCGAATCCTTGTACATGAAGCTGGACATCTACTGACTGCTTATCTCATGGGTTGCCCAATTCGTGGAGTTATTTTGGATCCAATTGTTGCCATGCAAATGGGGATACAGGGACAGGCAGGTACCCAGTTTTGGGATGAAAAAATGGCAAGCAACCTTGCTGAAGGACGTTTGGATGGTACTTCTTTTGACAGGTACTGCATGATCCTTTTTGCGGGCATTGCAGCTGAAGCTCTTGTTTACGGTGAAGCAGAGGGTGGAGAGAATGATGAAAATTTGTTTAGAAGTATCTGCATTCTTTTGCAACCGCCACTATCTGTTGCGCAGATGTCAAATCAAGCAAGGTGGGCTGTTCTACAATCTTACAATCTGCTGAAGTGGCACAAACATGCACACCAAGTTGCCGTTAAAGCTTTGGAAAGTGGAAGCAGTCTCAGTGTTGTAATTAGGAAAATCGAGGATGCTTTGTCAACCAATGGATGA

Protein sequence

MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLERLQRLLHLPRALREWQDYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADSAPSALRNPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSPKKWGLSGSSSYALIAFLGGTSFLLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRLDGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDALSTNG
Homology
BLAST of Moc04g00260 vs. NCBI nr
Match: XP_022147989.1 (uncharacterized protein LOC111016783 [Momordica charantia] >XP_022147990.1 uncharacterized protein LOC111016783 [Momordica charantia])

HSP 1 Score: 805.4 bits (2079), Expect = 2.2e-229
Identity = 402/402 (100.00%), Postives = 402/402 (100.00%), Query Frame = 0

Query: 1   MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLERLQRLLHLPRALREWQ 60
           MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLERLQRLLHLPRALREWQ
Sbjct: 1   MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLERLQRLLHLPRALREWQ 60

Query: 61  DYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADSAPSALRNPRLSGWERDWEVLDT 120
           DYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADSAPSALRNPRLSGWERDWEVLDT
Sbjct: 61  DYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADSAPSALRNPRLSGWERDWEVLDT 120

Query: 121 CLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSPKK 180
           CLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSPKK
Sbjct: 121 CLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSPKK 180

Query: 181 WGLSGSSSYALIAFLGGTSFLLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPP 240
           WGLSGSSSYALIAFLGGTSFLLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPP
Sbjct: 181 WGLSGSSSYALIAFLGGTSFLLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPP 240

Query: 241 YRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL 300
           YRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL
Sbjct: 241 YRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL 300

Query: 301 DGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAV 360
           DGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAV
Sbjct: 301 DGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAV 360

Query: 361 LQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDALSTNG 403
           LQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDALSTNG
Sbjct: 361 LQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDALSTNG 402

BLAST of Moc04g00260 vs. NCBI nr
Match: XP_008447096.1 (PREDICTED: uncharacterized protein LOC103489633 isoform X1 [Cucumis melo] >XP_008447097.1 PREDICTED: uncharacterized protein LOC103489633 isoform X1 [Cucumis melo] >KAA0051124.1 uncharacterized protein E6C27_scaffold511G00710 [Cucumis melo var. makuwa])

HSP 1 Score: 723.8 bits (1867), Expect = 8.3e-205
Identity = 365/403 (90.57%), Postives = 379/403 (94.04%), Query Frame = 0

Query: 1   MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLER--LQRLLHLPRALRE 60
           MAI SPPKL ISSS L  Q F + I FHF QK P GI +HFHL+R   QRLL L RALRE
Sbjct: 1   MAILSPPKLLISSSLLQSQLFHYPIPFHFQQKNPNGINKHFHLQRHHYQRLLPLSRALRE 60

Query: 61  WQDYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADSAPSALRNPRLSGWERDWEVL 120
           WQDYEEAVKRKDLAEALRFLESFDRDSAIEP+NDSA A SAPSA+ N RLSGWERDWEVL
Sbjct: 61  WQDYEEAVKRKDLAEALRFLESFDRDSAIEPINDSAPAGSAPSAIGNLRLSGWERDWEVL 120

Query: 121 DTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSP 180
           DTCLNADDMKLVANAY FL+DRGFLPNFGKCRNIVLEG+RDVTPSVLES+TGL+V+KLSP
Sbjct: 121 DTCLNADDMKLVANAYRFLKDRGFLPNFGKCRNIVLEGQRDVTPSVLESTTGLEVSKLSP 180

Query: 181 KKWGLSGSSSYALIAFLGGTSFLLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYW 240
           KKWGLSGSS YALIAFLGGTSFLLS+DIDIRPNLLALLGLAFLDSILLGGTCLAQISSYW
Sbjct: 181 KKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYW 240

Query: 241 PPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG 300
           PPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG
Sbjct: 241 PPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG 300

Query: 301 RLDGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARW 360
           RLDGTSFDRYCM+LFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARW
Sbjct: 301 RLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARW 360

Query: 361 AVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDALSTN 402
           AVLQSYNLLKWHKHAHQVAVKA+ESGSSLSVVIR+IEDALSTN
Sbjct: 361 AVLQSYNLLKWHKHAHQVAVKAMESGSSLSVVIRRIEDALSTN 403

BLAST of Moc04g00260 vs. NCBI nr
Match: XP_038888049.1 (uncharacterized protein LOC120077976 isoform X1 [Benincasa hispida])

HSP 1 Score: 723.8 bits (1867), Expect = 8.3e-205
Identity = 363/401 (90.52%), Postives = 378/401 (94.26%), Query Frame = 0

Query: 1   MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLERLQRLLHLPRALREWQ 60
           MA+ SPPKL ISSS L FQ   + I F+F QK P GI +HF+LER QRLL L RAL EWQ
Sbjct: 1   MAVLSPPKLLISSSLLQFQQLHYPIPFNFQQKNPNGINKHFYLERHQRLLPLSRALSEWQ 60

Query: 61  DYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADSAPSALRNPRLSGWERDWEVLDT 120
           DYEEAVKRKDLAEALRFLESFDRDSAIEP+NDSA A SAPSAL NPRLSGWERDWEVLDT
Sbjct: 61  DYEEAVKRKDLAEALRFLESFDRDSAIEPINDSAPAGSAPSALANPRLSGWERDWEVLDT 120

Query: 121 CLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSPKK 180
           CLNADDMKLVA+AYGFLRDRGFLPNFGK RNIVLEGRRDVTPSVLES+TGL+V+KLSPKK
Sbjct: 121 CLNADDMKLVADAYGFLRDRGFLPNFGKFRNIVLEGRRDVTPSVLESTTGLEVSKLSPKK 180

Query: 181 WGLSGSSSYALIAFLGGTSFLLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPP 240
           WG+SGSS YALIAFLGGTSFLLS+DIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPP
Sbjct: 181 WGVSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPP 240

Query: 241 YRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL 300
           YRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMAS+LAEGRL
Sbjct: 241 YRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASSLAEGRL 300

Query: 301 DGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAV 360
           DGTSFDRYCM+LFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAV
Sbjct: 301 DGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAV 360

Query: 361 LQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDALSTN 402
           LQSYNLLKWHKHAHQ AVKALESGSSLSVVIR+IEDALSTN
Sbjct: 361 LQSYNLLKWHKHAHQTAVKALESGSSLSVVIRRIEDALSTN 401

BLAST of Moc04g00260 vs. NCBI nr
Match: XP_004139896.1 (uncharacterized protein LOC101213430 [Cucumis sativus] >XP_011659042.1 uncharacterized protein LOC101213430 [Cucumis sativus])

HSP 1 Score: 719.5 bits (1856), Expect = 1.6e-203
Identity = 364/403 (90.32%), Postives = 377/403 (93.55%), Query Frame = 0

Query: 1   MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLER--LQRLLHLPRALRE 60
           MAI SPPKL ISSS    Q F + I FHF QK P GI ++FHLER   QRLL L RALRE
Sbjct: 1   MAILSPPKLLISSSLPQSQLFHYPIPFHFQQKNPNGINKYFHLERHHHQRLLPLSRALRE 60

Query: 61  WQDYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADSAPSALRNPRLSGWERDWEVL 120
           WQDYEEAVKRKDLAEALRFLESFDRDSAIEP+ DSA A SAPSA+RN RLSGWERDWEVL
Sbjct: 61  WQDYEEAVKRKDLAEALRFLESFDRDSAIEPIKDSAPAGSAPSAIRNLRLSGWERDWEVL 120

Query: 121 DTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSP 180
           DTCLNADDMKLVANAY FL+DRGFLPNFGKCRNIVLEGRRDVTPSVLE +TGL+V+KLSP
Sbjct: 121 DTCLNADDMKLVANAYRFLKDRGFLPNFGKCRNIVLEGRRDVTPSVLELTTGLEVSKLSP 180

Query: 181 KKWGLSGSSSYALIAFLGGTSFLLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYW 240
           KKWGLSGSS YALIAFLGGTSFLLS+DIDIRPNLLALLGLAFLDSILLGGTCLAQISSYW
Sbjct: 181 KKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYW 240

Query: 241 PPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG 300
           PPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG
Sbjct: 241 PPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG 300

Query: 301 RLDGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARW 360
           RLDGTSFDRYCM+LFAGIAAEALVYGEAEGGENDENLFRSIC+LLQPPLSVAQMSNQARW
Sbjct: 301 RLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVAQMSNQARW 360

Query: 361 AVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDALSTN 402
           AVLQSYNLLKWHKHAHQVAVKA+ESGSSLSVVIRKIEDALSTN
Sbjct: 361 AVLQSYNLLKWHKHAHQVAVKAMESGSSLSVVIRKIEDALSTN 403

BLAST of Moc04g00260 vs. NCBI nr
Match: XP_022969425.1 (uncharacterized protein LOC111468437 isoform X1 [Cucurbita maxima] >XP_022969426.1 uncharacterized protein LOC111468437 isoform X1 [Cucurbita maxima])

HSP 1 Score: 711.4 bits (1835), Expect = 4.2e-201
Identity = 359/401 (89.53%), Postives = 374/401 (93.27%), Query Frame = 0

Query: 1   MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLERLQRLLHLPRALREWQ 60
           M+I SPPKL IS S L FQ F   + FHF QK   GI  HFHL+R QRLL LPRA+REWQ
Sbjct: 1   MSIHSPPKLLISPSLLQFQSFHCPLPFHFQQK--NGINEHFHLQRHQRLLLLPRAIREWQ 60

Query: 61  DYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADSAPSALRNPRLSGWERDWEVLDT 120
           +YEEAVKRKDLAEALRFLESF R+SAIEP NDSA ADSAPSAL NPRLSGWERDWEVLDT
Sbjct: 61  EYEEAVKRKDLAEALRFLESFGRESAIEPPNDSALADSAPSALGNPRLSGWERDWEVLDT 120

Query: 121 CLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSPKK 180
           CLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEG RDVTPSVLES+TGL+V+KLSPKK
Sbjct: 121 CLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGPRDVTPSVLESTTGLEVSKLSPKK 180

Query: 181 WGLSGSSSYALIAFLGGTSFLLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPP 240
           WGLSGSS YALIA LGGTSFLLS+DIDIRPNL ALLGLAFLDSILLGGTCLAQISS WPP
Sbjct: 181 WGLSGSSRYALIACLGGTSFLLSQDIDIRPNLFALLGLAFLDSILLGGTCLAQISSCWPP 240

Query: 241 YRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL 300
           YRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL
Sbjct: 241 YRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL 300

Query: 301 DGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAV 360
           DGTSFDRYCM+LFAGIAAEALVYGEAEGGENDENLFRSIC+LLQPPLSVAQMSNQARWAV
Sbjct: 301 DGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVAQMSNQARWAV 360

Query: 361 LQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDALSTN 402
           LQSYNLLKWHKHAHQVAVKALESGSSLSVVIR++E+ALSTN
Sbjct: 361 LQSYNLLKWHKHAHQVAVKALESGSSLSVVIRRMENALSTN 399

BLAST of Moc04g00260 vs. ExPASy TrEMBL
Match: A0A6J1D1P2 (uncharacterized protein LOC111016783 OS=Momordica charantia OX=3673 GN=LOC111016783 PE=4 SV=1)

HSP 1 Score: 805.4 bits (2079), Expect = 1.0e-229
Identity = 402/402 (100.00%), Postives = 402/402 (100.00%), Query Frame = 0

Query: 1   MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLERLQRLLHLPRALREWQ 60
           MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLERLQRLLHLPRALREWQ
Sbjct: 1   MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLERLQRLLHLPRALREWQ 60

Query: 61  DYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADSAPSALRNPRLSGWERDWEVLDT 120
           DYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADSAPSALRNPRLSGWERDWEVLDT
Sbjct: 61  DYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADSAPSALRNPRLSGWERDWEVLDT 120

Query: 121 CLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSPKK 180
           CLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSPKK
Sbjct: 121 CLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSPKK 180

Query: 181 WGLSGSSSYALIAFLGGTSFLLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPP 240
           WGLSGSSSYALIAFLGGTSFLLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPP
Sbjct: 181 WGLSGSSSYALIAFLGGTSFLLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPP 240

Query: 241 YRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL 300
           YRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL
Sbjct: 241 YRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL 300

Query: 301 DGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAV 360
           DGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAV
Sbjct: 301 DGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAV 360

Query: 361 LQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDALSTNG 403
           LQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDALSTNG
Sbjct: 361 LQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDALSTNG 402

BLAST of Moc04g00260 vs. ExPASy TrEMBL
Match: A0A1S3BH83 (uncharacterized protein LOC103489633 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103489633 PE=4 SV=1)

HSP 1 Score: 723.8 bits (1867), Expect = 4.0e-205
Identity = 365/403 (90.57%), Postives = 379/403 (94.04%), Query Frame = 0

Query: 1   MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLER--LQRLLHLPRALRE 60
           MAI SPPKL ISSS L  Q F + I FHF QK P GI +HFHL+R   QRLL L RALRE
Sbjct: 1   MAILSPPKLLISSSLLQSQLFHYPIPFHFQQKNPNGINKHFHLQRHHYQRLLPLSRALRE 60

Query: 61  WQDYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADSAPSALRNPRLSGWERDWEVL 120
           WQDYEEAVKRKDLAEALRFLESFDRDSAIEP+NDSA A SAPSA+ N RLSGWERDWEVL
Sbjct: 61  WQDYEEAVKRKDLAEALRFLESFDRDSAIEPINDSAPAGSAPSAIGNLRLSGWERDWEVL 120

Query: 121 DTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSP 180
           DTCLNADDMKLVANAY FL+DRGFLPNFGKCRNIVLEG+RDVTPSVLES+TGL+V+KLSP
Sbjct: 121 DTCLNADDMKLVANAYRFLKDRGFLPNFGKCRNIVLEGQRDVTPSVLESTTGLEVSKLSP 180

Query: 181 KKWGLSGSSSYALIAFLGGTSFLLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYW 240
           KKWGLSGSS YALIAFLGGTSFLLS+DIDIRPNLLALLGLAFLDSILLGGTCLAQISSYW
Sbjct: 181 KKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYW 240

Query: 241 PPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG 300
           PPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG
Sbjct: 241 PPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG 300

Query: 301 RLDGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARW 360
           RLDGTSFDRYCM+LFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARW
Sbjct: 301 RLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARW 360

Query: 361 AVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDALSTN 402
           AVLQSYNLLKWHKHAHQVAVKA+ESGSSLSVVIR+IEDALSTN
Sbjct: 361 AVLQSYNLLKWHKHAHQVAVKAMESGSSLSVVIRRIEDALSTN 403

BLAST of Moc04g00260 vs. ExPASy TrEMBL
Match: A0A5A7U732 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold511G00710 PE=4 SV=1)

HSP 1 Score: 723.8 bits (1867), Expect = 4.0e-205
Identity = 365/403 (90.57%), Postives = 379/403 (94.04%), Query Frame = 0

Query: 1   MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLER--LQRLLHLPRALRE 60
           MAI SPPKL ISSS L  Q F + I FHF QK P GI +HFHL+R   QRLL L RALRE
Sbjct: 1   MAILSPPKLLISSSLLQSQLFHYPIPFHFQQKNPNGINKHFHLQRHHYQRLLPLSRALRE 60

Query: 61  WQDYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADSAPSALRNPRLSGWERDWEVL 120
           WQDYEEAVKRKDLAEALRFLESFDRDSAIEP+NDSA A SAPSA+ N RLSGWERDWEVL
Sbjct: 61  WQDYEEAVKRKDLAEALRFLESFDRDSAIEPINDSAPAGSAPSAIGNLRLSGWERDWEVL 120

Query: 121 DTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSP 180
           DTCLNADDMKLVANAY FL+DRGFLPNFGKCRNIVLEG+RDVTPSVLES+TGL+V+KLSP
Sbjct: 121 DTCLNADDMKLVANAYRFLKDRGFLPNFGKCRNIVLEGQRDVTPSVLESTTGLEVSKLSP 180

Query: 181 KKWGLSGSSSYALIAFLGGTSFLLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYW 240
           KKWGLSGSS YALIAFLGGTSFLLS+DIDIRPNLLALLGLAFLDSILLGGTCLAQISSYW
Sbjct: 181 KKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYW 240

Query: 241 PPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG 300
           PPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG
Sbjct: 241 PPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG 300

Query: 301 RLDGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARW 360
           RLDGTSFDRYCM+LFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARW
Sbjct: 301 RLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARW 360

Query: 361 AVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDALSTN 402
           AVLQSYNLLKWHKHAHQVAVKA+ESGSSLSVVIR+IEDALSTN
Sbjct: 361 AVLQSYNLLKWHKHAHQVAVKAMESGSSLSVVIRRIEDALSTN 403

BLAST of Moc04g00260 vs. ExPASy TrEMBL
Match: A0A0A0K7I5 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G239000 PE=4 SV=1)

HSP 1 Score: 719.5 bits (1856), Expect = 7.6e-204
Identity = 364/403 (90.32%), Postives = 377/403 (93.55%), Query Frame = 0

Query: 1   MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLER--LQRLLHLPRALRE 60
           MAI SPPKL ISSS    Q F + I FHF QK P GI ++FHLER   QRLL L RALRE
Sbjct: 1   MAILSPPKLLISSSLPQSQLFHYPIPFHFQQKNPNGINKYFHLERHHHQRLLPLSRALRE 60

Query: 61  WQDYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADSAPSALRNPRLSGWERDWEVL 120
           WQDYEEAVKRKDLAEALRFLESFDRDSAIEP+ DSA A SAPSA+RN RLSGWERDWEVL
Sbjct: 61  WQDYEEAVKRKDLAEALRFLESFDRDSAIEPIKDSAPAGSAPSAIRNLRLSGWERDWEVL 120

Query: 121 DTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSP 180
           DTCLNADDMKLVANAY FL+DRGFLPNFGKCRNIVLEGRRDVTPSVLE +TGL+V+KLSP
Sbjct: 121 DTCLNADDMKLVANAYRFLKDRGFLPNFGKCRNIVLEGRRDVTPSVLELTTGLEVSKLSP 180

Query: 181 KKWGLSGSSSYALIAFLGGTSFLLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYW 240
           KKWGLSGSS YALIAFLGGTSFLLS+DIDIRPNLLALLGLAFLDSILLGGTCLAQISSYW
Sbjct: 181 KKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYW 240

Query: 241 PPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG 300
           PPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG
Sbjct: 241 PPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG 300

Query: 301 RLDGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARW 360
           RLDGTSFDRYCM+LFAGIAAEALVYGEAEGGENDENLFRSIC+LLQPPLSVAQMSNQARW
Sbjct: 301 RLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVAQMSNQARW 360

Query: 361 AVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDALSTN 402
           AVLQSYNLLKWHKHAHQVAVKA+ESGSSLSVVIRKIEDALSTN
Sbjct: 361 AVLQSYNLLKWHKHAHQVAVKAMESGSSLSVVIRKIEDALSTN 403

BLAST of Moc04g00260 vs. ExPASy TrEMBL
Match: A0A6J1HZW5 (uncharacterized protein LOC111468437 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111468437 PE=4 SV=1)

HSP 1 Score: 711.4 bits (1835), Expect = 2.1e-201
Identity = 359/401 (89.53%), Postives = 374/401 (93.27%), Query Frame = 0

Query: 1   MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLERLQRLLHLPRALREWQ 60
           M+I SPPKL IS S L FQ F   + FHF QK   GI  HFHL+R QRLL LPRA+REWQ
Sbjct: 1   MSIHSPPKLLISPSLLQFQSFHCPLPFHFQQK--NGINEHFHLQRHQRLLLLPRAIREWQ 60

Query: 61  DYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADSAPSALRNPRLSGWERDWEVLDT 120
           +YEEAVKRKDLAEALRFLESF R+SAIEP NDSA ADSAPSAL NPRLSGWERDWEVLDT
Sbjct: 61  EYEEAVKRKDLAEALRFLESFGRESAIEPPNDSALADSAPSALGNPRLSGWERDWEVLDT 120

Query: 121 CLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSPKK 180
           CLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEG RDVTPSVLES+TGL+V+KLSPKK
Sbjct: 121 CLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGPRDVTPSVLESTTGLEVSKLSPKK 180

Query: 181 WGLSGSSSYALIAFLGGTSFLLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPP 240
           WGLSGSS YALIA LGGTSFLLS+DIDIRPNL ALLGLAFLDSILLGGTCLAQISS WPP
Sbjct: 181 WGLSGSSRYALIACLGGTSFLLSQDIDIRPNLFALLGLAFLDSILLGGTCLAQISSCWPP 240

Query: 241 YRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL 300
           YRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL
Sbjct: 241 YRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL 300

Query: 301 DGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAV 360
           DGTSFDRYCM+LFAGIAAEALVYGEAEGGENDENLFRSIC+LLQPPLSVAQMSNQARWAV
Sbjct: 301 DGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVAQMSNQARWAV 360

Query: 361 LQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDALSTN 402
           LQSYNLLKWHKHAHQVAVKALESGSSLSVVIR++E+ALSTN
Sbjct: 361 LQSYNLLKWHKHAHQVAVKALESGSSLSVVIRRMENALSTN 399

BLAST of Moc04g00260 vs. TAIR 10
Match: AT1G56180.1 (unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G27290.1); Has 436 Blast hits to 436 proteins in 83 species: Archae - 0; Bacteria - 153; Metazoa - 0; Fungi - 0; Plants - 160; Viruses - 0; Other Eukaryotes - 123 (source: NCBI BLink). )

HSP 1 Score: 513.1 bits (1320), Expect = 2.1e-145
Identity = 265/404 (65.59%), Postives = 315/404 (77.97%), Query Frame = 0

Query: 5   SPPKLQISSSSLYFQPFRHQISFHF--LQKTPRGITRHFHLERLQRLLHLPRALREWQDY 64
           SPP L+  S S     F  QI F    +Q    G  R   L R       P ALREW++Y
Sbjct: 7   SPPCLRSLSPS-----FSRQIGFLVPRVQSLVFGSVRKHELRR-------PSALREWREY 66

Query: 65  EEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADSAPSALRNPRLSG-----WERDWEV 124
           E+AVKRKDLA ALRFL+S + D   + V     A          +LSG      ERDW+V
Sbjct: 67  EDAVKRKDLAGALRFLKSIENDEQRDSVESIVTA----------KLSGLGALELERDWQV 126

Query: 125 LDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLS 184
           LD CLNADDM+LV +A+ FL++RG L NFGK  +IVLEG R+VTP+VL+S+TGL+VTKLS
Sbjct: 127 LDACLNADDMRLVGSAFRFLKERGLLANFGKFTSIVLEGTREVTPTVLKSATGLEVTKLS 186

Query: 185 PKKWGLSGSSSYALIAFLGGTSFLLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSY 244
           PKKWGLSG SS AL A LGG S+LLS++ID+RPNL  +LGLA+LDS+ LGGTCLAQ+S Y
Sbjct: 187 PKKWGLSGGSSIALAALLGGVSYLLSQEIDVRPNLAVILGLAYLDSVFLGGTCLAQVSCY 246

Query: 245 WPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAE 304
           WPP++RRI+VHEAGHLL AYLMGCPIRGVILDP+VAMQMG+QGQAGTQFWD+KM S +AE
Sbjct: 247 WPPHKRRIVVHEAGHLLVAYLMGCPIRGVILDPVVAMQMGVQGQAGTQFWDQKMESEIAE 306

Query: 305 GRLDGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQAR 364
           GRL G+SFDRY M+LFAGIAAEALVYGEAEGGENDENLFRSI +LL+PPLSVAQMSNQAR
Sbjct: 307 GRLSGSSFDRYSMVLFAGIAAEALVYGEAEGGENDENLFRSISVLLEPPLSVAQMSNQAR 366

Query: 365 WAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDALSTN 402
           W+VLQSYNLLKWHK AH+ AV+AL+ GS LS+VIR+IE+A+S++
Sbjct: 367 WSVLQSYNLLKWHKAAHRAAVEALQVGSPLSIVIRRIEEAMSSS 388

BLAST of Moc04g00260 vs. TAIR 10
Match: AT2G21960.1 (unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G56180.1); Has 224 Blast hits to 222 proteins in 59 species: Archae - 0; Bacteria - 65; Metazoa - 0; Fungi - 0; Plants - 134; Viruses - 0; Other Eukaryotes - 25 (source: NCBI BLink). )

HSP 1 Score: 102.1 bits (253), Expect = 1.1e-21
Identity = 59/167 (35.33%), Postives = 84/167 (50.30%), Query Frame = 0

Query: 231 LAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEK 290
           ++  S+++P Y+ RI  HEA H L AYL+G PI G  LD          G+      DE+
Sbjct: 174 ISGFSTFFPDYQERIAAHEAAHFLVAYLIGLPILGYSLD---------IGKEHVNLIDER 233

Query: 291 MASNLAEGRLDGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVA 350
           +A  +  G+LD    DR   +  AG+AAE L Y +  G   D    +      QP +S  
Sbjct: 234 LAKLIYSGKLDSKELDRLAAVAMAGLAAEGLKYDKVIGQSADLFSLQRFINRSQPKISNE 293

Query: 351 QMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDA 398
           Q  N  RWAVL S +LLK +K  H+  + A+   +S+   I+ IE A
Sbjct: 294 QQQNLTRWAVLYSASLLKNNKTIHEALMAAMSKNASVLECIQTIETA 331

BLAST of Moc04g00260 vs. TAIR 10
Match: AT5G27290.1 (unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G54680.3); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 95.5 bits (236), Expect = 1.0e-19
Identity = 55/160 (34.38%), Postives = 86/160 (53.75%), Query Frame = 0

Query: 241 YRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQM--GIQGQAGTQFWDEKMASNLAEG 300
           Y  R++ HEAGH L AYL+G   RG  L  + A+Q    +  QAG+ F D +    +  G
Sbjct: 179 YHNRVVQHEAGHFLVAYLVGILPRGYTLSSLEALQKEGSLNIQAGSAFVDYEFLEEVNSG 238

Query: 301 RLDGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARW 360
           ++  T  +R+  I  AG+A E L+YG AEGG +D +    +   L    +  +  +Q RW
Sbjct: 239 KVSATMLNRFSCIALAGVATEYLLYGYAEGGLDDISKLDGLVKSL--GFTQKKADSQVRW 298

Query: 361 AVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDAL 399
           +VL +  LL+ H+ A     +A+  G S+   I+ IED++
Sbjct: 299 SVLNTILLLRRHEIARSKLAQAMSKGESVGSCIQIIEDSI 336

BLAST of Moc04g00260 vs. TAIR 10
Match: AT5G27290.2 (unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G54680.3); Has 199 Blast hits to 194 proteins in 57 species: Archae - 0; Bacteria - 61; Metazoa - 0; Fungi - 0; Plants - 129; Viruses - 0; Other Eukaryotes - 9 (source: NCBI BLink). )

HSP 1 Score: 60.5 bits (145), Expect = 3.7e-09
Identity = 31/84 (36.90%), Postives = 46/84 (54.76%), Query Frame = 0

Query: 241 YRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQM--GIQGQAGTQFWDEKMASNLAEG 300
           Y  R++ HEAGH L AYL+G   RG  L  + A+Q    +  QAG+ F D +    +  G
Sbjct: 179 YHNRVVQHEAGHFLVAYLVGILPRGYTLSSLEALQKEGSLNIQAGSAFVDYEFLEEVNSG 238

Query: 301 RLDGTSFDRYCMILFAGIAAEALV 323
           ++  T  +R+  I  AG+A E L+
Sbjct: 239 KVSATMLNRFSCIALAGVATEYLL 262

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022147989.12.2e-229100.00uncharacterized protein LOC111016783 [Momordica charantia] >XP_022147990.1 uncha... [more]
XP_008447096.18.3e-20590.57PREDICTED: uncharacterized protein LOC103489633 isoform X1 [Cucumis melo] >XP_00... [more]
XP_038888049.18.3e-20590.52uncharacterized protein LOC120077976 isoform X1 [Benincasa hispida][more]
XP_004139896.11.6e-20390.32uncharacterized protein LOC101213430 [Cucumis sativus] >XP_011659042.1 uncharact... [more]
XP_022969425.14.2e-20189.53uncharacterized protein LOC111468437 isoform X1 [Cucurbita maxima] >XP_022969426... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1D1P21.0e-229100.00uncharacterized protein LOC111016783 OS=Momordica charantia OX=3673 GN=LOC111016... [more]
A0A1S3BH834.0e-20590.57uncharacterized protein LOC103489633 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A5A7U7324.0e-20590.57Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
A0A0A0K7I57.6e-20490.32Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G239000 PE=4 SV=1[more]
A0A6J1HZW52.1e-20189.53uncharacterized protein LOC111468437 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
Match NameE-valueIdentityDescription
AT1G56180.12.1e-14565.59unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplas... [more]
AT2G21960.11.1e-2135.33unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXP... [more]
AT5G27290.11.0e-1934.38unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXP... [more]
AT5G27290.23.7e-0936.90unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXP... [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR037219Peptidase M41-likeGENE3D1.20.58.760Peptidase M41coord: 236..388
e-value: 8.7E-10
score: 40.6
IPR037219Peptidase M41-likeSUPERFAMILY140990FtsH protease domain-likecoord: 238..387
NoneNo IPR availablePANTHERPTHR33471:SF7ATP-DEPENDENT ZINC METALLOPROTEASEcoord: 44..400
NoneNo IPR availablePANTHERPTHR33471FAMILY NOT NAMEDcoord: 44..400

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc04g00260.1Moc04g00260.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0016020 membrane
molecular_function GO:0005524 ATP binding
molecular_function GO:0004176 ATP-dependent peptidase activity
molecular_function GO:0004222 metalloendopeptidase activity