Moc02g02310 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc02g02310
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGolgin family A protein
Locationchr2: 1765465 .. 1768201 (-)
RNA-Seq ExpressionMoc02g02310
SyntenyMoc02g02310
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCGTGGGCAGAGCAGCAAACAGAGAAGAGATGTAAAATCAGAAAGCGAAAGGGCCCGTGGTCGTCGTCTTCTTCTTATTCGAGTTTGGTCCGCAAGTACAGATTCAAGAAATCCCCCACGTGGAAAATGAGTACAAAATCCCATTCTTCCGCCGGAAAAGGTGTGAAATTGAAGAACAACAACAATTCAGATGCTGTTGAACTTGAAGATAAGAAGGAGCTGATCAAAACCGGGGAAATGGTTTCTCAGATCTCGCATTCGTGTTTATCAGATCCGGATCCGAATTTCAACAACACCAAATCTGAGGTCAGCTTTTTTTTTTAAACCAATTTTTGCATTTATCGGGAATTGTGTTTGAGTTAATTTTAATTAAATTCGTTCGTGTAGAAAGTCGAGGGTGGCAGTGGGAGAGTACAGAGAAGAAGAAGATCGGCATGTTCTTTGAGGATTGGGATTGGTGAGATTGAAGTGGGTGGTTCCAATTTTCGGGGCAACGATTGTTTAATGGAGGTACCAAATAGAAATACCAAATACCAAAAATCTATTATTTCTGCCTTTCCACAAAATCATAATTTTCCGTTTTTATTTTTTTTCTCGCTGTTTTCTTTCAGTCCCATCAATGGTTTTCTTTGCTGTCATTTCCTACTTGTACTTCTCCACCAACCCTTTTTTCTTCTTTCCTTTTTCATGATACCCGTTTCTTTTCTTTTTTAATTTTTGCTTTCAGGAATTTCATGGTTATAGATTTAGAAACAATCTAAATCAATAATTATACATTTTTCCTCTTAATCAGGAGTAAGTAAGTGAAGGTTTTTTTGTTCTTGGTAAATATATGGTTACCATATATAAACCATAAATAATTACTTTTTTTTTTGTAATTACCTATTTTGTCTGGAGCTTAGGCTTTAATTCTAAATTTGGTTATCATTATTCAAAACTTCTAATTTAGCTTCTATAATTTTGCTAAATCACGTGTCATCTATAAATTGTTCTTTTCCCATCCAAGACTTACCTATTTTCTCCTATATTTTCAACTCTAAACTATAATACAGTGGATTAAATAAGAAATACATGCATTTTAACTTCTTTCCTTTTTTAAAGTAAACTAGAGGCATATTTTCAAAAGTTTTAAATTTTATCAGCTATATCATAAAAGCCAAAGCTAAATTCTCTGAACTATAACTATAAGAACTAAATTGTAATTAAACACAAAATATAAAGATTAAATTAAAACTTAGTTTGCAATTTAATGTTTTTTTTTCCTTTTAGAAAAAAGTTGCGAGGAACAAAAATTTGGGGTTTGAAAACTGAATGTAATAAATGTGGACAGAGGCACTGAATTTGACCACAAAATAAAGCTGTAGCAGTATTAAAACTCCATTATTGGCATGCGCACCATTTCATTATAATAGAAAGAAAAATTCACAATTATAATACACAGCTTTTTCAGTACATTAAAATAAGAATTTAACACTTTCTAGAATTGCAGATCGAAAATCGTAGCGAGGTAAAAACGACACGTCGGAAGAAGAAATTTACGGTGAAAACCCGTTTGAAGGAAGTGAGCAATTGCCTGACGACATCGAAGGAGCTCGTGAGAGTTCTAACCCACGTTTGGGGATCCCACGATGAGAAGCAGCCATCATCGGCGTCATCTCTGATGGCGGCTCTGAAATCGGAGCTGGACCGGGCCAAGACCCGAGTGGAGCATTTGATGAGAGACGAGCAGAGGTTGTTCCACGGCGACGAAATTGAGGCTCTGAGGAAGCGGTTTGCGGAGGAGAAAGCTGCGTGGAAGTACAAGGAGAGAGCGAGAGTTGGGAGTGCGATTAGTTCAATGGCGGAGGAAGTGGCGGTGGAGAGGAAGCTGAGGAGGCAAGCAGAGAGGCTGAACAAGAGGATCGGGAAAGAGCTTGGGGAGGCCAGAGTTGCGGTGGCGAAGGCGATGAAGGATGTGGACAGGGAAAAGCGAGCGAAGGAGATATTGGAGGAGATTTGCGAGGAGTTGGCCAAAGGGATTGGAGAGGACAGAGCGGAATTCGAGGAGCTGAGGAAGGAGTCGGAAAAAGTTAGAGAGGAAGTGGAGAAAGAGAGGGAGATGCTTCAATTGGCCGACGTTTTACGCGAGGAGAGAGTTCAGATGAAGCTGTCGGAGGCGAAGTACCAATTTGAAGAGAAAAACGCTGCGGTGGAGCGCCTCAAGGACCAGCTCGAGGCCTATTTCGTTGAAAACAGAGATCATTCTCAAGATTCGTTCAATAACAAACTCGACAAAATCAAGGAGCTGGAGGCGTATTTGAAGAAGATTAATTTCGGGTCGTACAACAAGAACAAGGAAGAAGAGGATTGTAATTGGGATGAGGAGAGCGATCTGCACTCGATTGAGCTCAACATGGACAACAACAACAAGAGCTACAGGTGGAGTTTCGTGCATGGCTCACACAACGCCTCCAAAAGGAACTCATTCGAGAAGGAAAGGAAGTCGCTTTCGGAGAAAATCCAGTGGGGGAGCATTTGCTTCAACAGTAGTAGTAAAAATGGAGAATTTGAGGGGGACGGGGAGCGGGACGGTGAAATTCAAATTACCCACAAGCAGAAATCTGGTGGTGTGAGGTGCCTTCGGGACATTCTATTTCCAGTTTCAGGAGTCGAAGAAAATAAAGTGGAGAAGACAGAGGATGCCATGCCTCTCCAAATTGATGAACCTTGTTCGGTGGTGGTGATGAAGGGATGA

mRNA sequence

ATGTCGTGGGCAGAGCAGCAAACAGAGAAGAGATGTAAAATCAGAAAGCGAAAGGGCCCGTGGTCGTCGTCTTCTTCTTATTCGAGTTTGGTCCGCAAGTACAGATTCAAGAAATCCCCCACGTGGAAAATGAGTACAAAATCCCATTCTTCCGCCGGAAAAGGTGTGAAATTGAAGAACAACAACAATTCAGATGCTGTTGAACTTGAAGATAAGAAGGAGCTGATCAAAACCGGGGAAATGGTTTCTCAGATCTCGCATTCGTGTTTATCAGATCCGGATCCGAATTTCAACAACACCAAATCTGAGAAAGTCGAGGGTGGCAGTGGGAGAGTACAGAGAAGAAGAAGATCGGCATGTTCTTTGAGGATTGGGATTGGTGAGATTGAAGTGGGTGGTTCCAATTTTCGGGGCAACGATTGTTTAATGGAGATCGAAAATCGTAGCGAGGTAAAAACGACACGTCGGAAGAAGAAATTTACGGTGAAAACCCGTTTGAAGGAAGTGAGCAATTGCCTGACGACATCGAAGGAGCTCGTGAGAGTTCTAACCCACGTTTGGGGATCCCACGATGAGAAGCAGCCATCATCGGCGTCATCTCTGATGGCGGCTCTGAAATCGGAGCTGGACCGGGCCAAGACCCGAGTGGAGCATTTGATGAGAGACGAGCAGAGGTTGTTCCACGGCGACGAAATTGAGGCTCTGAGGAAGCGGTTTGCGGAGGAGAAAGCTGCGTGGAAGTACAAGGAGAGAGCGAGAGTTGGGAGTGCGATTAGTTCAATGGCGGAGGAAGTGGCGGTGGAGAGGAAGCTGAGGAGGCAAGCAGAGAGGCTGAACAAGAGGATCGGGAAAGAGCTTGGGGAGGCCAGAGTTGCGGTGGCGAAGGCGATGAAGGATGTGGACAGGGAAAAGCGAGCGAAGGAGATATTGGAGGAGATTTGCGAGGAGTTGGCCAAAGGGATTGGAGAGGACAGAGCGGAATTCGAGGAGCTGAGGAAGGAGTCGGAAAAAGTTAGAGAGGAAGTGGAGAAAGAGAGGGAGATGCTTCAATTGGCCGACGTTTTACGCGAGGAGAGAGTTCAGATGAAGCTGTCGGAGGCGAAGTACCAATTTGAAGAGAAAAACGCTGCGGTGGAGCGCCTCAAGGACCAGCTCGAGGCCTATTTCGTTGAAAACAGAGATCATTCTCAAGATTCGTTCAATAACAAACTCGACAAAATCAAGGAGCTGGAGGCGTATTTGAAGAAGATTAATTTCGGGTCGTACAACAAGAACAAGGAAGAAGAGGATTGTAATTGGGATGAGGAGAGCGATCTGCACTCGATTGAGCTCAACATGGACAACAACAACAAGAGCTACAGGTGGAGTTTCGTGCATGGCTCACACAACGCCTCCAAAAGGAACTCATTCGAGAAGGAAAGGAAGTCGCTTTCGGAGAAAATCCAGTGGGGGAGCATTTGCTTCAACAGTAGTAGTAAAAATGGAGAATTTGAGGGGGACGGGGAGCGGGACGGTGAAATTCAAATTACCCACAAGCAGAAATCTGGTGGTGTGAGGTGCCTTCGGGACATTCTATTTCCAGTTTCAGGAGTCGAAGAAAATAAAGTGGAGAAGACAGAGGATGCCATGCCTCTCCAAATTGATGAACCTTGTTCGGTGGTGGTGATGAAGGGATGA

Coding sequence (CDS)

ATGTCGTGGGCAGAGCAGCAAACAGAGAAGAGATGTAAAATCAGAAAGCGAAAGGGCCCGTGGTCGTCGTCTTCTTCTTATTCGAGTTTGGTCCGCAAGTACAGATTCAAGAAATCCCCCACGTGGAAAATGAGTACAAAATCCCATTCTTCCGCCGGAAAAGGTGTGAAATTGAAGAACAACAACAATTCAGATGCTGTTGAACTTGAAGATAAGAAGGAGCTGATCAAAACCGGGGAAATGGTTTCTCAGATCTCGCATTCGTGTTTATCAGATCCGGATCCGAATTTCAACAACACCAAATCTGAGAAAGTCGAGGGTGGCAGTGGGAGAGTACAGAGAAGAAGAAGATCGGCATGTTCTTTGAGGATTGGGATTGGTGAGATTGAAGTGGGTGGTTCCAATTTTCGGGGCAACGATTGTTTAATGGAGATCGAAAATCGTAGCGAGGTAAAAACGACACGTCGGAAGAAGAAATTTACGGTGAAAACCCGTTTGAAGGAAGTGAGCAATTGCCTGACGACATCGAAGGAGCTCGTGAGAGTTCTAACCCACGTTTGGGGATCCCACGATGAGAAGCAGCCATCATCGGCGTCATCTCTGATGGCGGCTCTGAAATCGGAGCTGGACCGGGCCAAGACCCGAGTGGAGCATTTGATGAGAGACGAGCAGAGGTTGTTCCACGGCGACGAAATTGAGGCTCTGAGGAAGCGGTTTGCGGAGGAGAAAGCTGCGTGGAAGTACAAGGAGAGAGCGAGAGTTGGGAGTGCGATTAGTTCAATGGCGGAGGAAGTGGCGGTGGAGAGGAAGCTGAGGAGGCAAGCAGAGAGGCTGAACAAGAGGATCGGGAAAGAGCTTGGGGAGGCCAGAGTTGCGGTGGCGAAGGCGATGAAGGATGTGGACAGGGAAAAGCGAGCGAAGGAGATATTGGAGGAGATTTGCGAGGAGTTGGCCAAAGGGATTGGAGAGGACAGAGCGGAATTCGAGGAGCTGAGGAAGGAGTCGGAAAAAGTTAGAGAGGAAGTGGAGAAAGAGAGGGAGATGCTTCAATTGGCCGACGTTTTACGCGAGGAGAGAGTTCAGATGAAGCTGTCGGAGGCGAAGTACCAATTTGAAGAGAAAAACGCTGCGGTGGAGCGCCTCAAGGACCAGCTCGAGGCCTATTTCGTTGAAAACAGAGATCATTCTCAAGATTCGTTCAATAACAAACTCGACAAAATCAAGGAGCTGGAGGCGTATTTGAAGAAGATTAATTTCGGGTCGTACAACAAGAACAAGGAAGAAGAGGATTGTAATTGGGATGAGGAGAGCGATCTGCACTCGATTGAGCTCAACATGGACAACAACAACAAGAGCTACAGGTGGAGTTTCGTGCATGGCTCACACAACGCCTCCAAAAGGAACTCATTCGAGAAGGAAAGGAAGTCGCTTTCGGAGAAAATCCAGTGGGGGAGCATTTGCTTCAACAGTAGTAGTAAAAATGGAGAATTTGAGGGGGACGGGGAGCGGGACGGTGAAATTCAAATTACCCACAAGCAGAAATCTGGTGGTGTGAGGTGCCTTCGGGACATTCTATTTCCAGTTTCAGGAGTCGAAGAAAATAAAGTGGAGAAGACAGAGGATGCCATGCCTCTCCAAATTGATGAACCTTGTTCGGTGGTGGTGATGAAGGGATGA

Protein sequence

MSWAEQQTEKRCKIRKRKGPWSSSSSYSSLVRKYRFKKSPTWKMSTKSHSSAGKGVKLKNNNNSDAVELEDKKELIKTGEMVSQISHSCLSDPDPNFNNTKSEKVEGGSGRVQRRRRSACSLRIGIGEIEVGGSNFRGNDCLMEIENRSEVKTTRRKKKFTVKTRLKEVSNCLTTSKELVRVLTHVWGSHDEKQPSSASSLMAALKSELDRAKTRVEHLMRDEQRLFHGDEIEALRKRFAEEKAAWKYKERARVGSAISSMAEEVAVERKLRRQAERLNKRIGKELGEARVAVAKAMKDVDREKRAKEILEEICEELAKGIGEDRAEFEELRKESEKVREEVEKEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDQLEAYFVENRDHSQDSFNNKLDKIKELEAYLKKINFGSYNKNKEEEDCNWDEESDLHSIELNMDNNNKSYRWSFVHGSHNASKRNSFEKERKSLSEKIQWGSICFNSSSKNGEFEGDGERDGEIQITHKQKSGGVRCLRDILFPVSGVEENKVEKTEDAMPLQIDEPCSVVVMKG
Homology
BLAST of Moc02g02310 vs. NCBI nr
Match: XP_022153077.1 (uncharacterized protein LOC111020667 isoform X1 [Momordica charantia])

HSP 1 Score: 1057.0 bits (2732), Expect = 5.7e-305
Identity = 558/558 (100.00%), Postives = 558/558 (100.00%), Query Frame = 0

Query: 1   MSWAEQQTEKRCKIRKRKGPWSSSSSYSSLVRKYRFKKSPTWKMSTKSHSSAGKGVKLKN 60
           MSWAEQQTEKRCKIRKRKGPWSSSSSYSSLVRKYRFKKSPTWKMSTKSHSSAGKGVKLKN
Sbjct: 1   MSWAEQQTEKRCKIRKRKGPWSSSSSYSSLVRKYRFKKSPTWKMSTKSHSSAGKGVKLKN 60

Query: 61  NNNSDAVELEDKKELIKTGEMVSQISHSCLSDPDPNFNNTKSEKVEGGSGRVQRRRRSAC 120
           NNNSDAVELEDKKELIKTGEMVSQISHSCLSDPDPNFNNTKSEKVEGGSGRVQRRRRSAC
Sbjct: 61  NNNSDAVELEDKKELIKTGEMVSQISHSCLSDPDPNFNNTKSEKVEGGSGRVQRRRRSAC 120

Query: 121 SLRIGIGEIEVGGSNFRGNDCLMEIENRSEVKTTRRKKKFTVKTRLKEVSNCLTTSKELV 180
           SLRIGIGEIEVGGSNFRGNDCLMEIENRSEVKTTRRKKKFTVKTRLKEVSNCLTTSKELV
Sbjct: 121 SLRIGIGEIEVGGSNFRGNDCLMEIENRSEVKTTRRKKKFTVKTRLKEVSNCLTTSKELV 180

Query: 181 RVLTHVWGSHDEKQPSSASSLMAALKSELDRAKTRVEHLMRDEQRLFHGDEIEALRKRFA 240
           RVLTHVWGSHDEKQPSSASSLMAALKSELDRAKTRVEHLMRDEQRLFHGDEIEALRKRFA
Sbjct: 181 RVLTHVWGSHDEKQPSSASSLMAALKSELDRAKTRVEHLMRDEQRLFHGDEIEALRKRFA 240

Query: 241 EEKAAWKYKERARVGSAISSMAEEVAVERKLRRQAERLNKRIGKELGEARVAVAKAMKDV 300
           EEKAAWKYKERARVGSAISSMAEEVAVERKLRRQAERLNKRIGKELGEARVAVAKAMKDV
Sbjct: 241 EEKAAWKYKERARVGSAISSMAEEVAVERKLRRQAERLNKRIGKELGEARVAVAKAMKDV 300

Query: 301 DREKRAKEILEEICEELAKGIGEDRAEFEELRKESEKVREEVEKEREMLQLADVLREERV 360
           DREKRAKEILEEICEELAKGIGEDRAEFEELRKESEKVREEVEKEREMLQLADVLREERV
Sbjct: 301 DREKRAKEILEEICEELAKGIGEDRAEFEELRKESEKVREEVEKEREMLQLADVLREERV 360

Query: 361 QMKLSEAKYQFEEKNAAVERLKDQLEAYFVENRDHSQDSFNNKLDKIKELEAYLKKINFG 420
           QMKLSEAKYQFEEKNAAVERLKDQLEAYFVENRDHSQDSFNNKLDKIKELEAYLKKINFG
Sbjct: 361 QMKLSEAKYQFEEKNAAVERLKDQLEAYFVENRDHSQDSFNNKLDKIKELEAYLKKINFG 420

Query: 421 SYNKNKEEEDCNWDEESDLHSIELNMDNNNKSYRWSFVHGSHNASKRNSFEKERKSLSEK 480
           SYNKNKEEEDCNWDEESDLHSIELNMDNNNKSYRWSFVHGSHNASKRNSFEKERKSLSEK
Sbjct: 421 SYNKNKEEEDCNWDEESDLHSIELNMDNNNKSYRWSFVHGSHNASKRNSFEKERKSLSEK 480

Query: 481 IQWGSICFNSSSKNGEFEGDGERDGEIQITHKQKSGGVRCLRDILFPVSGVEENKVEKTE 540
           IQWGSICFNSSSKNGEFEGDGERDGEIQITHKQKSGGVRCLRDILFPVSGVEENKVEKTE
Sbjct: 481 IQWGSICFNSSSKNGEFEGDGERDGEIQITHKQKSGGVRCLRDILFPVSGVEENKVEKTE 540

Query: 541 DAMPLQIDEPCSVVVMKG 559
           DAMPLQIDEPCSVVVMKG
Sbjct: 541 DAMPLQIDEPCSVVVMKG 558

BLAST of Moc02g02310 vs. NCBI nr
Match: XP_022153078.1 (uncharacterized protein LOC111020667 isoform X2 [Momordica charantia] >XP_022153079.1 uncharacterized protein LOC111020667 isoform X2 [Momordica charantia])

HSP 1 Score: 971.8 bits (2511), Expect = 2.4e-279
Identity = 515/515 (100.00%), Postives = 515/515 (100.00%), Query Frame = 0

Query: 44  MSTKSHSSAGKGVKLKNNNNSDAVELEDKKELIKTGEMVSQISHSCLSDPDPNFNNTKSE 103
           MSTKSHSSAGKGVKLKNNNNSDAVELEDKKELIKTGEMVSQISHSCLSDPDPNFNNTKSE
Sbjct: 1   MSTKSHSSAGKGVKLKNNNNSDAVELEDKKELIKTGEMVSQISHSCLSDPDPNFNNTKSE 60

Query: 104 KVEGGSGRVQRRRRSACSLRIGIGEIEVGGSNFRGNDCLMEIENRSEVKTTRRKKKFTVK 163
           KVEGGSGRVQRRRRSACSLRIGIGEIEVGGSNFRGNDCLMEIENRSEVKTTRRKKKFTVK
Sbjct: 61  KVEGGSGRVQRRRRSACSLRIGIGEIEVGGSNFRGNDCLMEIENRSEVKTTRRKKKFTVK 120

Query: 164 TRLKEVSNCLTTSKELVRVLTHVWGSHDEKQPSSASSLMAALKSELDRAKTRVEHLMRDE 223
           TRLKEVSNCLTTSKELVRVLTHVWGSHDEKQPSSASSLMAALKSELDRAKTRVEHLMRDE
Sbjct: 121 TRLKEVSNCLTTSKELVRVLTHVWGSHDEKQPSSASSLMAALKSELDRAKTRVEHLMRDE 180

Query: 224 QRLFHGDEIEALRKRFAEEKAAWKYKERARVGSAISSMAEEVAVERKLRRQAERLNKRIG 283
           QRLFHGDEIEALRKRFAEEKAAWKYKERARVGSAISSMAEEVAVERKLRRQAERLNKRIG
Sbjct: 181 QRLFHGDEIEALRKRFAEEKAAWKYKERARVGSAISSMAEEVAVERKLRRQAERLNKRIG 240

Query: 284 KELGEARVAVAKAMKDVDREKRAKEILEEICEELAKGIGEDRAEFEELRKESEKVREEVE 343
           KELGEARVAVAKAMKDVDREKRAKEILEEICEELAKGIGEDRAEFEELRKESEKVREEVE
Sbjct: 241 KELGEARVAVAKAMKDVDREKRAKEILEEICEELAKGIGEDRAEFEELRKESEKVREEVE 300

Query: 344 KEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDQLEAYFVENRDHSQDSFNNK 403
           KEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDQLEAYFVENRDHSQDSFNNK
Sbjct: 301 KEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDQLEAYFVENRDHSQDSFNNK 360

Query: 404 LDKIKELEAYLKKINFGSYNKNKEEEDCNWDEESDLHSIELNMDNNNKSYRWSFVHGSHN 463
           LDKIKELEAYLKKINFGSYNKNKEEEDCNWDEESDLHSIELNMDNNNKSYRWSFVHGSHN
Sbjct: 361 LDKIKELEAYLKKINFGSYNKNKEEEDCNWDEESDLHSIELNMDNNNKSYRWSFVHGSHN 420

Query: 464 ASKRNSFEKERKSLSEKIQWGSICFNSSSKNGEFEGDGERDGEIQITHKQKSGGVRCLRD 523
           ASKRNSFEKERKSLSEKIQWGSICFNSSSKNGEFEGDGERDGEIQITHKQKSGGVRCLRD
Sbjct: 421 ASKRNSFEKERKSLSEKIQWGSICFNSSSKNGEFEGDGERDGEIQITHKQKSGGVRCLRD 480

Query: 524 ILFPVSGVEENKVEKTEDAMPLQIDEPCSVVVMKG 559
           ILFPVSGVEENKVEKTEDAMPLQIDEPCSVVVMKG
Sbjct: 481 ILFPVSGVEENKVEKTEDAMPLQIDEPCSVVVMKG 515

BLAST of Moc02g02310 vs. NCBI nr
Match: XP_038900292.1 (uncharacterized protein At5g41620 isoform X1 [Benincasa hispida])

HSP 1 Score: 615.9 bits (1587), Expect = 3.4e-172
Identity = 388/604 (64.24%), Postives = 437/604 (72.35%), Query Frame = 0

Query: 2   SWAEQQTE-KRCKIRKRKGPWSSSSSYSSLVRKYRFKKSPTWKMSTKSHSS--------- 61
           SWAEQ+TE KRCKIRKR     SS S S+LVRKYRFKK PTWKMSTKSHSS         
Sbjct: 3   SWAEQKTEKKRCKIRKR--VCLSSPSSSTLVRKYRFKKPPTWKMSTKSHSSKLSTGDLPN 62

Query: 62  ---------AGKGVK---------------LKNNNNSDAVELEDKKELIKTGEMVSQISH 121
                     GKG +                K  NNSD V  E+KKEL+KT EMVSQISH
Sbjct: 63  RSPSCSLDGGGKGKEGSVSVSARKSAGNNSQKLKNNSDVV--EEKKELMKTREMVSQISH 122

Query: 122 SCLSDPDPNFNNTKSEKVEGGSGRVQRRRRSACSLRIGIGEIEVGGSNFRGNDCLMEIEN 181
           SCLSDPD +  NTK+EK E   GRV RRR SA SLRIGIGE+ VGGSNF GNDCLMEIEN
Sbjct: 123 SCLSDPDRSMKNTKTEKDE--VGRVHRRRGSASSLRIGIGEM-VGGSNFHGNDCLMEIEN 182

Query: 182 RSEVKTTRRKKKFTVKTRLKEVSNCLTTSKELVRVLTHVWGSHDEKQPSSASSLMAALKS 241
            +  KTTRRK K T+KTRLKEVSNCLTTSKEL+RVL HV G H+E  PSS SSL+ ALKS
Sbjct: 183 GNVGKTTRRKTKSTIKTRLKEVSNCLTTSKELLRVLHHVLG-HEEHPPSSTSSLITALKS 242

Query: 242 ELDRAKTRVEHLMRDEQRLFHGDEIEALRKRFAEEKAAWKYKERARVGSAISSMAEEVAV 301
           ELDRAKTRV+HL++D+   FHGDEIE LRKRFAEEKAAWKY+ERAR GSAISSMAEEV V
Sbjct: 243 ELDRAKTRVDHLIKDQ--TFHGDEIEVLRKRFAEEKAAWKYRERARFGSAISSMAEEVEV 302

Query: 302 ERKLRRQAERLNKRIGKELGEARVAVAKAMKDVDREKRAKEILEEICEELAKGIGEDRAE 361
           E+KLRRQAERLNK I KEL EA+V+V+KAMK+V+REKRAKEILE+ICEELAKGIGEDRAE
Sbjct: 303 EKKLRRQAERLNKTIAKELAEAKVSVSKAMKEVEREKRAKEILEQICEELAKGIGEDRAE 362

Query: 362 FEELRKESEKVREEVEKEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDQLEA 421
           FEEL+KES KVREEVEKEREML LADVLREERVQMKLSEAKYQFEEKNAAVERLK QL+A
Sbjct: 363 FEELKKESAKVREEVEKEREMLHLADVLREERVQMKLSEAKYQFEEKNAAVERLKHQLQA 422

Query: 422 YFV-------ENRDHSQDSFNNKLDKIKELEAYLKKINFGS-------YNKNKEEEDC-N 481
           Y V       +N   +Q+   N+ +KIKELEAYLKKINFGS         K +E EDC +
Sbjct: 423 YLVTQFGNEEQNGGENQEYSCNEFEKIKELEAYLKKINFGSCQDSEKLVRKEEENEDCSD 482

Query: 482 WDEESDLHSIELNMDNNNKSYRWSFVHGSHNASKRNSFEKERKSLSEKIQWGSICFNSSS 541
            +EESDLHSIELNMDNNNKSYRWSFVHG    + +      RKS+SEKIQWGSIC N+++
Sbjct: 483 EEEESDLHSIELNMDNNNKSYRWSFVHGEKADNNQIQSNNGRKSISEKIQWGSICLNTTN 542

BLAST of Moc02g02310 vs. NCBI nr
Match: KAG7035923.1 (hypothetical protein SDJN02_02723 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 575.5 bits (1482), Expect = 5.1e-160
Identity = 350/585 (59.83%), Postives = 424/585 (72.48%), Query Frame = 0

Query: 1   MSWAEQQTEKRCKIRKRKGPWSSSSSYSSLVRKYRFKKSPTWKMSTKSHSS--------A 60
           MSW EQ+TE+ CKIRKR+   SSSSS S+LV KYRFK +PTWKMSTKSHSS        A
Sbjct: 1   MSWPEQKTEEICKIRKRRCSSSSSSSSSTLVFKYRFKNTPTWKMSTKSHSSNRSPSCSIA 60

Query: 61  GKGVKLKNNNNSDAVE----------------LEDKKELIKTGEMVSQISHSCLSDPDPN 120
           G G K K  + S +V                 +EDK+EL+KT + VSQISHSCLSDPDP 
Sbjct: 61  GGGSKGKEASVSVSVSVSARNHSQKLKNNMDIIEDKQELMKTQDFVSQISHSCLSDPDPC 120

Query: 121 FNNTKSEKVEGGSGRVQRRRRSACSLRIGIGEIEVGGSNFRGNDCLMEIENRS-EVKTTR 180
           FN++ S+KVEG   RV RRR SA S+R+G GE     +NF GN CL+EIEN S + +T R
Sbjct: 121 FNDSNSKKVEG--DRVHRRRTSASSMRLGTGE-----ANFHGNHCLIEIENPSNQGRTVR 180

Query: 181 RKKKFTVKTRLKEVSNCLTTSKELVRVLTHVWG--SHDEKQPSSASSLMAALKSELDRAK 240
           RK KF +KTRLKEV NCLTTSKEL+RVL HV     +D+ +PSS S L+ ALKSE++RAK
Sbjct: 181 RKTKFMLKTRLKEVGNCLTTSKELIRVLNHVLAHEDNDQHRPSSISPLITALKSEMERAK 240

Query: 241 TRVEHLMRDEQRLFHGDEIEALRKRFAEEKAAWKYKERARVGSAISSMAEEVAVERKLRR 300
            RV+HL++D+    HGDEIE + KRF EEK AWK +ERARV S+I+SMA+E+ +E+KLRR
Sbjct: 241 ARVDHLIKDQS--LHGDEIEIVMKRFTEEKTAWKNRERARVRSSIASMADEIEIEKKLRR 300

Query: 301 QAERLNKRIGKELGEARVAVAKAMKDVDREKRAKEILEEICEELAKGIGEDRAEFEELRK 360
           QAERLNK I KEL EA+++++KAMKD+ RE+RAKEI E+IC+ELAKGIGEDRA+FEE +K
Sbjct: 301 QAERLNKTIAKELAEAKISLSKAMKDLQRERRAKEIFEQICDELAKGIGEDRAQFEEFKK 360

Query: 361 ESEKVREEVEKEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDQLEAYFVENR 420
           ES KVREE+E+EREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKD+LEA+ +   
Sbjct: 361 ESAKVREEIEQEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDELEAFLITQF 420

Query: 421 DHSQDSFNNKLDKIKELEAYLKKINFGSYNKNK------EEEDCNWDEESDLHSIELNMD 480
            H      +   KIKELEAYLKKINFGS  ++       EE++C+ +++SDLHSIELNMD
Sbjct: 421 RHENREEEDYSGKIKELEAYLKKINFGSVQEHLEGDEKIEEQECSEEDDSDLHSIELNMD 480

Query: 481 NNNKSYRWSFVHGSHNASKRNSFEKE----RKSLSEKIQWGSICFN----SSSKNGEFEG 528
           NNNKSYRWSFVHG    SKRNSFEK+    RKS+SEKIQWGSIC N    + SKNGEF G
Sbjct: 481 NNNKSYRWSFVHG---GSKRNSFEKDQINGRKSVSEKIQWGSICLNRKASNGSKNGEFVG 540

BLAST of Moc02g02310 vs. NCBI nr
Match: XP_022995046.1 (uncharacterized protein LOC111490718 isoform X1 [Cucurbita maxima] >XP_022995048.1 uncharacterized protein LOC111490718 isoform X1 [Cucurbita maxima])

HSP 1 Score: 575.5 bits (1482), Expect = 5.1e-160
Identity = 353/587 (60.14%), Postives = 426/587 (72.57%), Query Frame = 0

Query: 1   MSWAEQQTEKRCKIRKRK--GPWSSSSSYSSLVRKYRFKKSPTWKMSTKSHSS------- 60
           MSW EQ+TE+ CKIRKR+     SSSSS S+LV KYRFK +PTWKMSTKSHSS       
Sbjct: 1   MSWPEQKTEEICKIRKRRCSSSSSSSSSSSTLVFKYRFKNTPTWKMSTKSHSSNRSPSCS 60

Query: 61  -AGKGVKLKNNNNSDAVE----------------LEDKKELIKTGEMVSQISHSCLSDPD 120
            AG G K K  + S +V                 +EDK+EL+KT + VSQISHSCLSDPD
Sbjct: 61  VAGGGSKGKEASVSVSVSVSARNHSQKLKNNMDIIEDKQELMKTQDFVSQISHSCLSDPD 120

Query: 121 PNFNNTKSEKVEGGSGRVQRRRRSACSLRIGIGEIEVGGSNFRGNDCLMEIENRS-EVKT 180
           P FN++ S+KVEG   RV RRR SA SLRIG GE     +NF GN CL+EIEN S + +T
Sbjct: 121 PCFNDSNSKKVEG--DRVHRRRTSASSLRIGTGE-----ANFHGNHCLIEIENPSNQGRT 180

Query: 181 TRRKKKFTVKTRLKEVSNCLTTSKELVRVLTHVWG--SHDEKQPSSASSLMAALKSELDR 240
            RRK KF +KTRLKEVSNCLTTSKELVRVL HV     +D+ +PSS S L+ ALKSE++R
Sbjct: 181 ARRKTKFMLKTRLKEVSNCLTTSKELVRVLNHVLAHEDNDQHRPSSISPLITALKSEMER 240

Query: 241 AKTRVEHLMRDEQRLFHGDEIEALRKRFAEEKAAWKYKERARVGSAISSMAEEVAVERKL 300
           AK RV+HL++D+   FHGDEIE + KRF EEK AWK +ERARV S+I+SMA+E+ +E+KL
Sbjct: 241 AKARVDHLIKDQS--FHGDEIEIVMKRFTEEKTAWKNRERARVRSSIASMADEIEIEKKL 300

Query: 301 RRQAERLNKRIGKELGEARVAVAKAMKDVDREKRAKEILEEICEELAKGIGEDRAEFEEL 360
           R+QAERLNK I KEL EA+++++KAMKD+ RE+RAKEI E+IC+ELAKGIGEDRA+FEE 
Sbjct: 301 RKQAERLNKTIAKELAEAKISLSKAMKDLQRERRAKEIFEQICDELAKGIGEDRAQFEEF 360

Query: 361 RKESEKVREEVEKEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDQLEAYFVE 420
           +KES KVREE+E+EREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKD+LEA+ + 
Sbjct: 361 KKESAKVREEIEQEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDELEAFLIT 420

Query: 421 NRDHSQDSFNNKLDKIKELEAYLKKINFGSYNKNK------EEEDCNWDEESDLHSIELN 480
              H      +   KIKELEAYLKKINFGS  ++       EE++C+ +++SDLHSIELN
Sbjct: 421 QFRHENREEEDYSGKIKELEAYLKKINFGSVQEHPDGDGKIEEQECSEEDDSDLHSIELN 480

Query: 481 MDNNNKSYRWSFVHGSHNASKRNSFEKE----RKSLSEKIQWGSICFN----SSSKNGEF 528
           MDNNNKSYRWSFVHG    SKRNSFEK+    RKS+SEKIQWGSIC N    + SKNG+F
Sbjct: 481 MDNNNKSYRWSFVHG---GSKRNSFEKDQINGRKSVSEKIQWGSICLNRKASNGSKNGDF 540

BLAST of Moc02g02310 vs. ExPASy Swiss-Prot
Match: Q66GQ2 (Uncharacterized protein At5g41620 OS=Arabidopsis thaliana OX=3702 GN=At5g41620 PE=2 SV=2)

HSP 1 Score: 88.6 bits (218), Expect = 2.5e-16
Identity = 96/342 (28.07%), Postives = 178/342 (52.05%), Query Frame = 0

Query: 118 SACSLRIGIGE--IEVGGSNFRGNDCLMEIENR---SEVKTTRRKKKFTVKTRL------ 177
           SA SLR  IG+  I+   S  R N  L  +      S ++ T   K  T  + L      
Sbjct: 126 SAGSLRRQIGQMLIKHHQSIDRNNHALQPVSPASYGSSLEVTTYNKAVTPSSSLEFRGRP 185

Query: 178 -KEVSNCLTTSKELVRVLTHVWGSHDEKQPSSASSLMAALKSELDRAKTRVEHLMRDEQR 237
            +E    L TS EL++VL  +W    E+Q  S  SL+ ALK+E+  ++ R++ L+R +Q 
Sbjct: 186 SREPHYNLKTSTELLKVLNRIWSL--EEQHVSNISLIKALKTEVAHSRVRIKELLRYQQA 245

Query: 238 LFHGDEIEALRKRFAEEKAAWKYKERARVGSAISSMAEEVAVERKLRRQAERLNKRIGKE 297
             H  E++++ K+ AEEK   K KE  R+ SA+ S+ + +  ERKLR+++E L++++ +E
Sbjct: 246 DRH--ELDSVVKQLAEEKLLSKNKEVERMSSAVQSVRKALEDERKLRKRSESLHRKMARE 305

Query: 298 LGEARVAVAKAMKDVDREKRAKEILEEICEELAKGIGEDRAEFEELRKES--EKVREEVE 357
           L E + +++  +K+++R  ++ +++E +C+E AKGI     E   L+K++  +       
Sbjct: 306 LSEVKSSLSNCVKELERGSKSNKMMELLCDEFAKGIKSYEEEIHGLKKKNLDKDWAGRGG 365

Query: 358 KEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDQLEAYFVENRDHSQDSFNNK 417
            ++ +L +A+   +ER+QM+L        +  + +++L+ ++E +  E R+    +  N 
Sbjct: 366 GDQLVLHIAESWLDERMQMRLEGGDTLNGKNRSVLDKLEVEIETFLQEKRNEIPRNRRNS 425

Query: 418 LDKIKELEAYLKKINFGSYNKNKEEEDCNWDE-ESDLHSIEL 445
           L+ +           F + +    + DC  D   SD +  EL
Sbjct: 426 LESVP----------FNTLSAPPRDVDCEEDSGGSDSNCFEL 453

BLAST of Moc02g02310 vs. ExPASy Swiss-Prot
Match: F4I878 (Protein BRANCHLESS TRICHOME OS=Arabidopsis thaliana OX=3702 GN=BLT PE=1 SV=1)

HSP 1 Score: 50.1 bits (118), Expect = 9.7e-05
Identity = 45/137 (32.85%), Postives = 73/137 (53.28%), Query Frame = 0

Query: 250 ERARVGSAISSMAEEVAVERKLRRQAERLNKRIGKELGEARVAVAKAMKDVDREKRAKEI 309
           E  +    I  +  E+  ERK RR+AE + K++               KDV+ E+ A+E 
Sbjct: 76  ELGKAQDEIKELKAELDYERKARRRAELMIKKLA--------------KDVEEERMAREA 135

Query: 310 LEEICEELAKGIGEDRAEFEELRKESEKVREEVEKEREMLQLADVLREERVQMKLSEAKY 369
            E   + L K +  +++E   +++       ++E+ER+M +LA+VLREERVQMKL +A+ 
Sbjct: 136 EEMQNKRLFKELSSEKSEMVRMKR-------DLEEERQMHRLAEVLREERVQMKLMDARL 191

Query: 370 QFEEKNAAVERLKDQLE 387
             EEK + +E    Q E
Sbjct: 196 FLEEKLSELEEANRQGE 191

BLAST of Moc02g02310 vs. ExPASy TrEMBL
Match: A0A6J1DHY9 (uncharacterized protein LOC111020667 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111020667 PE=4 SV=1)

HSP 1 Score: 1057.0 bits (2732), Expect = 2.8e-305
Identity = 558/558 (100.00%), Postives = 558/558 (100.00%), Query Frame = 0

Query: 1   MSWAEQQTEKRCKIRKRKGPWSSSSSYSSLVRKYRFKKSPTWKMSTKSHSSAGKGVKLKN 60
           MSWAEQQTEKRCKIRKRKGPWSSSSSYSSLVRKYRFKKSPTWKMSTKSHSSAGKGVKLKN
Sbjct: 1   MSWAEQQTEKRCKIRKRKGPWSSSSSYSSLVRKYRFKKSPTWKMSTKSHSSAGKGVKLKN 60

Query: 61  NNNSDAVELEDKKELIKTGEMVSQISHSCLSDPDPNFNNTKSEKVEGGSGRVQRRRRSAC 120
           NNNSDAVELEDKKELIKTGEMVSQISHSCLSDPDPNFNNTKSEKVEGGSGRVQRRRRSAC
Sbjct: 61  NNNSDAVELEDKKELIKTGEMVSQISHSCLSDPDPNFNNTKSEKVEGGSGRVQRRRRSAC 120

Query: 121 SLRIGIGEIEVGGSNFRGNDCLMEIENRSEVKTTRRKKKFTVKTRLKEVSNCLTTSKELV 180
           SLRIGIGEIEVGGSNFRGNDCLMEIENRSEVKTTRRKKKFTVKTRLKEVSNCLTTSKELV
Sbjct: 121 SLRIGIGEIEVGGSNFRGNDCLMEIENRSEVKTTRRKKKFTVKTRLKEVSNCLTTSKELV 180

Query: 181 RVLTHVWGSHDEKQPSSASSLMAALKSELDRAKTRVEHLMRDEQRLFHGDEIEALRKRFA 240
           RVLTHVWGSHDEKQPSSASSLMAALKSELDRAKTRVEHLMRDEQRLFHGDEIEALRKRFA
Sbjct: 181 RVLTHVWGSHDEKQPSSASSLMAALKSELDRAKTRVEHLMRDEQRLFHGDEIEALRKRFA 240

Query: 241 EEKAAWKYKERARVGSAISSMAEEVAVERKLRRQAERLNKRIGKELGEARVAVAKAMKDV 300
           EEKAAWKYKERARVGSAISSMAEEVAVERKLRRQAERLNKRIGKELGEARVAVAKAMKDV
Sbjct: 241 EEKAAWKYKERARVGSAISSMAEEVAVERKLRRQAERLNKRIGKELGEARVAVAKAMKDV 300

Query: 301 DREKRAKEILEEICEELAKGIGEDRAEFEELRKESEKVREEVEKEREMLQLADVLREERV 360
           DREKRAKEILEEICEELAKGIGEDRAEFEELRKESEKVREEVEKEREMLQLADVLREERV
Sbjct: 301 DREKRAKEILEEICEELAKGIGEDRAEFEELRKESEKVREEVEKEREMLQLADVLREERV 360

Query: 361 QMKLSEAKYQFEEKNAAVERLKDQLEAYFVENRDHSQDSFNNKLDKIKELEAYLKKINFG 420
           QMKLSEAKYQFEEKNAAVERLKDQLEAYFVENRDHSQDSFNNKLDKIKELEAYLKKINFG
Sbjct: 361 QMKLSEAKYQFEEKNAAVERLKDQLEAYFVENRDHSQDSFNNKLDKIKELEAYLKKINFG 420

Query: 421 SYNKNKEEEDCNWDEESDLHSIELNMDNNNKSYRWSFVHGSHNASKRNSFEKERKSLSEK 480
           SYNKNKEEEDCNWDEESDLHSIELNMDNNNKSYRWSFVHGSHNASKRNSFEKERKSLSEK
Sbjct: 421 SYNKNKEEEDCNWDEESDLHSIELNMDNNNKSYRWSFVHGSHNASKRNSFEKERKSLSEK 480

Query: 481 IQWGSICFNSSSKNGEFEGDGERDGEIQITHKQKSGGVRCLRDILFPVSGVEENKVEKTE 540
           IQWGSICFNSSSKNGEFEGDGERDGEIQITHKQKSGGVRCLRDILFPVSGVEENKVEKTE
Sbjct: 481 IQWGSICFNSSSKNGEFEGDGERDGEIQITHKQKSGGVRCLRDILFPVSGVEENKVEKTE 540

Query: 541 DAMPLQIDEPCSVVVMKG 559
           DAMPLQIDEPCSVVVMKG
Sbjct: 541 DAMPLQIDEPCSVVVMKG 558

BLAST of Moc02g02310 vs. ExPASy TrEMBL
Match: A0A6J1DGK1 (uncharacterized protein LOC111020667 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111020667 PE=4 SV=1)

HSP 1 Score: 971.8 bits (2511), Expect = 1.2e-279
Identity = 515/515 (100.00%), Postives = 515/515 (100.00%), Query Frame = 0

Query: 44  MSTKSHSSAGKGVKLKNNNNSDAVELEDKKELIKTGEMVSQISHSCLSDPDPNFNNTKSE 103
           MSTKSHSSAGKGVKLKNNNNSDAVELEDKKELIKTGEMVSQISHSCLSDPDPNFNNTKSE
Sbjct: 1   MSTKSHSSAGKGVKLKNNNNSDAVELEDKKELIKTGEMVSQISHSCLSDPDPNFNNTKSE 60

Query: 104 KVEGGSGRVQRRRRSACSLRIGIGEIEVGGSNFRGNDCLMEIENRSEVKTTRRKKKFTVK 163
           KVEGGSGRVQRRRRSACSLRIGIGEIEVGGSNFRGNDCLMEIENRSEVKTTRRKKKFTVK
Sbjct: 61  KVEGGSGRVQRRRRSACSLRIGIGEIEVGGSNFRGNDCLMEIENRSEVKTTRRKKKFTVK 120

Query: 164 TRLKEVSNCLTTSKELVRVLTHVWGSHDEKQPSSASSLMAALKSELDRAKTRVEHLMRDE 223
           TRLKEVSNCLTTSKELVRVLTHVWGSHDEKQPSSASSLMAALKSELDRAKTRVEHLMRDE
Sbjct: 121 TRLKEVSNCLTTSKELVRVLTHVWGSHDEKQPSSASSLMAALKSELDRAKTRVEHLMRDE 180

Query: 224 QRLFHGDEIEALRKRFAEEKAAWKYKERARVGSAISSMAEEVAVERKLRRQAERLNKRIG 283
           QRLFHGDEIEALRKRFAEEKAAWKYKERARVGSAISSMAEEVAVERKLRRQAERLNKRIG
Sbjct: 181 QRLFHGDEIEALRKRFAEEKAAWKYKERARVGSAISSMAEEVAVERKLRRQAERLNKRIG 240

Query: 284 KELGEARVAVAKAMKDVDREKRAKEILEEICEELAKGIGEDRAEFEELRKESEKVREEVE 343
           KELGEARVAVAKAMKDVDREKRAKEILEEICEELAKGIGEDRAEFEELRKESEKVREEVE
Sbjct: 241 KELGEARVAVAKAMKDVDREKRAKEILEEICEELAKGIGEDRAEFEELRKESEKVREEVE 300

Query: 344 KEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDQLEAYFVENRDHSQDSFNNK 403
           KEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDQLEAYFVENRDHSQDSFNNK
Sbjct: 301 KEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDQLEAYFVENRDHSQDSFNNK 360

Query: 404 LDKIKELEAYLKKINFGSYNKNKEEEDCNWDEESDLHSIELNMDNNNKSYRWSFVHGSHN 463
           LDKIKELEAYLKKINFGSYNKNKEEEDCNWDEESDLHSIELNMDNNNKSYRWSFVHGSHN
Sbjct: 361 LDKIKELEAYLKKINFGSYNKNKEEEDCNWDEESDLHSIELNMDNNNKSYRWSFVHGSHN 420

Query: 464 ASKRNSFEKERKSLSEKIQWGSICFNSSSKNGEFEGDGERDGEIQITHKQKSGGVRCLRD 523
           ASKRNSFEKERKSLSEKIQWGSICFNSSSKNGEFEGDGERDGEIQITHKQKSGGVRCLRD
Sbjct: 421 ASKRNSFEKERKSLSEKIQWGSICFNSSSKNGEFEGDGERDGEIQITHKQKSGGVRCLRD 480

Query: 524 ILFPVSGVEENKVEKTEDAMPLQIDEPCSVVVMKG 559
           ILFPVSGVEENKVEKTEDAMPLQIDEPCSVVVMKG
Sbjct: 481 ILFPVSGVEENKVEKTEDAMPLQIDEPCSVVVMKG 515

BLAST of Moc02g02310 vs. ExPASy TrEMBL
Match: A0A6J1K0X3 (uncharacterized protein LOC111490718 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111490718 PE=4 SV=1)

HSP 1 Score: 575.5 bits (1482), Expect = 2.4e-160
Identity = 353/587 (60.14%), Postives = 426/587 (72.57%), Query Frame = 0

Query: 1   MSWAEQQTEKRCKIRKRK--GPWSSSSSYSSLVRKYRFKKSPTWKMSTKSHSS------- 60
           MSW EQ+TE+ CKIRKR+     SSSSS S+LV KYRFK +PTWKMSTKSHSS       
Sbjct: 1   MSWPEQKTEEICKIRKRRCSSSSSSSSSSSTLVFKYRFKNTPTWKMSTKSHSSNRSPSCS 60

Query: 61  -AGKGVKLKNNNNSDAVE----------------LEDKKELIKTGEMVSQISHSCLSDPD 120
            AG G K K  + S +V                 +EDK+EL+KT + VSQISHSCLSDPD
Sbjct: 61  VAGGGSKGKEASVSVSVSVSARNHSQKLKNNMDIIEDKQELMKTQDFVSQISHSCLSDPD 120

Query: 121 PNFNNTKSEKVEGGSGRVQRRRRSACSLRIGIGEIEVGGSNFRGNDCLMEIENRS-EVKT 180
           P FN++ S+KVEG   RV RRR SA SLRIG GE     +NF GN CL+EIEN S + +T
Sbjct: 121 PCFNDSNSKKVEG--DRVHRRRTSASSLRIGTGE-----ANFHGNHCLIEIENPSNQGRT 180

Query: 181 TRRKKKFTVKTRLKEVSNCLTTSKELVRVLTHVWG--SHDEKQPSSASSLMAALKSELDR 240
            RRK KF +KTRLKEVSNCLTTSKELVRVL HV     +D+ +PSS S L+ ALKSE++R
Sbjct: 181 ARRKTKFMLKTRLKEVSNCLTTSKELVRVLNHVLAHEDNDQHRPSSISPLITALKSEMER 240

Query: 241 AKTRVEHLMRDEQRLFHGDEIEALRKRFAEEKAAWKYKERARVGSAISSMAEEVAVERKL 300
           AK RV+HL++D+   FHGDEIE + KRF EEK AWK +ERARV S+I+SMA+E+ +E+KL
Sbjct: 241 AKARVDHLIKDQS--FHGDEIEIVMKRFTEEKTAWKNRERARVRSSIASMADEIEIEKKL 300

Query: 301 RRQAERLNKRIGKELGEARVAVAKAMKDVDREKRAKEILEEICEELAKGIGEDRAEFEEL 360
           R+QAERLNK I KEL EA+++++KAMKD+ RE+RAKEI E+IC+ELAKGIGEDRA+FEE 
Sbjct: 301 RKQAERLNKTIAKELAEAKISLSKAMKDLQRERRAKEIFEQICDELAKGIGEDRAQFEEF 360

Query: 361 RKESEKVREEVEKEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDQLEAYFVE 420
           +KES KVREE+E+EREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKD+LEA+ + 
Sbjct: 361 KKESAKVREEIEQEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDELEAFLIT 420

Query: 421 NRDHSQDSFNNKLDKIKELEAYLKKINFGSYNKNK------EEEDCNWDEESDLHSIELN 480
              H      +   KIKELEAYLKKINFGS  ++       EE++C+ +++SDLHSIELN
Sbjct: 421 QFRHENREEEDYSGKIKELEAYLKKINFGSVQEHPDGDGKIEEQECSEEDDSDLHSIELN 480

Query: 481 MDNNNKSYRWSFVHGSHNASKRNSFEKE----RKSLSEKIQWGSICFN----SSSKNGEF 528
           MDNNNKSYRWSFVHG    SKRNSFEK+    RKS+SEKIQWGSIC N    + SKNG+F
Sbjct: 481 MDNNNKSYRWSFVHG---GSKRNSFEKDQINGRKSVSEKIQWGSICLNRKASNGSKNGDF 540

BLAST of Moc02g02310 vs. ExPASy TrEMBL
Match: A0A6J1H037 (uncharacterized protein LOC111459204 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111459204 PE=4 SV=1)

HSP 1 Score: 571.2 bits (1471), Expect = 4.6e-159
Identity = 348/585 (59.49%), Postives = 423/585 (72.31%), Query Frame = 0

Query: 1   MSWAEQQTEKRCKIRKRKGPWSSSSSYSSLVRKYRFKKSPTWKMSTKSHSS--------A 60
           MSW EQ+TE+ CKIRKR+    SSSS S+LV KYRFK +PTWKMSTKSHSS        A
Sbjct: 1   MSWPEQKTEEICKIRKRRCSSLSSSSSSTLVFKYRFKNTPTWKMSTKSHSSNRSPSCSIA 60

Query: 61  GKGVKLKNNNNSDAVE----------------LEDKKELIKTGEMVSQISHSCLSDPDPN 120
           G G K K  + S +V                 +EDK+EL+KT + VSQISHSCLSDPDP 
Sbjct: 61  GGGSKGKEASVSVSVSVSARNHSQKLKNNMDIIEDKQELMKTQDFVSQISHSCLSDPDPC 120

Query: 121 FNNTKSEKVEGGSGRVQRRRRSACSLRIGIGEIEVGGSNFRGNDCLMEIENRS-EVKTTR 180
           FN++ S+KVEG   RV RRR SA S+R+G GE     +NF G+ CL+EIEN S + KT R
Sbjct: 121 FNDSNSKKVEG--DRVHRRRTSASSMRLGTGE-----ANFHGDHCLIEIENPSNQGKTAR 180

Query: 181 RKKKFTVKTRLKEVSNCLTTSKELVRVLTHVWG--SHDEKQPSSASSLMAALKSELDRAK 240
           RK KF +KTRLKEV NCLTTSKEL+RVL HV     +D+ +PSS S L+ ALKSE++RAK
Sbjct: 181 RKTKFMLKTRLKEVGNCLTTSKELIRVLNHVLAHEDNDQHRPSSISPLITALKSEMERAK 240

Query: 241 TRVEHLMRDEQRLFHGDEIEALRKRFAEEKAAWKYKERARVGSAISSMAEEVAVERKLRR 300
            RV+HL++D+    HGDEIE + KRF EEK AWK +ERARV S+I+SMA+E+ +E+KLR+
Sbjct: 241 ARVDHLIKDQS--LHGDEIEIVMKRFTEEKTAWKNRERARVRSSIASMADEIEIEKKLRK 300

Query: 301 QAERLNKRIGKELGEARVAVAKAMKDVDREKRAKEILEEICEELAKGIGEDRAEFEELRK 360
           QAERLNK I KEL EA+++++KAMKD+ RE+RAKEI E+IC+ELAKGIGEDRA+FEE +K
Sbjct: 301 QAERLNKTIAKELAEAKISLSKAMKDLQRERRAKEIFEQICDELAKGIGEDRAQFEEFKK 360

Query: 361 ESEKVREEVEKEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDQLEAYFVENR 420
           ES KVREE+E+EREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKD+LEA+ +   
Sbjct: 361 ESAKVREEIEQEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDELEAFLITQF 420

Query: 421 DHSQDSFNNKLDKIKELEAYLKKINFGSYNKNK------EEEDCNWDEESDLHSIELNMD 480
            H      +   KIKELEAYLKKINFGS  ++       EE++C+ +++SDLHSIELNMD
Sbjct: 421 RHENREEEDYSGKIKELEAYLKKINFGSVQEHLEGDEKIEEQECSEEDDSDLHSIELNMD 480

Query: 481 NNNKSYRWSFVHGSHNASKRNSFEKE----RKSLSEKIQWGSICFN----SSSKNGEFEG 528
           NNNKSYRWSFVHG    SKRNSFEK+    RKS+SEKIQWGSIC N    + SKNGEF G
Sbjct: 481 NNNKSYRWSFVHG---GSKRNSFEKDEINGRKSVSEKIQWGSICLNRKASNGSKNGEFVG 540

BLAST of Moc02g02310 vs. ExPASy TrEMBL
Match: A0A0A0KEV9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G425750 PE=4 SV=1)

HSP 1 Score: 539.3 bits (1388), Expect = 1.9e-149
Identity = 359/609 (58.95%), Postives = 425/609 (69.79%), Query Frame = 0

Query: 2   SWAEQQTE-KRCKIRKRKGPWSSSSSYSSLVRKYRFKKSPTWKMSTKSHSSAGK---GVK 61
           SWAEQ+TE K+CKIRKR     SS S S+ VRKYRFKK PTWKMSTKS S + K      
Sbjct: 3   SWAEQKTEKKKCKIRKR--VCLSSPSSSTFVRKYRFKKPPTWKMSTKSKSHSSKLSTTDD 62

Query: 62  LKNNNNSDAVE------------------LEDKKELI--KTGEMVSQISHSCLSDPDPNF 121
           + N + S +V                   L+   E++  K+ E+VS+IS + LSDPD + 
Sbjct: 63  IVNRSPSCSVNKGKEEEEGGGGGGSVSRILKKNSEVVEDKSRELVSEISETNLSDPDRSV 122

Query: 122 NNTK-SEKVE-GGSGRVQRRRRSACS---LRIGIGEIEVGGSNFRGNDCL-MEIENRSEV 181
            NTK +EK E G   RV RRRRSA +   LRIG GE+ VGGSNF GNDCL MEIEN +  
Sbjct: 123 KNTKTTEKDEIGTMKRVHRRRRSAATEPCLRIGNGEM-VGGSNFHGNDCLTMEIENGNVE 182

Query: 182 KTTRRKKKFTVKTRLKEVSNCLTTSKELVRVLTHVWGSHDEKQPSSASSLMAALKSELDR 241
           KTTRRK K TVKTRLKEVSNCLTTSKEL+RVL H+   H++  PSS SSL++ALKSELDR
Sbjct: 183 KTTRRKTKTTVKTRLKEVSNCLTTSKELLRVLHHIL-LHEDHLPSSTSSLISALKSELDR 242

Query: 242 AKTRVEHLMRDEQRLFHGDEIEALRKRFAEEKAAWKYKERARVGSAISSMAEEVAVERKL 301
           AKTRV+HL++D+   F+ DEIE L++R AEEKAAWKY+ERAR GSAISSMAEE+ +E+KL
Sbjct: 243 AKTRVDHLIKDQ--TFNVDEIEVLKRRLAEEKAAWKYRERARFGSAISSMAEEMEIEKKL 302

Query: 302 RRQAERLNKRIGKELGEARVAVAKAMKDVDREKRAKEILEEICEELAKGIGEDRAEFEEL 361
           RRQAERLNK I KEL EA+V+V+KAMK+V+REKRAKEILE+ICEELAKGIGEDRAEFEEL
Sbjct: 303 RRQAERLNKSIAKELAEAKVSVSKAMKEVEREKRAKEILEQICEELAKGIGEDRAEFEEL 362

Query: 362 RKESEKVREEVEKEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDQLEAYFV- 421
           +KES KVREEVEKEREML LADVLREERVQMKLSEAKYQFEEKNAAVERLK QL+ YFV 
Sbjct: 363 KKESAKVREEVEKEREMLHLADVLREERVQMKLSEAKYQFEEKNAAVERLKHQLQGYFVI 422

Query: 422 ----ENRDHSQDSFNNKLDKIKELEAYLKKINFGSYN--------------KNKEEEDCN 481
               +N   +++   N+ +KIKELEAYLKKINFGS                 ++EEE+  
Sbjct: 423 GNEEQNAGENREYSCNEFEKIKELEAYLKKINFGSCQDTEKMGKKEENGDCSDEEEEEEE 482

Query: 482 WDEESDLHSIELNMDNNNKSYRWSFVHGSHNASKRNSFEKERKSLSEKIQWGSICFNSSS 541
            +EESD+HSIELNMDNNNKSYRWSFV  + N   +      RKS+SEKIQWGSIC N+S+
Sbjct: 483 EEEESDMHSIELNMDNNNKSYRWSFVEKADN--NQIQINNGRKSVSEKIQWGSICLNTSN 542

BLAST of Moc02g02310 vs. TAIR 10
Match: AT3G11590.1 (unknown protein; LOCATED IN: plasma membrane; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G22310.1); Has 22320 Blast hits to 15179 proteins in 1213 species: Archae - 372; Bacteria - 2307; Metazoa - 10906; Fungi - 1700; Plants - 1146; Viruses - 65; Other Eukaryotes - 5824 (source: NCBI BLink). )

HSP 1 Score: 273.9 bits (699), Expect = 2.9e-73
Identity = 184/402 (45.77%), Postives = 269/402 (66.92%), Query Frame = 0

Query: 90  LSDPDPNFNNTKSEKVEGGSGRVQRRRRSACSLRIGIGEIEVGGSNFRGNDCLMEIENRS 149
           LSDP    ++  SE++E      ++RR S+   ++ +G+  VG  +   +   M+IE RS
Sbjct: 151 LSDPS---HSPVSERMERSGTGSRQRRASSTVQKLRLGDCNVGARDPINSGSFMDIETRS 210

Query: 150 EVKTTRRKKKFTVKTRLKEVSNCLTTSKELVRVLTHVWGSHDEKQPSSASSLMAALKSEL 209
            V+T        VKTRLK+ SN LTTSKEL++++  +WG  D  +PSS+ SL++AL SEL
Sbjct: 211 RVETP-TGSTVGVKTRLKDCSNALTTSKELLKIINRMWGQDD--RPSSSMSLVSALHSEL 270

Query: 210 DRAKTRVEHLMRDEQRLFHGDEIEALRKRFAEEKAAWKYKERARVGSAISSMAEEVAVER 269
           +RA+ +V  L+ + +     ++I  L KRFAEEKA WK  E+  V +AI S+A E+ VER
Sbjct: 271 ERARLQVNQLIHEHKP--ENNDISYLMKRFAEEKAVWKSNEQEVVEAAIESVAGELEVER 330

Query: 270 KLRRQAERLNKRIGKELGEARVAVAKAMKDVDREKRAKEILEEICEELAKGIGEDRAEFE 329
           KLRR+ E LNK++GKEL E + A+ KA+K+++ EKRA+ ++E++C+ELA+ I ED+AE E
Sbjct: 331 KLRRRFESLNKKLGKELAETKSALMKAVKEIENEKRARVMVEKVCDELARDISEDKAEVE 390

Query: 330 ELRKESEKVREEVEKEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDQLEAYF 389
           EL++ES KV+EEVEKEREMLQLAD LREERVQMKLSEAK+Q EEKNAAV++L++QL+ Y 
Sbjct: 391 ELKRESFKVKEEVEKEREMLQLADALREERVQMKLSEAKHQLEEKNAAVDKLRNQLQTYL 450

Query: 390 VENR--DHSQDSFNNKLDKIKELEAYLKKINFGSYNKNKEEEDCNWDE---ESDLHSIEL 449
              R  + +++    +L   +  +     I+FGSYN    E +   +E   ESDLHSIEL
Sbjct: 451 KAKRCKEKTREPPQTQLHNEEAGDYLNHHISFGSYNIEDGEVENGNEEGSGESDLHSIEL 510

Query: 450 NMDNNNKSYRWSFVHGSHNASKRNSFEKE---RKSLSEKIQW 484
           N+D  NKSY+W +  G  N  ++++  K    ++S+S+ + W
Sbjct: 511 NID--NKSYKWPY--GEENRGRKSTPRKSLSLQRSISDCVDW 540

BLAST of Moc02g02310 vs. TAIR 10
Match: AT5G22310.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G11590.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 169.1 bits (427), Expect = 1.0e-41
Identity = 167/479 (34.86%), Postives = 246/479 (51.36%), Query Frame = 0

Query: 6   QQTEKRCKIRKRKGPWSSSSSYSSLVRKYRFKKSPTWKMSTKSHSSAGKGVKLKNNNNSD 65
           +Q +K CKIRKR G   SSSS SSL R+ RFK++  +     +    G G  +K+   + 
Sbjct: 2   EQRKKGCKIRKRGG---SSSSSSSLARRNRFKRA-IFAGKRAAQDDGGSGTPVKSITAAK 61

Query: 66  A-VELEDKKELIKTGEMVSQISHSCLS-------------DPDPNFNNTKSEKVEGGSGR 125
             V L    E +       Q+  SC+S             D DP  N+ K         R
Sbjct: 62  TPVLLSFSPENLPIDH--HQLQKSCVSARKLAATLWEINDDADPPVNSDKDCLRSKKPSR 121

Query: 126 VQRRRRSACS------------LRIGIGEIEVGGSNFRGNDCLMEIENRSEVKTTRRKKK 185
            + ++ +  S             R+    I++     R      +  N  E K       
Sbjct: 122 YRAKKSTEFSSIDFPPRSSDPISRLSSERIDLCDDMIRRRSTNPQKLNPIEYKIIGAN-- 181

Query: 186 FTVKTRLKEVSNCLTTSKELVRVLTHVWGSHDEKQPSSASSLMAALKSELDRAKTRVEHL 245
            +VKTR K VS+ LTTSKELV+VL  + G   +   ++++ L++AL  ELDRA++ ++HL
Sbjct: 182 -SVKTRFKNVSDGLTTSKELVKVLKRI-GELGDDHKTASNRLISALLCELDRARSSLKHL 241

Query: 246 MRDEQRLFHGDEIEALRKRFAEEKAAWKYKERARVGSAISSMAEEVAVERKLRRQAERLN 305
           M +       DE E  ++R                   I S+ EE  VERKLRR+ E++N
Sbjct: 242 MSEL------DEEEEEKRRL------------------IESLQEEAMVERKLRRRTEKMN 301

Query: 306 KRIGKELGEARVAVAKAMKDVDREKRAKEILEEICEELAKGIGEDRAEFEELRKESEKVR 365
           +R+G+EL EA+    K  +++ REKRAK++LEE+C+EL KGIG+D              +
Sbjct: 302 RRLGRELTEAKETERKMKEEMKREKRAKDVLEEVCDELTKGIGDD--------------K 361

Query: 366 EEVEKEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDQLEAYFVENRDHSQDS 425
           +E+EKEREM+ +ADVLREERVQMKL+EAK++FE+K AAVERLK +L        D  +  
Sbjct: 362 KEMEKEREMMHIADVLREERVQMKLTEAKFEFEDKYAAVERLKKELRRVL----DGEEGK 410

Query: 426 FNNKLDKIKELEAYLKKINFGSYNKNKEEEDCNWDEESDLHSIELNMDNNNKSYRWSFV 459
            ++++ +I E       +  GS + +        DEESDL SIELNM++ +K   W +V
Sbjct: 422 GSSEIRRILE-------VIDGSGSDD--------DEESDLKSIELNMESGSK---WGYV 410

BLAST of Moc02g02310 vs. TAIR 10
Match: AT1G50660.1 (unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G20350.1); Has 21445 Blast hits to 15134 proteins in 1325 species: Archae - 461; Bacteria - 2309; Metazoa - 11052; Fungi - 1737; Plants - 1035; Viruses - 42; Other Eukaryotes - 4809 (source: NCBI BLink). )

HSP 1 Score: 136.7 bits (343), Expect = 5.6e-32
Identity = 103/318 (32.39%), Postives = 188/318 (59.12%), Query Frame = 0

Query: 172 CLTTSKELVRVLTHVWGSHDEKQPSSASSLMAALKSELDRAKTRVEHLMRDEQRLFHGDE 231
           CL T +E+ ++ +++       Q  +A SL+++L++EL+ A  R+E L  + ++  H  +
Sbjct: 212 CLDTMEEVHQIYSNM---KRIDQQVNAVSLVSSLEAELEEAHARIEDL--ESEKRSHKKK 271

Query: 232 IEALRKRFAEEKAAWKYKERARVGSAISSMAEEVAVERKLRRQAERLNKRIGKELGEARV 291
           +E   ++ +EE+AAW+ +E  +V + I  M  ++  E+K R++ E +N ++  EL ++++
Sbjct: 272 LEQFLRKVSEERAAWRSREHEKVRAIIDDMKTDMNREKKTRQRLEIVNHKLVNELADSKL 331

Query: 292 AVAKAMKDVDREKRAKEILEEICEELAKGIGEDRAEFEELRKESEKVREEVEKEREMLQL 351
           AV + M+D ++E++A+E++EE+C+ELAK IGED+AE E L++ES  +REEV+ ER MLQ+
Sbjct: 332 AVKRYMQDYEKERKARELIEEVCDELAKEIGEDKAEIEALKRESMSLREEVDDERRMLQM 391

Query: 352 ADVLREERVQMKLSEAKYQFEEKNAAVERLKDQLEAYFVENRDHSQD------------- 411
           A+V REERVQMKL +AK   EE+ + + +L   LE+ F+ +RD   D             
Sbjct: 392 AEVWREERVQMKLIDAKVALEERYSQMNKLVGDLES-FLRSRDIVTDVKEVREAELLRET 451

Query: 412 SFNNKLDKIKE----------LEAYLKKINFGSYNKNKEEEDCNW---DEESDLHSIELN 464
           + +  + +IKE          + A  +++N G  +  + E+   +     +S +H++ L+
Sbjct: 452 AASVNIQEIKEFTYVPANPDDIYAVFEEMNLGEAHDREMEKSVAYSPISHDSKVHTVSLD 511

BLAST of Moc02g02310 vs. TAIR 10
Match: AT3G20350.1 (unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: cotyledon; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G50660.1); Has 15095 Blast hits to 11224 proteins in 1051 species: Archae - 223; Bacteria - 1586; Metazoa - 7000; Fungi - 1255; Plants - 746; Viruses - 40; Other Eukaryotes - 4245 (source: NCBI BLink). )

HSP 1 Score: 119.4 bits (298), Expect = 9.2e-27
Identity = 108/349 (30.95%), Postives = 196/349 (56.16%), Query Frame = 0

Query: 172 CLTTSKELVRVLTHV-WGSHDEKQPSSASSLMAALKSELDRAKTRVEHLMRDEQRLFHGD 231
           CL T  ++ ++ T+V W +      S ASS    ++ +L  A+  ++ L  + ++     
Sbjct: 189 CLDTRDDVHQIYTNVKWNNQQVNDVSLASS----IELKLQEARACIKDL--ESEKRSQKK 248

Query: 232 EIEALRKRFAEEKAAWKYKERARVGSAISSMAEEVAVERKLRRQAERLNKRIGKELGEAR 291
           ++E   K+ +EE+AAW+ +E  +V + I  M  ++  E+K R++ E +N ++  EL +++
Sbjct: 249 KLEQFLKKVSEERAAWRSREHEKVRAIIDDMKADMNQEKKTRQRLEIVNSKLVNELADSK 308

Query: 292 VAVAKAMKDVDREKRAKEILEEICEELAKGIGEDRAEFEELRKESEKVREEVEKEREMLQ 351
           +AV + M D  +E++A+E++EE+C+ELAK I ED+AE E L+ ES  +REEV+ ER MLQ
Sbjct: 309 LAVKRYMHDYQQERKARELIEEVCDELAKEIEEDKAEIEALKSESMNLREEVDDERRMLQ 368

Query: 352 LADVLREERVQMKLSEAKYQFEEKNAAVERLKDQLEAYFVENRDHSQDSFNNKLDKIKEL 411
           +A+V REERVQMKL +AK   EEK + + +L   +EA F+ +R+ +        + ++E 
Sbjct: 369 MAEVWREERVQMKLIDAKVTLEEKYSQMNKLVGDMEA-FLSSRNTTGVKEVRVAELLRET 428

Query: 412 EA---YLKKINFGSYNKNKEEEDCNWDEESDLHSIELNMDNNNKSYRWSFVHGSHNASKR 471
            A    +++I   +Y   K ++     E+ ++     N D  ++ Y  ++   SH ASK 
Sbjct: 429 AASVDNIQEIKEFTYEPAKPDDILMLFEQMNMGE---NQDRESEQY-VAYSPVSH-ASKA 488

Query: 472 NSFEKERKSLSEKIQWGSICFNSSSKNGEFEGDGERDGEIQITHKQKSG 517
           ++   +   +++    G      + +NGEFE D    G   ++H ++ G
Sbjct: 489 HTVSPDVNLINK----GRHSNAFTDQNGEFEEDD--SGWETVSHSEEHG 519

BLAST of Moc02g02310 vs. TAIR 10
Match: AT5G41620.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast, plasma membrane; EXPRESSED IN: 9 plant structures; EXPRESSED DURING: 6 growth stages; BEST Arabidopsis thaliana protein match is: intracellular protein transport protein USO1-related (TAIR:AT1G64180.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 88.6 bits (218), Expect = 1.7e-17
Identity = 96/342 (28.07%), Postives = 178/342 (52.05%), Query Frame = 0

Query: 118 SACSLRIGIGE--IEVGGSNFRGNDCLMEIENR---SEVKTTRRKKKFTVKTRL------ 177
           SA SLR  IG+  I+   S  R N  L  +      S ++ T   K  T  + L      
Sbjct: 126 SAGSLRRQIGQMLIKHHQSIDRNNHALQPVSPASYGSSLEVTTYNKAVTPSSSLEFRGRP 185

Query: 178 -KEVSNCLTTSKELVRVLTHVWGSHDEKQPSSASSLMAALKSELDRAKTRVEHLMRDEQR 237
            +E    L TS EL++VL  +W    E+Q  S  SL+ ALK+E+  ++ R++ L+R +Q 
Sbjct: 186 SREPHYNLKTSTELLKVLNRIWSL--EEQHVSNISLIKALKTEVAHSRVRIKELLRYQQA 245

Query: 238 LFHGDEIEALRKRFAEEKAAWKYKERARVGSAISSMAEEVAVERKLRRQAERLNKRIGKE 297
             H  E++++ K+ AEEK   K KE  R+ SA+ S+ + +  ERKLR+++E L++++ +E
Sbjct: 246 DRH--ELDSVVKQLAEEKLLSKNKEVERMSSAVQSVRKALEDERKLRKRSESLHRKMARE 305

Query: 298 LGEARVAVAKAMKDVDREKRAKEILEEICEELAKGIGEDRAEFEELRKES--EKVREEVE 357
           L E + +++  +K+++R  ++ +++E +C+E AKGI     E   L+K++  +       
Sbjct: 306 LSEVKSSLSNCVKELERGSKSNKMMELLCDEFAKGIKSYEEEIHGLKKKNLDKDWAGRGG 365

Query: 358 KEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDQLEAYFVENRDHSQDSFNNK 417
            ++ +L +A+   +ER+QM+L        +  + +++L+ ++E +  E R+    +  N 
Sbjct: 366 GDQLVLHIAESWLDERMQMRLEGGDTLNGKNRSVLDKLEVEIETFLQEKRNEIPRNRRNS 425

Query: 418 LDKIKELEAYLKKINFGSYNKNKEEEDCNWDE-ESDLHSIEL 445
           L+ +           F + +    + DC  D   SD +  EL
Sbjct: 426 LESVP----------FNTLSAPPRDVDCEEDSGGSDSNCFEL 453

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022153077.15.7e-305100.00uncharacterized protein LOC111020667 isoform X1 [Momordica charantia][more]
XP_022153078.12.4e-279100.00uncharacterized protein LOC111020667 isoform X2 [Momordica charantia] >XP_022153... [more]
XP_038900292.13.4e-17264.24uncharacterized protein At5g41620 isoform X1 [Benincasa hispida][more]
KAG7035923.15.1e-16059.83hypothetical protein SDJN02_02723 [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_022995046.15.1e-16060.14uncharacterized protein LOC111490718 isoform X1 [Cucurbita maxima] >XP_022995048... [more]
Match NameE-valueIdentityDescription
Q66GQ22.5e-1628.07Uncharacterized protein At5g41620 OS=Arabidopsis thaliana OX=3702 GN=At5g41620 P... [more]
F4I8789.7e-0532.85Protein BRANCHLESS TRICHOME OS=Arabidopsis thaliana OX=3702 GN=BLT PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A6J1DHY92.8e-305100.00uncharacterized protein LOC111020667 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A6J1DGK11.2e-279100.00uncharacterized protein LOC111020667 isoform X2 OS=Momordica charantia OX=3673 G... [more]
A0A6J1K0X32.4e-16060.14uncharacterized protein LOC111490718 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1H0374.6e-15959.49uncharacterized protein LOC111459204 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A0A0KEV91.9e-14958.95Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G425750 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G11590.12.9e-7345.77unknown protein; LOCATED IN: plasma membrane; EXPRESSED IN: 22 plant structures;... [more]
AT5G22310.11.0e-4134.86unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G50660.15.6e-3232.39unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplas... [more]
AT3G20350.19.2e-2730.95unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: plasma mem... [more]
AT5G41620.11.7e-1728.07FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 364..398
NoneNo IPR availableCOILSCoilCoilcoord: 202..222
NoneNo IPR availableCOILSCoilCoilcoord: 293..313
NoneNo IPR availableCOILSCoilCoilcoord: 321..352
NoneNo IPR availableCOILSCoilCoilcoord: 261..281
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 43..60
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 88..106
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 88..112
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 41..60
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..26
NoneNo IPR availablePANTHERPTHR31071:SF16OS04G0382800 PROTEINcoord: 10..505
IPR043424Protein BRANCHLESS TRICHOME-likePANTHERPTHR31071GB|AAF24581.1coord: 10..505

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc02g02310.1Moc02g02310.1mRNA