Moc10g07840 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc10g07840
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionHydroxyproline-rich glycoprotein family protein
Locationchr10: 5761926 .. 5764031 (+)
RNA-Seq ExpressionMoc10g07840
SyntenyMoc10g07840
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGGCGACGTCCGGATGCGGATGCTGATGCCGATCTGAGCCCCGTGAATAACACCTTCCAAACCATAACGGCTGCCGCCGATGCGATCGCCACCGTCGATCATCGTTTCCCTCGGGCTACTGCCGTCCAGGTATTGTTGGAATTGGAATTTGTGATTTTAGCGCGGTTTCCTCGTGTGGATTGATACTGGTCCTGTTCTGGAAATTATGGTTCATTCTTTATTCCGCTAGGTAGTAGGATGATGCGATCATTGGTCTTAAGGCTGGAGCGTTTGTGTTTCTCTTCTTTCGTATGTCTTATGATATCGCGGAATCTGTGTTGTCCTCGATTGCAGAATTCTATATAAGGCTTCAATTTCCCCCTTTTTTTGGTCCTTCTGTTTGTTTTTGCTTAAAAACTCCCATCACATGGACACACGGAAAAAGGGAATCACGAGTTTCTTTTCTCTTTTTCTCTTTTTCTGTTTTATCGCATGATTATTAGATTTTTAAGGTTTGAAAAGTACACGATTAGTGTCTCGGGTTGGATGACCGTGTCAGGAAGGCAATGCTCTGTTCCAGCCAATGTTATTTGTCTTTTTGTAATGAACACTGCCTTTCCTTTTATAATCTGAAAGGCATCTCCCCATTCCTTTTTTTCAAAAAGAGAGAGAGAGAGGGAGAGGTTTTTTTAAAAAAAAATTAGCATTCTATATTGAGGTAGAGCAATGCTTTTTATATAAAGATTCCAACAAATGGGGCTGTCGTTTGTCCCAACTCTTTACACTGTTGTGTACTGATGTCATTATCACTTGTTCCTCTTTTTCTGTTTGTTTGATTACCCCCAGAAAAGGAGATGGGGCAGCTGCTGGAGTATTTATTGGTGCTTTGGATCTCTCAAACAGAGGAAGAGAATTGGACACGCTGTACTGGTCCCAGAACCAAGTCCTTCAACTGAACCTCCTGAAAATACATTACAATCACCAGACATTGTGCTTCCCTTTGCTGCACCTCCCTCTTCCCCTGTATCCTTCCTTCAATCAGAGCCGCCTTCTGCTACACAATCACCAACAGCCATACTCTCTTTCACTTCTCTCACTGCTAACATGTATTCCCCTGATGGGCCTTCCTCTATTTTTGCCGTTGGCCCATTTGCTCATGAAACACAGTTAGTGTCTCCACCTCTGAATTTCTCCACTGTCACTACTCAGCCATCAACTGCTCCCTTCACTCCTCCTGAGTCTATCCACTTGACTACACCTTCTTCCCCTGAAGTTCCATTTGCTCAGTATCTGCAACCTAGCCATCAGAAAGTTGAGTCTGATCACCAATATGATCAGTTTCCTAATGATGACTTTCAATCTTATCAATTCTATCCCGGAAGCCCAGTTAGTCACCTCATATCACCACGCTCGGTCATTTCCCGTTCCGGGGCATCGTCGCCTTTGCCTGATTGTGATTTTACTCCCTCTGGTTCTTCATTTTCTAACTTCCCAATAGAAGTTCCTCCTACGCTGTTGAACCTTGACCAACATTCTATTCAGGACTGGCGACTACAGCAAAGTTCTGATTCTTGCACTCAGAATTCTGTAGGGTACAAATCTAGTAACGATTTTGTTTTGAATCCTCAAACTTCAGAATCTGTGTCAGACTACCACGCGTCAAATGAATATCATAATATTCAGATCCTCACTGATGGAAGCCAAAGGGATGAAGCTGCTGCTGCTAATCATAGATTCTCATTTGAGTTGTCTGATGAAGATGCTTTATTAAAAAGCGTAGAAAATAAACCACTGGAATCAAATGAACTTGCAGTTGCATCATCTCCAATACACGAACCACTTGAAACGGCAAAAGAAACTTCTCATGTTGGCGGTCATACCTCAAATGATACAGAAGAACAGGAAAAAGCAGATGGTGAAGAAGTACATGGGCATCAAGAAGTAGAACATCATTCCGTTACTCTTGGGACTGTGAAAGAATTCAATTTTGATAATGGCAATGGATGTGATACACTTAAGCCTAATATCAACTCAGCGTGGTGGGCTAATGGGAAGGATGCAGAGACAGAAGGTACGACCACCGGGGCCTGGTCGTTCTTTCCAATTACGCAGCAGCCAAGATGA

mRNA sequence

ATGAGGCGACGTCCGGATGCGGATGCTGATGCCGATCTGAGCCCCGTGAATAACACCTTCCAAACCATAACGGCTGCCGCCGATGCGATCGCCACCGTCGATCATCGTTTCCCTCGGGCTACTGCCGTCCAGAAAAGGAGATGGGGCAGCTGCTGGAGTATTTATTGGTGCTTTGGATCTCTCAAACAGAGGAAGAGAATTGGACACGCTGTACTGGTCCCAGAACCAAGTCCTTCAACTGAACCTCCTGAAAATACATTACAATCACCAGACATTGTGCTTCCCTTTGCTGCACCTCCCTCTTCCCCTGTATCCTTCCTTCAATCAGAGCCGCCTTCTGCTACACAATCACCAACAGCCATACTCTCTTTCACTTCTCTCACTGCTAACATGTATTCCCCTGATGGGCCTTCCTCTATTTTTGCCGTTGGCCCATTTGCTCATGAAACACAGTTAGTGTCTCCACCTCTGAATTTCTCCACTGTCACTACTCAGCCATCAACTGCTCCCTTCACTCCTCCTGAGTCTATCCACTTGACTACACCTTCTTCCCCTGAAGTTCCATTTGCTCAGTATCTGCAACCTAGCCATCAGAAAGTTGAGTCTGATCACCAATATGATCAGTTTCCTAATGATGACTTTCAATCTTATCAATTCTATCCCGGAAGCCCAGTTAGTCACCTCATATCACCACGCTCGGTCATTTCCCGTTCCGGGGCATCGTCGCCTTTGCCTGATTGTGATTTTACTCCCTCTGGTTCTTCATTTTCTAACTTCCCAATAGAAGTTCCTCCTACGCTGTTGAACCTTGACCAACATTCTATTCAGGACTGGCGACTACAGCAAAGTTCTGATTCTTGCACTCAGAATTCTGTAGGGTACAAATCTAGTAACGATTTTGTTTTGAATCCTCAAACTTCAGAATCTGTGTCAGACTACCACGCGTCAAATGAATATCATAATATTCAGATCCTCACTGATGGAAGCCAAAGGGATGAAGCTGCTGCTGCTAATCATAGATTCTCATTTGAGTTGTCTGATGAAGATGCTTTATTAAAAAGCGTAGAAAATAAACCACTGGAATCAAATGAACTTGCAGTTGCATCATCTCCAATACACGAACCACTTGAAACGGCAAAAGAAACTTCTCATGTTGGCGGTCATACCTCAAATGATACAGAAGAACAGGAAAAAGCAGATGGTGAAGAAGTACATGGGCATCAAGAAGTAGAACATCATTCCGTTACTCTTGGGACTGTGAAAGAATTCAATTTTGATAATGGCAATGGATGTGATACACTTAAGCCTAATATCAACTCAGCGTGGTGGGCTAATGGGAAGGATGCAGAGACAGAAGGTACGACCACCGGGGCCTGGTCGTTCTTTCCAATTACGCAGCAGCCAAGATGA

Coding sequence (CDS)

ATGAGGCGACGTCCGGATGCGGATGCTGATGCCGATCTGAGCCCCGTGAATAACACCTTCCAAACCATAACGGCTGCCGCCGATGCGATCGCCACCGTCGATCATCGTTTCCCTCGGGCTACTGCCGTCCAGAAAAGGAGATGGGGCAGCTGCTGGAGTATTTATTGGTGCTTTGGATCTCTCAAACAGAGGAAGAGAATTGGACACGCTGTACTGGTCCCAGAACCAAGTCCTTCAACTGAACCTCCTGAAAATACATTACAATCACCAGACATTGTGCTTCCCTTTGCTGCACCTCCCTCTTCCCCTGTATCCTTCCTTCAATCAGAGCCGCCTTCTGCTACACAATCACCAACAGCCATACTCTCTTTCACTTCTCTCACTGCTAACATGTATTCCCCTGATGGGCCTTCCTCTATTTTTGCCGTTGGCCCATTTGCTCATGAAACACAGTTAGTGTCTCCACCTCTGAATTTCTCCACTGTCACTACTCAGCCATCAACTGCTCCCTTCACTCCTCCTGAGTCTATCCACTTGACTACACCTTCTTCCCCTGAAGTTCCATTTGCTCAGTATCTGCAACCTAGCCATCAGAAAGTTGAGTCTGATCACCAATATGATCAGTTTCCTAATGATGACTTTCAATCTTATCAATTCTATCCCGGAAGCCCAGTTAGTCACCTCATATCACCACGCTCGGTCATTTCCCGTTCCGGGGCATCGTCGCCTTTGCCTGATTGTGATTTTACTCCCTCTGGTTCTTCATTTTCTAACTTCCCAATAGAAGTTCCTCCTACGCTGTTGAACCTTGACCAACATTCTATTCAGGACTGGCGACTACAGCAAAGTTCTGATTCTTGCACTCAGAATTCTGTAGGGTACAAATCTAGTAACGATTTTGTTTTGAATCCTCAAACTTCAGAATCTGTGTCAGACTACCACGCGTCAAATGAATATCATAATATTCAGATCCTCACTGATGGAAGCCAAAGGGATGAAGCTGCTGCTGCTAATCATAGATTCTCATTTGAGTTGTCTGATGAAGATGCTTTATTAAAAAGCGTAGAAAATAAACCACTGGAATCAAATGAACTTGCAGTTGCATCATCTCCAATACACGAACCACTTGAAACGGCAAAAGAAACTTCTCATGTTGGCGGTCATACCTCAAATGATACAGAAGAACAGGAAAAAGCAGATGGTGAAGAAGTACATGGGCATCAAGAAGTAGAACATCATTCCGTTACTCTTGGGACTGTGAAAGAATTCAATTTTGATAATGGCAATGGATGTGATACACTTAAGCCTAATATCAACTCAGCGTGGTGGGCTAATGGGAAGGATGCAGAGACAGAAGGTACGACCACCGGGGCCTGGTCGTTCTTTCCAATTACGCAGCAGCCAAGATGA

Protein sequence

MRRRPDADADADLSPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSTEPPENTLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTAILSFTSLTANMYSPDGPSSIFAVGPFAHETQLVSPPLNFSTVTTQPSTAPFTPPESIHLTTPSSPEVPFAQYLQPSHQKVESDHQYDQFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDCDFTPSGSSFSNFPIEVPPTLLNLDQHSIQDWRLQQSSDSCTQNSVGYKSSNDFVLNPQTSESVSDYHASNEYHNIQILTDGSQRDEAAAANHRFSFELSDEDALLKSVENKPLESNELAVASSPIHEPLETAKETSHVGGHTSNDTEEQEKADGEEVHGHQEVEHHSVTLGTVKEFNFDNGNGCDTLKPNINSAWWANGKDAETEGTTTGAWSFFPITQQPR
Homology
BLAST of Moc10g07840 vs. NCBI nr
Match: XP_022136623.1 (uncharacterized protein At1g76660-like [Momordica charantia])

HSP 1 Score: 932.6 bits (2409), Expect = 1.4e-267
Identity = 469/469 (100.00%), Postives = 469/469 (100.00%), Query Frame = 0

Query: 1   MRRRPDADADADLSPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGS 60
           MRRRPDADADADLSPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGS
Sbjct: 1   MRRRPDADADADLSPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGS 60

Query: 61  LKQRKRIGHAVLVPEPSPSTEPPENTLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTA 120
           LKQRKRIGHAVLVPEPSPSTEPPENTLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTA
Sbjct: 61  LKQRKRIGHAVLVPEPSPSTEPPENTLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTA 120

Query: 121 ILSFTSLTANMYSPDGPSSIFAVGPFAHETQLVSPPLNFSTVTTQPSTAPFTPPESIHLT 180
           ILSFTSLTANMYSPDGPSSIFAVGPFAHETQLVSPPLNFSTVTTQPSTAPFTPPESIHLT
Sbjct: 121 ILSFTSLTANMYSPDGPSSIFAVGPFAHETQLVSPPLNFSTVTTQPSTAPFTPPESIHLT 180

Query: 181 TPSSPEVPFAQYLQPSHQKVESDHQYDQFPNDDFQSYQFYPGSPVSHLISPRSVISRSGA 240
           TPSSPEVPFAQYLQPSHQKVESDHQYDQFPNDDFQSYQFYPGSPVSHLISPRSVISRSGA
Sbjct: 181 TPSSPEVPFAQYLQPSHQKVESDHQYDQFPNDDFQSYQFYPGSPVSHLISPRSVISRSGA 240

Query: 241 SSPLPDCDFTPSGSSFSNFPIEVPPTLLNLDQHSIQDWRLQQSSDSCTQNSVGYKSSNDF 300
           SSPLPDCDFTPSGSSFSNFPIEVPPTLLNLDQHSIQDWRLQQSSDSCTQNSVGYKSSNDF
Sbjct: 241 SSPLPDCDFTPSGSSFSNFPIEVPPTLLNLDQHSIQDWRLQQSSDSCTQNSVGYKSSNDF 300

Query: 301 VLNPQTSESVSDYHASNEYHNIQILTDGSQRDEAAAANHRFSFELSDEDALLKSVENKPL 360
           VLNPQTSESVSDYHASNEYHNIQILTDGSQRDEAAAANHRFSFELSDEDALLKSVENKPL
Sbjct: 301 VLNPQTSESVSDYHASNEYHNIQILTDGSQRDEAAAANHRFSFELSDEDALLKSVENKPL 360

Query: 361 ESNELAVASSPIHEPLETAKETSHVGGHTSNDTEEQEKADGEEVHGHQEVEHHSVTLGTV 420
           ESNELAVASSPIHEPLETAKETSHVGGHTSNDTEEQEKADGEEVHGHQEVEHHSVTLGTV
Sbjct: 361 ESNELAVASSPIHEPLETAKETSHVGGHTSNDTEEQEKADGEEVHGHQEVEHHSVTLGTV 420

Query: 421 KEFNFDNGNGCDTLKPNINSAWWANGKDAETEGTTTGAWSFFPITQQPR 470
           KEFNFDNGNGCDTLKPNINSAWWANGKDAETEGTTTGAWSFFPITQQPR
Sbjct: 421 KEFNFDNGNGCDTLKPNINSAWWANGKDAETEGTTTGAWSFFPITQQPR 469

BLAST of Moc10g07840 vs. NCBI nr
Match: XP_038884079.1 (uncharacterized protein LOC120075005 isoform X2 [Benincasa hispida])

HSP 1 Score: 728.0 bits (1878), Expect = 5.1e-206
Identity = 380/470 (80.85%), Postives = 413/470 (87.87%), Query Frame = 0

Query: 1   MRRRPDADADADLSPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGS 60
           MRRR D D   D  PVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGS
Sbjct: 1   MRRRTDTD---DSRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGS 60

Query: 61  LKQRKRIGHAVLVPEPSPSTEPPENTLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTA 120
           LKQRKRIGHAVLVPE SPS+E  EN+LQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTA
Sbjct: 61  LKQRKRIGHAVLVPESSPSSESHENSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTA 120

Query: 121 ILSFTSLTANMYSPDGPSSIFAVGPFAHETQLVSPPLNFSTVTTQPSTAPFTPPESIHLT 180
           ++SFTSLTANMYSPDGPSSIFA+GPFAHETQLVSPPLNFST+TT+PSTAPFTPPESIHLT
Sbjct: 121 LISFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTAPFTPPESIHLT 180

Query: 181 TPSSPEVPFAQYLQPSHQKVESDHQYDQFPNDDFQSYQFYPGSPVSHLISPRSVISRSGA 240
           TPSSPEVPFAQ+LQP+ QK ESDHQY  FPNDDFQSYQFYPGSPVSHLISPRSVISRSGA
Sbjct: 181 TPSSPEVPFAQFLQPTLQKSESDHQY-PFPNDDFQSYQFYPGSPVSHLISPRSVISRSGA 240

Query: 241 SSPLPDCDFTPSGSSFSNFPIEVPPTLLNLDQHSIQDWRLQQSSDSCTQNSVGYKSSNDF 300
           SSPLPD DF   GS F NFP+EVPPTLLNLD+ SI +WR +QS+DSCTQ+S+  KSSNDF
Sbjct: 241 SSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWRQRQSTDSCTQDSIELKSSNDF 300

Query: 301 VLNPQTSESVSDYHASNEYHNIQILTDGSQRDE--AAAANHRFSFELSDEDALLKSVENK 360
           VLNPQTSES+SD+HA+NE  NIQIL DG+Q++E    A NHRFSFELSD DALL+SV +K
Sbjct: 301 VLNPQTSESMSDHHATNESQNIQILIDGNQKEEEVPGATNHRFSFELSDGDALLQSVGSK 360

Query: 361 PLESNELAVASSPIHEPLETAKETSHV-GGHTSNDTEEQEKADGEEVHGHQEVEHHSVTL 420
           PL+SNE+AVASSPIHEP ETAKE S V   HTSN TE + KA+ EE H HQ  EHHS+TL
Sbjct: 361 PLDSNEVAVASSPIHEPFETAKENSPVDDDHTSNVTEGKTKAEVEEAHQHQ--EHHSITL 420

Query: 421 GTVKEFNFDNGNGCDTLKPNINSAWWANGKDAETEGTTTGAWSFFPITQQ 468
           G+VKEFNFDNGNG DT K N+NS WW N KD +TEGTT GAWSFFP+TQQ
Sbjct: 421 GSVKEFNFDNGNGSDTHKANLNSEWWTNAKDVDTEGTTNGAWSFFPMTQQ 464

BLAST of Moc10g07840 vs. NCBI nr
Match: KAA0044829.1 (mucin-2 [Cucumis melo var. makuwa])

HSP 1 Score: 713.8 bits (1841), Expect = 1.0e-201
Identity = 373/469 (79.53%), Postives = 402/469 (85.71%), Query Frame = 0

Query: 1   MRRRPDADADADLSPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGS 60
           MRRR D D   D  PVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSC SIYWCFGS
Sbjct: 1   MRRRTDTD---DFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGS 60

Query: 61  LKQRKRIGHAVLVPEPSPSTEPPENTLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTA 120
           LKQRKRIGHAVLVPEPSPS+EP ENTLQSPDIVLPFAAPPSSPVS LQSEPPSA QSPTA
Sbjct: 61  LKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAIQSPTA 120

Query: 121 ILSFTSLTANMYSPDGPSSIFAVGPFAHETQLVSPPLNFSTVTTQPSTAPFTPPESIHLT 180
           ++SFTSLTANMYSPDGPSSIFA+GPFAHE QLVSPPLNFST+TT+PST PFTPPESIHLT
Sbjct: 121 LISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLT 180

Query: 181 TPSSPEVPFAQYLQPSHQKVESDHQYDQFPNDDFQSYQFYPGSPVSHLISPRSVISRSGA 240
           TPSSPEVPFAQ++ PSHQKVESD+QY  FPNDDFQSYQFYPGSPVSHLISPRSVISRSGA
Sbjct: 181 TPSSPEVPFAQFVPPSHQKVESDNQY-TFPNDDFQSYQFYPGSPVSHLISPRSVISRSGA 240

Query: 241 SSPLPDCDFTPSGSSFSNFPIEVPPTLLNLDQHSIQDWRLQQSSDSCTQNSVGYKSSNDF 300
           SSPLPD DF   GS F NFP++VPPTL N+D+HSI +WR +QS+DSCTQ+S+ +KSSNDF
Sbjct: 241 SSPLPDYDFASFGSQFLNFPLKVPPTLSNIDKHSIHNWRQRQSTDSCTQDSIEFKSSNDF 300

Query: 301 VLNPQTSESVSDYHASNEYHNIQIL-TDGSQR-DEAAAANHRFSFELSDEDALLKSVENK 360
           VLNP TSES+ D+HA+NE  NIQIL  DGS+R +E  A NHRFSFELSD D L +SV +K
Sbjct: 301 VLNPHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSK 360

Query: 361 PLESNELAVASSPIHEPLETAKETSHVGGHTSNDTEEQEKADGEEVHGHQEVEHHSVTLG 420
           PLESNEL V SSPIHEP ET KE S  G HTSN  EE+ KADG+E H HQ  EHHSV LG
Sbjct: 361 PLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQHQ--EHHSVALG 420

Query: 421 TVKEFNFDNGNGCDTLKPNINSAWWANGKDAETEGTTTGAWSFFPITQQ 468
           +VKEFNFDN NG DT  P INS WW N KD  TEGTTTGAWSFFP TQQ
Sbjct: 421 SVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQ 463

BLAST of Moc10g07840 vs. NCBI nr
Match: XP_008452033.1 (PREDICTED: uncharacterized protein LOC103493162 isoform X2 [Cucumis melo] >TYK16635.1 mucin-2 [Cucumis melo var. makuwa])

HSP 1 Score: 711.8 bits (1836), Expect = 3.8e-201
Identity = 374/469 (79.74%), Postives = 401/469 (85.50%), Query Frame = 0

Query: 1   MRRRPDADADADLSPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGS 60
           MRRR D D   D  PVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSC SIYWCFGS
Sbjct: 1   MRRRTDTD---DFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGS 60

Query: 61  LKQRKRIGHAVLVPEPSPSTEPPENTLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTA 120
           LKQRKRIGHAVLVPEPSPS+EP ENTLQSPDIVLPFAAPPSSPVS LQSEPPSA QSPTA
Sbjct: 61  LKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAIQSPTA 120

Query: 121 ILSFTSLTANMYSPDGPSSIFAVGPFAHETQLVSPPLNFSTVTTQPSTAPFTPPESIHLT 180
           ++SFTSLTANMYSPDGPSSIFA+GPFAHE QLVSPPLNFST+TT+PST PFTPPESIHLT
Sbjct: 121 LISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLT 180

Query: 181 TPSSPEVPFAQYLQPSHQKVESDHQYDQFPNDDFQSYQFYPGSPVSHLISPRSVISRSGA 240
           TPSSPEVPFAQ++ PS QKVESD+QY  FPNDDFQSYQFYPGSPVSHLISPRSVISRSGA
Sbjct: 181 TPSSPEVPFAQFVPPSLQKVESDNQY-TFPNDDFQSYQFYPGSPVSHLISPRSVISRSGA 240

Query: 241 SSPLPDCDFTPSGSSFSNFPIEVPPTLLNLDQHSIQDWRLQQSSDSCTQNSVGYKSSNDF 300
           SSPLPD DF   GS F NFP+EVPPTL NLD+HSI +WR +QS+DSCTQ+S+ +KSSNDF
Sbjct: 241 SSPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDF 300

Query: 301 VLNPQTSESVSDYHASNEYHNIQIL-TDGSQR-DEAAAANHRFSFELSDEDALLKSVENK 360
           VLNP TSES+ D+HA+NE  NIQIL  DGS+R +E  A NHRFSFELSD D L +SV +K
Sbjct: 301 VLNPHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSK 360

Query: 361 PLESNELAVASSPIHEPLETAKETSHVGGHTSNDTEEQEKADGEEVHGHQEVEHHSVTLG 420
           PLESNEL V SSPIHEP ET KE S  G HTSN  EE+ KADG+E H HQ  EHHSV LG
Sbjct: 361 PLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQHQ--EHHSVALG 420

Query: 421 TVKEFNFDNGNGCDTLKPNINSAWWANGKDAETEGTTTGAWSFFPITQQ 468
           +VKEFNFDN NG DT  P INS WW N KD  TEGTTTGAWSFFP TQQ
Sbjct: 421 SVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQ 463

BLAST of Moc10g07840 vs. NCBI nr
Match: XP_004146564.1 (uncharacterized protein LOC101220378 isoform X1 [Cucumis sativus])

HSP 1 Score: 709.5 bits (1830), Expect = 1.9e-200
Identity = 375/470 (79.79%), Postives = 406/470 (86.38%), Query Frame = 0

Query: 1   MRRRPDADADADLSPV-NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFG 60
           MRRR D D   D  PV NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSC SIYWCFG
Sbjct: 1   MRRRTDTD---DFRPVNNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFG 60

Query: 61  SLKQRKRIGHAVLVPEPSPSTEPPENTLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPT 120
           S+KQRKRIGHAVLVPEPSPS+EP ENTLQSPDIVLPFAAPPSSPVS LQSEPPSA QSPT
Sbjct: 61  SIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAMQSPT 120

Query: 121 AILSFTSLTANMYSPDGPSSIFAVGPFAHETQLVSPPLNFSTVTTQPSTAPFTPPESIHL 180
           A++SFTSLTANMYSPDGPSSIFA+GPFAHE QLVSPPLNFST+TT+PST PFTPPESIHL
Sbjct: 121 ALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPPESIHL 180

Query: 181 TTPSSPEVPFAQYLQPSHQKVESDHQYDQFPNDDFQSYQFYPGSPVSHLISPRSVISRSG 240
           TTPSSPEVPFAQ++QP+  KVESD+QY  FPNDDFQSYQFYPGSPVSHLISPRSVISRSG
Sbjct: 181 TTPSSPEVPFAQFVQPTLPKVESDNQY-TFPNDDFQSYQFYPGSPVSHLISPRSVISRSG 240

Query: 241 ASSPLPDCDFTPSGSSFSNFPIEVPPTLLNLDQHSIQDWRLQQSSDSCTQNSVGYKSSND 300
           ASSPLPD DF   GS F NFP+EVPPTLLNLD+HSI +WR +QS+DSCTQ+S+ +KSSND
Sbjct: 241 ASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSND 300

Query: 301 FVLNPQTSESVSDYHASNEYHNIQIL-TDGSQR-DEAAAANHRFSFELSDEDALLKSVEN 360
           FVLNPQTSES+SD+HA+NE  NIQIL  DGS++ +E  A NHRFSFELSD D LL+SV +
Sbjct: 301 FVLNPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGS 360

Query: 361 KPLESNELAVASSPIHEPLETAKETSHVGGHTSNDTEEQEKADGEEVHGHQEVEHHSVTL 420
           KPLESNELAV SSPIHEP ET KE S  G HTSN  EE+ KADG+E   HQ  EHHSVTL
Sbjct: 361 KPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDE--AHQRQEHHSVTL 420

Query: 421 GTVKEFNFDNGNGCDTLKPNINSAWWANGKDAETEGTTTGAWSFFPITQQ 468
           G+VKEFNFDNGNG DT  PNINS WW N KD  TE T TG WSFFP+TQQ
Sbjct: 421 GSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQ 463

BLAST of Moc10g07840 vs. ExPASy Swiss-Prot
Match: Q9SRE5 (Uncharacterized protein At1g76660 OS=Arabidopsis thaliana OX=3702 GN=At1g76660 PE=2 SV=1)

HSP 1 Score: 139.4 bits (350), Expect = 1.0e-31
Identity = 136/348 (39.08%), Postives = 170/348 (48.85%), Query Frame = 0

Query: 44  QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPE-PSPSTEPPE--------NTLQSPDIVL 103
           Q++RWG C  ++ CF S K  KRI  A  +PE  + S   P         N   +  I L
Sbjct: 7   QRKRWGGCLGVFSCFKSQKGGKRIVPASRIPEGGNVSASQPNGAHQAGVLNNQAAGGINL 66

Query: 104 PFAAPPSSPVSFLQSEPPSATQSPTAILSFTSLTANMYSPDGP-SSIFAVGPFAHETQLV 163
              APPSSP SF  S  PS TQSP   L   SL AN  SP GP SS++A GP+AHETQLV
Sbjct: 67  SLLAPPSSPASFTNSALPSTTQSPNCYL---SLAAN--SPGGPSSSMYATGPYAHETQLV 126

Query: 164 SPPLNFSTVTTQPSTAPFT-PPESIHLTTPSSPEVPFAQYLQPSHQKVESDHQYDQFPND 223
           SPP+ FST TT+PSTAPFT PPE   LT PSSP+VP+A++L  S     S   +    ND
Sbjct: 127 SPPV-FSTFTTEPSTAPFTPPPELARLTAPSSPDVPYARFLTSSMDLKNSGKGH---YND 186

Query: 224 DFQSYQFYPGSPVSHLISPRSVISRSGASSPL-PDCDFTPSGSSF-------------SN 283
              +Y  YPGSP S L SP S  S  G  SP    C  + SG++F             SN
Sbjct: 187 LQATYSLYPGSPASALRSPISRASGDGLLSPQNGKCSRSDSGNTFGYDTNGVSTPLQESN 246

Query: 284 F--PIEVPPTLLNLDQHSIQD-WRLQQSSDSCTQNSVGYKSSNDFVLN---PQTSESVSD 343
           F  P       L+ D    Q+  RL  S DS    + GY + N    N    Q  E +  
Sbjct: 247 FFCPETFAKFYLDHDPSVPQNGGRLSVSKDSDVYPTNGYGNGNQNRQNRSPKQDMEELEA 306

Query: 344 YHASNEYHNIQILTDGSQRDEAAAANHRF---SFELSDEDALLKSVEN 358
           Y AS  +   +I+T     +     +  F   ++  SD   LL+   N
Sbjct: 307 YRASFGFSADEIITTSQYVEITDVMDGSFNTSAYSPSDGQKLLRREAN 345

BLAST of Moc10g07840 vs. ExPASy TrEMBL
Match: A0A6J1C828 (uncharacterized protein At1g76660-like OS=Momordica charantia OX=3673 GN=LOC111008285 PE=4 SV=1)

HSP 1 Score: 932.6 bits (2409), Expect = 6.6e-268
Identity = 469/469 (100.00%), Postives = 469/469 (100.00%), Query Frame = 0

Query: 1   MRRRPDADADADLSPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGS 60
           MRRRPDADADADLSPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGS
Sbjct: 1   MRRRPDADADADLSPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGS 60

Query: 61  LKQRKRIGHAVLVPEPSPSTEPPENTLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTA 120
           LKQRKRIGHAVLVPEPSPSTEPPENTLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTA
Sbjct: 61  LKQRKRIGHAVLVPEPSPSTEPPENTLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTA 120

Query: 121 ILSFTSLTANMYSPDGPSSIFAVGPFAHETQLVSPPLNFSTVTTQPSTAPFTPPESIHLT 180
           ILSFTSLTANMYSPDGPSSIFAVGPFAHETQLVSPPLNFSTVTTQPSTAPFTPPESIHLT
Sbjct: 121 ILSFTSLTANMYSPDGPSSIFAVGPFAHETQLVSPPLNFSTVTTQPSTAPFTPPESIHLT 180

Query: 181 TPSSPEVPFAQYLQPSHQKVESDHQYDQFPNDDFQSYQFYPGSPVSHLISPRSVISRSGA 240
           TPSSPEVPFAQYLQPSHQKVESDHQYDQFPNDDFQSYQFYPGSPVSHLISPRSVISRSGA
Sbjct: 181 TPSSPEVPFAQYLQPSHQKVESDHQYDQFPNDDFQSYQFYPGSPVSHLISPRSVISRSGA 240

Query: 241 SSPLPDCDFTPSGSSFSNFPIEVPPTLLNLDQHSIQDWRLQQSSDSCTQNSVGYKSSNDF 300
           SSPLPDCDFTPSGSSFSNFPIEVPPTLLNLDQHSIQDWRLQQSSDSCTQNSVGYKSSNDF
Sbjct: 241 SSPLPDCDFTPSGSSFSNFPIEVPPTLLNLDQHSIQDWRLQQSSDSCTQNSVGYKSSNDF 300

Query: 301 VLNPQTSESVSDYHASNEYHNIQILTDGSQRDEAAAANHRFSFELSDEDALLKSVENKPL 360
           VLNPQTSESVSDYHASNEYHNIQILTDGSQRDEAAAANHRFSFELSDEDALLKSVENKPL
Sbjct: 301 VLNPQTSESVSDYHASNEYHNIQILTDGSQRDEAAAANHRFSFELSDEDALLKSVENKPL 360

Query: 361 ESNELAVASSPIHEPLETAKETSHVGGHTSNDTEEQEKADGEEVHGHQEVEHHSVTLGTV 420
           ESNELAVASSPIHEPLETAKETSHVGGHTSNDTEEQEKADGEEVHGHQEVEHHSVTLGTV
Sbjct: 361 ESNELAVASSPIHEPLETAKETSHVGGHTSNDTEEQEKADGEEVHGHQEVEHHSVTLGTV 420

Query: 421 KEFNFDNGNGCDTLKPNINSAWWANGKDAETEGTTTGAWSFFPITQQPR 470
           KEFNFDNGNGCDTLKPNINSAWWANGKDAETEGTTTGAWSFFPITQQPR
Sbjct: 421 KEFNFDNGNGCDTLKPNINSAWWANGKDAETEGTTTGAWSFFPITQQPR 469

BLAST of Moc10g07840 vs. ExPASy TrEMBL
Match: A0A5A7TUB1 (Mucin-2 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold74G001210 PE=4 SV=1)

HSP 1 Score: 713.8 bits (1841), Expect = 4.8e-202
Identity = 373/469 (79.53%), Postives = 402/469 (85.71%), Query Frame = 0

Query: 1   MRRRPDADADADLSPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGS 60
           MRRR D D   D  PVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSC SIYWCFGS
Sbjct: 1   MRRRTDTD---DFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGS 60

Query: 61  LKQRKRIGHAVLVPEPSPSTEPPENTLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTA 120
           LKQRKRIGHAVLVPEPSPS+EP ENTLQSPDIVLPFAAPPSSPVS LQSEPPSA QSPTA
Sbjct: 61  LKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAIQSPTA 120

Query: 121 ILSFTSLTANMYSPDGPSSIFAVGPFAHETQLVSPPLNFSTVTTQPSTAPFTPPESIHLT 180
           ++SFTSLTANMYSPDGPSSIFA+GPFAHE QLVSPPLNFST+TT+PST PFTPPESIHLT
Sbjct: 121 LISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLT 180

Query: 181 TPSSPEVPFAQYLQPSHQKVESDHQYDQFPNDDFQSYQFYPGSPVSHLISPRSVISRSGA 240
           TPSSPEVPFAQ++ PSHQKVESD+QY  FPNDDFQSYQFYPGSPVSHLISPRSVISRSGA
Sbjct: 181 TPSSPEVPFAQFVPPSHQKVESDNQY-TFPNDDFQSYQFYPGSPVSHLISPRSVISRSGA 240

Query: 241 SSPLPDCDFTPSGSSFSNFPIEVPPTLLNLDQHSIQDWRLQQSSDSCTQNSVGYKSSNDF 300
           SSPLPD DF   GS F NFP++VPPTL N+D+HSI +WR +QS+DSCTQ+S+ +KSSNDF
Sbjct: 241 SSPLPDYDFASFGSQFLNFPLKVPPTLSNIDKHSIHNWRQRQSTDSCTQDSIEFKSSNDF 300

Query: 301 VLNPQTSESVSDYHASNEYHNIQIL-TDGSQR-DEAAAANHRFSFELSDEDALLKSVENK 360
           VLNP TSES+ D+HA+NE  NIQIL  DGS+R +E  A NHRFSFELSD D L +SV +K
Sbjct: 301 VLNPHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSK 360

Query: 361 PLESNELAVASSPIHEPLETAKETSHVGGHTSNDTEEQEKADGEEVHGHQEVEHHSVTLG 420
           PLESNEL V SSPIHEP ET KE S  G HTSN  EE+ KADG+E H HQ  EHHSV LG
Sbjct: 361 PLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQHQ--EHHSVALG 420

Query: 421 TVKEFNFDNGNGCDTLKPNINSAWWANGKDAETEGTTTGAWSFFPITQQ 468
           +VKEFNFDN NG DT  P INS WW N KD  TEGTTTGAWSFFP TQQ
Sbjct: 421 SVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQ 463

BLAST of Moc10g07840 vs. ExPASy TrEMBL
Match: A0A5D3CYQ2 (Mucin-2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold21G004630 PE=4 SV=1)

HSP 1 Score: 711.8 bits (1836), Expect = 1.8e-201
Identity = 374/469 (79.74%), Postives = 401/469 (85.50%), Query Frame = 0

Query: 1   MRRRPDADADADLSPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGS 60
           MRRR D D   D  PVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSC SIYWCFGS
Sbjct: 1   MRRRTDTD---DFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGS 60

Query: 61  LKQRKRIGHAVLVPEPSPSTEPPENTLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTA 120
           LKQRKRIGHAVLVPEPSPS+EP ENTLQSPDIVLPFAAPPSSPVS LQSEPPSA QSPTA
Sbjct: 61  LKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAIQSPTA 120

Query: 121 ILSFTSLTANMYSPDGPSSIFAVGPFAHETQLVSPPLNFSTVTTQPSTAPFTPPESIHLT 180
           ++SFTSLTANMYSPDGPSSIFA+GPFAHE QLVSPPLNFST+TT+PST PFTPPESIHLT
Sbjct: 121 LISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLT 180

Query: 181 TPSSPEVPFAQYLQPSHQKVESDHQYDQFPNDDFQSYQFYPGSPVSHLISPRSVISRSGA 240
           TPSSPEVPFAQ++ PS QKVESD+QY  FPNDDFQSYQFYPGSPVSHLISPRSVISRSGA
Sbjct: 181 TPSSPEVPFAQFVPPSLQKVESDNQY-TFPNDDFQSYQFYPGSPVSHLISPRSVISRSGA 240

Query: 241 SSPLPDCDFTPSGSSFSNFPIEVPPTLLNLDQHSIQDWRLQQSSDSCTQNSVGYKSSNDF 300
           SSPLPD DF   GS F NFP+EVPPTL NLD+HSI +WR +QS+DSCTQ+S+ +KSSNDF
Sbjct: 241 SSPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDF 300

Query: 301 VLNPQTSESVSDYHASNEYHNIQIL-TDGSQR-DEAAAANHRFSFELSDEDALLKSVENK 360
           VLNP TSES+ D+HA+NE  NIQIL  DGS+R +E  A NHRFSFELSD D L +SV +K
Sbjct: 301 VLNPHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSK 360

Query: 361 PLESNELAVASSPIHEPLETAKETSHVGGHTSNDTEEQEKADGEEVHGHQEVEHHSVTLG 420
           PLESNEL V SSPIHEP ET KE S  G HTSN  EE+ KADG+E H HQ  EHHSV LG
Sbjct: 361 PLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQHQ--EHHSVALG 420

Query: 421 TVKEFNFDNGNGCDTLKPNINSAWWANGKDAETEGTTTGAWSFFPITQQ 468
           +VKEFNFDN NG DT  P INS WW N KD  TEGTTTGAWSFFP TQQ
Sbjct: 421 SVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQ 463

BLAST of Moc10g07840 vs. ExPASy TrEMBL
Match: A0A1S3BSY8 (uncharacterized protein LOC103493162 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103493162 PE=4 SV=1)

HSP 1 Score: 711.8 bits (1836), Expect = 1.8e-201
Identity = 374/469 (79.74%), Postives = 401/469 (85.50%), Query Frame = 0

Query: 1   MRRRPDADADADLSPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGS 60
           MRRR D D   D  PVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSC SIYWCFGS
Sbjct: 1   MRRRTDTD---DFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGS 60

Query: 61  LKQRKRIGHAVLVPEPSPSTEPPENTLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTA 120
           LKQRKRIGHAVLVPEPSPS+EP ENTLQSPDIVLPFAAPPSSPVS LQSEPPSA QSPTA
Sbjct: 61  LKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAIQSPTA 120

Query: 121 ILSFTSLTANMYSPDGPSSIFAVGPFAHETQLVSPPLNFSTVTTQPSTAPFTPPESIHLT 180
           ++SFTSLTANMYSPDGPSSIFA+GPFAHE QLVSPPLNFST+TT+PST PFTPPESIHLT
Sbjct: 121 LISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLT 180

Query: 181 TPSSPEVPFAQYLQPSHQKVESDHQYDQFPNDDFQSYQFYPGSPVSHLISPRSVISRSGA 240
           TPSSPEVPFAQ++ PS QKVESD+QY  FPNDDFQSYQFYPGSPVSHLISPRSVISRSGA
Sbjct: 181 TPSSPEVPFAQFVPPSLQKVESDNQY-TFPNDDFQSYQFYPGSPVSHLISPRSVISRSGA 240

Query: 241 SSPLPDCDFTPSGSSFSNFPIEVPPTLLNLDQHSIQDWRLQQSSDSCTQNSVGYKSSNDF 300
           SSPLPD DF   GS F NFP+EVPPTL NLD+HSI +WR +QS+DSCTQ+S+ +KSSNDF
Sbjct: 241 SSPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDF 300

Query: 301 VLNPQTSESVSDYHASNEYHNIQIL-TDGSQR-DEAAAANHRFSFELSDEDALLKSVENK 360
           VLNP TSES+ D+HA+NE  NIQIL  DGS+R +E  A NHRFSFELSD D L +SV +K
Sbjct: 301 VLNPHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSK 360

Query: 361 PLESNELAVASSPIHEPLETAKETSHVGGHTSNDTEEQEKADGEEVHGHQEVEHHSVTLG 420
           PLESNEL V SSPIHEP ET KE S  G HTSN  EE+ KADG+E H HQ  EHHSV LG
Sbjct: 361 PLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQHQ--EHHSVALG 420

Query: 421 TVKEFNFDNGNGCDTLKPNINSAWWANGKDAETEGTTTGAWSFFPITQQ 468
           +VKEFNFDN NG DT  P INS WW N KD  TEGTTTGAWSFFP TQQ
Sbjct: 421 SVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQ 463

BLAST of Moc10g07840 vs. ExPASy TrEMBL
Match: A0A1S3BSB0 (uncharacterized protein LOC103493162 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103493162 PE=4 SV=1)

HSP 1 Score: 707.2 bits (1824), Expect = 4.5e-200
Identity = 374/470 (79.57%), Postives = 401/470 (85.32%), Query Frame = 0

Query: 1   MRRRPDADADADLSPVNNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCWSIYWCFG 60
           MRRR D D   D  PVNNTFQTITAAADAIATVDHRFPRATAV QKRRWGSC SIYWCFG
Sbjct: 1   MRRRTDTD---DFRPVNNTFQTITAAADAIATVDHRFPRATAVQQKRRWGSCLSIYWCFG 60

Query: 61  SLKQRKRIGHAVLVPEPSPSTEPPENTLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPT 120
           SLKQRKRIGHAVLVPEPSPS+EP ENTLQSPDIVLPFAAPPSSPVS LQSEPPSA QSPT
Sbjct: 61  SLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAIQSPT 120

Query: 121 AILSFTSLTANMYSPDGPSSIFAVGPFAHETQLVSPPLNFSTVTTQPSTAPFTPPESIHL 180
           A++SFTSLTANMYSPDGPSSIFA+GPFAHE QLVSPPLNFST+TT+PST PFTPPESIHL
Sbjct: 121 ALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHL 180

Query: 181 TTPSSPEVPFAQYLQPSHQKVESDHQYDQFPNDDFQSYQFYPGSPVSHLISPRSVISRSG 240
           TTPSSPEVPFAQ++ PS QKVESD+QY  FPNDDFQSYQFYPGSPVSHLISPRSVISRSG
Sbjct: 181 TTPSSPEVPFAQFVPPSLQKVESDNQY-TFPNDDFQSYQFYPGSPVSHLISPRSVISRSG 240

Query: 241 ASSPLPDCDFTPSGSSFSNFPIEVPPTLLNLDQHSIQDWRLQQSSDSCTQNSVGYKSSND 300
           ASSPLPD DF   GS F NFP+EVPPTL NLD+HSI +WR +QS+DSCTQ+S+ +KSSND
Sbjct: 241 ASSPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSND 300

Query: 301 FVLNPQTSESVSDYHASNEYHNIQIL-TDGSQR-DEAAAANHRFSFELSDEDALLKSVEN 360
           FVLNP TSES+ D+HA+NE  NIQIL  DGS+R +E  A NHRFSFELSD D L +SV +
Sbjct: 301 FVLNPHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGS 360

Query: 361 KPLESNELAVASSPIHEPLETAKETSHVGGHTSNDTEEQEKADGEEVHGHQEVEHHSVTL 420
           KPLESNEL V SSPIHEP ET KE S  G HTSN  EE+ KADG+E H HQ  EHHSV L
Sbjct: 361 KPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQHQ--EHHSVAL 420

Query: 421 GTVKEFNFDNGNGCDTLKPNINSAWWANGKDAETEGTTTGAWSFFPITQQ 468
           G+VKEFNFDN NG DT  P INS WW N KD  TEGTTTGAWSFFP TQQ
Sbjct: 421 GSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQ 464

BLAST of Moc10g07840 vs. TAIR 10
Match: AT5G52430.1 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 229.9 bits (585), Expect = 4.1e-60
Identity = 182/468 (38.89%), Postives = 246/468 (52.56%), Query Frame = 0

Query: 16  VNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPE 75
           VNN+ +T+ AAA AI T + R  + ++ QK RWG CWS+Y CFG+ K  KRIG+AVLVPE
Sbjct: 5   VNNSVETVNAAATAIVTAESRV-QPSSSQKGRWGKCWSLYSCFGTQKNNKRIGNAVLVPE 64

Query: 76  PSPSTEP---PENTLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTAILSFTSLTANMY 135
           P  S  P    +N+  S  +VLPF APPSSP SFLQS+P S + SP   L   SLT+N +
Sbjct: 65  PVTSGVPVVTVQNSATSTTVVLPFIAPPSSPASFLQSDPSSVSHSPVGPL---SLTSNTF 124

Query: 136 SPDGPSSIFAVGPFAHETQLVSPPLNFSTVTTQPSTAPFTPP--ESIHLTTPSSPEVPFA 195
           SP  P S+F VGP+A+ETQ V+PP+ FS   T+PSTAP+TPP   S+H+TTPSSPEVPFA
Sbjct: 125 SPKEPQSVFTVGPYANETQPVTPPV-FSAFITEPSTAPYTPPPESSVHITTPSSPEVPFA 184

Query: 196 QYLQPSHQKVESD-----HQYDQFPNDDFQSYQFYPGSP-VSHLISPRSVISRSGASSPL 255
           Q L  S +    D     +Q     + +F+S Q  PGSP   +LISP SVIS SG SSP 
Sbjct: 185 QLLTSSLELTRRDSTSGMNQKFSSSHYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSPY 244

Query: 256 PDCDFTPSGSSFSNFPIEVPPTLLNLDQHSIQDWRLQQSSDSCTQNSVGYKSS-NDFVLN 315
                 P  S    F I  PP  L  +  + + W  +  S S T   VG+ S      L 
Sbjct: 245 ------PGKSPMVEFRIGEPPKFLGFEHFTARKWGSRFGSGSIT--PVGHGSGLASGALT 304

Query: 316 PQTSESVS--------DYHASNEYHNIQILTDGSQRDEAAAANHRFSFELSDEDALLKSV 375
           P   E VS         +   N+   +  L +     E   A+HR SFEL+ ED + + +
Sbjct: 305 PNGPEIVSGNLTPNNTTWPLQNQISEVASLANSDHGSEVMVADHRVSFELTGED-VARCL 364

Query: 376 ENKPLESNELAVASSPIHEPLETAKETSHVGGHTSNDTEEQEKADGEEVHGHQEVEHHSV 435
            +K   S++        ++ +ET + +S     T      ++++   E   H+  +  S 
Sbjct: 365 ASKLNRSHD----RMNNNDRIETEESSS-----TDIRRNIEKRSGDRENEQHRIQKLSSS 424

Query: 436 TLGTVKEFNFDNGNGCDTLKPNINSAWWANGKDAETEGTTTGAWSFFP 464
           ++G+ KEF FD                  N KD   E     +WSFFP
Sbjct: 425 SIGSSKEFKFD------------------NTKDENIEKVAGNSWSFFP 431

BLAST of Moc10g07840 vs. TAIR 10
Match: AT1G63720.1 (BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1); Has 490 Blast hits to 394 proteins in 96 species: Archae - 0; Bacteria - 2; Metazoa - 132; Fungi - 88; Plants - 175; Viruses - 14; Other Eukaryotes - 79 (source: NCBI BLink). )

HSP 1 Score: 209.1 bits (531), Expect = 7.4e-54
Identity = 134/263 (50.95%), Postives = 169/263 (64.26%), Query Frame = 0

Query: 17  NNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPE 76
           NN F TI AAA AIA+ D R  +++ + +KR+W + WS+  CFGS +QRKRIG++VLVPE
Sbjct: 8   NNVFDTINAAASAIASSDDRLHQSSPIHKKRKWWNRWSLLKCFGSSRQRKRIGNSVLVPE 67

Query: 77  P----SPSTEPPENTLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTAILSFTSLTANM 136
           P    S ++    +  +S    LPF APPSSP SF QSEPPSATQSP  ILSF+ L  N 
Sbjct: 68  PVSMSSSNSTTSNSGYRSVITTLPFIAPPSSPASFFQSEPPSATQSPVGILSFSPLPCN- 127

Query: 137 YSPDGPSSIFAVGPFAHETQLVSPPLNFSTVTTQPSTAPFTPP---ESIHL--TTPSSPE 196
                  SIFA+GP+AHETQLVSPP+ FST TT+PS+AP TPP    SI+L  TTPSSPE
Sbjct: 128 ----NRPSIFAIGPYAHETQLVSPPV-FSTYTTEPSSAPITPPLDDSSIYLTTTTPSSPE 187

Query: 197 VPFAQYLQPSHQKVESDHQYDQFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPD 256
           VPFAQ    +HQ     +++    + +FQ YQ  PGSP+  LISP      SG +SP PD
Sbjct: 188 VPFAQLFNSNHQTGSYGYKFPMSSSYEFQFYQLPPGSPLGQLISPS---PGSGPTSPFPD 247

Query: 257 CDFTPSGSSFSNFPIEVPPTLLN 270
            +     S F +F +  PP LL+
Sbjct: 248 GE----TSLFPHFQVSDPPKLLS 257

BLAST of Moc10g07840 vs. TAIR 10
Match: AT4G25620.1 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 199.9 bits (507), Expect = 4.5e-51
Identity = 171/469 (36.46%), Postives = 235/469 (50.11%), Query Frame = 0

Query: 17  NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEP 76
           N++  T+ AAA AI + + R  + ++VQK+R GS WS+YWCFGS K  KRIGHAVLVPEP
Sbjct: 6   NSSVDTVNAAASAIVSAESR-TQPSSVQKKR-GSWWSLYWCFGSKKNNKRIGHAVLVPEP 65

Query: 77  SPS---TEPPEN-TLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTAILSFTSLTANMY 136
           + S     P +N +  S  I +PF APPSSP SFL S PPSA+ +P   L   SLT N  
Sbjct: 66  AASGAAVAPVQNSSSNSTSIFMPFIAPPSSPASFLPSGPPSASHTPDPGL-LCSLTVN-- 125

Query: 137 SPDGPSSIFAVGPFAHETQLVSPPLNFSTVTTQPSTAPFTPPESIHLTTPSSPEVPFAQY 196
               P S F +GP+AHETQ V+PP+ FS  TT+PSTAPFTPP      +PSSPEVPFAQ 
Sbjct: 126 ---EPPSAFTIGPYAHETQPVTPPV-FSAFTTEPSTAPFTPPPE----SPSSPEVPFAQL 185

Query: 197 LQPSHQKVESD-----HQYDQFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDC 256
           L  S ++   +     +Q     + +F+S Q YPGSP  +LISP      SG SSP    
Sbjct: 186 LTSSLERARRNSGGGMNQKFSAAHYEFKSCQVYPGSPGGNLISP-----GSGTSSPY--- 245

Query: 257 DFTPSGSSFSNFPIEVPPTLLNLDQHSIQDWRLQQSSDSCTQNSVGYKSSNDFVLNPQTS 316
              P   S   F I  PP  L  +  + + W  +  S S T    G +  +   L P  S
Sbjct: 246 ---PGKCSIIEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPAGQGSRLGSG-ALTPDGS 305

Query: 317 ESVSDYHASNEYHNIQILTDGSQRDEAAAANHRFSFELSDEDALLKSVENKPLESNELAV 376
           +  S     N    +  ++ G+              ++S+  +L  S       ++E  V
Sbjct: 306 KLTSGVVTPNGAETVIRMSYGNL---TPLEGSLLDSQISEVASLANSDHGSSRHNDEALV 365

Query: 377 ASSPIHEPLETAKETSHVGGHTSNDTEEQEKADGE-----------EVHGHQEVEHHSVT 436
               +   L T ++ +       N +   EKA GE           E    Q  +  S +
Sbjct: 366 VPHRVSFEL-TGEDVARCLASKLNRSGSHEKASGEHLRPNCCKTSGETESEQSQKLRSFS 425

Query: 437 LGTVKEFNFDNGNGCDTLKPNINSAWWANGKDA-ETEGTTTGAWSFFPI 465
            G+ KEF FD+ N  + +   I S WWAN K A + + +   +W+FFP+
Sbjct: 426 TGSNKEFKFDSTN--EEMIEKIRSEWWANEKVAGKGDHSPRNSWTFFPV 443

BLAST of Moc10g07840 vs. TAIR 10
Match: AT1G76660.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1); Has 353 Blast hits to 231 proteins in 60 species: Archae - 0; Bacteria - 6; Metazoa - 57; Fungi - 22; Plants - 125; Viruses - 4; Other Eukaryotes - 139 (source: NCBI BLink). )

HSP 1 Score: 139.4 bits (350), Expect = 7.2e-33
Identity = 136/348 (39.08%), Postives = 170/348 (48.85%), Query Frame = 0

Query: 44  QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPE-PSPSTEPPE--------NTLQSPDIVL 103
           Q++RWG C  ++ CF S K  KRI  A  +PE  + S   P         N   +  I L
Sbjct: 7   QRKRWGGCLGVFSCFKSQKGGKRIVPASRIPEGGNVSASQPNGAHQAGVLNNQAAGGINL 66

Query: 104 PFAAPPSSPVSFLQSEPPSATQSPTAILSFTSLTANMYSPDGP-SSIFAVGPFAHETQLV 163
              APPSSP SF  S  PS TQSP   L   SL AN  SP GP SS++A GP+AHETQLV
Sbjct: 67  SLLAPPSSPASFTNSALPSTTQSPNCYL---SLAAN--SPGGPSSSMYATGPYAHETQLV 126

Query: 164 SPPLNFSTVTTQPSTAPFT-PPESIHLTTPSSPEVPFAQYLQPSHQKVESDHQYDQFPND 223
           SPP+ FST TT+PSTAPFT PPE   LT PSSP+VP+A++L  S     S   +    ND
Sbjct: 127 SPPV-FSTFTTEPSTAPFTPPPELARLTAPSSPDVPYARFLTSSMDLKNSGKGH---YND 186

Query: 224 DFQSYQFYPGSPVSHLISPRSVISRSGASSPL-PDCDFTPSGSSF-------------SN 283
              +Y  YPGSP S L SP S  S  G  SP    C  + SG++F             SN
Sbjct: 187 LQATYSLYPGSPASALRSPISRASGDGLLSPQNGKCSRSDSGNTFGYDTNGVSTPLQESN 246

Query: 284 F--PIEVPPTLLNLDQHSIQD-WRLQQSSDSCTQNSVGYKSSNDFVLN---PQTSESVSD 343
           F  P       L+ D    Q+  RL  S DS    + GY + N    N    Q  E +  
Sbjct: 247 FFCPETFAKFYLDHDPSVPQNGGRLSVSKDSDVYPTNGYGNGNQNRQNRSPKQDMEELEA 306

Query: 344 YHASNEYHNIQILTDGSQRDEAAAANHRF---SFELSDEDALLKSVEN 358
           Y AS  +   +I+T     +     +  F   ++  SD   LL+   N
Sbjct: 307 YRASFGFSADEIITTSQYVEITDVMDGSFNTSAYSPSDGQKLLRREAN 345

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022136623.11.4e-267100.00uncharacterized protein At1g76660-like [Momordica charantia][more]
XP_038884079.15.1e-20680.85uncharacterized protein LOC120075005 isoform X2 [Benincasa hispida][more]
KAA0044829.11.0e-20179.53mucin-2 [Cucumis melo var. makuwa][more]
XP_008452033.13.8e-20179.74PREDICTED: uncharacterized protein LOC103493162 isoform X2 [Cucumis melo] >TYK16... [more]
XP_004146564.11.9e-20079.79uncharacterized protein LOC101220378 isoform X1 [Cucumis sativus][more]
Match NameE-valueIdentityDescription
Q9SRE51.0e-3139.08Uncharacterized protein At1g76660 OS=Arabidopsis thaliana OX=3702 GN=At1g76660 P... [more]
Match NameE-valueIdentityDescription
A0A6J1C8286.6e-268100.00uncharacterized protein At1g76660-like OS=Momordica charantia OX=3673 GN=LOC1110... [more]
A0A5A7TUB14.8e-20279.53Mucin-2 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold74G001210 PE=4 S... [more]
A0A5D3CYQ21.8e-20179.74Mucin-2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold21G004630 PE=4 S... [more]
A0A1S3BSY81.8e-20179.74uncharacterized protein LOC103493162 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A1S3BSB04.5e-20079.57uncharacterized protein LOC103493162 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
Match NameE-valueIdentityDescription
AT5G52430.14.1e-6038.89hydroxyproline-rich glycoprotein family protein [more]
AT1G63720.17.4e-5450.95BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein fam... [more]
AT4G25620.14.5e-5136.46hydroxyproline-rich glycoprotein family protein [more]
AT1G76660.17.2e-3339.08FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 367..413
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 377..413
NoneNo IPR availablePANTHERPTHR31798:SF2HYDROXYPROLINE-RICH GLYCOPROTEIN FAMILY PROTEINcoord: 12..466
IPR040420Uncharacterized protein At1g76660-likePANTHERPTHR31798HYDROXYPROLINE-RICH GLYCOPROTEIN-LIKEcoord: 12..466

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc10g07840.1Moc10g07840.1mRNA