Bhi01G001808 (gene) Wax gourd (B227) v1

Overview
NameBhi01G001808
Typegene
OrganismBenincasa hispida (Wax gourd (B227) v1)
DescriptionHydroxyproline-rich glycoprotein family protein
Locationchr1: 57111933 .. 57118489 (-)
RNA-Seq ExpressionBhi01G001808
SyntenyBhi01G001808
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCTTCTTCCTTCTCTCTCGCTAACGAACCTTCTCTTTACTTTCTTACTGCAAAATCTCCTATTTTGATTTTGCCTAAGAATTTCACGATCCCTTTCTTGTGTATGAGCAACTACGATTCTTTCTATGAAACTGATCGGCGATTCTCTGATTACGAACAGCGATAGCGATGAGACGACGTACGGATACTGATGATTCGAGGCCTGTTAACAATACTTTCCAAACCATTACTGCCGCCGCTGATGCGATCGCCACCGTCGATCATCGTTTTCCTCGGGCTACTGCCGTCCAGGTATTGTATTCTTCACATCAGTTCACCTTCTAATTCAATCATTTGGAATTTTTAGATTTTAGTGTTGGTTACTTATGGATCGTGAATACTGACCTTGTTTATATGGAAATTACTGTGTTCATTCTTAATTCCGTTAGGTATTAAGATGTAGTTTTTTTTCGAAGGATGGGATTATATTATACCGGAATGGAATCTGTGTTGTTGGATTGTGGATTGTCGATTGTCGAATTCTATATAAGGCTTTTAATTTCCCCTTTTCTTTTTCCTGCTCGCTCCTTCTGTTTGTTTGCTTAGAAACTGTGAATCGTCACATGGAAACATGAAAAAAAGGGAATCACGAGCTGAGCTTCGTTTCTGCTTTTTCTGCTTTTTCTGCTTTTGGCTTTTCTTCTCGCATGATTATTAGATTTTTAAGGATTAGAAAGTACCAAGATTGGTGTTTATTGGGTTGGATGACTGTGTCAGGAAGCAATGTGCTGTTGTTCTGAGCCCAATGTTATTTGTCTTTATGTAATAACACTGTCCTTTATAACCAGAAAGGCATCTCTCCATTCTTTATCTTTTTCAAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGGTTAGCTTTTAAAATGCATTCTATATAGAAAAGTATTGCTTTTTATATAAAGATTCCAACAAATAGGGCTCTCATTTGTCCAAACTCTTCCCACTGTTTTGTACTGATGTCATTATCAGTTTCCTCTTTTTCTCTTTGTTTGGTTACTCACAGAAAAGAAGATGGGGTAGTTGTTGGAGTATTTATTGGTGCTTTGGATCTCTCAAACAGAGGAAAAGAATTGGGCATGCTGTTTTGGTACCGGAATCAAGTCCTTCTTCCGAGTCTCATGAAAATTCATTGCAATCACCAGATATTGTGCTTCCTTTTGCTGCACCTCCCTCTTCCCCTGTGTCCTTCCTTCAATCTGAGCCACCTTCTGCTACACAATCACCTACTGCTTTAATCTCTTTCACCTCTCTTACTGCTAACATGTATTCTCCTGATGGGCCTTCCTCCATTTTTGCCATTGGCCCATTTGCTCATGAAACCCAACTAGTGTCTCCACCTCTGAATTTCTCTACTCTCACTACTGAACCATCAACTGCTCCCTTCACTCCTCCCGAGTCTATCCACTTGACTACACCTTCTTCCCCTGAAGTTCCTTTTGCTCAGTTTCTTCAACCTACACTCCAGAAATCTGAATCTGATCATCAATATCCATTTCCTAATGATGACTTCCAATCTTACCAATTCTATCCAGGTAGTCCGGTCAGTCATCTCATATCACCACGATCAGTTATTTCTCGTTCTGGGGCTTCGTCGCCTTTGCCTGACTATGATTTTGCCTCCTTTGGTTCTCAATTTTTGAATTTCCCATTAGAAGTTCCACCTACCTTGTTGAACCTTGACAAACAGTCCATTCATAACTGGCGACAACGACAAAGTACCGATTCTTGCACTCAAGATTCTATAGAATTAAAATCGAGTAATGACTTTGTTTTGAATCCCCAAACTTCAGAATCTATGTCGGATCACCACGCAACGAATGAATCTCAAAATATCCAAATTCTCATTGATGGAAACCAAAAGGAGGAGGAGGTGCCAGGTGCTACTAACCATAGATTCTCATTCGAGTTATCTGATGGAGATGCTTTATTACAAAGCGTAGGAAGTAAGCCACTGGATTCAAATGAAGTTGCAGTTGCATCGTCTCCAATACATGAACCATTTGAAACGGCTAAAGAAAATTCTCCTGTTGATGATGACCATACTTCAAATGTTACAGAAGGAAAGACAAAAGCAGAGGTTGAAGAAGCACATCAGCATCAAGAACATCATTCCATTACTCTTGGGTCTGTGAAGGAATTCAATTTTGATAATGGCAATGGAAGTGATACACATAAGGCAAATCTAAATTCAGAATGGTGGACTAATGCAAAGGATGTTGACACAGAAGGCACGACCAACGGGGCCTGGTCATTCTTTCCAATGACGCAGCAAAGATAACTGACTGGTGCTTACTTATCTTCTAGAATCTCCTCATGCCATCATCTTTTGCAGTTGCAAATTGATAGGCAAGACAAATACAAGACGTCAAATCTTGAAAGAGCCAAACTGGAAGCCTGTTTTTTTCAACAATATGACCTAAGACAAACAAAGCCAGATATTATTAGAATGATAGAGAAATTTGTAGATTCGATAGGGCCTTATTGACAAACGATTGTGGCTCGTCACTTGAATTGTAATAGATATTAGTGGTCTGATAGAAATTGGAAGTGTATAAATATGGTAATAAAAAAGTGTTTTTTTTTTATCTTCATAATATTTCGTTTATTGATTTGAATTAGTAGAAAACATAAAAAGTTGAGAACATATTGATTTGGTATACCCAAAAATTGCAGAATCTTTTTGTAATTAGAAATGTTTCATTCTATGAAATGAATGAGTTGCAAATAGAACAAGAGATACGATTTGCAATGGTTCTGAAGTCTTTTTTCATTCAAGTCTCTCATAAGCATGAGTTGAATGATGTTTCTTTTAACATTTGAATAATTTTTTATTCATTCTTTGATATTTAAGACGTTTGGGAGAGCAAACTTCTTCTCTTATTCTCCTTACCTTGAGAGGTTTGAGTCACTTGTTTCTCGAATTAATGAGATAAAAAATGCACTTCTAAAAATAAATCGGAAGATATTGGTTGTTCCTTGAAAAATTAGAAAAGGCATGTTAATTTTTTCATCTTTACAGAGTGCTTCTCTCACCAATTTTTATTATTATTATATTTCCTGTTAGCTTTCGTCACCCTTTTTTTAAAATTTCTGTTTTTAGTTTTACCTCTCTTTTTTCCTTAAATTTCTGTTCTTGATTTTACCTTTTTTTTAATTTTGTTATTAATTTTACCACTTTCTTTTTCTATTTTAAACTACTTTTTCCTAGTAGTAAGAGATACACTCCAGATTCAATGTTTTTAGTTTTACCACATTCAGATATCAATTCAGCTAAGCTCATGTCCATAGTTGTTTATACTTTTTTAATATTTGAACATTCTCTTTCATTTCAAAGCCAAAAAGAAGAAAATGGTCCATTTGGATATCTTATCTGAGAAAGAAATAGAATAGGCATCCAGTAACCAAAACTTGTCTATAAAACAAATAAATTTCCCTTTCTAATTGTAAGTGCTTAAAAGGCGATGTTGAGCACTACAAACCCTAAAACATCGAATTATGAAATCAATAGGAAACACAATTGTACTCGTTTTCAAACAAAAGTATTACAATCGTGCACTCATATTTGTTCCCACAGGCCACATTCTGTTTATGTTTCAAGGCAAAGGGGCATTAAAAACTACAAATACTGATTTGTCTAACTTCACCATTTAAAGTGTCCATGTATTACATGAAGCCTGAGATAAGAAGTTAAGAGTAAGAAAGGTTTCTCTTAAGATTCAATTATTTTGGGCTGGTTAGATCCACACTGAAATCCCACAGTTTCTTGGCCAATTCCATATCTTTAGCATGATTGGTTGGGTTGGCTATATTACTATCAACAAAATACTCTCCACTAACCCCTTTAACTTGGGGATTCAGTGCTACATAACACTGAGTTGCCGCTCCCTACATCAAATGAAAACTTAATCACATAAAAACACCACCTAAGAGAAACAAAATAAATAGTGTTCAAGAAAATCTGGTGAAATCAAATCATTAACATACCTGCTGGACATTTTTAAGCACAAACTTAGCAACCGAATTAGTAACAGCTGTAACAAAAGAAACAAGTTATAAACAGACACCAACACATGACAGAAACTCAAGGACAAAAAATAAGCAGGAAATAAGAAAAAGGTAAACAAACGTGTGAAAGCATAATTAGTAACATGGCAGATGTTTATATTTGTTCGGAAAGCATTTTGTCAAACTAAATATGTCTAAGCCAACCTAAACACAGCTCAACTGGTTAAGGCAAAAGAAAAAAAATCAAAACATATGCCTTCGACTAAGAGGTCAGAGGTTCCAGTCCTCCCTAAATAGGCCTACACCATAGTTCAGAATAAAATACAGGAACCTTTTGAATTGTTTTCAGCAACTTTTACTATTTTGAGTTGAGAAAAAGATATCTAGGTCCAGTGCCTAGCACCCTTCTTTGGAAAGGAACATACTCAGACCAAGCATAAACCATCGCCTTCAGTCCAAATGCAAATTAAAGTGCCCTACATACTACAATCTATACAAGCATCTGATTGATGTAAAAAGAACATCTGATTGACAGATTCATGCCCAGTTGAAGTTAAAAGGCAAGCCTTATATTTTTCATTTACTAAAGCACATTTGCCTTATAAATGTAATAGGCAAGAAAGTTACCATTAATGAAACCATGGAAGCGTAGTAGATTGGTAGCAATTGCTCCCGGATGAAGGGCGTTGGCTGTTATTTCTACCCCTTCTGCCTATTGAAACAAGTAGAGAAACGATAATCGTTCATCAAATTTGACTTGACAAGATAACTTGGAACAATAATAACGAACAAAAGAAACTGCACAAAAACCAGTTAGCGAAACTCCATCAGAGACTACCAAATATATGATATCTTTCTATAAAACTGTGGAAAAAGTTTATCAGTTAACATGCCATTTGTTAGTAAAAAATTTACTCACGTGACCAAGTCATTTGATAAATTCATTAAAAAAATTAAATAAATAAATAAAAAAAGAAGATGCTGAATCATTCAAAAATGGCTGTTAAGCGCACAAGGAACGTTATTGTGCTAAAAAAATATGGATTTTGAACACATATTTGGATAAAAGTCTTGGGTAAAGGGATGTGGGAATTTCACCTAGTTCACTACTAAGTATTCTGAAAGCAATGAAATGCCTAATCTATGAAAGGATGGAAGTTTCTAGGTTCACTTGATTATTCAACCTCCTCCTAGCCCAATTATTGTACAAAAAAAATCCATGCGTCGTTTTAAGATTCAAGTTCTAAAGCTCCAGTTGTAAATTTCATAAATAAACCAATAAGCACCTTTAACCGCCTGGCAAGCTCTTTTGCATGCAATATGTTGGACAGCTTTGATTGTCCATAAGCTAAGATAGTTCTGTACCTGAAATTTTACATGATACAGAGGATTACTAAATCACATTCAAGGAAACTTTTGAAGAAGTCATCATCAACAAATAAGCAACTTAGATTGTTACTCTGATTCATCGTTGATTTTATCAAAACGAATTCCTTCACTGTATGTCAATCGGTGACCCTCTGATGACAGGTTGACAATCCTTCCCTCCTTTTTGCTTTCAAGCACAGTTTTTTTCATGGTTTCCATTAGAAGGTTTGTCAGAAGAAAATGTCCTACAAAAGAATATGAATTTGTGTTACAACTTGATCATGGGCAGGCATTATAAGAATTGTAAAGCATTAAATGAATATATTTAAGCATCAAGCGAAGCTCAATAAAAAGAATTGTACTCTCGTTACAAAGTTTTTATTGAAACATACCTAAATGATTTGTTGCAAACTGCAATTCTATGCCATCATGGGAAAGCATAAAAGGAGTAGCCATAACGCCAGCATTGTTCCTTCAATCAAAAGAAGAGAAGTATTATTATCCAAAGTCATGAAATATACACACAACAAGCCCTTCAAATAAGCCCACATAACAGGTTAACAACAAGGTTCCAGGTCCAATAAGACTGAAGGGCAACTTAACAAAGCCTACCCCCCGAAAATAATAAAATAATAACAAAATAATCTGTTTGAATATACTCCAACAGATATGTAATTCATCAATGGCTATTTTAGCTGAGTTGGATCTAAATTCCCTCTTTTTGGGGCAATTGTTAACTTTTCAGCCAGTTATCCAATAAAGTGTCATTGTCAAGACGACTAATGAAATAGAAGCACAAGAACTTTCATTCTAATAAGTCATAAAACTTGTGCATAGCATCAGACAGGTTCAACTGAGAGAATCCTAGACAAAGAATTCTACTCTACCAGCTCCAAATCAGTTCTTACATGAGAATATTCAGTGCACGGCCTGATGCAATATAATCTGCAGCAAATTTCCTTACAGATTCCATTGAAGAAAGATCTAACTCCATGATATCAATTTTAGCAGAGGGGGATTCCTTCAGTACTGCTTCTTTTACTTTTCTTCCTGCTTCAACATTCCTTACAGCCATAATGACATAGACTCCACGTAATGCAAGAACACGTGTAGTCTCTTCGCCAAGACCACTTGAAGCTCCTG

mRNA sequence

TCTTCTTCCTTCTCTCTCGCTAACGAACCTTCTCTTTACTTTCTTACTGCAAAATCTCCTATTTTGATTTTGCCTAAGAATTTCACGATCCCTTTCTTGTGTATGAGCAACTACGATTCTTTCTATGAAACTGATCGGCGATTCTCTGATTACGAACAGCGATAGCGATGAGACGACGTACGGATACTGATGATTCGAGGCCTGTTAACAATACTTTCCAAACCATTACTGCCGCCGCTGATGCGATCGCCACCGTCGATCATCGTTTTCCTCGGGCTACTGCCGTCCAGAAAAGAAGATGGGGTAGTTGTTGGAGTATTTATTGGTGCTTTGGATCTCTCAAACAGAGGAAAAGAATTGGGCATGCTGTTTTGGTACCGGAATCAAGTCCTTCTTCCGAGTCTCATGAAAATTCATTGCAATCACCAGATATTGTGCTTCCTTTTGCTGCACCTCCCTCTTCCCCTGTGTCCTTCCTTCAATCTGAGCCACCTTCTGCTACACAATCACCTACTGCTTTAATCTCTTTCACCTCTCTTACTGCTAACATGTATTCTCCTGATGGGCCTTCCTCCATTTTTGCCATTGGCCCATTTGCTCATGAAACCCAACTAGTGTCTCCACCTCTGAATTTCTCTACTCTCACTACTGAACCATCAACTGCTCCCTTCACTCCTCCCGAGTCTATCCACTTGACTACACCTTCTTCCCCTGAAGTTCCTTTTGCTCAGTTTCTTCAACCTACACTCCAGAAATCTGAATCTGATCATCAATATCCATTTCCTAATGATGACTTCCAATCTTACCAATTCTATCCAGGTAGTCCGGTCAGTCATCTCATATCACCACGATCAGTTATTTCTCGTTCTGGGGCTTCGTCGCCTTTGCCTGACTATGATTTTGCCTCCTTTGGTTCTCAATTTTTGAATTTCCCATTAGAAGTTCCACCTACCTTGTTGAACCTTGACAAACAGTCCATTCATAACTGGCGACAACGACAAAGTACCGATTCTTGCACTCAAGATTCTATAGAATTAAAATCGAGTAATGACTTTGTTTTGAATCCCCAAACTTCAGAATCTATGTCGGATCACCACGCAACGAATGAATCTCAAAATATCCAAATTCTCATTGATGGAAACCAAAAGGAGGAGGAGGTGCCAGGTGCTACTAACCATAGATTCTCATTCGAGTTATCTGATGGAGATGCTTTATTACAAAGCGTAGGAAGTAAGCCACTGGATTCAAATGAAGTTGCAGTTGCATCGTCTCCAATACATGAACCATTTGAAACGGCTAAAGAAAATTCTCCTGTTGATGATGACCATACTTCAAATGTTACAGAAGGAAAGACAAAAGCAGAGGTTGAAGAAGCACATCAGCATCAAGAACATCATTCCATTACTCTTGGGTCTGTGAAGGAATTCAATTTTGATAATGGCAATGGAAGTGATACACATAAGGCAAATCTAAATTCAGAATGGTGGACTAATGCAAAGGATGTTGACACAGAAGGCACGACCAACGGGGCCTGGTCATTCTTTCCAATGACGCAGCAAAGATAACTGACTGGTGCTTACTTATCTTCTAGAATCTCCTCATGCCATCATCTTTTGCAGTTGCAAATTGATAGGCAAGACAAATACAAGACGTCAAATCTTGAAAGAGCCAAACTGGAAGCCTGTTTTTTTCAACAATATGACCTAAGACAAACAAAGCCAGATATTATTAGAATGATAGAGAAATTTGTAGATTCGATAGGGCCTTATTGACAAACGATTGTGGCTCGTCACTTGAATTGTAATAGATATTAGTGGTCTGATAGAAATTGGAAGTGTATAAATATGGTAATAAAAAAGTGTTTTTTTTTTATCTTCATAATATTTCGTTTATTGATTTGAATTAGTAGAAAACATAAAAAGTTGAGAACATATTGATTTGGTATACCCAAAAATTGCAGAATCTTTTTGTAATTAGAAATGTTTCATTCTATGAAATGAATGAGTTGCAAATAGAACAAGAGATACGATTTGCAATGGTTCTGAAGTCTTTTTTCATTCAAGTCTCTCATAAGCATGAGTTGAATGATGTTTCTTTTAACATTTGAATAATTTTTTATTCATTCTTTGATATTTAAGACGTTTGGGAGAGCAAACTTCTTCTCTTATTCTCCTTACCTTGAGAGGTTTGAGTCACTTGTTTCTCGAATTAATGAGATAAAAAATGCACTTCTAAAAATAAATCGGAAGATATTGGTTGTTCCTTGAAAAATTAGAAAAGGCATGTTAATTTTTTCATCTTTACAGAGTGCTTCTCTCACCAATTTTTATTATTATTATATTTCCTGTTAGCTTTCGTCACCCTTTTTTTAAAATTTCTGTTTTTAGTTTTACCTCTCTTTTTTCCTTAAATTTCTGTTCTTGATTTTACCTTTTTTTTAATTTTGTTATTAATTTTACCACTTTCTTTTTCTATTTTAAACTACTTTTTCCTAGTAGTAAGAGATACACTCCAGATTCAATGTTTTTAGTTTTACCACATTCAGATATCAATTCAGCTAAGCTCATGTCCATAGTTGTTTATACTTTTTTAATATTTGAACATTCTCTTTCATTTCAAAGCCAAAAAGAAGAAAATGGTCCATTTGGATATCTTATCTGAGAAAGAAATAGAATAGGCATCCAGTAACCAAAACTTGTCTATAAAACAAATAAATTTCCCTTTCTAATTGTAAGTGCTTAAAAGGCGATGTTGAGCACTACAAACCCTAAAACATCGAATTATGAAATCAATAGGAAACACAATTGTACTCGTTTTCAAACAAAAGTATTACAATCGTGCACTCATATTTGTTCCCACAGGCCACATTCTGTTTATGTTTCAAGGCAAAGGGGCATTAAAAACTACAAATACTGATTTGTCTAACTTCACCATTTAAAGTGTCCATGTATTACATGAAGCCTGAGATAAGAAGTTAAGAGTAAGAAAGGTTTCTCTTAAGATTCAATTATTTTGGGCTGGTTAGATCCACACTGAAATCCCACAGTTTCTTGGCCAATTCCATATCTTTAGCATGATTGGTTGGGTTGGCTATATTACTATCAACAAAATACTCTCCACTAACCCCTTTAACTTGGGGATTCAGTGCTACATAACACTGAGTTGCCGCTCCCTACATCAAATGAAAACTTAATCACATAAAAACACCACCTAAGAGAAACAAAATAAATAGTGTTCAAGAAAATCTGGTGAAATCAAATCATTAACATACCTGCTGGACATTTTTAAGCACAAACTTAGCAACCGAATTAGTAACAGCTGTAACAAAAGAAACAAGTTATAAACAGACACCAACACATGACAGAAACTCAAGGACAAAAAATAAGCAGGAAATAAGAAAAAGGTAAACAAACGTGTGAAAGCATAATTAGTAACATGGCAGATGTTTATATTTGTTCGGAAAGCATTTTGTCAAACTAAATATGTCTAAGCCAACCTAAACACAGCTCAACTGGTTAAGGCAAAAGAAAAAAAATCAAAACATATGCCTTCGACTAAGAGGTCAGAGGTTCCAGTCCTCCCTAAATAGGCCTACACCATAGTTCAGAATAAAATACAGGAACCTTTTGAATTGTTTTCAGCAACTTTTACTATTTTGAGTTGAGAAAAAGATATCTAGGTCCAGTGCCTAGCACCCTTCTTTGGAAAGGAACATACTCAGACCAAGCATAAACCATCGCCTTCAGTCCAAATGCAAATTAAAGTGCCCTACATACTACAATCTATACAAGCATCTGATTGATGTAAAAAGAACATCTGATTGACAGATTCATGCCCAGTTGAAGTTAAAAGGCAAGCCTTATATTTTTCATTTACTAAAGCACATTTGCCTTATAAATGTAATAGGCAAGAAAGTTACCATTAATGAAACCATGGAAGCGTAGTAGATTGGTAGCAATTGCTCCCGGATGAAGGGCGTTGGCTGTTATTTCTACCCCTTCTGCCTATTGAAACAAGTAGAGAAACGATAATCGTTCATCAAATTTGACTTGACAAGATAACTTGGAACAATAATAACGAACAAAAGAAACTGCACAAAAACCAGTTAGCGAAACTCCATCAGAGACTACCAAATATATGATATCTTTCTATAAAACTGTGGAAAAAGTTTATCAGTTAACATGCCATTTGTTAGTAAAAAATTTACTCACGTGACCAAGTCATTTGATAAATTCATTAAAAAAATTAAATAAATAAATAAAAAAAGAAGATGCTGAATCATTCAAAAATGGCTGTTAAGCGCACAAGGAACGTTATTGTGCTAAAAAAATATGGATTTTGAACACATATTTGGATAAAAGTCTTGGGTAAAGGGATGTGGGAATTTCACCTAGTTCACTACTAAGTATTCTGAAAGCAATGAAATGCCTAATCTATGAAAGGATGGAAGTTTCTAGGTTCACTTGATTATTCAACCTCCTCCTAGCCCAATTATTGTACAAAAAAAATCCATGCGTCGTTTTAAGATTCAAGTTCTAAAGCTCCAGTTGTAAATTTCATAAATAAACCAATAAGCACCTTTAACCGCCTGGCAAGCTCTTTTGCATGCAATATGTTGGACAGCTTTGATTGTCCATAAGCTAAGATAGTTCTGTACCTGAAATTTTACATGATACAGAGGATTACTAAATCACATTCAAGGAAACTTTTGAAGAAGTCATCATCAACAAATAAGCAACTTAGATTGTTACTCTGATTCATCGTTGATTTTATCAAAACGAATTCCTTCACTGTATGTCAATCGGTGACCCTCTGATGACAGGTTGACAATCCTTCCCTCCTTTTTGCTTTCAAGCACAGTTTTTTTCATGGTTTCCATTAGAAGGTTTGTCAGAAGAAAATGTCCTACAAAAGAATATGAATTTGTGTTACAACTTGATCATGGGCAGGCATTATAAGAATTGTAAAGCATTAAATGAATATATTTAAGCATCAAGCGAAGCTCAATAAAAAGAATTGTACTCTCGTTACAAAGTTTTTATTGAAACATACCTAAATGATTTGTTGCAAACTGCAATTCTATGCCATCATGGGAAAGCATAAAAGGAGTAGCCATAACGCCAGCATTGTTCCTTCAATCAAAAGAAGAGAAGTATTATTATCCAAAGTCATGAAATATACACACAACAAGCCCTTCAAATAAGCCCACATAACAGGTTAACAACAAGGTTCCAGGTCCAATAAGACTGAAGGGCAACTTAACAAAGCCTACCCCCCGAAAATAATAAAATAATAACAAAATAATCTGTTTGAATATACTCCAACAGATATGTAATTCATCAATGGCTATTTTAGCTGAGTTGGATCTAAATTCCCTCTTTTTGGGGCAATTGTTAACTTTTCAGCCAGTTATCCAATAAAGTGTCATTGTCAAGACGACTAATGAAATAGAAGCACAAGAACTTTCATTCTAATAAGTCATAAAACTTGTGCATAGCATCAGACAGGTTCAACTGAGAGAATCCTAGACAAAGAATTCTACTCTACCAGCTCCAAATCAGTTCTTACATGAGAATATTCAGTGCACGGCCTGATGCAATATAATCTGCAGCAAATTTCCTTACAGATTCCATTGAAGAAAGATCTAACTCCATGATATCAATTTTAGCAGAGGGGGATTCCTTCAGTACTGCTTCTTTTACTTTTCTTCCTGCTTCAACATTCCTTACAGCCATAATGACATAGACTCCACGTAATGCAAGAACACGTGTAGTCTCTTCGCCAAGACCACTTGAAGCTCCTG

Coding sequence (CDS)

ATGAGACGACGTACGGATACTGATGATTCGAGGCCTGTTAACAATACTTTCCAAACCATTACTGCCGCCGCTGATGCGATCGCCACCGTCGATCATCGTTTTCCTCGGGCTACTGCCGTCCAGAAAAGAAGATGGGGTAGTTGTTGGAGTATTTATTGGTGCTTTGGATCTCTCAAACAGAGGAAAAGAATTGGGCATGCTGTTTTGGTACCGGAATCAAGTCCTTCTTCCGAGTCTCATGAAAATTCATTGCAATCACCAGATATTGTGCTTCCTTTTGCTGCACCTCCCTCTTCCCCTGTGTCCTTCCTTCAATCTGAGCCACCTTCTGCTACACAATCACCTACTGCTTTAATCTCTTTCACCTCTCTTACTGCTAACATGTATTCTCCTGATGGGCCTTCCTCCATTTTTGCCATTGGCCCATTTGCTCATGAAACCCAACTAGTGTCTCCACCTCTGAATTTCTCTACTCTCACTACTGAACCATCAACTGCTCCCTTCACTCCTCCCGAGTCTATCCACTTGACTACACCTTCTTCCCCTGAAGTTCCTTTTGCTCAGTTTCTTCAACCTACACTCCAGAAATCTGAATCTGATCATCAATATCCATTTCCTAATGATGACTTCCAATCTTACCAATTCTATCCAGGTAGTCCGGTCAGTCATCTCATATCACCACGATCAGTTATTTCTCGTTCTGGGGCTTCGTCGCCTTTGCCTGACTATGATTTTGCCTCCTTTGGTTCTCAATTTTTGAATTTCCCATTAGAAGTTCCACCTACCTTGTTGAACCTTGACAAACAGTCCATTCATAACTGGCGACAACGACAAAGTACCGATTCTTGCACTCAAGATTCTATAGAATTAAAATCGAGTAATGACTTTGTTTTGAATCCCCAAACTTCAGAATCTATGTCGGATCACCACGCAACGAATGAATCTCAAAATATCCAAATTCTCATTGATGGAAACCAAAAGGAGGAGGAGGTGCCAGGTGCTACTAACCATAGATTCTCATTCGAGTTATCTGATGGAGATGCTTTATTACAAAGCGTAGGAAGTAAGCCACTGGATTCAAATGAAGTTGCAGTTGCATCGTCTCCAATACATGAACCATTTGAAACGGCTAAAGAAAATTCTCCTGTTGATGATGACCATACTTCAAATGTTACAGAAGGAAAGACAAAAGCAGAGGTTGAAGAAGCACATCAGCATCAAGAACATCATTCCATTACTCTTGGGTCTGTGAAGGAATTCAATTTTGATAATGGCAATGGAAGTGATACACATAAGGCAAATCTAAATTCAGAATGGTGGACTAATGCAAAGGATGTTGACACAGAAGGCACGACCAACGGGGCCTGGTCATTCTTTCCAATGACGCAGCAAAGATAA

Protein sequence

MRRRTDTDDSRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPESSPSSESHENSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTAPFTPPESIHLTTPSSPEVPFAQFLQPTLQKSESDHQYPFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWRQRQSTDSCTQDSIELKSSNDFVLNPQTSESMSDHHATNESQNIQILIDGNQKEEEVPGATNHRFSFELSDGDALLQSVGSKPLDSNEVAVASSPIHEPFETAKENSPVDDDHTSNVTEGKTKAEVEEAHQHQEHHSITLGSVKEFNFDNGNGSDTHKANLNSEWWTNAKDVDTEGTTNGAWSFFPMTQQR
Homology
BLAST of Bhi01G001808 vs. TAIR 10
Match: AT5G52430.1 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 230.3 bits (586), Expect = 3.1e-60
Identity = 173/471 (36.73%), Postives = 244/471 (51.80%), Query Frame = 0

Query: 13  VNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPE 72
           VNN+ +T+ AAA AI T + R  + ++ QK RWG CWS+Y CFG+ K  KRIG+AVLVPE
Sbjct: 5   VNNSVETVNAAATAIVTAESRV-QPSSSQKGRWGKCWSLYSCFGTQKNNKRIGNAVLVPE 64

Query: 73  SSPSS---ESHENSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTALISFTSLTANMY 132
              S     + +NS  S  +VLPF APPSSP SFLQS+P S + SP   +   SLT+N +
Sbjct: 65  PVTSGVPVVTVQNSATSTTVVLPFIAPPSSPASFLQSDPSSVSHSPVGPL---SLTSNTF 124

Query: 133 SPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTAPFTPP--ESIHLTTPSSPEVPFA 192
           SP  P S+F +GP+A+ETQ V+PP+ FS   TEPSTAP+TPP   S+H+TTPSSPEVPFA
Sbjct: 125 SPKEPQSVFTVGPYANETQPVTPPV-FSAFITEPSTAPYTPPPESSVHITTPSSPEVPFA 184

Query: 193 QFLQPTLQKSESD------HQYPFPNDDFQSYQFYPGSP-VSHLISPRSVISRSGASSPL 252
           Q L  +L+ +  D       ++   + +F+S Q  PGSP   +LISP SVIS SG SSP 
Sbjct: 185 QLLTSSLELTRRDSTSGMNQKFSSSHYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSPY 244

Query: 253 PDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWRQRQSTDSCTQDSIELKSSNDFVLNP 312
           P        S  + F +  PP  L  +  +   W  R  + S T        ++   L P
Sbjct: 245 PG------KSPMVEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPVGHGSGLASG-ALTP 304

Query: 313 QTSESMSDHHATN------ESQNIQILIDGNQKEEEVPGATNHRFSFELSDGDALLQSVG 372
              E +S +   N      ++Q  ++    N          +HR SFEL+ G+ + + + 
Sbjct: 305 NGPEIVSGNLTPNNTTWPLQNQISEVASLANSDHGSEVMVADHRVSFELT-GEDVARCLA 364

Query: 373 SKPLDSNEVAVASSPIHEPFETAKENSPVDDDHTSNV-----TEGKTKAEVEEAHQHQEH 432
           SK             ++   +    N  ++ + +S+       E ++     E H+ Q+ 
Sbjct: 365 SK-------------LNRSHDRMNNNDRIETEESSSTDIRRNIEKRSGDRENEQHRIQKL 424

Query: 433 HSITLGSVKEFNFDNGNGSDTHKANLNSEWWTNAKDVDTEGTTNGAWSFFP 461
            S ++GS KEF FD                  N KD + E     +WSFFP
Sbjct: 425 SSSSIGSSKEFKFD------------------NTKDENIEKVAGNSWSFFP 431

BLAST of Bhi01G001808 vs. TAIR 10
Match: AT1G63720.1 (BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1); Has 490 Blast hits to 394 proteins in 96 species: Archae - 0; Bacteria - 2; Metazoa - 132; Fungi - 88; Plants - 175; Viruses - 14; Other Eukaryotes - 79 (source: NCBI BLink). )

HSP 1 Score: 210.3 bits (534), Expect = 3.3e-54
Identity = 135/263 (51.33%), Postives = 169/263 (64.26%), Query Frame = 0

Query: 14  NNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPE 73
           NN F TI AAA AIA+ D R  +++ + +KR+W + WS+  CFGS +QRKRIG++VLVPE
Sbjct: 8   NNVFDTINAAASAIASSDDRLHQSSPIHKKRKWWNRWSLLKCFGSSRQRKRIGNSVLVPE 67

Query: 74  ----SSPSSESHENSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTALISFTSLTANM 133
               SS +S +  +  +S    LPF APPSSP SF QSEPPSATQSP  ++SF+ L  N 
Sbjct: 68  PVSMSSSNSTTSNSGYRSVITTLPFIAPPSSPASFFQSEPPSATQSPVGILSFSPLPCN- 127

Query: 134 YSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTAPFTPP---ESIHL--TTPSSPE 193
                  SIFAIGP+AHETQLVSPP+ FST TTEPS+AP TPP    SI+L  TTPSSPE
Sbjct: 128 ----NRPSIFAIGPYAHETQLVSPPV-FSTYTTEPSSAPITPPLDDSSIYLTTTTPSSPE 187

Query: 194 VPFAQFLQPTLQKSESDHQYPFPND-DFQSYQFYPGSPVSHLISPRSVISRSGASSPLPD 253
           VPFAQ      Q     +++P  +  +FQ YQ  PGSP+  LISP      SG +SP PD
Sbjct: 188 VPFAQLFNSNHQTGSYGYKFPMSSSYEFQFYQLPPGSPLGQLISPS---PGSGPTSPFPD 247

Query: 254 YDFASFGSQFLNFPLEVPPTLLN 266
            +     S F +F +  PP LL+
Sbjct: 248 GE----TSLFPHFQVSDPPKLLS 257

BLAST of Bhi01G001808 vs. TAIR 10
Match: AT4G25620.1 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 203.0 bits (515), Expect = 5.3e-52
Identity = 177/491 (36.05%), Postives = 240/491 (48.88%), Query Frame = 0

Query: 11  RPVNN-TFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVL 70
           R VNN +  T+ AAA AI + + R  + ++VQK+R GS WS+YWCFGS K  KRIGHAVL
Sbjct: 2   RSVNNSSVDTVNAAASAIVSAESR-TQPSSVQKKR-GSWWSLYWCFGSKKNNKRIGHAVL 61

Query: 71  VPESSPSSES----HENSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTALISFTSLT 130
           VPE + S  +      +S  S  I +PF APPSSP SFL S PPSA+ +P   +   SLT
Sbjct: 62  VPEPAASGAAVAPVQNSSSNSTSIFMPFIAPPSSPASFLPSGPPSASHTPDPGL-LCSLT 121

Query: 131 ANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTAPFTPPESIHLTTPSSPEVP 190
            N      P S F IGP+AHETQ V+PP+ FS  TTEPSTAPFTPP      +PSSPEVP
Sbjct: 122 VN-----EPPSAFTIGPYAHETQPVTPPV-FSAFTTEPSTAPFTPPPE----SPSSPEVP 181

Query: 191 FAQFLQPTLQKSE------SDHQYPFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSP 250
           FAQ L  +L+++        + ++   + +F+S Q YPGSP  +LISP      SG SSP
Sbjct: 182 FAQLLTSSLERARRNSGGGMNQKFSAAHYEFKSCQVYPGSPGGNLISP-----GSGTSSP 241

Query: 251 LPDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWRQRQSTDS--------------CTQ 310
            P           + F +  PP  L  +  +   W  R  + S               T 
Sbjct: 242 YPG------KCSIIEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPAGQGSRLGSGALTP 301

Query: 311 DSIELKSSNDFVLNPQTSESMSDHHATNESQNIQILID--------------GNQKEEEV 370
           D  +L S    V+ P  +E++      N +     L+D              G+ +  + 
Sbjct: 302 DGSKLTSG---VVTPNGAETVIRMSYGNLTPLEGSLLDSQISEVASLANSDHGSSRHNDE 361

Query: 371 PGATNHRFSFELSDGDALLQSVGSKPLDSNEVAVASSPIHEPFETAKENSPVDDDHTSNV 430
                HR SFEL+ G+ + + + SK   S     AS     P                  
Sbjct: 362 ALVVPHRVSFELT-GEDVARCLASKLNRSGSHEKASGEHLRP--------------NCCK 421

Query: 431 TEGKTKAEVEEAHQHQEHHSITLGSVKEFNFDNGNGSDTHKANLNSEWWTNAKDVDT-EG 462
           T G+T++E     Q Q+  S + GS KEF FD+ N     K  + SEWW N K     + 
Sbjct: 422 TSGETESE-----QSQKLRSFSTGSNKEFKFDSTNEEMIEK--IRSEWWANEKVAGKGDH 443

BLAST of Bhi01G001808 vs. TAIR 10
Match: AT1G76660.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1); Has 353 Blast hits to 231 proteins in 60 species: Archae - 0; Bacteria - 6; Metazoa - 57; Fungi - 22; Plants - 125; Viruses - 4; Other Eukaryotes - 139 (source: NCBI BLink). )

HSP 1 Score: 149.1 bits (375), Expect = 9.1e-36
Identity = 102/210 (48.57%), Postives = 121/210 (57.62%), Query Frame = 0

Query: 41  QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPESSPSSESHENSLQSPDIV---------L 100
           Q++RWG C  ++ CF S K  KRI  A  +PE    S S  N      ++         L
Sbjct: 7   QRKRWGGCLGVFSCFKSQKGGKRIVPASRIPEGGNVSASQPNGAHQAGVLNNQAAGGINL 66

Query: 101 PFAAPPSSPVSFLQSEPPSATQSPTALISFTSLTANMYSPDGP-SSIFAIGPFAHETQLV 160
              APPSSP SF  S  PS TQSP     + SL AN  SP GP SS++A GP+AHETQLV
Sbjct: 67  SLLAPPSSPASFTNSALPSTTQSPNC---YLSLAAN--SPGGPSSSMYATGPYAHETQLV 126

Query: 161 SPPLNFSTLTTEPSTAPFT-PPESIHLTTPSSPEVPFAQFLQPTLQKSESDHQYPFPNDD 220
           SPP+ FST TTEPSTAPFT PPE   LT PSSP+VP+A+FL  ++    S   +   ND 
Sbjct: 127 SPPV-FSTFTTEPSTAPFTPPPELARLTAPSSPDVPYARFLTSSMDLKNSGKGH--YNDL 186

Query: 221 FQSYQFYPGSPVSHLISPRSVISRSGASSP 240
             +Y  YPGSP S L SP S  S  G  SP
Sbjct: 187 QATYSLYPGSPASALRSPISRASGDGLLSP 208

BLAST of Bhi01G001808 vs. ExPASy Swiss-Prot
Match: Q9SRE5 (Uncharacterized protein At1g76660 OS=Arabidopsis thaliana OX=3702 GN=At1g76660 PE=2 SV=1)

HSP 1 Score: 149.1 bits (375), Expect = 1.3e-34
Identity = 102/210 (48.57%), Postives = 121/210 (57.62%), Query Frame = 0

Query: 41  QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPESSPSSESHENSLQSPDIV---------L 100
           Q++RWG C  ++ CF S K  KRI  A  +PE    S S  N      ++         L
Sbjct: 7   QRKRWGGCLGVFSCFKSQKGGKRIVPASRIPEGGNVSASQPNGAHQAGVLNNQAAGGINL 66

Query: 101 PFAAPPSSPVSFLQSEPPSATQSPTALISFTSLTANMYSPDGP-SSIFAIGPFAHETQLV 160
              APPSSP SF  S  PS TQSP     + SL AN  SP GP SS++A GP+AHETQLV
Sbjct: 67  SLLAPPSSPASFTNSALPSTTQSPNC---YLSLAAN--SPGGPSSSMYATGPYAHETQLV 126

Query: 161 SPPLNFSTLTTEPSTAPFT-PPESIHLTTPSSPEVPFAQFLQPTLQKSESDHQYPFPNDD 220
           SPP+ FST TTEPSTAPFT PPE   LT PSSP+VP+A+FL  ++    S   +   ND 
Sbjct: 127 SPPV-FSTFTTEPSTAPFTPPPELARLTAPSSPDVPYARFLTSSMDLKNSGKGH--YNDL 186

Query: 221 FQSYQFYPGSPVSHLISPRSVISRSGASSP 240
             +Y  YPGSP S L SP S  S  G  SP
Sbjct: 187 QATYSLYPGSPASALRSPISRASGDGLLSP 208

BLAST of Bhi01G001808 vs. NCBI nr
Match: XP_038884079.1 (uncharacterized protein LOC120075005 isoform X2 [Benincasa hispida])

HSP 1 Score: 923.7 bits (2386), Expect = 6.3e-265
Identity = 465/465 (100.00%), Postives = 465/465 (100.00%), Query Frame = 0

Query: 1   MRRRTDTDDSRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQ 60
           MRRRTDTDDSRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQ
Sbjct: 1   MRRRTDTDDSRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQ 60

Query: 61  RKRIGHAVLVPESSPSSESHENSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTALIS 120
           RKRIGHAVLVPESSPSSESHENSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTALIS
Sbjct: 61  RKRIGHAVLVPESSPSSESHENSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTALIS 120

Query: 121 FTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTAPFTPPESIHLTTPS 180
           FTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTAPFTPPESIHLTTPS
Sbjct: 121 FTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTAPFTPPESIHLTTPS 180

Query: 181 SPEVPFAQFLQPTLQKSESDHQYPFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPL 240
           SPEVPFAQFLQPTLQKSESDHQYPFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPL
Sbjct: 181 SPEVPFAQFLQPTLQKSESDHQYPFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPL 240

Query: 241 PDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWRQRQSTDSCTQDSIELKSSNDFVLNP 300
           PDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWRQRQSTDSCTQDSIELKSSNDFVLNP
Sbjct: 241 PDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWRQRQSTDSCTQDSIELKSSNDFVLNP 300

Query: 301 QTSESMSDHHATNESQNIQILIDGNQKEEEVPGATNHRFSFELSDGDALLQSVGSKPLDS 360
           QTSESMSDHHATNESQNIQILIDGNQKEEEVPGATNHRFSFELSDGDALLQSVGSKPLDS
Sbjct: 301 QTSESMSDHHATNESQNIQILIDGNQKEEEVPGATNHRFSFELSDGDALLQSVGSKPLDS 360

Query: 361 NEVAVASSPIHEPFETAKENSPVDDDHTSNVTEGKTKAEVEEAHQHQEHHSITLGSVKEF 420
           NEVAVASSPIHEPFETAKENSPVDDDHTSNVTEGKTKAEVEEAHQHQEHHSITLGSVKEF
Sbjct: 361 NEVAVASSPIHEPFETAKENSPVDDDHTSNVTEGKTKAEVEEAHQHQEHHSITLGSVKEF 420

Query: 421 NFDNGNGSDTHKANLNSEWWTNAKDVDTEGTTNGAWSFFPMTQQR 466
           NFDNGNGSDTHKANLNSEWWTNAKDVDTEGTTNGAWSFFPMTQQR
Sbjct: 421 NFDNGNGSDTHKANLNSEWWTNAKDVDTEGTTNGAWSFFPMTQQR 465

BLAST of Bhi01G001808 vs. NCBI nr
Match: XP_038884072.1 (uncharacterized protein LOC120075005 isoform X1 [Benincasa hispida])

HSP 1 Score: 847.8 bits (2189), Expect = 4.4e-242
Identity = 425/425 (100.00%), Postives = 425/425 (100.00%), Query Frame = 0

Query: 41  QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPESSPSSESHENSLQSPDIVLPFAAPPSSP 100
           QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPESSPSSESHENSLQSPDIVLPFAAPPSSP
Sbjct: 45  QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPESSPSSESHENSLQSPDIVLPFAAPPSSP 104

Query: 101 VSFLQSEPPSATQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLT 160
           VSFLQSEPPSATQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLT
Sbjct: 105 VSFLQSEPPSATQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLT 164

Query: 161 TEPSTAPFTPPESIHLTTPSSPEVPFAQFLQPTLQKSESDHQYPFPNDDFQSYQFYPGSP 220
           TEPSTAPFTPPESIHLTTPSSPEVPFAQFLQPTLQKSESDHQYPFPNDDFQSYQFYPGSP
Sbjct: 165 TEPSTAPFTPPESIHLTTPSSPEVPFAQFLQPTLQKSESDHQYPFPNDDFQSYQFYPGSP 224

Query: 221 VSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWRQRQST 280
           VSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWRQRQST
Sbjct: 225 VSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWRQRQST 284

Query: 281 DSCTQDSIELKSSNDFVLNPQTSESMSDHHATNESQNIQILIDGNQKEEEVPGATNHRFS 340
           DSCTQDSIELKSSNDFVLNPQTSESMSDHHATNESQNIQILIDGNQKEEEVPGATNHRFS
Sbjct: 285 DSCTQDSIELKSSNDFVLNPQTSESMSDHHATNESQNIQILIDGNQKEEEVPGATNHRFS 344

Query: 341 FELSDGDALLQSVGSKPLDSNEVAVASSPIHEPFETAKENSPVDDDHTSNVTEGKTKAEV 400
           FELSDGDALLQSVGSKPLDSNEVAVASSPIHEPFETAKENSPVDDDHTSNVTEGKTKAEV
Sbjct: 345 FELSDGDALLQSVGSKPLDSNEVAVASSPIHEPFETAKENSPVDDDHTSNVTEGKTKAEV 404

Query: 401 EEAHQHQEHHSITLGSVKEFNFDNGNGSDTHKANLNSEWWTNAKDVDTEGTTNGAWSFFP 460
           EEAHQHQEHHSITLGSVKEFNFDNGNGSDTHKANLNSEWWTNAKDVDTEGTTNGAWSFFP
Sbjct: 405 EEAHQHQEHHSITLGSVKEFNFDNGNGSDTHKANLNSEWWTNAKDVDTEGTTNGAWSFFP 464

Query: 461 MTQQR 466
           MTQQR
Sbjct: 465 MTQQR 469

BLAST of Bhi01G001808 vs. NCBI nr
Match: XP_004146564.1 (uncharacterized protein LOC101220378 isoform X1 [Cucumis sativus])

HSP 1 Score: 813.9 bits (2101), Expect = 7.0e-232
Identity = 418/466 (89.70%), Postives = 429/466 (92.06%), Query Frame = 0

Query: 1   MRRRTDTDDSRPV-NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLK 60
           MRRRTDTDD RPV NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSC SIYWCFGS+K
Sbjct: 1   MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSIK 60

Query: 61  QRKRIGHAVLVPESSPSSESHENSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTALI 120
           QRKRIGHAVLVPE SPSSE HEN+LQSPDIVLPFAAPPSSPVS LQSEPPSA QSPTALI
Sbjct: 61  QRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAMQSPTALI 120

Query: 121 SFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTAPFTPPESIHLTTP 180
           SFTSLTANMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPST PFTPPESIHLTTP
Sbjct: 121 SFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPPESIHLTTP 180

Query: 181 SSPEVPFAQFLQPTLQKSESDHQYPFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSP 240
           SSPEVPFAQF+QPTL K ESD+QY FPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSP
Sbjct: 181 SSPEVPFAQFVQPTLPKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSP 240

Query: 241 LPDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWRQRQSTDSCTQDSIELKSSNDFVLN 300
           LPDYDFASFGSQFLNFPLEVPPTLLNLDK SIHNWRQRQSTDSCTQDSIE KSSNDFVLN
Sbjct: 241 LPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN 300

Query: 301 PQTSESMSDHHATNESQNIQILIDGNQKEEEVPGATNHRFSFELSDGDALLQSVGSKPLD 360
           PQTSESMSDHHATNESQNIQILID   K+EE PGATNHRFSFELSDGD LLQSVGSKPL+
Sbjct: 301 PQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLE 360

Query: 361 SNEVAVASSPIHEPFETAKENSPVDDDHTSNVTEGKTKAEVEEAHQHQEHHSITLGSVKE 420
           SNE+AV SSPIHEPFET KENSP   DHTSNV E KTKA+ +EAHQ QEHHS+TLGSVKE
Sbjct: 361 SNELAVESSPIHEPFETTKENSP-HGDHTSNVIEEKTKADGDEAHQRQEHHSVTLGSVKE 420

Query: 421 FNFDNGNGSDTHKANLNSEWWTNAKDVDTEGTTNGAWSFFPMTQQR 466
           FNFDNGNGSDTH  N+NSEWW NAKD  TE T  G WSFFPMTQQR
Sbjct: 421 FNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR 464

BLAST of Bhi01G001808 vs. NCBI nr
Match: XP_008452033.1 (PREDICTED: uncharacterized protein LOC103493162 isoform X2 [Cucumis melo] >TYK16635.1 mucin-2 [Cucumis melo var. makuwa])

HSP 1 Score: 811.6 bits (2095), Expect = 3.5e-231
Identity = 413/465 (88.82%), Postives = 424/465 (91.18%), Query Frame = 0

Query: 1   MRRRTDTDDSRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQ 60
           MRRRTDTDD RPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSC SIYWCFGSLKQ
Sbjct: 1   MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQ 60

Query: 61  RKRIGHAVLVPESSPSSESHENSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTALIS 120
           RKRIGHAVLVPE SPSSE HEN+LQSPDIVLPFAAPPSSPVS LQSEPPSA QSPTALIS
Sbjct: 61  RKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAIQSPTALIS 120

Query: 121 FTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTAPFTPPESIHLTTPS 180
           FTSLTANMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPST PFTPPESIHLTTPS
Sbjct: 121 FTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPS 180

Query: 181 SPEVPFAQFLQPTLQKSESDHQYPFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPL 240
           SPEVPFAQF+ P+LQK ESD+QY FPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPL
Sbjct: 181 SPEVPFAQFVPPSLQKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPL 240

Query: 241 PDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWRQRQSTDSCTQDSIELKSSNDFVLNP 300
           PDYDFASFGSQFLNFPLEVPPTL NLDK SIHNWRQRQSTDSCTQDSIE KSSNDFVLNP
Sbjct: 241 PDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP 300

Query: 301 QTSESMSDHHATNESQNIQILIDGNQKEEEVPGATNHRFSFELSDGDALLQSVGSKPLDS 360
            TSESM DHHATNESQNIQILID   K EE PGATNHRFSFELSDGD L QSVGSKPL+S
Sbjct: 301 HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLES 360

Query: 361 NEVAVASSPIHEPFETAKENSPVDDDHTSNVTEGKTKAEVEEAHQHQEHHSITLGSVKEF 420
           NE+ V SSPIHEPFET KENSP   DHTSNV E KTKA+ +EAHQHQEHHS+ LGSVKEF
Sbjct: 361 NELPVESSPIHEPFETTKENSP-HGDHTSNVIEEKTKADGDEAHQHQEHHSVALGSVKEF 420

Query: 421 NFDNGNGSDTHKANLNSEWWTNAKDVDTEGTTNGAWSFFPMTQQR 466
           NFDN NGSDTH   +NS+WWTNAKD  TEGTT GAWSFFP TQQR
Sbjct: 421 NFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR 464

BLAST of Bhi01G001808 vs. NCBI nr
Match: XP_008452032.1 (PREDICTED: uncharacterized protein LOC103493162 isoform X1 [Cucumis melo])

HSP 1 Score: 807.0 bits (2083), Expect = 8.6e-230
Identity = 413/466 (88.63%), Postives = 424/466 (90.99%), Query Frame = 0

Query: 1   MRRRTDTDDSRPVNNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCWSIYWCFGSLK 60
           MRRRTDTDD RPVNNTFQTITAAADAIATVDHRFPRATAV QKRRWGSC SIYWCFGSLK
Sbjct: 1   MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQQKRRWGSCLSIYWCFGSLK 60

Query: 61  QRKRIGHAVLVPESSPSSESHENSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTALI 120
           QRKRIGHAVLVPE SPSSE HEN+LQSPDIVLPFAAPPSSPVS LQSEPPSA QSPTALI
Sbjct: 61  QRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAIQSPTALI 120

Query: 121 SFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTAPFTPPESIHLTTP 180
           SFTSLTANMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPST PFTPPESIHLTTP
Sbjct: 121 SFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTP 180

Query: 181 SSPEVPFAQFLQPTLQKSESDHQYPFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSP 240
           SSPEVPFAQF+ P+LQK ESD+QY FPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSP
Sbjct: 181 SSPEVPFAQFVPPSLQKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSP 240

Query: 241 LPDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWRQRQSTDSCTQDSIELKSSNDFVLN 300
           LPDYDFASFGSQFLNFPLEVPPTL NLDK SIHNWRQRQSTDSCTQDSIE KSSNDFVLN
Sbjct: 241 LPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN 300

Query: 301 PQTSESMSDHHATNESQNIQILIDGNQKEEEVPGATNHRFSFELSDGDALLQSVGSKPLD 360
           P TSESM DHHATNESQNIQILID   K EE PGATNHRFSFELSDGD L QSVGSKPL+
Sbjct: 301 PHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLE 360

Query: 361 SNEVAVASSPIHEPFETAKENSPVDDDHTSNVTEGKTKAEVEEAHQHQEHHSITLGSVKE 420
           SNE+ V SSPIHEPFET KENSP   DHTSNV E KTKA+ +EAHQHQEHHS+ LGSVKE
Sbjct: 361 SNELPVESSPIHEPFETTKENSP-HGDHTSNVIEEKTKADGDEAHQHQEHHSVALGSVKE 420

Query: 421 FNFDNGNGSDTHKANLNSEWWTNAKDVDTEGTTNGAWSFFPMTQQR 466
           FNFDN NGSDTH   +NS+WWTNAKD  TEGTT GAWSFFP TQQR
Sbjct: 421 FNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR 465

BLAST of Bhi01G001808 vs. ExPASy TrEMBL
Match: A0A5D3CYQ2 (Mucin-2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold21G004630 PE=4 SV=1)

HSP 1 Score: 811.6 bits (2095), Expect = 1.7e-231
Identity = 413/465 (88.82%), Postives = 424/465 (91.18%), Query Frame = 0

Query: 1   MRRRTDTDDSRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQ 60
           MRRRTDTDD RPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSC SIYWCFGSLKQ
Sbjct: 1   MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQ 60

Query: 61  RKRIGHAVLVPESSPSSESHENSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTALIS 120
           RKRIGHAVLVPE SPSSE HEN+LQSPDIVLPFAAPPSSPVS LQSEPPSA QSPTALIS
Sbjct: 61  RKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAIQSPTALIS 120

Query: 121 FTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTAPFTPPESIHLTTPS 180
           FTSLTANMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPST PFTPPESIHLTTPS
Sbjct: 121 FTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPS 180

Query: 181 SPEVPFAQFLQPTLQKSESDHQYPFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPL 240
           SPEVPFAQF+ P+LQK ESD+QY FPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPL
Sbjct: 181 SPEVPFAQFVPPSLQKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPL 240

Query: 241 PDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWRQRQSTDSCTQDSIELKSSNDFVLNP 300
           PDYDFASFGSQFLNFPLEVPPTL NLDK SIHNWRQRQSTDSCTQDSIE KSSNDFVLNP
Sbjct: 241 PDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP 300

Query: 301 QTSESMSDHHATNESQNIQILIDGNQKEEEVPGATNHRFSFELSDGDALLQSVGSKPLDS 360
            TSESM DHHATNESQNIQILID   K EE PGATNHRFSFELSDGD L QSVGSKPL+S
Sbjct: 301 HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLES 360

Query: 361 NEVAVASSPIHEPFETAKENSPVDDDHTSNVTEGKTKAEVEEAHQHQEHHSITLGSVKEF 420
           NE+ V SSPIHEPFET KENSP   DHTSNV E KTKA+ +EAHQHQEHHS+ LGSVKEF
Sbjct: 361 NELPVESSPIHEPFETTKENSP-HGDHTSNVIEEKTKADGDEAHQHQEHHSVALGSVKEF 420

Query: 421 NFDNGNGSDTHKANLNSEWWTNAKDVDTEGTTNGAWSFFPMTQQR 466
           NFDN NGSDTH   +NS+WWTNAKD  TEGTT GAWSFFP TQQR
Sbjct: 421 NFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR 464

BLAST of Bhi01G001808 vs. ExPASy TrEMBL
Match: A0A1S3BSY8 (uncharacterized protein LOC103493162 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103493162 PE=4 SV=1)

HSP 1 Score: 811.6 bits (2095), Expect = 1.7e-231
Identity = 413/465 (88.82%), Postives = 424/465 (91.18%), Query Frame = 0

Query: 1   MRRRTDTDDSRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQ 60
           MRRRTDTDD RPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSC SIYWCFGSLKQ
Sbjct: 1   MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQ 60

Query: 61  RKRIGHAVLVPESSPSSESHENSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTALIS 120
           RKRIGHAVLVPE SPSSE HEN+LQSPDIVLPFAAPPSSPVS LQSEPPSA QSPTALIS
Sbjct: 61  RKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAIQSPTALIS 120

Query: 121 FTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTAPFTPPESIHLTTPS 180
           FTSLTANMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPST PFTPPESIHLTTPS
Sbjct: 121 FTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPS 180

Query: 181 SPEVPFAQFLQPTLQKSESDHQYPFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPL 240
           SPEVPFAQF+ P+LQK ESD+QY FPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPL
Sbjct: 181 SPEVPFAQFVPPSLQKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPL 240

Query: 241 PDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWRQRQSTDSCTQDSIELKSSNDFVLNP 300
           PDYDFASFGSQFLNFPLEVPPTL NLDK SIHNWRQRQSTDSCTQDSIE KSSNDFVLNP
Sbjct: 241 PDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP 300

Query: 301 QTSESMSDHHATNESQNIQILIDGNQKEEEVPGATNHRFSFELSDGDALLQSVGSKPLDS 360
            TSESM DHHATNESQNIQILID   K EE PGATNHRFSFELSDGD L QSVGSKPL+S
Sbjct: 301 HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLES 360

Query: 361 NEVAVASSPIHEPFETAKENSPVDDDHTSNVTEGKTKAEVEEAHQHQEHHSITLGSVKEF 420
           NE+ V SSPIHEPFET KENSP   DHTSNV E KTKA+ +EAHQHQEHHS+ LGSVKEF
Sbjct: 361 NELPVESSPIHEPFETTKENSP-HGDHTSNVIEEKTKADGDEAHQHQEHHSVALGSVKEF 420

Query: 421 NFDNGNGSDTHKANLNSEWWTNAKDVDTEGTTNGAWSFFPMTQQR 466
           NFDN NGSDTH   +NS+WWTNAKD  TEGTT GAWSFFP TQQR
Sbjct: 421 NFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR 464

BLAST of Bhi01G001808 vs. ExPASy TrEMBL
Match: A0A1S3BSB0 (uncharacterized protein LOC103493162 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103493162 PE=4 SV=1)

HSP 1 Score: 807.0 bits (2083), Expect = 4.2e-230
Identity = 413/466 (88.63%), Postives = 424/466 (90.99%), Query Frame = 0

Query: 1   MRRRTDTDDSRPVNNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCWSIYWCFGSLK 60
           MRRRTDTDD RPVNNTFQTITAAADAIATVDHRFPRATAV QKRRWGSC SIYWCFGSLK
Sbjct: 1   MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQQKRRWGSCLSIYWCFGSLK 60

Query: 61  QRKRIGHAVLVPESSPSSESHENSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTALI 120
           QRKRIGHAVLVPE SPSSE HEN+LQSPDIVLPFAAPPSSPVS LQSEPPSA QSPTALI
Sbjct: 61  QRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAIQSPTALI 120

Query: 121 SFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTAPFTPPESIHLTTP 180
           SFTSLTANMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPST PFTPPESIHLTTP
Sbjct: 121 SFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTP 180

Query: 181 SSPEVPFAQFLQPTLQKSESDHQYPFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSP 240
           SSPEVPFAQF+ P+LQK ESD+QY FPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSP
Sbjct: 181 SSPEVPFAQFVPPSLQKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSP 240

Query: 241 LPDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWRQRQSTDSCTQDSIELKSSNDFVLN 300
           LPDYDFASFGSQFLNFPLEVPPTL NLDK SIHNWRQRQSTDSCTQDSIE KSSNDFVLN
Sbjct: 241 LPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN 300

Query: 301 PQTSESMSDHHATNESQNIQILIDGNQKEEEVPGATNHRFSFELSDGDALLQSVGSKPLD 360
           P TSESM DHHATNESQNIQILID   K EE PGATNHRFSFELSDGD L QSVGSKPL+
Sbjct: 301 PHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLE 360

Query: 361 SNEVAVASSPIHEPFETAKENSPVDDDHTSNVTEGKTKAEVEEAHQHQEHHSITLGSVKE 420
           SNE+ V SSPIHEPFET KENSP   DHTSNV E KTKA+ +EAHQHQEHHS+ LGSVKE
Sbjct: 361 SNELPVESSPIHEPFETTKENSP-HGDHTSNVIEEKTKADGDEAHQHQEHHSVALGSVKE 420

Query: 421 FNFDNGNGSDTHKANLNSEWWTNAKDVDTEGTTNGAWSFFPMTQQR 466
           FNFDN NGSDTH   +NS+WWTNAKD  TEGTT GAWSFFP TQQR
Sbjct: 421 FNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR 465

BLAST of Bhi01G001808 vs. ExPASy TrEMBL
Match: A0A5A7TUB1 (Mucin-2 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold74G001210 PE=4 SV=1)

HSP 1 Score: 806.6 bits (2082), Expect = 5.4e-230
Identity = 410/465 (88.17%), Postives = 423/465 (90.97%), Query Frame = 0

Query: 1   MRRRTDTDDSRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQ 60
           MRRRTDTDD RPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSC SIYWCFGSLKQ
Sbjct: 1   MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQ 60

Query: 61  RKRIGHAVLVPESSPSSESHENSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTALIS 120
           RKRIGHAVLVPE SPSSE HEN+LQSPDIVLPFAAPPSSPVS LQSEPPSA QSPTALIS
Sbjct: 61  RKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPSAIQSPTALIS 120

Query: 121 FTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTAPFTPPESIHLTTPS 180
           FTSLTANMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPST PFTPPESIHLTTPS
Sbjct: 121 FTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPS 180

Query: 181 SPEVPFAQFLQPTLQKSESDHQYPFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPL 240
           SPEVPFAQF+ P+ QK ESD+QY FPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPL
Sbjct: 181 SPEVPFAQFVPPSHQKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPL 240

Query: 241 PDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWRQRQSTDSCTQDSIELKSSNDFVLNP 300
           PDYDFASFGSQFLNFPL+VPPTL N+DK SIHNWRQRQSTDSCTQDSIE KSSNDFVLNP
Sbjct: 241 PDYDFASFGSQFLNFPLKVPPTLSNIDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP 300

Query: 301 QTSESMSDHHATNESQNIQILIDGNQKEEEVPGATNHRFSFELSDGDALLQSVGSKPLDS 360
            TSESM DHHATNESQNIQILID   K EE PGATNHRFSFELSDGD L QSVGSKPL+S
Sbjct: 301 HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLES 360

Query: 361 NEVAVASSPIHEPFETAKENSPVDDDHTSNVTEGKTKAEVEEAHQHQEHHSITLGSVKEF 420
           NE+ V SSPIHEPFET KENSP   DHTSNV E KTKA+ +EAHQHQEHHS+ LGSVKEF
Sbjct: 361 NELPVESSPIHEPFETTKENSP-HGDHTSNVIEEKTKADGDEAHQHQEHHSVALGSVKEF 420

Query: 421 NFDNGNGSDTHKANLNSEWWTNAKDVDTEGTTNGAWSFFPMTQQR 466
           NFDN NGSDTH   +NS+WWTNAKD  TEGTT GAWSFFP TQQR
Sbjct: 421 NFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR 464

BLAST of Bhi01G001808 vs. ExPASy TrEMBL
Match: A0A6J1C828 (uncharacterized protein At1g76660-like OS=Momordica charantia OX=3673 GN=LOC111008285 PE=4 SV=1)

HSP 1 Score: 732.3 bits (1889), Expect = 1.3e-207
Identity = 380/470 (80.85%), Postives = 413/470 (87.87%), Query Frame = 0

Query: 1   MRRRTDTD---DSRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGS 60
           MRRR D D   D  PVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGS
Sbjct: 1   MRRRPDADADADLSPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGS 60

Query: 61  LKQRKRIGHAVLVPESSPSSESHENSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTA 120
           LKQRKRIGHAVLVPE SPS+E  EN+LQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTA
Sbjct: 61  LKQRKRIGHAVLVPEPSPSTEPPENTLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPTA 120

Query: 121 LISFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTAPFTPPESIHLT 180
           ++SFTSLTANMYSPDGPSSIFA+GPFAHETQLVSPPLNFST+TT+PSTAPFTPPESIHLT
Sbjct: 121 ILSFTSLTANMYSPDGPSSIFAVGPFAHETQLVSPPLNFSTVTTQPSTAPFTPPESIHLT 180

Query: 181 TPSSPEVPFAQFLQPTLQKSESDHQY-PFPNDDFQSYQFYPGSPVSHLISPRSVISRSGA 240
           TPSSPEVPFAQ+LQP+ QK ESDHQY  FPNDDFQSYQFYPGSPVSHLISPRSVISRSGA
Sbjct: 181 TPSSPEVPFAQYLQPSHQKVESDHQYDQFPNDDFQSYQFYPGSPVSHLISPRSVISRSGA 240

Query: 241 SSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWRQRQSTDSCTQDSIELKSSNDF 300
           SSPLPD DF   GS F NFP+EVPPTLLNLD+ SI +WR +QS+DSCTQ+S+  KSSNDF
Sbjct: 241 SSPLPDCDFTPSGSSFSNFPIEVPPTLLNLDQHSIQDWRLQQSSDSCTQNSVGYKSSNDF 300

Query: 301 VLNPQTSESMSDHHATNESQNIQILIDGNQKEEEVPGATNHRFSFELSDGDALLQSVGSK 360
           VLNPQTSES+SD+HA+NE  NIQIL DG+Q++E    A NHRFSFELSD DALL+SV +K
Sbjct: 301 VLNPQTSESVSDYHASNEYHNIQILTDGSQRDE--AAAANHRFSFELSDEDALLKSVENK 360

Query: 361 PLDSNEVAVASSPIHEPFETAKENSPVDDDHTSNVTEGKTKAEVEEAHQHQ--EHHSITL 420
           PL+SNE+AVASSPIHEP ETAKE S V   HTSN TE + KA+ EE H HQ  EHHS+TL
Sbjct: 361 PLESNELAVASSPIHEPLETAKETSHV-GGHTSNDTEEQEKADGEEVHGHQEVEHHSVTL 420

Query: 421 GSVKEFNFDNGNGSDTHKANLNSEWWTNAKDVDTEGTTNGAWSFFPMTQQ 465
           G+VKEFNFDNGNG DT K N+NS WW N KD +TEGTT GAWSFFP+TQQ
Sbjct: 421 GTVKEFNFDNGNGCDTLKPNINSAWWANGKDAETEGTTTGAWSFFPITQQ 467

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AT5G52430.13.1e-6036.73hydroxyproline-rich glycoprotein family protein [more]
AT1G63720.13.3e-5451.33BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein fam... [more]
AT4G25620.15.3e-5236.05hydroxyproline-rich glycoprotein family protein [more]
AT1G76660.19.1e-3648.57FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
Match NameE-valueIdentityDescription
Q9SRE51.3e-3448.57Uncharacterized protein At1g76660 OS=Arabidopsis thaliana OX=3702 GN=At1g76660 P... [more]
Match NameE-valueIdentityDescription
XP_038884079.16.3e-265100.00uncharacterized protein LOC120075005 isoform X2 [Benincasa hispida][more]
XP_038884072.14.4e-242100.00uncharacterized protein LOC120075005 isoform X1 [Benincasa hispida][more]
XP_004146564.17.0e-23289.70uncharacterized protein LOC101220378 isoform X1 [Cucumis sativus][more]
XP_008452033.13.5e-23188.82PREDICTED: uncharacterized protein LOC103493162 isoform X2 [Cucumis melo] >TYK16... [more]
XP_008452032.18.6e-23088.63PREDICTED: uncharacterized protein LOC103493162 isoform X1 [Cucumis melo][more]
Match NameE-valueIdentityDescription
A0A5D3CYQ21.7e-23188.82Mucin-2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold21G004630 PE=4 S... [more]
A0A1S3BSY81.7e-23188.82uncharacterized protein LOC103493162 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A1S3BSB04.2e-23088.63uncharacterized protein LOC103493162 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A5A7TUB15.4e-23088.17Mucin-2 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold74G001210 PE=4 S... [more]
A0A6J1C8281.3e-20780.85uncharacterized protein At1g76660-like OS=Momordica charantia OX=3673 GN=LOC1110... [more]
InterPro
Analysis Name: InterPro Annotations of Wax gourd (B227) v1
Date Performed: 2021-10-22
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 367..410
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 160..181
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 377..410
NoneNo IPR availablePANTHERPTHR31798:SF2HYDROXYPROLINE-RICH GLYCOPROTEIN FAMILY PROTEINcoord: 9..463
IPR040420Uncharacterized protein At1g76660-likePANTHERPTHR31798HYDROXYPROLINE-RICH GLYCOPROTEIN-LIKEcoord: 9..463

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi01M001808Bhi01M001808mRNA