Bhi10G000196 (gene) Wax gourd

NameBhi10G000196
Typegene
OrganismBenincasa hispida (Wax gourd)
DescriptionHydroxyproline O-arabinosyltransferase
Locationchr10 : 5804284 .. 5808413 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTTGATTCCACTCAAACTAAACTTTTTTGTCTAAATGGGAATTTAATTTAAAACAAAAAAGGAAAAAAGGAAAAAGCTTTTAAATTGATCCATAAAAGTTTGTCGTAATCCAAAATGTCGCACCAAAAAGATGGCCCCATGTATATATATCACAAGGAACACGGCTGGTTACCACATAGATTTCCATAAAAAAAATTGACCCACAACCTTATTTTTCTTTAATTTTTTATGGGCTTCGAAAGTTTTTGCGAATCCCCCACGTTTCGCCCATTTTCCCCTTCGTTTTTGTTTTCTTCTTTGGTCTAAAGGGTTGGCGTTGAAATACCCTTTTGAACCCAAATTCTATTGACTTTTTTCTTTCTGAACTTTCTTCTGGATTTCTGGGTTCCTTCGTTATTTGTACGTTTATTTCTTATGGTATGAATCAATGTCGTTTCAGTTACTCTGTAGTTTCCGTTTCTTTCATCTTCTTATTTGACTGATTATCTCCATTTCCCATTTCCTTTTTCATCTGGGTTGGTTTTTGTGTTTGTACTCTGTTGAGTTTTGACGGCTTCAGCTGAATTTCTGAATTCCAGATCTGGGTTTGTCATTTCACATTTTGATATGTTTATATATGCATGCCCTTCCTCTTTTTTAGCCATTTTGTTCTCTGATATTCCTTTTGTTGTGTTGAATTTAGTTGAATGTTTGAGGATGAGTAGATTTTGAAGTTATTGTGCTGTGCTGCCACTGATGTAGCTTCATTAGTTTAGTTGTGGTGCTCCTTACATTTTATCCGTTGATTTTCTGATGCAGCATATACATGTCCTTCCTCTTTTATAGCTATGTTGTTCTCTGGTATTCCTTTTGTTGTGTTGATTTTAGTTGAATGTTTGGGGATTAGTAGATTGTGAAGTTATTGTGCTATGCTGCCATTGATGTAGCTTCATTAGTTTAGTTGTGGTGCTCCTTACATTTTACTTGTTTGATTTTCTGGTGTAGCATTCTGTGAAACGACAAGAAGCTGCTAAAGGTTGAGAAATTGAAACCAATGATAGGGAGAAAGACTTCACCAGGTTTTCTGGTGCTTTTGGCTCTTGGGTTTCTTTTTGCTTCTTATAACTTGCTTACCATGTCTGTACACTACAAAGCTTCGAAAGGGAGTTGGTTGGCAGATGGGTTTAACTTGTTTGATCCAGTTATTCGAGTGCCTGGTCGAGCTGAACGGGCAGGGAAAACCAATTCAAAATACCATGTTGCTGTCACTGCAACTGATGCTCCTTACAGTCAATGGCAGTGCAGGATCATGTATTATTGGTATAAGAAAGTGAAGGATCTGCCTGGTTCTGACATGGGAAGCTTCACTAGAGTTTTGCATTCAGGAACTCCGGATAATTTAATGAAGGAGATTCCAACTTTTATTGTTGATCCTTTGCCAGAAGGCTTGGATCGGGTGAGATTCTGATGTTTCTTGTTCTGTTTTTAGTAGTTTTTCCAGTTGCTCTGCTTGAAACTTCACTGGTTTGTGTTAGTTTATGTCAGCAGTGTTTCCTTTGTTCCTTTTTCACTCTTTAAGTTCATTTAGACATTTGTTTTTGGCTTGTTGTTCTTATATTCTGTTCAGTTTTGGTTGTCTTCAACTATGGCTGAACTTAATGCATTGAATTTTAATGAAAAATTCAGGGTTATGTTGTGTTAAACCGACCGTGGGCTTTTGTTCAATGGCTGGAGAAAGCAAACATTGAAGAAGAGTGAGATGGTTTCCAAGTTTTTCTGGTGAAGCTTTTCTTTTCCTTTTCCATTTGATGAAATCTGAGGACAAGTGACTTTTTGACTGATGCAGATATATACTAATGGCAGAACCCGATCATATATTTGTTAAACCGTTACCAAACTTAGCTCACGGGAAGAATCCGGCTGGATTTCCGTTCTTCTACATAAAGCCAACTGAACATGAGAATATCATCAGGAAATTCTATCCTGAGGAGAATGGTCCAGTGACCAACATTGACCCCATTGGAAATTCTCCTGTCATCATTGAAAAGGTAACTTTTTTGACTGCCTATGAACTTTTGTGATTGATGCGAAGGCAGAATTACAAAAACAGCCAAACAAATTACCTCTGCATGAAAATGTATATTAGAGTCGAAGTTGCTATCGTATAGCTTAAGTGGTTGTTATATTGACCTGCTTCTTCAATTTCTAGGATGATTGATAGTACTGAATCTACTGACTAAGTTTTCTATATTAGACGCTGCTGGAGGAGATTGCACCAACTTGGGTGAACATATCCTTGAGAATGAAAGACGACCCAGCTACTGATAAAGCATTCGGTTGGGTGCTTGAGATGTGAGTTCTACTGAGATTGTTCGTCAAATTGGCACTTGGTAGTGGACTTGGGATCAATGTAACAGCTTTTGGCTTTTTTTTTTTTTTGTGTGTGTGTGTGTGTGTAGGTATGCTTATGCTGTAGCTTCTGCATTGCACGGTGTGCGGCATACGCTCCGCAAGGATTTTATGCTGCAGGTTTCGCCTATTTTGATCGTTCTATCCATCCATTAGTGGTTATGTTTTGGAGCCCTTAACAGATTATTGACTTCCTGTGCTTGAGTGTTTCAGCCTCCTTGGGATTTAGAAGTTGGTAGGAAATTTATCATCCACTATACCTATGGATGTGACTACACTATGAAGGTACAGATTCATCTCCAGTCTCTGGGCTGGAGTAAGAAAAAACTGGAATTGACAGTCATATCAAGTGAAAGCCACAAAGTTGTTGATTTTGTGTTTGTTTGTTTACTTCTCATTCTTCTCTTTTCTTTCTTTTGCTGTATTTGTCGATATTGATCAGTGTCACATCTGAGTGTTTCATCAGTAATTCTCAGCAGTCTATATGTTAACTCATCTTTTGATGGTCAGGGAGAACTGACATATGGTAAGATCGGGGAGTGGCGCTTCGACAAGAGATCGTTTCTCAATGGTCCACCACCAAGAAATCTTTCCTTACCTCCTCCTGGAGTACCCGAAAGTGTAGTAAGTATTTGTTCATGGAATCGTATCCTCTTTTGATGTTTCTTATCCCAAAAAAAGAAAAAAACAAAGAAAAATCTCATCTCATCTTCTGTAAAATCTCTTTGAGAAACTGTGAATTAATAACCTGTTTCTAACCTATAACCATCACCACCATCATATGAAGATTTCATCTATGTTCTAGTCTATCACCACCATCACACAATCATAATCCTTCCATGATAGGCCATAGCTTTTAACCATTTATGTATTCTTCCCATACAAGGAGAACAACTTAGCCCATTCGGATTATGTTTTCACCGATAACTCTATCTACTGGATTGAAGTTTGAACTTTAATGCATGTTCGAGAGTGATTTTGAAATTGTTAAAATCACTTATGTTCAAAATATACACACTTTTAATCACTCAAAAATTAATTTACTATAAATATTTAATTTTACACTTTTAAATGCAATTTTCATATAGTCATAATTTATCGAGAATGATTTAAAGCATATTTTAGAATGGTTTTGCAATGCCCAGAATTAGAATCAATCTCAAATATAATTACTACTACCTACTCCATAGTCTTTCGTTTTTCTTCTTTGACGTCATCGTTGATTACCTTTAAGATATGCTTTGTTAGACAAGAACGCTGCGAGGTGAATCAAAACCCTGTTTATAGAAACATCTCGTGCTGAGTCAGGACGAGATTTGCTCGGTTGTTTTCGTGTGTTTAGAAGTCAACCCTCTTGAGGAAAATTGAATGAACTCTTGCACTGCTTTTGATTTTGCAGGTAAGGCTTGTAAAGATGGTAAATGAGGCAACTGCAAATATCCCTGGTTGGGGGGAGTCATAAACTGGTGAGGGAAGTAACAACAATCAAACCTTGGTTTTGGTAAATATAATACACACTGTTACTGTTAGCAGCAAAAACTATGGTGGGGTTTATGTTTAAAGTCACCAAAATACCAATCAGCTTGAGAAGAGAGAACAATATGAAGAGGGGAATGTAACCTGAGAAAATAATTGTATAGTTTTTTAAATGAAACTTTTGAGATAAACTTTATAATATACCATTAAATTCTTTAGCGCACAGTATATTCTCGTGATTTGTACGTAGCTCAAAGACTTTTGC

mRNA sequence

TTTTGATTCCACTCAAACTAAACTTTTTTGTCTAAATGGGAATTTAATTTAAAACAAAAAAGGAAAAAAGGAAAAAGCTTTTAAATTGATCCATAAAAGTTTGTCGTAATCCAAAATGTCGCACCAAAAAGATGGCCCCATGTATATATATCACAAGGAACACGGCTGGTTACCACATAGATTTCCATAAAAAAAATTGACCCACAACCTTATTTTTCTTTAATTTTTTATGGGCTTCGAAAGTTTTTGCGAATCCCCCACGTTTCGCCCATTTTCCCCTTCGTTTTTGTTTTCTTCTTTGGTCTAAAGGGTTGGCGTTGAAATACCCTTTTGAACCCAAATTCTATTGACTTTTTTCTTTCTGAACTTTCTTCTGGATTTCTGGGTTCCTTCGTTATTTGTACGTTTATTTCTTATGCATTCTGTGAAACGACAAGAAGCTGCTAAAGGTTGAGAAATTGAAACCAATGATAGGGAGAAAGACTTCACCAGGTTTTCTGGTGCTTTTGGCTCTTGGGTTTCTTTTTGCTTCTTATAACTTGCTTACCATGTCTGTACACTACAAAGCTTCGAAAGGGAGTTGGTTGGCAGATGGGTTTAACTTGTTTGATCCAGTTATTCGAGTGCCTGGTCGAGCTGAACGGGCAGGGAAAACCAATTCAAAATACCATGTTGCTGTCACTGCAACTGATGCTCCTTACAGTCAATGGCAGTGCAGGATCATGTATTATTGGTATAAGAAAGTGAAGGATCTGCCTGGTTCTGACATGGGAAGCTTCACTAGAGTTTTGCATTCAGGAACTCCGGATAATTTAATGAAGGAGATTCCAACTTTTATTGTTGATCCTTTGCCAGAAGGCTTGGATCGGGGTTATGTTGTGTTAAACCGACCGTGGGCTTTTGTTCAATGGCTGGAGAAAGCAAACATTGAAGAAGAATATATACTAATGGCAGAACCCGATCATATATTTGTTAAACCGTTACCAAACTTAGCTCACGGGAAGAATCCGGCTGGATTTCCGTTCTTCTACATAAAGCCAACTGAACATGAGAATATCATCAGGAAATTCTATCCTGAGGAGAATGGTCCAGTGACCAACATTGACCCCATTGGAAATTCTCCTGTCATCATTGAAAAGACGCTGCTGGAGGAGATTGCACCAACTTGGGTGAACATATCCTTGAGAATGAAAGACGACCCAGCTACTGATAAAGCATTCGGTTGGGTGCTTGAGATGTATGCTTATGCTGTAGCTTCTGCATTGCACGGTGTGCGGCATACGCTCCGCAAGGATTTTATGCTGCAGCCTCCTTGGGATTTAGAAGTTGGTAGGAAATTTATCATCCACTATACCTATGGATGTGACTACACTATGAAGGGAGAACTGACATATGGTAAGATCGGGGAGTGGCGCTTCGACAAGAGATCGTTTCTCAATGGTCCACCACCAAGAAATCTTTCCTTACCTCCTCCTGGAGTACCCGAAAGTGTAGTAAGGCTTGTAAAGATGGTAAATGAGGCAACTGCAAATATCCCTGGTTGGGGGGAGTCATAAACTGGTGAGGGAAGTAACAACAATCAAACCTTGGTTTTGGTAAATATAATACACACTGTTACTGTTAGCAGCAAAAACTATGGTGGGGTTTATGTTTAAAGTCACCAAAATACCAATCAGCTTGAGAAGAGAGAACAATATGAAGAGGGGAATGTAACCTGAGAAAATAATTGTATAGTTTTTTAAATGAAACTTTTGAGATAAACTTTATAATATACCATTAAATTCTTTAGCGCACAGTATATTCTCGTGATTTGTACGTAGCTCAAAGACTTTTGC

Coding sequence (CDS)

ATGATAGGGAGAAAGACTTCACCAGGTTTTCTGGTGCTTTTGGCTCTTGGGTTTCTTTTTGCTTCTTATAACTTGCTTACCATGTCTGTACACTACAAAGCTTCGAAAGGGAGTTGGTTGGCAGATGGGTTTAACTTGTTTGATCCAGTTATTCGAGTGCCTGGTCGAGCTGAACGGGCAGGGAAAACCAATTCAAAATACCATGTTGCTGTCACTGCAACTGATGCTCCTTACAGTCAATGGCAGTGCAGGATCATGTATTATTGGTATAAGAAAGTGAAGGATCTGCCTGGTTCTGACATGGGAAGCTTCACTAGAGTTTTGCATTCAGGAACTCCGGATAATTTAATGAAGGAGATTCCAACTTTTATTGTTGATCCTTTGCCAGAAGGCTTGGATCGGGGTTATGTTGTGTTAAACCGACCGTGGGCTTTTGTTCAATGGCTGGAGAAAGCAAACATTGAAGAAGAATATATACTAATGGCAGAACCCGATCATATATTTGTTAAACCGTTACCAAACTTAGCTCACGGGAAGAATCCGGCTGGATTTCCGTTCTTCTACATAAAGCCAACTGAACATGAGAATATCATCAGGAAATTCTATCCTGAGGAGAATGGTCCAGTGACCAACATTGACCCCATTGGAAATTCTCCTGTCATCATTGAAAAGACGCTGCTGGAGGAGATTGCACCAACTTGGGTGAACATATCCTTGAGAATGAAAGACGACCCAGCTACTGATAAAGCATTCGGTTGGGTGCTTGAGATGTATGCTTATGCTGTAGCTTCTGCATTGCACGGTGTGCGGCATACGCTCCGCAAGGATTTTATGCTGCAGCCTCCTTGGGATTTAGAAGTTGGTAGGAAATTTATCATCCACTATACCTATGGATGTGACTACACTATGAAGGGAGAACTGACATATGGTAAGATCGGGGAGTGGCGCTTCGACAAGAGATCGTTTCTCAATGGTCCACCACCAAGAAATCTTTCCTTACCTCCTCCTGGAGTACCCGAAAGTGTAGTAAGGCTTGTAAAGATGGTAAATGAGGCAACTGCAAATATCCCTGGTTGGGGGGAGTCATAA

Protein sequence

MIGRKTSPGFLVLLALGFLFASYNLLTMSVHYKASKGSWLADGFNLFDPVIRVPGRAERAGKTNSKYHVAVTATDAPYSQWQCRIMYYWYKKVKDLPGSDMGSFTRVLHSGTPDNLMKEIPTFIVDPLPEGLDRGYVVLNRPWAFVQWLEKANIEEEYILMAEPDHIFVKPLPNLAHGKNPAGFPFFYIKPTEHENIIRKFYPEENGPVTNIDPIGNSPVIIEKTLLEEIAPTWVNISLRMKDDPATDKAFGWVLEMYAYAVASALHGVRHTLRKDFMLQPPWDLEVGRKFIIHYTYGCDYTMKGELTYGKIGEWRFDKRSFLNGPPPRNLSLPPPGVPESVVRLVKMVNEATANIPGWGES
BLAST of Bhi10G000196 vs. TAIR10
Match: AT5G13500.1 (unknown protein)

HSP 1 Score: 564.7 bits (1454), Expect = 4.1e-161
Identity = 258/355 (72.68%), Postives = 293/355 (82.54%), Query Frame = 0

Query: 5   KTSPGFLVLLALGFLFASYNLLTMSVHYKASKGSWLADGFNLFDPVIRVPGRAERAGKTN 64
           K S   L LL  GF   +YNLLT+ VH ++   +  +DG  L DPV+++P    +A  + 
Sbjct: 3   KASGLLLFLLGFGFFVVTYNLLTLIVHNRSGVSN--SDGSPLLDPVVQMPLNIRKAKSSP 62

Query: 65  SKYHVAVTATDAPYSQWQCRIMYYWYKKVKDLPGSDMGSFTRVLHSGTPDNLMKEIPTFI 124
           + +HVA+TATDAPY++WQCRIMYYWYK+ K LPGSDMG FTR+LHSG  DNLM EIPTF+
Sbjct: 63  APFHVALTATDAPYNKWQCRIMYYWYKQKKALPGSDMGGFTRILHSGNSDNLMDEIPTFV 122

Query: 125 VDPLPEGLDRGYVVLNRPWAFVQWLEKANIEEEYILMAEPDHIFVKPLPNLAHGKNPAGF 184
           VDPLP GLDRGYVVLNRPWAFVQWLE+A I+E+Y+LMAEPDH+FV PLPNLA G  PA F
Sbjct: 123 VDPLPPGLDRGYVVLNRPWAFVQWLERATIKEDYVLMAEPDHVFVNPLPNLAVGGFPAAF 182

Query: 185 PFFYIKPTEHENIIRKFYPEENGPVTNIDPIGNSPVIIEKTLLEEIAPTWVNISLRMKDD 244
           PFFYI P ++ENI+RK+YP E GPVTNIDPIGNSPVII K  LE+IAPTW+N+SL MK+D
Sbjct: 183 PFFYITPEKYENIVRKYYPAEMGPVTNIDPIGNSPVIISKESLEKIAPTWMNVSLTMKND 242

Query: 245 PATDKAFGWVLEMYAYAVASALHGVRHTLRKDFMLQPPWDLEVGRKFIIHYTYGCDYTMK 304
           P TDKAFGWVLEMY YA+ASA+HGVRH LRKDFMLQPPWDL    KFIIHYTYGCDY MK
Sbjct: 243 PETDKAFGWVLEMYGYAIASAIHGVRHILRKDFMLQPPWDLSTKGKFIIHYTYGCDYNMK 302

Query: 305 GELTYGKIGEWRFDKRSFLNGPPPRNLSLPPPGVPESVVRLVKMVNEATANIPGW 360
           GELTYGKIGEWRFDKRS L GPPPRN+SLPPPGVPESVV LVKMVNEATA IP W
Sbjct: 303 GELTYGKIGEWRFDKRSHLRGPPPRNMSLPPPGVPESVVTLVKMVNEATATIPNW 355

BLAST of Bhi10G000196 vs. TAIR10
Match: AT2G25260.1 (unknown protein)

HSP 1 Score: 489.2 bits (1258), Expect = 2.2e-138
Identity = 212/298 (71.14%), Postives = 253/298 (84.90%), Query Frame = 0

Query: 62  KTNSKYHVAVTATDAPYSQWQCRIMYYWYKKVKDLPGSDMGSFTRVLHSGTPDNLMKEIP 121
           KT   +H AVTATD+ YS WQCR+MYYWY + +D PGSDMG +TR+LHSG PD LM EIP
Sbjct: 59  KTKRLFHTAVTATDSVYSTWQCRVMYYWYNRFRDEPGSDMGGYTRILHSGRPDGLMDEIP 118

Query: 122 TFIVDPLPEGLDRGYVVLNRPWAFVQWLEKANIEEEYILMAEPDHIFVKPLPNLAHGKNP 181
           TF+ DPLP G+D+GYVVLNRPWAFVQWL++A+IEE+YILMAEPDHI VKP+PNLA G   
Sbjct: 119 TFVADPLPSGVDKGYVVLNRPWAFVQWLQQAHIEEDYILMAEPDHIIVKPIPNLARGNLA 178

Query: 182 AGFPFFYIKPTEHENIIRKFYPEENGPVTNIDPIGNSPVIIEKTLLEEIAPTWVNISLRM 241
           A FPFFYI+P ++E+++RKF+P+ENGP++ IDPIGNSPVI+ K  L +IAPTW+N+SL M
Sbjct: 179 AAFPFFYIEPKKYESVLRKFFPKENGPISRIDPIGNSPVIVTKNALMKIAPTWMNVSLAM 238

Query: 242 KDDPATDKAFGWVLEMYAYAVASALHGVRHTLRKDFMLQPPWDLEVGRKFIIHYTYGCDY 301
           K+DP TDKAFGWVLEMYAYAV+SALHGV + L KDFM+QPPWD E  + FIIHYTYGCD+
Sbjct: 239 KNDPQTDKAFGWVLEMYAYAVSSALHGVSNILHKDFMIQPPWDTETKKTFIIHYTYGCDF 298

Query: 302 TMKGELTYGKIGEWRFDKRSFLNGPPPRNLSLPPPGVPESVVRLVKMVNEATANIPGW 360
            MKG++  GKIGEWRFDKRS+ + PPPRNL+LPP GVPESVV LV M+NEATANIP W
Sbjct: 299 DMKGKMMVGKIGEWRFDKRSYGDKPPPRNLTLPPRGVPESVVTLVTMINEATANIPNW 356

BLAST of Bhi10G000196 vs. TAIR10
Match: AT5G25265.1 (unknown protein)

HSP 1 Score: 488.8 bits (1257), Expect = 2.9e-138
Identity = 217/318 (68.24%), Postives = 261/318 (82.08%), Query Frame = 0

Query: 48  DPVIRVP---GRAERAGKTNSKYHVAVTATDAPYSQWQCRIMYYWYKKVKDL--PGSDMG 107
           DPVI +P   G     GK    +H AVTA+D+ Y+ WQCR+MYYW+KK++    PGS+MG
Sbjct: 48  DPVIELPRGGGSRNNDGKRIRLFHTAVTASDSVYNTWQCRVMYYWFKKIQASAGPGSEMG 107

Query: 108 SFTRVLHSGTPDNLMKEIPTFIVDPLPEGLDRGYVVLNRPWAFVQWLEKANIEEEYILMA 167
            FTR+LHSG PD  M EIPTF+  PLP G+D+GYVVLNRPWAFVQWL++ +I+E+YILM+
Sbjct: 108 GFTRILHSGKPDQYMDEIPTFVAQPLPSGMDQGYVVLNRPWAFVQWLQQTDIKEDYILMS 167

Query: 168 EPDHIFVKPLPNLAHGKNPAGFPFFYIKPTEHENIIRKFYPEENGPVTNIDPIGNSPVII 227
           EPDHI VKP+PNLA     A FPFFYI+P ++E ++RK+YPE  GPVTNIDPIGNSPVI+
Sbjct: 168 EPDHIIVKPIPNLAKDGLGAAFPFFYIEPKKYEKVLRKYYPEVRGPVTNIDPIGNSPVIV 227

Query: 228 EKTLLEEIAPTWVNISLRMKDDPATDKAFGWVLEMYAYAVASALHGVRHTLRKDFMLQPP 287
            K  L++IAPTW+N+SL MK DP  DKAFGWVLEMYAYAV+SALHGV + L KDFM+QPP
Sbjct: 228 GKDALKKIAPTWMNVSLAMKKDPEADKAFGWVLEMYAYAVSSALHGVSNILHKDFMIQPP 287

Query: 288 WDLEVGRKFIIHYTYGCDYTMKGELTYGKIGEWRFDKRSFLNGPPPRNLSLPPPGVPESV 347
           WD+EVG K+IIHYTYGCDY MKG+LTYGKIGEWRFDKRS+ + PPPRNL++PPPGV +SV
Sbjct: 288 WDIEVGDKYIIHYTYGCDYDMKGKLTYGKIGEWRFDKRSYDSKPPPRNLTMPPPGVSQSV 347

Query: 348 VRLVKMVNEATANIPGWG 361
           V LVKM+NEATANIP WG
Sbjct: 348 VTLVKMINEATANIPNWG 365

BLAST of Bhi10G000196 vs. Swiss-Prot
Match: sp|Q9FY51|HPAT3_ARATH (Hydroxyproline O-arabinosyltransferase 3 OS=Arabidopsis thaliana OX=3702 GN=HPAT3 PE=1 SV=1)

HSP 1 Score: 564.7 bits (1454), Expect = 7.5e-160
Identity = 258/355 (72.68%), Postives = 293/355 (82.54%), Query Frame = 0

Query: 5   KTSPGFLVLLALGFLFASYNLLTMSVHYKASKGSWLADGFNLFDPVIRVPGRAERAGKTN 64
           K S   L LL  GF   +YNLLT+ VH ++   +  +DG  L DPV+++P    +A  + 
Sbjct: 3   KASGLLLFLLGFGFFVVTYNLLTLIVHNRSGVSN--SDGSPLLDPVVQMPLNIRKAKSSP 62

Query: 65  SKYHVAVTATDAPYSQWQCRIMYYWYKKVKDLPGSDMGSFTRVLHSGTPDNLMKEIPTFI 124
           + +HVA+TATDAPY++WQCRIMYYWYK+ K LPGSDMG FTR+LHSG  DNLM EIPTF+
Sbjct: 63  APFHVALTATDAPYNKWQCRIMYYWYKQKKALPGSDMGGFTRILHSGNSDNLMDEIPTFV 122

Query: 125 VDPLPEGLDRGYVVLNRPWAFVQWLEKANIEEEYILMAEPDHIFVKPLPNLAHGKNPAGF 184
           VDPLP GLDRGYVVLNRPWAFVQWLE+A I+E+Y+LMAEPDH+FV PLPNLA G  PA F
Sbjct: 123 VDPLPPGLDRGYVVLNRPWAFVQWLERATIKEDYVLMAEPDHVFVNPLPNLAVGGFPAAF 182

Query: 185 PFFYIKPTEHENIIRKFYPEENGPVTNIDPIGNSPVIIEKTLLEEIAPTWVNISLRMKDD 244
           PFFYI P ++ENI+RK+YP E GPVTNIDPIGNSPVII K  LE+IAPTW+N+SL MK+D
Sbjct: 183 PFFYITPEKYENIVRKYYPAEMGPVTNIDPIGNSPVIISKESLEKIAPTWMNVSLTMKND 242

Query: 245 PATDKAFGWVLEMYAYAVASALHGVRHTLRKDFMLQPPWDLEVGRKFIIHYTYGCDYTMK 304
           P TDKAFGWVLEMY YA+ASA+HGVRH LRKDFMLQPPWDL    KFIIHYTYGCDY MK
Sbjct: 243 PETDKAFGWVLEMYGYAIASAIHGVRHILRKDFMLQPPWDLSTKGKFIIHYTYGCDYNMK 302

Query: 305 GELTYGKIGEWRFDKRSFLNGPPPRNLSLPPPGVPESVVRLVKMVNEATANIPGW 360
           GELTYGKIGEWRFDKRS L GPPPRN+SLPPPGVPESVV LVKMVNEATA IP W
Sbjct: 303 GELTYGKIGEWRFDKRSHLRGPPPRNMSLPPPGVPESVVTLVKMVNEATATIPNW 355

BLAST of Bhi10G000196 vs. Swiss-Prot
Match: sp|Q494Q2|HPAT2_ARATH (Hydroxyproline O-arabinosyltransferase 2 OS=Arabidopsis thaliana OX=3702 GN=HPAT2 PE=1 SV=1)

HSP 1 Score: 489.2 bits (1258), Expect = 4.0e-137
Identity = 212/298 (71.14%), Postives = 253/298 (84.90%), Query Frame = 0

Query: 62  KTNSKYHVAVTATDAPYSQWQCRIMYYWYKKVKDLPGSDMGSFTRVLHSGTPDNLMKEIP 121
           KT   +H AVTATD+ YS WQCR+MYYWY + +D PGSDMG +TR+LHSG PD LM EIP
Sbjct: 59  KTKRLFHTAVTATDSVYSTWQCRVMYYWYNRFRDEPGSDMGGYTRILHSGRPDGLMDEIP 118

Query: 122 TFIVDPLPEGLDRGYVVLNRPWAFVQWLEKANIEEEYILMAEPDHIFVKPLPNLAHGKNP 181
           TF+ DPLP G+D+GYVVLNRPWAFVQWL++A+IEE+YILMAEPDHI VKP+PNLA G   
Sbjct: 119 TFVADPLPSGVDKGYVVLNRPWAFVQWLQQAHIEEDYILMAEPDHIIVKPIPNLARGNLA 178

Query: 182 AGFPFFYIKPTEHENIIRKFYPEENGPVTNIDPIGNSPVIIEKTLLEEIAPTWVNISLRM 241
           A FPFFYI+P ++E+++RKF+P+ENGP++ IDPIGNSPVI+ K  L +IAPTW+N+SL M
Sbjct: 179 AAFPFFYIEPKKYESVLRKFFPKENGPISRIDPIGNSPVIVTKNALMKIAPTWMNVSLAM 238

Query: 242 KDDPATDKAFGWVLEMYAYAVASALHGVRHTLRKDFMLQPPWDLEVGRKFIIHYTYGCDY 301
           K+DP TDKAFGWVLEMYAYAV+SALHGV + L KDFM+QPPWD E  + FIIHYTYGCD+
Sbjct: 239 KNDPQTDKAFGWVLEMYAYAVSSALHGVSNILHKDFMIQPPWDTETKKTFIIHYTYGCDF 298

Query: 302 TMKGELTYGKIGEWRFDKRSFLNGPPPRNLSLPPPGVPESVVRLVKMVNEATANIPGW 360
            MKG++  GKIGEWRFDKRS+ + PPPRNL+LPP GVPESVV LV M+NEATANIP W
Sbjct: 299 DMKGKMMVGKIGEWRFDKRSYGDKPPPRNLTLPPRGVPESVVTLVTMINEATANIPNW 356

BLAST of Bhi10G000196 vs. Swiss-Prot
Match: sp|Q8W4E6|HPAT1_ARATH (Hydroxyproline O-arabinosyltransferase 1 OS=Arabidopsis thaliana OX=3702 GN=HPAT1 PE=1 SV=1)

HSP 1 Score: 488.8 bits (1257), Expect = 5.2e-137
Identity = 217/318 (68.24%), Postives = 261/318 (82.08%), Query Frame = 0

Query: 48  DPVIRVP---GRAERAGKTNSKYHVAVTATDAPYSQWQCRIMYYWYKKVKDL--PGSDMG 107
           DPVI +P   G     GK    +H AVTA+D+ Y+ WQCR+MYYW+KK++    PGS+MG
Sbjct: 48  DPVIELPRGGGSRNNDGKRIRLFHTAVTASDSVYNTWQCRVMYYWFKKIQASAGPGSEMG 107

Query: 108 SFTRVLHSGTPDNLMKEIPTFIVDPLPEGLDRGYVVLNRPWAFVQWLEKANIEEEYILMA 167
            FTR+LHSG PD  M EIPTF+  PLP G+D+GYVVLNRPWAFVQWL++ +I+E+YILM+
Sbjct: 108 GFTRILHSGKPDQYMDEIPTFVAQPLPSGMDQGYVVLNRPWAFVQWLQQTDIKEDYILMS 167

Query: 168 EPDHIFVKPLPNLAHGKNPAGFPFFYIKPTEHENIIRKFYPEENGPVTNIDPIGNSPVII 227
           EPDHI VKP+PNLA     A FPFFYI+P ++E ++RK+YPE  GPVTNIDPIGNSPVI+
Sbjct: 168 EPDHIIVKPIPNLAKDGLGAAFPFFYIEPKKYEKVLRKYYPEVRGPVTNIDPIGNSPVIV 227

Query: 228 EKTLLEEIAPTWVNISLRMKDDPATDKAFGWVLEMYAYAVASALHGVRHTLRKDFMLQPP 287
            K  L++IAPTW+N+SL MK DP  DKAFGWVLEMYAYAV+SALHGV + L KDFM+QPP
Sbjct: 228 GKDALKKIAPTWMNVSLAMKKDPEADKAFGWVLEMYAYAVSSALHGVSNILHKDFMIQPP 287

Query: 288 WDLEVGRKFIIHYTYGCDYTMKGELTYGKIGEWRFDKRSFLNGPPPRNLSLPPPGVPESV 347
           WD+EVG K+IIHYTYGCDY MKG+LTYGKIGEWRFDKRS+ + PPPRNL++PPPGV +SV
Sbjct: 288 WDIEVGDKYIIHYTYGCDYDMKGKLTYGKIGEWRFDKRSYDSKPPPRNLTMPPPGVSQSV 347

Query: 348 VRLVKMVNEATANIPGWG 361
           V LVKM+NEATANIP WG
Sbjct: 348 VTLVKMINEATANIPNWG 365

BLAST of Bhi10G000196 vs. TrEMBL
Match: tr|A0A1S3BN61|A0A1S3BN61_CUCME (uncharacterized protein LOC103491472 OS=Cucumis melo OX=3656 GN=LOC103491472 PE=4 SV=1)

HSP 1 Score: 752.7 bits (1942), Expect = 3.9e-214
Identity = 350/362 (96.69%), Postives = 356/362 (98.34%), Query Frame = 0

Query: 1   MIGRKTSPGFLVLLALGFLFASYNLLTMSVHYKASKGSWLADGFNLFDPVIRVPGRAERA 60
           M+GRKTSPGFLVLLALGFLFASYNL+TMSV+YKASKGSWLADGF+LFDPVIRVPGRAERA
Sbjct: 1   MMGRKTSPGFLVLLALGFLFASYNLITMSVNYKASKGSWLADGFDLFDPVIRVPGRAERA 60

Query: 61  GKTNSKYHVAVTATDAPYSQWQCRIMYYWYKKVKDLPGSDMGSFTRVLHSGTPDNLMKEI 120
           GKTNSKYHVAVTATDAPYSQWQCRIMYYWYKKVKDLPGSDMGSFTRVLHSGTPDNLMKEI
Sbjct: 61  GKTNSKYHVAVTATDAPYSQWQCRIMYYWYKKVKDLPGSDMGSFTRVLHSGTPDNLMKEI 120

Query: 121 PTFIVDPLPEGLDRGYVVLNRPWAFVQWLEKANIEEEYILMAEPDHIFVKPLPNLAHGKN 180
           PTFIVDPLPEGLDRGY+VLNRPWAFVQWLEKANIEEEYILMAEPDHIFVKPLPNLAHGKN
Sbjct: 121 PTFIVDPLPEGLDRGYIVLNRPWAFVQWLEKANIEEEYILMAEPDHIFVKPLPNLAHGKN 180

Query: 181 PAGFPFFYIKPTEHENIIRKFYPEENGPVTNIDPIGNSPVIIEKTLLEEIAPTWVNISLR 240
           PAGFPFFYIKP EHE IIRKFYPEE GPVTNIDPIGNSPVIIEKTLLEEIAPTWVNISLR
Sbjct: 181 PAGFPFFYIKPAEHEKIIRKFYPEEKGPVTNIDPIGNSPVIIEKTLLEEIAPTWVNISLR 240

Query: 241 MKDDPATDKAFGWVLEMYAYAVASALHGVRHTLRKDFMLQPPWDLEVGRKFIIHYTYGCD 300
           MKDDP TDK FGWVLEMYAYAVASALHGVRHTLRKDFMLQPPWDLEVGR FIIHYTYGCD
Sbjct: 241 MKDDPTTDKTFGWVLEMYAYAVASALHGVRHTLRKDFMLQPPWDLEVGRNFIIHYTYGCD 300

Query: 301 YTMKGELTYGKIGEWRFDKRSFLNGPPPRNLSLPPPGVPESVVRLVKMVNEATANIPGWG 360
           YTMKGELTYGKIGEWRFDKRS+LNGPPPRNLSLPPPGVPESVVRLVKMVNEATANIPGWG
Sbjct: 301 YTMKGELTYGKIGEWRFDKRSYLNGPPPRNLSLPPPGVPESVVRLVKMVNEATANIPGWG 360

Query: 361 ES 363
           ES
Sbjct: 361 ES 362

BLAST of Bhi10G000196 vs. TrEMBL
Match: tr|A0A0A0KEZ8|A0A0A0KEZ8_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G430740 PE=4 SV=1)

HSP 1 Score: 706.1 bits (1821), Expect = 4.2e-200
Identity = 333/362 (91.99%), Postives = 338/362 (93.37%), Query Frame = 0

Query: 1   MIGRKTSPGFLVLLALGFLFASYNLLTMSVHYKASKGSWLADGFNLFDPVIRVPGRAERA 60
           MIGRKTSPGFLVLLALGFL ASYNL+TMSVHYKA KGSWL                AERA
Sbjct: 1   MIGRKTSPGFLVLLALGFLLASYNLITMSVHYKAPKGSWL----------------AERA 60

Query: 61  GKTNSKYHVAVTATDAPYSQWQCRIMYYWYKKVKDLPGSDMGSFTRVLHSGTPDNLMKEI 120
           GKTNSKYHVAVTATDAPYSQWQCRIMYYWYKKVKDLPGSDMGSFTRVLHSGTPDNLMKEI
Sbjct: 61  GKTNSKYHVAVTATDAPYSQWQCRIMYYWYKKVKDLPGSDMGSFTRVLHSGTPDNLMKEI 120

Query: 121 PTFIVDPLPEGLDRGYVVLNRPWAFVQWLEKANIEEEYILMAEPDHIFVKPLPNLAHGKN 180
           PTFIVDPLPEGLDRGYVVLNRPWAFVQWLEKANIEEEYILMAEPDHIFVKPLPNLAHGKN
Sbjct: 121 PTFIVDPLPEGLDRGYVVLNRPWAFVQWLEKANIEEEYILMAEPDHIFVKPLPNLAHGKN 180

Query: 181 PAGFPFFYIKPTEHENIIRKFYPEENGPVTNIDPIGNSPVIIEKTLLEEIAPTWVNISLR 240
           PAGFPFFYIKP +HE IIRKFYPEENGPVTNIDPIGNSPVIIEKTLLEEIAPTWVNISLR
Sbjct: 181 PAGFPFFYIKPADHEKIIRKFYPEENGPVTNIDPIGNSPVIIEKTLLEEIAPTWVNISLR 240

Query: 241 MKDDPATDKAFGWVLEMYAYAVASALHGVRHTLRKDFMLQPPWDLEVGRKFIIHYTYGCD 300
           MKDDP TDK FGWVLEMYAYAVASALHGVRHTLRKDFMLQPPWDLEVGR FIIHYTYGCD
Sbjct: 241 MKDDPTTDKTFGWVLEMYAYAVASALHGVRHTLRKDFMLQPPWDLEVGRNFIIHYTYGCD 300

Query: 301 YTMKGELTYGKIGEWRFDKRSFLNGPPPRNLSLPPPGVPESVVRLVKMVNEATANIPGWG 360
           YTMKGELTYGKIGEWRFDKR++LNGPPPRNLSLPPPGVPE+VVRLVKMVNEATANIP WG
Sbjct: 301 YTMKGELTYGKIGEWRFDKRTYLNGPPPRNLSLPPPGVPETVVRLVKMVNEATANIPDWG 346

Query: 361 ES 363
           ES
Sbjct: 361 ES 346

BLAST of Bhi10G000196 vs. TrEMBL
Match: tr|A0A2C9WE38|A0A2C9WE38_MANES (Uncharacterized protein OS=Manihot esculenta OX=3983 GN=MANES_02G139400 PE=4 SV=1)

HSP 1 Score: 643.3 bits (1658), Expect = 3.3e-181
Identity = 295/364 (81.04%), Postives = 324/364 (89.01%), Query Frame = 0

Query: 1   MIGRK----TSPGFLVLLALGFLFASYNLLTMSVHYKASKGSWLADGFNLFDPVIRVPGR 60
           M GRK     SP FLVLL+LGF FA+YNLLT+ + YK+S      DG  L DP+  +P  
Sbjct: 1   MTGRKNMGRVSPLFLVLLSLGFFFATYNLLTLVIQYKSSNS---GDGLELADPITDMPHE 60

Query: 61  AERAGKTNSKYHVAVTATDAPYSQWQCRIMYYWYKKVKDLPGSDMGSFTRVLHSGTPDNL 120
            +R GK+N +YHVA+TATDAPYSQWQCRIMYYWYKK+KD+PGSDMG FTRVLHSG PD L
Sbjct: 61  VKRLGKSNPRYHVALTATDAPYSQWQCRIMYYWYKKMKDMPGSDMGKFTRVLHSGKPDKL 120

Query: 121 MKEIPTFIVDPLPEGLDRGYVVLNRPWAFVQWLEKANIEEEYILMAEPDHIFVKPLPNLA 180
           M EIPTF+VDPLPEGLDRGY+VLNRPWAFVQWLEKA IEEEYILMAEPDHIFV PLPNLA
Sbjct: 121 MDEIPTFVVDPLPEGLDRGYIVLNRPWAFVQWLEKATIEEEYILMAEPDHIFVNPLPNLA 180

Query: 181 HGKNPAGFPFFYIKPTEHENIIRKFYPEENGPVTNIDPIGNSPVIIEKTLLEEIAPTWVN 240
           HG +PAGFPFFYIKP +HENIIRKFYP+E GPVTN+DPIGNSPVII+KT+LEEI+PTWVN
Sbjct: 181 HGDHPAGFPFFYIKPAQHENIIRKFYPKEKGPVTNVDPIGNSPVIIKKTILEEISPTWVN 240

Query: 241 ISLRMKDDPATDKAFGWVLEMYAYAVASALHGVRHTLRKDFMLQPPWDLEVGRKFIIHYT 300
           ISLRMKDDP TDKAFGWVLEMYAYAVASALHGVRH LRKDFM+QPPWDL+VG++FIIHYT
Sbjct: 241 ISLRMKDDPETDKAFGWVLEMYAYAVASALHGVRHILRKDFMIQPPWDLKVGKRFIIHYT 300

Query: 301 YGCDYTMKGELTYGKIGEWRFDKRSFLNGPPPRNLSLPPPGVPESVVRLVKMVNEATANI 360
           YGCDY +KGELTYGKIGEWRFDKRS+L+G PPRNLSLPP GVPESVVRLVKMVNEATANI
Sbjct: 301 YGCDYNLKGELTYGKIGEWRFDKRSYLSGSPPRNLSLPPAGVPESVVRLVKMVNEATANI 360

BLAST of Bhi10G000196 vs. TrEMBL
Match: tr|A0A2I4GF59|A0A2I4GF59_9ROSI (uncharacterized protein LOC109007348 isoform X2 OS=Juglans regia OX=51240 GN=LOC109007348 PE=4 SV=1)

HSP 1 Score: 638.6 bits (1646), Expect = 8.2e-180
Identity = 296/364 (81.32%), Postives = 322/364 (88.46%), Query Frame = 0

Query: 1   MIGRK----TSPGFLVLLALGFLFASYNLLTMSVHYKAS-KGSWLADGFNLFDPVIRVPG 60
           MIGRK     SP  LVLLALGF FASYN L++ +H KAS  GSW+AD  + FDP+I +P 
Sbjct: 1   MIGRKNTGRASPVLLVLLALGFFFASYNFLSIVIHNKASNSGSWVADRLDWFDPIIGIPE 60

Query: 61  RAERAGKTNSKYHVAVTATDAPYSQWQCRIMYYWYKKVKDLPGSDMGSFTRVLHSGTPDN 120
           + +R+   NSK+HVA+TATDA YSQWQCRIMYYWYKKVK++PGSDMG FTR+LHSG+PD 
Sbjct: 61  KVKRSRDFNSKFHVALTATDASYSQWQCRIMYYWYKKVKEMPGSDMGEFTRILHSGSPDK 120

Query: 121 LMKEIPTFIVDPLPEGLDRGYVVLNRPWAFVQWLEKANIEEEYILMAEPDHIFVKPLPNL 180
           LM+EIPTF+VDPLPEGLDRGYVVLNRPWAFVQWLEKA IEEEYILMAEPDHIFV PLPNL
Sbjct: 121 LMEEIPTFVVDPLPEGLDRGYVVLNRPWAFVQWLEKAKIEEEYILMAEPDHIFVNPLPNL 180

Query: 181 AHGKNPAGFPFFYIKPTEHENIIRKFYPEENGPVTNIDPIGNSPVIIEKTLLEEIAPTWV 240
           A G  PAGFPFFYIKP EHE IIRKFYPEENGPV +IDPIGNSPVII+K+ LEEIAPTWV
Sbjct: 181 ADGIQPAGFPFFYIKPAEHEKIIRKFYPEENGPVADIDPIGNSPVIIKKSSLEEIAPTWV 240

Query: 241 NISLRMKDDPATDKAFGWVLEMYAYAVASALHGVRHTLRKDFMLQPPWDLEVGRKFIIHY 300
           N+SLRMKDDP TDKAFGWVLEMYAYAVASALH VRH LRKDFMLQPPWDL VG+KFIIHY
Sbjct: 241 NVSLRMKDDPETDKAFGWVLEMYAYAVASALHDVRHILRKDFMLQPPWDLNVGKKFIIHY 300

Query: 301 TYGCDYTMKGELTYGKIGEWRFDKRSFLNGPPPRNLSLPPPGVPESVVRLVKMVNEATAN 360
           TYGCDY  KGELTYGK+GEWRFDKRS+L  PPPRNLSLPPPGVPESVVRLVKMVNEAT+N
Sbjct: 301 TYGCDYNSKGELTYGKVGEWRFDKRSYLTSPPPRNLSLPPPGVPESVVRLVKMVNEATSN 360

BLAST of Bhi10G000196 vs. TrEMBL
Match: tr|A0A2I4GF70|A0A2I4GF70_9ROSI (uncharacterized protein LOC109007348 isoform X1 OS=Juglans regia OX=51240 GN=LOC109007348 PE=4 SV=1)

HSP 1 Score: 638.6 bits (1646), Expect = 8.2e-180
Identity = 296/364 (81.32%), Postives = 322/364 (88.46%), Query Frame = 0

Query: 1   MIGRK----TSPGFLVLLALGFLFASYNLLTMSVHYKAS-KGSWLADGFNLFDPVIRVPG 60
           MIGRK     SP  LVLLALGF FASYN L++ +H KAS  GSW+AD  + FDP+I +P 
Sbjct: 13  MIGRKNTGRASPVLLVLLALGFFFASYNFLSIVIHNKASNSGSWVADRLDWFDPIIGIPE 72

Query: 61  RAERAGKTNSKYHVAVTATDAPYSQWQCRIMYYWYKKVKDLPGSDMGSFTRVLHSGTPDN 120
           + +R+   NSK+HVA+TATDA YSQWQCRIMYYWYKKVK++PGSDMG FTR+LHSG+PD 
Sbjct: 73  KVKRSRDFNSKFHVALTATDASYSQWQCRIMYYWYKKVKEMPGSDMGEFTRILHSGSPDK 132

Query: 121 LMKEIPTFIVDPLPEGLDRGYVVLNRPWAFVQWLEKANIEEEYILMAEPDHIFVKPLPNL 180
           LM+EIPTF+VDPLPEGLDRGYVVLNRPWAFVQWLEKA IEEEYILMAEPDHIFV PLPNL
Sbjct: 133 LMEEIPTFVVDPLPEGLDRGYVVLNRPWAFVQWLEKAKIEEEYILMAEPDHIFVNPLPNL 192

Query: 181 AHGKNPAGFPFFYIKPTEHENIIRKFYPEENGPVTNIDPIGNSPVIIEKTLLEEIAPTWV 240
           A G  PAGFPFFYIKP EHE IIRKFYPEENGPV +IDPIGNSPVII+K+ LEEIAPTWV
Sbjct: 193 ADGIQPAGFPFFYIKPAEHEKIIRKFYPEENGPVADIDPIGNSPVIIKKSSLEEIAPTWV 252

Query: 241 NISLRMKDDPATDKAFGWVLEMYAYAVASALHGVRHTLRKDFMLQPPWDLEVGRKFIIHY 300
           N+SLRMKDDP TDKAFGWVLEMYAYAVASALH VRH LRKDFMLQPPWDL VG+KFIIHY
Sbjct: 253 NVSLRMKDDPETDKAFGWVLEMYAYAVASALHDVRHILRKDFMLQPPWDLNVGKKFIIHY 312

Query: 301 TYGCDYTMKGELTYGKIGEWRFDKRSFLNGPPPRNLSLPPPGVPESVVRLVKMVNEATAN 360
           TYGCDY  KGELTYGK+GEWRFDKRS+L  PPPRNLSLPPPGVPESVVRLVKMVNEAT+N
Sbjct: 313 TYGCDYNSKGELTYGKVGEWRFDKRSYLTSPPPRNLSLPPPGVPESVVRLVKMVNEATSN 372

BLAST of Bhi10G000196 vs. NCBI nr
Match: XP_008449655.1 (PREDICTED: uncharacterized protein LOC103491472 [Cucumis melo] >XP_008449656.1 PREDICTED: uncharacterized protein LOC103491472 [Cucumis melo])

HSP 1 Score: 752.7 bits (1942), Expect = 5.9e-214
Identity = 350/362 (96.69%), Postives = 356/362 (98.34%), Query Frame = 0

Query: 1   MIGRKTSPGFLVLLALGFLFASYNLLTMSVHYKASKGSWLADGFNLFDPVIRVPGRAERA 60
           M+GRKTSPGFLVLLALGFLFASYNL+TMSV+YKASKGSWLADGF+LFDPVIRVPGRAERA
Sbjct: 1   MMGRKTSPGFLVLLALGFLFASYNLITMSVNYKASKGSWLADGFDLFDPVIRVPGRAERA 60

Query: 61  GKTNSKYHVAVTATDAPYSQWQCRIMYYWYKKVKDLPGSDMGSFTRVLHSGTPDNLMKEI 120
           GKTNSKYHVAVTATDAPYSQWQCRIMYYWYKKVKDLPGSDMGSFTRVLHSGTPDNLMKEI
Sbjct: 61  GKTNSKYHVAVTATDAPYSQWQCRIMYYWYKKVKDLPGSDMGSFTRVLHSGTPDNLMKEI 120

Query: 121 PTFIVDPLPEGLDRGYVVLNRPWAFVQWLEKANIEEEYILMAEPDHIFVKPLPNLAHGKN 180
           PTFIVDPLPEGLDRGY+VLNRPWAFVQWLEKANIEEEYILMAEPDHIFVKPLPNLAHGKN
Sbjct: 121 PTFIVDPLPEGLDRGYIVLNRPWAFVQWLEKANIEEEYILMAEPDHIFVKPLPNLAHGKN 180

Query: 181 PAGFPFFYIKPTEHENIIRKFYPEENGPVTNIDPIGNSPVIIEKTLLEEIAPTWVNISLR 240
           PAGFPFFYIKP EHE IIRKFYPEE GPVTNIDPIGNSPVIIEKTLLEEIAPTWVNISLR
Sbjct: 181 PAGFPFFYIKPAEHEKIIRKFYPEEKGPVTNIDPIGNSPVIIEKTLLEEIAPTWVNISLR 240

Query: 241 MKDDPATDKAFGWVLEMYAYAVASALHGVRHTLRKDFMLQPPWDLEVGRKFIIHYTYGCD 300
           MKDDP TDK FGWVLEMYAYAVASALHGVRHTLRKDFMLQPPWDLEVGR FIIHYTYGCD
Sbjct: 241 MKDDPTTDKTFGWVLEMYAYAVASALHGVRHTLRKDFMLQPPWDLEVGRNFIIHYTYGCD 300

Query: 301 YTMKGELTYGKIGEWRFDKRSFLNGPPPRNLSLPPPGVPESVVRLVKMVNEATANIPGWG 360
           YTMKGELTYGKIGEWRFDKRS+LNGPPPRNLSLPPPGVPESVVRLVKMVNEATANIPGWG
Sbjct: 301 YTMKGELTYGKIGEWRFDKRSYLNGPPPRNLSLPPPGVPESVVRLVKMVNEATANIPGWG 360

Query: 361 ES 363
           ES
Sbjct: 361 ES 362

BLAST of Bhi10G000196 vs. NCBI nr
Match: XP_011657614.1 (PREDICTED: uncharacterized protein LOC101207236 [Cucumis sativus] >KGN48098.1 hypothetical protein Csa_6G430740 [Cucumis sativus])

HSP 1 Score: 706.1 bits (1821), Expect = 6.3e-200
Identity = 333/362 (91.99%), Postives = 338/362 (93.37%), Query Frame = 0

Query: 1   MIGRKTSPGFLVLLALGFLFASYNLLTMSVHYKASKGSWLADGFNLFDPVIRVPGRAERA 60
           MIGRKTSPGFLVLLALGFL ASYNL+TMSVHYKA KGSWL                AERA
Sbjct: 1   MIGRKTSPGFLVLLALGFLLASYNLITMSVHYKAPKGSWL----------------AERA 60

Query: 61  GKTNSKYHVAVTATDAPYSQWQCRIMYYWYKKVKDLPGSDMGSFTRVLHSGTPDNLMKEI 120
           GKTNSKYHVAVTATDAPYSQWQCRIMYYWYKKVKDLPGSDMGSFTRVLHSGTPDNLMKEI
Sbjct: 61  GKTNSKYHVAVTATDAPYSQWQCRIMYYWYKKVKDLPGSDMGSFTRVLHSGTPDNLMKEI 120

Query: 121 PTFIVDPLPEGLDRGYVVLNRPWAFVQWLEKANIEEEYILMAEPDHIFVKPLPNLAHGKN 180
           PTFIVDPLPEGLDRGYVVLNRPWAFVQWLEKANIEEEYILMAEPDHIFVKPLPNLAHGKN
Sbjct: 121 PTFIVDPLPEGLDRGYVVLNRPWAFVQWLEKANIEEEYILMAEPDHIFVKPLPNLAHGKN 180

Query: 181 PAGFPFFYIKPTEHENIIRKFYPEENGPVTNIDPIGNSPVIIEKTLLEEIAPTWVNISLR 240
           PAGFPFFYIKP +HE IIRKFYPEENGPVTNIDPIGNSPVIIEKTLLEEIAPTWVNISLR
Sbjct: 181 PAGFPFFYIKPADHEKIIRKFYPEENGPVTNIDPIGNSPVIIEKTLLEEIAPTWVNISLR 240

Query: 241 MKDDPATDKAFGWVLEMYAYAVASALHGVRHTLRKDFMLQPPWDLEVGRKFIIHYTYGCD 300
           MKDDP TDK FGWVLEMYAYAVASALHGVRHTLRKDFMLQPPWDLEVGR FIIHYTYGCD
Sbjct: 241 MKDDPTTDKTFGWVLEMYAYAVASALHGVRHTLRKDFMLQPPWDLEVGRNFIIHYTYGCD 300

Query: 301 YTMKGELTYGKIGEWRFDKRSFLNGPPPRNLSLPPPGVPESVVRLVKMVNEATANIPGWG 360
           YTMKGELTYGKIGEWRFDKR++LNGPPPRNLSLPPPGVPE+VVRLVKMVNEATANIP WG
Sbjct: 301 YTMKGELTYGKIGEWRFDKRTYLNGPPPRNLSLPPPGVPETVVRLVKMVNEATANIPDWG 346

Query: 361 ES 363
           ES
Sbjct: 361 ES 346

BLAST of Bhi10G000196 vs. NCBI nr
Match: XP_022153085.1 (hydroxyproline O-arabinosyltransferase 3-like [Momordica charantia] >XP_022153086.1 hydroxyproline O-arabinosyltransferase 3-like [Momordica charantia] >XP_022153087.1 hydroxyproline O-arabinosyltransferase 3-like [Momordica charantia])

HSP 1 Score: 700.7 bits (1807), Expect = 2.7e-198
Identity = 328/360 (91.11%), Postives = 342/360 (95.00%), Query Frame = 0

Query: 1   MIGRKTSPGFLVLLALGFLFASYNLLTMSVHYKASKGSWLADGFNLFDPVIRVPGRAERA 60
           MIGRKTSP  LVL ALGFLFA+YNLLTM++H KASKGSW++DG   FDPV+R+PG   RA
Sbjct: 1   MIGRKTSPALLVLFALGFLFATYNLLTMTIHNKASKGSWVSDG---FDPVLRLPG---RA 60

Query: 61  GKTNSKYHVAVTATDAPYSQWQCRIMYYWYKKVKDLPGSDMGSFTRVLHSGTPDNLMKEI 120
           GKTNSKYHVAVTATDAPYSQWQCRIMYYWYKKVKD+PGSDMG+FTRVLHSG PDNLMKEI
Sbjct: 61  GKTNSKYHVAVTATDAPYSQWQCRIMYYWYKKVKDMPGSDMGNFTRVLHSGRPDNLMKEI 120

Query: 121 PTFIVDPLPEGLDRGYVVLNRPWAFVQWLEKANIEEEYILMAEPDHIFVKPLPNLAHGKN 180
            TFIVDPLPEGLDRGYVVLNRPWAFVQWLEKA+IEEEYILMAEPDHIFV PLPNLAHGKN
Sbjct: 121 STFIVDPLPEGLDRGYVVLNRPWAFVQWLEKASIEEEYILMAEPDHIFVNPLPNLAHGKN 180

Query: 181 PAGFPFFYIKPTEHENIIRKFYPEENGPVTNIDPIGNSPVIIEKTLLEEIAPTWVNISLR 240
           PAGFPFFYIKP +HE IIR FYPEE+GPVTNIDPIGNSPVIIEKTLLEEIAPTWVNISLR
Sbjct: 181 PAGFPFFYIKPEDHEKIIRAFYPEESGPVTNIDPIGNSPVIIEKTLLEEIAPTWVNISLR 240

Query: 241 MKDDPATDKAFGWVLEMYAYAVASALHGVRHTLRKDFMLQPPWDLEVGRKFIIHYTYGCD 300
           MKDDPATDKAFGWVLEMYAYAVASALHGVRHTLRKDFMLQPPWDLEVGRKFIIHYTYGCD
Sbjct: 241 MKDDPATDKAFGWVLEMYAYAVASALHGVRHTLRKDFMLQPPWDLEVGRKFIIHYTYGCD 300

Query: 301 YTMKGELTYGKIGEWRFDKRSFLNGPPPRNLSLPPPGVPESVVRLVKMVNEATANIPGWG 360
           Y MKGELTYGKIGEWRFDKRS+L+ PPPRNLSLPPPGVPESVVRLVKMVNEATANIPGWG
Sbjct: 301 YNMKGELTYGKIGEWRFDKRSYLSSPPPRNLSLPPPGVPESVVRLVKMVNEATANIPGWG 354

BLAST of Bhi10G000196 vs. NCBI nr
Match: XP_022944726.1 (hydroxyproline O-arabinosyltransferase 3-like isoform X1 [Cucurbita moschata] >XP_022944727.1 hydroxyproline O-arabinosyltransferase 3-like isoform X1 [Cucurbita moschata])

HSP 1 Score: 684.1 bits (1764), Expect = 2.6e-193
Identity = 316/362 (87.29%), Postives = 334/362 (92.27%), Query Frame = 0

Query: 1   MIGRKTSPGFLVLLALGFLFASYNLLTMSVHYKASKGSWLADGFNLFDPVIRVPGRAERA 60
           MIGRKTSPGFLVLLALGF FASYNLLTMSV YKASKGS       LFDPV+R PG  +RA
Sbjct: 1   MIGRKTSPGFLVLLALGFFFASYNLLTMSVRYKASKGS------ELFDPVVRAPGGTDRA 60

Query: 61  GKTNSKYHVAVTATDAPYSQWQCRIMYYWYKKVKDLPGSDMGSFTRVLHSGTPDNLMKEI 120
           GK   K+HVAVTAT + YSQWQCRIMYYWYKKVKD+PGSDMG FTRVLHSG PDNLMK I
Sbjct: 61  GKGIQKFHVAVTATASTYSQWQCRIMYYWYKKVKDMPGSDMGGFTRVLHSGLPDNLMKNI 120

Query: 121 PTFIVDPLPEGLDRGYVVLNRPWAFVQWLEKANIEEEYILMAEPDHIFVKPLPNLAHGKN 180
           PTFIVDPLPEGLDRGYVVLNRPWAFVQWLEKANIEE YILMAEPDHIFV+PLPNLAHGKN
Sbjct: 121 PTFIVDPLPEGLDRGYVVLNRPWAFVQWLEKANIEENYILMAEPDHIFVRPLPNLAHGKN 180

Query: 181 PAGFPFFYIKPTEHENIIRKFYPEENGPVTNIDPIGNSPVIIEKTLLEEIAPTWVNISLR 240
           P GFPFFYI+PT+HE I+RKFYPEENGPVTNIDPIGNSPVIIEK LLEEIAPTWVN+SLR
Sbjct: 181 PVGFPFFYIRPTDHEKILRKFYPEENGPVTNIDPIGNSPVIIEKGLLEEIAPTWVNVSLR 240

Query: 241 MKDDPATDKAFGWVLEMYAYAVASALHGVRHTLRKDFMLQPPWDLEVGRKFIIHYTYGCD 300
           MKDDP TDK FGWVLEMYAYAVASALHGVRHTLRKDFMLQPPWDLEVG+KFIIHYTYGCD
Sbjct: 241 MKDDPVTDKTFGWVLEMYAYAVASALHGVRHTLRKDFMLQPPWDLEVGKKFIIHYTYGCD 300

Query: 301 YTMKGELTYGKIGEWRFDKRSFLNGPPPRNLSLPPPGVPESVVRLVKMVNEATANIPGWG 360
           YTMKGELTYGK+GEWRFDKR+++ GPPPRNLSLPPPGVPESVVRLVK+VNEATANI GWG
Sbjct: 301 YTMKGELTYGKVGEWRFDKRAYIKGPPPRNLSLPPPGVPESVVRLVKLVNEATANIAGWG 356

Query: 361 ES 363
           ++
Sbjct: 361 DA 356

BLAST of Bhi10G000196 vs. NCBI nr
Match: XP_023511837.1 (hydroxyproline O-arabinosyltransferase 3-like isoform X1 [Cucurbita pepo subsp. pepo] >XP_023511838.1 hydroxyproline O-arabinosyltransferase 3-like isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 683.3 bits (1762), Expect = 4.4e-193
Identity = 316/362 (87.29%), Postives = 334/362 (92.27%), Query Frame = 0

Query: 1   MIGRKTSPGFLVLLALGFLFASYNLLTMSVHYKASKGSWLADGFNLFDPVIRVPGRAERA 60
           MIGRKTSPGFLVLLALGF FASYNLLTMSV YKASKGS       LFDPV+R PG  ERA
Sbjct: 1   MIGRKTSPGFLVLLALGFCFASYNLLTMSVRYKASKGS------ELFDPVVRAPGGTERA 60

Query: 61  GKTNSKYHVAVTATDAPYSQWQCRIMYYWYKKVKDLPGSDMGSFTRVLHSGTPDNLMKEI 120
           GK   K+HVAVTAT + YSQWQCRIMYYWY KVKD+PGSDMG FTRVLHSG PDNLMK+I
Sbjct: 61  GKGIQKFHVAVTATASTYSQWQCRIMYYWYNKVKDMPGSDMGGFTRVLHSGLPDNLMKKI 120

Query: 121 PTFIVDPLPEGLDRGYVVLNRPWAFVQWLEKANIEEEYILMAEPDHIFVKPLPNLAHGKN 180
           PTFIVDPLPEGLDRGYVVLNRPWAFVQWLEKANIEE YILMAEPDHIFV+PLPNLAHGKN
Sbjct: 121 PTFIVDPLPEGLDRGYVVLNRPWAFVQWLEKANIEENYILMAEPDHIFVRPLPNLAHGKN 180

Query: 181 PAGFPFFYIKPTEHENIIRKFYPEENGPVTNIDPIGNSPVIIEKTLLEEIAPTWVNISLR 240
           P GFPFFYI+PT+HE I+RKFYPEENGPVTNIDPIGNSPVIIEK LLEEIAPTWVN+SLR
Sbjct: 181 PVGFPFFYIRPTDHEKILRKFYPEENGPVTNIDPIGNSPVIIEKGLLEEIAPTWVNVSLR 240

Query: 241 MKDDPATDKAFGWVLEMYAYAVASALHGVRHTLRKDFMLQPPWDLEVGRKFIIHYTYGCD 300
           MKDDP TDK FGWVLEMYAYAVASALHGVRHTLRKDFMLQPPWDLEVG+KFIIHYTYGCD
Sbjct: 241 MKDDPVTDKTFGWVLEMYAYAVASALHGVRHTLRKDFMLQPPWDLEVGKKFIIHYTYGCD 300

Query: 301 YTMKGELTYGKIGEWRFDKRSFLNGPPPRNLSLPPPGVPESVVRLVKMVNEATANIPGWG 360
           YTMKGELTYGK+GEWRFDKR+++ GPPPRNLSLPPPGVPESVVRLVK+VNEATANI GWG
Sbjct: 301 YTMKGELTYGKVGEWRFDKRAYIKGPPPRNLSLPPPGVPESVVRLVKLVNEATANIAGWG 356

Query: 361 ES 363
           ++
Sbjct: 361 DA 356

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AT5G13500.14.1e-16172.68unknown protein[more]
AT2G25260.12.2e-13871.14unknown protein[more]
AT5G25265.12.9e-13868.24unknown protein[more]
Match NameE-valueIdentityDescription
sp|Q9FY51|HPAT3_ARATH7.5e-16072.68Hydroxyproline O-arabinosyltransferase 3 OS=Arabidopsis thaliana OX=3702 GN=HPAT... [more]
sp|Q494Q2|HPAT2_ARATH4.0e-13771.14Hydroxyproline O-arabinosyltransferase 2 OS=Arabidopsis thaliana OX=3702 GN=HPAT... [more]
sp|Q8W4E6|HPAT1_ARATH5.2e-13768.24Hydroxyproline O-arabinosyltransferase 1 OS=Arabidopsis thaliana OX=3702 GN=HPAT... [more]
Match NameE-valueIdentityDescription
tr|A0A1S3BN61|A0A1S3BN61_CUCME3.9e-21496.69uncharacterized protein LOC103491472 OS=Cucumis melo OX=3656 GN=LOC103491472 PE=... [more]
tr|A0A0A0KEZ8|A0A0A0KEZ8_CUCSA4.2e-20091.99Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G430740 PE=4 SV=1[more]
tr|A0A2C9WE38|A0A2C9WE38_MANES3.3e-18181.04Uncharacterized protein OS=Manihot esculenta OX=3983 GN=MANES_02G139400 PE=4 SV=... [more]
tr|A0A2I4GF59|A0A2I4GF59_9ROSI8.2e-18081.32uncharacterized protein LOC109007348 isoform X2 OS=Juglans regia OX=51240 GN=LOC... [more]
tr|A0A2I4GF70|A0A2I4GF70_9ROSI8.2e-18081.32uncharacterized protein LOC109007348 isoform X1 OS=Juglans regia OX=51240 GN=LOC... [more]
Match NameE-valueIdentityDescription
XP_008449655.15.9e-21496.69PREDICTED: uncharacterized protein LOC103491472 [Cucumis melo] >XP_008449656.1 P... [more]
XP_011657614.16.3e-20091.99PREDICTED: uncharacterized protein LOC101207236 [Cucumis sativus] >KGN48098.1 hy... [more]
XP_022153085.12.7e-19891.11hydroxyproline O-arabinosyltransferase 3-like [Momordica charantia] >XP_02215308... [more]
XP_022944726.12.6e-19387.29hydroxyproline O-arabinosyltransferase 3-like isoform X1 [Cucurbita moschata] >X... [more]
XP_023511837.14.4e-19387.29hydroxyproline O-arabinosyltransferase 3-like isoform X1 [Cucurbita pepo subsp. ... [more]
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
biological_process GO:0008150 biological_process
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005575 cellular_component
cellular_component GO:0016020 membrane
cellular_component GO:0005801 cis-Golgi network
cellular_component GO:0005768 endosome
cellular_component GO:0005794 Golgi apparatus
cellular_component GO:0005886 plasma membrane
cellular_component GO:0005802 trans-Golgi network
cellular_component GO:0005774 vacuolar membrane
molecular_function GO:0016740 transferase activity
molecular_function GO:0003674 molecular_function
molecular_function GO:1990585 hydroxyproline O-arabinosyltransferase activity
molecular_function GO:0016757 transferase activity, transferring glycosyl groups

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi10M000196Bhi10M000196mRNA


Analysis Name: InterPro Annotations of wax gourd
Date Performed: 2019-11-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR31485FAMILY NOT NAMEDcoord: 6..360
NoneNo IPR availablePANTHERPTHR31485:SF4SUBFAMILY NOT NAMEDcoord: 6..360