CcUC04G063670 (gene) Watermelon (PI 537277) v1

Overview
NameCcUC04G063670
Typegene
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionO-fucosyltransferase family protein
LocationCicolChr04: 19799927 .. 19804306 (-)
RNA-Seq ExpressionCcUC04G063670
SyntenyCcUC04G063670
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGTGTTTTAGGACTTGAAGGCACAAAGCTTGATGATTGGACACATGGTGGTTAGGTGAGAGGCAAAACCACAAACATCTTTTGCATCAAAATTTCTTTGATTCAAAGTTATATATCAATCAACAATTCCATTTATAATTAATTTACAATTCTCAGTACTCACTGCTACTTATGCTCGATCCAAGTTCTTGTCCTTCAGCGGATGCGATCGATTTCTATAGCATTTCCTTTGTTCGAGTGAATCCAATTTCAATCACTCTTGCGCTATACATTTCAATTTCTCTTCCACTCTGCAAATTCCATGGCATTTCCCAGAGCCCATAAGCCAAAACCCAAACCCAGATCCCCAATCAGCCTCTTCTTCGTTGCCCTCGCCGCCATTGCCTTTCTTTTCTTCTTTTCCTCACTGATTTCTACCAATGGGGCTTCTTCCTTACCATCCTCGAATTCAATTCAGAAAATCTTCAGATTCAAGAATCTAACCCAGAAACAGAGACGTAATCGCCATGTTTTTAGTGTCAATGACAAGTTCCTGTACTGGGGCAACAGAATCGACTGCCCGGGGAAGCACTGCGAGTCTTGTGAGGGTTTGGGTCACCAGGAATCCAGCTTGAGGTGTGCCCTTGAAGAAGCCATGTTCCTCCAAAGGTAATTTTTCATGATGTTCTTCTGAAGTCTGGGGTTTAATCCTTACTTGGAGATGTTTTATCCCTGCTTAATTTGTTGGATTGGTATGATAAGGTTCTGTATGAAAGGGATTTGGAATCGGTGAGCAGTAACTTCTCAGCATTGTTGTAGAACTAAATGGAGAATTATCATATTTAGAAACTTCTGTTTGCCCCTTGCTTGATTTTAGTTCTATGTTATTGCTATGGGGTGCCATTTAGTTTTGTCTCACTTTTCTCTTTCTAATGGTAGTGGATTGGTCCAGTGCTCTTAAGTTCAGAATAATTTATTCGTAGATTGCTTTTGCATAACCTACCTACGCACTTAACTACTCTTTCTTTTTGGCTGCAGAACATTTGTAATGCCCTCTAGAATGTGTATCAACCCTATACACAATAAGAAAGGGCTTCTTCATCAGTCCACTAATGCAAGCTCAGAGGAAAGGTGTTTCTTCCCTCTTTTCTTCTCCAATAATGTTTTCCTTGTATGCTCTCTTTCTTGCAACTGTGTTGATCCATGTTGTTTCTAATCATATAAATATTTTACCTATTGAATGTTGCATTTTCTAACAGGCATATACTTGATGGTAGTTGAAGCTGTATTAAATTGGTTCTCTTTGGGCTTCAAACTAAGATGCAAACTTCATTGCATAAAAAATTCACTGTTCTTTCATATATTTGTGATCTTTTCTAGAGAAAATGTAGAACCTGAATTTGTTCTGTGCATTGCAGTTGGGAAGCAAACTCTTGTGCCATGGACTCGTTGTACGATATGGATCTCATATCTGACACCGTGCCAGTGATTTTAGACAACTCAAAATTATGGTATCAGGTGCTATCAACTGGTATGAAATTAGGAGCTAGAGCAGTTGCCCACGTCGAGCAAGTTAGTCGTGTTGAACTCAGAGACAACAGCCACTACTCCAATCTTTTGCTAATAAATCGAACTGCCAGCCCGCTTTCATGGTAAATCAAACTTGGGGAAATAAGATTCTTTTTATCACTCTTATGTGTATGAATGTGTCTAGATAGATCCAAGTGCGATCATATGCTCAACTAGGCTTATCTTTTTGATGCTTTGTTTTTATGCTTCTATTTGCAACTTGGTATGACAGATGTTTCAATTTAAGATCCCAAGTTATGAATTCCTTGGGTGTTAATGTCATGATAACTCTTTAGGTTTATGGAATGCAAGGACAGAAACAACCGCAGTGCCATATTGTTGCCCTATAAATTTCTTCCTTCTATGGCCGCAGAAAACTTGAGGGATGCAGCTGAGAAGGTATTTTCTTGAAATCATATTAGGTTCTTTCTTTTGCTATCCATTGCATCACATGCATAGATATTTGCACTCTTCTATAAATCATGACATAGGGCTTAACAAAGTTGCAAGCTATTACCCATTGCTTATGAAACCTAGTTTCAGTGCTTTTCCATATTTATAGGTTTGTGTTTTATAGTCGTAGTAGATGCTTAAGTTCAGTTTTACAAGATTAGGTATTCGGTTTATTCAGACTTCGCAGATGTTTCTTAGCATATACTTGCCACCTTAGGAAATATAATTGTTGAACTCTGGGTAGAATGTAAATACTTTTCTTTAGATACTCTTTGGCAAGTATAGCAAAGGAAAATCTAATTAAAGACTCATTGTTTATTTATTTTATTATTATTATTATTTTATTTTATTTATTATTATTATTATTATTTTTTTGCGATAACAGATATCAAGGGGTGCCACTCTGTTTTGTGCTCATGGCCCACCTTTACTATTTCCTTAGAAAGCTCTCTGCAAGGGTTTTATGATAATCGATTTGTTGTTGAAACTCTTGATAATCAATTTGTTGTTTTTCAAGTTTTAGCTTATAAGTACTGTCTACACTTATGCGTTTCTTTGCTTTATCTTCCACTTTTTGAAAATGTTTTCGAAATCTAAGCCAAATTTTAAAACCAAAAATAGTATTTCTTAAAAACTTGTTTTCTTTTTGGAGTTTGTCTTCAAATTCATATGCATTTGCAAAAGTAGTGAAAATATACCATTAAAATTGAGGGAAGTAGGCATAGTCTTTAGAAACAAAAAACTAAAATGGGGCCACTCCTAGGATGACCTGAGTTTAAGACTTTCCACTCGTAGGAATTCAATTTAGTCCACCATCCAAAGTACTTGGAAAATCTTTATTTTTCTGTCTTTGTTGAACTTTGAAGTTCTGTCGTCAGGCCAAAGAATGAGAAAGAAGGGGCTGTAAGCATGACAACCACTTGCAGAAGTAAGCCAGTTGTAGCCATTATACTTGGAGTAGGTTTGTAGAAAATTTGTAAGTAGCTGAAACTTTATGGTTTTGGTGCCTTCTTGGAGAGTGGAAACTGAATGGAAGGCACACCATTGATTAGGGATCCTGATATGCTTCATGAAGCCAGGTTAAGATCAAAGGTGTTCGTTTAACCTCTTAGCAGCATTAGATAAATTCAAAAAGAGAAATTAGCTATGGCGGGTAGGTGGTGAGGCTTGCATTGGTTAGAGACGGAGTTTGCTTGCAGTTCTCTGCTTATGTTTGGGTTACTGATTTCTTATACCGTGCGTAATAACGTTGAGAAAAGACTTACAAGTTGTCTCTCTTTCTTTTTGGAACATTTCTATAAGTAACCGTGTCTCCCCAAATCTTATTCCTTGAATTTTGTGTCGTTTAATATTTCTTAATGGTGATGATAGTGAAAAAATATCTCATTTTACTGATAATCTTGAAAAGAACATCTGATCGACAGATTAAGGGACTACTCGGTGATTATGATGCCATCCATGTTCGTCGTGGAGATAAAATAAAAACCAGAAAGGACAGGTTTGGTGTTGATAGAAGCTTACATCCGCATCTTGACAGGGATACACGGCCCGAGTTTATGCTAAAGAGAATAGCAAAGTGGGTTCCGGCAGGGCGGACACTTTTTATTGCTTCAAATGAGAGAATTCCTGGATTCTTCTCTCCCCTCTCTGCTCGGTGAGCCCATCTCACTCACCTTTTTAGTTCACTAGAGTTGGAGACGAAGTATTAGTTCTTTGCCATTTAGTAGTAGAAAGACTTCACTCAAAGGATCGTCTTGTGTTCTGATAGCTATAATATAAATCACCAAACCGTTTTTCATTTAGTTGCATATCCCCTTGGGGATTCCTGTTAACCAAGAATAAAGAAGCTTTGTGCTTCAATATTATTTCACATCACATTGATTCATGATACCCTTCTGATTTTGCAGGTACAGGTTAGCTTATTCCTCTAACTATAGCGATATTCTAGATCCTGTGGTTAAGAACAATTATCAGCTGTTCATGATCGAAAGGCTCATTATGGCGGGTGCAAAGACATTCATCAGAACATTCAAAGAAGACGACACGGATCTAAGCCTCACTGACGACCTAAAGAAGAACACGAAAATATGGCAAATACCTGTCTACACAGCTGATGAAGAAAGAAGCTGAGGAATTATTGCGCAGTTATTCCTGTGGGAAAATGTCTTGGAGAGATCTTTTTGATCCACCAACAAAAGCCTTTCAATGTTCATTGAAAAGTTTCATAGTGAAGATAAGGATGTTCTAGGAAAGGTAGATTGCTTGGCCAATGTTGTAAATGTGTGGTATAAATTACGATCCAGGTTTCTGAATTTCATTAGTTTCATTGTTTACTGGCCCCCATTAGTTTGTGACCAGTTTTGTTTTGTTTTTGTTCTCT

mRNA sequence

TGTGTTTTAGGACTTGAAGGCACAAAGCTTGATGATTGGACACATGGTGGTTAGTACTCACTGCTACTTATGCTCGATCCAAGTTCTTGTCCTTCAGCGGATGCGATCGATTTCTATAGCATTTCCTTTGTTCGAGTGAATCCAATTTCAATCACTCTTGCGCTATACATTTCAATTTCTCTTCCACTCTGCAAATTCCATGGCATTTCCCAGAGCCCATAAGCCAAAACCCAAACCCAGATCCCCAATCAGCCTCTTCTTCGTTGCCCTCGCCGCCATTGCCTTTCTTTTCTTCTTTTCCTCACTGATTTCTACCAATGGGGCTTCTTCCTTACCATCCTCGAATTCAATTCAGAAAATCTTCAGATTCAAGAATCTAACCCAGAAACAGAGACGTAATCGCCATGTTTTTAGTGTCAATGACAAGTTCCTGTACTGGGGCAACAGAATCGACTGCCCGGGGAAGCACTGCGAGTCTTGTGAGGGTTTGGGTCACCAGGAATCCAGCTTGAGGTGTGCCCTTGAAGAAGCCATGTTCCTCCAAAGAACATTTGTAATGCCCTCTAGAATGTGTATCAACCCTATACACAATAAGAAAGGGCTTCTTCATCAGTCCACTAATGCAAGCTCAGAGGAAAGTTGGGAAGCAAACTCTTGTGCCATGGACTCGTTGTACGATATGGATCTCATATCTGACACCGTGCCAGTGATTTTAGACAACTCAAAATTATGGTATCAGGTGCTATCAACTGGTATGAAATTAGGAGCTAGAGCAGTTGCCCACGTCGAGCAAGTTAGTCGTGTTGAACTCAGAGACAACAGCCACTACTCCAATCTTTTGCTAATAAATCGAACTGCCAGCCCGCTTTCATGGTTTATGGAATGCAAGGACAGAAACAACCGCAGTGCCATATTGTTGCCCTATAAATTTCTTCCTTCTATGGCCGCAGAAAACTTGAGGGATGCAGCTGAGAAGATTAAGGGACTACTCGGTGATTATGATGCCATCCATGTTCGTCGTGGAGATAAAATAAAAACCAGAAAGGACAGGTTTGGTGTTGATAGAAGCTTACATCCGCATCTTGACAGGGATACACGGCCCGAGTTTATGCTAAAGAGAATAGCAAAGTGGGTTCCGGCAGGGCGGACACTTTTTATTGCTTCAAATGAGAGAATTCCTGGATTCTTCTCTCCCCTCTCTGCTCGGTACAGGTTAGCTTATTCCTCTAACTATAGCGATATTCTAGATCCTGTGGTTAAGAACAATTATCAGCTGTTCATGATCGAAAGGCTCATTATGGCGGGTGCAAAGACATTCATCAGAACATTCAAAGAAGACGACACGGATCTAAGCCTCACTGACGACCTAAAGAAGAACACGAAAATATGGCAAATACCTGTCTACACAGCTGATGAAGAAAGAAGCTGAGGAATTATTGCGCAGTTATTCCTGTGGGAAAATGTCTTGGAGAGATCTTTTTGATCCACCAACAAAAGCCTTTCAATGTTCATTGAAAAGTTTCATAGTGAAGATAAGGATGTTCTAGGAAAGGTAGATTGCTTGGCCAATGTTGTAAATGTGTGGTATAAATTACGATCCAGGTTTCTGAATTTCATTAGTTTCATTGTTTACTGGCCCCCATTAGTTTGTGACCAGTTTTGTTTTGTTTTTGTTCTCT

Coding sequence (CDS)

ATGGCATTTCCCAGAGCCCATAAGCCAAAACCCAAACCCAGATCCCCAATCAGCCTCTTCTTCGTTGCCCTCGCCGCCATTGCCTTTCTTTTCTTCTTTTCCTCACTGATTTCTACCAATGGGGCTTCTTCCTTACCATCCTCGAATTCAATTCAGAAAATCTTCAGATTCAAGAATCTAACCCAGAAACAGAGACGTAATCGCCATGTTTTTAGTGTCAATGACAAGTTCCTGTACTGGGGCAACAGAATCGACTGCCCGGGGAAGCACTGCGAGTCTTGTGAGGGTTTGGGTCACCAGGAATCCAGCTTGAGGTGTGCCCTTGAAGAAGCCATGTTCCTCCAAAGAACATTTGTAATGCCCTCTAGAATGTGTATCAACCCTATACACAATAAGAAAGGGCTTCTTCATCAGTCCACTAATGCAAGCTCAGAGGAAAGTTGGGAAGCAAACTCTTGTGCCATGGACTCGTTGTACGATATGGATCTCATATCTGACACCGTGCCAGTGATTTTAGACAACTCAAAATTATGGTATCAGGTGCTATCAACTGGTATGAAATTAGGAGCTAGAGCAGTTGCCCACGTCGAGCAAGTTAGTCGTGTTGAACTCAGAGACAACAGCCACTACTCCAATCTTTTGCTAATAAATCGAACTGCCAGCCCGCTTTCATGGTTTATGGAATGCAAGGACAGAAACAACCGCAGTGCCATATTGTTGCCCTATAAATTTCTTCCTTCTATGGCCGCAGAAAACTTGAGGGATGCAGCTGAGAAGATTAAGGGACTACTCGGTGATTATGATGCCATCCATGTTCGTCGTGGAGATAAAATAAAAACCAGAAAGGACAGGTTTGGTGTTGATAGAAGCTTACATCCGCATCTTGACAGGGATACACGGCCCGAGTTTATGCTAAAGAGAATAGCAAAGTGGGTTCCGGCAGGGCGGACACTTTTTATTGCTTCAAATGAGAGAATTCCTGGATTCTTCTCTCCCCTCTCTGCTCGGTACAGGTTAGCTTATTCCTCTAACTATAGCGATATTCTAGATCCTGTGGTTAAGAACAATTATCAGCTGTTCATGATCGAAAGGCTCATTATGGCGGGTGCAAAGACATTCATCAGAACATTCAAAGAAGACGACACGGATCTAAGCCTCACTGACGACCTAAAGAAGAACACGAAAATATGGCAAATACCTGTCTACACAGCTGATGAAGAAAGAAGCTGA

Protein sequence

MAFPRAHKPKPKPRSPISLFFVALAAIAFLFFFSSLISTNGASSLPSSNSIQKIFRFKNLTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVSRVELRDNSHYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYRLAYSSNYSDILDPVVKNNYQLFMIERLIMAGAKTFIRTFKEDDTDLSLTDDLKKNTKIWQIPVYTADEERS
Homology
BLAST of CcUC04G063670 vs. NCBI nr
Match: XP_038881641.1 (uncharacterized protein LOC120073097 [Benincasa hispida])

HSP 1 Score: 783.1 bits (2021), Expect = 1.2e-222
Identity = 387/408 (94.85%), Postives = 393/408 (96.32%), Query Frame = 0

Query: 1   MAFPRAHKPKPKPRSPISLFFVALAAIAFLFFFSSLISTNGASSLPSSNSIQKIFRFKNL 60
           MAFPR  KPKPKPRSP+  FFVALAAIAFLF FSSL+STNGASS  SSNSIQKIFRFKNL
Sbjct: 1   MAFPRTQKPKPKPRSPLIFFFVALAAIAFLFLFSSLVSTNGASSFSSSNSIQKIFRFKNL 60

Query: 61  TQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVM 120
           TQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQR FVM
Sbjct: 61  TQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRIFVM 120

Query: 121 PSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQ 180
           PSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQ
Sbjct: 121 PSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQ 180

Query: 181 VLSTGMKLGARAVAHVEQVSRVELRDNSHYSNLLLINRTASPLSWFMECKDRNNRSAILL 240
           VLSTGMKLGARAVAHVEQVSR+ELRDNS YS+LLLINRTASPLSWFMECKDRNNRSAILL
Sbjct: 181 VLSTGMKLGARAVAHVEQVSRLELRDNSRYSDLLLINRTASPLSWFMECKDRNNRSAILL 240

Query: 241 PYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR 300
           PYKFLPSMAAEN+RDAAEKIK LLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR
Sbjct: 241 PYKFLPSMAAENMRDAAEKIKALLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR 300

Query: 301 PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYRLAYSSNYSDILDPVVKNNYQLF 360
           PEFMLKRIAKWVPAGRTLFIASNER PGFFSPLS RY+LAYS NYS ILDPVVKNNYQLF
Sbjct: 301 PEFMLKRIAKWVPAGRTLFIASNERTPGFFSPLSDRYKLAYSLNYSSILDPVVKNNYQLF 360

Query: 361 MIERLIMAGAKTFIRTFKEDDTDLSLTDDLKKNTKIWQIPVYTADEER 409
           MIERLIMAGAKTFIRTFKEDDTDLSLTDD KKNTKIWQIPVYTADEER
Sbjct: 361 MIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKIWQIPVYTADEER 408

BLAST of CcUC04G063670 vs. NCBI nr
Match: XP_008455718.1 (PREDICTED: uncharacterized protein LOC103495824 [Cucumis melo] >KAA0025926.1 uncharacterized protein E6C27_scaffold34G002270 [Cucumis melo var. makuwa] >TYK27788.1 uncharacterized protein E5676_scaffold749G00060 [Cucumis melo var. makuwa])

HSP 1 Score: 782.7 bits (2020), Expect = 1.5e-222
Identity = 384/408 (94.12%), Postives = 394/408 (96.57%), Query Frame = 0

Query: 1   MAFPRAHKPKPKPRSPISLFFVALAAIAFLFFFSSLISTNGASSLPSSNSIQKIFRFKNL 60
           MAFPR  KPKPK RSP+  FFV+LAAIAFLF FSSLISTNG+SS PSSNSIQKIFRFKNL
Sbjct: 1   MAFPRTQKPKPKHRSPLIFFFVSLAAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRFKNL 60

Query: 61  TQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVM 120
           TQKQRR RH FSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVM
Sbjct: 61  TQKQRRGRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVM 120

Query: 121 PSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQ 180
           PSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSK WYQ
Sbjct: 121 PSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQ 180

Query: 181 VLSTGMKLGARAVAHVEQVSRVELRDNSHYSNLLLINRTASPLSWFMECKDRNNRSAILL 240
           VLST MKLGARAVAHVEQVSR+ELRD+SHYSNLLLINRTASPLSWFMECKDRNNRSA++L
Sbjct: 181 VLSTSMKLGARAVAHVEQVSRIELRDSSHYSNLLLINRTASPLSWFMECKDRNNRSAVML 240

Query: 241 PYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR 300
           PYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR
Sbjct: 241 PYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR 300

Query: 301 PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYRLAYSSNYSDILDPVVKNNYQLF 360
           PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARY+LAYSSNYSDILDPVVKNNYQLF
Sbjct: 301 PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLF 360

Query: 361 MIERLIMAGAKTFIRTFKEDDTDLSLTDDLKKNTKIWQIPVYTADEER 409
           MIERLIMAGAKTFIRTFKEDDTDLSLTDD KKNTK+WQIPVYT +E R
Sbjct: 361 MIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQIPVYTDEERR 408

BLAST of CcUC04G063670 vs. NCBI nr
Match: XP_004144331.1 (uncharacterized protein LOC101219097 [Cucumis sativus] >KGN54704.1 hypothetical protein Csa_012613 [Cucumis sativus])

HSP 1 Score: 766.1 bits (1977), Expect = 1.5e-217
Identity = 377/408 (92.40%), Postives = 390/408 (95.59%), Query Frame = 0

Query: 1   MAFPRAHKPKPKPRSPISLFFVALAAIAFLFFFSSLISTNGASSLPSSNSIQKIFRFKNL 60
           MAFPR  KPKPKPRSP+  FFV+L+AIAFLF FSSLISTNG+SS PSSNSIQKIFR KNL
Sbjct: 1   MAFPRTQKPKPKPRSPLIFFFVSLSAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRLKNL 60

Query: 61  TQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVM 120
           TQKQRRNRH FSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVM
Sbjct: 61  TQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVM 120

Query: 121 PSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQ 180
           PSRMCINPIHNKKGLLHQS N+SSEESWEANSCAMDSLYDMDLISDTVPVILDNSK WYQ
Sbjct: 121 PSRMCINPIHNKKGLLHQS-NSSSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQ 180

Query: 181 VLSTGMKLGARAVAHVEQVSRVELRDNSHYSNLLLINRTASPLSWFMECKDRNNRSAILL 240
           VLSTGMKLGARAV HVE+VSR+ELRD+S YSNLLLINRTASPLSWFMECKDRNN SA++L
Sbjct: 181 VLSTGMKLGARAVGHVEKVSRIELRDSSRYSNLLLINRTASPLSWFMECKDRNNHSAVML 240

Query: 241 PYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR 300
           PYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR
Sbjct: 241 PYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR 300

Query: 301 PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYRLAYSSNYSDILDPVVKNNYQLF 360
           PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARY+LAYSSNYSDILDPVV+NNYQLF
Sbjct: 301 PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVQNNYQLF 360

Query: 361 MIERLIMAGAKTFIRTFKEDDTDLSLTDDLKKNTKIWQIPVYTADEER 409
           MIERLIMAGAKT IRTFKEDDTDLSLTDD KKNTK WQIPVYT +E R
Sbjct: 361 MIERLIMAGAKTLIRTFKEDDTDLSLTDDPKKNTKAWQIPVYTDEERR 407

BLAST of CcUC04G063670 vs. NCBI nr
Match: XP_022153942.1 (uncharacterized protein LOC111021332 [Momordica charantia])

HSP 1 Score: 751.1 bits (1938), Expect = 4.9e-213
Identity = 372/411 (90.51%), Postives = 386/411 (93.92%), Query Frame = 0

Query: 1   MAFPRAHKPKPKPRSPISLFFVALAAIAFLFFFSSLISTNGASS--LPSSNSIQKIFRFK 60
           MAFPRA K KPKPRSP+  FFVALAAIAFLF FSSLISTNGASS    SSNSIQKIFRF 
Sbjct: 1   MAFPRAQKAKPKPRSPLFFFFVALAAIAFLFLFSSLISTNGASSSTFSSSNSIQKIFRFN 60

Query: 61  NLTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTF 120
           N+ +K +RNRHVFS NDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFL+R F
Sbjct: 61  NVNEKPKRNRHVFSANDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLRRIF 120

Query: 121 VMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLW 180
           VMPSRMCINPIHNKKG+LHQS NASSEESWEA SCAMDSLYD+DLISDTVPVILDNSKLW
Sbjct: 121 VMPSRMCINPIHNKKGILHQSNNASSEESWEAKSCAMDSLYDIDLISDTVPVILDNSKLW 180

Query: 181 YQVLSTGMKLGARAVAHVEQVSRVELRDNSHYSNLLLINRTASPLSWFMECKDRNNRSAI 240
           YQVLSTGMKLGARAVAHVE+VSR EL+DN+ YSNLLLINRTASPLSWFMECKDRNNRSAI
Sbjct: 181 YQVLSTGMKLGARAVAHVERVSRAELKDNNRYSNLLLINRTASPLSWFMECKDRNNRSAI 240

Query: 241 LLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300
           LLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD
Sbjct: 241 LLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300

Query: 301 TRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYRLAYSSNYSDILDPVVKNNYQ 360
           TRPEFMLKR+AKWV  GRTLFIASNER PGFFSPLSARY+LAYSSNYS ILDP+VKNNYQ
Sbjct: 301 TRPEFMLKRLAKWVAPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPMVKNNYQ 360

Query: 361 LFMIERLIMAGAKTFIRTFKEDDTDLSLTDDLKKNTKIWQIPVYTADEERS 410
           LFMIERLIMAGAKTFIRTFKEDDTDLSLTDD KKNTKIWQ PVYT DEE+S
Sbjct: 361 LFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKIWQKPVYTDDEEKS 411

BLAST of CcUC04G063670 vs. NCBI nr
Match: XP_022967807.1 (uncharacterized protein LOC111467213 isoform X1 [Cucurbita maxima])

HSP 1 Score: 747.3 bits (1928), Expect = 7.1e-212
Identity = 369/410 (90.00%), Postives = 382/410 (93.17%), Query Frame = 0

Query: 1   MAFPRAHKPKPKPRSPISLFFVALAAIAFLFFFSSLISTNG-ASSLPSSNSIQKIFRFKN 60
           MA  +  K KPKPRSP   FFVALA IAFLF FSSLISTNG +SS PSSNSI++IFRFKN
Sbjct: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTNGVSSSFPSSNSIREIFRFKN 60

Query: 61  LTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFV 120
           L QKQRRNRHVFS NDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQR FV
Sbjct: 61  LNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVFV 120

Query: 121 MPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWY 180
           MPSRMCINPIHNKKG+LHQSTNASSEE WE NSCAMDSLYDMDLISDTVPVILDNSKLWY
Sbjct: 121 MPSRMCINPIHNKKGILHQSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLWY 180

Query: 181 QVLSTGMKLGARAVAHVEQVSRVELRDNSHYSNLLLINRTASPLSWFMECKDRNNRSAIL 240
           QV STGMKLG+R VAHV+QVSR+ELRD+S YSNLLLINRTASPLSWFMECKDRNNRSAIL
Sbjct: 181 QVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLLLINRTASPLSWFMECKDRNNRSAIL 240

Query: 241 LPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT 300
           LPYKFLPSMAAENLRDA+EKIK LLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT
Sbjct: 241 LPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT 300

Query: 301 RPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYRLAYSSNYSDILDPVVKNNYQL 360
           RPEFMLKRIAKWVP GRTLFIASNER PGFFSPLSARY+LAYSSNYS ILDPVVKNNYQL
Sbjct: 301 RPEFMLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQL 360

Query: 361 FMIERLIMAGAKTFIRTFKEDDTDLSLTDDLKKNTKIWQIPVYTADEERS 410
           FMIERLIMAGAKTFIRTFKEDDTDLSLTDD KKNTK+WQ P+YT DEE S
Sbjct: 361 FMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQKPIYTDDEESS 410

BLAST of CcUC04G063670 vs. ExPASy TrEMBL
Match: A0A5D3DWC1 (O-fucosyltransferase family protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold749G00060 PE=3 SV=1)

HSP 1 Score: 782.7 bits (2020), Expect = 7.4e-223
Identity = 384/408 (94.12%), Postives = 394/408 (96.57%), Query Frame = 0

Query: 1   MAFPRAHKPKPKPRSPISLFFVALAAIAFLFFFSSLISTNGASSLPSSNSIQKIFRFKNL 60
           MAFPR  KPKPK RSP+  FFV+LAAIAFLF FSSLISTNG+SS PSSNSIQKIFRFKNL
Sbjct: 1   MAFPRTQKPKPKHRSPLIFFFVSLAAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRFKNL 60

Query: 61  TQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVM 120
           TQKQRR RH FSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVM
Sbjct: 61  TQKQRRGRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVM 120

Query: 121 PSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQ 180
           PSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSK WYQ
Sbjct: 121 PSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQ 180

Query: 181 VLSTGMKLGARAVAHVEQVSRVELRDNSHYSNLLLINRTASPLSWFMECKDRNNRSAILL 240
           VLST MKLGARAVAHVEQVSR+ELRD+SHYSNLLLINRTASPLSWFMECKDRNNRSA++L
Sbjct: 181 VLSTSMKLGARAVAHVEQVSRIELRDSSHYSNLLLINRTASPLSWFMECKDRNNRSAVML 240

Query: 241 PYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR 300
           PYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR
Sbjct: 241 PYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR 300

Query: 301 PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYRLAYSSNYSDILDPVVKNNYQLF 360
           PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARY+LAYSSNYSDILDPVVKNNYQLF
Sbjct: 301 PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLF 360

Query: 361 MIERLIMAGAKTFIRTFKEDDTDLSLTDDLKKNTKIWQIPVYTADEER 409
           MIERLIMAGAKTFIRTFKEDDTDLSLTDD KKNTK+WQIPVYT +E R
Sbjct: 361 MIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQIPVYTDEERR 408

BLAST of CcUC04G063670 vs. ExPASy TrEMBL
Match: A0A1S3C1H9 (O-fucosyltransferase family protein OS=Cucumis melo OX=3656 GN=LOC103495824 PE=3 SV=1)

HSP 1 Score: 782.7 bits (2020), Expect = 7.4e-223
Identity = 384/408 (94.12%), Postives = 394/408 (96.57%), Query Frame = 0

Query: 1   MAFPRAHKPKPKPRSPISLFFVALAAIAFLFFFSSLISTNGASSLPSSNSIQKIFRFKNL 60
           MAFPR  KPKPK RSP+  FFV+LAAIAFLF FSSLISTNG+SS PSSNSIQKIFRFKNL
Sbjct: 1   MAFPRTQKPKPKHRSPLIFFFVSLAAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRFKNL 60

Query: 61  TQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVM 120
           TQKQRR RH FSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVM
Sbjct: 61  TQKQRRGRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVM 120

Query: 121 PSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQ 180
           PSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSK WYQ
Sbjct: 121 PSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQ 180

Query: 181 VLSTGMKLGARAVAHVEQVSRVELRDNSHYSNLLLINRTASPLSWFMECKDRNNRSAILL 240
           VLST MKLGARAVAHVEQVSR+ELRD+SHYSNLLLINRTASPLSWFMECKDRNNRSA++L
Sbjct: 181 VLSTSMKLGARAVAHVEQVSRIELRDSSHYSNLLLINRTASPLSWFMECKDRNNRSAVML 240

Query: 241 PYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR 300
           PYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR
Sbjct: 241 PYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR 300

Query: 301 PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYRLAYSSNYSDILDPVVKNNYQLF 360
           PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARY+LAYSSNYSDILDPVVKNNYQLF
Sbjct: 301 PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQLF 360

Query: 361 MIERLIMAGAKTFIRTFKEDDTDLSLTDDLKKNTKIWQIPVYTADEER 409
           MIERLIMAGAKTFIRTFKEDDTDLSLTDD KKNTK+WQIPVYT +E R
Sbjct: 361 MIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQIPVYTDEERR 408

BLAST of CcUC04G063670 vs. ExPASy TrEMBL
Match: A0A0A0L0X3 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G430860 PE=4 SV=1)

HSP 1 Score: 766.1 bits (1977), Expect = 7.2e-218
Identity = 377/408 (92.40%), Postives = 390/408 (95.59%), Query Frame = 0

Query: 1   MAFPRAHKPKPKPRSPISLFFVALAAIAFLFFFSSLISTNGASSLPSSNSIQKIFRFKNL 60
           MAFPR  KPKPKPRSP+  FFV+L+AIAFLF FSSLISTNG+SS PSSNSIQKIFR KNL
Sbjct: 1   MAFPRTQKPKPKPRSPLIFFFVSLSAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRLKNL 60

Query: 61  TQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVM 120
           TQKQRRNRH FSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVM
Sbjct: 61  TQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVM 120

Query: 121 PSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQ 180
           PSRMCINPIHNKKGLLHQS N+SSEESWEANSCAMDSLYDMDLISDTVPVILDNSK WYQ
Sbjct: 121 PSRMCINPIHNKKGLLHQS-NSSSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWYQ 180

Query: 181 VLSTGMKLGARAVAHVEQVSRVELRDNSHYSNLLLINRTASPLSWFMECKDRNNRSAILL 240
           VLSTGMKLGARAV HVE+VSR+ELRD+S YSNLLLINRTASPLSWFMECKDRNN SA++L
Sbjct: 181 VLSTGMKLGARAVGHVEKVSRIELRDSSRYSNLLLINRTASPLSWFMECKDRNNHSAVML 240

Query: 241 PYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR 300
           PYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR
Sbjct: 241 PYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTR 300

Query: 301 PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYRLAYSSNYSDILDPVVKNNYQLF 360
           PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARY+LAYSSNYSDILDPVV+NNYQLF
Sbjct: 301 PEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVQNNYQLF 360

Query: 361 MIERLIMAGAKTFIRTFKEDDTDLSLTDDLKKNTKIWQIPVYTADEER 409
           MIERLIMAGAKT IRTFKEDDTDLSLTDD KKNTK WQIPVYT +E R
Sbjct: 361 MIERLIMAGAKTLIRTFKEDDTDLSLTDDPKKNTKAWQIPVYTDEERR 407

BLAST of CcUC04G063670 vs. ExPASy TrEMBL
Match: A0A6J1DKB9 (O-fucosyltransferase family protein OS=Momordica charantia OX=3673 GN=LOC111021332 PE=3 SV=1)

HSP 1 Score: 751.1 bits (1938), Expect = 2.4e-213
Identity = 372/411 (90.51%), Postives = 386/411 (93.92%), Query Frame = 0

Query: 1   MAFPRAHKPKPKPRSPISLFFVALAAIAFLFFFSSLISTNGASS--LPSSNSIQKIFRFK 60
           MAFPRA K KPKPRSP+  FFVALAAIAFLF FSSLISTNGASS    SSNSIQKIFRF 
Sbjct: 1   MAFPRAQKAKPKPRSPLFFFFVALAAIAFLFLFSSLISTNGASSSTFSSSNSIQKIFRFN 60

Query: 61  NLTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTF 120
           N+ +K +RNRHVFS NDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFL+R F
Sbjct: 61  NVNEKPKRNRHVFSANDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLRRIF 120

Query: 121 VMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLW 180
           VMPSRMCINPIHNKKG+LHQS NASSEESWEA SCAMDSLYD+DLISDTVPVILDNSKLW
Sbjct: 121 VMPSRMCINPIHNKKGILHQSNNASSEESWEAKSCAMDSLYDIDLISDTVPVILDNSKLW 180

Query: 181 YQVLSTGMKLGARAVAHVEQVSRVELRDNSHYSNLLLINRTASPLSWFMECKDRNNRSAI 240
           YQVLSTGMKLGARAVAHVE+VSR EL+DN+ YSNLLLINRTASPLSWFMECKDRNNRSAI
Sbjct: 181 YQVLSTGMKLGARAVAHVERVSRAELKDNNRYSNLLLINRTASPLSWFMECKDRNNRSAI 240

Query: 241 LLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300
           LLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD
Sbjct: 241 LLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300

Query: 301 TRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYRLAYSSNYSDILDPVVKNNYQ 360
           TRPEFMLKR+AKWV  GRTLFIASNER PGFFSPLSARY+LAYSSNYS ILDP+VKNNYQ
Sbjct: 301 TRPEFMLKRLAKWVAPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPMVKNNYQ 360

Query: 361 LFMIERLIMAGAKTFIRTFKEDDTDLSLTDDLKKNTKIWQIPVYTADEERS 410
           LFMIERLIMAGAKTFIRTFKEDDTDLSLTDD KKNTKIWQ PVYT DEE+S
Sbjct: 361 LFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKIWQKPVYTDDEEKS 411

BLAST of CcUC04G063670 vs. ExPASy TrEMBL
Match: A0A6J1HRU4 (O-fucosyltransferase family protein OS=Cucurbita maxima OX=3661 GN=LOC111467213 PE=3 SV=1)

HSP 1 Score: 747.3 bits (1928), Expect = 3.4e-212
Identity = 369/410 (90.00%), Postives = 382/410 (93.17%), Query Frame = 0

Query: 1   MAFPRAHKPKPKPRSPISLFFVALAAIAFLFFFSSLISTNG-ASSLPSSNSIQKIFRFKN 60
           MA  +  K KPKPRSP   FFVALA IAFLF FSSLISTNG +SS PSSNSI++IFRFKN
Sbjct: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTNGVSSSFPSSNSIREIFRFKN 60

Query: 61  LTQKQRRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFV 120
           L QKQRRNRHVFS NDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQR FV
Sbjct: 61  LNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVFV 120

Query: 121 MPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWY 180
           MPSRMCINPIHNKKG+LHQSTNASSEE WE NSCAMDSLYDMDLISDTVPVILDNSKLWY
Sbjct: 121 MPSRMCINPIHNKKGILHQSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLWY 180

Query: 181 QVLSTGMKLGARAVAHVEQVSRVELRDNSHYSNLLLINRTASPLSWFMECKDRNNRSAIL 240
           QV STGMKLG+R VAHV+QVSR+ELRD+S YSNLLLINRTASPLSWFMECKDRNNRSAIL
Sbjct: 181 QVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLLLINRTASPLSWFMECKDRNNRSAIL 240

Query: 241 LPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT 300
           LPYKFLPSMAAENLRDA+EKIK LLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT
Sbjct: 241 LPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT 300

Query: 301 RPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYRLAYSSNYSDILDPVVKNNYQL 360
           RPEFMLKRIAKWVP GRTLFIASNER PGFFSPLSARY+LAYSSNYS ILDPVVKNNYQL
Sbjct: 301 RPEFMLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQL 360

Query: 361 FMIERLIMAGAKTFIRTFKEDDTDLSLTDDLKKNTKIWQIPVYTADEERS 410
           FMIERLIMAGAKTFIRTFKEDDTDLSLTDD KKNTK+WQ P+YT DEE S
Sbjct: 361 FMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQKPIYTDDEESS 410

BLAST of CcUC04G063670 vs. TAIR 10
Match: AT3G56750.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G41150.2); Has 128 Blast hits to 128 proteins in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 117; Viruses - 0; Other Eukaryotes - 11 (source: NCBI BLink). )

HSP 1 Score: 535.8 bits (1379), Expect = 3.0e-152
Identity = 266/404 (65.84%), Postives = 324/404 (80.20%), Query Frame = 0

Query: 5   RAHKPKPKPRSPISLFFVALAAIAFLFFFSSLISTNGASSLPSSNSIQKIFRFKNLTQKQ 64
           +A + KP   S   + F  +   +FL  FSS+IST G   LP   ++   F +       
Sbjct: 6   KAQRTKPTSGSQRLVLF-CIVVFSFLLLFSSVIST-GKLGLPYQQTLIDYFVW------S 65

Query: 65  RRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVMPSRM 124
            R +   S+++K+LYWGNRIDCPGK+CE+C GLGHQESSLRCALEEAMFL RTFVMPS M
Sbjct: 66  PRGKRQHSLSEKYLYWGNRIDCPGKNCETCAGLGHQESSLRCALEEAMFLNRTFVMPSGM 125

Query: 125 CINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLST 184
           CINPIHNKKG+L++S N ++EE W  +SCAMDSLYD+DLIS+ +PVILD+SK W+ VLST
Sbjct: 126 CINPIHNKKGILNRSDNKTTEEGWLGSSCAMDSLYDIDLISEKIPVILDDSKTWHIVLST 185

Query: 185 GMKLGARAVAHVEQVSRVELRDNSHYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKF 244
            MKLG R +AHV  V+R  L++ SHYSNLL+INRTASPL+WF+ECKDR+NRSA++LPY F
Sbjct: 186 SMKLGERGIAHVSGVTRHRLKE-SHYSNLLIINRTASPLAWFVECKDRSNRSAVMLPYSF 245

Query: 245 LPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFM 304
           LP+MAA  LR+AAEKIK  LGDYDAIHVRRGDK+KTRKDRFGV+R   PHLDRDTRPEF+
Sbjct: 246 LPNMAAAKLRNAAEKIKAQLGDYDAIHVRRGDKLKTRKDRFGVERIQFPHLDRDTRPEFI 305

Query: 305 LKRIAKWVPAGRTLFIASNERIPGFFSPLSARYRLAYSSNYSDILDPVVKNNYQLFMIER 364
           L+RI K +P GRTLFI SNER PGFFSPL+ RY+LAYSSN+S+ILDP+++NNYQLFM+ER
Sbjct: 306 LRRIEKRIPRGRTLFIGSNERKPGFFSPLAVRYKLAYSSNFSEILDPIIENNYQLFMMER 365

Query: 365 LIMAGAKTFIRTFKEDDTDLSLTDDLKKNTKIWQIPVYTADEER 409
           L+M GAKT+ +TFKE +TDL+LTDD KKN K W+IPVYT DE R
Sbjct: 366 LVMMGAKTYFKTFKEYETDLTLTDDPKKN-KNWEIPVYTMDERR 399

BLAST of CcUC04G063670 vs. TAIR 10
Match: AT2G41150.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: leaf; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G56750.1); Has 127 Blast hits to 127 proteins in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 117; Viruses - 0; Other Eukaryotes - 10 (source: NCBI BLink). )

HSP 1 Score: 533.1 bits (1372), Expect = 2.0e-151
Identity = 264/402 (65.67%), Postives = 321/402 (79.85%), Query Frame = 0

Query: 5   RAHKPKPKPRSPISLFFVALAAIAFLFFFSSLISTNGASSLPSSNSIQKIFRFKNLTQKQ 64
           + HK K  P S   L  + + A+AFL  F+S+IST G  +LP   ++   F       + 
Sbjct: 6   KPHKLKATPGSQ-RLVLLCIVAVAFLLLFTSVISTGGL-ALPYRTTLIGYF------VRS 65

Query: 65  RRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVMPSRM 124
            RN+   S++DK+LYWGNRIDCPGK+CE+C GLGHQESSLRCALEEAMFL RTFVMPSRM
Sbjct: 66  TRNKTQHSLSDKYLYWGNRIDCPGKNCETCAGLGHQESSLRCALEEAMFLNRTFVMPSRM 125

Query: 125 CINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLST 184
           CINPIHNKKG+L++S N + EESWE +SCAM+SLYD+DLIS+ +PVILD+S+ W+ +LST
Sbjct: 126 CINPIHNKKGILNRSNNETREESWEVSSCAMESLYDIDLISEKIPVILDDSETWHIMLST 185

Query: 185 GMKLGARAVAHVEQVSRVELRDNSHYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKF 244
            MKL  R  AHV   +R EL D+S ++NLLLINRTASPL+WF+ECKDR NRS ++LPY F
Sbjct: 186 SMKLKERGSAHVYGANRHELNDSSDFTNLLLINRTASPLAWFVECKDRGNRSDVMLPYSF 245

Query: 245 LPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFM 304
           L +MAA  LRDAAEKIK  LGDYDAIHVRRGDK+KTRKDRF V+RS  PHLDRDTRPEF+
Sbjct: 246 LQTMAASRLRDAAEKIKAKLGDYDAIHVRRGDKLKTRKDRFRVERSQFPHLDRDTRPEFI 305

Query: 305 LKRIAKWVPAGRTLFIASNERIPGFFSPLSARYRLAYSSNYSDILDPVVKNNYQLFMIER 364
           + RI K +P GRTLFI SNER P FFSPL+ RY++AYSSN+S+ILDP+++NNYQLFM+ER
Sbjct: 306 IGRIQKQIPPGRTLFIGSNERTPDFFSPLAIRYKVAYSSNFSEILDPIIENNYQLFMVER 365

Query: 365 LIMAGAKTFIRTFKEDDTDLSLTDDLKKNTKIWQIPVYTADE 407
           LIM GAKTF +TF+E +TDL+LTDD KKN K W+IPVYT DE
Sbjct: 366 LIMMGAKTFFKTFREYETDLTLTDDPKKN-KNWEIPVYTMDE 398

BLAST of CcUC04G063670 vs. TAIR 10
Match: AT2G41150.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: leaf; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G56750.1); Has 57 Blast hits to 57 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 56; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink). )

HSP 1 Score: 313.9 bits (803), Expect = 1.9e-85
Identity = 158/259 (61.00%), Postives = 197/259 (76.06%), Query Frame = 0

Query: 5   RAHKPKPKPRSPISLFFVALAAIAFLFFFSSLISTNGASSLPSSNSIQKIFRFKNLTQKQ 64
           + HK K  P S   L  + + A+AFL  F+S+IST G  +LP   ++   F       + 
Sbjct: 6   KPHKLKATPGSQ-RLVLLCIVAVAFLLLFTSVISTGGL-ALPYRTTLIGYF------VRS 65

Query: 65  RRNRHVFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFVMPSRM 124
            RN+   S++DK+LYWGNRIDCPGK+CE+C GLGHQESSLRCALEEAMFL RTFVMPSRM
Sbjct: 66  TRNKTQHSLSDKYLYWGNRIDCPGKNCETCAGLGHQESSLRCALEEAMFLNRTFVMPSRM 125

Query: 125 CINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKLWYQVLST 184
           CINPIHNKKG+L++S N + EESWE +SCAM+SLYD+DLIS+ +PVILD+S+ W+ +LST
Sbjct: 126 CINPIHNKKGILNRSNNETREESWEVSSCAMESLYDIDLISEKIPVILDDSETWHIMLST 185

Query: 185 GMKLGARAVAHVEQVSRVELRDNSHYSNLLLINRTASPLSWFMECKDRNNRSAILLPYKF 244
            MKL  R  AHV   +R EL D+S ++NLLLINRTASPL+WF+ECKDR NRS ++LPY F
Sbjct: 186 SMKLKERGSAHVYGANRHELNDSSDFTNLLLINRTASPLAWFVECKDRGNRSDVMLPYSF 245

Query: 245 LPSMAAENLRDAAEKIKGL 264
           L +MAA  LRDAAEK+K L
Sbjct: 246 LQTMAASRLRDAAEKVKEL 256

BLAST of CcUC04G063670 vs. TAIR 10
Match: AT4G12700.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G04280.1); Has 136 Blast hits to 136 proteins in 17 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 125; Viruses - 0; Other Eukaryotes - 11 (source: NCBI BLink). )

HSP 1 Score: 91.3 bits (225), Expect = 2.0e-18
Identity = 76/311 (24.44%), Postives = 132/311 (42.44%), Query Frame = 0

Query: 92  ESCEGLGHQESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEAN 151
           + C+ + H   S  CAL EA +L RT VM   +C++ ++   G   +  +      +E  
Sbjct: 264 DRCKSMNHFLWSFLCALGEAQYLNRTLVMDLTLCLSSVYTLSGQNEEGKDFRFYFDFE-- 323

Query: 152 SCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVSRVELRDNSHYS 211
                 L +   + D V    D  K WY+    G+KL       V  +  V+++D     
Sbjct: 324 -----HLKEAASMLDQVQFWADWGK-WYK--KNGLKLHLVEDFRVTPMKLVDVKDT---- 383

Query: 212 NLLLINR--TASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAAEKIKGLLG-DYD 271
             L++ +  T  P +++    +    S +  P+  L    ++ L +    I   L  DYD
Sbjct: 384 --LIMRKFGTVEPDNYWYRVCEGETESVVQRPWNLL--WKSKRLMEIVSAIASRLNWDYD 443

Query: 272 AIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPG 331
           AIH+ RGDK +        ++ + P+L++DT P  +L  +   +  GR L+IA+NE    
Sbjct: 444 AIHIERGDKAR--------NKEVWPNLEKDTSPSSILSTLQDKIEQGRNLYIATNEPELS 503

Query: 332 FFSPLSARYRLAYSSNYSDILD----------------PVVKNNYQLFMIERLIMAGAKT 384
           FF+PL  +Y+  +   + D+ D                PV  + Y    ++  +    K 
Sbjct: 504 FFNPLKDKYKPHFLDEFKDLWDESSEWYSETTKLNGGNPVEFDGYMRASVDTEVFLRGKK 548

BLAST of CcUC04G063670 vs. TAIR 10
Match: AT2G04280.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G12700.1); Has 130 Blast hits to 130 proteins in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 124; Viruses - 0; Other Eukaryotes - 6 (source: NCBI BLink). )

HSP 1 Score: 90.9 bits (224), Expect = 2.6e-18
Identity = 79/319 (24.76%), Postives = 137/319 (42.95%), Query Frame = 0

Query: 92  ESCEGLGHQESSLRCALEEAMFLQRTFVMPSRMCINPIHNKKGLLHQSTNASSEESWEAN 151
           + C+ + H   S  CAL EA +L RT VM   +C++ I+   G         +EE  +  
Sbjct: 269 DRCKSMNHFLWSFLCALGEAQYLNRTLVMDLTLCLSSIYTSSG--------QNEEGKD-- 328

Query: 152 SCAMDSLYDMDLISDTVPVILDNSKLWYQVLSTGMKLGARAVAHVEQVSRVELRDNSHYS 211
                  +D + + +   V LD ++ W Q      K   R   H+ +  RV     +   
Sbjct: 329 ---FRFYFDFEHLKEAASV-LDEAQFWAQWGKLRKKRRNRLNLHLVEDFRVTPMKLAAVK 388

Query: 212 NLLLINRTAS--PLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDAAEKIKGLLG-DYD 271
           + L++ +  S  P +++    + +  S +  P+  L    +  L +    I   L  DYD
Sbjct: 389 DTLIMRKFGSVEPDNYWYRVCEGDAESVVKRPWHLL--WKSRRLMEIVSAIASRLNWDYD 448

Query: 272 AIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPG 331
           A+H+ RG+K +        ++ + P+L+ DT P  +L  +   V  GR L+IA+NE    
Sbjct: 449 AVHIERGEKAR--------NKEVWPNLEADTSPSALLSTLQDKVEEGRHLYIATNEGELS 508

Query: 332 FFSPLSARYRLAYSSNYSDILD----------------PVVKNNYQLFMIERLIMAGAKT 391
           FF+PL  +Y   +  +Y D+ D                PV  + Y    ++       + 
Sbjct: 509 FFNPLKDKYATHFLYDYKDLWDESSEWYSETTKLNGGNPVEFDGYMRASVD------TEV 557

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038881641.11.2e-22294.85uncharacterized protein LOC120073097 [Benincasa hispida][more]
XP_008455718.11.5e-22294.12PREDICTED: uncharacterized protein LOC103495824 [Cucumis melo] >KAA0025926.1 unc... [more]
XP_004144331.11.5e-21792.40uncharacterized protein LOC101219097 [Cucumis sativus] >KGN54704.1 hypothetical ... [more]
XP_022153942.14.9e-21390.51uncharacterized protein LOC111021332 [Momordica charantia][more]
XP_022967807.17.1e-21290.00uncharacterized protein LOC111467213 isoform X1 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5D3DWC17.4e-22394.12O-fucosyltransferase family protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5... [more]
A0A1S3C1H97.4e-22394.12O-fucosyltransferase family protein OS=Cucumis melo OX=3656 GN=LOC103495824 PE=3... [more]
A0A0A0L0X37.2e-21892.40Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G430860 PE=4 SV=1[more]
A0A6J1DKB92.4e-21390.51O-fucosyltransferase family protein OS=Momordica charantia OX=3673 GN=LOC1110213... [more]
A0A6J1HRU43.4e-21290.00O-fucosyltransferase family protein OS=Cucurbita maxima OX=3661 GN=LOC111467213 ... [more]
Match NameE-valueIdentityDescription
AT3G56750.13.0e-15265.84unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G41150.22.0e-15165.67unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G41150.11.9e-8561.00unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT4G12700.12.0e-1824.44unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G04280.12.6e-1824.76unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (PI 537277) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D3.40.50.11350coord: 236..378
e-value: 2.3E-6
score: 29.5
NoneNo IPR availablePANTHERPTHR31469:SF8PLANT/PROTEINcoord: 3..407
NoneNo IPR availablePANTHERPTHR31469OS07G0633600 PROTEINcoord: 3..407
IPR019378GDP-fucose protein O-fucosyltransferasePFAMPF10250O-FucTcoord: 92..377
e-value: 1.5E-8
score: 34.8

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CcUC04G063670.1CcUC04G063670.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006004 fucose metabolic process
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0016740 transferase activity