Cp4.1LG10g11110 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG10g11110
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionO-fucosyltransferase family protein
LocationCp4.1LG10: 7650709 .. 7654916 (+)
RNA-Seq ExpressionCp4.1LG10g11110
SyntenyCp4.1LG10g11110
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGATTCACAAGACCCAGAAGGCAAAACCCAAACCCAGATCCCCATTCCTCTTCTTCTTCGTTGCCCTCGCCGTCATTGCGTTTCTTTTCCTATTTTCCTCTCTGATTTCCACTAATGGGGTTTCTTCTTCTTCTTTTCCATCATCAAATTCGATTCGTGAAATCTTCAGATTCAAGAATTTGAACCAGAAACAGAGACGTAATCGGCACGTTTTTAGTACGAACGACAAGTTCTTGTACTGGGGCAACAGAATCGACTGTCCTGGGAAGCATTGCGAGTCTTGCGAGGGTTTGGGTCACCAGGAGTCCAGCTTGAGGTGCGCCCTTGAGGAAGCCATGTTCCTCCAGAGGTAATTTTCATGAAGTTTTTCTGATTTCTGGGGCTTAATCTTTACATGAACATGTTTTCTCCTAGTTTAATTTGGTGGATTGGCATGTTAAGATTTCTGTGTGAAAGGGGTTTGGAATTGGATTAGCAGTAACTCATCAGCACTCGGGTGGTTCTATCACTAAATGTAGAATTAACCGTTGAGGATGGTTGGGAGGGAGTCCCACGTTGGCTAATACATCTCTAGAGGAAAGCCATGAGTGCTTACACTCAATGTGGACAATAGCATACCATTGTGGAGATACGTGGTATTAGAGTCATGCCTTTAACTTAACCATGTCAATAGAATCCTCAAATGTTGAACAAAGAAGTCGTGAGCCTCAATTAAGGGGAGGTTGTTTGAGGACTCTATAGGCCTCAAGGGAGGCTCTATGGCGTACTTCGTTCGAGGGGAGGATTGTGGAGATTCGTGATTCCTAACATTATCATATTGTGAAACTCACGAGTTCAAGAACTTCTGTTGGCCCCTTGCTTGATCTTTATGCTGTGTTATTGCTATGAGGTGCCATTTAGCTTGTCTCAGCTCTTTGTAATGGTAATGGATTGGTCCAACGCCCTTAAATTCAGCATAATTTATTGGTAGAATGCTTTTGCATAACCTACTTAGACAATTTACTTTACTTTCTTTTTGGTGCAGAGTATTTGTAATGCCCTCTAGAATGTGTATCAACCCTATACACAATAAGAAAGGGATTCTTCATCAGTCAACCAATGCAAGCTCAGAGGAAAGGTGCTTCTTCCTCTTTGCTAATCCTATAATGAGTTATACTTGATGATAGTTGAAACGGTATTCAATTGATTCGCTTCATGCTTCAAACTAAGACGCGAACTCCCTTGCATAAAAATCTTTGTTCTTCATGTGTTTGTGATCTTTTCTAGAAAGAACACAGAATCTGAATTTGTTCTCTGCATTGCAGATGGGAAACAAACTCTTGTGCCATGGATTCGTTGTACGATATGGATCTTATATCTGATACCGTACCGGTGATTTTAGACAACTCAAAATTATGGTATCAGGTGCAGTCAACTGGTATGAAATTAGGATCTAGAGTGGTTGCCCATGTTGATCAAGTTAGTCGTATTGAACTCAGAGACGACAGCCGCTACTCCAATCTTTTGCTAATAAATCGAACTGCCAGCCCTCTTTCATGGTAAATCAAACTTGGGGTAATAAGATTCTTTTATGTCTTTTGCATGTGTCTGTGAGATCCCATATCGGTTGGAGAGAGGAACGAAACATTCCTTATAAGAGTGTAGAAACCTCTCCCTAGCAGACGAGTTTTAAAACCTTGAAGGGAAGCCCAGAAGGGAAAACCCAAAGAGGACAATATCTAGTGGCGGTGGGATTGGGCTGTTACAAATGTTATCAGAGCCAAACACCGGGCGGTGTGCCAGCGAGGACGTTGGTCCTCCAAGGGGGGTGGATTGCGAGATCCCACATCGGTTGGAGAGGGGAACAAAACATTCCTTATAAGAGTGTAGAAACCTTTCCCTAGCAAACACGTTTTAAAATATTGAGGGGAAGCCCAGAAGGAAAAACCCAAAGAGGACACTATCTACTAGCGGTGAGCTTGGGCTGTTACAAATAGTATCAGAACCAGACACCAGGCAGTGTGCCACCGGGCGGTAAAAGCTCAATTAGGCTCGTCTTTTTAATGTTTTGTTTTCATGCATTTCTTTGCAACTTGGTACGACGGATATTTCAATTTAAGATCCCAATTTATGAATGATTAGGGCGTTAATGCCGTGATAACTCTTTAGGTTTATGGAATGCAAGGACAGAAACAACCGCAGTGCCATATTGTTGCCCTATAAATTTCTTCCTTCTATGGCAGCAGAAAACTTGAGGGATGCATCTGAGAAGGTATTTTTTGAAACCATATTAGGTTCTTTCTTTTGTTATTCATATTTGCAGTCTTCTACGAATCATGACATACGGTAGAGCCTATCGCTGTTAACTCGCTGCTTACGAAACCTAGCTTTCGTGCATTTCTTTATTTCTACGTTGCTGTTGAATAGTTGTAGTAGATGCTCAACTTCAGTGTTATAAGATAAGGCATTTTCTTTATTTGGTCTTCCCGGATGCTTCTTAGCATAAACTCACCCTCTTAGAAAATATAATTACTGAACTCTTGATCGACCGAAAAGACTTCACTTTAGATACCCTGACACGAATAACGAACCCTAATTTAAAGACTCACGAGGGAAGTAGGCATAATTTTTAGAAACAAGAAACTAAAAACATAATGGTTACCTATTGAGGCCTAATTTTCCTCTCCTAGGATGACCCTAGCCAGTTCCCTTGCCGGGCTTGGAGGTTTGATTGAAAGTAGCGCTCGAGGATTGTAATTGGAAATGGAAGCTTGGTTCCCCTCGCACTCCACTCTATTTTGAATTAGGGATTTGCCTCTTTCTTTCTCTACCGCCTGTGTCTATCTCCTGTCAGACTTTCCACCCGTAGGATCGCTTCTAAGTCCAATATCCAAAGTACTTGGAAATTCGTCTTTTTCCGTCTTTGATGAACTTTGAAGTTTTGTCATCAGGCCATGGTTGTATAATGGTGAAAGGCAGAAGAACATAGAAGGGCTTGTATGCAAACATTTGCAGAAGCAAGCCAGTTGTAACCATTATTTATGGATAGGTTTGTAGAAAATCTTGGAAGTACCTGAAACTTTATGGTTTTGGTGCCTTGGAAACTGTACAGAAGGCATATTTCTTTATTTCATAGGAAAATTCGATAAATCCGAAGCGAGTAATTAGCTACAGTGAGTGGTTGGTAAGCCTTGTATTAGCTCGAGACCGAGTTTGTTTGCAGTTCTCTGTTTCTGTTTGGGTTACTTATATGTTATACTGTGTGTAATAGCTGTGAGAAAAGATTACAAGTTGTTTCTCATTCATTTTGGAATATAGTTCTATAAGTAATACTGAAAAGAATGTCTGATCAATAGATTAAAGAGCTACTCGGTGATTACGACGCCATCCATGTTCGTCGTGGAGATAAAATAAAGACCAGAAAGGACAGGTTTGGTGTTGATAGAAGCTTACATCCACATCTCGACAGGGATACACGGCCTGAGTTTATGCTAAAGAGAATAGCAAAGTGGGTTCCGCCAGGGCGGACACTTTTTATTGCTTCAAATGAGAGAACGCCGGGGTTCTTCTCGCCCCTCTCTGCTCGGTGAGTCCATCTCACTTACCTTTTTAGTTCACTCGAGTTAGAGCTGAAATGTTGGTTAGACTTCGTTCAAAAGATCGATCTTCTTGTGTTCTGATAGCGATATTATAAATCATTGAACCGTCTCTCATTTAGTCCTACCACCACCACCACCACCACCACCACCTTGATTCATGATACCCCAATTTTGCAGGTACAAGTTGGCTTATTCCTCGAACTATAGCCATATTCTGGATCCTGTGGTTAAGAACAATTATCAGTTGTTCATGATCGAAAGGCTCGTTATGGCGGGTGCCAAGACATTCATCAGAACGTTCAAAGAAGACGATACAGATCTTAGCCTCACCGATGACCCAAAGAAGAACACGAAAGTGTGGCAAAAACCAATCTACACAGATGACGAAGAAAGTAGCTGAGGATTTGTTATTTCACTGAGATGTCATTTTGATCCATGCTAATGCCAAACCAACAAAAGCCTTTCATTGCTCGTTGTAAAGCTTCATATGAATGAATGGTAATAGAAAGGGTAGATTCTTGGCCGATGATGTTGTATATGTGGTATAAATTATGATCCTGGTTTCTGAATTTCATTACTTCCATTTCTTTAGCGTCACGTTTTTTTTTTTTTTTTTTTTTTTTTTT

mRNA sequence

ATGGCGATTCACAAGACCCAGAAGGCAAAACCCAAACCCAGATCCCCATTCCTCTTCTTCTTCGTTGCCCTCGCCGTCATTGCGTTTCTTTTCCTATTTTCCTCTCTGATTTCCACTAATGGGGTTTCTTCTTCTTCTTTTCCATCATCAAATTCGATTCGTGAAATCTTCAGATTCAAGAATTTGAACCAGAAACAGAGACGTAATCGGCACGTTTTTAGTACGAACGACAAGTTCTTGTACTGGGGCAACAGAATCGACTGTCCTGGGAAGCATTGCGAGTCTTGCGAGGGTTTGGGTCACCAGGAGTCCAGCTTGAGGTGCGCCCTTGAGGAAGCCATGTTCCTCCAGAGAGTATTTGTAATGCCCTCTAGAATGTGTATCAACCCTATACACAATAAGAAAGGGATTCTTCATCAGTCAACCAATGCAAGCTCAGAGGAAAGATGGGAAACAAACTCTTGTGCCATGGATTCGTTGTACGATATGGATCTTATATCTGATACCGTACCGGTGATTTTAGACAACTCAAAATTATGGTTTATGGAATGCAAGGACAGAAACAACCGCAGTGCCATATTGTTGCCCTATAAATTTCTTCCTTCTATGGCAGCAGAAAACTTGAGGGATGCATCTGAGAAGATTAAAGAGCTACTCGGTGATTACGACGCCATCCATGTTCGTCGTGGAGATAAAATAAAGACCAGAAAGGACAGGTTTGGTGTTGATAGAAGCTTACATCCACATCTCGACAGGGATACACGGCCTGAGTTTATGCTAAAGAGAATAGCAAAGTGGGTTCCGCCAGGGCGGACACTTTTTATTGCTTCAAATGAGAGAACGCCGGGGTTCTTCTCGCCCCTCTCTGCTCGGTACAAGTTGGCTTATTCCTCGAACTATAGCCATATTCTGGATCCTGTGGTTAAGAACAATTATCAGTTGTTCATGATCGAAAGGCTCGTTATGGCGGGTGCCAAGACATTCATCAGAACGTTCAAAGAAGACGATACAGATCTTAGCCTCACCGATGACCCAAAGAAGAACACGAAAGTGTGGCAAAAACCAATCTACACAGATGACGAAGAAAGTAGCTGAGGATTTGTTATTTCACTGAGATGTCATTTTGATCCATGCTAATGCCAAACCAACAAAAGCCTTTCATTGCTCGTTGTAAAGCTTCATATGAATGAATGGTAATAGAAAGGGTAGATTCTTGGCCGATGATGTTGTATATGTGGTATAAATTATGATCCTGGTTTCTGAATTTCATTACTTCCATTTCTTTAGCGTCACGTTTTTTTTTTTTTTTTTTTTTTTTTTT

Coding sequence (CDS)

ATGGCGATTCACAAGACCCAGAAGGCAAAACCCAAACCCAGATCCCCATTCCTCTTCTTCTTCGTTGCCCTCGCCGTCATTGCGTTTCTTTTCCTATTTTCCTCTCTGATTTCCACTAATGGGGTTTCTTCTTCTTCTTTTCCATCATCAAATTCGATTCGTGAAATCTTCAGATTCAAGAATTTGAACCAGAAACAGAGACGTAATCGGCACGTTTTTAGTACGAACGACAAGTTCTTGTACTGGGGCAACAGAATCGACTGTCCTGGGAAGCATTGCGAGTCTTGCGAGGGTTTGGGTCACCAGGAGTCCAGCTTGAGGTGCGCCCTTGAGGAAGCCATGTTCCTCCAGAGAGTATTTGTAATGCCCTCTAGAATGTGTATCAACCCTATACACAATAAGAAAGGGATTCTTCATCAGTCAACCAATGCAAGCTCAGAGGAAAGATGGGAAACAAACTCTTGTGCCATGGATTCGTTGTACGATATGGATCTTATATCTGATACCGTACCGGTGATTTTAGACAACTCAAAATTATGGTTTATGGAATGCAAGGACAGAAACAACCGCAGTGCCATATTGTTGCCCTATAAATTTCTTCCTTCTATGGCAGCAGAAAACTTGAGGGATGCATCTGAGAAGATTAAAGAGCTACTCGGTGATTACGACGCCATCCATGTTCGTCGTGGAGATAAAATAAAGACCAGAAAGGACAGGTTTGGTGTTGATAGAAGCTTACATCCACATCTCGACAGGGATACACGGCCTGAGTTTATGCTAAAGAGAATAGCAAAGTGGGTTCCGCCAGGGCGGACACTTTTTATTGCTTCAAATGAGAGAACGCCGGGGTTCTTCTCGCCCCTCTCTGCTCGGTACAAGTTGGCTTATTCCTCGAACTATAGCCATATTCTGGATCCTGTGGTTAAGAACAATTATCAGTTGTTCATGATCGAAAGGCTCGTTATGGCGGGTGCCAAGACATTCATCAGAACGTTCAAAGAAGACGATACAGATCTTAGCCTCACCGATGACCCAAAGAAGAACACGAAAGTGTGGCAAAAACCAATCTACACAGATGACGAAGAAAGTAGCTGA

Protein sequence

MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTNGVSSSSFPSSNSIREIFRFKNLNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVFVMPSRMCINPIHNKKGILHQSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLWFMECKDRNNRSAILLPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQLFMIERLVMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQKPIYTDDEESS
Homology
BLAST of Cp4.1LG10g11110 vs. NCBI nr
Match: XP_023545162.1 (uncharacterized protein LOC111804548 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 714 bits (1842), Expect = 1.98e-258
Identity = 364/411 (88.56%), Postives = 364/411 (88.56%), Query Frame = 0

Query: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTNGVSSSSFPSSNSIREIFRFK 60
           MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTNGVSSSSFPSSNSIREIFRFK
Sbjct: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTNGVSSSSFPSSNSIREIFRFK 60

Query: 61  NLNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVF 120
           NLNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVF
Sbjct: 61  NLNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVF 120

Query: 121 VMPSRMCINPIHNKKGILHQSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLW 180
           VMPSRMCINPIHNKKGILHQSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLW
Sbjct: 121 VMPSRMCINPIHNKKGILHQSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLW 180

Query: 181 -----------------------------------------------FMECKDRNNRSAI 240
                                                          FMECKDRNNRSAI
Sbjct: 181 YQVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLLLINRTASPLSWFMECKDRNNRSAI 240

Query: 241 LLPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300
           LLPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD
Sbjct: 241 LLPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300

Query: 301 TRPEFMLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQ 360
           TRPEFMLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQ
Sbjct: 301 TRPEFMLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQ 360

Query: 361 LFMIERLVMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQKPIYTDDEESS 364
           LFMIERLVMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQKPIYTDDEESS
Sbjct: 361 LFMIERLVMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQKPIYTDDEESS 411

BLAST of Cp4.1LG10g11110 vs. NCBI nr
Match: KAG7033584.1 (hypothetical protein SDJN02_03308 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 709 bits (1830), Expect = 1.33e-256
Identity = 361/411 (87.83%), Postives = 362/411 (88.08%), Query Frame = 0

Query: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTNGVSSSSFPSSNSIREIFRFK 60
           MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTNGVSSSSFPSSNSIREIFRFK
Sbjct: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTNGVSSSSFPSSNSIREIFRFK 60

Query: 61  NLNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVF 120
           NLNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVF
Sbjct: 61  NLNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVF 120

Query: 121 VMPSRMCINPIHNKKGILHQSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLW 180
           VMPSRMCINPIHNKKGILH STNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLW
Sbjct: 121 VMPSRMCINPIHNKKGILHHSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLW 180

Query: 181 -----------------------------------------------FMECKDRNNRSAI 240
                                                          FMECKDRNNRSAI
Sbjct: 181 YQVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLLLINRTASPLSWFMECKDRNNRSAI 240

Query: 241 LLPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300
           LLPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD
Sbjct: 241 LLPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300

Query: 301 TRPEFMLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQ 360
           TRPEFMLKRIAKWVPPGRTLFIASNER PGFFSPLSARYKLAYSSNYSHILDPVVKNNYQ
Sbjct: 301 TRPEFMLKRIAKWVPPGRTLFIASNERMPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQ 360

Query: 361 LFMIERLVMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQKPIYTDDEESS 364
           LFMIERL+MAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQKPIYTDDEESS
Sbjct: 361 LFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQKPIYTDDEESS 411

BLAST of Cp4.1LG10g11110 vs. NCBI nr
Match: XP_022967807.1 (uncharacterized protein LOC111467213 isoform X1 [Cucurbita maxima])

HSP 1 Score: 707 bits (1825), Expect = 7.43e-256
Identity = 362/411 (88.08%), Postives = 363/411 (88.32%), Query Frame = 0

Query: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTNGVSSSSFPSSNSIREIFRFK 60
           MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTNGVSSS FPSSNSIREIFRFK
Sbjct: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTNGVSSS-FPSSNSIREIFRFK 60

Query: 61  NLNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVF 120
           NLNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVF
Sbjct: 61  NLNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVF 120

Query: 121 VMPSRMCINPIHNKKGILHQSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLW 180
           VMPSRMCINPIHNKKGILHQSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLW
Sbjct: 121 VMPSRMCINPIHNKKGILHQSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLW 180

Query: 181 -----------------------------------------------FMECKDRNNRSAI 240
                                                          FMECKDRNNRSAI
Sbjct: 181 YQVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLLLINRTASPLSWFMECKDRNNRSAI 240

Query: 241 LLPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300
           LLPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD
Sbjct: 241 LLPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300

Query: 301 TRPEFMLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQ 360
           TRPEFMLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQ
Sbjct: 301 TRPEFMLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQ 360

Query: 361 LFMIERLVMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQKPIYTDDEESS 364
           LFMIERL+MAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQKPIYTDDEESS
Sbjct: 361 LFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQKPIYTDDEESS 410

BLAST of Cp4.1LG10g11110 vs. NCBI nr
Match: XP_022932889.1 (uncharacterized protein LOC111439414 isoform X1 [Cucurbita moschata])

HSP 1 Score: 695 bits (1793), Expect = 5.58e-251
Identity = 357/411 (86.86%), Postives = 359/411 (87.35%), Query Frame = 0

Query: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTNGVSSSSFPSSNSIREIFRFK 60
           MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLIST GVSSS FPSSNSIREIFRFK
Sbjct: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTIGVSSS-FPSSNSIREIFRFK 60

Query: 61  NLNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVF 120
           NLNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVF
Sbjct: 61  NLNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVF 120

Query: 121 VMPSRMCINPIHNKKGILHQSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLW 180
           VMPSRMCINPIHNKKGILH STNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLW
Sbjct: 121 VMPSRMCINPIHNKKGILHHSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLW 180

Query: 181 -----------------------------------------------FMECKDRNNRSAI 240
                                                          FMECKDRNNRSAI
Sbjct: 181 YQVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLLIINRTASPLSWFMECKDRNNRSAI 240

Query: 241 LLPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300
           LLPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD
Sbjct: 241 LLPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300

Query: 301 TRPEFMLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQ 360
           TRPEFMLKRIAKWVPPGRTLFIASNER PGFFSPLSARYKLAYSSNYSHIL PVVKNNYQ
Sbjct: 301 TRPEFMLKRIAKWVPPGRTLFIASNERMPGFFSPLSARYKLAYSSNYSHILGPVVKNNYQ 360

Query: 361 LFMIERLVMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQKPIYTDDEESS 364
           LFMIERL+MAGAKTFIRTFKED+TDLSLTDDPKKNTKVWQKPIYTDDEESS
Sbjct: 361 LFMIERLIMAGAKTFIRTFKEDNTDLSLTDDPKKNTKVWQKPIYTDDEESS 410

BLAST of Cp4.1LG10g11110 vs. NCBI nr
Match: XP_022153942.1 (uncharacterized protein LOC111021332 [Momordica charantia])

HSP 1 Score: 658 bits (1697), Expect = 2.44e-236
Identity = 330/411 (80.29%), Postives = 346/411 (84.18%), Query Frame = 0

Query: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTNGVSSSSFPSSNSIREIFRFK 60
           MA  + QKAKPKPRSP  FFFVALA IAFLFLFSSLISTNG SSS+F SSNSI++IFRF 
Sbjct: 1   MAFPRAQKAKPKPRSPLFFFFVALAAIAFLFLFSSLISTNGASSSTFSSSNSIQKIFRFN 60

Query: 61  NLNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVF 120
           N+N+K +RNRHVFS NDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFL+R+F
Sbjct: 61  NVNEKPKRNRHVFSANDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLRRIF 120

Query: 121 VMPSRMCINPIHNKKGILHQSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLW 180
           VMPSRMCINPIHNKKGILHQS NASSEE WE  SCAMDSLYD+DLISDTVPVILDNSKLW
Sbjct: 121 VMPSRMCINPIHNKKGILHQSNNASSEESWEAKSCAMDSLYDIDLISDTVPVILDNSKLW 180

Query: 181 -----------------------------------------------FMECKDRNNRSAI 240
                                                          FMECKDRNNRSAI
Sbjct: 181 YQVLSTGMKLGARAVAHVERVSRAELKDNNRYSNLLLINRTASPLSWFMECKDRNNRSAI 240

Query: 241 LLPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300
           LLPYKFLPSMAAENLRDA+EKIK LLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD
Sbjct: 241 LLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300

Query: 301 TRPEFMLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQ 360
           TRPEFMLKR+AKWV PGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDP+VKNNYQ
Sbjct: 301 TRPEFMLKRLAKWVAPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPMVKNNYQ 360

Query: 361 LFMIERLVMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQKPIYTDDEESS 364
           LFMIERL+MAGAKTFIRTFKEDDTDLSLTDDPKKNTK+WQKP+YTDDEE S
Sbjct: 361 LFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKIWQKPVYTDDEEKS 411

BLAST of Cp4.1LG10g11110 vs. ExPASy TrEMBL
Match: A0A6J1HRU4 (O-fucosyltransferase family protein OS=Cucurbita maxima OX=3661 GN=LOC111467213 PE=3 SV=1)

HSP 1 Score: 707 bits (1825), Expect = 3.60e-256
Identity = 362/411 (88.08%), Postives = 363/411 (88.32%), Query Frame = 0

Query: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTNGVSSSSFPSSNSIREIFRFK 60
           MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTNGVSSS FPSSNSIREIFRFK
Sbjct: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTNGVSSS-FPSSNSIREIFRFK 60

Query: 61  NLNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVF 120
           NLNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVF
Sbjct: 61  NLNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVF 120

Query: 121 VMPSRMCINPIHNKKGILHQSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLW 180
           VMPSRMCINPIHNKKGILHQSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLW
Sbjct: 121 VMPSRMCINPIHNKKGILHQSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLW 180

Query: 181 -----------------------------------------------FMECKDRNNRSAI 240
                                                          FMECKDRNNRSAI
Sbjct: 181 YQVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLLLINRTASPLSWFMECKDRNNRSAI 240

Query: 241 LLPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300
           LLPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD
Sbjct: 241 LLPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300

Query: 301 TRPEFMLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQ 360
           TRPEFMLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQ
Sbjct: 301 TRPEFMLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQ 360

Query: 361 LFMIERLVMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQKPIYTDDEESS 364
           LFMIERL+MAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQKPIYTDDEESS
Sbjct: 361 LFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQKPIYTDDEESS 410

BLAST of Cp4.1LG10g11110 vs. ExPASy TrEMBL
Match: A0A6J1EY95 (uncharacterized protein LOC111439414 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111439414 PE=4 SV=1)

HSP 1 Score: 695 bits (1793), Expect = 2.70e-251
Identity = 357/411 (86.86%), Postives = 359/411 (87.35%), Query Frame = 0

Query: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTNGVSSSSFPSSNSIREIFRFK 60
           MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLIST GVSSS FPSSNSIREIFRFK
Sbjct: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTIGVSSS-FPSSNSIREIFRFK 60

Query: 61  NLNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVF 120
           NLNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVF
Sbjct: 61  NLNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVF 120

Query: 121 VMPSRMCINPIHNKKGILHQSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLW 180
           VMPSRMCINPIHNKKGILH STNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLW
Sbjct: 121 VMPSRMCINPIHNKKGILHHSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLW 180

Query: 181 -----------------------------------------------FMECKDRNNRSAI 240
                                                          FMECKDRNNRSAI
Sbjct: 181 YQVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLLIINRTASPLSWFMECKDRNNRSAI 240

Query: 241 LLPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300
           LLPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD
Sbjct: 241 LLPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300

Query: 301 TRPEFMLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQ 360
           TRPEFMLKRIAKWVPPGRTLFIASNER PGFFSPLSARYKLAYSSNYSHIL PVVKNNYQ
Sbjct: 301 TRPEFMLKRIAKWVPPGRTLFIASNERMPGFFSPLSARYKLAYSSNYSHILGPVVKNNYQ 360

Query: 361 LFMIERLVMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQKPIYTDDEESS 364
           LFMIERL+MAGAKTFIRTFKED+TDLSLTDDPKKNTKVWQKPIYTDDEESS
Sbjct: 361 LFMIERLIMAGAKTFIRTFKEDNTDLSLTDDPKKNTKVWQKPIYTDDEESS 410

BLAST of Cp4.1LG10g11110 vs. ExPASy TrEMBL
Match: A0A6J1DKB9 (O-fucosyltransferase family protein OS=Momordica charantia OX=3673 GN=LOC111021332 PE=3 SV=1)

HSP 1 Score: 658 bits (1697), Expect = 1.18e-236
Identity = 330/411 (80.29%), Postives = 346/411 (84.18%), Query Frame = 0

Query: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTNGVSSSSFPSSNSIREIFRFK 60
           MA  + QKAKPKPRSP  FFFVALA IAFLFLFSSLISTNG SSS+F SSNSI++IFRF 
Sbjct: 1   MAFPRAQKAKPKPRSPLFFFFVALAAIAFLFLFSSLISTNGASSSTFSSSNSIQKIFRFN 60

Query: 61  NLNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVF 120
           N+N+K +RNRHVFS NDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFL+R+F
Sbjct: 61  NVNEKPKRNRHVFSANDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLRRIF 120

Query: 121 VMPSRMCINPIHNKKGILHQSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLW 180
           VMPSRMCINPIHNKKGILHQS NASSEE WE  SCAMDSLYD+DLISDTVPVILDNSKLW
Sbjct: 121 VMPSRMCINPIHNKKGILHQSNNASSEESWEAKSCAMDSLYDIDLISDTVPVILDNSKLW 180

Query: 181 -----------------------------------------------FMECKDRNNRSAI 240
                                                          FMECKDRNNRSAI
Sbjct: 181 YQVLSTGMKLGARAVAHVERVSRAELKDNNRYSNLLLINRTASPLSWFMECKDRNNRSAI 240

Query: 241 LLPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300
           LLPYKFLPSMAAENLRDA+EKIK LLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD
Sbjct: 241 LLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300

Query: 301 TRPEFMLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQ 360
           TRPEFMLKR+AKWV PGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDP+VKNNYQ
Sbjct: 301 TRPEFMLKRLAKWVAPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPMVKNNYQ 360

Query: 361 LFMIERLVMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQKPIYTDDEESS 364
           LFMIERL+MAGAKTFIRTFKEDDTDLSLTDDPKKNTK+WQKP+YTDDEE S
Sbjct: 361 LFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKIWQKPVYTDDEEKS 411

BLAST of Cp4.1LG10g11110 vs. ExPASy TrEMBL
Match: A0A5D3DWC1 (O-fucosyltransferase family protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold749G00060 PE=3 SV=1)

HSP 1 Score: 644 bits (1660), Expect = 4.57e-231
Identity = 328/408 (80.39%), Postives = 340/408 (83.33%), Query Frame = 0

Query: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTNGVSSSSFPSSNSIREIFRFK 60
           MA  +TQK KPK RSP +FFFV+LA IAFLFLFSSLISTNG  SSSFPSSNSI++IFRFK
Sbjct: 1   MAFPRTQKPKPKHRSPLIFFFVSLAAIAFLFLFSSLISTNG--SSSFPSSNSIQKIFRFK 60

Query: 61  NLNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVF 120
           NL QKQRR RH FS NDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQR F
Sbjct: 61  NLTQKQRRGRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTF 120

Query: 121 VMPSRMCINPIHNKKGILHQSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLW 180
           VMPSRMCINPIHNKKG+LHQSTNASSEE WE NSCAMDSLYDMDLISDTVPVILDNSK W
Sbjct: 121 VMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSW 180

Query: 181 -----------------------------------------------FMECKDRNNRSAI 240
                                                          FMECKDRNNRSA+
Sbjct: 181 YQVLSTSMKLGARAVAHVEQVSRIELRDSSHYSNLLLINRTASPLSWFMECKDRNNRSAV 240

Query: 241 LLPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300
           +LPYKFLPSMAAENLRDA+EKIK LLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD
Sbjct: 241 MLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300

Query: 301 TRPEFMLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQ 360
           TRPEFMLKRIAKWVP GRTLFIASNER PGFFSPLSARYKLAYSSNYS ILDPVVKNNYQ
Sbjct: 301 TRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQ 360

BLAST of Cp4.1LG10g11110 vs. ExPASy TrEMBL
Match: A0A1S3C1H9 (O-fucosyltransferase family protein OS=Cucumis melo OX=3656 GN=LOC103495824 PE=3 SV=1)

HSP 1 Score: 644 bits (1660), Expect = 4.57e-231
Identity = 328/408 (80.39%), Postives = 340/408 (83.33%), Query Frame = 0

Query: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTNGVSSSSFPSSNSIREIFRFK 60
           MA  +TQK KPK RSP +FFFV+LA IAFLFLFSSLISTNG  SSSFPSSNSI++IFRFK
Sbjct: 1   MAFPRTQKPKPKHRSPLIFFFVSLAAIAFLFLFSSLISTNG--SSSFPSSNSIQKIFRFK 60

Query: 61  NLNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVF 120
           NL QKQRR RH FS NDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQR F
Sbjct: 61  NLTQKQRRGRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTF 120

Query: 121 VMPSRMCINPIHNKKGILHQSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLW 180
           VMPSRMCINPIHNKKG+LHQSTNASSEE WE NSCAMDSLYDMDLISDTVPVILDNSK W
Sbjct: 121 VMPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSW 180

Query: 181 -----------------------------------------------FMECKDRNNRSAI 240
                                                          FMECKDRNNRSA+
Sbjct: 181 YQVLSTSMKLGARAVAHVEQVSRIELRDSSHYSNLLLINRTASPLSWFMECKDRNNRSAV 240

Query: 241 LLPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300
           +LPYKFLPSMAAENLRDA+EKIK LLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD
Sbjct: 241 MLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300

Query: 301 TRPEFMLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQ 360
           TRPEFMLKRIAKWVP GRTLFIASNER PGFFSPLSARYKLAYSSNYS ILDPVVKNNYQ
Sbjct: 301 TRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQ 360

BLAST of Cp4.1LG10g11110 vs. TAIR 10
Match: AT2G41150.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: leaf; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G56750.1); Has 127 Blast hits to 127 proteins in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 117; Viruses - 0; Other Eukaryotes - 10 (source: NCBI BLink). )

HSP 1 Score: 451.8 bits (1161), Expect = 5.1e-127
Identity = 236/404 (58.42%), Postives = 283/404 (70.05%), Query Frame = 0

Query: 5   KTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTNGVSSSSFPSSNSIREIFRFKNLNQ 64
           K  K K  P S  L     +AV AFL LF+S+IST G+   + P   ++   F       
Sbjct: 6   KPHKLKATPGSQRLVLLCIVAV-AFLLLFTSVISTGGL---ALPYRTTLIGYF------V 65

Query: 65  KQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVFVMPS 124
           +  RN+   S +DK+LYWGNRIDCPGK+CE+C GLGHQESSLRCALEEAMFL R FVMPS
Sbjct: 66  RSTRNKTQHSLSDKYLYWGNRIDCPGKNCETCAGLGHQESSLRCALEEAMFLNRTFVMPS 125

Query: 125 RMCINPIHNKKGILHQSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSK------ 184
           RMCINPIHNKKGIL++S N + EE WE +SCAM+SLYD+DLIS+ +PVILD+S+      
Sbjct: 126 RMCINPIHNKKGILNRSNNETREESWEVSSCAMESLYDIDLISEKIPVILDDSETWHIML 185

Query: 185 -----------------------------------------LWFMECKDRNNRSAILLPY 244
                                                     WF+ECKDR NRS ++LPY
Sbjct: 186 STSMKLKERGSAHVYGANRHELNDSSDFTNLLLINRTASPLAWFVECKDRGNRSDVMLPY 245

Query: 245 KFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPE 304
            FL +MAA  LRDA+EKIK  LGDYDAIHVRRGDK+KTRKDRF V+RS  PHLDRDTRPE
Sbjct: 246 SFLQTMAASRLRDAAEKIKAKLGDYDAIHVRRGDKLKTRKDRFRVERSQFPHLDRDTRPE 305

Query: 305 FMLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQLFMI 362
           F++ RI K +PPGRTLFI SNERTP FFSPL+ RYK+AYSSN+S ILDP+++NNYQLFM+
Sbjct: 306 FIIGRIQKQIPPGRTLFIGSNERTPDFFSPLAIRYKVAYSSNFSEILDPIIENNYQLFMV 365

BLAST of Cp4.1LG10g11110 vs. TAIR 10
Match: AT3G56750.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G41150.2); Has 128 Blast hits to 128 proteins in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 117; Viruses - 0; Other Eukaryotes - 11 (source: NCBI BLink). )

HSP 1 Score: 451.1 bits (1159), Expect = 8.7e-127
Identity = 236/403 (58.56%), Postives = 285/403 (70.72%), Query Frame = 0

Query: 5   KTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTNGVSSSSFPSSNSIREIFRFKNLNQ 64
           K Q+ KP   S  L  F  + V +FL LFSS+IST  +     P   ++ + F +    +
Sbjct: 6   KAQRTKPTSGSQRLVLF-CIVVFSFLLLFSSVISTGKL---GLPYQQTLIDYFVWSPRGK 65

Query: 65  KQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVFVMPS 124
           +Q       S ++K+LYWGNRIDCPGK+CE+C GLGHQESSLRCALEEAMFL R FVMPS
Sbjct: 66  RQH------SLSEKYLYWGNRIDCPGKNCETCAGLGHQESSLRCALEEAMFLNRTFVMPS 125

Query: 125 RMCINPIHNKKGILHQSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSK------ 184
            MCINPIHNKKGIL++S N ++EE W  +SCAMDSLYD+DLIS+ +PVILD+SK      
Sbjct: 126 GMCINPIHNKKGILNRSDNKTTEEGWLGSSCAMDSLYDIDLISEKIPVILDDSKTWHIVL 185

Query: 185 ----------------------------------------LWFMECKDRNNRSAILLPYK 244
                                                    WF+ECKDR+NRSA++LPY 
Sbjct: 186 STSMKLGERGIAHVSGVTRHRLKESHYSNLLIINRTASPLAWFVECKDRSNRSAVMLPYS 245

Query: 245 FLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEF 304
           FLP+MAA  LR+A+EKIK  LGDYDAIHVRRGDK+KTRKDRFGV+R   PHLDRDTRPEF
Sbjct: 246 FLPNMAAAKLRNAAEKIKAQLGDYDAIHVRRGDKLKTRKDRFGVERIQFPHLDRDTRPEF 305

Query: 305 MLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQLFMIE 362
           +L+RI K +P GRTLFI SNER PGFFSPL+ RYKLAYSSN+S ILDP+++NNYQLFM+E
Sbjct: 306 ILRRIEKRIPRGRTLFIGSNERKPGFFSPLAVRYKLAYSSNFSEILDPIIENNYQLFMME 365

BLAST of Cp4.1LG10g11110 vs. TAIR 10
Match: AT2G41150.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: leaf; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G56750.1); Has 57 Blast hits to 57 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 56; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink). )

HSP 1 Score: 233.0 bits (593), Expect = 3.7e-61
Identity = 130/261 (49.81%), Postives = 159/261 (60.92%), Query Frame = 0

Query: 5   KTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTNGVSSSSFPSSNSIREIFRFKNLNQ 64
           K  K K  P S  L     +AV AFL LF+S+IST G+   + P   ++   F       
Sbjct: 6   KPHKLKATPGSQRLVLLCIVAV-AFLLLFTSVISTGGL---ALPYRTTLIGYF------V 65

Query: 65  KQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVFVMPS 124
           +  RN+   S +DK+LYWGNRIDCPGK+CE+C GLGHQESSLRCALEEAMFL R FVMPS
Sbjct: 66  RSTRNKTQHSLSDKYLYWGNRIDCPGKNCETCAGLGHQESSLRCALEEAMFLNRTFVMPS 125

Query: 125 RMCINPIHNKKGILHQSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSK------ 184
           RMCINPIHNKKGIL++S N + EE WE +SCAM+SLYD+DLIS+ +PVILD+S+      
Sbjct: 126 RMCINPIHNKKGILNRSNNETREESWEVSSCAMESLYDIDLISEKIPVILDDSETWHIML 185

Query: 185 -----------------------------------------LWFMECKDRNNRSAILLPY 219
                                                     WF+ECKDR NRS ++LPY
Sbjct: 186 STSMKLKERGSAHVYGANRHELNDSSDFTNLLLINRTASPLAWFVECKDRGNRSDVMLPY 245

BLAST of Cp4.1LG10g11110 vs. TAIR 10
Match: AT4G08810.1 (calcium ion binding )

HSP 1 Score: 77.8 bits (190), Expect = 2.0e-14
Identity = 75/313 (23.96%), Postives = 120/313 (38.34%), Query Frame = 0

Query: 78  KFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVFVMPSRMCINPIHNKKG- 137
           K+LY+    D        C+G+     S  C L EAM+L R FVM   +C++  ++ KG 
Sbjct: 249 KYLYYSRGGD-------YCKGMNQYMWSFLCGLGEAMYLNRTFVMDLSLCLSSSYSSKGK 308

Query: 138 ------------ILHQSTNASSEERWE-----------------TNSCAMDSLYDMDLIS 197
                         H    AS  E  E                         +  + L  
Sbjct: 309 DEEGKDFRYYFDFEHLKETASIVEEGEFLRDWKKWNRLHKRKVPVRKVKTHRVSPLQLSK 368

Query: 198 DTVPVIL------DNSKLWFMECKDRNNRSAILLPYKFLPSMAAENLRDASEKIKELLGD 257
           D   +I       +    W+  C+ + ++      +    S    N+   SE   ++  D
Sbjct: 369 DKSTIIWRQFDTPEPENYWYRVCEGQASKYVERPWHALWKSKRLMNI--VSEISGKMDWD 428

Query: 258 YDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPPGRTLFIASNERT 317
           +DA+HV RG+K K +K        L PHLD DT P+ +L ++   V   R L++A+NE  
Sbjct: 429 FDAVHVVRGEKAKNKK--------LWPHLDADTWPDAILTKLKGLVQVWRNLYVATNEPF 488

Query: 318 PGFFSPLSARYKLAYSSNYSHIL----------------DPVVKNNYQLFMIERLVMAGA 339
             +F  L ++YK+    +YS++                  PV  + Y    ++  V    
Sbjct: 489 YNYFDKLRSQYKVHLLDDYSYLWGNKSEWYNETSLLNNGKPVEFDGYMRVAVDTEVFYRG 544

BLAST of Cp4.1LG10g11110 vs. TAIR 10
Match: AT4G12700.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G04280.1); Has 136 Blast hits to 136 proteins in 17 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 125; Viruses - 0; Other Eukaryotes - 11 (source: NCBI BLink). )

HSP 1 Score: 76.6 bits (187), Expect = 4.5e-14
Identity = 68/296 (22.97%), Postives = 118/296 (39.86%), Query Frame = 0

Query: 94  ESCEGLGHQESSLRCALEEAMFLQRVFVMPSRMCINPIHNKKG-------------ILHQ 153
           + C+ + H   S  CAL EA +L R  VM   +C++ ++   G               H 
Sbjct: 264 DRCKSMNHFLWSFLCALGEAQYLNRTLVMDLTLCLSSVYTLSGQNEEGKDFRFYFDFEHL 323

Query: 154 STNASSEER----------WETNSCAMDSLYDMDL-------ISDTVPV----ILDNSKL 213
              AS  ++          ++ N   +  + D  +       + DT+ +     ++    
Sbjct: 324 KEAASMLDQVQFWADWGKWYKKNGLKLHLVEDFRVTPMKLVDVKDTLIMRKFGTVEPDNY 383

Query: 214 WFMECKDRNNRSAILLPYKFLPSMAAENLRDASEKIKELLG-DYDAIHVRRGDKIKTRKD 273
           W+  C +    S +  P+  L    ++ L +    I   L  DYDAIH+ RGDK +    
Sbjct: 384 WYRVC-EGETESVVQRPWNLL--WKSKRLMEIVSAIASRLNWDYDAIHIERGDKAR---- 443

Query: 274 RFGVDRSLHPHLDRDTRPEFMLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSS 333
               ++ + P+L++DT P  +L  +   +  GR L+IA+NE    FF+PL  +YK  +  
Sbjct: 444 ----NKEVWPNLEKDTSPSSILSTLQDKIEQGRNLYIATNEPELSFFNPLKDKYKPHFLD 503

Query: 334 NYSHILD----------------PVVKNNYQLFMIERLVMAGAKTFIRTFKEDDTD 339
            +  + D                PV  + Y    ++  V    K  I TF +   D
Sbjct: 504 EFKDLWDESSEWYSETTKLNGGNPVEFDGYMRASVDTEVFLRGKKQIETFNDLTND 548

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023545162.11.98e-25888.56uncharacterized protein LOC111804548 isoform X1 [Cucurbita pepo subsp. pepo][more]
KAG7033584.11.33e-25687.83hypothetical protein SDJN02_03308 [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_022967807.17.43e-25688.08uncharacterized protein LOC111467213 isoform X1 [Cucurbita maxima][more]
XP_022932889.15.58e-25186.86uncharacterized protein LOC111439414 isoform X1 [Cucurbita moschata][more]
XP_022153942.12.44e-23680.29uncharacterized protein LOC111021332 [Momordica charantia][more]
Match NameE-valueIdentityDescription
A0A6J1HRU43.60e-25688.08O-fucosyltransferase family protein OS=Cucurbita maxima OX=3661 GN=LOC111467213 ... [more]
A0A6J1EY952.70e-25186.86uncharacterized protein LOC111439414 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1DKB91.18e-23680.29O-fucosyltransferase family protein OS=Momordica charantia OX=3673 GN=LOC1110213... [more]
A0A5D3DWC14.57e-23180.39O-fucosyltransferase family protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5... [more]
A0A1S3C1H94.57e-23180.39O-fucosyltransferase family protein OS=Cucumis melo OX=3656 GN=LOC103495824 PE=3... [more]
Match NameE-valueIdentityDescription
AT2G41150.25.1e-12758.42unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT3G56750.18.7e-12758.56unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G41150.13.7e-6149.81unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT4G08810.12.0e-1423.96calcium ion binding [more]
AT4G12700.14.5e-1422.97unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D3.40.50.11350coord: 187..333
e-value: 3.3E-7
score: 32.2
NoneNo IPR availablePANTHERPTHR31469:SF8PLANT/PROTEINcoord: 3..180
NoneNo IPR availablePANTHERPTHR31469OS07G0633600 PROTEINcoord: 3..180
NoneNo IPR availablePANTHERPTHR31469OS07G0633600 PROTEINcoord: 180..362
NoneNo IPR availablePANTHERPTHR31469:SF8PLANT/PROTEINcoord: 180..362

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG10g11110.1Cp4.1LG10g11110.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006004 fucose metabolic process
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0016740 transferase activity