CmoCh03G001510 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh03G001510
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionO-fucosyltransferase family protein
LocationCmo_Chr03: 2116830 .. 2121333 (-)
RNA-Seq ExpressionCmoCh03G001510
SyntenyCmoCh03G001510
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ACATAATCACAGACATCTTTTGCTTCTCAAATTTCTTCGATTCAAAGTTACATATCAATCAATCAATCCATAATATCCTTTACGATTAGTTTATAATTCTCAGTACTCACTACGATACTTGCTCGATCCAAGTTCTTGTCCTTCACCGGATGCGATCTATTAGCATCCCTGTTTGCGCCGAAGATTACAATCTCTCTTCCACTCTGCAATTTCCATGGCGATTCACAAGACCCAGAAGGCAAAACCCAAACCCAGATCCCCATTCCTCTTCTTCTTCGTTGCCCTCGCCGTCATTGCGTTTCTTTTCCTATTTTCCTCTCTGATTTCCACTATTGGGGTTTCTTCTTCTTTTCCATCATCAAATTCGATTCGTGAAATCTTCAGATTCAAGAATTTGAACCAGAAACAGAGACGTAATCGGCACGTTTTTAGTACGAACGACAAGTTCTTGTACTGGGGCAACAGAATCGACTGTCCTGGGAAGCATTGCGAGTCTTGCGAGGGTTTGGGTCACCAGGAGTCCAGCTTGAGGTGCGCCCTTGAGGAAGCCATGTTCCTCCAGAGGTAATTTTCATGAAGTTTTTCTGATTTCTGGGGCTTAATCTTTACATGAACATGTTTTCTCCTAGTTTAATTTGGTGGATTGGCATGTTAGGATTTCTGTGTGAAAGGGGTTTGGAATTGGATTAGCAGTAACTCATCAGCACTGTAGTGGTTCTATCATCACTAAACGTAGAATTATCCGTTGAGGATGGTTGGGAGGGAGTCACACGTTGGCTAATACATCTCTAGATGAAAGCCATGTGGACAATAGCATACCATTGTGGAGATACTTGGTATCAGAGTCATGCCTTTAACTTAACCATGTCAATAGAATCCTCAAATGTCGAACCAAGAAGTTGTGAGCCTTAATTAAGGGGAGGTTGTTCGAGGACTCTATAGGCCTCAAGGGAGGCTCTATGGTGTACTTCGTTCGAGGGGAGGACTGTGGAGATTCGTGATTCCTAACATTATCATATCTTGAAACTCACGAGTTCAAGAACTTCTGTTGGCCCCTTGCTTGATCTTTATGCTGTGTTATTGGTATGAGGTGCCATTTAGTTTGTCTCAGCTCTTTGTAATGGTAATGGATTGGTCCAACGCCGTTAAGTTCAGCATAATTTATTGGTAGAATGCTTTTGCATAACCTACTTAGACAATTTACTTTACTTTCTTTTTGGTGCAGAGTATTTGTAATGCCCTCTAGAATGTGTATCAACCCTATACACAATAAGAAAGGGATTCTTCATCACTCCACCAATGCAAGCTCAGAGGAAAGGTGCTTCTTCCTCTTTTCTAATCCTATAATGAGTTATACTTGATGATAGTTGAAACGGGTATTCAATTGATTCGCTTCATGCTTCAAACTAAGACGCGAACTCCCTTGGATAAAGATCTTTGTTCTTCATGTGTTTGTGATCTTTTCTAGAAAGAACACAGAATCTGAATTTGTTCTCTGCATTGCAGATGGGAAACAAACTCTTGTGCCATGGATTCGTTGTACGATATGGATCTTATATCTGATACCGTACCGGTGATTTTAGACAACTCAAAATTATGGTATCAGGTGCAGTCAACTGGTATGAAATTAGGATCTAGAGTGGTTGCCCATGTTGATCAAGTTAGTCGTATTGAACTCAGAGACGACAGCCGCTACTCCAATCTTTTGATAATAAATCGAACTGCCAGCCCTCTTTCATGGTAAATCAAACTTGGGGTAATAAGATTCTTTTATGTCTTTTGTATGTGTCTGTGAGATCCCATATCGGTGGGAGAGAGGAACGAAACATTCCTTATAAGAGTGTAGAAACCTCTCCCTAGCAGACGAGTTTTAAAACGTTGAAGGGAATCCCAGAAGGGAAAACCCAAAGAGGACAATATCTACTAGCGGTGGGATTGGGCTGTTACAAATGTTATCAGAGCCAACCGGGCGGTGTGCCAGCGAGGACGTTGGTCCTCCAAGGGGGGTGGATTGCGAGATCCCACATCGGTTGGAGAGGGGAACAAAACATTCCTTATAAGAGTGTAGAAATCTTTCCCTAGCAGACACGTTTTAAAATATTGAGGGGAAGCCCAGAAGGAAAAACCCAAAGAGGACACTATCTACTAACGGTGAGCTTGGGCTGTTACAAATAGTATCAGAACCAGACACCAGACAGTATGCCACCGGGCGGTAAAAGCTCAATTAGGCTCGTCTTTTTAATGTTTTGTTTTCATGCATTTCTTTGCAACTTGGTACGACGGATATTTCAATTTAAGATCCCAATTTATGAATGATTAGGGCGTTAATGCCGTGATAACTCTTTAGGTTTATGGAATGCAAGGACAGAAACAACCGCAGTGCCATATTGTTGCCCTATAAATTTCTTCCTTCTATGGCAGCAGAAAACTTGAGGGATGCATCTGAGAAGGTATTTTTTTGAAACCATATTAGGTTCTTTCTTTTGTTATTCATATTTGCAGTCTTCTACGAATCATGACATACGGTAGAGCCTATCGCTGTTAGCTCGCTGCTTACGAAACCTAGCTTTCGTGCATTTCTTTATTTCTACGTTGCTGTTGAATAGTTGTAGTAGATGCTCAACTTCCGTGTTATAAGATTAGGCATTTGCTTTATTTGGTTTTCCTGGATGCTTCTTAGCATAAACTCACCCTCTTAGAAAATATAATTACTGAACTCTTGATCGACCGAAAAGACTTCACTTTAGATACCCTGACACGAATAACGAACCCTAATTTAAAGACTCACGAGGGAAGTAGGCATAATTTTTAGAAACAAGAAACTAAAAACATAATGGTTACCTATTGAGGCCTAATTTTCCTCTCCTAGGATGACCCTAGCCAGTTCCCTTGCCGGGCTTGGAGGTTTGATTGAAAGTAGCGCTCGAGGATTGTAATTGGAAATGGAAGCTCGGTTCCCCTCGCACTCTCACTCTATTTTGAATTAGGGATTTGCCTCTTTCTTTCTCTACCGCCTGTGTCTATCTCCTGTCAGACTTTCCACCCGTAGGATCGCTTCTAAGTCCAATATCCAAAGTACTTGGAAATTCGTCTTTTTCCGTCTTTGATGAACTTTGAAGTTTTGTCATCAGGCCATGGTTGTATAATGGTGAAAGGCAGAAGAACATAGAAGGGCTTGTATGCAAACATTTGCAGAAGCAAGCCAGTTGTAACCATTATTTATGGATAGGTTTGTAGAAAATCTTGGAAGTACCTGAAACTTTATGGTTTTGGTGCCTTGGAAACTGTACAGAAGGCACATTTCTTTATTTCATAGGAAAATTCGATAAATCCGAAGCGAGTAATTAGCTACAGTGAGTGGTTGGTAAGCCTTGTATTAGCTCGAGACCGAGTTTGTTTGCAGTTCTCTGTTTCTGTTTGGGTTACTTATATGTTATACTGTGTGTAATAGCTGTGAGAAAAGATTACAAGTTGTTTCTCATTCATTTTGGAATATAGTTCTATAAGTAATACTGAAAAGAATGTCTGATCAATAGATTAAAGAGCTACTCGGTGATTACGACGCCATCCATGTTCGTCGTGGAGATAAAATAAAGACCAGAAAGGACAGGTTTGGTGTTGATAGAAGCTTACATCCACATCTCGACAGGGATACACGGCCTGAGTTTATGCTAAAGAGAATAGCAAAGTGGGTTCCGCCAGGGCGGACACTTTTTATTGCTTCAAATGAGAGAATGCCGGGGTTCTTCTCGCCCCTCTCTGCTCGGTGAGTCCATCTCACTTAACGTTTTAGTTCGCTCGAGTTAGCTGAAATGTTGGTTCTTTGCCATTTAGTAGTACGAAGACTTGGTTCAAAAGATCGATCTTCTTGTGTTCCGATAGCGATATTATAAATCATTGAACTGTCTCTCATATAGTCCTACCACCACCTTGATTCATGATACCCCAATTCTTCTGCAGGTACAAGTTGGCTTATTCCTCGAACTATAGCCATATTCTGGGTCCTGTGGTTAAGAACAATTATCAGTTGTTCATGATCGAAAGGCTCATTATGGCGGGTGCCAAGACATTCATCAGAACGTTCAAAGAAGACAATACAGATCTTAGCCTCACCGATGACCCAAAGAAGAACACGAAAGTGTGGCAAAAACCAATCTACACAGATGACGAAGAAAGTAGCTGAGGATTTGTTATTTCACTGGGAAAACATCTTGAAGACATCTTTTTGATCCATGCTAATGCCAACCAACAAAAGCCTTTCATTGCTCGTTGTAAAGCTTCGTATGAATGAATGGTAATAGGAAAGGTAGATTCTTGGCCGATGTTGTTGTATATGTGGTATAAATTATGATCCTGGTTTCTGAATTTCATTACTTCCATTTCTTTAGTGTCACGTGTATTCTTTTTTTTTTGAATGGAATACATCTTTCGTTACTTGAAATTTTTTGATGTAGCGTGTAATAACTCTAGTTGTTCGTTGTTCAAAGATTCTTGTTTAGAATTA

mRNA sequence

ACATAATCACAGACATCTTTTGCTTCTCAAATTTCTTCGATTCAAAGTTACATATCAATCAATCAATCCATAATATCCTTTACGATTAGTTTATAATTCTCAGTACTCACTACGATACTTGCTCGATCCAAGTTCTTGTCCTTCACCGGATGCGATCTATTAGCATCCCTGTTTGCGCCGAAGATTACAATCTCTCTTCCACTCTGCAATTTCCATGGCGATTCACAAGACCCAGAAGGCAAAACCCAAACCCAGATCCCCATTCCTCTTCTTCTTCGTTGCCCTCGCCGTCATTGCGTTTCTTTTCCTATTTTCCTCTCTGATTTCCACTATTGGGGTTTCTTCTTCTTTTCCATCATCAAATTCGATTCGTGAAATCTTCAGATTCAAGAATTTGAACCAGAAACAGAGACGTAATCGGCACGTTTTTAGTACGAACGACAAGTTCTTGTACTGGGGCAACAGAATCGACTGTCCTGGGAAGCATTGCGAGTCTTGCGAGGGTTTGGGTCACCAGGAGTCCAGCTTGAGGTGCGCCCTTGAGGAAGCCATGTTCCTCCAGAGAGTATTTGTAATGCCCTCTAGAATGTGTATCAACCCTATACACAATAAGAAAGGGATTCTTCATCACTCCACCAATGCAAGCTCAGAGGAAAGATGGGAAACAAACTCTTGTGCCATGGATTCGTTGTACGATATGGATCTTATATCTGATACCGTACCGGTGATTTTAGACAACTCAAAATTATGGTATCAGGTGCAGTCAACTGGTATGAAATTAGGATCTAGAGTGGTTGCCCATGTTGATCAAGTTAGTCGTATTGAACTCAGAGACGACAGCCGCTACTCCAATCTTTTGATAATAAATCGAACTGCCAGCCCTCTTTCATGGTTTATGGAATGCAAGGACAGAAACAACCGCAGTGCCATATTGTTGCCCTATAAATTTCTTCCTTCTATGGCAGCAGAAAACTTGAGGGATGCATCTGAGAAGATTAAAGAGCTACTCGGTGATTACGACGCCATCCATGTTCGTCGTGGAGATAAAATAAAGACCAGAAAGGACAGGTTTGGTGTTGATAGAAGCTTACATCCACATCTCGACAGGGATACACGGCCTGAGTTTATGCTAAAGAGAATAGCAAAGTGGGTTCCGCCAGGGCGGACACTTTTTATTGCTTCAAATGAGAGAATGCCGGGGTTCTTCTCGCCCCTCTCTGCTCGGTACAAGTTGGCTTATTCCTCGAACTATAGCCATATTCTGGGTCCTGTGGTTAAGAACAATTATCAGTTGTTCATGATCGAAAGGCTCATTATGGCGGGTGCCAAGACATTCATCAGAACGTTCAAAGAAGACAATACAGATCTTAGCCTCACCGATGACCCAAAGAAGAACACGAAAGTGTGGCAAAAACCAATCTACACAGATGACGAAGAAAGTAGCTGAGGATTTGTTATTTCACTGGGAAAACATCTTGAAGACATCTTTTTGATCCATGCTAATGCCAACCAACAAAAGCCTTTCATTGCTCGTTGTAAAGCTTCGTATGAATGAATGGTAATAGGAAAGGTAGATTCTTGGCCGATGTTGTTGTATATGTGGTATAAATTATGATCCTGGTTTCTGAATTTCATTACTTCCATTTCTTTAGTGTCACGTGTATTCTTTTTTTTTTGAATGGAATACATCTTTCGTTACTTGAAATTTTTTGATGTAGCGTGTAATAACTCTAGTTGTTCGTTGTTCAAAGATTCTTGTTTAGAATTA

Coding sequence (CDS)

ATGGCGATTCACAAGACCCAGAAGGCAAAACCCAAACCCAGATCCCCATTCCTCTTCTTCTTCGTTGCCCTCGCCGTCATTGCGTTTCTTTTCCTATTTTCCTCTCTGATTTCCACTATTGGGGTTTCTTCTTCTTTTCCATCATCAAATTCGATTCGTGAAATCTTCAGATTCAAGAATTTGAACCAGAAACAGAGACGTAATCGGCACGTTTTTAGTACGAACGACAAGTTCTTGTACTGGGGCAACAGAATCGACTGTCCTGGGAAGCATTGCGAGTCTTGCGAGGGTTTGGGTCACCAGGAGTCCAGCTTGAGGTGCGCCCTTGAGGAAGCCATGTTCCTCCAGAGAGTATTTGTAATGCCCTCTAGAATGTGTATCAACCCTATACACAATAAGAAAGGGATTCTTCATCACTCCACCAATGCAAGCTCAGAGGAAAGATGGGAAACAAACTCTTGTGCCATGGATTCGTTGTACGATATGGATCTTATATCTGATACCGTACCGGTGATTTTAGACAACTCAAAATTATGGTATCAGGTGCAGTCAACTGGTATGAAATTAGGATCTAGAGTGGTTGCCCATGTTGATCAAGTTAGTCGTATTGAACTCAGAGACGACAGCCGCTACTCCAATCTTTTGATAATAAATCGAACTGCCAGCCCTCTTTCATGGTTTATGGAATGCAAGGACAGAAACAACCGCAGTGCCATATTGTTGCCCTATAAATTTCTTCCTTCTATGGCAGCAGAAAACTTGAGGGATGCATCTGAGAAGATTAAAGAGCTACTCGGTGATTACGACGCCATCCATGTTCGTCGTGGAGATAAAATAAAGACCAGAAAGGACAGGTTTGGTGTTGATAGAAGCTTACATCCACATCTCGACAGGGATACACGGCCTGAGTTTATGCTAAAGAGAATAGCAAAGTGGGTTCCGCCAGGGCGGACACTTTTTATTGCTTCAAATGAGAGAATGCCGGGGTTCTTCTCGCCCCTCTCTGCTCGGTACAAGTTGGCTTATTCCTCGAACTATAGCCATATTCTGGGTCCTGTGGTTAAGAACAATTATCAGTTGTTCATGATCGAAAGGCTCATTATGGCGGGTGCCAAGACATTCATCAGAACGTTCAAAGAAGACAATACAGATCTTAGCCTCACCGATGACCCAAAGAAGAACACGAAAGTGTGGCAAAAACCAATCTACACAGATGACGAAGAAAGTAGCTGA

Protein sequence

MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTIGVSSSFPSSNSIREIFRFKNLNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVFVMPSRMCINPIHNKKGILHHSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLWYQVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLLIINRTASPLSWFMECKDRNNRSAILLPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAKWVPPGRTLFIASNERMPGFFSPLSARYKLAYSSNYSHILGPVVKNNYQLFMIERLIMAGAKTFIRTFKEDNTDLSLTDDPKKNTKVWQKPIYTDDEESS
Homology
BLAST of CmoCh03G001510 vs. ExPASy TrEMBL
Match: A0A6J1EY95 (uncharacterized protein LOC111439414 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111439414 PE=4 SV=1)

HSP 1 Score: 827.8 bits (2137), Expect = 2.0e-236
Identity = 410/410 (100.00%), Postives = 410/410 (100.00%), Query Frame = 0

Query: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTIGVSSSFPSSNSIREIFRFKN 60
           MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTIGVSSSFPSSNSIREIFRFKN
Sbjct: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTIGVSSSFPSSNSIREIFRFKN 60

Query: 61  LNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVFV 120
           LNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVFV
Sbjct: 61  LNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVFV 120

Query: 121 MPSRMCINPIHNKKGILHHSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLWY 180
           MPSRMCINPIHNKKGILHHSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLWY
Sbjct: 121 MPSRMCINPIHNKKGILHHSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLWY 180

Query: 181 QVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLLIINRTASPLSWFMECKDRNNRSAIL 240
           QVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLLIINRTASPLSWFMECKDRNNRSAIL
Sbjct: 181 QVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLLIINRTASPLSWFMECKDRNNRSAIL 240

Query: 241 LPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT 300
           LPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT
Sbjct: 241 LPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT 300

Query: 301 RPEFMLKRIAKWVPPGRTLFIASNERMPGFFSPLSARYKLAYSSNYSHILGPVVKNNYQL 360
           RPEFMLKRIAKWVPPGRTLFIASNERMPGFFSPLSARYKLAYSSNYSHILGPVVKNNYQL
Sbjct: 301 RPEFMLKRIAKWVPPGRTLFIASNERMPGFFSPLSARYKLAYSSNYSHILGPVVKNNYQL 360

Query: 361 FMIERLIMAGAKTFIRTFKEDNTDLSLTDDPKKNTKVWQKPIYTDDEESS 411
           FMIERLIMAGAKTFIRTFKEDNTDLSLTDDPKKNTKVWQKPIYTDDEESS
Sbjct: 361 FMIERLIMAGAKTFIRTFKEDNTDLSLTDDPKKNTKVWQKPIYTDDEESS 410

BLAST of CmoCh03G001510 vs. ExPASy TrEMBL
Match: A0A6J1HRU4 (O-fucosyltransferase family protein OS=Cucurbita maxima OX=3661 GN=LOC111467213 PE=3 SV=1)

HSP 1 Score: 814.3 bits (2102), Expect = 2.3e-232
Identity = 404/410 (98.54%), Postives = 406/410 (99.02%), Query Frame = 0

Query: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTIGVSSSFPSSNSIREIFRFKN 60
           MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLIST GVSSSFPSSNSIREIFRFKN
Sbjct: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTNGVSSSFPSSNSIREIFRFKN 60

Query: 61  LNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVFV 120
           LNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVFV
Sbjct: 61  LNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVFV 120

Query: 121 MPSRMCINPIHNKKGILHHSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLWY 180
           MPSRMCINPIHNKKGILH STNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLWY
Sbjct: 121 MPSRMCINPIHNKKGILHQSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLWY 180

Query: 181 QVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLLIINRTASPLSWFMECKDRNNRSAIL 240
           QVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLL+INRTASPLSWFMECKDRNNRSAIL
Sbjct: 181 QVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLLLINRTASPLSWFMECKDRNNRSAIL 240

Query: 241 LPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT 300
           LPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT
Sbjct: 241 LPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT 300

Query: 301 RPEFMLKRIAKWVPPGRTLFIASNERMPGFFSPLSARYKLAYSSNYSHILGPVVKNNYQL 360
           RPEFMLKRIAKWVPPGRTLFIASNER PGFFSPLSARYKLAYSSNYSHIL PVVKNNYQL
Sbjct: 301 RPEFMLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQL 360

Query: 361 FMIERLIMAGAKTFIRTFKEDNTDLSLTDDPKKNTKVWQKPIYTDDEESS 411
           FMIERLIMAGAKTFIRTFKED+TDLSLTDDPKKNTKVWQKPIYTDDEESS
Sbjct: 361 FMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQKPIYTDDEESS 410

BLAST of CmoCh03G001510 vs. ExPASy TrEMBL
Match: A0A6J1DKB9 (O-fucosyltransferase family protein OS=Momordica charantia OX=3673 GN=LOC111021332 PE=3 SV=1)

HSP 1 Score: 740.7 bits (1911), Expect = 3.2e-210
Identity = 363/411 (88.32%), Postives = 385/411 (93.67%), Query Frame = 0

Query: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTIGVSSS-FPSSNSIREIFRFK 60
           MA  + QKAKPKPRSP  FFFVALA IAFLFLFSSLIST G SSS F SSNSI++IFRF 
Sbjct: 1   MAFPRAQKAKPKPRSPLFFFFVALAAIAFLFLFSSLISTNGASSSTFSSSNSIQKIFRFN 60

Query: 61  NLNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVF 120
           N+N+K +RNRHVFS NDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFL+R+F
Sbjct: 61  NVNEKPKRNRHVFSANDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLRRIF 120

Query: 121 VMPSRMCINPIHNKKGILHHSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLW 180
           VMPSRMCINPIHNKKGILH S NASSEE WE  SCAMDSLYD+DLISDTVPVILDNSKLW
Sbjct: 121 VMPSRMCINPIHNKKGILHQSNNASSEESWEAKSCAMDSLYDIDLISDTVPVILDNSKLW 180

Query: 181 YQVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLLIINRTASPLSWFMECKDRNNRSAI 240
           YQV STGMKLG+R VAHV++VSR EL+D++RYSNLL+INRTASPLSWFMECKDRNNRSAI
Sbjct: 181 YQVLSTGMKLGARAVAHVERVSRAELKDNNRYSNLLLINRTASPLSWFMECKDRNNRSAI 240

Query: 241 LLPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300
           LLPYKFLPSMAAENLRDA+EKIK LLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD
Sbjct: 241 LLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300

Query: 301 TRPEFMLKRIAKWVPPGRTLFIASNERMPGFFSPLSARYKLAYSSNYSHILGPVVKNNYQ 360
           TRPEFMLKR+AKWV PGRTLFIASNER PGFFSPLSARYKLAYSSNYSHIL P+VKNNYQ
Sbjct: 301 TRPEFMLKRLAKWVAPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPMVKNNYQ 360

Query: 361 LFMIERLIMAGAKTFIRTFKEDNTDLSLTDDPKKNTKVWQKPIYTDDEESS 411
           LFMIERLIMAGAKTFIRTFKED+TDLSLTDDPKKNTK+WQKP+YTDDEE S
Sbjct: 361 LFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKIWQKPVYTDDEEKS 411

BLAST of CmoCh03G001510 vs. ExPASy TrEMBL
Match: A0A5D3DWC1 (O-fucosyltransferase family protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold749G00060 PE=3 SV=1)

HSP 1 Score: 736.1 bits (1899), Expect = 8.0e-209
Identity = 364/407 (89.43%), Postives = 380/407 (93.37%), Query Frame = 0

Query: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTIGVSSSFPSSNSIREIFRFKN 60
           MA  +TQK KPK RSP +FFFV+LA IAFLFLFSSLIST G SSSFPSSNSI++IFRFKN
Sbjct: 1   MAFPRTQKPKPKHRSPLIFFFVSLAAIAFLFLFSSLISTNG-SSSFPSSNSIQKIFRFKN 60

Query: 61  LNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVFV 120
           L QKQRR RH FS NDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQR FV
Sbjct: 61  LTQKQRRGRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFV 120

Query: 121 MPSRMCINPIHNKKGILHHSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLWY 180
           MPSRMCINPIHNKKG+LH STNASSEE WE NSCAMDSLYDMDLISDTVPVILDNSK WY
Sbjct: 121 MPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWY 180

Query: 181 QVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLLIINRTASPLSWFMECKDRNNRSAIL 240
           QV ST MKLG+R VAHV+QVSRIELRD S YSNLL+INRTASPLSWFMECKDRNNRSA++
Sbjct: 181 QVLSTSMKLGARAVAHVEQVSRIELRDSSHYSNLLLINRTASPLSWFMECKDRNNRSAVM 240

Query: 241 LPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT 300
           LPYKFLPSMAAENLRDA+EKIK LLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT
Sbjct: 241 LPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT 300

Query: 301 RPEFMLKRIAKWVPPGRTLFIASNERMPGFFSPLSARYKLAYSSNYSHILGPVVKNNYQL 360
           RPEFMLKRIAKWVP GRTLFIASNER+PGFFSPLSARYKLAYSSNYS IL PVVKNNYQL
Sbjct: 301 RPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQL 360

Query: 361 FMIERLIMAGAKTFIRTFKEDNTDLSLTDDPKKNTKVWQKPIYTDDE 408
           FMIERLIMAGAKTFIRTFKED+TDLSLTDDPKKNTKVWQ P+YTD+E
Sbjct: 361 FMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQIPVYTDEE 406

BLAST of CmoCh03G001510 vs. ExPASy TrEMBL
Match: A0A1S3C1H9 (O-fucosyltransferase family protein OS=Cucumis melo OX=3656 GN=LOC103495824 PE=3 SV=1)

HSP 1 Score: 736.1 bits (1899), Expect = 8.0e-209
Identity = 364/407 (89.43%), Postives = 380/407 (93.37%), Query Frame = 0

Query: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTIGVSSSFPSSNSIREIFRFKN 60
           MA  +TQK KPK RSP +FFFV+LA IAFLFLFSSLIST G SSSFPSSNSI++IFRFKN
Sbjct: 1   MAFPRTQKPKPKHRSPLIFFFVSLAAIAFLFLFSSLISTNG-SSSFPSSNSIQKIFRFKN 60

Query: 61  LNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVFV 120
           L QKQRR RH FS NDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQR FV
Sbjct: 61  LTQKQRRGRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTFV 120

Query: 121 MPSRMCINPIHNKKGILHHSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLWY 180
           MPSRMCINPIHNKKG+LH STNASSEE WE NSCAMDSLYDMDLISDTVPVILDNSK WY
Sbjct: 121 MPSRMCINPIHNKKGLLHQSTNASSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWY 180

Query: 181 QVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLLIINRTASPLSWFMECKDRNNRSAIL 240
           QV ST MKLG+R VAHV+QVSRIELRD S YSNLL+INRTASPLSWFMECKDRNNRSA++
Sbjct: 181 QVLSTSMKLGARAVAHVEQVSRIELRDSSHYSNLLLINRTASPLSWFMECKDRNNRSAVM 240

Query: 241 LPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT 300
           LPYKFLPSMAAENLRDA+EKIK LLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT
Sbjct: 241 LPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT 300

Query: 301 RPEFMLKRIAKWVPPGRTLFIASNERMPGFFSPLSARYKLAYSSNYSHILGPVVKNNYQL 360
           RPEFMLKRIAKWVP GRTLFIASNER+PGFFSPLSARYKLAYSSNYS IL PVVKNNYQL
Sbjct: 301 RPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVKNNYQL 360

Query: 361 FMIERLIMAGAKTFIRTFKEDNTDLSLTDDPKKNTKVWQKPIYTDDE 408
           FMIERLIMAGAKTFIRTFKED+TDLSLTDDPKKNTKVWQ P+YTD+E
Sbjct: 361 FMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQIPVYTDEE 406

BLAST of CmoCh03G001510 vs. NCBI nr
Match: XP_022932889.1 (uncharacterized protein LOC111439414 isoform X1 [Cucurbita moschata])

HSP 1 Score: 827.8 bits (2137), Expect = 4.1e-236
Identity = 410/410 (100.00%), Postives = 410/410 (100.00%), Query Frame = 0

Query: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTIGVSSSFPSSNSIREIFRFKN 60
           MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTIGVSSSFPSSNSIREIFRFKN
Sbjct: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTIGVSSSFPSSNSIREIFRFKN 60

Query: 61  LNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVFV 120
           LNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVFV
Sbjct: 61  LNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVFV 120

Query: 121 MPSRMCINPIHNKKGILHHSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLWY 180
           MPSRMCINPIHNKKGILHHSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLWY
Sbjct: 121 MPSRMCINPIHNKKGILHHSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLWY 180

Query: 181 QVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLLIINRTASPLSWFMECKDRNNRSAIL 240
           QVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLLIINRTASPLSWFMECKDRNNRSAIL
Sbjct: 181 QVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLLIINRTASPLSWFMECKDRNNRSAIL 240

Query: 241 LPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT 300
           LPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT
Sbjct: 241 LPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT 300

Query: 301 RPEFMLKRIAKWVPPGRTLFIASNERMPGFFSPLSARYKLAYSSNYSHILGPVVKNNYQL 360
           RPEFMLKRIAKWVPPGRTLFIASNERMPGFFSPLSARYKLAYSSNYSHILGPVVKNNYQL
Sbjct: 301 RPEFMLKRIAKWVPPGRTLFIASNERMPGFFSPLSARYKLAYSSNYSHILGPVVKNNYQL 360

Query: 361 FMIERLIMAGAKTFIRTFKEDNTDLSLTDDPKKNTKVWQKPIYTDDEESS 411
           FMIERLIMAGAKTFIRTFKEDNTDLSLTDDPKKNTKVWQKPIYTDDEESS
Sbjct: 361 FMIERLIMAGAKTFIRTFKEDNTDLSLTDDPKKNTKVWQKPIYTDDEESS 410

BLAST of CmoCh03G001510 vs. NCBI nr
Match: KAG7033584.1 (hypothetical protein SDJN02_03308 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 815.1 bits (2104), Expect = 2.8e-232
Identity = 406/411 (98.78%), Postives = 408/411 (99.27%), Query Frame = 0

Query: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTIGV-SSSFPSSNSIREIFRFK 60
           MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLIST GV SSSFPSSNSIREIFRFK
Sbjct: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTNGVSSSSFPSSNSIREIFRFK 60

Query: 61  NLNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVF 120
           NLNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVF
Sbjct: 61  NLNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVF 120

Query: 121 VMPSRMCINPIHNKKGILHHSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLW 180
           VMPSRMCINPIHNKKGILHHSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLW
Sbjct: 121 VMPSRMCINPIHNKKGILHHSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLW 180

Query: 181 YQVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLLIINRTASPLSWFMECKDRNNRSAI 240
           YQVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLL+INRTASPLSWFMECKDRNNRSAI
Sbjct: 181 YQVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLLLINRTASPLSWFMECKDRNNRSAI 240

Query: 241 LLPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300
           LLPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD
Sbjct: 241 LLPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300

Query: 301 TRPEFMLKRIAKWVPPGRTLFIASNERMPGFFSPLSARYKLAYSSNYSHILGPVVKNNYQ 360
           TRPEFMLKRIAKWVPPGRTLFIASNERMPGFFSPLSARYKLAYSSNYSHIL PVVKNNYQ
Sbjct: 301 TRPEFMLKRIAKWVPPGRTLFIASNERMPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQ 360

Query: 361 LFMIERLIMAGAKTFIRTFKEDNTDLSLTDDPKKNTKVWQKPIYTDDEESS 411
           LFMIERLIMAGAKTFIRTFKED+TDLSLTDDPKKNTKVWQKPIYTDDEESS
Sbjct: 361 LFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQKPIYTDDEESS 411

BLAST of CmoCh03G001510 vs. NCBI nr
Match: XP_022967807.1 (uncharacterized protein LOC111467213 isoform X1 [Cucurbita maxima])

HSP 1 Score: 814.3 bits (2102), Expect = 4.7e-232
Identity = 404/410 (98.54%), Postives = 406/410 (99.02%), Query Frame = 0

Query: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTIGVSSSFPSSNSIREIFRFKN 60
           MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLIST GVSSSFPSSNSIREIFRFKN
Sbjct: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTNGVSSSFPSSNSIREIFRFKN 60

Query: 61  LNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVFV 120
           LNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVFV
Sbjct: 61  LNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVFV 120

Query: 121 MPSRMCINPIHNKKGILHHSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLWY 180
           MPSRMCINPIHNKKGILH STNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLWY
Sbjct: 121 MPSRMCINPIHNKKGILHQSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLWY 180

Query: 181 QVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLLIINRTASPLSWFMECKDRNNRSAIL 240
           QVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLL+INRTASPLSWFMECKDRNNRSAIL
Sbjct: 181 QVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLLLINRTASPLSWFMECKDRNNRSAIL 240

Query: 241 LPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT 300
           LPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT
Sbjct: 241 LPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT 300

Query: 301 RPEFMLKRIAKWVPPGRTLFIASNERMPGFFSPLSARYKLAYSSNYSHILGPVVKNNYQL 360
           RPEFMLKRIAKWVPPGRTLFIASNER PGFFSPLSARYKLAYSSNYSHIL PVVKNNYQL
Sbjct: 301 RPEFMLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQL 360

Query: 361 FMIERLIMAGAKTFIRTFKEDNTDLSLTDDPKKNTKVWQKPIYTDDEESS 411
           FMIERLIMAGAKTFIRTFKED+TDLSLTDDPKKNTKVWQKPIYTDDEESS
Sbjct: 361 FMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQKPIYTDDEESS 410

BLAST of CmoCh03G001510 vs. NCBI nr
Match: XP_023545162.1 (uncharacterized protein LOC111804548 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 809.3 bits (2089), Expect = 1.5e-230
Identity = 403/411 (98.05%), Postives = 406/411 (98.78%), Query Frame = 0

Query: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTIGV-SSSFPSSNSIREIFRFK 60
           MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLIST GV SSSFPSSNSIREIFRFK
Sbjct: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTNGVSSSSFPSSNSIREIFRFK 60

Query: 61  NLNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVF 120
           NLNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVF
Sbjct: 61  NLNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVF 120

Query: 121 VMPSRMCINPIHNKKGILHHSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLW 180
           VMPSRMCINPIHNKKGILH STNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLW
Sbjct: 121 VMPSRMCINPIHNKKGILHQSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLW 180

Query: 181 YQVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLLIINRTASPLSWFMECKDRNNRSAI 240
           YQVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLL+INRTASPLSWFMECKDRNNRSAI
Sbjct: 181 YQVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLLLINRTASPLSWFMECKDRNNRSAI 240

Query: 241 LLPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300
           LLPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD
Sbjct: 241 LLPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300

Query: 301 TRPEFMLKRIAKWVPPGRTLFIASNERMPGFFSPLSARYKLAYSSNYSHILGPVVKNNYQ 360
           TRPEFMLKRIAKWVPPGRTLFIASNER PGFFSPLSARYKLAYSSNYSHIL PVVKNNYQ
Sbjct: 301 TRPEFMLKRIAKWVPPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPVVKNNYQ 360

Query: 361 LFMIERLIMAGAKTFIRTFKEDNTDLSLTDDPKKNTKVWQKPIYTDDEESS 411
           LFMIERL+MAGAKTFIRTFKED+TDLSLTDDPKKNTKVWQKPIYTDDEESS
Sbjct: 361 LFMIERLVMAGAKTFIRTFKEDDTDLSLTDDPKKNTKVWQKPIYTDDEESS 411

BLAST of CmoCh03G001510 vs. NCBI nr
Match: XP_022153942.1 (uncharacterized protein LOC111021332 [Momordica charantia])

HSP 1 Score: 740.7 bits (1911), Expect = 6.7e-210
Identity = 363/411 (88.32%), Postives = 385/411 (93.67%), Query Frame = 0

Query: 1   MAIHKTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTIGVSSS-FPSSNSIREIFRFK 60
           MA  + QKAKPKPRSP  FFFVALA IAFLFLFSSLIST G SSS F SSNSI++IFRF 
Sbjct: 1   MAFPRAQKAKPKPRSPLFFFFVALAAIAFLFLFSSLISTNGASSSTFSSSNSIQKIFRFN 60

Query: 61  NLNQKQRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVF 120
           N+N+K +RNRHVFS NDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFL+R+F
Sbjct: 61  NVNEKPKRNRHVFSANDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLRRIF 120

Query: 121 VMPSRMCINPIHNKKGILHHSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLW 180
           VMPSRMCINPIHNKKGILH S NASSEE WE  SCAMDSLYD+DLISDTVPVILDNSKLW
Sbjct: 121 VMPSRMCINPIHNKKGILHQSNNASSEESWEAKSCAMDSLYDIDLISDTVPVILDNSKLW 180

Query: 181 YQVQSTGMKLGSRVVAHVDQVSRIELRDDSRYSNLLIINRTASPLSWFMECKDRNNRSAI 240
           YQV STGMKLG+R VAHV++VSR EL+D++RYSNLL+INRTASPLSWFMECKDRNNRSAI
Sbjct: 181 YQVLSTGMKLGARAVAHVERVSRAELKDNNRYSNLLLINRTASPLSWFMECKDRNNRSAI 240

Query: 241 LLPYKFLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300
           LLPYKFLPSMAAENLRDA+EKIK LLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD
Sbjct: 241 LLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRD 300

Query: 301 TRPEFMLKRIAKWVPPGRTLFIASNERMPGFFSPLSARYKLAYSSNYSHILGPVVKNNYQ 360
           TRPEFMLKR+AKWV PGRTLFIASNER PGFFSPLSARYKLAYSSNYSHIL P+VKNNYQ
Sbjct: 301 TRPEFMLKRLAKWVAPGRTLFIASNERTPGFFSPLSARYKLAYSSNYSHILDPMVKNNYQ 360

Query: 361 LFMIERLIMAGAKTFIRTFKEDNTDLSLTDDPKKNTKVWQKPIYTDDEESS 411
           LFMIERLIMAGAKTFIRTFKED+TDLSLTDDPKKNTK+WQKP+YTDDEE S
Sbjct: 361 LFMIERLIMAGAKTFIRTFKEDDTDLSLTDDPKKNTKIWQKPVYTDDEEKS 411

BLAST of CmoCh03G001510 vs. TAIR 10
Match: AT3G56750.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G41150.2); Has 128 Blast hits to 128 proteins in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 117; Viruses - 0; Other Eukaryotes - 11 (source: NCBI BLink). )

HSP 1 Score: 520.8 bits (1340), Expect = 1.0e-147
Identity = 264/403 (65.51%), Postives = 317/403 (78.66%), Query Frame = 0

Query: 5   KTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTIGVSSSFPSSNSIREIFRFKNLNQK 64
           K Q+ KP   S  L  F  + V +FL LFSS+IST       P   ++ + F +    ++
Sbjct: 6   KAQRTKPTSGSQRLVLF-CIVVFSFLLLFSSVIST--GKLGLPYQQTLIDYFVWSPRGKR 65

Query: 65  QRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVFVMPSR 124
           Q       S ++K+LYWGNRIDCPGK+CE+C GLGHQESSLRCALEEAMFL R FVMPS 
Sbjct: 66  QH------SLSEKYLYWGNRIDCPGKNCETCAGLGHQESSLRCALEEAMFLNRTFVMPSG 125

Query: 125 MCINPIHNKKGILHHSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLWYQVQS 184
           MCINPIHNKKGIL+ S N ++EE W  +SCAMDSLYD+DLIS+ +PVILD+SK W+ V S
Sbjct: 126 MCINPIHNKKGILNRSDNKTTEEGWLGSSCAMDSLYDIDLISEKIPVILDDSKTWHIVLS 185

Query: 185 TGMKLGSRVVAHVDQVSRIELRDDSRYSNLLIINRTASPLSWFMECKDRNNRSAILLPYK 244
           T MKLG R +AHV  V+R  L+ +S YSNLLIINRTASPL+WF+ECKDR+NRSA++LPY 
Sbjct: 186 TSMKLGERGIAHVSGVTRHRLK-ESHYSNLLIINRTASPLAWFVECKDRSNRSAVMLPYS 245

Query: 245 FLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEF 304
           FLP+MAA  LR+A+EKIK  LGDYDAIHVRRGDK+KTRKDRFGV+R   PHLDRDTRPEF
Sbjct: 246 FLPNMAAAKLRNAAEKIKAQLGDYDAIHVRRGDKLKTRKDRFGVERIQFPHLDRDTRPEF 305

Query: 305 MLKRIAKWVPPGRTLFIASNERMPGFFSPLSARYKLAYSSNYSHILGPVVKNNYQLFMIE 364
           +L+RI K +P GRTLFI SNER PGFFSPL+ RYKLAYSSN+S IL P+++NNYQLFM+E
Sbjct: 306 ILRRIEKRIPRGRTLFIGSNERKPGFFSPLAVRYKLAYSSNFSEILDPIIENNYQLFMME 365

Query: 365 RLIMAGAKTFIRTFKEDNTDLSLTDDPKKNTKVWQKPIYTDDE 408
           RL+M GAKT+ +TFKE  TDL+LTDDPKKN K W+ P+YT DE
Sbjct: 366 RLVMMGAKTYFKTFKEYETDLTLTDDPKKN-KNWEIPVYTMDE 397

BLAST of CmoCh03G001510 vs. TAIR 10
Match: AT2G41150.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: leaf; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G56750.1); Has 127 Blast hits to 127 proteins in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 117; Viruses - 0; Other Eukaryotes - 10 (source: NCBI BLink). )

HSP 1 Score: 514.6 bits (1324), Expect = 7.2e-146
Identity = 261/403 (64.76%), Postives = 312/403 (77.42%), Query Frame = 0

Query: 5   KTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTIGVSSSFPSSNSIREIFRFKNLNQK 64
           K  K K  P S  L     +AV AFL LF+S+IST G+  + P   ++   F       +
Sbjct: 6   KPHKLKATPGSQRLVLLCIVAV-AFLLLFTSVISTGGL--ALPYRTTLIGYF------VR 65

Query: 65  QRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVFVMPSR 124
             RN+   S +DK+LYWGNRIDCPGK+CE+C GLGHQESSLRCALEEAMFL R FVMPSR
Sbjct: 66  STRNKTQHSLSDKYLYWGNRIDCPGKNCETCAGLGHQESSLRCALEEAMFLNRTFVMPSR 125

Query: 125 MCINPIHNKKGILHHSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLWYQVQS 184
           MCINPIHNKKGIL+ S N + EE WE +SCAM+SLYD+DLIS+ +PVILD+S+ W+ + S
Sbjct: 126 MCINPIHNKKGILNRSNNETREESWEVSSCAMESLYDIDLISEKIPVILDDSETWHIMLS 185

Query: 185 TGMKLGSRVVAHVDQVSRIELRDDSRYSNLLIINRTASPLSWFMECKDRNNRSAILLPYK 244
           T MKL  R  AHV   +R EL D S ++NLL+INRTASPL+WF+ECKDR NRS ++LPY 
Sbjct: 186 TSMKLKERGSAHVYGANRHELNDSSDFTNLLLINRTASPLAWFVECKDRGNRSDVMLPYS 245

Query: 245 FLPSMAAENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEF 304
           FL +MAA  LRDA+EKIK  LGDYDAIHVRRGDK+KTRKDRF V+RS  PHLDRDTRPEF
Sbjct: 246 FLQTMAASRLRDAAEKIKAKLGDYDAIHVRRGDKLKTRKDRFRVERSQFPHLDRDTRPEF 305

Query: 305 MLKRIAKWVPPGRTLFIASNERMPGFFSPLSARYKLAYSSNYSHILGPVVKNNYQLFMIE 364
           ++ RI K +PPGRTLFI SNER P FFSPL+ RYK+AYSSN+S IL P+++NNYQLFM+E
Sbjct: 306 IIGRIQKQIPPGRTLFIGSNERTPDFFSPLAIRYKVAYSSNFSEILDPIIENNYQLFMVE 365

Query: 365 RLIMAGAKTFIRTFKEDNTDLSLTDDPKKNTKVWQKPIYTDDE 408
           RLIM GAKTF +TF+E  TDL+LTDDPKKN K W+ P+YT DE
Sbjct: 366 RLIMMGAKTFFKTFREYETDLTLTDDPKKN-KNWEIPVYTMDE 398

BLAST of CmoCh03G001510 vs. TAIR 10
Match: AT2G41150.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: leaf; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G56750.1); Has 57 Blast hits to 57 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 56; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink). )

HSP 1 Score: 301.2 bits (770), Expect = 1.3e-81
Identity = 156/260 (60.00%), Postives = 191/260 (73.46%), Query Frame = 0

Query: 5   KTQKAKPKPRSPFLFFFVALAVIAFLFLFSSLISTIGVSSSFPSSNSIREIFRFKNLNQK 64
           K  K K  P S  L     +AV AFL LF+S+IST G+  + P   ++   F       +
Sbjct: 6   KPHKLKATPGSQRLVLLCIVAV-AFLLLFTSVISTGGL--ALPYRTTLIGYF------VR 65

Query: 65  QRRNRHVFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVFVMPSR 124
             RN+   S +DK+LYWGNRIDCPGK+CE+C GLGHQESSLRCALEEAMFL R FVMPSR
Sbjct: 66  STRNKTQHSLSDKYLYWGNRIDCPGKNCETCAGLGHQESSLRCALEEAMFLNRTFVMPSR 125

Query: 125 MCINPIHNKKGILHHSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLWYQVQS 184
           MCINPIHNKKGIL+ S N + EE WE +SCAM+SLYD+DLIS+ +PVILD+S+ W+ + S
Sbjct: 126 MCINPIHNKKGILNRSNNETREESWEVSSCAMESLYDIDLISEKIPVILDDSETWHIMLS 185

Query: 185 TGMKLGSRVVAHVDQVSRIELRDDSRYSNLLIINRTASPLSWFMECKDRNNRSAILLPYK 244
           T MKL  R  AHV   +R EL D S ++NLL+INRTASPL+WF+ECKDR NRS ++LPY 
Sbjct: 186 TSMKLKERGSAHVYGANRHELNDSSDFTNLLLINRTASPLAWFVECKDRGNRSDVMLPYS 245

Query: 245 FLPSMAAENLRDASEKIKEL 265
           FL +MAA  LRDA+EK+KEL
Sbjct: 246 FLQTMAASRLRDAAEKVKEL 256

BLAST of CmoCh03G001510 vs. TAIR 10
Match: AT4G08810.1 (calcium ion binding )

HSP 1 Score: 90.1 bits (222), Expect = 4.4e-18
Identity = 79/329 (24.01%), Postives = 139/329 (42.25%), Query Frame = 0

Query: 77  KFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVFVMPSRMCINPIHNKKGI 136
           K+LY+    D        C+G+     S  C L EAM+L R FVM   +C++  ++ KG 
Sbjct: 249 KYLYYSRGGD-------YCKGMNQYMWSFLCGLGEAMYLNRTFVMDLSLCLSSSYSSKG- 308

Query: 137 LHHSTNASSEERWETNSCAMDSLYDMDLISDTVPVI-----LDNSKLWYQVQSTGMKLGS 196
                    + R+          +D + + +T  ++     L + K W ++    + +  
Sbjct: 309 ---KDEEGKDFRY---------YFDFEHLKETASIVEEGEFLRDWKKWNRLHKRKVPV-R 368

Query: 197 RVVAHVDQVSRIELRDDSRYSNLLIINRTASPLSWFMECKDRNNRSAILLPYKFLPSMAA 256
           +V  H  +VS ++L  D         +       W+  C+ + ++      +    S   
Sbjct: 369 KVKTH--RVSPLQLSKDKSTIIWRQFDTPEPENYWYRVCEGQASKYVERPWHALWKSKRL 428

Query: 257 ENLRDASEKIKELLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLKRIAK 316
            N+   SE   ++  D+DA+HV RG+K K +K        L PHLD DT P+ +L ++  
Sbjct: 429 MNI--VSEISGKMDWDFDAVHVVRGEKAKNKK--------LWPHLDADTWPDAILTKLKG 488

Query: 317 WVPPGRTLFIASNERMPGFFSPLSARYKLAYSSNYSHILG----------------PVVK 376
            V   R L++A+NE    +F  L ++YK+    +YS++ G                PV  
Sbjct: 489 LVQVWRNLYVATNEPFYNYFDKLRSQYKVHLLDDYSYLWGNKSEWYNETSLLNNGKPVEF 544

Query: 377 NNYQLFMIERLIMAGAKTFIRTFKEDNTD 385
           + Y    ++  +    KT + TF    TD
Sbjct: 549 DGYMRVAVDTEVFYRGKTRVETFYNLTTD 544

BLAST of CmoCh03G001510 vs. TAIR 10
Match: AT2G04280.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G12700.1); Has 130 Blast hits to 130 proteins in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 124; Viruses - 0; Other Eukaryotes - 6 (source: NCBI BLink). )

HSP 1 Score: 87.0 bits (214), Expect = 3.7e-17
Identity = 78/333 (23.42%), Postives = 136/333 (40.84%), Query Frame = 0

Query: 71  VFSTNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRVFVMPSRMCINPI 130
           VF T    +Y G          + C+ + H   S  CAL EA +L R  VM   +C++ I
Sbjct: 255 VFKTGKYLVYVGGG--------DRCKSMNHFLWSFLCALGEAQYLNRTLVMDLTLCLSSI 314

Query: 131 HNKKGILHHSTNASSEERWETNSCAMDSLYDMDLISDTVPVILDNSKLWYQVQSTGMKLG 190
           +   G         +EE  +         +D + + +   V LD ++ W Q      K  
Sbjct: 315 YTSSG--------QNEEGKD-----FRFYFDFEHLKEAASV-LDEAQFWAQWGKLRKKRR 374

Query: 191 SRVVAHVDQVSRIELRDDSRYSNLLIINRTAS--PLSWFMECKDRNNRSAILLPYKFLPS 250
           +R+  H+ +  R+     +   + LI+ +  S  P +++    + +  S +  P+  L  
Sbjct: 375 NRLNLHLVEDFRVTPMKLAAVKDTLIMRKFGSVEPDNYWYRVCEGDAESVVKRPWHLL-- 434

Query: 251 MAAENLRDASEKIKELLG-DYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDTRPEFMLK 310
             +  L +    I   L  DYDA+H+ RG+K +        ++ + P+L+ DT P  +L 
Sbjct: 435 WKSRRLMEIVSAIASRLNWDYDAVHIERGEKAR--------NKEVWPNLEADTSPSALLS 494

Query: 311 RIAKWVPPGRTLFIASNERMPGFFSPLSARYKLAYSSNYSHIL----------------G 370
            +   V  GR L+IA+NE    FF+PL  +Y   +  +Y  +                  
Sbjct: 495 TLQDKVEEGRHLYIATNEGELSFFNPLKDKYATHFLYDYKDLWDESSEWYSETTKLNGGN 554

Query: 371 PVVKNNYQLFMIERLIMAGAKTFIRTFKEDNTD 385
           PV  + Y    ++  +    K  I TF +   D
Sbjct: 555 PVEFDGYMRASVDTEVFLRGKKQIETFNDLTND 555

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1EY952.0e-236100.00uncharacterized protein LOC111439414 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1HRU42.3e-23298.54O-fucosyltransferase family protein OS=Cucurbita maxima OX=3661 GN=LOC111467213 ... [more]
A0A6J1DKB93.2e-21088.32O-fucosyltransferase family protein OS=Momordica charantia OX=3673 GN=LOC1110213... [more]
A0A5D3DWC18.0e-20989.43O-fucosyltransferase family protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5... [more]
A0A1S3C1H98.0e-20989.43O-fucosyltransferase family protein OS=Cucumis melo OX=3656 GN=LOC103495824 PE=3... [more]
Match NameE-valueIdentityDescription
XP_022932889.14.1e-236100.00uncharacterized protein LOC111439414 isoform X1 [Cucurbita moschata][more]
KAG7033584.12.8e-23298.78hypothetical protein SDJN02_03308 [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_022967807.14.7e-23298.54uncharacterized protein LOC111467213 isoform X1 [Cucurbita maxima][more]
XP_023545162.11.5e-23098.05uncharacterized protein LOC111804548 isoform X1 [Cucurbita pepo subsp. pepo][more]
XP_022153942.16.7e-21088.32uncharacterized protein LOC111021332 [Momordica charantia][more]
Match NameE-valueIdentityDescription
AT3G56750.11.0e-14765.51unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G41150.27.2e-14664.76unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G41150.11.3e-8160.00unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT4G08810.14.4e-1824.01calcium ion binding [more]
AT2G04280.13.7e-1723.42unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D3.40.50.11350coord: 235..379
e-value: 1.9E-6
score: 29.7
NoneNo IPR availablePANTHERPTHR31469:SF8PLANT/PROTEINcoord: 3..408
NoneNo IPR availablePANTHERPTHR31469OS07G0633600 PROTEINcoord: 3..408

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh03G001510.1CmoCh03G001510.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006004 fucose metabolic process
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0016740 transferase activity