Cp4.1LG02g12440 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG02g12440
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionGlycosyltransferase family protein 2
LocationCp4.1LG02 : 11812438 .. 11816231 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTGTGATTGTTCCGAAATCGTGTTGTGGAAGATGGGAATGTTTCGGAATCCTACGACGCGAAACGGGGAGTATTTAGAAGGGATGATAAATGAGTATGTTGGAGGAGGAAAGGGAAAGTTAAGAGCTCAAAGAAATACCTCAACAAAGCTTGTTACTGCTCTTACATGTCTCCAGTTTGCCTTTGCAATATATGCAACGTTTTTACTTTATTATGTGAGCCCTGCAATAGACTTGAGAACCAAACCAGATTTCTCTTGGGCTACTAGAATTGCTCAGCAATGGAAACAGTTTGTAATACCGCCCCACGTCGTGGGACGATATCAAGAACAAACTTCTTTTGCGGAATTCAGACCGATCACACCGGAGGAAGCTTGTGAGAACGAGAAGATTGATTTCGAGCAAAAGAAGTCGAATGATGCACAGATGATCAAGTTGAAAACAGAGCTTTATAATGAGGTTTTAGATTTTCAAAGCCAAAGCTTTGGAACTGAAACTCTTTCTCAGCTTATGGCAATGAAGTCCAAATGGGATTTGAGAGGGAAAAACAAGCCAAAAGTTACTGTGATCTTGAACCATTTCAAGAGGAAAACTCTGTGTGCACAGATTAACTCTTTGCTGCATCAAACTCTTCCTTTCCACCATGTTTGGGTGCTTGCATTTGGGAGCCCAAATGAGCTCTCATTGAAGAGAATTGTAGATAGCTATAACAACTCAAGAATTAGCTTCATTAGTTCAAGCTATGACTTCAAGTACTATGGAAGGTTCCAAATGGCCTTACAAACCGAGGCCGATTTAGTATACATTCTCGACGACGACATGATTCCTGGCCATAAAATGCTACAGATTTTGTCTCATGTAGCAGGGACGGAGAAGTACAAGAATGCAGTTTTGGGCAGTATAGGGAGGATTTTGCCATTTCGACAGAAGGATTTCACGTTCCCGAGTTATCGGAAGTTTCGTTCCAAGGAGGCGGGGCTTTACTTGCCTGATCCTGCTTATAACATCACAGTCAATAAAATTGTGCAGGTTGATTTTCTCTCCAGCTCTTGGTTCTTATCTGCAGAGCTTGTTAAGACGCTTTTCATTGAAACTCCATTCACCTTTGCAACAGGGGAAGATCTCCATCTAAGGTAACTTTAACTCGATTTACATTCCTGCTTTCTTCATTTCTACGGTACTACAAATCAACTGAGTCATAACATAATGTGAGATCCCACATCGGTTAGAGAGGGGAATAAAGCATTCCTTACAAGGGTGTGGAGACCTCTCCCTAGTATACGCATTTTAAAACCTCAAGGGAAAGCCCAAAGAGGACAATATTTGTTAACGGTGTGCTTGGGCCGTTACAAGTGGTAACAGAGCCAGTCACCTAACGGTGCGAGACACTGGTGGCGTAGGGCTAGACCCTCTCCAGTGGGTTTGGACTGTTACAAATGGTATCAGAGCTAGACACTAAGCAGTGTGCCAGCGAGGACACTGGCCCCCAAGGGGGTGGATTTTGAAATCCCACATCAGTTGGAGNGGTGCGAGACACTGGTGGCGTAGGGCTAGACCCTCTCCAGTGGGTTTGGACTGTTACAAATGGTATCAGAGCTAGACACTAAGCAGTGTGCCAGCGAGGACACTGGCCCCCAAGGGGGGTGGATTTTGAAATCCCACATCAATTGGAGAAAAGAACAAAAGAACGAAACAGTCCTTATAAGGGTGTGGAAACCTCTTCCTAATAGAGGCGTTTTAAAACCTTGAGGGGAAATCCATAAGGAAAAGCCCAAAAAAGACAATATCTGCTAGCAGTGGGCTTGGGCAGTAACACATATCGTGACAAACGGAAGCTGTCTTTTTGGTGCATTATTTGCAACAGACTAGCAGTCATGGAATGGAAATTGTGGAATAAAATTCCTTTGAGTGAAGCATTATCAGAATTTCACATTATAAATGGCTTTAGCACTGAAATTTGATCTCATCTTTCCAGTAAATTCTGGTCAGCTAACTGTTATCCATCCCAAATTATGCAGCTATCAGCTTCAAAAATACAGAAACGCAGGCTCGTTTGTTCTTCCTGTAGATCCAAAAGACAAGGAAACCTGGGGTGATAGTGAACACAGGCTGGCTTACGTGTCCGAGACAACTGTGATATTTAAGGACATCGTTCAAGTTCGGGACGATCAATGGTGGAAAGCGCTGTCTACTGGCTATGTCACACAATGGGCAGCCATGCATCCTCAAAAGATAGATGCTCTATTCTATGCTCATTCTGTTGATGAAGTTAAAGCACTAGCCCCACTTCTTGAAAAGTTCAGGTCCACTATTGGCAAGAAAGCTTATATTGTAGTGTCGGGAGGCACTTTTTGCCCATGTGAAGATGCTGCAGCTGCTCTTAAATGGCCTAAGTTGGTGTGTAAAGAACGGAGGTTCAAGATATTTGACTTAGCTATTGGGGCTCTCTCTGGCCTATCTAATTCTGAGGTGCCCGTGGTGCAAGCAGTGTATGCTAGTATGAAGGGATTGATCCAAATACACAATCCAAGCATCATCGTCACGGTAGCCGATCTCGATCCTAACGTAAAGAAGGCTTTGAAAATGGCGTCCGAGGCTAACATGAATACTACAACACTGATTCTTTTACCAAGACCTTCCATTTCAAAAGTTCTTTGGATGGCTGATCTTCGACCAACCGCAATTCCAAGTAAGAACACTACAACACTTTTTACGCTCAGTTTCTCCCGTCGAAAATTCTTGCCATAGTGTAACACTCGTGCGTTTTTTCCTTCTTCAGATTGGAACAAGATGCGGATTTCAATCAACATTATCACACAAAACCGTGCCAATTCGTTAACAAGGCTTCTCAAATCACTCAAAGATGCATATTACCTAGGGGATGAGATACCTATCAGCTTCAACATGGACAGCAAAGTGGATGAGGAAACTATCAAATTAGTAAGCTCATTTGAGTGGCCCCATGGCCCCAAAAGCCTTAGAAGGAGAATCATCCAAGGAGGGCTAATACGAGCAGTGAGCGAGAGTTGGTATCCAGCTTCAGACGACGATTACGGTCTCTTACTCGAAGATGATATCGAAGTCTCTCCATACTACTACCTATGGATCAAATATGCCCTCCTAGCATACCACTATGATCCACAAATATCTCTACCTGAGCTATCGACGATATCGCTTTACACACCTCGGCTAGTCGAAGTGGTGAAGGAAAGGCCTAAATGGAATGCAACAGAGTTTTTCCAGCGGATTCATCCAAACACACCTTACCTCCACCAGCTGCCCTGCAGCTGGGGAGCACTTTTCTTCCCCAAACATTGGAGGGAATTTTATGTTTACATGAACTCAAGATTCACAGAAAATGCCAAGGAAAATCCAGTTCAAATCCCTAAATCAAGAACAAACGGTTGGCAGGCATCATGGAAGAAGTTCCTAATCGACATGATGTACCTAAGAGGCTACGTCAGTTTGTATCCAAATTTCCCAAATCAAGCCAGTTTTTCAACAAATCACATGGAACCAGGCGCTCACATAAGTGCAAAAGACAATATCGTGAAGCATAACAAGGAAGATTTCGAGGTTCCATTACTGAAAGAAAACTTCGTGAATTTCTTACCTAACGGCAAATTGCCGGCGGCTTCGAGACTGCCGTCGCTGAACCTCTTCAATCAACCGGTTTCTCTGAAGGGACTCAAATCCGCCGGAGCCAAACTAAGGCAAGATGTTCTGAAATGCGAAGTTTCGGAGATCGTAGCGGTGAATCATCAAACTGGTCTGCCTTCGCATTGCGCAAAATTC

mRNA sequence

GTTGTGATTGTTCCGAAATCGTGTTGTGGAAGATGGGAATGTTTCGGAATCCTACGACGCGAAACGGGGAGTATTTAGAAGGGATGATAAATGAGTATGTTGGAGGAGGAAAGGGAAAGTTAAGAGCTCAAAGAAATACCTCAACAAAGCTTGTTACTGCTCTTACATGTCTCCAGTTTGCCTTTGCAATATATGCAACGTTTTTACTTTATTATGTGAGCCCTGCAATAGACTTGAGAACCAAACCAGATTTCTCTTGGGCTACTAGAATTGCTCAGCAATGGAAACAGTTTGTAATACCGCCCCACGTCGTGGGACGATATCAAGAACAAACTTCTTTTGCGGAATTCAGACCGATCACACCGGAGGAAGCTTGTGAGAACGAGAAGATTGATTTCGAGCAAAAGAAGTCGAATGATGCACAGATGATCAAGTTGAAAACAGAGCTTTATAATGAGGTTTTAGATTTTCAAAGCCAAAGCTTTGGAACTGAAACTCTTTCTCAGCTTATGGCAATGAAGTCCAAATGGGATTTGAGAGGGAAAAACAAGCCAAAAGTTACTGTGATCTTGAACCATTTCAAGAGGAAAACTCTGTGTGCACAGATTAACTCTTTGCTGCATCAAACTCTTCCTTTCCACCATGTTTGGGTGCTTGCATTTGGGAGCCCAAATGAGCTCTCATTGAAGAGAATTGTAGATAGCTATAACAACTCAAGAATTAGCTTCATTAGTTCAAGCTATGACTTCAAGTACTATGGAAGGTTCCAAATGGCCTTACAAACCGAGGCCGATTTAGTATACATTCTCGACGACGACATGATTCCTGGCCATAAAATGCTACAGATTTTGTCTCATGTAGCAGGGACGGAGAAGTACAAGAATGCAGTTTTGGGCAGTATAGGGAGGATTTTGCCATTTCGACAGAAGGATTTCACGTTCCCGAGTTATCGGAAGTTTCGTTCCAAGGAGGCGGGGCTTTACTTGCCTGATCCTGCTTATAACATCACAGTCAATAAAATTGTGCAGGTTGATTTTCTCTCCAGCTCTTGGTTCTTATCTGCAGAGCTTCTTCAAAAATACAGAAACGCAGGCTCGTTTGTTCTTCCTGTAGATCCAAAAGACAAGGAAACCTGGGGTGATAGTGAACACAGGCTGGCTTACGTGTCCGAGACAACTGTGATATTTAAGGACATCGTTCAAGTTCGGGACGATCAATGGTGGAAAGCGCTGTCTACTGGCTATGTCACACAATGGGCAGCCATGCATCCTCAAAAGATAGATGCTCTATTCTATGCTCATTCTGTTGATGAAGTTAAAGCACTAGCCCCACTTCTTGAAAAGTTCAGGTCCACTATTGGCAAGAAAGCTTATATTGTAGTGTCGGGAGGCACTTTTTGCCCATGTGAAGATGCTGCAGCTGCTCTTAAATGGCCTAAGTTGGTGTGTAAAGAACGGAGGTTCAAGATATTTGACTTAGCTATTGGGGCTCTCTCTGGCCTATCTAATTCTGAGGTGCCCGTGGTGCAAGCAGTGTATGCTAGTATGAAGGGATTGATCCAAATACACAATCCAAGCATCATCGTCACGGTAGCCGATCTCGATCCTAACGTAAAGAAGGCTTTGAAAATGGCGTCCGAGGCTAACATGAATACTACAACACTGATTCTTTTACCAAGACCTTCCATTTCAAAAGTTCTTTGGATGGCTGATCTTCGACCAACCGCAATTCCAAATTGGAACAAGATGCGGATTTCAATCAACATTATCACACAAAACCGTGCCAATTCGTTAACAAGGCTTCTCAAATCACTCAAAGATGCATATTACCTAGGGGATGAGATACCTATCAGCTTCAACATGGACAGCAAAGTGGATGAGGAAACTATCAAATTAGTAAGCTCATTTGAGTGGCCCCATGGCCCCAAAAGCCTTAGAAGGAGAATCATCCAAGGAGGGCTAATACGAGCAGTGAGCGAGAGTTGGTATCCAGCTTCAGACGACGATTACGGTCTCTTACTCGAAGATGATATCGAAGTCTCTCCATACTACTACCTATGGATCAAATATGCCCTCCTAGCATACCACTATGATCCACAAATATCTCTACCTGAGCTATCGACGATATCGCTTTACACACCTCGGCTAGTCGAAGTGGTGAAGGAAAGGCCTAAATGGAATGCAACAGAGTTTTTCCAGCGGATTCATCCAAACACACCTTACCTCCACCAGCTGCCCTGCAGCTGGGGAGCACTTTTCTTCCCCAAACATTGGAGGGAATTTTATGTTTACATGAACTCAAGATTCACAGAAAATGCCAAGGAAAATCCAGTTCAAATCCCTAAATCAAGAACAAACGGTTGGCAGGCATCATGGAAGAAGTTCCTAATCGACATGATGTACCTAAGAGGCTACGTCAGTTTGTATCCAAATTTCCCAAATCAAGCCAGTTTTTCAACAAATCACATGGAACCAGGCGCTCACATAATTTCGGAGATCGTAGCGGTGAATCATCAAACTGGTCTGCCTTCGCATTGCGCAAAATTC

Coding sequence (CDS)

ATGGGAATGTTTCGGAATCCTACGACGCGAAACGGGGAGTATTTAGAAGGGATGATAAATGAGTATGTTGGAGGAGGAAAGGGAAAGTTAAGAGCTCAAAGAAATACCTCAACAAAGCTTGTTACTGCTCTTACATGTCTCCAGTTTGCCTTTGCAATATATGCAACGTTTTTACTTTATTATGTGAGCCCTGCAATAGACTTGAGAACCAAACCAGATTTCTCTTGGGCTACTAGAATTGCTCAGCAATGGAAACAGTTTGTAATACCGCCCCACGTCGTGGGACGATATCAAGAACAAACTTCTTTTGCGGAATTCAGACCGATCACACCGGAGGAAGCTTGTGAGAACGAGAAGATTGATTTCGAGCAAAAGAAGTCGAATGATGCACAGATGATCAAGTTGAAAACAGAGCTTTATAATGAGGTTTTAGATTTTCAAAGCCAAAGCTTTGGAACTGAAACTCTTTCTCAGCTTATGGCAATGAAGTCCAAATGGGATTTGAGAGGGAAAAACAAGCCAAAAGTTACTGTGATCTTGAACCATTTCAAGAGGAAAACTCTGTGTGCACAGATTAACTCTTTGCTGCATCAAACTCTTCCTTTCCACCATGTTTGGGTGCTTGCATTTGGGAGCCCAAATGAGCTCTCATTGAAGAGAATTGTAGATAGCTATAACAACTCAAGAATTAGCTTCATTAGTTCAAGCTATGACTTCAAGTACTATGGAAGGTTCCAAATGGCCTTACAAACCGAGGCCGATTTAGTATACATTCTCGACGACGACATGATTCCTGGCCATAAAATGCTACAGATTTTGTCTCATGTAGCAGGGACGGAGAAGTACAAGAATGCAGTTTTGGGCAGTATAGGGAGGATTTTGCCATTTCGACAGAAGGATTTCACGTTCCCGAGTTATCGGAAGTTTCGTTCCAAGGAGGCGGGGCTTTACTTGCCTGATCCTGCTTATAACATCACAGTCAATAAAATTGTGCAGGTTGATTTTCTCTCCAGCTCTTGGTTCTTATCTGCAGAGCTTCTTCAAAAATACAGAAACGCAGGCTCGTTTGTTCTTCCTGTAGATCCAAAAGACAAGGAAACCTGGGGTGATAGTGAACACAGGCTGGCTTACGTGTCCGAGACAACTGTGATATTTAAGGACATCGTTCAAGTTCGGGACGATCAATGGTGGAAAGCGCTGTCTACTGGCTATGTCACACAATGGGCAGCCATGCATCCTCAAAAGATAGATGCTCTATTCTATGCTCATTCTGTTGATGAAGTTAAAGCACTAGCCCCACTTCTTGAAAAGTTCAGGTCCACTATTGGCAAGAAAGCTTATATTGTAGTGTCGGGAGGCACTTTTTGCCCATGTGAAGATGCTGCAGCTGCTCTTAAATGGCCTAAGTTGGTGTGTAAAGAACGGAGGTTCAAGATATTTGACTTAGCTATTGGGGCTCTCTCTGGCCTATCTAATTCTGAGGTGCCCGTGGTGCAAGCAGTGTATGCTAGTATGAAGGGATTGATCCAAATACACAATCCAAGCATCATCGTCACGGTAGCCGATCTCGATCCTAACGTAAAGAAGGCTTTGAAAATGGCGTCCGAGGCTAACATGAATACTACAACACTGATTCTTTTACCAAGACCTTCCATTTCAAAAGTTCTTTGGATGGCTGATCTTCGACCAACCGCAATTCCAAATTGGAACAAGATGCGGATTTCAATCAACATTATCACACAAAACCGTGCCAATTCGTTAACAAGGCTTCTCAAATCACTCAAAGATGCATATTACCTAGGGGATGAGATACCTATCAGCTTCAACATGGACAGCAAAGTGGATGAGGAAACTATCAAATTAGTAAGCTCATTTGAGTGGCCCCATGGCCCCAAAAGCCTTAGAAGGAGAATCATCCAAGGAGGGCTAATACGAGCAGTGAGCGAGAGTTGGTATCCAGCTTCAGACGACGATTACGGTCTCTTACTCGAAGATGATATCGAAGTCTCTCCATACTACTACCTATGGATCAAATATGCCCTCCTAGCATACCACTATGATCCACAAATATCTCTACCTGAGCTATCGACGATATCGCTTTACACACCTCGGCTAGTCGAAGTGGTGAAGGAAAGGCCTAAATGGAATGCAACAGAGTTTTTCCAGCGGATTCATCCAAACACACCTTACCTCCACCAGCTGCCCTGCAGCTGGGGAGCACTTTTCTTCCCCAAACATTGGAGGGAATTTTATGTTTACATGAACTCAAGATTCACAGAAAATGCCAAGGAAAATCCAGTTCAAATCCCTAAATCAAGAACAAACGGTTGGCAGGCATCATGGAAGAAGTTCCTAATCGACATGATGTACCTAAGAGGCTACGTCAGTTTGTATCCAAATTTCCCAAATCAAGCCAGTTTTTCAACAAATCACATGGAACCAGGCGCTCACATAATTTCGGAGATCGTAGCGGTGAATCATCAAACTGGTCTGCCTTCGCATTGCGCAAAATTC

Protein sequence

MGMFRNPTTRNGEYLEGMINEYVGGGKGKLRAQRNTSTKLVTALTCLQFAFAIYATFLLYYVSPAIDLRTKPDFSWATRIAQQWKQFVIPPHVVGRYQEQTSFAEFRPITPEEACENEKIDFEQKKSNDAQMIKLKTELYNEVLDFQSQSFGTETLSQLMAMKSKWDLRGKNKPKVTVILNHFKRKTLCAQINSLLHQTLPFHHVWVLAFGSPNELSLKRIVDSYNNSRISFISSSYDFKYYGRFQMALQTEADLVYILDDDMIPGHKMLQILSHVAGTEKYKNAVLGSIGRILPFRQKDFTFPSYRKFRSKEAGLYLPDPAYNITVNKIVQVDFLSSSWFLSAELLQKYRNAGSFVLPVDPKDKETWGDSEHRLAYVSETTVIFKDIVQVRDDQWWKALSTGYVTQWAAMHPQKIDALFYAHSVDEVKALAPLLEKFRSTIGKKAYIVVSGGTFCPCEDAAAALKWPKLVCKERRFKIFDLAIGALSGLSNSEVPVVQAVYASMKGLIQIHNPSIIVTVADLDPNVKKALKMASEANMNTTTLILLPRPSISKVLWMADLRPTAIPNWNKMRISINIITQNRANSLTRLLKSLKDAYYLGDEIPISFNMDSKVDEETIKLVSSFEWPHGPKSLRRRIIQGGLIRAVSESWYPASDDDYGLLLEDDIEVSPYYYLWIKYALLAYHYDPQISLPELSTISLYTPRLVEVVKERPKWNATEFFQRIHPNTPYLHQLPCSWGALFFPKHWREFYVYMNSRFTENAKENPVQIPKSRTNGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPGAHIISEIVAVNHQTGLPSHCAKF
BLAST of Cp4.1LG02g12440 vs. TrEMBL
Match: A0A0A0LV36_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G124520 PE=4 SV=1)

HSP 1 Score: 1568.9 bits (4061), Expect = 0.0e+00
Identity = 767/854 (89.81%), Postives = 802/854 (93.91%), Query Frame = 1

Query: 1   MGMFRNPTTRNGEYLEGMINEYVGGGKGKLRAQRNTSTKLVTALTCLQFAFAIYATFLLY 60
           MGMFRNPT  NG+ +EGMI +YVGG KGKLR QR++STK+V  LTCLQFAFA+YATFLLY
Sbjct: 1   MGMFRNPTMGNGDCIEGMIKDYVGG-KGKLRPQRSSSTKIVAGLTCLQFAFALYATFLLY 60

Query: 61  YVSPAIDLRTKPDFSWATRIAQQWKQFVIPPHVVGRYQEQTSF---AEFRPITPEEACEN 120
           YVSPAIDLRTKPDFSWATRIAQQWKQFVIPPHVVGRYQE  S    AE RPITPEEACEN
Sbjct: 61  YVSPAIDLRTKPDFSWATRIAQQWKQFVIPPHVVGRYQEPNSMMMQAELRPITPEEACEN 120

Query: 121 EKIDFEQKKSNDAQMIKLKTELYNEVLDFQSQSFGTETLSQLMAMKSKWDLRGKNKPKVT 180
           EKIDFEQKKSND QMIKLKTELYNE+LDFQS+SFGTETLSQLMAMKSKWDL+G NKPKVT
Sbjct: 121 EKIDFEQKKSNDGQMIKLKTELYNEILDFQSKSFGTETLSQLMAMKSKWDLKGPNKPKVT 180

Query: 181 VILNHFKRKTLCAQINSLLHQTLPFHHVWVLAFGSPNELSLKRIVDSYNNSRISFISSSY 240
           VILNHFKRKTLCAQ+NSLLHQTLPFHHVWVLAFGSPNELSLKRIVDSYNNS+ISFISSSY
Sbjct: 181 VILNHFKRKTLCAQLNSLLHQTLPFHHVWVLAFGSPNELSLKRIVDSYNNSKISFISSSY 240

Query: 241 DFKYYGRFQMALQTEADLVYILDDDMIPGHKMLQILSHVAGTEKYKNAVLGSIGRILPFR 300
           DFKYYGRFQMALQTEADLVYILDDDMIPG KMLQILSHVAGT+KYKN+VLGSIGRILPFR
Sbjct: 241 DFKYYGRFQMALQTEADLVYILDDDMIPGRKMLQILSHVAGTDKYKNSVLGSIGRILPFR 300

Query: 301 QKDFTFPSYRKFRSKEAGLYLPDPAYNITVNKIVQVDFLSSSWFLSAEL----------- 360
           QKDFTFPSYRKFRSKEAGLYLPDPAY+ITVNKIVQVDFLSSSWFLSAEL           
Sbjct: 301 QKDFTFPSYRKFRSKEAGLYLPDPAYDITVNKIVQVDFLSSSWFLSAELVKTLFIETPFT 360

Query: 361 ------------LQKYRNAGSFVLPVDPKDKETWGDSEHRLAYVSETTVIFKDIVQVRDD 420
                       LQKYR+AGSFVLPVDPKDKETWGDSEHRLAYVSETTVIFKDIVQVRDD
Sbjct: 361 FATGEDLHLSYQLQKYRDAGSFVLPVDPKDKETWGDSEHRLAYVSETTVIFKDIVQVRDD 420

Query: 421 QWWKALSTGYVTQWAAMHPQKIDALFYAHSVDEVKALAPLLEKFRSTIGKKAYIVVSGGT 480
           QWWKALSTGY+TQWAAMHPQKIDALFYAHSVDE KALAPLLEKFRST+GKKAYIVVSGG 
Sbjct: 421 QWWKALSTGYITQWAAMHPQKIDALFYAHSVDEAKALAPLLEKFRSTVGKKAYIVVSGGN 480

Query: 481 FCPCEDAAAALKWPKLVCKERRFKIFDLAIGALSGLSNSEVPVVQAVYASMKGLIQIHNP 540
           FCPCED A ALKWPKLVCKERRFKIFDLAIGALSG+SNSEVPVVQAVYASMKGLI+IHNP
Sbjct: 481 FCPCEDVADALKWPKLVCKERRFKIFDLAIGALSGISNSEVPVVQAVYASMKGLIKIHNP 540

Query: 541 SIIVTVADLDPNVKKALKMASEANMNTTTLILLPRPSISKVLWMADLRPTAIPNWNKMRI 600
           S+I+TVAD+DPNVKKALKMASEAN+N TTL+LLPRPSISKVLWMA+LR TA+PNWNKMRI
Sbjct: 541 SVIITVADIDPNVKKALKMASEANLNGTTLVLLPRPSISKVLWMANLRSTALPNWNKMRI 600

Query: 601 SINIITQNRANSLTRLLKSLKDAYYLGDEIPISFNMDSKVDEETIKLVSSFEWPHGPKSL 660
           SINIITQNRA+SLTRLLKSLKDAYYLGDEIPISFNMDSKVDEETIKLVSSFEWPHGPKSL
Sbjct: 601 SINIITQNRASSLTRLLKSLKDAYYLGDEIPISFNMDSKVDEETIKLVSSFEWPHGPKSL 660

Query: 661 RRRIIQGGLIRAVSESWYPASDDDYGLLLEDDIEVSPYYYLWIKYALLAYHYDPQISLPE 720
           RRRIIQGGLIRAVSESWYPASDDDYGLLLEDDIEVSPYYYLWIKYALLAYHYDPQISLPE
Sbjct: 661 RRRIIQGGLIRAVSESWYPASDDDYGLLLEDDIEVSPYYYLWIKYALLAYHYDPQISLPE 720

Query: 721 LSTISLYTPRLVEVVKERPKWNATEFFQRIHPNTPYLHQLPCSWGALFFPKHWREFYVYM 780
           LS+ISLYTPRLVEVVKERPKWNATEFF+RIHPNTPYLHQLPCSWGA+FFPKHWREFYVYM
Sbjct: 721 LSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTPYLHQLPCSWGAVFFPKHWREFYVYM 780

Query: 781 NSRFTENAKENPVQIPKSRTNGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEP 829
           NSRFTENAKENPVQIPKSRTNGWQASWKKFLIDMMYLRGYVSLYPNFP+QASFSTNHMEP
Sbjct: 781 NSRFTENAKENPVQIPKSRTNGWQASWKKFLIDMMYLRGYVSLYPNFPDQASFSTNHMEP 840

BLAST of Cp4.1LG02g12440 vs. TrEMBL
Match: B9HQB9_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0009s01610g PE=4 SV=2)

HSP 1 Score: 1485.3 bits (3844), Expect = 0.0e+00
Identity = 715/852 (83.92%), Postives = 780/852 (91.55%), Query Frame = 1

Query: 1   MGMFRNPTTRNGEYLEGMINEYVGGGKGKLRAQRNTSTKLVTALTCLQFAFAIYATFLLY 60
           MG+ RN T ++G+YLEGM+++YVGG K K + QR++S +LVTALTCLQFAFA+YATFLLY
Sbjct: 1   MGLIRNSTMKSGDYLEGMLSDYVGG-KAKSKVQRSSSARLVTALTCLQFAFAVYATFLLY 60

Query: 61  YVSPAIDLRTKPDFSWATRIAQQWKQFVIPPHVVGRYQEQTSF--AEFRPITPEEACENE 120
           Y+SP IDLRTKPDF+WATRIAQQWK F+IPPHV+GRYQE  S   AE  PI P E CE+E
Sbjct: 61  YMSPTIDLRTKPDFAWATRIAQQWKHFIIPPHVLGRYQEAASLVTAEIGPINPSEVCEHE 120

Query: 121 KIDFEQKKSNDAQMIKLKTELYNEVLDFQSQSFGTETLSQLMAMKSKWDLRGKNKPKVTV 180
           KIDF+QKKSNDAQMIKLK ELY+EVLDFQS+S GTETLS+LMAMKSKWDLRG NKP+VTV
Sbjct: 121 KIDFQQKKSNDAQMIKLKRELYDEVLDFQSKSTGTETLSELMAMKSKWDLRGPNKPRVTV 180

Query: 181 ILNHFKRKTLCAQINSLLHQTLPFHHVWVLAFGSPNELSLKRIVDSYNNSRISFISSSYD 240
           ILNHFKRKTLCAQ++SLLHQTLPFHHVWVL+FGSPNELSLKRIV+SYN+SRISFISSSYD
Sbjct: 181 ILNHFKRKTLCAQLDSLLHQTLPFHHVWVLSFGSPNELSLKRIVNSYNDSRISFISSSYD 240

Query: 241 FKYYGRFQMALQTEADLVYILDDDMIPGHKMLQILSHVAGTEKYKNAVLGSIGRILPFRQ 300
           FKYYGRFQMALQTEADLVYI+DDDMIPG KMLQILSHVAGTEKYKN+VLGSIGRILPFRQ
Sbjct: 241 FKYYGRFQMALQTEADLVYIVDDDMIPGRKMLQILSHVAGTEKYKNSVLGSIGRILPFRQ 300

Query: 301 KDFTFPSYRKFRSKEAGLYLPDPAYNITVNKIVQVDFLSSSWFLSAEL------------ 360
           KDFTFPSYRKFRSKEAGLYLPDPAY+ITV+KIVQVDFLSSSWFLSAEL            
Sbjct: 301 KDFTFPSYRKFRSKEAGLYLPDPAYDITVDKIVQVDFLSSSWFLSAELVKTLFIEAPMTF 360

Query: 361 -----------LQKYRNAGSFVLPVDPKDKETWGDSEHRLAYVSETTVIFKDIVQVRDDQ 420
                      LQKYRNAGSFVLPVDP DKETWGDSEHRLAYVSETTVIFKDIVQVRDDQ
Sbjct: 361 MTGEDLHLSYQLQKYRNAGSFVLPVDPNDKETWGDSEHRLAYVSETTVIFKDIVQVRDDQ 420

Query: 421 WWKALSTGYVTQWAAMHPQKIDALFYAHSVDEVKALAPLLEKFRSTIGKKAYIVVSGGTF 480
           WWKA STGYVTQWAAMHPQKIDALFYAHSVDEVKALAPL+EKFRST+GKKAYIVVSGG F
Sbjct: 421 WWKAFSTGYVTQWAAMHPQKIDALFYAHSVDEVKALAPLIEKFRSTVGKKAYIVVSGGNF 480

Query: 481 CPCEDAAAALKWPKLVCKERRFKIFDLAIGALSGLSNSEVPVVQAVYASMKGLIQIHNPS 540
           CPCEDAA AL WPK+VCKERRFKIFDLA+ A + +SNSEVPV+QAVY+S+KGLI+IHNPS
Sbjct: 481 CPCEDAATALNWPKIVCKERRFKIFDLAVAAQTEISNSEVPVIQAVYSSVKGLIKIHNPS 540

Query: 541 IIVTVADLDPNVKKALKMASEANMNTTTLILLPRPSISKVLWMADLRPTAIPNWNKMRIS 600
           +++ V D+DPNVKKALKMA+E N N TT++LLPRPSISKVLWMADLR TA+PNWNKMRIS
Sbjct: 541 VLIAVNDIDPNVKKALKMATETNTNGTTMVLLPRPSISKVLWMADLRSTALPNWNKMRIS 600

Query: 601 INIITQNRANSLTRLLKSLKDAYYLGDEIPISFNMDSKVDEETIKLVSSFEWPHGPKSLR 660
           +NIITQNRA SLTRLLKSL DAYY+GDEIPISFN+DSKVDEETI+LVSSF WPHGPK+LR
Sbjct: 601 VNIITQNRAPSLTRLLKSLSDAYYVGDEIPISFNVDSKVDEETIRLVSSFNWPHGPKTLR 660

Query: 661 RRIIQGGLIRAVSESWYPASDDDYGLLLEDDIEVSPYYYLWIKYALLAYHYDPQISLPEL 720
           RRIIQGGLIRAVSESWYP+SDDDYGLLLEDDIEVSP+YYLWIKYALLAYHYDPQ+SLPEL
Sbjct: 661 RRIIQGGLIRAVSESWYPSSDDDYGLLLEDDIEVSPFYYLWIKYALLAYHYDPQVSLPEL 720

Query: 721 STISLYTPRLVEVVKERPKWNATEFFQRIHPNTPYLHQLPCSWGALFFPKHWREFYVYMN 780
           S+ISLYTP+LVEVVKERP+WNATEFF+RIHPNTPYLHQLPCSWGA+FFPK WREFYVYMN
Sbjct: 721 SSISLYTPKLVEVVKERPRWNATEFFKRIHPNTPYLHQLPCSWGAVFFPKQWREFYVYMN 780

Query: 781 SRFTENAKENPVQIPKSRTNGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPG 828
            RFTE+AK NPVQIPKSRTNGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPG
Sbjct: 781 MRFTEDAKANPVQIPKSRTNGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPG 840

BLAST of Cp4.1LG02g12440 vs. TrEMBL
Match: A0A0D2R2N2_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_004G230400 PE=4 SV=1)

HSP 1 Score: 1479.2 bits (3828), Expect = 0.0e+00
Identity = 711/852 (83.45%), Postives = 777/852 (91.20%), Query Frame = 1

Query: 1   MGMFRNPTTRNGEYLEGMINEYVGGGKGKLRAQRNTSTKLVTALTCLQFAFAIYATFLLY 60
           MG+ RNP  R+G+ LEGM+++YVGG K K +A +N S++LVTALTCLQFAFA+YATFLLY
Sbjct: 1   MGLVRNPNMRSGDMLEGMLSDYVGG-KAKAKAPKNASSRLVTALTCLQFAFAVYATFLLY 60

Query: 61  YVSPAIDLRTKPDFSWATRIAQQWKQFVIPPHVVGRYQEQTSF--AEFRPITPEEACENE 120
           Y+SPAIDLRTKPDF+WATRIA+  KQF+IPPHV+GRYQE  SF  AE  PITP   CE E
Sbjct: 61  YMSPAIDLRTKPDFAWATRIARNMKQFIIPPHVLGRYQEAASFIRAEIPPITPSTICETE 120

Query: 121 KIDFEQKKSNDAQMIKLKTELYNEVLDFQSQSFGTETLSQLMAMKSKWDLRGKNKPKVTV 180
           K+DF QKKSND QMIKLK ELY+E+LDFQS++ GTETL+QLMAMKSKWD+RG NKPKVTV
Sbjct: 121 KLDFMQKKSNDVQMIKLKRELYDEILDFQSKTIGTETLAQLMAMKSKWDIRGPNKPKVTV 180

Query: 181 ILNHFKRKTLCAQINSLLHQTLPFHHVWVLAFGSPNELSLKRIVDSYNNSRISFISSSYD 240
           +LNHFKRKTLCAQ++SLLHQTLPFHHVWV++FGSPNE SLKRIV++YN+SRISFISSSYD
Sbjct: 181 LLNHFKRKTLCAQLDSLLHQTLPFHHVWVISFGSPNEQSLKRIVETYNDSRISFISSSYD 240

Query: 241 FKYYGRFQMALQTEADLVYILDDDMIPGHKMLQILSHVAGTEKYKNAVLGSIGRILPFRQ 300
           FKYYGRFQMALQTEADLVYILDDDMIPG KMLQILSHVAGTEKYKN+VLGSIGRILPFRQ
Sbjct: 241 FKYYGRFQMALQTEADLVYILDDDMIPGKKMLQILSHVAGTEKYKNSVLGSIGRILPFRQ 300

Query: 301 KDFTFPSYRKFRSKEAGLYLPDPAYNITVNKIVQVDFLSSSWFLSAEL------------ 360
           KDFTFPSYRKFRSKEAGLYLPDPAY+ITV+KIVQVDFLSSSWFLS+EL            
Sbjct: 301 KDFTFPSYRKFRSKEAGLYLPDPAYDITVDKIVQVDFLSSSWFLSSELVKSLFIETPFTF 360

Query: 361 -----------LQKYRNAGSFVLPVDPKDKETWGDSEHRLAYVSETTVIFKDIVQVRDDQ 420
                      LQKYRNAGSFVLPVDP DKETWGDSEHRLAYVSETTVIFKDIVQVRDDQ
Sbjct: 361 MTGEDLHLSYQLQKYRNAGSFVLPVDPTDKETWGDSEHRLAYVSETTVIFKDIVQVRDDQ 420

Query: 421 WWKALSTGYVTQWAAMHPQKIDALFYAHSVDEVKALAPLLEKFRSTIGKKAYIVVSGGTF 480
           WW+ALS GYVTQWAAM+PQKIDAL YAHS+DEVKALAPLLEKFRST+GKKAYIVV GG+F
Sbjct: 421 WWRALSAGYVTQWAAMYPQKIDALLYAHSLDEVKALAPLLEKFRSTVGKKAYIVVPGGSF 480

Query: 481 CPCEDAAAALKWPKLVCKERRFKIFDLAIGALSGLSNSEVPVVQAVYASMKGLIQIHNPS 540
           CPCEDAAAAL WPKLVC+ERRFKIFDL IGALSG SNSEVPVVQAVY+SMKGLI+IHNPS
Sbjct: 481 CPCEDAAAALNWPKLVCRERRFKIFDLQIGALSGTSNSEVPVVQAVYSSMKGLIKIHNPS 540

Query: 541 IIVTVADLDPNVKKALKMASEANMNTTTLILLPRPSISKVLWMADLRPTAIPNWNKMRIS 600
           +++TV D+DPNVKKALKMA+E N+N T L+LLPRPS+SKVLWMADLR TA+PNWN+MR+S
Sbjct: 541 VVITVTDIDPNVKKALKMATETNVNGTALVLLPRPSVSKVLWMADLRSTALPNWNRMRVS 600

Query: 601 INIITQNRANSLTRLLKSLKDAYYLGDEIPISFNMDSKVDEETIKLVSSFEWPHGPKSLR 660
           +NIITQNRA+SLTRLLKSL DAYY+GDEIPISFNMDSKVDE TIKLV SFEWPHGPK+LR
Sbjct: 601 VNIITQNRASSLTRLLKSLSDAYYIGDEIPISFNMDSKVDEATIKLVESFEWPHGPKTLR 660

Query: 661 RRIIQGGLIRAVSESWYPASDDDYGLLLEDDIEVSPYYYLWIKYALLAYHYDPQISLPEL 720
           RRIIQGGLIRAVSESWYP SDDDYGLLLEDDIEVSPYYYLWIKYALLAYHYDPQISLPEL
Sbjct: 661 RRIIQGGLIRAVSESWYPTSDDDYGLLLEDDIEVSPYYYLWIKYALLAYHYDPQISLPEL 720

Query: 721 STISLYTPRLVEVVKERPKWNATEFFQRIHPNTPYLHQLPCSWGALFFPKHWREFYVYMN 780
           S+ISLYTPR+VEVVKERPKWN T+FF+RIHPNTPYLHQLPCSWGA+FFPKHWREFYVYMN
Sbjct: 721 SSISLYTPRIVEVVKERPKWNPTDFFKRIHPNTPYLHQLPCSWGAVFFPKHWREFYVYMN 780

Query: 781 SRFTENAKENPVQIPKSRTNGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPG 828
            RFTE+AK NPVQIPKSRTNGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPG
Sbjct: 781 MRFTEDAKSNPVQIPKSRTNGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPG 840

BLAST of Cp4.1LG02g12440 vs. TrEMBL
Match: M5WLD4_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001021mg PE=4 SV=1)

HSP 1 Score: 1478.0 bits (3825), Expect = 0.0e+00
Identity = 708/851 (83.20%), Postives = 779/851 (91.54%), Query Frame = 1

Query: 2   GMFRNPTTRNGEYLEGMINEYVGGGKGKLRAQRNTSTKLVTALTCLQFAFAIYATFLLYY 61
           G+ RN   R+G+YLEGM+N+YVGG K KL+A ++TS +LVTALTCLQFAFA+YATFLLYY
Sbjct: 3   GLARNQNARSGDYLEGMLNDYVGG-KAKLKAHKSTSARLVTALTCLQFAFAVYATFLLYY 62

Query: 62  VSPAIDLRTKPDFSWATRIAQQWKQFVIPPHVVGRYQEQTSF--AEFRPITPEEACENEK 121
           +SP+IDLRTKPDF+WAT+IAQQWK F+IPPH++  YQ  +S   AE +PITP + CE EK
Sbjct: 63  MSPSIDLRTKPDFAWATKIAQQWKHFIIPPHILNHYQVSSSLVGAEIQPITPSDVCEQEK 122

Query: 122 IDFEQKKSNDAQMIKLKTELYNEVLDFQSQSFGTETLSQLMAMKSKWDLRGKNKPKVTVI 181
           IDF QKKSNDAQMIKLKTELY EVLDFQS+S GTETL+QLMAMKSKWDL+G N+PK+TVI
Sbjct: 123 IDFMQKKSNDAQMIKLKTELYKEVLDFQSKSIGTETLAQLMAMKSKWDLKGPNRPKITVI 182

Query: 182 LNHFKRKTLCAQINSLLHQTLPFHHVWVLAFGSPNELSLKRIVDSYNNSRISFISSSYDF 241
           LNHFKRKTLCAQ+++L  QTLPFHHVWVL+FGSPNELSLKRIVDSYN+SRISFISSSYDF
Sbjct: 183 LNHFKRKTLCAQLDTLHEQTLPFHHVWVLSFGSPNELSLKRIVDSYNDSRISFISSSYDF 242

Query: 242 KYYGRFQMALQTEADLVYILDDDMIPGHKMLQILSHVAGTEKYKNAVLGSIGRILPFRQK 301
           KYYGRFQMALQTEADLVYILDDDMIPG KMLQILSHVAGTEKYKNAVLGSIGRILPFRQK
Sbjct: 243 KYYGRFQMALQTEADLVYILDDDMIPGKKMLQILSHVAGTEKYKNAVLGSIGRILPFRQK 302

Query: 302 DFTFPSYRKFRSKEAGLYLPDPAYNITVNKIVQVDFLSSSWFLSAEL------------- 361
           DFTFPSYRKFRSKEAGLYLPDPAY+IT++KIVQVDFLSSSWFLSAEL             
Sbjct: 303 DFTFPSYRKFRSKEAGLYLPDPAYDITLDKIVQVDFLSSSWFLSAELVKTLFIETPFTFS 362

Query: 362 ----------LQKYRNAGSFVLPVDPKDKETWGDSEHRLAYVSETTVIFKDIVQVRDDQW 421
                     LQKYRNAGSFVLPVDPKD+ETWGDSEHRLAYVSETTVIFKDIVQVRDDQW
Sbjct: 363 TGEDLHLSYQLQKYRNAGSFVLPVDPKDRETWGDSEHRLAYVSETTVIFKDIVQVRDDQW 422

Query: 422 WKALSTGYVTQWAAMHPQKIDALFYAHSVDEVKALAPLLEKFRSTIGKKAYIVVSGGTFC 481
           WKALSTGY+TQWAAM+PQKIDALFYAHSVDEVKALAPLLEKFRST+GKKAYI VSGG +C
Sbjct: 423 WKALSTGYITQWAAMYPQKIDALFYAHSVDEVKALAPLLEKFRSTVGKKAYIAVSGGNYC 482

Query: 482 PCEDAAAALKWPKLVCKERRFKIFDLAIGALSGLSNSEVPVVQAVYASMKGLIQIHNPSI 541
            CEDAA ALKWP+LVCKERRFKIFDLA+GALSG+SNSEV V+Q VY+SMKGLI+IHNPS+
Sbjct: 483 ACEDAATALKWPQLVCKERRFKIFDLAVGALSGVSNSEVVVLQGVYSSMKGLIKIHNPSV 542

Query: 542 IVTVADLDPNVKKALKMASEANMNTTTLILLPRPSISKVLWMADLRPTAIPNWNKMRISI 601
           ++TVAD+DPNVKKALKMA+E N+N TTL+LLPRPSI KVLWMADLR TA+PNWN+MRISI
Sbjct: 543 VITVADIDPNVKKALKMATETNLNATTLVLLPRPSIPKVLWMADLRTTALPNWNRMRISI 602

Query: 602 NIITQNRANSLTRLLKSLKDAYYLGDEIPISFNMDSKVDEETIKLVSSFEWPHGPKSLRR 661
           NIITQNR +SLTRLLKSL DAYYLGDE+PISFNMDSKVDE T++LVSSFEWPHGPK+L+R
Sbjct: 603 NIITQNRVHSLTRLLKSLSDAYYLGDEVPISFNMDSKVDEATVRLVSSFEWPHGPKTLKR 662

Query: 662 RIIQGGLIRAVSESWYPASDDDYGLLLEDDIEVSPYYYLWIKYALLAYHYDPQISLPELS 721
           RIIQGGLIRAVSESWYP+SDDD+GLLLEDDIEVSPYYYLWIKYALLAYHYDPQ+SLPELS
Sbjct: 663 RIIQGGLIRAVSESWYPSSDDDFGLLLEDDIEVSPYYYLWIKYALLAYHYDPQVSLPELS 722

Query: 722 TISLYTPRLVEVVKERPKWNATEFFQRIHPNTPYLHQLPCSWGALFFPKHWREFYVYMNS 781
           +ISLYTPRLVEVVKERPKWN TEFF++IHPNTPY HQLPCSWGA+FFPK WREFYVYMN 
Sbjct: 723 SISLYTPRLVEVVKERPKWNPTEFFKKIHPNTPYFHQLPCSWGAVFFPKQWREFYVYMNM 782

Query: 782 RFTENAKENPVQIPKSRTNGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPGA 828
           RFTE+AK+NPVQIPKSRTNGWQASWKKFLIDMMYLRGYV+LYPNFPNQASFSTNHMEPGA
Sbjct: 783 RFTEDAKKNPVQIPKSRTNGWQASWKKFLIDMMYLRGYVTLYPNFPNQASFSTNHMEPGA 842

BLAST of Cp4.1LG02g12440 vs. TrEMBL
Match: A0A067LHF7_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_05502 PE=4 SV=1)

HSP 1 Score: 1473.8 bits (3814), Expect = 0.0e+00
Identity = 708/852 (83.10%), Postives = 774/852 (90.85%), Query Frame = 1

Query: 1   MGMFRNPTTRNGEYLEGMINEYVGGGKGKLRAQRNTSTKLVTALTCLQFAFAIYATFLLY 60
           MG+ RN + +NG+YLEGMINEYVGG + KL+A ++ S +LVTALTCLQFAFA+YATFLLY
Sbjct: 1   MGIIRNSSMKNGDYLEGMINEYVGG-RAKLKANKSISARLVTALTCLQFAFAVYATFLLY 60

Query: 61  YVSPAIDLRTKPDFSWATRIAQQWKQFVIPPHVVGRYQEQTSFA--EFRPITPEEACENE 120
           Y+SPA+DLRTKPDF+WATR AQ WK+F++PPHV+GRYQ+  S    E +PI P E CE+E
Sbjct: 61  YMSPAVDLRTKPDFAWATRFAQHWKEFIVPPHVIGRYQDSASLVGTEIQPINPSEVCEHE 120

Query: 121 KIDFEQKKSNDAQMIKLKTELYNEVLDFQSQSFGTETLSQLMAMKSKWDLRGKNKPKVTV 180
           KIDFEQKKS+D QMIKLKTELY E+LDFQS+S GTETLS+LM+MKSKWDL G NKPKVTV
Sbjct: 121 KIDFEQKKSSDVQMIKLKTELYKEILDFQSKSIGTETLSELMSMKSKWDLHGPNKPKVTV 180

Query: 181 ILNHFKRKTLCAQINSLLHQTLPFHHVWVLAFGSPNELSLKRIVDSYNNSRISFISSSYD 240
           ILNHFKRKTLCAQ++SLL QTLPFHHVWVLAFGSPNE+SLKRIV SYN+SRISFISSSYD
Sbjct: 181 ILNHFKRKTLCAQLDSLLQQTLPFHHVWVLAFGSPNEVSLKRIVQSYNDSRISFISSSYD 240

Query: 241 FKYYGRFQMALQTEADLVYILDDDMIPGHKMLQILSHVAGTEKYKNAVLGSIGRILPFRQ 300
           FKYYGRFQMALQTEADLVYI+DDDMIPG KMLQILSHVAGTEKYKN+VLGSIGRILPFRQ
Sbjct: 241 FKYYGRFQMALQTEADLVYIVDDDMIPGKKMLQILSHVAGTEKYKNSVLGSIGRILPFRQ 300

Query: 301 KDFTFPSYRKFRSKEAGLYLPDPAYNITVNKIVQVDFLSSSWFLSAEL------------ 360
           KDFTFPSYRKFRSKEAGLYLPDPAY+IT++KIVQVDFLSSSWFLSAEL            
Sbjct: 301 KDFTFPSYRKFRSKEAGLYLPDPAYDITIDKIVQVDFLSSSWFLSAELVKTLFVEAPMTF 360

Query: 361 -----------LQKYRNAGSFVLPVDPKDKETWGDSEHRLAYVSETTVIFKDIVQVRDDQ 420
                      LQKYRNAGSFVLPVDP DKETWGDSEHRLAYVSETTVIFKDIVQVRDDQ
Sbjct: 361 MTGEDLHLSYQLQKYRNAGSFVLPVDPNDKETWGDSEHRLAYVSETTVIFKDIVQVRDDQ 420

Query: 421 WWKALSTGYVTQWAAMHPQKIDALFYAHSVDEVKALAPLLEKFRSTIGKKAYIVVSGGTF 480
           WWKALSTGYVTQWAAM+PQKIDALFYAHSVDEVKALAPLLEKFRST+GKKAYIVVSGG F
Sbjct: 421 WWKALSTGYVTQWAAMYPQKIDALFYAHSVDEVKALAPLLEKFRSTVGKKAYIVVSGGNF 480

Query: 481 CPCEDAAAALKWPKLVCKERRFKIFDLAIGALSGLSNSEVPVVQAVYASMKGLIQIHNPS 540
           CPCEDAA AL WPKLVCKERRFKIFDL +G LSG+SNSEVPVVQAVY+SMKGLI+IHNPS
Sbjct: 481 CPCEDAATALNWPKLVCKERRFKIFDLDVGKLSGISNSEVPVVQAVYSSMKGLIKIHNPS 540

Query: 541 IIVTVADLDPNVKKALKMASEANMNTTTLILLPRPSISKVLWMADLRPTAIPNWNKMRIS 600
           +++TVAD+DP+VKKALKMA+E + N TT++LLPR SISKVLWMADLR TA+PNWNKMRIS
Sbjct: 541 VVITVADIDPDVKKALKMATETSTNVTTMVLLPRTSISKVLWMADLRSTALPNWNKMRIS 600

Query: 601 INIITQNRANSLTRLLKSLKDAYYLGDEIPISFNMDSKVDEETIKLVSSFEWPHGPKSLR 660
           +NIITQNR+ SLTRLL SL++AYYLGDEIPISFNMDSKVD  TI+LV+SF W HGPK+LR
Sbjct: 601 VNIITQNRSPSLTRLLNSLRNAYYLGDEIPISFNMDSKVDAATIRLVNSFNWTHGPKTLR 660

Query: 661 RRIIQGGLIRAVSESWYPASDDDYGLLLEDDIEVSPYYYLWIKYALLAYHYDPQISLPEL 720
           RRIIQGGLIRAVSESWYPASDDD+GLLLEDDIEVSPYYYLWIKYALLAYHYDPQ+S PEL
Sbjct: 661 RRIIQGGLIRAVSESWYPASDDDFGLLLEDDIEVSPYYYLWIKYALLAYHYDPQVSFPEL 720

Query: 721 STISLYTPRLVEVVKERPKWNATEFFQRIHPNTPYLHQLPCSWGALFFPKHWREFYVYMN 780
           S+ISLYTP+LVEVVKERPKWN TEFF+RIHPNTPYLHQLPCSWGA+FFPK WREFYVYMN
Sbjct: 721 SSISLYTPKLVEVVKERPKWNPTEFFKRIHPNTPYLHQLPCSWGAVFFPKQWREFYVYMN 780

Query: 781 SRFTENAKENPVQIPKSRTNGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPG 828
            RFTE+AK NPVQIPKSRTNGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPG
Sbjct: 781 MRFTEDAKANPVQIPKSRTNGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPG 840

BLAST of Cp4.1LG02g12440 vs. TAIR10
Match: AT5G60700.1 (AT5G60700.1 glycosyltransferase family protein 2)

HSP 1 Score: 1024.6 bits (2648), Expect = 3.4e-299
Identity = 483/588 (82.14%), Postives = 534/588 (90.82%), Query Frame = 1

Query: 263 MIPGHKMLQILSHVAGTEKYKNAVLGSIGRILPFRQKDFTFPSYRKFRSKEAGLYLPDPA 322
           MIPG KMLQ+LSHVAGTEKY+N+VLGSIGRILPFRQKDFTFPSYRKFRSKEAGLYLPDPA
Sbjct: 1   MIPGKKMLQMLSHVAGTEKYENSVLGSIGRILPFRQKDFTFPSYRKFRSKEAGLYLPDPA 60

Query: 323 YNITVNKIVQVDFLSSSWFLSAEL-----------------------LQKYRNAGSFVLP 382
           Y+IT+++I+QVDFLSSSWFLSAEL                       LQKYRNAGSFVLP
Sbjct: 61  YDITLDRILQVDFLSSSWFLSAELVKALFIEKPFTFSTGEDLHLSYQLQKYRNAGSFVLP 120

Query: 383 VDPKDKETWGDSEHRLAYVSETTVIFKDIVQVRDDQWWKALSTGYVTQWAAMHPQKIDAL 442
           VDP DKETWGDSEHRLAYVSETTVIFK+IV+VRD+QWWKALSTGYVTQWAAM+PQKIDAL
Sbjct: 121 VDPNDKETWGDSEHRLAYVSETTVIFKNIVEVRDNQWWKALSTGYVTQWAAMYPQKIDAL 180

Query: 443 FYAHSVDEVKALAPLLEKFRSTIGKKAYIVVSGGTFCPCEDAAAALKWPKLVCKERRFKI 502
           FYAHS+DEVKAL PLLEKFR T+GKKAYI VSGG FCPCEDAA+AL+WPK+VCKERRFKI
Sbjct: 181 FYAHSIDEVKALGPLLEKFRGTVGKKAYIAVSGGKFCPCEDAASALRWPKVVCKERRFKI 240

Query: 503 FDLAIGALSGLSNSEVPVVQAVYASMKGLIQIHNPSIIVTVADLDPNVKKALKMASEANM 562
           FDL +GA+ G+SNSEVPV QAVY+SMKGLI+IHNPS+++TVAD DPNVKKALKMA+E N 
Sbjct: 241 FDLEVGAILGVSNSEVPVFQAVYSSMKGLIKIHNPSVVITVADADPNVKKALKMATETNS 300

Query: 563 NTTTLILLPRPSISKVLWMADLRPTAIPNWNKMRISINIITQNRANSLTRLLKSLKDAYY 622
           N T L+LLPR SISKVLWMADLR TA+PNWNKMR+S+NIITQNRA SL RLL+SL +AYY
Sbjct: 301 NGTALVLLPRASISKVLWMADLRSTALPNWNKMRVSVNIITQNRAQSLLRLLRSLSNAYY 360

Query: 623 LGDEIPISFNMDSKVDEETIKLVSSFEWPHGPKSLRRRIIQGGLIRAVSESWYPASDDDY 682
           LGDEI +SFNMDSKVDEETI +VS+F+WPHGPK+LRRRIIQGGLIRAVSESWYPASDDD+
Sbjct: 361 LGDEISLSFNMDSKVDEETINVVSTFDWPHGPKTLRRRIIQGGLIRAVSESWYPASDDDF 420

Query: 683 GLLLEDDIEVSPYYYLWIKYALLAYHYDPQISLPELSTISLYTPRLVEVVKERPKWNATE 742
           GLLLEDDIEVSPYY+LWIKYALLAYHYDPQ+S PELS+ISLYTP++VEVVKERPKWN T+
Sbjct: 421 GLLLEDDIEVSPYYFLWIKYALLAYHYDPQVSFPELSSISLYTPKIVEVVKERPKWNPTD 480

Query: 743 FFQRIHPNTPYLHQLPCSWGALFFPKHWREFYVYMNSRFTENAKENPVQIPKSRTNGWQA 802
           FF++IHP+TPYLHQLPCSWGA+FFPK WREFYVYMN RFTENAK NPVQIPKSRTNGWQA
Sbjct: 481 FFKQIHPHTPYLHQLPCSWGAVFFPKQWREFYVYMNMRFTENAKANPVQIPKSRTNGWQA 540

Query: 803 SWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPGAHIISEIVAVNH 828
           SWKKFLIDMMYLRGYVSLYPNFPNQ+SFSTNHMEPGAHI ++   V H
Sbjct: 541 SWKKFLIDMMYLRGYVSLYPNFPNQSSFSTNHMEPGAHIAAKDNVVKH 588

BLAST of Cp4.1LG02g12440 vs. TAIR10
Match: AT5G12260.1 (AT5G12260.1 BEST Arabidopsis thaliana protein match is: glycosyltransferase family protein 2 (TAIR:AT5G60700.1))

HSP 1 Score: 125.6 bits (314), Expect = 1.5e-28
Identity = 84/266 (31.58%), Postives = 132/266 (49.62%), Query Frame = 1

Query: 576 INIITQNRANSLTRLLKSLKDAYY--LGDEIPIS-------FNM---DSKVDE------E 635
           I ++T NR +SL+R L+SL  A Y   GD   I        FN+   D+ V++      E
Sbjct: 73  IKVLTFNRLHSLSRCLRSLSAADYGVSGDRGRIHLHVYIDHFNLARNDTPVEDNLQIARE 132

Query: 636 TIKLVSSFEWPHGPKSLRRRIIQGGLIRAVSESWYPASDDDYGLLLEDDIEVSPYYYLWI 695
            +  V  FEW  G K +  R    GL     E+W+P SD ++  ++EDD+EVSP YY  +
Sbjct: 133 ILGFVDRFEWRFGEKVVHYRTDNAGLQAQWLEAWWPISDHEFAFVVEDDLEVSPLYYGIL 192

Query: 696 KYALLAYHYDPQISLPELSTISLYTPRLVEVVKERPKWNATEFFQRIHPNTP-YLHQLPC 755
           +  +L Y+YD     P +   SL  PR V      P  +  +    + P T   L+QL  
Sbjct: 193 ERLILKYYYDTSNFNPSIYGASLQRPRFV------PGKHGNKL--HVDPKTNLILYQLVG 252

Query: 756 SWGALFFPKHWREFYVYMNSRFTENAKENPVQIPKSRTNGW-----QASWKKFLIDMMYL 815
           +WG L FPK W+EF ++ +   ++  K     +    +NGW     +  W  + I  ++ 
Sbjct: 253 TWGQLLFPKPWKEFRLWYDEHKSKGKKP---FLDGMVSNGWYKRLGERIWTPWFIKFVHS 312

Query: 816 RGYVSLYPNFPNQASFSTNHMEPGAH 818
           RGY ++Y +FPN+ + S +H + G +
Sbjct: 313 RGYFNIYTSFPNEGALSVSHRDAGVN 327

BLAST of Cp4.1LG02g12440 vs. NCBI nr
Match: gi|778659191|ref|XP_004139558.2| (PREDICTED: uncharacterized protein LOC101202906 [Cucumis sativus])

HSP 1 Score: 1568.9 bits (4061), Expect = 0.0e+00
Identity = 767/854 (89.81%), Postives = 802/854 (93.91%), Query Frame = 1

Query: 1   MGMFRNPTTRNGEYLEGMINEYVGGGKGKLRAQRNTSTKLVTALTCLQFAFAIYATFLLY 60
           MGMFRNPT  NG+ +EGMI +YVGG KGKLR QR++STK+V  LTCLQFAFA+YATFLLY
Sbjct: 1   MGMFRNPTMGNGDCIEGMIKDYVGG-KGKLRPQRSSSTKIVAGLTCLQFAFALYATFLLY 60

Query: 61  YVSPAIDLRTKPDFSWATRIAQQWKQFVIPPHVVGRYQEQTSF---AEFRPITPEEACEN 120
           YVSPAIDLRTKPDFSWATRIAQQWKQFVIPPHVVGRYQE  S    AE RPITPEEACEN
Sbjct: 61  YVSPAIDLRTKPDFSWATRIAQQWKQFVIPPHVVGRYQEPNSMMMQAELRPITPEEACEN 120

Query: 121 EKIDFEQKKSNDAQMIKLKTELYNEVLDFQSQSFGTETLSQLMAMKSKWDLRGKNKPKVT 180
           EKIDFEQKKSND QMIKLKTELYNE+LDFQS+SFGTETLSQLMAMKSKWDL+G NKPKVT
Sbjct: 121 EKIDFEQKKSNDGQMIKLKTELYNEILDFQSKSFGTETLSQLMAMKSKWDLKGPNKPKVT 180

Query: 181 VILNHFKRKTLCAQINSLLHQTLPFHHVWVLAFGSPNELSLKRIVDSYNNSRISFISSSY 240
           VILNHFKRKTLCAQ+NSLLHQTLPFHHVWVLAFGSPNELSLKRIVDSYNNS+ISFISSSY
Sbjct: 181 VILNHFKRKTLCAQLNSLLHQTLPFHHVWVLAFGSPNELSLKRIVDSYNNSKISFISSSY 240

Query: 241 DFKYYGRFQMALQTEADLVYILDDDMIPGHKMLQILSHVAGTEKYKNAVLGSIGRILPFR 300
           DFKYYGRFQMALQTEADLVYILDDDMIPG KMLQILSHVAGT+KYKN+VLGSIGRILPFR
Sbjct: 241 DFKYYGRFQMALQTEADLVYILDDDMIPGRKMLQILSHVAGTDKYKNSVLGSIGRILPFR 300

Query: 301 QKDFTFPSYRKFRSKEAGLYLPDPAYNITVNKIVQVDFLSSSWFLSAEL----------- 360
           QKDFTFPSYRKFRSKEAGLYLPDPAY+ITVNKIVQVDFLSSSWFLSAEL           
Sbjct: 301 QKDFTFPSYRKFRSKEAGLYLPDPAYDITVNKIVQVDFLSSSWFLSAELVKTLFIETPFT 360

Query: 361 ------------LQKYRNAGSFVLPVDPKDKETWGDSEHRLAYVSETTVIFKDIVQVRDD 420
                       LQKYR+AGSFVLPVDPKDKETWGDSEHRLAYVSETTVIFKDIVQVRDD
Sbjct: 361 FATGEDLHLSYQLQKYRDAGSFVLPVDPKDKETWGDSEHRLAYVSETTVIFKDIVQVRDD 420

Query: 421 QWWKALSTGYVTQWAAMHPQKIDALFYAHSVDEVKALAPLLEKFRSTIGKKAYIVVSGGT 480
           QWWKALSTGY+TQWAAMHPQKIDALFYAHSVDE KALAPLLEKFRST+GKKAYIVVSGG 
Sbjct: 421 QWWKALSTGYITQWAAMHPQKIDALFYAHSVDEAKALAPLLEKFRSTVGKKAYIVVSGGN 480

Query: 481 FCPCEDAAAALKWPKLVCKERRFKIFDLAIGALSGLSNSEVPVVQAVYASMKGLIQIHNP 540
           FCPCED A ALKWPKLVCKERRFKIFDLAIGALSG+SNSEVPVVQAVYASMKGLI+IHNP
Sbjct: 481 FCPCEDVADALKWPKLVCKERRFKIFDLAIGALSGISNSEVPVVQAVYASMKGLIKIHNP 540

Query: 541 SIIVTVADLDPNVKKALKMASEANMNTTTLILLPRPSISKVLWMADLRPTAIPNWNKMRI 600
           S+I+TVAD+DPNVKKALKMASEAN+N TTL+LLPRPSISKVLWMA+LR TA+PNWNKMRI
Sbjct: 541 SVIITVADIDPNVKKALKMASEANLNGTTLVLLPRPSISKVLWMANLRSTALPNWNKMRI 600

Query: 601 SINIITQNRANSLTRLLKSLKDAYYLGDEIPISFNMDSKVDEETIKLVSSFEWPHGPKSL 660
           SINIITQNRA+SLTRLLKSLKDAYYLGDEIPISFNMDSKVDEETIKLVSSFEWPHGPKSL
Sbjct: 601 SINIITQNRASSLTRLLKSLKDAYYLGDEIPISFNMDSKVDEETIKLVSSFEWPHGPKSL 660

Query: 661 RRRIIQGGLIRAVSESWYPASDDDYGLLLEDDIEVSPYYYLWIKYALLAYHYDPQISLPE 720
           RRRIIQGGLIRAVSESWYPASDDDYGLLLEDDIEVSPYYYLWIKYALLAYHYDPQISLPE
Sbjct: 661 RRRIIQGGLIRAVSESWYPASDDDYGLLLEDDIEVSPYYYLWIKYALLAYHYDPQISLPE 720

Query: 721 LSTISLYTPRLVEVVKERPKWNATEFFQRIHPNTPYLHQLPCSWGALFFPKHWREFYVYM 780
           LS+ISLYTPRLVEVVKERPKWNATEFF+RIHPNTPYLHQLPCSWGA+FFPKHWREFYVYM
Sbjct: 721 LSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTPYLHQLPCSWGAVFFPKHWREFYVYM 780

Query: 781 NSRFTENAKENPVQIPKSRTNGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEP 829
           NSRFTENAKENPVQIPKSRTNGWQASWKKFLIDMMYLRGYVSLYPNFP+QASFSTNHMEP
Sbjct: 781 NSRFTENAKENPVQIPKSRTNGWQASWKKFLIDMMYLRGYVSLYPNFPDQASFSTNHMEP 840

BLAST of Cp4.1LG02g12440 vs. NCBI nr
Match: gi|659071945|ref|XP_008462729.1| (PREDICTED: uncharacterized protein LOC103501011 isoform X2 [Cucumis melo])

HSP 1 Score: 1562.0 bits (4043), Expect = 0.0e+00
Identity = 767/853 (89.92%), Postives = 795/853 (93.20%), Query Frame = 1

Query: 1   MGMFRNPTTRNGEYLEGMINEYVGGGKGKLRAQRNTSTKLVTALTCLQFAFAIYATFLLY 60
           MG FRN    NG+ LEGMIN+YVGG KGKLR QR++STK+V  LTCLQFAFA+YATFLLY
Sbjct: 3   MGKFRNSAMGNGDCLEGMINDYVGG-KGKLRPQRSSSTKIVAGLTCLQFAFALYATFLLY 62

Query: 61  YVSPAIDLRTKPDFSWATRIAQQWKQFVIPPHVVGRYQEQTSF---AEFRPITPEEACEN 120
           YVSPAIDLRTKPDFSWATRIAQQW QFVIPPHVVGRYQE TS    AE RPITPEEACEN
Sbjct: 63  YVSPAIDLRTKPDFSWATRIAQQWTQFVIPPHVVGRYQEPTSMMMQAELRPITPEEACEN 122

Query: 121 EKIDFEQKKSNDAQMIKLKTELYNEVLDFQSQSFGTETLSQLMAMKSKWDLRGKNKPKVT 180
           EKIDFEQKKSND QMIKLKTELYNE+LDFQS+SFGTETL QLMAMKSKWDL+G  KPKVT
Sbjct: 123 EKIDFEQKKSNDGQMIKLKTELYNEILDFQSKSFGTETLPQLMAMKSKWDLKGPKKPKVT 182

Query: 181 VILNHFKRKTLCAQINSLLHQTLPFHHVWVLAFGSPNELSLKRIVDSYNNSRISFISSSY 240
           VILNHFKRKTLCAQ+NSLLHQTLPFHHVWVLAFGSPNELSLKRIVDSYNNS+ISFISSSY
Sbjct: 183 VILNHFKRKTLCAQLNSLLHQTLPFHHVWVLAFGSPNELSLKRIVDSYNNSKISFISSSY 242

Query: 241 DFKYYGRFQMALQTEADLVYILDDDMIPGHKMLQILSHVAGTEKYKNAVLGSIGRILPFR 300
           DFKYYGRFQMALQTEADLVYILDDDMIPG KMLQILSHVAGT+KYKNAVLGSIGRILPFR
Sbjct: 243 DFKYYGRFQMALQTEADLVYILDDDMIPGRKMLQILSHVAGTDKYKNAVLGSIGRILPFR 302

Query: 301 QKDFTFPSYRKFRSKEAGLYLPDPAYNITVNKIVQVDFLSSSWFLSAEL----------- 360
           QKDFTFPSYRKFRSKEAGLYLPDPAY+ITVNKIVQVDFLSSSWFLSAEL           
Sbjct: 303 QKDFTFPSYRKFRSKEAGLYLPDPAYDITVNKIVQVDFLSSSWFLSAELVKTLFIETPFT 362

Query: 361 ------------LQKYRNAGSFVLPVDPKDKETWGDSEHRLAYVSETTVIFKDIVQVRDD 420
                       LQKYRNAGSFVLPVDPKDKETWGDSEHRLAYVSETTVIFKDIVQVRDD
Sbjct: 363 FATGEDLHLSYQLQKYRNAGSFVLPVDPKDKETWGDSEHRLAYVSETTVIFKDIVQVRDD 422

Query: 421 QWWKALSTGYVTQWAAMHPQKIDALFYAHSVDEVKALAPLLEKFRSTIGKKAYIVVSGGT 480
           QWWKALSTGYVTQWAAMHPQKIDALFYAHSVDE KALAPLLEKFRST+GKKAYIVVSGG 
Sbjct: 423 QWWKALSTGYVTQWAAMHPQKIDALFYAHSVDEAKALAPLLEKFRSTVGKKAYIVVSGGR 482

Query: 481 FCPCEDAAAALKWPKLVCKERRFKIFDLAIGALSGLSNSEVPVVQAVYASMKGLIQIHNP 540
           FCPCED   ALKWPKLVCKERRFKIFDLAIGALSGLSNSEVPVVQAVYASMKGLI+IHNP
Sbjct: 483 FCPCEDVTDALKWPKLVCKERRFKIFDLAIGALSGLSNSEVPVVQAVYASMKGLIKIHNP 542

Query: 541 SIIVTVADLDPNVKKALKMASEANMNTTTLILLPRPSISKVLWMADLRPTAIPNWNKMRI 600
           S+I+TVAD+DPNVKKALKMASEAN+N TTLILLPRPSISKVLWMADLR TA+PNWNKM+I
Sbjct: 543 SVIITVADIDPNVKKALKMASEANLNGTTLILLPRPSISKVLWMADLRSTALPNWNKMKI 602

Query: 601 SINIITQNRANSLTRLLKSLKDAYYLGDEIPISFNMDSKVDEETIKLVSSFEWPHGPKSL 660
           SINIITQNR +SLTRLLKSLKDAYYLGDEIPISFNMDSKVDE+TIKLVSSFEWPHGPKSL
Sbjct: 603 SINIITQNRVSSLTRLLKSLKDAYYLGDEIPISFNMDSKVDEKTIKLVSSFEWPHGPKSL 662

Query: 661 RRRIIQGGLIRAVSESWYPASDDDYGLLLEDDIEVSPYYYLWIKYALLAYHYDPQISLPE 720
           RRRIIQGGLIRAVSESWYPASDDDYGLLLEDDIEVSPYYYLWIKYALLAYHYDPQISLPE
Sbjct: 663 RRRIIQGGLIRAVSESWYPASDDDYGLLLEDDIEVSPYYYLWIKYALLAYHYDPQISLPE 722

Query: 721 LSTISLYTPRLVEVVKERPKWNATEFFQRIHPNTPYLHQLPCSWGALFFPKHWREFYVYM 780
           LS+ISLYTPRLVEVVKERPKWNATEFF+RIHPNTPYLHQLPCSWGA+FFPKHWREFYVYM
Sbjct: 723 LSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTPYLHQLPCSWGAVFFPKHWREFYVYM 782

Query: 781 NSRFTENAKENPVQIPKSRTNGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEP 828
           NSRFTENAKENPVQIPKSRTNGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEP
Sbjct: 783 NSRFTENAKENPVQIPKSRTNGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEP 842

BLAST of Cp4.1LG02g12440 vs. NCBI nr
Match: gi|659071941|ref|XP_008462712.1| (PREDICTED: uncharacterized protein LOC103501011 isoform X1 [Cucumis melo])

HSP 1 Score: 1562.0 bits (4043), Expect = 0.0e+00
Identity = 767/853 (89.92%), Postives = 795/853 (93.20%), Query Frame = 1

Query: 1   MGMFRNPTTRNGEYLEGMINEYVGGGKGKLRAQRNTSTKLVTALTCLQFAFAIYATFLLY 60
           MG FRN    NG+ LEGMIN+YVGG KGKLR QR++STK+V  LTCLQFAFA+YATFLLY
Sbjct: 22  MGKFRNSAMGNGDCLEGMINDYVGG-KGKLRPQRSSSTKIVAGLTCLQFAFALYATFLLY 81

Query: 61  YVSPAIDLRTKPDFSWATRIAQQWKQFVIPPHVVGRYQEQTSF---AEFRPITPEEACEN 120
           YVSPAIDLRTKPDFSWATRIAQQW QFVIPPHVVGRYQE TS    AE RPITPEEACEN
Sbjct: 82  YVSPAIDLRTKPDFSWATRIAQQWTQFVIPPHVVGRYQEPTSMMMQAELRPITPEEACEN 141

Query: 121 EKIDFEQKKSNDAQMIKLKTELYNEVLDFQSQSFGTETLSQLMAMKSKWDLRGKNKPKVT 180
           EKIDFEQKKSND QMIKLKTELYNE+LDFQS+SFGTETL QLMAMKSKWDL+G  KPKVT
Sbjct: 142 EKIDFEQKKSNDGQMIKLKTELYNEILDFQSKSFGTETLPQLMAMKSKWDLKGPKKPKVT 201

Query: 181 VILNHFKRKTLCAQINSLLHQTLPFHHVWVLAFGSPNELSLKRIVDSYNNSRISFISSSY 240
           VILNHFKRKTLCAQ+NSLLHQTLPFHHVWVLAFGSPNELSLKRIVDSYNNS+ISFISSSY
Sbjct: 202 VILNHFKRKTLCAQLNSLLHQTLPFHHVWVLAFGSPNELSLKRIVDSYNNSKISFISSSY 261

Query: 241 DFKYYGRFQMALQTEADLVYILDDDMIPGHKMLQILSHVAGTEKYKNAVLGSIGRILPFR 300
           DFKYYGRFQMALQTEADLVYILDDDMIPG KMLQILSHVAGT+KYKNAVLGSIGRILPFR
Sbjct: 262 DFKYYGRFQMALQTEADLVYILDDDMIPGRKMLQILSHVAGTDKYKNAVLGSIGRILPFR 321

Query: 301 QKDFTFPSYRKFRSKEAGLYLPDPAYNITVNKIVQVDFLSSSWFLSAEL----------- 360
           QKDFTFPSYRKFRSKEAGLYLPDPAY+ITVNKIVQVDFLSSSWFLSAEL           
Sbjct: 322 QKDFTFPSYRKFRSKEAGLYLPDPAYDITVNKIVQVDFLSSSWFLSAELVKTLFIETPFT 381

Query: 361 ------------LQKYRNAGSFVLPVDPKDKETWGDSEHRLAYVSETTVIFKDIVQVRDD 420
                       LQKYRNAGSFVLPVDPKDKETWGDSEHRLAYVSETTVIFKDIVQVRDD
Sbjct: 382 FATGEDLHLSYQLQKYRNAGSFVLPVDPKDKETWGDSEHRLAYVSETTVIFKDIVQVRDD 441

Query: 421 QWWKALSTGYVTQWAAMHPQKIDALFYAHSVDEVKALAPLLEKFRSTIGKKAYIVVSGGT 480
           QWWKALSTGYVTQWAAMHPQKIDALFYAHSVDE KALAPLLEKFRST+GKKAYIVVSGG 
Sbjct: 442 QWWKALSTGYVTQWAAMHPQKIDALFYAHSVDEAKALAPLLEKFRSTVGKKAYIVVSGGR 501

Query: 481 FCPCEDAAAALKWPKLVCKERRFKIFDLAIGALSGLSNSEVPVVQAVYASMKGLIQIHNP 540
           FCPCED   ALKWPKLVCKERRFKIFDLAIGALSGLSNSEVPVVQAVYASMKGLI+IHNP
Sbjct: 502 FCPCEDVTDALKWPKLVCKERRFKIFDLAIGALSGLSNSEVPVVQAVYASMKGLIKIHNP 561

Query: 541 SIIVTVADLDPNVKKALKMASEANMNTTTLILLPRPSISKVLWMADLRPTAIPNWNKMRI 600
           S+I+TVAD+DPNVKKALKMASEAN+N TTLILLPRPSISKVLWMADLR TA+PNWNKM+I
Sbjct: 562 SVIITVADIDPNVKKALKMASEANLNGTTLILLPRPSISKVLWMADLRSTALPNWNKMKI 621

Query: 601 SINIITQNRANSLTRLLKSLKDAYYLGDEIPISFNMDSKVDEETIKLVSSFEWPHGPKSL 660
           SINIITQNR +SLTRLLKSLKDAYYLGDEIPISFNMDSKVDE+TIKLVSSFEWPHGPKSL
Sbjct: 622 SINIITQNRVSSLTRLLKSLKDAYYLGDEIPISFNMDSKVDEKTIKLVSSFEWPHGPKSL 681

Query: 661 RRRIIQGGLIRAVSESWYPASDDDYGLLLEDDIEVSPYYYLWIKYALLAYHYDPQISLPE 720
           RRRIIQGGLIRAVSESWYPASDDDYGLLLEDDIEVSPYYYLWIKYALLAYHYDPQISLPE
Sbjct: 682 RRRIIQGGLIRAVSESWYPASDDDYGLLLEDDIEVSPYYYLWIKYALLAYHYDPQISLPE 741

Query: 721 LSTISLYTPRLVEVVKERPKWNATEFFQRIHPNTPYLHQLPCSWGALFFPKHWREFYVYM 780
           LS+ISLYTPRLVEVVKERPKWNATEFF+RIHPNTPYLHQLPCSWGA+FFPKHWREFYVYM
Sbjct: 742 LSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTPYLHQLPCSWGAVFFPKHWREFYVYM 801

Query: 781 NSRFTENAKENPVQIPKSRTNGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEP 828
           NSRFTENAKENPVQIPKSRTNGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEP
Sbjct: 802 NSRFTENAKENPVQIPKSRTNGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEP 861

BLAST of Cp4.1LG02g12440 vs. NCBI nr
Match: gi|743902142|ref|XP_011044401.1| (PREDICTED: uncharacterized protein LOC105139601 [Populus euphratica])

HSP 1 Score: 1486.9 bits (3848), Expect = 0.0e+00
Identity = 714/852 (83.80%), Postives = 781/852 (91.67%), Query Frame = 1

Query: 1   MGMFRNPTTRNGEYLEGMINEYVGGGKGKLRAQRNTSTKLVTALTCLQFAFAIYATFLLY 60
           MG+ RN T ++G+YLEGM+++YVGG K K + QR++S +LVTALTCLQFAFA+YATFLLY
Sbjct: 1   MGLIRNSTMKSGDYLEGMLSDYVGG-KAKSKVQRSSSARLVTALTCLQFAFAVYATFLLY 60

Query: 61  YVSPAIDLRTKPDFSWATRIAQQWKQFVIPPHVVGRYQEQTSF--AEFRPITPEEACENE 120
           Y+SP IDLRTKPDF+WATRIAQQWK F+IPPHV+GRYQE  S   AE RPI P E CE+E
Sbjct: 61  YMSPTIDLRTKPDFTWATRIAQQWKHFIIPPHVLGRYQEAASLVTAEIRPINPSEVCEHE 120

Query: 121 KIDFEQKKSNDAQMIKLKTELYNEVLDFQSQSFGTETLSQLMAMKSKWDLRGKNKPKVTV 180
           KIDF+QKKSNDAQMIKLK ELY+EVLDFQS+S GTETLS+LMAMKSKWDLRG NKP+VTV
Sbjct: 121 KIDFQQKKSNDAQMIKLKRELYDEVLDFQSKSIGTETLSELMAMKSKWDLRGPNKPRVTV 180

Query: 181 ILNHFKRKTLCAQINSLLHQTLPFHHVWVLAFGSPNELSLKRIVDSYNNSRISFISSSYD 240
           ILNHFKRKTLCAQ++SLLHQTLPFHHVWVL+FGSPNELSLKRIV+SYN+SRISFISSSYD
Sbjct: 181 ILNHFKRKTLCAQLDSLLHQTLPFHHVWVLSFGSPNELSLKRIVNSYNDSRISFISSSYD 240

Query: 241 FKYYGRFQMALQTEADLVYILDDDMIPGHKMLQILSHVAGTEKYKNAVLGSIGRILPFRQ 300
           FKYYGRFQMALQTEADLVYI+DDDMIPG KMLQILSHVAGTEKYKN+VLGSIGRILPFRQ
Sbjct: 241 FKYYGRFQMALQTEADLVYIVDDDMIPGRKMLQILSHVAGTEKYKNSVLGSIGRILPFRQ 300

Query: 301 KDFTFPSYRKFRSKEAGLYLPDPAYNITVNKIVQVDFLSSSWFLSAEL------------ 360
           KDFTFPSYRKFRSK+AGLYLPDPAY+ITV+KIVQVDFLSSSWFLSAEL            
Sbjct: 301 KDFTFPSYRKFRSKDAGLYLPDPAYDITVDKIVQVDFLSSSWFLSAELVKTLFVEAPMTF 360

Query: 361 -----------LQKYRNAGSFVLPVDPKDKETWGDSEHRLAYVSETTVIFKDIVQVRDDQ 420
                      LQKYRNAGSFVLPVDP DKETWGDSEHRLAYVSETTVIFKDIVQVRDDQ
Sbjct: 361 MTGEDLHLSYQLQKYRNAGSFVLPVDPNDKETWGDSEHRLAYVSETTVIFKDIVQVRDDQ 420

Query: 421 WWKALSTGYVTQWAAMHPQKIDALFYAHSVDEVKALAPLLEKFRSTIGKKAYIVVSGGTF 480
           WWKALSTGYVTQWAAMHPQKIDALFYAHSVDEVKALAPL+EKFRST+GKKAYIVVSGG F
Sbjct: 421 WWKALSTGYVTQWAAMHPQKIDALFYAHSVDEVKALAPLIEKFRSTVGKKAYIVVSGGNF 480

Query: 481 CPCEDAAAALKWPKLVCKERRFKIFDLAIGALSGLSNSEVPVVQAVYASMKGLIQIHNPS 540
           CPCEDAA AL WPK+VCKERRFKIFDLA+ A + +SNSEVPV+Q VY S+KGLI+IHNPS
Sbjct: 481 CPCEDAATALNWPKIVCKERRFKIFDLAVAAQTEISNSEVPVIQTVYTSVKGLIKIHNPS 540

Query: 541 IIVTVADLDPNVKKALKMASEANMNTTTLILLPRPSISKVLWMADLRPTAIPNWNKMRIS 600
           +++ V D+DPNVKKALKMA+E + N TTL+LLPRPSISK+LWMADLR TA+PNWNKMRIS
Sbjct: 541 VLIAVNDIDPNVKKALKMATETSTNGTTLVLLPRPSISKILWMADLRSTALPNWNKMRIS 600

Query: 601 INIITQNRANSLTRLLKSLKDAYYLGDEIPISFNMDSKVDEETIKLVSSFEWPHGPKSLR 660
           +NIITQNRA+SLTRLLKSL DAYY+GDEIPISFN+DSKVDEETI+LVSSF WPHGPK+LR
Sbjct: 601 VNIITQNRASSLTRLLKSLSDAYYVGDEIPISFNVDSKVDEETIRLVSSFNWPHGPKTLR 660

Query: 661 RRIIQGGLIRAVSESWYPASDDDYGLLLEDDIEVSPYYYLWIKYALLAYHYDPQISLPEL 720
           RRIIQGGLIRAVSESWYP+SDDDYGLLLEDDIEVSP+YYLWIKYALLAYHYDPQ+SLPEL
Sbjct: 661 RRIIQGGLIRAVSESWYPSSDDDYGLLLEDDIEVSPFYYLWIKYALLAYHYDPQVSLPEL 720

Query: 721 STISLYTPRLVEVVKERPKWNATEFFQRIHPNTPYLHQLPCSWGALFFPKHWREFYVYMN 780
           S+ISLYTP+LVEVVKERP+WNATEFF+RIHPNTPYLHQLPCSWGA+FFPK WREFYVYMN
Sbjct: 721 SSISLYTPKLVEVVKERPRWNATEFFKRIHPNTPYLHQLPCSWGAVFFPKQWREFYVYMN 780

Query: 781 SRFTENAKENPVQIPKSRTNGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPG 828
            RFTE+AK NPVQIPKSRTNGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPG
Sbjct: 781 MRFTEDAKANPVQIPKSRTNGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPG 840

BLAST of Cp4.1LG02g12440 vs. NCBI nr
Match: gi|566185783|ref|XP_002314281.2| (hypothetical protein POPTR_0009s01610g [Populus trichocarpa])

HSP 1 Score: 1485.3 bits (3844), Expect = 0.0e+00
Identity = 715/852 (83.92%), Postives = 780/852 (91.55%), Query Frame = 1

Query: 1   MGMFRNPTTRNGEYLEGMINEYVGGGKGKLRAQRNTSTKLVTALTCLQFAFAIYATFLLY 60
           MG+ RN T ++G+YLEGM+++YVGG K K + QR++S +LVTALTCLQFAFA+YATFLLY
Sbjct: 1   MGLIRNSTMKSGDYLEGMLSDYVGG-KAKSKVQRSSSARLVTALTCLQFAFAVYATFLLY 60

Query: 61  YVSPAIDLRTKPDFSWATRIAQQWKQFVIPPHVVGRYQEQTSF--AEFRPITPEEACENE 120
           Y+SP IDLRTKPDF+WATRIAQQWK F+IPPHV+GRYQE  S   AE  PI P E CE+E
Sbjct: 61  YMSPTIDLRTKPDFAWATRIAQQWKHFIIPPHVLGRYQEAASLVTAEIGPINPSEVCEHE 120

Query: 121 KIDFEQKKSNDAQMIKLKTELYNEVLDFQSQSFGTETLSQLMAMKSKWDLRGKNKPKVTV 180
           KIDF+QKKSNDAQMIKLK ELY+EVLDFQS+S GTETLS+LMAMKSKWDLRG NKP+VTV
Sbjct: 121 KIDFQQKKSNDAQMIKLKRELYDEVLDFQSKSTGTETLSELMAMKSKWDLRGPNKPRVTV 180

Query: 181 ILNHFKRKTLCAQINSLLHQTLPFHHVWVLAFGSPNELSLKRIVDSYNNSRISFISSSYD 240
           ILNHFKRKTLCAQ++SLLHQTLPFHHVWVL+FGSPNELSLKRIV+SYN+SRISFISSSYD
Sbjct: 181 ILNHFKRKTLCAQLDSLLHQTLPFHHVWVLSFGSPNELSLKRIVNSYNDSRISFISSSYD 240

Query: 241 FKYYGRFQMALQTEADLVYILDDDMIPGHKMLQILSHVAGTEKYKNAVLGSIGRILPFRQ 300
           FKYYGRFQMALQTEADLVYI+DDDMIPG KMLQILSHVAGTEKYKN+VLGSIGRILPFRQ
Sbjct: 241 FKYYGRFQMALQTEADLVYIVDDDMIPGRKMLQILSHVAGTEKYKNSVLGSIGRILPFRQ 300

Query: 301 KDFTFPSYRKFRSKEAGLYLPDPAYNITVNKIVQVDFLSSSWFLSAEL------------ 360
           KDFTFPSYRKFRSKEAGLYLPDPAY+ITV+KIVQVDFLSSSWFLSAEL            
Sbjct: 301 KDFTFPSYRKFRSKEAGLYLPDPAYDITVDKIVQVDFLSSSWFLSAELVKTLFIEAPMTF 360

Query: 361 -----------LQKYRNAGSFVLPVDPKDKETWGDSEHRLAYVSETTVIFKDIVQVRDDQ 420
                      LQKYRNAGSFVLPVDP DKETWGDSEHRLAYVSETTVIFKDIVQVRDDQ
Sbjct: 361 MTGEDLHLSYQLQKYRNAGSFVLPVDPNDKETWGDSEHRLAYVSETTVIFKDIVQVRDDQ 420

Query: 421 WWKALSTGYVTQWAAMHPQKIDALFYAHSVDEVKALAPLLEKFRSTIGKKAYIVVSGGTF 480
           WWKA STGYVTQWAAMHPQKIDALFYAHSVDEVKALAPL+EKFRST+GKKAYIVVSGG F
Sbjct: 421 WWKAFSTGYVTQWAAMHPQKIDALFYAHSVDEVKALAPLIEKFRSTVGKKAYIVVSGGNF 480

Query: 481 CPCEDAAAALKWPKLVCKERRFKIFDLAIGALSGLSNSEVPVVQAVYASMKGLIQIHNPS 540
           CPCEDAA AL WPK+VCKERRFKIFDLA+ A + +SNSEVPV+QAVY+S+KGLI+IHNPS
Sbjct: 481 CPCEDAATALNWPKIVCKERRFKIFDLAVAAQTEISNSEVPVIQAVYSSVKGLIKIHNPS 540

Query: 541 IIVTVADLDPNVKKALKMASEANMNTTTLILLPRPSISKVLWMADLRPTAIPNWNKMRIS 600
           +++ V D+DPNVKKALKMA+E N N TT++LLPRPSISKVLWMADLR TA+PNWNKMRIS
Sbjct: 541 VLIAVNDIDPNVKKALKMATETNTNGTTMVLLPRPSISKVLWMADLRSTALPNWNKMRIS 600

Query: 601 INIITQNRANSLTRLLKSLKDAYYLGDEIPISFNMDSKVDEETIKLVSSFEWPHGPKSLR 660
           +NIITQNRA SLTRLLKSL DAYY+GDEIPISFN+DSKVDEETI+LVSSF WPHGPK+LR
Sbjct: 601 VNIITQNRAPSLTRLLKSLSDAYYVGDEIPISFNVDSKVDEETIRLVSSFNWPHGPKTLR 660

Query: 661 RRIIQGGLIRAVSESWYPASDDDYGLLLEDDIEVSPYYYLWIKYALLAYHYDPQISLPEL 720
           RRIIQGGLIRAVSESWYP+SDDDYGLLLEDDIEVSP+YYLWIKYALLAYHYDPQ+SLPEL
Sbjct: 661 RRIIQGGLIRAVSESWYPSSDDDYGLLLEDDIEVSPFYYLWIKYALLAYHYDPQVSLPEL 720

Query: 721 STISLYTPRLVEVVKERPKWNATEFFQRIHPNTPYLHQLPCSWGALFFPKHWREFYVYMN 780
           S+ISLYTP+LVEVVKERP+WNATEFF+RIHPNTPYLHQLPCSWGA+FFPK WREFYVYMN
Sbjct: 721 SSISLYTPKLVEVVKERPRWNATEFFKRIHPNTPYLHQLPCSWGAVFFPKQWREFYVYMN 780

Query: 781 SRFTENAKENPVQIPKSRTNGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPG 828
            RFTE+AK NPVQIPKSRTNGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPG
Sbjct: 781 MRFTEDAKANPVQIPKSRTNGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPG 840

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LV36_CUCSA0.0e+0089.81Uncharacterized protein OS=Cucumis sativus GN=Csa_1G124520 PE=4 SV=1[more]
B9HQB9_POPTR0.0e+0083.92Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0009s01610g PE=4 SV=2[more]
A0A0D2R2N2_GOSRA0.0e+0083.45Uncharacterized protein OS=Gossypium raimondii GN=B456_004G230400 PE=4 SV=1[more]
M5WLD4_PRUPE0.0e+0083.20Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001021mg PE=4 SV=1[more]
A0A067LHF7_JATCU0.0e+0083.10Uncharacterized protein OS=Jatropha curcas GN=JCGZ_05502 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G60700.13.4e-29982.14 glycosyltransferase family protein 2[more]
AT5G12260.11.5e-2831.58 BEST Arabidopsis thaliana protein match is: glycosyltransferase fami... [more]
Match NameE-valueIdentityDescription
gi|778659191|ref|XP_004139558.2|0.0e+0089.81PREDICTED: uncharacterized protein LOC101202906 [Cucumis sativus][more]
gi|659071945|ref|XP_008462729.1|0.0e+0089.92PREDICTED: uncharacterized protein LOC103501011 isoform X2 [Cucumis melo][more]
gi|659071941|ref|XP_008462712.1|0.0e+0089.92PREDICTED: uncharacterized protein LOC103501011 isoform X1 [Cucumis melo][more]
gi|743902142|ref|XP_011044401.1|0.0e+0083.80PREDICTED: uncharacterized protein LOC105139601 [Populus euphratica][more]
gi|566185783|ref|XP_002314281.2|0.0e+0083.92hypothetical protein POPTR_0009s01610g [Populus trichocarpa][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0016757 transferase activity, transferring glycosyl groups
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG02g12440.1Cp4.1LG02g12440.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR33604FAMILY NOT NAMEDcoord: 160..826
score:
NoneNo IPR availablePANTHERPTHR33604:SF2GLYCOSYLTRANSFERASE FAMILY PROTEIN 2coord: 160..826
score:

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG02g12440Cp4.1LG20g07270Cucurbita pepo (Zucchini)cpecpeB432
The following block(s) are covering this gene:

None