Cp4.1LG10g09830 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG10g09830
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionCore-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein
LocationCp4.1LG10 : 4393135 .. 4397171 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGATGTCTTCCAAAACACTCCAAGTTCCGTTCACATTTCGCCACCGCCATCCCCTCCCCTTTTCTCATGAGTTTTTGTTTATTAATCAGCTCCCCGGCGAAACCGCCACCTGTTCCATGTATTCCTTTCTCTCTCTTGTTTTGGTCTGGTATTCCTTTCCGATCCATTGCATGAAACTTCTTTTGTTCATTTGCAGGAAGAACAATGAAGAAGAAAGGTCCTCTAACGCCAGGGCGCAACCTTATCTGGTTCAGCTGGAAGCTTGTCATCACCTTCTCTCTTGCACTTTGCGTTTTCGCTCTTATTAGGCTTCATTCGTCTTCTCGGACCAACCTCGCCTCTGCCTCATTATCCCGTAGATTGCGCCCCCCTCCTGGTTCATTTTTGGGCCGCCCTAAGATTGCCTTCTTGTTTCTCACTCGTCGAAACCTCCCTCTTGATTTTCTTTGGGGAAGCTTCTTCCAGGTGTTTAAAAGATTCCACTCTCGCTAAAACTATCTTCGATTAGGCATTGTCTTTGAGTTGGGAAAATAATTTGCAATTTTCTATTCCAGAATGGCGACGTTGCGAACTTCTCGATTTACATTCACTCGGCGCCCGGATTTGTGTTCGATGAATCGACGACAAGGTCGCATTTCTTTTTTGGACGGCAATTGGAGAATAGCATTCAGGTAACATGATGTGTTCTTGTGAAATGAAGTTTTTATGTTTTTCCGTACCAATGCTTACCATCAAACTGGAGAATAATGGAAAATAGATGCCTTTTGGCTTGTTCTTTACCCTTTTGTGCGTGCGTGTGTGTGTGTGTGTGTGTGTGGAATTACTTAGAAGTATGAGGGACGAAGTCTTCAAAGTGGAATGTTCCCGTTGTATTAAATTGAGGAGCGTAGTAGGCTACTCTCCAATGTGTAGTAGCCTTAGTGCTCCAACAAGTCTAACCTTTATCAAGGTGAGAGAACAACTTATGTAAATACTTCACCCTAGCTAGTCAAAACTTGACAATGTCCTAGGACTATTGTTCTCAATAGGTAGATACTATAATCATGATGAAATTATCATTTACTCTCTGTTTTTAAATAGTGGGGTTGAATTCATGTTGAGGCATGTAATGTTTTCGATAATTTTTTAAGTTTTGATTTTTGGTAATTGGTTGCATTGAAGTGACGTTATGTTGCAGGTGGACTGGGGAAAGTCGAGTATGATTGCTGCAGAGAGGTTCTTACTTGAAGCAGCTCTTGAGGACACTGCAAATCAGAGATTTGTTCTTCTATCGGACAGGTGTTTTGGGATCACCACTGCCACTTTCCGTTGCTCTCTGAACATTTTAATCTATGAAGTTGAGGTGTACAACTTCCCTTAAGATTTGTCTTCTCATGCTGCAGTTGCGTTCCACTATACAACTTTAGCTATATATATAGCTATCTCATGGCCTCTCCCAGGAGTTTTGTGGACAGGTAGATGGAGGGCTTTGATTCTGAGGTGCCAAGCACTGATTCTTTGTTGGCCATTTGCTGCCAAAATTTAGATATTTGCCAAGTTTTTATAGCTTCCGTAATTAAATTTCTCTGTATTGAAATATATATTCTCATTTTTTCTTGTAGCTTTCTTGATGCAAAGGAGGGTCGCTATAACCCAAAAATGTCACCTGCTATAACTAAGGGCAAATGGAGGAAAGGGTCCCAGGTCATCCATCACAATAGTTTTTTCTTGGATCTTTTGCTTCGTTTATTTTTTGGTTTCTTTTCTTAGTTATCCAGATTCAACCATGAGACATAGGATTTGTGTCCATGTTTTGTGTTTCACTAGTCGGTATTTTCATCCCTAGCTGTTATGTTCTTTCACATACCCAAAGGAAAGAAGTATGCCTTATTTATTGCTTGATAAATTATTTTAGAGGTACTAGGCAATATTGTTTTCGCTCTCGTGGGTGTAGATGTTTGTTAGTTGAATTTTAAACAATGGTGATGTGTGCAATTATAATTGCTTTGTGGTTAAATTGCAGTGGATCAGTTTGATTCGTAGTCACGCAGAAGTAATTGTGGATGATGATATTATATTCCCAGTCTTTGGATTATTCTGCAAGGTATGTCTTTTGCAGAAAATTCCTTTTTCATTTGTCAACTATGATCCAAGTGCAGTTAACTATTTCTTCTGTGTGGAGACACTATCATTCGTTCACACACTGACACACATTACACATGAACACAATATATTGGTCACATTTTCCCCAAATGTATTATCTTTAAATTGAATGGCATACTGGAATCCACTTCCCTCAGCACATTCTTAAGATTTTTATGAAACAACATTCACTCTTTTTGGTATCGCTTATAAAGTTTATATTTATTCTAAACTACAGCGAAGGCCACCTGTGGATGCCAGCAAAGGAGCTATGAACATTGTAAGTGATCATTTGAGTATATGAAGATATTCAGTTCAACAATATGAATGTTATTACGAAATAGTTTATTCATTATTCTGGATGTTATCTTCTAAGATTGTTGGTGCTGCTATCAATATCAAATCTTTCCCACTAGAAATCAATCTTCCTTAATAAAATATGCATCAATAACCTGGCCTAAATTCTGATTGTAGAAACTTCAAAAGCAGCACAACTGTATTCCAGATGAACACTATGTCCAGACATTGCTTGCTGTGAGTACATATTTGTCTTCTGAGGCCATGATATCTTATCCATCGCTATGGATCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCCATCTTGGGAGATGAGTTTATATAACCCTTTCCAGGCAGTTACTGTAAATTGTTGTCTGAAATAGTTTAGTTTACAGATAATTTACATTGTAACATTCTTTTCTTGTGTTTACTCTGTAGTTAAATGAACTTGAAGGTGAACTTGAACGAAGAACATTAACTTACACTCTATGGAATCAGTCAACCAATAATATGGAGAACAAGGGGTGGCATCCTATTACCTTTAACTATGCTAAGGCTGGACCAAGGCAGATCGAGGAAATAAAGGTTTGATTATATTGCATTATTTCTCTTTTTGTCTTCGTGTTTGTGTTTGTTTATTTAAGAGGGTGGGAGGGCGGGCGGATAAACGTGGTTTCTTGCAAAATTAAATGCTCTAGTTTTTATTTAATGCAATCAAGCTTTTTTTATTTTTTATTTTATTTTTACATTGTTTGTTGGACTGCTTGGCTTATAATGATTCAATGAGAATTGGTGTCTATTCTATGTCATTATCAACCATATTTGAGTAATTAGTGTATTGGGGTATATTTGTGGCATAAATTTGTTGCTCTGGGTTCCTTTTCTGAAATTATTCATTATCTAGTGTGTACTATGTTGGTTGACGTGTATTATTTTGACAGGGAATCAACCATGTCTACTATGAGACCGAATTCAGGACGGAATGGTGTCGAAATAATTCAACTTTTGTTCCTTGTTTTCTATTTGCCAGAAAATTTTCTCAGGGAGCTGCTATGCGATTATTAAGTGAGGGTGTTGCGAGTGACTTTGATGCCTAAGCATTGTCAGAGATCAAAAAGTTCAACTAAATTGAACTATCATTCATCGTAGTCAAGCAGTTAAGATATTAAATATTCTAGGAAGTTTCTAACTGTGATGTCTCCTGAAATTAGTAAAATCAGTGGAGGCGTGCATGGGTTAGATATTCTTGCTCTCCTGAAATTAGTAAAAAGAAAACCAGGTTTGATAGTTCAATCATACTTCACGGCACCTATATTTGTCCATCTTACTTAACATTGTTGGTTCATGTTTTCATCAACTGTGTCTTCAAATGAAAAAATTGATCGTTCAAGGAAGCCACTCTGTTCATGGTGCGTCCGCATTGTCAGTCCTTTGAGAAGAGGCTTCTGCAGAGCTGCCTTGTACATGTCAAAAAGATGGTTACCAATACTTGTGATTTTGATGCTGATGTAATATAGACTCAACGTGTTGTTTATTTATTTTAAGGACTTCTATGCGGACATTATTGTTTGTATCAGCAAAAAGAATGTCATATTAATTG

mRNA sequence

AGATGTCTTCCAAAACACTCCAAGTTCCGTTCACATTTCGCCACCGCCATCCCCTCCCCTTTTCTCATGAGTTTTTGTTTATTAATCAGCTCCCCGGCGAAACCGCCACCTGTTCCATGAAGAACAATGAAGAAGAAAGGTCCTCTAACGCCAGGGCGCAACCTTATCTGGTTCAGCTGGAAGCTTGTCATCACCTTCTCTCTTGCACTTTGCGTTTTCGCTCTTATTAGGCTTCATTCGTCTTCTCGGACCAACCTCGCCTCTGCCTCATTATCCCGTAGATTGCGCCCCCCTCCTGGTTCATTTTTGGGCCGCCCTAAGATTGCCTTCTTGTTTCTCACTCGTCGAAACCTCCCTCTTGATTTTCTTTGGGGAAGCTTCTTCCAGAATGGCGACGTTGCGAACTTCTCGATTTACATTCACTCGGCGCCCGGATTTGTGTTCGATGAATCGACGACAAGGTCGCATTTCTTTTTTGGACGGCAATTGGAGAATAGCATTCAGGTGGACTGGGGAAAGTCGAGTATGATTGCTGCAGAGAGGTTCTTACTTGAAGCAGCTCTTGAGGACACTGCAAATCAGAGATTTGTTCTTCTATCGGACAGTTGCGTTCCACTATACAACTTTAGCTATATATATAGCTATCTCATGGCCTCTCCCAGGAGTTTTGTGGACAGCTTTCTTGATGCAAAGGAGGGTCGCTATAACCCAAAAATGTCACCTGCTATAACTAAGGGCAAATGGAGGAAAGGGTCCCAGTGGATCAGTTTGATTCGTAGTCACGCAGAAGTAATTGTGGATGATGATATTATATTCCCAGTCTTTGGATTATTCTGCAAGCGAAGGCCACCTGTGGATGCCAGCAAAGGAGCTATGAACATTAAACTTCAAAAGCAGCACAACTGTATTCCAGATGAACACTATGTCCAGACATTGCTTGCTTTAAATGAACTTGAAGGTGAACTTGAACGAAGAACATTAACTTACACTCTATGGAATCAGTCAACCAATAATATGGAGAACAAGGGGTGGCATCCTATTACCTTTAACTATGCTAAGGCTGGACCAAGGCAGATCGAGGAAATAAAGGGAATCAACCATGTCTACTATGAGACCGAATTCAGGACGGAATGGTGTCGAAATAATTCAACTTTTGTTCCTTGTTTTCTATTTGCCAGAAAATTTTCTCAGGGAGCTGCTATGCGATTATTAAGTGAGGGTGTTGCGAGTGACTTTGATGCCTAAGCATTGTCAGAGATCAAAAAGTTCAACTAAATTGAACTATCATTCATCGTAGTCAAGCAGTTAAGATATTAAATATTCTAGGAAGTTTCTAACTGTGATGTCTCCTGAAATTAGTAAAATCAGTGGAGGCGTGCATGGGTTAGATATTCTTGCTCTCCTGAAATTAGTAAAAAGAAAACCAGGTTTGATAGTTCAATCATACTTCACGGCACCTATATTTGTCCATCTTACTTAACATTGTTGGTTCATGTTTTCATCAACTGTGTCTTCAAATGAAAAAATTGATCGTTCAAGGAAGCCACTCTGTTCATGGTGCGTCCGCATTGTCAGTCCTTTGAGAAGAGGCTTCTGCAGAGCTGCCTTGTACATGTCAAAAAGATGGTTACCAATACTTGTGATTTTGATGCTGATGTAATATAGACTCAACGTGTTGTTTATTTATTTTAAGGACTTCTATGCGGACATTATTGTTTGTATCAGCAAAAAGAATGTCATATTAATTG

Coding sequence (CDS)

ATGAAGAAGAAAGGTCCTCTAACGCCAGGGCGCAACCTTATCTGGTTCAGCTGGAAGCTTGTCATCACCTTCTCTCTTGCACTTTGCGTTTTCGCTCTTATTAGGCTTCATTCGTCTTCTCGGACCAACCTCGCCTCTGCCTCATTATCCCGTAGATTGCGCCCCCCTCCTGGTTCATTTTTGGGCCGCCCTAAGATTGCCTTCTTGTTTCTCACTCGTCGAAACCTCCCTCTTGATTTTCTTTGGGGAAGCTTCTTCCAGAATGGCGACGTTGCGAACTTCTCGATTTACATTCACTCGGCGCCCGGATTTGTGTTCGATGAATCGACGACAAGGTCGCATTTCTTTTTTGGACGGCAATTGGAGAATAGCATTCAGGTGGACTGGGGAAAGTCGAGTATGATTGCTGCAGAGAGGTTCTTACTTGAAGCAGCTCTTGAGGACACTGCAAATCAGAGATTTGTTCTTCTATCGGACAGTTGCGTTCCACTATACAACTTTAGCTATATATATAGCTATCTCATGGCCTCTCCCAGGAGTTTTGTGGACAGCTTTCTTGATGCAAAGGAGGGTCGCTATAACCCAAAAATGTCACCTGCTATAACTAAGGGCAAATGGAGGAAAGGGTCCCAGTGGATCAGTTTGATTCGTAGTCACGCAGAAGTAATTGTGGATGATGATATTATATTCCCAGTCTTTGGATTATTCTGCAAGCGAAGGCCACCTGTGGATGCCAGCAAAGGAGCTATGAACATTAAACTTCAAAAGCAGCACAACTGTATTCCAGATGAACACTATGTCCAGACATTGCTTGCTTTAAATGAACTTGAAGGTGAACTTGAACGAAGAACATTAACTTACACTCTATGGAATCAGTCAACCAATAATATGGAGAACAAGGGGTGGCATCCTATTACCTTTAACTATGCTAAGGCTGGACCAAGGCAGATCGAGGAAATAAAGGGAATCAACCATGTCTACTATGAGACCGAATTCAGGACGGAATGGTGTCGAAATAATTCAACTTTTGTTCCTTGTTTTCTATTTGCCAGAAAATTTTCTCAGGGAGCTGCTATGCGATTATTAAGTGAGGGTGTTGCGAGTGACTTTGATGCCTAA

Protein sequence

MKKKGPLTPGRNLIWFSWKLVITFSLALCVFALIRLHSSSRTNLASASLSRRLRPPPGSFLGRPKIAFLFLTRRNLPLDFLWGSFFQNGDVANFSIYIHSAPGFVFDESTTRSHFFFGRQLENSIQVDWGKSSMIAAERFLLEAALEDTANQRFVLLSDSCVPLYNFSYIYSYLMASPRSFVDSFLDAKEGRYNPKMSPAITKGKWRKGSQWISLIRSHAEVIVDDDIIFPVFGLFCKRRPPVDASKGAMNIKLQKQHNCIPDEHYVQTLLALNELEGELERRTLTYTLWNQSTNNMENKGWHPITFNYAKAGPRQIEEIKGINHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSEGVASDFDA
BLAST of Cp4.1LG10g09830 vs. TrEMBL
Match: A0A0A0KHS2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G513470 PE=4 SV=1)

HSP 1 Score: 693.0 bits (1787), Expect = 2.0e-196
Identity = 335/373 (89.81%), Postives = 350/373 (93.83%), Query Frame = 1

Query: 1   MKKKGPLTPGRNLIWFSWKLVITFSLALCVFALIRLHSS-SRTNLASASLSRRLRPPPGS 60
           MKKK  LTP R+L WFSWKL++TFSLALC+FAL+ LHSS S T+LASASLSRRLRPP  S
Sbjct: 1   MKKKALLTPPRSLFWFSWKLLVTFSLALCIFALVSLHSSPSTTDLASASLSRRLRPPSDS 60

Query: 61  FLGRPKIAFLFLTRRNLPLDFLWGSFFQNGDVANFSIYIHSAPGFVFDESTTRSHFFFGR 120
           FLGRPKIAFLFLTRRNLPLDFLWGSFF+NGDVANFSIYIHSAPGFVFDESTTRSHFFFGR
Sbjct: 61  FLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHSAPGFVFDESTTRSHFFFGR 120

Query: 121 QLENSIQVDWGKSSMIAAERFLLEAALEDTANQRFVLLSDSCVPLYNFSYIYSYLMASPR 180
           QLENSIQV WGKSSMIAAER LLEAALED ANQRF+LLSDSCVPLYNFSYIYSYLMASP+
Sbjct: 121 QLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFILLSDSCVPLYNFSYIYSYLMASPK 180

Query: 181 SFVDSFLDAKEGRYNPKMSPAITKGKWRKGSQWISLIRSHAEVIVDDDIIFPVFGLFCKR 240
           SFVDSFLDAKEGRYNPKMSPAI K KWRKGSQWISLIRSHAEV+VDDDIIFP+FGLFCKR
Sbjct: 181 SFVDSFLDAKEGRYNPKMSPAIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKR 240

Query: 241 RPPVDASKGAMNIKLQKQHNCIPDEHYVQTLLALNELEGELERRTLTYTLWNQSTNNMEN 300
           RPPVD SKG MN KLQKQHNCIPDEHYVQTLLALNELEGELERRT+TYTLWNQST  MEN
Sbjct: 241 RPPVDESKGIMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTKMEN 300

Query: 301 KGWHPITFNYAKAGPRQIEEIKGINHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAM 360
           KGWHPITF YA AGPRQ++EIKGI+HVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAM
Sbjct: 301 KGWHPITFTYANAGPRQVKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAM 360

Query: 361 RLLSEGVASDFDA 373
           RLLSEGV S FDA
Sbjct: 361 RLLSEGVVSHFDA 373

BLAST of Cp4.1LG10g09830 vs. TrEMBL
Match: E0CVP2_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_10s0116g00980 PE=4 SV=1)

HSP 1 Score: 572.0 bits (1473), Expect = 5.3e-160
Identity = 281/378 (74.34%), Postives = 309/378 (81.75%), Query Frame = 1

Query: 1   MKKKGPLTPGRNLIWFSWKLVITFSLALCVFALIRLHSSSRTNLASASLSRRLRPPPGS- 60
           M KK P    R++ WF WKLVI  S+ALCV AL+RL S+S   L+S SL     PP G  
Sbjct: 1   MTKKAPSFSIRHVFWFGWKLVILVSVALCVLALLRLQSNSE--LSSISL-----PPQGPR 60

Query: 61  ------FLGRPKIAFLFLTRRNLPLDFLWGSFFQNGDVANFSIYIHSAPGFVFDESTTRS 120
                 + G PKIAFLFL RR+LPLDFLWGSFF+N D ANFSIYIHS PGFVFDE+T+RS
Sbjct: 61  FYRVSVYQGNPKIAFLFLVRRSLPLDFLWGSFFENADAANFSIYIHSQPGFVFDETTSRS 120

Query: 121 HFFFGRQLENSIQVDWGKSSMIAAERFLLEAALEDTANQRFVLLSDSCVPLYNFSYIYSY 180
            FF+ RQL NSIQV WG+SSMI AER L EAALED ANQRFVLLSDSCVPLYNFSYIY+Y
Sbjct: 121 RFFYNRQLSNSIQVAWGESSMIQAERLLFEAALEDPANQRFVLLSDSCVPLYNFSYIYNY 180

Query: 181 LMASPRSFVDSFLDAKEGRYNPKMSPAITKGKWRKGSQWISLIRSHAEVIVDDDIIFPVF 240
           +MASPRS+VDSFLD KEGRYNPKMSP I K KWRKGSQWISL+RSHAEVIVDD +IF VF
Sbjct: 181 MMASPRSYVDSFLDVKEGRYNPKMSPVIPKAKWRKGSQWISLVRSHAEVIVDDQVIFSVF 240

Query: 241 GLFCKRRPPVDASKGAMNIKLQKQHNCIPDEHYVQTLLALNELEGELERRTLTYTLWNQS 300
             FCKRRPP+DA KG  NIKLQKQHNCIPDEHYVQTLLA++ELE ELERRTLTYT WN S
Sbjct: 241 KKFCKRRPPIDARKGKQNIKLQKQHNCIPDEHYVQTLLAMSELESELERRTLTYTEWNLS 300

Query: 301 TNNMENKGWHPITFNYAKAGPRQIEEIKGINHVYYETEFRTEWCRNNSTFVPCFLFARKF 360
              ME +GWHPITF+YA AGP++I+EIK +NHVYYETEFRTEWCR NST VPCFLFARKF
Sbjct: 301 VTKMEREGWHPITFSYANAGPQRIKEIKDVNHVYYETEFRTEWCRANSTSVPCFLFARKF 360

Query: 361 SQGAAMRLLSEGVASDFD 372
           S+GAAMRLLSEGV   FD
Sbjct: 361 SRGAAMRLLSEGVVGSFD 371

BLAST of Cp4.1LG10g09830 vs. TrEMBL
Match: A0A061GHY7_THECC (Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein isoform 2 OS=Theobroma cacao GN=TCM_030284 PE=4 SV=1)

HSP 1 Score: 569.7 bits (1467), Expect = 2.6e-159
Identity = 279/375 (74.40%), Postives = 308/375 (82.13%), Query Frame = 1

Query: 1   MKKKGPLTPGRNLIWFSWKLVITFSLALCVFALIRLHSS---SRTNLASASLSRRLRPPP 60
           MKK     P R ++W  WKLVI  S+ALC  AL+RLH S   S  N  S     R R   
Sbjct: 2   MKKLPAPVPARQVLWLGWKLVILLSVALCFVALLRLHFSPDLSSPNSLSRPARVRSRISG 61

Query: 61  GSFLGRPKIAFLFLTRRNLPLDFLWGSFFQNGDVANFSIYIHSAPGFVFDESTTRSHFFF 120
           G+F G PKIAFLFL R NLPLDFLWGSFF+N DVANFSIYIHSAPGFVFDESTTRS FF+
Sbjct: 62  GTFDGIPKIAFLFLARFNLPLDFLWGSFFENADVANFSIYIHSAPGFVFDESTTRSLFFY 121

Query: 121 GRQLENSIQVDWGKSSMIAAERFLLEAALEDTANQRFVLLSDSCVPLYNFSYIYSYLMAS 180
            RQL NSIQV WG+SSMI AER LLE+ALED ANQRFVLLSDSCVPLYNFSYIY YLM+S
Sbjct: 122 DRQLTNSIQVIWGESSMIEAERLLLESALEDPANQRFVLLSDSCVPLYNFSYIYRYLMSS 181

Query: 181 PRSFVDSFLDAKEGRYNPKMSPAITKGKWRKGSQWISLIRSHAEVIVDDDIIFPVFGLFC 240
            RSFVDSFLDAK+GRY+PKMSP I K KWRKGSQWISL+RSHAEVIVDD+++ PVF  FC
Sbjct: 182 SRSFVDSFLDAKDGRYHPKMSPVIPKEKWRKGSQWISLLRSHAEVIVDDEVVLPVFKKFC 241

Query: 241 KRRPPVDASKGAMNIKLQKQHNCIPDEHYVQTLLALNELEGELERRTLTYTLWNQSTNNM 300
           KRRPP+D  KG +NIKLQKQHNCIPDEHYVQTL A++ELEGELERRTLTYTLWNQS   M
Sbjct: 242 KRRPPMDTGKGKLNIKLQKQHNCIPDEHYVQTLFAMSELEGELERRTLTYTLWNQSAAKM 301

Query: 301 ENKGWHPITFNYAKAGPRQIEEIKGINHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGA 360
           +NK WHP+ FNYA A P++I+EIK INHVYYE+EFRTEWC+ NST VPCFLFARKFS+GA
Sbjct: 302 DNKAWHPVMFNYADASPKKIKEIKDINHVYYESEFRTEWCQTNSTSVPCFLFARKFSRGA 361

Query: 361 AMRLLSEGVASDFDA 373
           AMRLLSEGV   F+A
Sbjct: 362 AMRLLSEGVVGPFEA 376

BLAST of Cp4.1LG10g09830 vs. TrEMBL
Match: B9RXG6_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0903630 PE=4 SV=1)

HSP 1 Score: 568.5 bits (1464), Expect = 5.8e-159
Identity = 280/384 (72.92%), Postives = 316/384 (82.29%), Query Frame = 1

Query: 1   MKKKGPLTPGRNLIWFSWKLVITFSLALCVFALIRLH---------SSSRTNLASASLSR 60
           M KK P  P R++IW  WKLVI  S++LCVFAL+RLH         SSS ++ +S+S  R
Sbjct: 14  MTKKAPPVPPRHVIWLGWKLVIILSVSLCVFALLRLHFQSDHYSSPSSSSSSSSSSSFYR 73

Query: 61  ---RLRPPPGSFLGRPKIAFLFLTRRNLPLDFLWGSFFQNGDVANFSIYIHSAPGFVFDE 120
              RL      F G PK+AFLFL R++LPLDFLWGSFF+N DVA+FSI+IHS+PGF FDE
Sbjct: 74  PRSRLSRANLEFHGPPKLAFLFLVRQDLPLDFLWGSFFENADVASFSIFIHSSPGFEFDE 133

Query: 121 STTRSHFFFGRQLENSIQVDWGKSSMIAAERFLLEAALEDTANQRFVLLSDSCVPLYNFS 180
           STTRSHFF+GRQL+NSIQV WG+SSMI AER LL AALED ANQRFVLLSDSCVPLYNFS
Sbjct: 134 STTRSHFFYGRQLKNSIQVAWGESSMIEAERLLLSAALEDPANQRFVLLSDSCVPLYNFS 193

Query: 181 YIYSYLMASPRSFVDSFLDAKEGRYNPKMSPAITKGKWRKGSQWISLIRSHAEVIVDDDI 240
           YIYSY+MASPRSFVDSFLD KE RYN KMSP I K KWRKGSQWI+LIRSHAEVIVDD++
Sbjct: 194 YIYSYVMASPRSFVDSFLDTKEDRYNQKMSPIIQKHKWRKGSQWITLIRSHAEVIVDDEV 253

Query: 241 IFPVFGLFCKRRPPVDASKGAMNIKLQKQHNCIPDEHYVQTLLALNELEGELERRTLTYT 300
           IFP F  +CKRR P+DASKG +N KLQKQ+NCIPDEHYVQTLL++ ELEGELERRTLTYT
Sbjct: 254 IFPEFQKYCKRRLPLDASKGKLNAKLQKQNNCIPDEHYVQTLLSMAELEGELERRTLTYT 313

Query: 301 LWNQSTNNMENKGWHPITFNYAKAGPRQIEEIKGINHVYYETEFRTEWCRNNSTFVPCFL 360
           +WN S   ME+KGWHP+TF Y  AGP++I EIK INHVYYETE+RTEWC  NST VPCFL
Sbjct: 314 VWNLSVTRMESKGWHPMTFTYGNAGPQKIREIKAINHVYYETEYRTEWCHTNSTSVPCFL 373

Query: 361 FARKFSQGAAMRLLSEGVASDFDA 373
           FARKFS+GAAMRLLSEGV S FDA
Sbjct: 374 FARKFSRGAAMRLLSEGVVSPFDA 397

BLAST of Cp4.1LG10g09830 vs. TrEMBL
Match: B9HZ98_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0011s01410g PE=4 SV=2)

HSP 1 Score: 567.8 bits (1462), Expect = 9.9e-159
Identity = 278/378 (73.54%), Postives = 308/378 (81.48%), Query Frame = 1

Query: 1   MKKKGPLTP------GRNLIWFSWKLVITFSLALCVFALIRLHSSSRTNLASASLSRRLR 60
           M KK  L P       R +IW  WKLVI  S+ LCVFAL R+H SS      +   RR  
Sbjct: 1   MTKKSSLLPILLQQSRRRVIWSGWKLVIILSMGLCVFALFRIHLSSPPETLLSR--RRSF 60

Query: 61  PPPGSFLGRPKIAFLFLTRRNLPLDFLWGSFFQNGDVANFSIYIHSAPGFVFDESTTRSH 120
                F G PK+AFLFL RR LPLDFLWGSFF+N D  NFSI++HS PGF FDESTTRSH
Sbjct: 61  SREVVFSGPPKVAFLFLVRRGLPLDFLWGSFFENADTGNFSIHVHSEPGFEFDESTTRSH 120

Query: 121 FFFGRQLENSIQVDWGKSSMIAAERFLLEAALEDTANQRFVLLSDSCVPLYNFSYIYSYL 180
           FF+GRQL+NSIQV WG+SSMI AER LL+AALED ANQRFVLLSDSCVPLYNFSYIYSYL
Sbjct: 121 FFYGRQLKNSIQVIWGESSMIEAERLLLDAALEDPANQRFVLLSDSCVPLYNFSYIYSYL 180

Query: 181 MASPRSFVDSFLDAKEGRYNPKMSPAITKGKWRKGSQWISLIRSHAEVIVDDDIIFPVFG 240
           MASPRSFVDSFLD KEGRY+PKMSP I K KWRKGSQWI+LIRSHAEVIVDD +I PVF 
Sbjct: 181 MASPRSFVDSFLDVKEGRYHPKMSPVIPKDKWRKGSQWIALIRSHAEVIVDDVVILPVFK 240

Query: 241 LFCKRRPPVDASKGAMNIKLQKQHNCIPDEHYVQTLLALNELEGELERRTLTYTLWNQST 300
             CKRRPP+DA+KG +NIKLQKQHNCIPDEHYVQTLL+++ LEGELERRT+TYT+WNQS 
Sbjct: 241 KLCKRRPPLDATKGKLNIKLQKQHNCIPDEHYVQTLLSMSGLEGELERRTVTYTVWNQSA 300

Query: 301 NNMENKGWHPITFNYAKAGPRQIEEIKGINHVYYETEFRTEWCRNNSTFVPCFLFARKFS 360
             MENKGWHP TF+YA A PR+I+EIKGINH+ YETE+RTEWCR NSTFVPCFLFARKFS
Sbjct: 301 TKMENKGWHPKTFSYANASPRKIKEIKGINHIDYETEYRTEWCRTNSTFVPCFLFARKFS 360

Query: 361 QGAAMRLLSEGVASDFDA 373
           +GAAMRLLS+GV   FDA
Sbjct: 361 RGAAMRLLSDGVTGPFDA 376

BLAST of Cp4.1LG10g09830 vs. TAIR10
Match: AT1G62305.1 (AT1G62305.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein)

HSP 1 Score: 474.6 bits (1220), Expect = 5.8e-134
Identity = 236/362 (65.19%), Postives = 282/362 (77.90%), Query Frame = 1

Query: 11  RNLIWFSWKLVITFSLALCVFALIRLH----SSSRTNLASASLS-RRLRPPPGSFLG-RP 70
           R ++WF WK++IT S ALC+ AL  ++    S++ T   S+SLS  R R P   + G RP
Sbjct: 9   RGVVWFRWKILITISTALCILALFCINRQSNSTATTTTLSSSLSVARSRIPLVKYSGDRP 68

Query: 71  KIAFLFLTRRNLPLDFLWGSFFQNGDVANFSIYIHSAPGFVFDESTTRSHFFFGRQLENS 130
           K+AFLFL RR+LPLDFLW  FF++ D  NFSIY+HS PGFVFDES+TRSHFF+ RQL+NS
Sbjct: 69  KLAFLFLARRDLPLDFLWDRFFKSADQRNFSIYVHSIPGFVFDESSTRSHFFYNRQLKNS 128

Query: 131 IQVDWGKSSMIAAERFLLEAALEDTANQRFVLLSDSCVPLYNFSYIYSYLMASPRSFVDS 190
           I+V WG+SSMIAAER LL +ALED +NQRFVLLSDSCVPLY+F YIY YL++SP+SFVDS
Sbjct: 129 IEVVWGESSMIAAERLLLASALEDPSNQRFVLLSDSCVPLYDFGYIYRYLVSSPKSFVDS 188

Query: 191 FLDAKEGRYNPKMSPAITKGKWRKGSQWISLIRSHAEVIVDDDIIFPVFGLFCKRRPPVD 250
           FLD K+ RY  KM P I K KWRKGSQWISLIRSHAEVIV+DD +FPVF  FCKR  P+D
Sbjct: 189 FLD-KDNRYTMKMFPVIRKEKWRKGSQWISLIRSHAEVIVNDDTVFPVFQKFCKRSLPLD 248

Query: 251 ASKGAMNIKLQKQHNCIPDEHYVQTLLALNELEGELERRTLTYTLWNQSTNNMENKGWHP 310
             K  + +K +++HNCIPDEHYVQTLL +  LE E+ERRT+TYT WN S    E K WHP
Sbjct: 249 PRKNWLYLK-KRRHNCIPDEHYVQTLLTMRGLENEMERRTVTYTTWNLSAKKAEAKSWHP 308

Query: 311 ITFNYAKAGPRQIEEIKGINHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLLSE 367
           +TF     GP +IE IK INHVYYE+E+RTEWCR NS  VPCFLFARKF++GAAMRLLSE
Sbjct: 309 LTFTSDNCGPEEIEGIKKINHVYYESEYRTEWCRANSKPVPCFLFARKFTRGAAMRLLSE 368

BLAST of Cp4.1LG10g09830 vs. TAIR10
Match: AT1G11940.1 (AT1G11940.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein)

HSP 1 Score: 455.3 bits (1170), Expect = 3.6e-128
Identity = 225/364 (61.81%), Postives = 276/364 (75.82%), Query Frame = 1

Query: 6   PLTPGRNLIWFSWKLVITFSLALCVFALIR--LHSSSRTNLASASLSRRLRPPPGSFLG- 65
           PL+    ++W  WKLVI FS+ALC+ AL+R  L  +S T L+      R + P   + G 
Sbjct: 12  PLSRRGGVVWLGWKLVIAFSVALCLLALLRIQLQYNSFTTLSFPLSVARSQTPLHKYSGD 71

Query: 66  RPKIAFLFLTRRNLPLDFLWGSFFQNGDVANFSIYIHSAPGFVFDESTTRSHFFFGRQLE 125
           RPK+AFLFL RR+LPLDF+W  FF+  D ANFSIYIHS PGFVF+E TTRS +F+ RQL 
Sbjct: 72  RPKLAFLFLARRDLPLDFMWDRFFKGVDHANFSIYIHSVPGFVFNEETTRSQYFYNRQLN 131

Query: 126 NSIQVDWGKSSMIAAERFLLEAALEDTANQRFVLLSDSCVPLYNFSYIYSYLMASPRSFV 185
           NSI+V WG+SSMI AER LL +ALED +NQRFVLLSD C PLY+F YIY YL++SPRSFV
Sbjct: 132 NSIKVVWGESSMIEAERLLLASALEDHSNQRFVLLSDRCAPLYDFGYIYKYLISSPRSFV 191

Query: 186 DSFLDAKEGRYNPKMSPAITKGKWRKGSQWISLIRSHAEVIVDDDIIFPVFGLFCKRRPP 245
           DSFL  KE RY+ KMSP I + KWRKGSQWI+LIRSHAEVIV+D I+FPVF  FCKR PP
Sbjct: 192 DSFLHTKETRYSVKMSPVIPEEKWRKGSQWIALIRSHAEVIVNDGIVFPVFKEFCKRCPP 251

Query: 246 VDASKGAMNIKLQKQHNCIPDEHYVQTLLALNELEGELERRTLTYTLWNQSTNNMENKGW 305
           +  ++  + +K QK+ NCIPDEHYVQTLL +  LE E+ERRT+TYT+WN S    E K W
Sbjct: 252 LGTNEAWLFLK-QKRRNCIPDEHYVQTLLTMQGLESEMERRTVTYTVWNVSGTKYEAKSW 311

Query: 306 HPITFNYAKAGPRQIEEIKGINHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAMRLL 365
           HP+TF    +GP +I+EIK I+HVYYE+E RTEWC+ +S  VPCFLFARKF+  AAMR++
Sbjct: 312 HPVTFTLENSGPEEIKEIKKIDHVYYESESRTEWCKADSKPVPCFLFARKFTNEAAMRIV 371

Query: 366 SEGV 367
           SEG+
Sbjct: 372 SEGL 374

BLAST of Cp4.1LG10g09830 vs. TAIR10
Match: AT5G14550.1 (AT5G14550.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein)

HSP 1 Score: 359.0 bits (920), Expect = 3.5e-99
Identity = 185/375 (49.33%), Postives = 250/375 (66.67%), Query Frame = 1

Query: 1   MKKKGPLTPGRNLIWFSWK------LVITFSLALCVFALIRLHS-SSRTNLASASLSRRL 60
           MKKK      +  + + WK      L+  F     VF   R  S  +R N  SASL    
Sbjct: 1   MKKK----VSQQKLLYRWKRKVYATLMFAFCFGTFVFIQARFASIQARFNRISASLDSLK 60

Query: 61  RPPPGSFLGRPKIAFLFLTRRNLPLDFLWGSFFQNGDVANFSIYIHSAPGFVFDESTTRS 120
           +P       RP+IAFLF+ R  LPL+F+W +FF+ G+   FSIY+HS PGFV +E+TTRS
Sbjct: 61  KPRLDQ---RPQIAFLFIARNRLPLEFVWDAFFK-GEDGKFSIYVHSRPGFVLNEATTRS 120

Query: 121 HFFFGRQLENSIQVDWGKSSMIAAERFLLEAALEDTANQRFVLLSDSCVPLYNFSYIYSY 180
            +F  RQL +SIQVDWG+S+MI AER LL  AL D+ N RFV LSDSC+PLY+FSY Y+Y
Sbjct: 121 KYFLDRQLNDSIQVDWGESTMIEAERVLLRHALRDSFNHRFVFLSDSCIPLYSFSYTYNY 180

Query: 181 LMASPRSFVDSFLDAKEGRYNPKMSPAITKGKWRKGSQWISLIRSHAEVIVDDDIIFPVF 240
           +M++P SFVDSF D K+ RYNP+M+P I    WRKGSQW+ L R HAE++V+D  +FP+F
Sbjct: 181 IMSTPTSFVDSFADTKDSRYNPRMNPIIPVRNWRKGSQWVVLNRKHAEIVVNDTSVFPMF 240

Query: 241 GLFCKRRP-PVDASKGAMNIKLQKQHNCIPDEHYVQTLLALNELEGELERRTLTYTLWN- 300
              C+R+  P       +  +  K+HNCIPDEHYVQTLL+   ++ EL RR+LT++ W+ 
Sbjct: 241 QQHCRRKSLPEFWRDRPVPAEGWKEHNCIPDEHYVQTLLSQKGVDSELTRRSLTHSAWDL 300

Query: 301 QSTNNMENKGWHPITFNYAKAGPRQIEEIKGINHVYYETEFRTEWCRNNSTFVPCFLFAR 360
            S+ + E +GWHP+T+ ++ A P  I+ IKGI+++ YETE+R EWC +     PCFLFAR
Sbjct: 301 SSSKSNERRGWHPMTYKFSDATPDLIQSIKGIDNINYETEYRREWCSSKGKPSPCFLFAR 360

Query: 361 KFSQGAAMRLLSEGV 367
           KF++ AA+RLL E +
Sbjct: 361 KFTRPAALRLLRETI 367

BLAST of Cp4.1LG10g09830 vs. TAIR10
Match: AT4G31350.1 (AT4G31350.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein)

HSP 1 Score: 185.7 bits (470), Expect = 5.4e-47
Identity = 116/314 (36.94%), Postives = 166/314 (52.87%), Query Frame = 1

Query: 57  PGSFLGRPKIAFLFLTRRNLPLDFLWGSFFQNGDVANFSIYIHSAPGFVFDESTTRSHFF 116
           P S    PK+AF+FLT   LP + LW  FF+ G    FS+Y+H++           S +F
Sbjct: 81  PQSKTANPKLAFMFLTPGTLPFEPLWEMFFR-GHENKFSVYVHASK----KSPVHTSSYF 140

Query: 117 FGRQLENSIQVDWGKSSMIAAERFLLEAALEDTANQRFVLLSDSCVPLYNFSYIYSYLMA 176
            GR + +S +V WG+ SM+ AER LL  AL D  NQ F+LLSDSCVPL++F+YIY++L+ 
Sbjct: 141 VGRDI-HSHKVAWGQISMVDAERRLLAHALVDPDNQHFILLSDSCVPLFDFNYIYNHLIF 200

Query: 177 SPRSFVDSFLDA---KEGRYNPKMSPAITKGKWRKGSQWISLIRSHAEVIVDDDIIFPVF 236
           +  SF+D F D      GRY+  M P + K  +RKGSQW S+ R HA V++ D + +  F
Sbjct: 201 ANLSFIDCFEDPGPHGSGRYSQHMLPEVEKKDFRKGSQWFSMKRRHAIVVMADSLYYTKF 260

Query: 237 GLFCKRRPPVDASKGAMNIKLQKQHNCIPDEHYVQTLLALNELEGELERRTLTYTLWNQS 296
            L+C  RP ++              NC  DEHY  TL  + + +G +   ++T+  W++ 
Sbjct: 261 KLYC--RPNMEG------------RNCYADEHYFPTLFNMIDPDG-IANWSVTHVDWSEG 320

Query: 297 TNNMENKGWHPITFNYAKAGPRQIEEIKGINHVYYETE-----FRTEWCRNNSTFVPCFL 356
                   WHP  +N     P  I +IK I   Y+ T         + C       PC+L
Sbjct: 321 K-------WHPKLYNARDITPYLIRKIKSIQLAYHVTSDLKKVTTVKPCLWKGEQRPCYL 366

Query: 357 FARKFSQGAAMRLL 363
           FARKF+     RL+
Sbjct: 381 FARKFNPETLDRLM 366

BLAST of Cp4.1LG10g09830 vs. TAIR10
Match: AT1G73810.1 (AT1G73810.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein)

HSP 1 Score: 180.3 bits (456), Expect = 2.3e-45
Identity = 116/302 (38.41%), Postives = 155/302 (51.32%), Query Frame = 1

Query: 65  KIAFLFLTRRNLPLDFLWGSFFQNGDVANFSIYIHSAPGFVFDESTTRSHFFFGRQLENS 124
           K AF+FLTR  LPL  LW  FF+ G    FSIYIH++  F FD+ T  +  F+ R++ + 
Sbjct: 147 KAAFMFLTRGKLPLAKLWERFFK-GHEGLFSIYIHTSDPFYFDDHTPETSPFYRRRIPSK 206

Query: 125 IQVDWGKSSMIAAERFLLEAALEDTANQRFVLLSDSCVPLYNFSYIYSYLMASPRSFVDS 184
            +V WG  SM+AAER LL  AL D  N RFVLLS+S +PL+NFS IYSYL+ S  S+VD 
Sbjct: 207 -EVGWGMVSMVAAERRLLANALLDAGNHRFVLLSESDIPLFNFSTIYSYLINSQHSYVDV 266

Query: 185 F---LDAKEGRYNPKMSPAITKGKWRKGSQWISLIRSHAEVIVDDDIIFPVFGLFCKRRP 244
           +     A  GRYN +MSP I++  WRKGSQW  + R  A  +V D   FPVF  +C    
Sbjct: 267 YDLPGPAGRGRYNRRMSPVISRTNWRKGSQWFEIDREVALAVVSDTTYFPVFEKYC---- 326

Query: 245 PVDASKGAMNIKLQKQHNCIPDEHYVQTLLALNELEGELERRTLTYTLWNQSTNNMENKG 304
                            NC  DEHY+ T +      G+   R+LT+T W++       +G
Sbjct: 327 ---------------LWNCYADEHYLSTFVHA-MFPGKNANRSLTWTDWSR-------RG 386

Query: 305 WHPITFNYAKAGPRQIEEIKGINHVYYETEFRTEWC-RNNSTFVPCFLFARKFSQGAAMR 363
            HP  +         +  ++           R + C  N      C+LFARKF      +
Sbjct: 387 PHPRKYTRRSVTGEFLRRVRN----------REQGCVYNGKKSEKCYLFARKFDGSTLDK 409

BLAST of Cp4.1LG10g09830 vs. NCBI nr
Match: gi|449433986|ref|XP_004134777.1| (PREDICTED: uncharacterized protein LOC101222689 [Cucumis sativus])

HSP 1 Score: 693.0 bits (1787), Expect = 2.9e-196
Identity = 335/373 (89.81%), Postives = 350/373 (93.83%), Query Frame = 1

Query: 1   MKKKGPLTPGRNLIWFSWKLVITFSLALCVFALIRLHSS-SRTNLASASLSRRLRPPPGS 60
           MKKK  LTP R+L WFSWKL++TFSLALC+FAL+ LHSS S T+LASASLSRRLRPP  S
Sbjct: 1   MKKKALLTPPRSLFWFSWKLLVTFSLALCIFALVSLHSSPSTTDLASASLSRRLRPPSDS 60

Query: 61  FLGRPKIAFLFLTRRNLPLDFLWGSFFQNGDVANFSIYIHSAPGFVFDESTTRSHFFFGR 120
           FLGRPKIAFLFLTRRNLPLDFLWGSFF+NGDVANFSIYIHSAPGFVFDESTTRSHFFFGR
Sbjct: 61  FLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHSAPGFVFDESTTRSHFFFGR 120

Query: 121 QLENSIQVDWGKSSMIAAERFLLEAALEDTANQRFVLLSDSCVPLYNFSYIYSYLMASPR 180
           QLENSIQV WGKSSMIAAER LLEAALED ANQRF+LLSDSCVPLYNFSYIYSYLMASP+
Sbjct: 121 QLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFILLSDSCVPLYNFSYIYSYLMASPK 180

Query: 181 SFVDSFLDAKEGRYNPKMSPAITKGKWRKGSQWISLIRSHAEVIVDDDIIFPVFGLFCKR 240
           SFVDSFLDAKEGRYNPKMSPAI K KWRKGSQWISLIRSHAEV+VDDDIIFP+FGLFCKR
Sbjct: 181 SFVDSFLDAKEGRYNPKMSPAIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKR 240

Query: 241 RPPVDASKGAMNIKLQKQHNCIPDEHYVQTLLALNELEGELERRTLTYTLWNQSTNNMEN 300
           RPPVD SKG MN KLQKQHNCIPDEHYVQTLLALNELEGELERRT+TYTLWNQST  MEN
Sbjct: 241 RPPVDESKGIMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTKMEN 300

Query: 301 KGWHPITFNYAKAGPRQIEEIKGINHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAM 360
           KGWHPITF YA AGPRQ++EIKGI+HVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAM
Sbjct: 301 KGWHPITFTYANAGPRQVKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAM 360

Query: 361 RLLSEGVASDFDA 373
           RLLSEGV S FDA
Sbjct: 361 RLLSEGVVSHFDA 373

BLAST of Cp4.1LG10g09830 vs. NCBI nr
Match: gi|659079022|ref|XP_008440033.1| (PREDICTED: uncharacterized protein LOC103484630 isoform X1 [Cucumis melo])

HSP 1 Score: 686.0 bits (1769), Expect = 3.6e-194
Identity = 334/373 (89.54%), Postives = 347/373 (93.03%), Query Frame = 1

Query: 1   MKKKGPLTPGRNLIWFSWKLVITFSLALCVFALIRLHSS-SRTNLASASLSRRLRPPPGS 60
           MKKK  LTP R L WFSWKL++ FSLALC+ ALI LHSS S T+LA+ASLSRR RPP  S
Sbjct: 1   MKKKALLTPPRRLFWFSWKLLVAFSLALCILALISLHSSPSTTDLAAASLSRRSRPPSDS 60

Query: 61  FLGRPKIAFLFLTRRNLPLDFLWGSFFQNGDVANFSIYIHSAPGFVFDESTTRSHFFFGR 120
           FLGRPKIAFLFLTRRNLPLDFLWGSFF+NGDVANFSIYIHSAPGFVFDESTTRSHFFFGR
Sbjct: 61  FLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHSAPGFVFDESTTRSHFFFGR 120

Query: 121 QLENSIQVDWGKSSMIAAERFLLEAALEDTANQRFVLLSDSCVPLYNFSYIYSYLMASPR 180
           QLENSIQV WGKSSMIAAER LLEAALED ANQRFVLLSDSCVPLYNFSYIYSYL+ASP+
Sbjct: 121 QLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYIYSYLIASPK 180

Query: 181 SFVDSFLDAKEGRYNPKMSPAITKGKWRKGSQWISLIRSHAEVIVDDDIIFPVFGLFCKR 240
           SFVDSFLDAKEGRYNPKMSPAI K KWRKGSQWISLIRSHAEV+VDDDIIFP+FGLFCKR
Sbjct: 181 SFVDSFLDAKEGRYNPKMSPAIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKR 240

Query: 241 RPPVDASKGAMNIKLQKQHNCIPDEHYVQTLLALNELEGELERRTLTYTLWNQSTNNMEN 300
           RPPVDASKG MN KLQKQHNCIPDEHYVQTLLALNELEGELERRT+TYTLWNQST  MEN
Sbjct: 241 RPPVDASKGNMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTKMEN 300

Query: 301 KGWHPITFNYAKAGPRQIEEIKGINHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAM 360
           KGWHPITF YA AGPRQI+EIKGI+HVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAM
Sbjct: 301 KGWHPITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAM 360

Query: 361 RLLSEGVASDFDA 373
           RLLSEGV S FDA
Sbjct: 361 RLLSEGVVSHFDA 373

BLAST of Cp4.1LG10g09830 vs. NCBI nr
Match: gi|659079026|ref|XP_008440035.1| (PREDICTED: uncharacterized protein LOC103484630 isoform X2 [Cucumis melo])

HSP 1 Score: 626.3 bits (1614), Expect = 3.4e-176
Identity = 312/373 (83.65%), Postives = 323/373 (86.60%), Query Frame = 1

Query: 1   MKKKGPLTPGRNLIWFSWKLVITFSLALCVFALIRLHSS-SRTNLASASLSRRLRPPPGS 60
           MKKK  LTP R L WFSWKL++ FSLALC+ ALI LHSS S T+LA+ASLSRR RPP  S
Sbjct: 1   MKKKALLTPPRRLFWFSWKLLVAFSLALCILALISLHSSPSTTDLAAASLSRRSRPPSDS 60

Query: 61  FLGRPKIAFLFLTRRNLPLDFLWGSFFQNGDVANFSIYIHSAPGFVFDESTTRSHFFFGR 120
           FLGRPKIAFLFLTRRNLPLDFLWGSFF+NGDVANFSIYIHSAPGFVFDESTTRSHFFFGR
Sbjct: 61  FLGRPKIAFLFLTRRNLPLDFLWGSFFENGDVANFSIYIHSAPGFVFDESTTRSHFFFGR 120

Query: 121 QLENSIQVDWGKSSMIAAERFLLEAALEDTANQRFVLLSDSCVPLYNFSYIYSYLMASPR 180
           QLENSIQV WGKSSMIAAER LLEAALED ANQRFVLLSD                    
Sbjct: 121 QLENSIQVAWGKSSMIAAERLLLEAALEDPANQRFVLLSD-------------------- 180

Query: 181 SFVDSFLDAKEGRYNPKMSPAITKGKWRKGSQWISLIRSHAEVIVDDDIIFPVFGLFCKR 240
               SFLDAKEGRYNPKMSPAI K KWRKGSQWISLIRSHAEV+VDDDIIFP+FGLFCKR
Sbjct: 181 ----SFLDAKEGRYNPKMSPAIPKSKWRKGSQWISLIRSHAEVVVDDDIIFPIFGLFCKR 240

Query: 241 RPPVDASKGAMNIKLQKQHNCIPDEHYVQTLLALNELEGELERRTLTYTLWNQSTNNMEN 300
           RPPVDASKG MN KLQKQHNCIPDEHYVQTLLALNELEGELERRT+TYTLWNQST  MEN
Sbjct: 241 RPPVDASKGNMNTKLQKQHNCIPDEHYVQTLLALNELEGELERRTVTYTLWNQSTTKMEN 300

Query: 301 KGWHPITFNYAKAGPRQIEEIKGINHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAM 360
           KGWHPITF YA AGPRQI+EIKGI+HVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAM
Sbjct: 301 KGWHPITFTYANAGPRQIKEIKGIDHVYYETEFRTEWCRNNSTFVPCFLFARKFSQGAAM 349

Query: 361 RLLSEGVASDFDA 373
           RLLSEGV S FDA
Sbjct: 361 RLLSEGVVSHFDA 349

BLAST of Cp4.1LG10g09830 vs. NCBI nr
Match: gi|731404263|ref|XP_010655375.1| (PREDICTED: uncharacterized protein LOC100262450 isoform X2 [Vitis vinifera])

HSP 1 Score: 572.0 bits (1473), Expect = 7.5e-160
Identity = 281/378 (74.34%), Postives = 309/378 (81.75%), Query Frame = 1

Query: 1   MKKKGPLTPGRNLIWFSWKLVITFSLALCVFALIRLHSSSRTNLASASLSRRLRPPPGS- 60
           M KK P    R++ WF WKLVI  S+ALCV AL+RL S+S   L+S SL     PP G  
Sbjct: 1   MTKKAPSFSIRHVFWFGWKLVILVSVALCVLALLRLQSNSE--LSSISL-----PPQGPR 60

Query: 61  ------FLGRPKIAFLFLTRRNLPLDFLWGSFFQNGDVANFSIYIHSAPGFVFDESTTRS 120
                 + G PKIAFLFL RR+LPLDFLWGSFF+N D ANFSIYIHS PGFVFDE+T+RS
Sbjct: 61  FYRVSVYQGNPKIAFLFLVRRSLPLDFLWGSFFENADAANFSIYIHSQPGFVFDETTSRS 120

Query: 121 HFFFGRQLENSIQVDWGKSSMIAAERFLLEAALEDTANQRFVLLSDSCVPLYNFSYIYSY 180
            FF+ RQL NSIQV WG+SSMI AER L EAALED ANQRFVLLSDSCVPLYNFSYIY+Y
Sbjct: 121 RFFYNRQLSNSIQVAWGESSMIQAERLLFEAALEDPANQRFVLLSDSCVPLYNFSYIYNY 180

Query: 181 LMASPRSFVDSFLDAKEGRYNPKMSPAITKGKWRKGSQWISLIRSHAEVIVDDDIIFPVF 240
           +MASPRS+VDSFLD KEGRYNPKMSP I K KWRKGSQWISL+RSHAEVIVDD +IF VF
Sbjct: 181 MMASPRSYVDSFLDVKEGRYNPKMSPVIPKAKWRKGSQWISLVRSHAEVIVDDQVIFSVF 240

Query: 241 GLFCKRRPPVDASKGAMNIKLQKQHNCIPDEHYVQTLLALNELEGELERRTLTYTLWNQS 300
             FCKRRPP+DA KG  NIKLQKQHNCIPDEHYVQTLLA++ELE ELERRTLTYT WN S
Sbjct: 241 KKFCKRRPPIDARKGKQNIKLQKQHNCIPDEHYVQTLLAMSELESELERRTLTYTEWNLS 300

Query: 301 TNNMENKGWHPITFNYAKAGPRQIEEIKGINHVYYETEFRTEWCRNNSTFVPCFLFARKF 360
              ME +GWHPITF+YA AGP++I+EIK +NHVYYETEFRTEWCR NST VPCFLFARKF
Sbjct: 301 VTKMEREGWHPITFSYANAGPQRIKEIKDVNHVYYETEFRTEWCRANSTSVPCFLFARKF 360

Query: 361 SQGAAMRLLSEGVASDFD 372
           S+GAAMRLLSEGV   FD
Sbjct: 361 SRGAAMRLLSEGVVGSFD 371

BLAST of Cp4.1LG10g09830 vs. NCBI nr
Match: gi|731404261|ref|XP_002264137.2| (PREDICTED: uncharacterized protein LOC100262450 isoform X1 [Vitis vinifera])

HSP 1 Score: 572.0 bits (1473), Expect = 7.5e-160
Identity = 281/378 (74.34%), Postives = 309/378 (81.75%), Query Frame = 1

Query: 1   MKKKGPLTPGRNLIWFSWKLVITFSLALCVFALIRLHSSSRTNLASASLSRRLRPPPGS- 60
           M KK P    R++ WF WKLVI  S+ALCV AL+RL S+S   L+S SL     PP G  
Sbjct: 14  MTKKAPSFSIRHVFWFGWKLVILVSVALCVLALLRLQSNSE--LSSISL-----PPQGPR 73

Query: 61  ------FLGRPKIAFLFLTRRNLPLDFLWGSFFQNGDVANFSIYIHSAPGFVFDESTTRS 120
                 + G PKIAFLFL RR+LPLDFLWGSFF+N D ANFSIYIHS PGFVFDE+T+RS
Sbjct: 74  FYRVSVYQGNPKIAFLFLVRRSLPLDFLWGSFFENADAANFSIYIHSQPGFVFDETTSRS 133

Query: 121 HFFFGRQLENSIQVDWGKSSMIAAERFLLEAALEDTANQRFVLLSDSCVPLYNFSYIYSY 180
            FF+ RQL NSIQV WG+SSMI AER L EAALED ANQRFVLLSDSCVPLYNFSYIY+Y
Sbjct: 134 RFFYNRQLSNSIQVAWGESSMIQAERLLFEAALEDPANQRFVLLSDSCVPLYNFSYIYNY 193

Query: 181 LMASPRSFVDSFLDAKEGRYNPKMSPAITKGKWRKGSQWISLIRSHAEVIVDDDIIFPVF 240
           +MASPRS+VDSFLD KEGRYNPKMSP I K KWRKGSQWISL+RSHAEVIVDD +IF VF
Sbjct: 194 MMASPRSYVDSFLDVKEGRYNPKMSPVIPKAKWRKGSQWISLVRSHAEVIVDDQVIFSVF 253

Query: 241 GLFCKRRPPVDASKGAMNIKLQKQHNCIPDEHYVQTLLALNELEGELERRTLTYTLWNQS 300
             FCKRRPP+DA KG  NIKLQKQHNCIPDEHYVQTLLA++ELE ELERRTLTYT WN S
Sbjct: 254 KKFCKRRPPIDARKGKQNIKLQKQHNCIPDEHYVQTLLAMSELESELERRTLTYTEWNLS 313

Query: 301 TNNMENKGWHPITFNYAKAGPRQIEEIKGINHVYYETEFRTEWCRNNSTFVPCFLFARKF 360
              ME +GWHPITF+YA AGP++I+EIK +NHVYYETEFRTEWCR NST VPCFLFARKF
Sbjct: 314 VTKMEREGWHPITFSYANAGPQRIKEIKDVNHVYYETEFRTEWCRANSTSVPCFLFARKF 373

Query: 361 SQGAAMRLLSEGVASDFD 372
           S+GAAMRLLSEGV   FD
Sbjct: 374 SRGAAMRLLSEGVVGSFD 384

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KHS2_CUCSA2.0e-19689.81Uncharacterized protein OS=Cucumis sativus GN=Csa_6G513470 PE=4 SV=1[more]
E0CVP2_VITVI5.3e-16074.34Putative uncharacterized protein OS=Vitis vinifera GN=VIT_10s0116g00980 PE=4 SV=... [more]
A0A061GHY7_THECC2.6e-15974.40Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein isofo... [more]
B9RXG6_RICCO5.8e-15972.92Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0903630 PE=4 SV=1[more]
B9HZ98_POPTR9.9e-15973.54Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0011s01410g PE=4 SV=2[more]
Match NameE-valueIdentityDescription
AT1G62305.15.8e-13465.19 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family p... [more]
AT1G11940.13.6e-12861.81 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family p... [more]
AT5G14550.13.5e-9949.33 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family p... [more]
AT4G31350.15.4e-4736.94 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family p... [more]
AT1G73810.12.3e-4538.41 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family p... [more]
Match NameE-valueIdentityDescription
gi|449433986|ref|XP_004134777.1|2.9e-19689.81PREDICTED: uncharacterized protein LOC101222689 [Cucumis sativus][more]
gi|659079022|ref|XP_008440033.1|3.6e-19489.54PREDICTED: uncharacterized protein LOC103484630 isoform X1 [Cucumis melo][more]
gi|659079026|ref|XP_008440035.1|3.4e-17683.65PREDICTED: uncharacterized protein LOC103484630 isoform X2 [Cucumis melo][more]
gi|731404263|ref|XP_010655375.1|7.5e-16074.34PREDICTED: uncharacterized protein LOC100262450 isoform X2 [Vitis vinifera][more]
gi|731404261|ref|XP_002264137.2|7.5e-16074.34PREDICTED: uncharacterized protein LOC100262450 isoform X1 [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: Cellular Component
TermDefinition
GO:0016020membrane
Vocabulary: Molecular Function
TermDefinition
GO:0008375acetylglucosaminyltransferase activity
Vocabulary: INTERPRO
TermDefinition
IPR003406Glyco_trans_14
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
biological_process GO:0008150 biological_process
cellular_component GO:0016020 membrane
molecular_function GO:0008375 acetylglucosaminyltransferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG10g09830.1Cp4.1LG10g09830.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003406Glycosyl transferase, family 14PFAMPF02485Branchcoord: 66..321
score: 8.1
NoneNo IPR availablePANTHERPTHR31042FAMILY NOT NAMEDcoord: 46..241
score: 1.3E-172coord: 257..364
score: 1.3E
NoneNo IPR availablePANTHERPTHR31042:SF2CORE-2/I-BRANCHING BETA-1,6-N-ACETYLGLUCOSAMINYLTRANSFERASE FAMILY PROTEINcoord: 257..364
score: 1.3E-172coord: 46..241
score: 1.3E