CSPI04G09440 (gene) Wild cucumber (PI 183967)

NameCSPI04G09440
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionGlycosyltransferase AER61, uncharacterized
LocationChr4 : 7384496 .. 7389210 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAGAAAAGCACAAAAGCTCATCAAAACCCCAAATGTGACCCATTATACATCCCATTATCATTTTTGTTCCAAAAATTTCCTTCTCTTTCCAAATTCATTGAAAGAAGAAAACATCATGGTTAAAGCTTTGCAACAATCAAAATCGCAATCAAGAACAACAAAAACAACCAATAATCTTGTTTCTCCAAAGCTTTTCCTCTATCTCCTCTCCATCTCAGCTCTCCTTTTCATCCTCTTCCACATCCACTCCCTCCACCACCATGTCCCTCCACCACCTTCCTCCATCGTTGTTGCGAAGCTCCGCCGTTCCGTTACGTTTCTTCCCCTCAAGGACTTGCGTTACTCCAACAAAGCCCTTGTAGGCCATACATGGTTCATGAGCTCCTTGTATGACATCCAGGAGGAAGGTGAGGTTCAATACCAGCAGTTTCCATCACCGGTGGTGGACGGCGACGAGCGGATGCTTTGCCTGAAAGGGCGCGACACTCACGACGGGTCTTGGAATTACTATGGGTTGGCGTGGCCTGAAGGTTTGCCGGAAAATGCGAGGGTCAAGAAAGGTGTGAGCTTTGTGTCTTACAATCATTATGATTATCAAAATATTTGGCATGGCTTGTCCGCTCTCATGCCTTTCGTTGCTTGGCATCAGATTCAAGGTAACGCCATTTGTTTTTTTTTGGATTTATTCATCTCATTACTCATTTCTTTGCTTCATTTCTGGGCTTAGCACTACTTGGAAATTGTGTCAAGTCTTAAAACCTTCATTATATATATATATAACTTGATTTGGAAGTAAAAGTAATTTATTATTAATTTAAGCATTTTTTTTATATAGTTTTTGTTGTCTATAAAAATACTTGTTGTATCAATACTCTATTATATATTCATTTATTCAAAATGATATATTACTATAAATTATATCAATATCTTAATAAATAATTACATTGCCTAACACTCTAATTAGGAATAATAATTGAGAGGTTTACTAAACAAAACAAAACAATAACAATGATAGACGATACGGTAATTTTTAATTTAATTATCTTAGTTTAACGTTAACATTAAACTAAAATAATTAAATTAAATTATCTTTAAAATTATACATTTGTAATTTTATACTATTTTGGTAATGAATATTAGAAAACAAAAAGTTGTATATTTTGCTTTAATTTTGATATGAATTTATGATACAACAAAGGAAAGTGTGAAGTACCAGAGAGATGGATATTATACCACTGGGGGGAACTGAGATTGAGAATGGGAAAATGGGTTTCCACATTAATGGAGGCCACATTCGGAGCCCCGCTTCAGTTCGAAGCTTTCGAAGATATCAGCGAAGGGCAGCCAGTTTGCTTCGAGAAGGCGGTGGTGATGAGACACAACGAGGGCGGAATGTCGAGGCAGCGCCGGATGGAGACCTACGACTTTATGAGGTGCAAGGCACGGTTGTTCTGCAACTTAACCTCGCCTGAGCCGTTGTCAGCGGCGGTGGGTATGACGATGTTAATGAGAACGGGGCCCAGGTCGTTCAGGAATGAGACGACGGTGGTGGAGATTTTTGGGAAGGAATGCGCAAAAGTCGCCGGCTGTCGCCTCACGGTGGCTTATTCCAATAATCTCACCTTCTGTGAACAGGTAACCCGTCCTTTAATCTACCTTCTCTTCTTTCTTTTTTATTTATTGAGATTTGGAATATTAAATTTTGTTGTTATTCTTTTTAAATTATTGGTAAATTGAAATTATCACTTTGAAAGTTTTACACCATCAAGGTTTTAGCCTACAAATTTTAAAATTTATATAACAATCACATCACATGTGCATATTATATAATGGGAGAGTATCCTATTTATCTGATTAAGATTATTATAAAATCTTTTATTTCTCCAATTTGATCTAGCAATAGATATCAACAACTTTTCTATTATCTTTCTTTAAAAAAAAAAATGAAAAAAAACAACTTTTCTATTAATAAATTATATGATATAATCGTATTTAGAGAATATTTATATTTGAGAATTTTCGATAAATTTAAGTGTAAATTTGAGAATTTTGAAATATTAAAATCAAAATTTAGAAAACTCTAAAAGATTTAGGATCGTTATTTATATTTCTAAACTTTCATTAAAATTAATATATTAGGAAAAATCAACTTTTTTAAATATAGAAAAATAGGTTAAATTGTATGTAATAGAAATCAATACTTAATCTTAGACGTGAATAGAAACCTATTAATGTCTTTTAATAATAGAAATTGGTAAAAATCAATAATTGATCAAGCCCATAGATATATTATATATTATGATTAGAGATTTATATAAGTTTATCATTGTCTACAAGTTTTTTTTTCTATTTCTATAAATAGTTTAACGTTTTTCTACAAAAATTTATCCAAGAAAAAACTTAAATTTGATGGTGTCGAGAGGTTGCAAATTTTACATTAAAAAAACTAAACAAACTCATACGTCTGATAAGGTAGGTTAAGCATTAATTACTATTCTTATTTCCAATGACGAGATCCATAGATTTTCGTTAATGTTTTTCTTCAAAGCTTTTAATATCATGGAAGATATGTATCTATTTATATTCTAAATGCAGGTTGGGAGGTGCATGTTATGACATTAAAATTTATTATTGGGAATAGTCCAAATGACTTTATTAAAGCATATCTTATAAAATACTTGAACAACAACCCACATTATATTGGATAATAAAGTGATGAATCCAGTACTCTGGTCACCCACTCAAATTTTCATAAACAAGTTTATTTGTGTTTGGTATTTCTTTTTTCTAAGTATGACGGCCTTGGTCATAAATCCAACCTTTGATCTTTATATAGAAAAAGTCTCTCTTTTAGTTATTATTAAGCTAAACTAACTTTGATGTACTTGTTTGGTATCCATATATTTTTTATTTTGCAACCTATAATATAGAAAATTAAAATTTTAGATTCTCACTAAAATAAAATTTGATTTTATATCAAAATATCACTAAATTAGCTTTAAATTTTTAAATACAATAAAGGTATAATCGAAACAATTTTAGTTAGTTTATGAGAGTAGTTGAAGAACTTGTAAGAGTTTCTAAAAATTGTTTAAAACAAATAAACATTATAATATAATATAGAACCTTAAGGGAAGTCTAGGAAGAAAATAGGCAAAACATTTCGGGAACATTTTAACCCTTTAGAACGTAGAATTAAGTTTTAAAGTCTAAAGTCAATATGTTGAAGTTAGGAAGTCTACGTTTTGGAGTATTGTTATTTAGCGTGGAGCTAAGAAACTCTGAAAAGAAAAGAGAAAAAGATCAAATTTCTTGATAAATATGGTTTAATAGTAAAGAATGTTGAAGTTAATGAAATTAAGTTATTTATTATCTATTTTTAAATTGGATCAGTCAAACACACTTGTTGTTTATTTCCTTTCACAATATACAAAACATGTATATCAAAGTTTGTATTTTGATGATTAACTAATGAAAAATGTGAATTAATTAACAGGTGAGTTTGATGGGGAAGACGGACATATTGATATCCCCACATGGAGCACAACTGACAAACATGATTCTAATGAACAGAAACAGTAGCGTAATGGAATTCTTTCCCAAAGGGTGGTTGGAACTTGCGGGCATTGGCCAATATGTGTACCATTGGCTCGCTAGCTGGTCTGGAATGAGGCATCAAGGTGCTTGGAGAGACCCTAATAGCACTCTTCCCTGTCCTTATTCTCCCGGCGATCGTCGATGCATGTCCATTTACAAAGCTGGCACTATTGGGTAGTACATTTCATATCGTTCTATTTCTGTTTTAATTTGTTTTTTCAAAAATTATCTTGTTCTTGATTGAATTTTCGACTATAGTTTATTGGAAATTAACTTTTTTAGTCCATTAGTTTTTGTACCGATTTAAATGGTTTTTTGAAGTTAATTTTGTCATTTGATATCAAATACATCCTACTTAAAAAAAAGGTAGGAGCCATTTCAATTTGTTTCTCAAAATTTAGGAGACTTGTGAAGTCTATTTCAAAATTTACTTTTTTCATATTTTTCGAATTTGCTCCTTGAGTTTGCACAGCGATTTCAAGTTTGGGGTCTTTAAACTTTTAGAGTTGTTATGTTTGATGTATTCGTAAACTTTGAATATTGTGCTCATGGATTGTTGATAGATTTCTTGTAAACTTTTGATATGATACTTAATATTTATCAAAACTTTCGTCAGACCCTGTAGATAGCTCCACTTTCAATTTTCGATCCATGACCTTATAGACCTATTTTCGACCCATTTTCAATTTTGAATACAAAAGAGTCCATTGTATTTTGTACAAACTTGTATGATTTCTTGTTGGATAGTCTACATATGCAGAAGTAGGCTTTTCCTAATTAAAATTGAAGTTCAGATGCGTTTGGTGTGTTAGATTTCTGTTGAATATAGTATTAGACACATCCCAACGGATAACATTTCCTACCCATGCATGATGAACTAAAAGAATTATTAATCCTAATTCATACTAATAATGTTACTATTTTTTTATTATGTGTTAACAGATACAACAGAACACACTTTTCTGAGTGGGCTAAGAGTGTTCTGAATGAGGTGAAGATGAGAAAGATGGAGGAAGCAACAAAGGTCACTACAAATCAAATTCATGAGTGTTCTTGTATCTAATTTTTCCCCATCTTCTAGGCACATTTCTTTAGTAATTTATATACCCCATTGACC

mRNA sequence

ATGGTTAAAGCTTTGCAACAATCAAAATCGCAATCAAGAACAACAAAAACAACCAATAATCTTGTTTCTCCAAAGCTTTTCCTCTATCTCCTCTCCATCTCAGCTCTCCTTTTCATCCTCTTCCACATCCACTCCCTCCACCACCATGTCCCTCCACCACCTTCCTCCATCGTTGTTGCGAAGCTCCGCCGTTCCGTTACGTTTCTTCCCCTCAAGGACTTGCGTTACTCCAACAAAGCCCTTGTAGGCCATACATGGTTCATGAGCTCCTTGTATGACATCCAGGAGGAAGGTGAGGTTCAATACCAGCAGTTTCCATCACCGGTGGTGGACGGCGACGAGCGGATGCTTTGCCTGAAAGGGCGCGACACTCACGACGGGTCTTGGAATTACTATGGGTTGGCGTGGCCTGAAGGTTTGCCGGAAAATGCGAGGGTCAAGAAAGGTGTGAGCTTTGTGTCTTACAATCATTATGATTATCAAAATATTTGGCATGGCTTGTCCGCTCTCATGCCTTTCGTTGCTTGGCATCAGATTCAAGGAAAGTGTGAAGTACCAGAGAGATGGATATTATACCACTGGGGGGAACTGAGATTGAGAATGGGAAAATGGGTTTCCACATTAATGGAGGCCACATTCGGAGCCCCGCTTCAGTTCGAAGCTTTCGAAGATATCAGCGAAGGGCAGCCAGTTTGCTTCGAGAAGGCGGTGGTGATGAGACACAACGAGGGCGGAATGTCGAGGCAGCGCCGGATGGAGACCTACGACTTTATGAGGTGCAAGGCACGGTTGTTCTGCAACTTAACCTCGCCTGAGCCGTTGTCAGCGGCGGTGGGTATGACGATGTTAATGAGAACGGGGCCCAGGTCGTTCAGGAATGAGACGACGGTGGTGGAGATTTTTGGGAAGGAATGCGCAAAAGTCGCCGGCTGTCGCCTCACGGTGGCTTATTCCAATAATCTCACCTTCTGTGAACAGGTGAGTTTGATGGGGAAGACGGACATATTGATATCCCCACATGGAGCACAACTGACAAACATGATTCTAATGAACAGAAACAGTAGCGTAATGGAATTCTTTCCCAAAGGGTGGTTGGAACTTGCGGGCATTGGCCAATATGTGTACCATTGGCTCGCTAGCTGGTCTGGAATGAGGCATCAAGGTGCTTGGAGAGACCCTAATAGCACTCTTCCCTGTCCTTATTCTCCCGGCGATCGTCGATGCATGTCCATTTACAAAGCTGGCACTATTGGATACAACAGAACACACTTTTCTGAGTGGGCTAAGAGTGTTCTGAATGAGGTGAAGATGAGAAAGATGGAGGAAGCAACAAAGGTCACTACAAATCAAATTCATGAGTGTTCTTGTATCTAA

Coding sequence (CDS)

ATGGTTAAAGCTTTGCAACAATCAAAATCGCAATCAAGAACAACAAAAACAACCAATAATCTTGTTTCTCCAAAGCTTTTCCTCTATCTCCTCTCCATCTCAGCTCTCCTTTTCATCCTCTTCCACATCCACTCCCTCCACCACCATGTCCCTCCACCACCTTCCTCCATCGTTGTTGCGAAGCTCCGCCGTTCCGTTACGTTTCTTCCCCTCAAGGACTTGCGTTACTCCAACAAAGCCCTTGTAGGCCATACATGGTTCATGAGCTCCTTGTATGACATCCAGGAGGAAGGTGAGGTTCAATACCAGCAGTTTCCATCACCGGTGGTGGACGGCGACGAGCGGATGCTTTGCCTGAAAGGGCGCGACACTCACGACGGGTCTTGGAATTACTATGGGTTGGCGTGGCCTGAAGGTTTGCCGGAAAATGCGAGGGTCAAGAAAGGTGTGAGCTTTGTGTCTTACAATCATTATGATTATCAAAATATTTGGCATGGCTTGTCCGCTCTCATGCCTTTCGTTGCTTGGCATCAGATTCAAGGAAAGTGTGAAGTACCAGAGAGATGGATATTATACCACTGGGGGGAACTGAGATTGAGAATGGGAAAATGGGTTTCCACATTAATGGAGGCCACATTCGGAGCCCCGCTTCAGTTCGAAGCTTTCGAAGATATCAGCGAAGGGCAGCCAGTTTGCTTCGAGAAGGCGGTGGTGATGAGACACAACGAGGGCGGAATGTCGAGGCAGCGCCGGATGGAGACCTACGACTTTATGAGGTGCAAGGCACGGTTGTTCTGCAACTTAACCTCGCCTGAGCCGTTGTCAGCGGCGGTGGGTATGACGATGTTAATGAGAACGGGGCCCAGGTCGTTCAGGAATGAGACGACGGTGGTGGAGATTTTTGGGAAGGAATGCGCAAAAGTCGCCGGCTGTCGCCTCACGGTGGCTTATTCCAATAATCTCACCTTCTGTGAACAGGTGAGTTTGATGGGGAAGACGGACATATTGATATCCCCACATGGAGCACAACTGACAAACATGATTCTAATGAACAGAAACAGTAGCGTAATGGAATTCTTTCCCAAAGGGTGGTTGGAACTTGCGGGCATTGGCCAATATGTGTACCATTGGCTCGCTAGCTGGTCTGGAATGAGGCATCAAGGTGCTTGGAGAGACCCTAATAGCACTCTTCCCTGTCCTTATTCTCCCGGCGATCGTCGATGCATGTCCATTTACAAAGCTGGCACTATTGGATACAACAGAACACACTTTTCTGAGTGGGCTAAGAGTGTTCTGAATGAGGTGAAGATGAGAAAGATGGAGGAAGCAACAAAGGTCACTACAAATCAAATTCATGAGTGTTCTTGTATCTAA
BLAST of CSPI04G09440 vs. TrEMBL
Match: A0A0A0KXZ9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G112630 PE=4 SV=1)

HSP 1 Score: 956.1 bits (2470), Expect = 1.6e-275
Identity = 456/457 (99.78%), Postives = 456/457 (99.78%), Query Frame = 1

Query: 1   MVKALQQSKSQSRTTKTTNNLVSPKLFLYLLSISALLFILFHIHSLHHHVPPPPSSIVVA 60
           MVKALQQSKSQSRTTKTTNNLVSPKLFLYLLSISALLFILFHIHSLHHHVPPPPSSIV A
Sbjct: 1   MVKALQQSKSQSRTTKTTNNLVSPKLFLYLLSISALLFILFHIHSLHHHVPPPPSSIVAA 60

Query: 61  KLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDGDERMLCLK 120
           KLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDGDERMLCLK
Sbjct: 61  KLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDGDERMLCLK 120

Query: 121 GRDTHDGSWNYYGLAWPEGLPENARVKKGVSFVSYNHYDYQNIWHGLSALMPFVAWHQIQ 180
           GRDTHDGSWNYYGLAWPEGLPENARVKKGVSFVSYNHYDYQNIWHGLSALMPFVAWHQIQ
Sbjct: 121 GRDTHDGSWNYYGLAWPEGLPENARVKKGVSFVSYNHYDYQNIWHGLSALMPFVAWHQIQ 180

Query: 181 GKCEVPERWILYHWGELRLRMGKWVSTLMEATFGAPLQFEAFEDISEGQPVCFEKAVVMR 240
           GKCEVPERWILYHWGELRLRMGKWVSTLMEATFGAPLQFEAFEDISEGQPVCFEKAVVMR
Sbjct: 181 GKCEVPERWILYHWGELRLRMGKWVSTLMEATFGAPLQFEAFEDISEGQPVCFEKAVVMR 240

Query: 241 HNEGGMSRQRRMETYDFMRCKARLFCNLTSPEPLSAAVGMTMLMRTGPRSFRNETTVVEI 300
           HNEGGMSRQRRMETYDFMRCKARLFCNLTSPEPLSAAVGMTMLMRTGPRSFRNETTVVEI
Sbjct: 241 HNEGGMSRQRRMETYDFMRCKARLFCNLTSPEPLSAAVGMTMLMRTGPRSFRNETTVVEI 300

Query: 301 FGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMNRNSSVMEFF 360
           FGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMNRNSSVMEFF
Sbjct: 301 FGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMNRNSSVMEFF 360

Query: 361 PKGWLELAGIGQYVYHWLASWSGMRHQGAWRDPNSTLPCPYSPGDRRCMSIYKAGTIGYN 420
           PKGWLELAGIGQYVYHWLASWSGMRHQGAWRDPNSTLPCPYSPGDRRCMSIYKAGTIGYN
Sbjct: 361 PKGWLELAGIGQYVYHWLASWSGMRHQGAWRDPNSTLPCPYSPGDRRCMSIYKAGTIGYN 420

Query: 421 RTHFSEWAKSVLNEVKMRKMEEATKVTTNQIHECSCI 458
           RTHFSEWAKSVLNEVKMRKMEEATKVTTNQIHECSCI
Sbjct: 421 RTHFSEWAKSVLNEVKMRKMEEATKVTTNQIHECSCI 457

BLAST of CSPI04G09440 vs. TrEMBL
Match: V4S1C6_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10006690mg PE=4 SV=1)

HSP 1 Score: 609.0 bits (1569), Expect = 4.8e-171
Identity = 300/487 (61.60%), Postives = 357/487 (73.31%), Query Frame = 1

Query: 3   KALQQSKSQSRTTKTTNNLVSPKLFLYLLSISALLFILFHIHSLHHHVPPPPSSI----- 62
           +A    K  +R   ++ +  SPKL L +L+I   LF++F I  LH  +   PSS      
Sbjct: 10  RAASSEKPLNRNVSSSFSCYSPKLSLLILAIFVTLFVVFQIRFLH--ISQSPSSCYPSPA 69

Query: 63  ----------------------VVAKLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQ 122
                                 +  KLR SVTFLPLKDLRYS+KALVGHTWF+SSLYD  
Sbjct: 70  SWPFIYQQWQKLKNNCTQDLDPITEKLRISVTFLPLKDLRYSDKALVGHTWFISSLYDCH 129

Query: 123 EEGEVQYQQFPSPVVDGDERMLCLKGRDTHDGSWNYYGLAWPEGLPENARVKKGVSFVSY 182
           +EGEVQYQQFPS       R+LC+KGRD HDGSWNYY LAWP+ LP NA + KG++FVSY
Sbjct: 130 QEGEVQYQQFPSK--SSQNRLLCIKGRDNHDGSWNYYALAWPKALPYNATLMKGLTFVSY 189

Query: 183 NHYDYQNIWHGLSALMPFVAWHQIQGKCEVPERWILYHWGELRLRMGKWVSTLMEATF-G 242
           NHY Y+NIWHGLSA++PFVAWHQ +  CE+P RWILYHWGELRL MG W+ TLM ATF G
Sbjct: 190 NHYSYENIWHGLSAMVPFVAWHQ-KNNCELPTRWILYHWGELRLGMGPWLQTLMHATFDG 249

Query: 243 APL--QFEAFEDISEGQPVCFEKAVVMRHNEGGMSRQRRMETYDFMRCKARLFCN--LTS 302
            P+  +F+  +D   G PVCFEKAVVMRHNEGGMSR+RRME YD MRCKAR++CN  L +
Sbjct: 250 EPVIERFDGIKDEDGGDPVCFEKAVVMRHNEGGMSRERRMEVYDLMRCKARMYCNVSLDN 309

Query: 303 PEPLSAAVGMTMLMRTGPRSFRNETTVVEIFGKECAKVAGCRLTVAYSNNLTFCEQVSLM 362
            +    AVGMT+LMRTGPRSF NE  ++ IF KECAK+ GCR+TVAYSNNLTFCEQV LM
Sbjct: 310 KDDNHKAVGMTLLMRTGPRSFTNEPAIIGIFEKECAKIDGCRMTVAYSNNLTFCEQVKLM 369

Query: 363 GKTDILISPHGAQLTNMILMNRNSSVMEFFPKGWLELAGIGQYVYHWLASWSGMRHQGAW 422
             TDIL+SPHGAQLTN+ LM+RNSSVMEFFPKGWL+LAG+GQYV+HW+ASWSGMRHQGAW
Sbjct: 370 SMTDILVSPHGAQLTNIFLMDRNSSVMEFFPKGWLKLAGVGQYVFHWIASWSGMRHQGAW 429

Query: 423 RDPNSTLPCPYSPGDRRCMSIYKAGTIGYNRTHFSEWAKSVLNEVKMRKMEEA-TKVTTN 457
           RDPN    C YS  DRRCMSIYK G IGYN T+FSEWA++VLNEVK  K+E++ +  + +
Sbjct: 430 RDPNGE-NCTYSEDDRRCMSIYKNGRIGYNETYFSEWARNVLNEVKTMKLEKSQSNGSAS 489

BLAST of CSPI04G09440 vs. TrEMBL
Match: B9S9E8_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0884460 PE=4 SV=1)

HSP 1 Score: 608.6 bits (1568), Expect = 6.2e-171
Identity = 294/461 (63.77%), Postives = 343/461 (74.40%), Query Frame = 1

Query: 23  SPKLFLYLLSISALLFILFHIHSLHHHVPPPPSS----------------------IVVA 82
           +PKL ++LLSI   LF LF I  LH+  PP  SS                      +   
Sbjct: 32  TPKLSVFLLSICVTLFTLFQIQCLHY--PPSLSSPTWSLMQKWQEFATTCNQELKSMAEM 91

Query: 83  KLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDGDERMLCLK 142
           +L+++VTFLPLKD+RY  KAL GHTWFMSS+YD  EEGEVQYQQFPS    G  R+LCLK
Sbjct: 92  RLKQAVTFLPLKDIRYQEKALQGHTWFMSSMYDTHEEGEVQYQQFPSESSKG--RLLCLK 151

Query: 143 GRDTHDGSWNYYGLAWPEGLPENARVKKGVSFVSYNHYDYQNIWHGLSALMPFVAWHQIQ 202
           G DTHDGSWN Y LAWPE LP NA + KG++FVSYNHYDY NIWHGLSA++PFVAWH+  
Sbjct: 152 GNDTHDGSWNSYALAWPETLPLNATLLKGLTFVSYNHYDYNNIWHGLSAIVPFVAWHKGN 211

Query: 203 GKCEVPERWILYHWGELRLRMGKWVSTLMEATFGAPLQFEAFEDISEGQPVCFEKAVVMR 262
           G  E+P RWILYHWGELR  MG W+STL EATFG+P   E F   +  +P+CFEKAVVMR
Sbjct: 212 GG-ELPSRWILYHWGELRFNMGLWLSTLTEATFGSPPNIEGFGWANNNEPICFEKAVVMR 271

Query: 263 HNEGGMSRQRRMETYDFMRCKARLFCNLTSP------EPLSAAVGMTMLMRTGPRSFRNE 322
           HNEGGMS  RR+ETYDFMRCKAR +CN++        E     +GMT+ MRTGPRSF+NE
Sbjct: 272 HNEGGMSTDRRIETYDFMRCKARAYCNVSLEGGNMVSEKGLPVIGMTLFMRTGPRSFKNE 331

Query: 323 TTVVEIFGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMNRNS 382
           + V+ IF KECAKV GCRL VAYSNNLTFCEQV LM  TDILISPHGAQLTNM LMN+NS
Sbjct: 332 SAVIRIFEKECAKVDGCRLMVAYSNNLTFCEQVKLMSMTDILISPHGAQLTNMFLMNKNS 391

Query: 383 SVMEFFPKGWLELAGIGQYVYHWLASWSGMRHQGAWRDPNSTLPCPYSPGDRRCMSIYKA 442
           SVMEFFPKGWL+LAG+GQ+VYHW+ASWSGM+HQGAWRDP+    CPY   DRRCMSIYK 
Sbjct: 392 SVMEFFPKGWLKLAGVGQFVYHWIASWSGMKHQGAWRDPDGD-HCPYPDDDRRCMSIYKG 451

Query: 443 GTIGYNRTHFSEWAKSVLNEVKMRKMEEATKVTTNQIHECS 456
           G IG+N THFSEW ++VLNEVK+RK EE +  + + I+ C+
Sbjct: 452 GKIGFNETHFSEWGRNVLNEVKLRKAEEMSHKSNDLIYSCA 486

BLAST of CSPI04G09440 vs. TrEMBL
Match: A0A061E286_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_007404 PE=4 SV=1)

HSP 1 Score: 605.9 bits (1561), Expect = 4.0e-170
Identity = 295/466 (63.30%), Postives = 346/466 (74.25%), Query Frame = 1

Query: 23  SPKLFLYLLSISALLFILFHIHSLHHHVPP-PPSSI------------------------ 82
           SPKL LY+L+    L +L  I SLH   PP  PS +                        
Sbjct: 28  SPKLSLYILAFCVTLLLLLQIRSLH--TPPISPSPLPSWSFLQQWQEVINKTLASPNCTQ 87

Query: 83  -----VVAKLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDG 142
                +  KLR SVTFLPLKDLRY+N+ L GHTWFMSS+YD  EEGEVQYQQFPS   +G
Sbjct: 88  DVLESMTQKLRDSVTFLPLKDLRYANQPLPGHTWFMSSMYDTHEEGEVQYQQFPSDSSNG 147

Query: 143 DERMLCLKGRDTHDGSWNYYGLAWPEGLPENARVKKGVSFVSYNHYDYQNIWHGLSALMP 202
             R+LCLKGRDTHDGSWNYY LAWPE LP NA + KG++FV+YNHY+Y NIWHGLSA++P
Sbjct: 148 --RLLCLKGRDTHDGSWNYYALAWPEALPSNATLMKGLTFVAYNHYNYDNIWHGLSAMVP 207

Query: 203 FVAWHQIQGKCEVPERWILYHWGELRLRMGKWVSTLMEATFGAPLQFEAFEDISEG-QPV 262
           FVAWH+ +  CE P RWILY WGELR +MG W++TLM+ATFG     E F  I +  QPV
Sbjct: 208 FVAWHR-KNSCETPTRWILYRWGELRFKMGTWLNTLMKATFGQAPYIEGFNGIEDDDQPV 267

Query: 263 CFEKAVVMRHNEGGMSRQRRMETYDFMRCKARLFCNLTSPEPLSAAVGMTMLMRTGPRSF 322
           CFEKAVVMRHNEGGMSR+RRME YD +RCKAR++CN++  +     +GMT+LMRTGPRSF
Sbjct: 268 CFEKAVVMRHNEGGMSRERRMEVYDLIRCKARVYCNVSGDQK-RPGIGMTLLMRTGPRSF 327

Query: 323 RNETTVVEIFGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMN 382
           RNET V+ IF KEC KV GC+L VAYSNNLT CEQV LM  TDILISPHGAQLTN+ LM+
Sbjct: 328 RNETAVIGIFEKECMKVEGCQLIVAYSNNLTICEQVKLMSLTDILISPHGAQLTNLFLMD 387

Query: 383 RNSSVMEFFPKGWLELAGIGQYVYHWLASWSGMRHQGAWRDPNSTLPCPYSPGDRRCMSI 442
           RNSSVMEFFPKGWL+LAG+GQYVYHW+ASWSGM H+G WRDP+    CPYS  DRRCMS+
Sbjct: 388 RNSSVMEFFPKGWLKLAGVGQYVYHWMASWSGMIHRGDWRDPDGE-NCPYSDDDRRCMSL 447

Query: 443 YKAGTIGYNRTHFSEWAKSVLNEVKMRKMEEATKVTTNQIHE-CSC 457
           YK+G IGYN THF+EWA++VLN+VK  K+EEA+K   N I + C C
Sbjct: 448 YKSGRIGYNETHFAEWARNVLNDVKTSKLEEASKHAQNSISKTCDC 486

BLAST of CSPI04G09440 vs. TrEMBL
Match: B9IJB4_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0017s07560g PE=4 SV=2)

HSP 1 Score: 605.5 bits (1560), Expect = 5.3e-170
Identity = 293/460 (63.70%), Postives = 341/460 (74.13%), Query Frame = 1

Query: 21  LVSPKLFLYLLSISALLFILFHIHSLHHHVPP-PPSSIV----------------VAKLR 80
           L SPKL ++LL++   LF L+HI SLH      PP S V                  KLR
Sbjct: 6   LYSPKLSIFLLALCVSLFTLYHIQSLHARTTSSPPWSFVHQWERFTNCTQEHGSMAEKLR 65

Query: 81  RSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDGDERMLCLKGRD 140
           +SVTFLPLKDLRY++KA  GHTWFMSS YD +EEG VQYQQFPS       R+LCLKG++
Sbjct: 66  QSVTFLPLKDLRYADKARQGHTWFMSSTYDTREEGGVQYQQFPSE--SSKRRLLCLKGKE 125

Query: 141 THDGSWNYYGLAWPEGLPENARVKKGVSFVSYNHYDYQNIWHGLSALMPFVAWHQIQGKC 200
           THDGSWN Y LAWPE LP NA + KG++FVSYNHYDY NIWHGLSA++PFVAWH I+  C
Sbjct: 126 THDGSWNSYALAWPEALPFNATLLKGLTFVSYNHYDYDNIWHGLSAMVPFVAWH-IRNGC 185

Query: 201 EVPERWILYHWGELRLRMGKWVSTLMEATFGAPLQFEAFEDISEGQPVCFEKAVVMRHNE 260
           E P RWILYHWGELR  MG W+ TL  ATFG     E+FE +++GQP+CFEKAVVMRHNE
Sbjct: 186 ESPSRWILYHWGELRFEMGPWLRTLTGATFGGAPYTESFEGVNDGQPLCFEKAVVMRHNE 245

Query: 261 GGMSRQRRMETYDFMRCKARLFCNLTSPEPLSAA-------VGMTMLMRTGPRSFRNETT 320
           GGMSR RR ETYD MRCKAR++CN++    +          +GMT+ MRTG RSF NE+ 
Sbjct: 246 GGMSRDRRTETYDLMRCKARMYCNVSLEGRIPEVNKQGFLVIGMTLFMRTGSRSFTNESA 305

Query: 321 VVEIFGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMNRNSSV 380
           V+ IF KECAKV GCRL VAYSNNLTFCEQV +M  TDIL+S HGAQLTNM LM++NSSV
Sbjct: 306 VIGIFEKECAKVDGCRLMVAYSNNLTFCEQVKMMSLTDILVSTHGAQLTNMFLMDKNSSV 365

Query: 381 MEFFPKGWLELAGIGQYVYHWLASWSGMRHQGAWRDPNSTLPCPYSPGDRRCMSIYKAGT 440
           MEFFPKGWL++AG+GQYVYHW+ASWSGMRHQGAWRD N    CPY+  DRRCMSIYK G 
Sbjct: 366 MEFFPKGWLKVAGVGQYVYHWIASWSGMRHQGAWRDLNGD-ECPYAEDDRRCMSIYKNGK 425

Query: 441 IGYNRTHFSEWAKSVLNEVKMRKMEEATKVTTNQIHECSC 457
           +G+N T+FSEWA+ VLNEVK+RK+EEA   T      CSC
Sbjct: 426 VGFNETYFSEWARDVLNEVKIRKLEEAASKTIASTSACSC 461

BLAST of CSPI04G09440 vs. TAIR10
Match: AT4G33600.1 (AT4G33600.1 unknown protein)

HSP 1 Score: 546.6 bits (1407), Expect = 1.5e-155
Identity = 268/475 (56.42%), Postives = 335/475 (70.53%), Query Frame = 1

Query: 9   KSQSRTTKTTNNLVSPKLFLYLLSISALLFILFHI---HSLHHHVPPPPS---------- 68
           +S SR   T   L SPK  L +L +   +F+L  I   H     +  PPS          
Sbjct: 4   RSVSRNLVTC--LASPKFSLNVLCLVVTVFVLLQIWSFHITQQPILLPPSLFTYLKEQQQ 63

Query: 69  -----------SIVVAKLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQ 128
                      + +V KLR SVTFLPLKDLR+SNK L GHTWFMSSLYD Q +GEVQYQ+
Sbjct: 64  EPEQIKSENETAYLVEKLRESVTFLPLKDLRFSNKPLEGHTWFMSSLYDNQTKGEVQYQE 123

Query: 129 FPSPVVDGDERMLCLKGRDTHDGSWNYYGLAWPEGLPENARVKKGVSFVSYNHYDYQNIW 188
           FPS    G  R+LCLKG D HDGSWNYY LAWP+ LP NA +++G++FVSYNHYDY N+W
Sbjct: 124 FPSESSKG--RLLCLKGVDEHDGSWNYYALAWPQALPVNASLQEGLTFVSYNHYDYGNMW 183

Query: 189 HGLSALMPFVAWHQIQGKCEVPERWILYHWGELRLRMGKWVSTLMEATFGAPLQFEAFED 248
           HGLSA++PFVAW  ++ +CE P+RW+LYHWGELR +MG W++ ++ AT+G   +F  F D
Sbjct: 184 HGLSAMVPFVAW-SLRHQCENPQRWVLYHWGELRFKMGNWLNEIITATYGQNTEFLRFRD 243

Query: 249 ISEGQPVCFEKAVVMRHNEGGMSRQRRMETYDFMRCKARLFCNLTSPEPLSAAVGMTMLM 308
             + +PVCFEKAVVMRHNEGGMSR+RRME +D +RCKAR +CN++  E   + +GMT+LM
Sbjct: 244 --KNRPVCFEKAVVMRHNEGGMSRERRMEVFDLIRCKARHYCNISLSETSKSRIGMTLLM 303

Query: 309 RTGPRSFRNETTVVEIFGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQL 368
           RTGPRSF+NE+ V++IF +EC  V GC L V+YSNNLTFCEQV LM  TD+L+SPHGAQL
Sbjct: 304 RTGPRSFKNESAVIDIFKRECKNVEGCELKVSYSNNLTFCEQVELMRMTDVLVSPHGAQL 363

Query: 369 TNMILMNRNSSVMEFFPKGWLELAGIGQYVYHWLASWSGMRHQGAWRDPNSTLPCPYSPG 428
           TN++LM+RNSSVMEF PKGW +LAG+GQ VY W   WSGMRH+G+W DP+  + C +   
Sbjct: 364 TNLVLMDRNSSVMEFLPKGWRKLAGVGQLVYQWGTRWSGMRHEGSWHDPDGEI-CQFPDT 423

Query: 429 DRRCM-SIYKAGTIGYNRTHFSEWAKSVLNEVKMRKMEEAT--KVTTNQIHECSC 457
           DRRCM S+YK G IGYN T+F EWAKSVL + K RKM      K +   +  C C
Sbjct: 424 DRRCMSSVYKNGRIGYNETYFGEWAKSVLGKFKERKMANVVGRKHSYGSLDGCWC 470

BLAST of CSPI04G09440 vs. TAIR10
Match: AT4G33590.1 (AT4G33590.1 unknown protein)

HSP 1 Score: 532.3 bits (1370), Expect = 2.9e-151
Identity = 244/411 (59.37%), Postives = 313/411 (76.16%), Query Frame = 1

Query: 32  SISALLFILFHIHSLHHHVPPPPSSIVVAKLRRSVTFLPLKDLRYSNKALVGHTWFMSSL 91
           S+S    +L ++   H  V    ++ +V KLR SVTFLPLKD R+SNK L GHTWFMSSL
Sbjct: 46  SLSLPPALLTYLKHNHEEVSENKTASLVEKLRESVTFLPLKDYRFSNKPLEGHTWFMSSL 105

Query: 92  YDIQEEGEVQYQQFPSPVVDGDERMLCLKGRDTHDGSWNYYGLAWPEGLPENARVKKGVS 151
           YD Q +GE QYQ+FPS    G  R+LCLKG D HDGSWN Y LAWPE LP NA ++ G++
Sbjct: 106 YDNQTKGEAQYQEFPSDSSKG--RLLCLKGVDEHDGSWNSYALAWPEALPTNAILQDGLT 165

Query: 152 FVSYNHYDYQNIWHGLSALMPFVAWHQIQGKCEVPERWILYHWGELRLRMGKWVSTLMEA 211
           FVSYN YDY N+WHGL+A++PF+AW  ++ +CE P++W+LYHWGELR  MG W+S ++ A
Sbjct: 166 FVSYNQYDYGNLWHGLTAVVPFIAW-SLRNQCEKPQKWVLYHWGELRFGMGHWLSEIVTA 225

Query: 212 TFGAPLQFEAFEDISEGQPVCFEKAVVMRHNEGGMSRQRRMETYDFMRCKARLFCNLTSP 271
           T+G    F  F D  + +PVCFEKAVVMRHNEGGMSR+RRME +D +RCKAR +CN++S 
Sbjct: 226 TYGQEPDFLRFVD--DDKPVCFEKAVVMRHNEGGMSRERRMEAFDLIRCKARNYCNISSS 285

Query: 272 EPLSAAVGMTMLMRTGPRSFRNETTVVEIFGKECAKVAGCRLTVAYSNNLTFCEQVSLMG 331
                 +GMT+L+RTG RSFRNE+ V+++F KEC +V GC ++V+YSNNL+FCEQV LM 
Sbjct: 286 VASKPRIGMTLLLRTGARSFRNESMVIDVFKKECKRVDGCEISVSYSNNLSFCEQVELMK 345

Query: 332 KTDILISPHGAQLTNMILMNRNSSVMEFFPKGWLELAGIGQYVYHWLASWSGMRHQGAWR 391
           KTD+L+SPHGAQLTN+ LM++NSSVMEFFPKGWL+LAG+GQ V+ W A+WSGMRH+G+W 
Sbjct: 346 KTDVLVSPHGAQLTNLFLMDKNSSVMEFFPKGWLKLAGVGQLVFQWGANWSGMRHEGSWH 405

Query: 392 DPNSTLPCPYSPGDRRCMSIYKAGTIGYNRTHFSEWAKSVLNEVKMRKMEE 443
           DP   + C +   DRRCMSIYK   IGYN T+F EWA+ VL +  +R+M+E
Sbjct: 406 DPVGEI-CQFPDTDRRCMSIYKNAMIGYNETYFGEWARRVLGKFSIREMKE 450

BLAST of CSPI04G09440 vs. NCBI nr
Match: gi|778692014|ref|XP_011653390.1| (PREDICTED: uncharacterized protein LOC101219216 [Cucumis sativus])

HSP 1 Score: 956.1 bits (2470), Expect = 2.3e-275
Identity = 456/457 (99.78%), Postives = 456/457 (99.78%), Query Frame = 1

Query: 1   MVKALQQSKSQSRTTKTTNNLVSPKLFLYLLSISALLFILFHIHSLHHHVPPPPSSIVVA 60
           MVKALQQSKSQSRTTKTTNNLVSPKLFLYLLSISALLFILFHIHSLHHHVPPPPSSIV A
Sbjct: 1   MVKALQQSKSQSRTTKTTNNLVSPKLFLYLLSISALLFILFHIHSLHHHVPPPPSSIVAA 60

Query: 61  KLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDGDERMLCLK 120
           KLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDGDERMLCLK
Sbjct: 61  KLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDGDERMLCLK 120

Query: 121 GRDTHDGSWNYYGLAWPEGLPENARVKKGVSFVSYNHYDYQNIWHGLSALMPFVAWHQIQ 180
           GRDTHDGSWNYYGLAWPEGLPENARVKKGVSFVSYNHYDYQNIWHGLSALMPFVAWHQIQ
Sbjct: 121 GRDTHDGSWNYYGLAWPEGLPENARVKKGVSFVSYNHYDYQNIWHGLSALMPFVAWHQIQ 180

Query: 181 GKCEVPERWILYHWGELRLRMGKWVSTLMEATFGAPLQFEAFEDISEGQPVCFEKAVVMR 240
           GKCEVPERWILYHWGELRLRMGKWVSTLMEATFGAPLQFEAFEDISEGQPVCFEKAVVMR
Sbjct: 181 GKCEVPERWILYHWGELRLRMGKWVSTLMEATFGAPLQFEAFEDISEGQPVCFEKAVVMR 240

Query: 241 HNEGGMSRQRRMETYDFMRCKARLFCNLTSPEPLSAAVGMTMLMRTGPRSFRNETTVVEI 300
           HNEGGMSRQRRMETYDFMRCKARLFCNLTSPEPLSAAVGMTMLMRTGPRSFRNETTVVEI
Sbjct: 241 HNEGGMSRQRRMETYDFMRCKARLFCNLTSPEPLSAAVGMTMLMRTGPRSFRNETTVVEI 300

Query: 301 FGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMNRNSSVMEFF 360
           FGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMNRNSSVMEFF
Sbjct: 301 FGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMNRNSSVMEFF 360

Query: 361 PKGWLELAGIGQYVYHWLASWSGMRHQGAWRDPNSTLPCPYSPGDRRCMSIYKAGTIGYN 420
           PKGWLELAGIGQYVYHWLASWSGMRHQGAWRDPNSTLPCPYSPGDRRCMSIYKAGTIGYN
Sbjct: 361 PKGWLELAGIGQYVYHWLASWSGMRHQGAWRDPNSTLPCPYSPGDRRCMSIYKAGTIGYN 420

Query: 421 RTHFSEWAKSVLNEVKMRKMEEATKVTTNQIHECSCI 458
           RTHFSEWAKSVLNEVKMRKMEEATKVTTNQIHECSCI
Sbjct: 421 RTHFSEWAKSVLNEVKMRKMEEATKVTTNQIHECSCI 457

BLAST of CSPI04G09440 vs. NCBI nr
Match: gi|659125841|ref|XP_008462883.1| (PREDICTED: uncharacterized protein LOC103501161 isoform X1 [Cucumis melo])

HSP 1 Score: 822.4 bits (2123), Expect = 3.9e-235
Identity = 382/410 (93.17%), Postives = 390/410 (95.12%), Query Frame = 1

Query: 48  HHVPPPPSSIVVAKLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPS 107
           HH+PP         LRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPS
Sbjct: 5   HHLPPSLXXXXXXXLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPS 64

Query: 108 PVVDGDERMLCLKGRDTHDGSWNYYGLAWPEGLPENARVKKGVSFVSYNHYDYQNIWHGL 167
           PVVDGDERMLCLKGRDTHDGSWNYYGLAWPEGLPENA V KGVSFVSYNHYDYQNIWHGL
Sbjct: 65  PVVDGDERMLCLKGRDTHDGSWNYYGLAWPEGLPENATVMKGVSFVSYNHYDYQNIWHGL 124

Query: 168 SALMPFVAWHQIQGKCEVPERWILYHWGELRLRMGKWVSTLMEATFGAPLQFEAFEDISE 227
           SALMPFVAWHQIQGKCEVPERWILYHWGELRLRMGKWV+TLMEATFGAP++ EAFE ISE
Sbjct: 125 SALMPFVAWHQIQGKCEVPERWILYHWGELRLRMGKWVNTLMEATFGAPIRIEAFEGISE 184

Query: 228 GQPVCFEKAVVMRHNEGGMSRQRRMETYDFMRCKARLFCNLTSPEPLSAAVGMTMLMRTG 287
           GQPVCFEKAVVMRHNEGGMSRQRRMETYDFMRCKARL CNLTSPEPLS AVGMTMLMRTG
Sbjct: 185 GQPVCFEKAVVMRHNEGGMSRQRRMETYDFMRCKARLLCNLTSPEPLSGAVGMTMLMRTG 244

Query: 288 PRSFRNETTVVEIFGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNM 347
           PRSFRNETTV EIFGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNM
Sbjct: 245 PRSFRNETTVAEIFGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNM 304

Query: 348 ILMNRNSSVMEFFPKGWLELAGIGQYVYHWLASWSGMRHQGAWRDPNSTLPCPYSPGDRR 407
           ILMNRNSSVMEFFPKGWLELAGIGQYVYHWLASWSGM+HQGAWRDPNSTLPCPYSP DRR
Sbjct: 305 ILMNRNSSVMEFFPKGWLELAGIGQYVYHWLASWSGMKHQGAWRDPNSTLPCPYSPNDRR 364

Query: 408 CMSIYKAGTIGYNRTHFSEWAKSVLNEVKMRKMEEATKVTTNQIHECSCI 458
           CMS YK GTIGYNRT+FSEWAKSVLNEVKMRK+EEATK TTNQ+HECSCI
Sbjct: 365 CMSFYKGGTIGYNRTYFSEWAKSVLNEVKMRKIEEATKFTTNQVHECSCI 414

BLAST of CSPI04G09440 vs. NCBI nr
Match: gi|659125843|ref|XP_008462884.1| (PREDICTED: uncharacterized protein LOC103501161 isoform X2 [Cucumis melo])

HSP 1 Score: 748.0 bits (1930), Expect = 9.5e-213
Identity = 347/371 (93.53%), Postives = 352/371 (94.88%), Query Frame = 1

Query: 48  HHVPPPPSSIVVAKLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPS 107
           HH+PP         LRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPS
Sbjct: 5   HHLPPSLXXXXXXXLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPS 64

Query: 108 PVVDGDERMLCLKGRDTHDGSWNYYGLAWPEGLPENARVKKGVSFVSYNHYDYQNIWHGL 167
           PVVDGDERMLCLKGRDTHDGSWNYYGLAWPEGLPENA V KGVSFVSYNHYDYQNIWHGL
Sbjct: 65  PVVDGDERMLCLKGRDTHDGSWNYYGLAWPEGLPENATVMKGVSFVSYNHYDYQNIWHGL 124

Query: 168 SALMPFVAWHQIQGKCEVPERWILYHWGELRLRMGKWVSTLMEATFGAPLQFEAFEDISE 227
           SALMPFVAWHQIQGKCEVPERWILYHWGELRLRMGKWV+TLMEATFGAP++ EAFE ISE
Sbjct: 125 SALMPFVAWHQIQGKCEVPERWILYHWGELRLRMGKWVNTLMEATFGAPIRIEAFEGISE 184

Query: 228 GQPVCFEKAVVMRHNEGGMSRQRRMETYDFMRCKARLFCNLTSPEPLSAAVGMTMLMRTG 287
           GQPVCFEKAVVMRHNEGGMSRQRRMETYDFMRCKARL CNLTSPEPLS AVGMTMLMRTG
Sbjct: 185 GQPVCFEKAVVMRHNEGGMSRQRRMETYDFMRCKARLLCNLTSPEPLSGAVGMTMLMRTG 244

Query: 288 PRSFRNETTVVEIFGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNM 347
           PRSFRNETTV EIFGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNM
Sbjct: 245 PRSFRNETTVAEIFGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNM 304

Query: 348 ILMNRNSSVMEFFPKGWLELAGIGQYVYHWLASWSGMRHQGAWRDPNSTLPCPYSPGDRR 407
           ILMNRNSSVMEFFPKGWLELAGIGQYVYHWLASWSGM+HQGAWRDPNSTLPCPYSP DRR
Sbjct: 305 ILMNRNSSVMEFFPKGWLELAGIGQYVYHWLASWSGMKHQGAWRDPNSTLPCPYSPNDRR 364

Query: 408 CMSIYKAGTIG 419
           CMS YK GTIG
Sbjct: 365 CMSFYKGGTIG 375

BLAST of CSPI04G09440 vs. NCBI nr
Match: gi|1009151277|ref|XP_015893468.1| (PREDICTED: uncharacterized protein LOC107427594 [Ziziphus jujuba])

HSP 1 Score: 614.8 bits (1584), Expect = 1.2e-172
Identity = 304/486 (62.55%), Postives = 355/486 (73.05%), Query Frame = 1

Query: 3   KALQQSKSQSRTTKTTNNLVSPKLFLYLLSISALLFILFHIHSLH--HHVPPPPSSI--- 62
           KA    K  SRT  +   L SP+L +++L+    LF+LFHI SL      P PP S+   
Sbjct: 18  KAHHHPKQHSRTIFSILPLYSPRLSIFILATCVALFVLFHIQSLQTPPSSPSPPWSLMHQ 77

Query: 63  -----------------------VVAKLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDI 122
                                  V  KLR SVTFLPLKDLRY++ AL GHTWFMSS+YD 
Sbjct: 78  YWQRATTTRIFTNCTTNQLANTTVTDKLRDSVTFLPLKDLRYAHAALDGHTWFMSSMYDT 137

Query: 123 QEEGEVQYQQFPSPVVDGDERMLCLKGRDTHDGSWNYYGLAWPEGLPENARVKKGVSFVS 182
            EEGEVQYQQFPS    G  R+LCLKGRDTHDGSWN Y LAW E LP NA   KG++FVS
Sbjct: 138 HEEGEVQYQQFPSESSKG--RILCLKGRDTHDGSWNSYALAWQEALPHNATFMKGLTFVS 197

Query: 183 YNHYDYQNIWHGLSALMPFVAWHQIQGKCEVPERWILYHWGELRLRMGKWVSTLMEATFG 242
           YNHY+Y+NIWHGLSA+MPFVAW++  G  ++P+RW+LYHWGELR +MG W+ TLMEATF 
Sbjct: 198 YNHYNYENIWHGLSAVMPFVAWYKKNGCTQLPQRWVLYHWGELRFKMGLWLKTLMEATFD 257

Query: 243 APLQFEAFE--DISEGQPVCFEKAVVMRHNEGGMSRQRRMETYDFMRCKARLFCNLTSPE 302
            P   E FE  +  E  PVCFE AVVMRHNEGGMSR++RME YD +RCKAR++CN++S +
Sbjct: 258 GPQHIEGFEWVENDEFSPVCFETAVVMRHNEGGMSREKRMEVYDLIRCKARIYCNVSSEK 317

Query: 303 PLSA-AVGMTMLMRTGPRSFRNETTVVEIFGKECAKVAGCRLTVAYSNNLTFCEQVSLMG 362
             +   +GMT+ MR GPRSF+NET V+EIF KEC KV GCRL VAYSNNLT CEQV LM 
Sbjct: 318 KTTVPEIGMTLFMRMGPRSFKNETAVIEIFAKECGKVEGCRLMVAYSNNLTVCEQVKLMS 377

Query: 363 KTDILISPHGAQLTNMILMNRNSSVMEFFPKGWLELAGIGQYVYHWLASWSGMRHQGAWR 422
            TDIL+SPHGAQLTNM LM+RNSSVMEFFPKGWL+LAG+GQYV+HWLASWSGM+HQGAWR
Sbjct: 378 STDILVSPHGAQLTNMFLMDRNSSVMEFFPKGWLKLAGVGQYVHHWLASWSGMKHQGAWR 437

Query: 423 DPNSTLPCPYSPGDRRCMSIYKAGTIGYNRTHFSEWAKSVLNEVKMRKMEE-ATKVTTNQ 457
           DPN    CPYS  DRRCMSIYK+G IG+N T+FSEWA++VLNEVK RKMEE A K T   
Sbjct: 438 DPNGD-HCPYSEDDRRCMSIYKSGKIGFNNTYFSEWARNVLNEVKARKMEEYALKGTFPS 497

BLAST of CSPI04G09440 vs. NCBI nr
Match: gi|743939586|ref|XP_011014243.1| (PREDICTED: uncharacterized protein LOC105118077 [Populus euphratica])

HSP 1 Score: 613.2 bits (1580), Expect = 3.6e-172
Identity = 296/470 (62.98%), Postives = 351/470 (74.68%), Query Frame = 1

Query: 11  QSRTTKTTNNLVSPKLFLYLLSISALLFILFHIHSLH-HHVPPPPSSIV----------- 70
           + RTT +   L SPKL ++LL++   LF L+HI SLH      PP S V           
Sbjct: 23  KKRTTMSVFCLYSPKLSIFLLTLCVSLFTLYHIQSLHARKTSSPPWSFVHKWERVTNCTR 82

Query: 71  -----VAKLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDGD 130
                  KLR+SVTFLPLKDLRY++KAL GHTWFMSS+YD +EEGEVQYQQFPS      
Sbjct: 83  EHESMADKLRQSVTFLPLKDLRYADKALQGHTWFMSSMYDTREEGEVQYQQFPSK--SSK 142

Query: 131 ERMLCLKGRDTHDGSWNYYGLAWPEGLPENARVKKGVSFVSYNHYDYQNIWHGLSALMPF 190
            R+LCLKG++THDGS N Y LAWPE LP NA + KG++FVSYNHY+Y NIWHGLSA++PF
Sbjct: 143 RRLLCLKGKETHDGSRNSYALAWPEALPFNATLLKGLTFVSYNHYNYDNIWHGLSAMVPF 202

Query: 191 VAWHQIQGKCEVPERWILYHWGELRLRMGKWVSTLMEATFGAPLQFEAFEDISEGQPVCF 250
           VAWH I+  CE P RWILYHWGELR  M  W+ TL  ATFG     E+FE++++GQP+CF
Sbjct: 203 VAWH-IRNGCESPSRWILYHWGELRFEMSPWLRTLTGATFGGAPYTESFEEVNDGQPLCF 262

Query: 251 EKAVVMRHNEGGMSRQRRMETYDFMRCKARLFCNLTSPEPLSAA-------VGMTMLMRT 310
           EKAVVMRHNEGGMSR RR ETYD MRC+AR++CN++    +          +GMT+ MRT
Sbjct: 263 EKAVVMRHNEGGMSRDRRTETYDLMRCRARMYCNVSLEGRIHEVNKQGLPVIGMTLFMRT 322

Query: 311 GPRSFRNETTVVEIFGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTN 370
           GPRSF NE+ V+ IF KECA+V GCRL VAYSNNLTFCEQV +M  TDIL+SPHGAQLTN
Sbjct: 323 GPRSFTNESAVIAIFEKECARVDGCRLMVAYSNNLTFCEQVKVMSLTDILVSPHGAQLTN 382

Query: 371 MILMNRNSSVMEFFPKGWLELAGIGQYVYHWLASWSGMRHQGAWRDPNSTLPCPYSPGDR 430
           M LM++NSSVMEFFPKGWL+LAG+GQYV+HW+ASWSGMRHQGAWRDPN    CPY+  DR
Sbjct: 383 MFLMDKNSSVMEFFPKGWLKLAGVGQYVFHWIASWSGMRHQGAWRDPNGD-ECPYAEDDR 442

Query: 431 RCMSIYKAGTIGYNRTHFSEWAKSVLNEVKMRKMEEATKVTTNQIHECSC 457
           RCMSIYK G IG+N T+FSEWA+ VLNEVK+RK+EEA   T      C+C
Sbjct: 443 RCMSIYKNGKIGFNETYFSEWARDVLNEVKIRKLEEAANKTNASTSACAC 488

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KXZ9_CUCSA1.6e-27599.78Uncharacterized protein OS=Cucumis sativus GN=Csa_4G112630 PE=4 SV=1[more]
V4S1C6_9ROSI4.8e-17161.60Uncharacterized protein OS=Citrus clementina GN=CICLE_v10006690mg PE=4 SV=1[more]
B9S9E8_RICCO6.2e-17163.77Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0884460 PE=4 SV=1[more]
A0A061E286_THECC4.0e-17063.30Uncharacterized protein OS=Theobroma cacao GN=TCM_007404 PE=4 SV=1[more]
B9IJB4_POPTR5.3e-17063.70Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0017s07560g PE=4 SV=2[more]
Match NameE-valueIdentityDescription
AT4G33600.11.5e-15556.42 unknown protein[more]
AT4G33590.12.9e-15159.37 unknown protein[more]
Match NameE-valueIdentityDescription
gi|778692014|ref|XP_011653390.1|2.3e-27599.78PREDICTED: uncharacterized protein LOC101219216 [Cucumis sativus][more]
gi|659125841|ref|XP_008462883.1|3.9e-23593.17PREDICTED: uncharacterized protein LOC103501161 isoform X1 [Cucumis melo][more]
gi|659125843|ref|XP_008462884.1|9.5e-21393.53PREDICTED: uncharacterized protein LOC103501161 isoform X2 [Cucumis melo][more]
gi|1009151277|ref|XP_015893468.1|1.2e-17262.55PREDICTED: uncharacterized protein LOC107427594 [Ziziphus jujuba][more]
gi|743939586|ref|XP_011014243.1|3.6e-17262.98PREDICTED: uncharacterized protein LOC105118077 [Populus euphratica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR007657Glycosyltransferase_61
Vocabulary: Molecular Function
TermDefinition
GO:0016757transferase activity, transferring glycosyl groups
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
biological_process GO:0009664 plant-type cell wall organization
biological_process GO:0008150 biological_process
biological_process GO:0048767 root hair elongation
biological_process GO:0010155 regulation of proton transport
biological_process GO:0010817 regulation of hormone levels
biological_process GO:0016567 protein ubiquitination
biological_process GO:0046777 protein autophosphorylation
biological_process GO:0000271 polysaccharide biosynthetic process
biological_process GO:0009832 plant-type cell wall biogenesis
biological_process GO:0009638 phototropism
biological_process GO:0009825 multidimensional cell growth
biological_process GO:0009932 cell tip growth
biological_process GO:0009785 blue light signaling pathway
biological_process GO:0043481 anthocyanin accumulation in tissues in response to UV light
cellular_component GO:0005575 cellular_component
cellular_component GO:0005886 plasma membrane
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0005515 protein binding
molecular_function GO:0016757 transferase activity, transferring glycosyl groups

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI04G09440.1CSPI04G09440.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR007657Glycosyltransferase AER61, uncharacterisedPFAMPF04577DUF563coord: 224..387
score: 4.4
NoneNo IPR availablePANTHERPTHR20961GLYCOSYLTRANSFERASEcoord: 61..451
score: 1.6E
NoneNo IPR availablePANTHERPTHR20961:SF33SUBFAMILY NOT NAMEDcoord: 61..451
score: 1.6E