CmoCh04G004260.1 (mRNA) Cucurbita moschata (Rifu)

NameCmoCh04G004260.1
TypemRNA
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionBnaA08g11520D protein
LocationCmo_Chr04 : 2116929 .. 2119250 (-)
Sequence length1574
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AATTTCCATTACAACTTAAAGTAGTGGGGGAAGAGAGAGAAACTCAAAATTCCATTATCATTGTTTGAGAGAGAAAATCAAACCACCACCACCGCCGCCGCCGCCATGGTTAAACCTTCCCACACCTCTAAGCCACACCAAAGAACAACCTCCATTCTCTTCTCTCCAAAGCTCTTCATCTATCTCCTCTCCATCTCCGCCATCCTCTTCATCTTTTTCCACATCCAATCTCTCCACCGCCATGTTCCTCCGCGCCCACAAAACAACCCCTCCTCCTCCTCCTCCTCCGCCGCCAAACTCCGCCGCTCCGTCACCTTCCTTCCCCTCAAAGACTTGCGTTACTCCCACAAACCCCTCGAGGGCCACACGTGGTTCATGAGCTCCATGTATGACATTCACGAGGATGGCGAGGTTCAATTCCAGCAATTCCCCTCTCCGGCAGCTGACGGCGATGCCCGGCTGTTATGCCTTAAAGGCAACGACACCCACGACGGCTCTTGGAACTATTACGCCGTCGCCTGGCCGGAAACACTGCCGGAAAATGCCACGGTGATGAAAGGCCTGAGCTTTGTGTCTTATAATCATTATAATTATGATAATATTTGGCACGGCTTGTCCGCTCTCATGCCCTTCGTTGCTTGGCACCAAATTCAAGGTACCCATTTCAAACTGATCATCAAAATCCATATATATATATATATATATATATATATACATTCATATTGGTCAATTTTTTGCTTAAATTTTAAGTTTTGTTCTTCTTATAATTAATGGAAGGAATTAAAATTTAATTCACGGAAAAATTAGGTAATCCGATTTTTTGTATGTTAATTTATTTTTTGAGGGTTCTCGACCCGCTCATAAATTAAGAGTGGAAGATTATGTCGTAATATCATGTGTCATTTTTACTAAATGATATGAATTTACGATAGGAAAGTGTGAAATCCCGGAGAGATGGATACTATACCATTGGGGGGAGTTGAGGTTGAAGATGGGGACATGGGTTAGTACAATAATGGAGGTGACATTCGGCGGGCCGCCAAAGATTGAAGCTTTTGACGGTATCAGCGAGGGGCAGCCGGTTTGTTTCGAGAAGGCGGTGGTGATGAGGCACAATGAGGGCGGAATGTCGAGGCAGCGTCGGATGGAGACTTATGATCTCATGAGATGCAAGGCTAGATTGTTTTGCAACTTCACCTCACCGAAGCCATCAGTGGCGACAGTTGGGATGACGTTGTTCATGAGAACTGGGGCTAGGTCGTTTAAGAATGAGACGGCGGTGGTGGAGATTTTTGGGGCGGAATGCACTAAAGTTGTCGGTTGTAGGCTTAGGGTGGCTCATTCTAATAATCTCACGTTTTGTGAGCAGGTATTGTTCTGTTCATCCAATCCAACTTGCGAACACCGATATATCGAGCTCGGGAAGTGAAATTTTGTGTATTTGTAGGTGAGTTTAATGGGGAAGACAGACATATTGGTGTCCCCACATGGAGCACAGCTGACAAACATGTTTCTAATGGATAGAAACAGCAGTGTAATGGAGTTCTTCCCAAAAGGCTGGCTTAAACTTGCAGGCATTGGCCAGTTTGTGTACCAATGGATGGCAAGCTGGTCTGGAATGAGGCATCAAGGTGCTTGGAGAGACCCTAATGGCTTAACCTGTCCCTATAATGAAGACGATCGTCGCTGCATGTCGATTTTCAAAGGTGGCACCATCGGGTACGTTCCTTTCCCTTTTCATGTTCGATATCGTTCTACTTTCGTCATTAAACTTATAATATATGTAGTAACTGTCCAAACCCACTCCTAGCCGATATTTGTTCTAATTGAGATTTCCGAGGCTTCTCCTCAAAATAAAAATAAAAAAAAAACTGTTTTAGGGAGAGGTTTCCACATCGATATGGGATATCATAATCCTCCTCCTTCAGAATCCAGCGTCCTTGCTGTTACACCCCCTTGTGTCCACCCTTTACAAGGCTCAGCCTCCACACTGGCACATCGCCTGATGTTTGGCTCTAATACCATTTGTAACGACCTAAGCTCACCGCTAGCAAACATATTGTCCTCTTTAGACTTTTCTTTTAATACGTTTGTTTTTTATAATGTAAGCAGATATAATAGAACGTACTTTTCGGAGTGGGCTAAGAATGTTCTGAACGAGGTGAAGACGAGGAAGATGGATGAAGCAGCACAGGCCACTGCAAATCATGTTCATCAATGTTCTTGTAACTAATAATTATTTTTTGCCCAATTTTAGTGTTTTTTTTCCCTAATAAACAGTTTAAAATATTTATAATGGAATCAAAGAGACTGTTATAATGT

mRNA sequence

AATTTCCATTACAACTTAAAGTAGTGGGGGAAGAGAGAGAAACTCAAAATTCCATTATCATTGTTTGAGAGAGAAAATCAAACCACCACCACCGCCGCCGCCGCCATGGTTAAACCTTCCCACACCTCTAAGCCACACCAAAGAACAACCTCCATTCTCTTCTCTCCAAAGCTCTTCATCTATCTCCTCTCCATCTCCGCCATCCTCTTCATCTTTTTCCACATCCAATCTCTCCACCGCCATGTTCCTCCGCGCCCACAAAACAACCCCTCCTCCTCCTCCTCCTCCGCCGCCAAACTCCGCCGCTCCGTCACCTTCCTTCCCCTCAAAGACTTGCGTTACTCCCACAAACCCCTCGAGGGCCACACGTGGTTCATGAGCTCCATGTATGACATTCACGAGGATGGCGAGGTTCAATTCCAGCAATTCCCCTCTCCGGCAGCTGACGGCGATGCCCGGCTGTTATGCCTTAAAGGCAACGACACCCACGACGGCTCTTGGAACTATTACGCCGTCGCCTGGCCGGAAACACTGCCGGAAAATGCCACGGTGATGAAAGGCCTGAGCTTTGTGTCTTATAATCATTATAATTATGATAATATTTGGCACGGCTTGTCCGCTCTCATGCCCTTCGTTGCTTGGCACCAAATTCAAGGAAAGTGTGAAATCCCGGAGAGATGGATACTATACCATTGGGGGGAGTTGAGGTTGAAGATGGGGACATGGGTTAGTACAATAATGGAGGTGACATTCGGCGGGCCGCCAAAGATTGAAGCTTTTGACGGTATCAGCGAGGGGCAGCCGGTTTGTTTCGAGAAGGCGGTGGTGATGAGGCACAATGAGGGCGGAATGTCGAGGCAGCGTCGGATGGAGACTTATGATCTCATGAGATGCAAGGCTAGATTGTTTTGCAACTTCACCTCACCGAAGCCATCAGTGGCGACAGTTGGGATGACGTTGTTCATGAGAACTGGGGCTAGGTCGTTTAAGAATGAGACGGCGGTGGTGGAGATTTTTGGGGCGGAATGCACTAAAGTTGTCGGTTGTAGGCTTAGGGTGGCTCATTCTAATAATCTCACGTTTTGTGAGCAGGTGAGTTTAATGGGGAAGACAGACATATTGGTGTCCCCACATGGAGCACAGCTGACAAACATGTTTCTAATGGATAGAAACAGCAGTGTAATGGAGTTCTTCCCAAAAGGCTGGCTTAAACTTGCAGGCATTGGCCAGTTTGTGTACCAATGGATGGCAAGCTGGTCTGGAATGAGGCATCAAGGTGCTTGGAGAGACCCTAATGGCTTAACCTGTCCCTATAATGAAGACGATCGTCGCTGCATGTCGATTTTCAAAGGTGGCACCATCGGATATAATAGAACGTACTTTTCGGAGTGGGCTAAGAATGTTCTGAACGAGGTGAAGACGAGGAAGATGGATGAAGCAGCACAGGCCACTGCAAATCATGTTCATCAATGTTCTTGTAACTAATAATTATTTTTTGCCCAATTTTAGTGTTTTTTTTCCCTAATAAACAGTTTAAAATATTTATAATGGAATCAAAGAGACTGTTATAATGT

Coding sequence (CDS)

ATGGTTAAACCTTCCCACACCTCTAAGCCACACCAAAGAACAACCTCCATTCTCTTCTCTCCAAAGCTCTTCATCTATCTCCTCTCCATCTCCGCCATCCTCTTCATCTTTTTCCACATCCAATCTCTCCACCGCCATGTTCCTCCGCGCCCACAAAACAACCCCTCCTCCTCCTCCTCCTCCGCCGCCAAACTCCGCCGCTCCGTCACCTTCCTTCCCCTCAAAGACTTGCGTTACTCCCACAAACCCCTCGAGGGCCACACGTGGTTCATGAGCTCCATGTATGACATTCACGAGGATGGCGAGGTTCAATTCCAGCAATTCCCCTCTCCGGCAGCTGACGGCGATGCCCGGCTGTTATGCCTTAAAGGCAACGACACCCACGACGGCTCTTGGAACTATTACGCCGTCGCCTGGCCGGAAACACTGCCGGAAAATGCCACGGTGATGAAAGGCCTGAGCTTTGTGTCTTATAATCATTATAATTATGATAATATTTGGCACGGCTTGTCCGCTCTCATGCCCTTCGTTGCTTGGCACCAAATTCAAGGAAAGTGTGAAATCCCGGAGAGATGGATACTATACCATTGGGGGGAGTTGAGGTTGAAGATGGGGACATGGGTTAGTACAATAATGGAGGTGACATTCGGCGGGCCGCCAAAGATTGAAGCTTTTGACGGTATCAGCGAGGGGCAGCCGGTTTGTTTCGAGAAGGCGGTGGTGATGAGGCACAATGAGGGCGGAATGTCGAGGCAGCGTCGGATGGAGACTTATGATCTCATGAGATGCAAGGCTAGATTGTTTTGCAACTTCACCTCACCGAAGCCATCAGTGGCGACAGTTGGGATGACGTTGTTCATGAGAACTGGGGCTAGGTCGTTTAAGAATGAGACGGCGGTGGTGGAGATTTTTGGGGCGGAATGCACTAAAGTTGTCGGTTGTAGGCTTAGGGTGGCTCATTCTAATAATCTCACGTTTTGTGAGCAGGTGAGTTTAATGGGGAAGACAGACATATTGGTGTCCCCACATGGAGCACAGCTGACAAACATGTTTCTAATGGATAGAAACAGCAGTGTAATGGAGTTCTTCCCAAAAGGCTGGCTTAAACTTGCAGGCATTGGCCAGTTTGTGTACCAATGGATGGCAAGCTGGTCTGGAATGAGGCATCAAGGTGCTTGGAGAGACCCTAATGGCTTAACCTGTCCCTATAATGAAGACGATCGTCGCTGCATGTCGATTTTCAAAGGTGGCACCATCGGATATAATAGAACGTACTTTTCGGAGTGGGCTAAGAATGTTCTGAACGAGGTGAAGACGAGGAAGATGGATGAAGCAGCACAGGCCACTGCAAATCATGTTCATCAATGTTCTTGTAACTAA
BLAST of CmoCh04G004260.1 vs. TrEMBL
Match: A0A0A0KXZ9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G112630 PE=4 SV=1)

HSP 1 Score: 768.5 bits (1983), Expect = 4.7e-219
Identity = 362/462 (78.35%), Postives = 394/462 (85.28%), Query Frame = 1

Query: 1   MVKPSHTSKPHQRTTSI---LFSPKLFIYLLSISAILFIFFHIQSLHRHVPPRPQNNPSS 60
           MVK    SK   RTT     L SPKLF+YLLSISA+LFI FHI SLH HVPP P      
Sbjct: 1   MVKALQQSKSQSRTTKTTNNLVSPKLFLYLLSISALLFILFHIHSLHHHVPPPP------ 60

Query: 61  SSSSAAKLRRSVTFLPLKDLRYSHKPLEGHTWFMSSMYDIHEDGEVQFQQFPSPAADGDA 120
           SS  AAKLRRSVTFLPLKDLRYS+K L GHTWFMSS+YDI E+GEVQ+QQFPSP  DGD 
Sbjct: 61  SSIVAAKLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDGDE 120

Query: 121 RLLCLKGNDTHDGSWNYYAVAWPETLPENATVMKGLSFVSYNHYNYDNIWHGLSALMPFV 180
           R+LCLKG DTHDGSWNYY +AWPE LPENA V KG+SFVSYNHY+Y NIWHGLSALMPFV
Sbjct: 121 RMLCLKGRDTHDGSWNYYGLAWPEGLPENARVKKGVSFVSYNHYDYQNIWHGLSALMPFV 180

Query: 181 AWHQIQGKCEIPERWILYHWGELRLKMGTWVSTIMEVTFGGPPKIEAFDGISEGQPVCFE 240
           AWHQIQGKCE+PERWILYHWGELRL+MG WVST+ME TFG P + EAF+ ISEGQPVCFE
Sbjct: 181 AWHQIQGKCEVPERWILYHWGELRLRMGKWVSTLMEATFGAPLQFEAFEDISEGQPVCFE 240

Query: 241 KAVVMRHNEGGMSRQRRMETYDLMRCKARLFCNFTSPKPSVATVGMTLFMRTGARSFKNE 300
           KAVVMRHNEGGMSRQRRMETYD MRCKARLFCN TSP+P  A VGMT+ MRTG RSF+NE
Sbjct: 241 KAVVMRHNEGGMSRQRRMETYDFMRCKARLFCNLTSPEPLSAAVGMTMLMRTGPRSFRNE 300

Query: 301 TAVVEIFGAECTKVVGCRLRVAHSNNLTFCEQVSLMGKTDILVSPHGAQLTNMFLMDRNS 360
           T VVEIFG EC KV GCRL VA+SNNLTFCEQVSLMGKTDIL+SPHGAQLTNM LM+RNS
Sbjct: 301 TTVVEIFGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMNRNS 360

Query: 361 SVMEFFPKGWLKLAGIGQFVYQWMASWSGMRHQGAWRDPNG-LTCPYNEDDRRCMSIFKG 420
           SVMEFFPKGWL+LAGIGQ+VY W+ASWSGMRHQGAWRDPN  L CPY+  DRRCMSI+K 
Sbjct: 361 SVMEFFPKGWLELAGIGQYVYHWLASWSGMRHQGAWRDPNSTLPCPYSPGDRRCMSIYKA 420

Query: 421 GTIGYNRTYFSEWAKNVLNEVKTRKMDEAAQATANHVHQCSC 459
           GTIGYNRT+FSEWAK+VLNEVK RKM+EA + T N +H+CSC
Sbjct: 421 GTIGYNRTHFSEWAKSVLNEVKMRKMEEATKVTTNQIHECSC 456

BLAST of CmoCh04G004260.1 vs. TrEMBL
Match: B9S9E8_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0884460 PE=4 SV=1)

HSP 1 Score: 632.5 bits (1630), Expect = 4.0e-178
Identity = 302/483 (62.53%), Postives = 364/483 (75.36%), Query Frame = 1

Query: 1   MVKPSHTS--KPHQRTTS--ILFSPKLFIYLLSISAILFIFFHIQSLHRHVPPRPQN--- 60
           +++P   S  K HQ + S    F+PKL ++LLSI   LF  F IQ LH   PP   +   
Sbjct: 9   VIRPKFHSPKKAHQLSMSSFCFFTPKLSVFLLSICVTLFTLFQIQCLH--YPPSLSSPTW 68

Query: 61  -------------NPSSSSSSAAKLRRSVTFLPLKDLRYSHKPLEGHTWFMSSMYDIHED 120
                        N    S +  +L+++VTFLPLKD+RY  K L+GHTWFMSSMYD HE+
Sbjct: 69  SLMQKWQEFATTCNQELKSMAEMRLKQAVTFLPLKDIRYQEKALQGHTWFMSSMYDTHEE 128

Query: 121 GEVQFQQFPSPAADGDARLLCLKGNDTHDGSWNYYAVAWPETLPENATVMKGLSFVSYNH 180
           GEVQ+QQFPS ++ G  RLLCLKGNDTHDGSWN YA+AWPETLP NAT++KGL+FVSYNH
Sbjct: 129 GEVQYQQFPSESSKG--RLLCLKGNDTHDGSWNSYALAWPETLPLNATLLKGLTFVSYNH 188

Query: 181 YNYDNIWHGLSALMPFVAWHQIQGKCEIPERWILYHWGELRLKMGTWVSTIMEVTFGGPP 240
           Y+Y+NIWHGLSA++PFVAWH+  G  E+P RWILYHWGELR  MG W+ST+ E TFG PP
Sbjct: 189 YDYNNIWHGLSAIVPFVAWHKGNGG-ELPSRWILYHWGELRFNMGLWLSTLTEATFGSPP 248

Query: 241 KIEAFDGISEGQPVCFEKAVVMRHNEGGMSRQRRMETYDLMRCKARLFCNFTSP------ 300
            IE F   +  +P+CFEKAVVMRHNEGGMS  RR+ETYD MRCKAR +CN +        
Sbjct: 249 NIEGFGWANNNEPICFEKAVVMRHNEGGMSTDRRIETYDFMRCKARAYCNVSLEGGNMVS 308

Query: 301 KPSVATVGMTLFMRTGARSFKNETAVVEIFGAECTKVVGCRLRVAHSNNLTFCEQVSLMG 360
           +  +  +GMTLFMRTG RSFKNE+AV+ IF  EC KV GCRL VA+SNNLTFCEQV LM 
Sbjct: 309 EKGLPVIGMTLFMRTGPRSFKNESAVIRIFEKECAKVDGCRLMVAYSNNLTFCEQVKLMS 368

Query: 361 KTDILVSPHGAQLTNMFLMDRNSSVMEFFPKGWLKLAGIGQFVYQWMASWSGMRHQGAWR 420
            TDIL+SPHGAQLTNMFLM++NSSVMEFFPKGWLKLAG+GQFVY W+ASWSGM+HQGAWR
Sbjct: 369 MTDILISPHGAQLTNMFLMNKNSSVMEFFPKGWLKLAGVGQFVYHWIASWSGMKHQGAWR 428

Query: 421 DPNGLTCPYNEDDRRCMSIFKGGTIGYNRTYFSEWAKNVLNEVKTRKMDEAAQATANHVH 458
           DP+G  CPY +DDRRCMSI+KGG IG+N T+FSEW +NVLNEVK RK +E +  + + ++
Sbjct: 429 DPDGDHCPYPDDDRRCMSIYKGGKIGFNETHFSEWGRNVLNEVKLRKAEEMSHKSNDLIY 486

BLAST of CmoCh04G004260.1 vs. TrEMBL
Match: A0A061E286_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_007404 PE=4 SV=1)

HSP 1 Score: 631.3 bits (1627), Expect = 9.0e-178
Identity = 300/466 (64.38%), Postives = 360/466 (77.25%), Query Frame = 1

Query: 19  FSPKLFIYLLSISAILFIFFHIQSLHRHVPPRPQNNPSSS-------------------- 78
           +SPKL +Y+L+    L +   I+SLH   P  P   PS S                    
Sbjct: 27  YSPKLSLYILAFCVTLLLLLQIRSLHTP-PISPSPLPSWSFLQQWQEVINKTLASPNCTQ 86

Query: 79  ---SSSAAKLRRSVTFLPLKDLRYSHKPLEGHTWFMSSMYDIHEDGEVQFQQFPSPAADG 138
               S   KLR SVTFLPLKDLRY+++PL GHTWFMSSMYD HE+GEVQ+QQFPS +++G
Sbjct: 87  DVLESMTQKLRDSVTFLPLKDLRYANQPLPGHTWFMSSMYDTHEEGEVQYQQFPSDSSNG 146

Query: 139 DARLLCLKGNDTHDGSWNYYAVAWPETLPENATVMKGLSFVSYNHYNYDNIWHGLSALMP 198
             RLLCLKG DTHDGSWNYYA+AWPE LP NAT+MKGL+FV+YNHYNYDNIWHGLSA++P
Sbjct: 147 --RLLCLKGRDTHDGSWNYYALAWPEALPSNATLMKGLTFVAYNHYNYDNIWHGLSAMVP 206

Query: 199 FVAWHQIQGKCEIPERWILYHWGELRLKMGTWVSTIMEVTFGGPPKIEAFDGISEG-QPV 258
           FVAWH+ +  CE P RWILY WGELR KMGTW++T+M+ TFG  P IE F+GI +  QPV
Sbjct: 207 FVAWHR-KNSCETPTRWILYRWGELRFKMGTWLNTLMKATFGQAPYIEGFNGIEDDDQPV 266

Query: 259 CFEKAVVMRHNEGGMSRQRRMETYDLMRCKARLFCNFTSPKPSVATVGMTLFMRTGARSF 318
           CFEKAVVMRHNEGGMSR+RRME YDL+RCKAR++CN +  +     +GMTL MRTG RSF
Sbjct: 267 CFEKAVVMRHNEGGMSRERRMEVYDLIRCKARVYCNVSGDQKRPG-IGMTLLMRTGPRSF 326

Query: 319 KNETAVVEIFGAECTKVVGCRLRVAHSNNLTFCEQVSLMGKTDILVSPHGAQLTNMFLMD 378
           +NETAV+ IF  EC KV GC+L VA+SNNLT CEQV LM  TDIL+SPHGAQLTN+FLMD
Sbjct: 327 RNETAVIGIFEKECMKVEGCQLIVAYSNNLTICEQVKLMSLTDILISPHGAQLTNLFLMD 386

Query: 379 RNSSVMEFFPKGWLKLAGIGQFVYQWMASWSGMRHQGAWRDPNGLTCPYNEDDRRCMSIF 438
           RNSSVMEFFPKGWLKLAG+GQ+VY WMASWSGM H+G WRDP+G  CPY++DDRRCMS++
Sbjct: 387 RNSSVMEFFPKGWLKLAGVGQYVYHWMASWSGMIHRGDWRDPDGENCPYSDDDRRCMSLY 446

Query: 439 KGGTIGYNRTYFSEWAKNVLNEVKTRKMDEAAQATANHVHQ-CSCN 460
           K G IGYN T+F+EWA+NVLN+VKT K++EA++   N + + C C+
Sbjct: 447 KSGRIGYNETHFAEWARNVLNDVKTSKLEEASKHAQNSISKTCDCS 487

BLAST of CmoCh04G004260.1 vs. TrEMBL
Match: A0A067LA02_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_24948 PE=4 SV=1)

HSP 1 Score: 627.9 bits (1618), Expect = 1.0e-176
Identity = 300/451 (66.52%), Postives = 350/451 (77.61%), Query Frame = 1

Query: 23  LFIYLLSISAILFIFFHIQSLHRHVPP-------RPQNNPSSSS---SSAAKLRRSVTFL 82
           L I+LL+I   LF  F IQSLH    P       R Q   + +    S + KL+ SVTFL
Sbjct: 6   LSIFLLTIFVALFTLFQIQSLHTTTSPSFSPLIHRWQKLTTCNQEIKSLSEKLKESVTFL 65

Query: 83  PLKDLRYSHKPLEGHTWFMSSMYDIHEDGEVQFQQFPSPAADGDARLLCLKGNDTHDGSW 142
           PLKDLRY  K L+GHTWFMSSMYD  E+G VQ+QQFPS +++   R+LCLKGNDTHDGSW
Sbjct: 66  PLKDLRYQDKALQGHTWFMSSMYDTQEEGGVQYQQFPSNSSN--FRILCLKGNDTHDGSW 125

Query: 143 NYYAVAWPETLPENATVMKGLSFVSYNHYNYDNIWHGLSALMPFVAWHQIQGKCEIPERW 202
           N YA+AWPETLP NAT+MKGL+FVSYNHYNYDNIWHGLSA++PFVAWH I+  CE P RW
Sbjct: 126 NSYALAWPETLPSNATLMKGLTFVSYNHYNYDNIWHGLSAIVPFVAWH-IRNGCENPNRW 185

Query: 203 ILYHWGELRLKMGTWVSTIMEVTFGGP-PKIEAFDGISEGQPVCFEKAVVMRHNEGGMSR 262
           +LYHWGELR +MG W+  +   TFGG  P +E F+  ++  P+CFEKAVVMRHNEGGMSR
Sbjct: 186 VLYHWGELRFQMGAWLRNLTGATFGGEEPNVERFEWANKNDPICFEKAVVMRHNEGGMSR 245

Query: 263 QRRMETYDLMRCKARLFCNFTS----PKPSVATVGMTLFMRTGARSFKNETAVVEIFGAE 322
           +RR+ETYDL+RCKAR+ CN +      +  +  +GMTLFMRTG RSFKNE+AV+ IF  E
Sbjct: 246 ERRIETYDLLRCKARVSCNVSLLQRVNEKGLPLIGMTLFMRTGPRSFKNESAVIGIFEKE 305

Query: 323 CTKVVGCRLRVAHSNNLTFCEQVSLMGKTDILVSPHGAQLTNMFLMDRNSSVMEFFPKGW 382
           C KV GC+L VA+SNNLTFCEQV LMG TDILVSPHGAQLTNMFLMDRNSSVMEFFPKGW
Sbjct: 306 CAKVEGCKLMVAYSNNLTFCEQVKLMGMTDILVSPHGAQLTNMFLMDRNSSVMEFFPKGW 365

Query: 383 LKLAGIGQFVYQWMASWSGMRHQGAWRDPNGLTCPYNEDDRRCMSIFKGGTIGYNRTYFS 442
           LKLAG+GQFVY W+ASWSGMRHQGAWRDP G  CP+ EDDRRCMS +KGG IG+N TYFS
Sbjct: 366 LKLAGVGQFVYHWIASWSGMRHQGAWRDPTGDHCPFGEDDRRCMSFYKGGKIGFNETYFS 425

Query: 443 EWAKNVLNEVKTRKMDEAAQATANHVHQCSC 459
           EWA+NVL EVKTRK++E    +A     C+C
Sbjct: 426 EWARNVLTEVKTRKLEEVKNNSAASTSSCAC 453

BLAST of CmoCh04G004260.1 vs. TrEMBL
Match: M5VM83_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa1027207mg PE=4 SV=1)

HSP 1 Score: 624.4 bits (1609), Expect = 1.1e-175
Identity = 298/475 (62.74%), Postives = 366/475 (77.05%), Query Frame = 1

Query: 5   SHTSKPHQRTTSILFSPKLFIYLLSISAILFIFFHIQSL------------HRHVPPRPQ 64
           S+ +KPH+ T S  +SPKL  Y+ S    LF+  HI++L            H+H   R  
Sbjct: 16  SNPNKPHRHTLSF-YSPKLSFYIFSACVTLFVVLHIKTLQTDQNPTSSLWFHKHQRQRVT 75

Query: 65  NNPSSSSSSAAKLRRSVTFLPLKDLRYSHKP-LEGHTWFMSSMYDIH-EDGEVQFQQFPS 124
            N  ++S+ A KLR++VTFLPLKDLRY+    L+GHTWFMSSMYD   E+G  Q+QQFPS
Sbjct: 76  TN--TTSTMAEKLRQAVTFLPLKDLRYAGPAALQGHTWFMSSMYDKQDEEGGAQYQQFPS 135

Query: 125 PAADGDARLLCLKGNDTHDGSWNYYAVAWPETLPENATVMKGLSFVSYNHYNYDNIWHGL 184
           P++ G  RLLCLKG D HDGSWN YA+A+PE LP N T+MKGL+FVSYNHYNYDNIWHGL
Sbjct: 136 PSSHG--RLLCLKGRDNHDGSWNSYALAFPEALPHNTTLMKGLTFVSYNHYNYDNIWHGL 195

Query: 185 SALMPFVAWHQIQGKCEIPERWILYHWGELRLKMGTWVSTIMEVTFGGPPKIEAFDGISE 244
           +A++PFV+WH I+  C +P+RWILYHWGE+R +MG W+ ++ME TF G P++E FD + E
Sbjct: 196 TAVVPFVSWH-IKNSCAVPDRWILYHWGEIRARMGVWLGSLMEATFSGAPRVEVFDDVEE 255

Query: 245 GQPVCFEKAVVMRHNEGGMSRQRRMETYDLMRCKARLFCN-------FTSPKPSVATVGM 304
           G+ VCFEKAVVMRHNEGGMSR++R+E +DLMRCKARLFCN       + S +     +G+
Sbjct: 256 GRAVCFEKAVVMRHNEGGMSREKRLEVFDLMRCKARLFCNVSLDEQNYKSTRSVTKRIGV 315

Query: 305 TLFMRTGARSFKNETAVVEIFGAECTKVVGCRLRVAHSNNLTFCEQVSLMGKTDILVSPH 364
           TLFMRTG RSFKN+TAV+ IF  EC KV GCRL VA+SNNLTFC+QV +M  TDILVSPH
Sbjct: 316 TLFMRTGPRSFKNDTAVIGIFERECAKVDGCRLMVAYSNNLTFCDQVKVMSLTDILVSPH 375

Query: 365 GAQLTNMFLMDRNSSVMEFFPKGWLKLAGIGQFVYQWMASWSGMRHQGAWRDPNGLTCPY 424
           GAQLTNMFLM+RNSSVMEFFPKGWLKLAG+GQ+V+ W+ASWSGMRH+GAWRDP+G TC Y
Sbjct: 376 GAQLTNMFLMNRNSSVMEFFPKGWLKLAGVGQYVFHWIASWSGMRHKGAWRDPDGDTCQY 435

Query: 425 NEDDRRCMSIFKGGTIGYNRTYFSEWAKNVLNEVKTRKMDEAAQATANHVHQCSC 459
            EDDRRCMSI+K G IG+N TYF+ W +NV++EVKTRKM+EA  A  N  + CSC
Sbjct: 436 GEDDRRCMSIYKHGKIGHNETYFAGWTRNVIDEVKTRKMEEAKMAIPNS-NGCSC 483

BLAST of CmoCh04G004260.1 vs. TAIR10
Match: AT4G33600.1 (AT4G33600.1 unknown protein)

HSP 1 Score: 550.8 bits (1418), Expect = 7.8e-157
Identity = 259/433 (59.82%), Postives = 319/433 (73.67%), Query Frame = 1

Query: 11  HQRTTSILFSPKLFIYLLSISAILFIFFHIQSLHRHVPPRPQNNPSSSSSSAAKLRRSVT 70
           H     IL  P LF YL                 +   P    + + ++    KLR SVT
Sbjct: 40  HITQQPILLPPSLFTYLKE---------------QQQEPEQIKSENETAYLVEKLRESVT 99

Query: 71  FLPLKDLRYSHKPLEGHTWFMSSMYDIHEDGEVQFQQFPSPAADGDARLLCLKGNDTHDG 130
           FLPLKDLR+S+KPLEGHTWFMSS+YD    GEVQ+Q+FPS ++ G  RLLCLKG D HDG
Sbjct: 100 FLPLKDLRFSNKPLEGHTWFMSSLYDNQTKGEVQYQEFPSESSKG--RLLCLKGVDEHDG 159

Query: 131 SWNYYAVAWPETLPENATVMKGLSFVSYNHYNYDNIWHGLSALMPFVAWHQIQGKCEIPE 190
           SWNYYA+AWP+ LP NA++ +GL+FVSYNHY+Y N+WHGLSA++PFVAW  ++ +CE P+
Sbjct: 160 SWNYYALAWPQALPVNASLQEGLTFVSYNHYDYGNMWHGLSAMVPFVAW-SLRHQCENPQ 219

Query: 191 RWILYHWGELRLKMGTWVSTIMEVTFGGPPKIEAFDGISEGQPVCFEKAVVMRHNEGGMS 250
           RW+LYHWGELR KMG W++ I+  T+G   +   F    + +PVCFEKAVVMRHNEGGMS
Sbjct: 220 RWVLYHWGELRFKMGNWLNEIITATYGQNTEFLRFR--DKNRPVCFEKAVVMRHNEGGMS 279

Query: 251 RQRRMETYDLMRCKARLFCNFTSPKPSVATVGMTLFMRTGARSFKNETAVVEIFGAECTK 310
           R+RRME +DL+RCKAR +CN +  + S + +GMTL MRTG RSFKNE+AV++IF  EC  
Sbjct: 280 RERRMEVFDLIRCKARHYCNISLSETSKSRIGMTLLMRTGPRSFKNESAVIDIFKRECKN 339

Query: 311 VVGCRLRVAHSNNLTFCEQVSLMGKTDILVSPHGAQLTNMFLMDRNSSVMEFFPKGWLKL 370
           V GC L+V++SNNLTFCEQV LM  TD+LVSPHGAQLTN+ LMDRNSSVMEF PKGW KL
Sbjct: 340 VEGCELKVSYSNNLTFCEQVELMRMTDVLVSPHGAQLTNLVLMDRNSSVMEFLPKGWRKL 399

Query: 371 AGIGQFVYQWMASWSGMRHQGAWRDPNGLTCPYNEDDRRCM-SIFKGGTIGYNRTYFSEW 430
           AG+GQ VYQW   WSGMRH+G+W DP+G  C + + DRRCM S++K G IGYN TYF EW
Sbjct: 400 AGVGQLVYQWGTRWSGMRHEGSWHDPDGEICQFPDTDRRCMSSVYKNGRIGYNETYFGEW 452

Query: 431 AKNVLNEVKTRKM 443
           AK+VL + K RKM
Sbjct: 460 AKSVLGKFKERKM 452

BLAST of CmoCh04G004260.1 vs. TAIR10
Match: AT4G33590.1 (AT4G33590.1 unknown protein)

HSP 1 Score: 545.4 bits (1404), Expect = 3.3e-155
Identity = 256/455 (56.26%), Postives = 330/455 (72.53%), Query Frame = 1

Query: 12  QRTTSILFSPKLFIYLLSISAILFIF-----FHIQSLHRHVPP---------RPQNNPSS 71
           +R  + L S K ++ +L +   +F+      F I      +PP           + + + 
Sbjct: 9   RRLVTCLSSLKFYLNVLCLVVTVFVLLQICSFQITQRSLSLPPALLTYLKHNHEEVSENK 68

Query: 72  SSSSAAKLRRSVTFLPLKDLRYSHKPLEGHTWFMSSMYDIHEDGEVQFQQFPSPAADGDA 131
           ++S   KLR SVTFLPLKD R+S+KPLEGHTWFMSS+YD    GE Q+Q+FPS ++ G  
Sbjct: 69  TASLVEKLRESVTFLPLKDYRFSNKPLEGHTWFMSSLYDNQTKGEAQYQEFPSDSSKG-- 128

Query: 132 RLLCLKGNDTHDGSWNYYAVAWPETLPENATVMKGLSFVSYNHYNYDNIWHGLSALMPFV 191
           RLLCLKG D HDGSWN YA+AWPE LP NA +  GL+FVSYN Y+Y N+WHGL+A++PF+
Sbjct: 129 RLLCLKGVDEHDGSWNSYALAWPEALPTNAILQDGLTFVSYNQYDYGNLWHGLTAVVPFI 188

Query: 192 AWHQIQGKCEIPERWILYHWGELRLKMGTWVSTIMEVTFGGPPKIEAFDGISEGQPVCFE 251
           AW  ++ +CE P++W+LYHWGELR  MG W+S I+  T+G  P    F  + + +PVCFE
Sbjct: 189 AW-SLRNQCEKPQKWVLYHWGELRFGMGHWLSEIVTATYGQEPDFLRF--VDDDKPVCFE 248

Query: 252 KAVVMRHNEGGMSRQRRMETYDLMRCKARLFCNFTSPKPSVATVGMTLFMRTGARSFKNE 311
           KAVVMRHNEGGMSR+RRME +DL+RCKAR +CN +S   S   +GMTL +RTGARSF+NE
Sbjct: 249 KAVVMRHNEGGMSRERRMEAFDLIRCKARNYCNISSSVASKPRIGMTLLLRTGARSFRNE 308

Query: 312 TAVVEIFGAECTKVVGCRLRVAHSNNLTFCEQVSLMGKTDILVSPHGAQLTNMFLMDRNS 371
           + V+++F  EC +V GC + V++SNNL+FCEQV LM KTD+LVSPHGAQLTN+FLMD+NS
Sbjct: 309 SMVIDVFKKECKRVDGCEISVSYSNNLSFCEQVELMKKTDVLVSPHGAQLTNLFLMDKNS 368

Query: 372 SVMEFFPKGWLKLAGIGQFVYQWMASWSGMRHQGAWRDPNGLTCPYNEDDRRCMSIFKGG 431
           SVMEFFPKGWLKLAG+GQ V+QW A+WSGMRH+G+W DP G  C + + DRRCMSI+K  
Sbjct: 369 SVMEFFPKGWLKLAGVGQLVFQWGANWSGMRHEGSWHDPVGEICQFPDTDRRCMSIYKNA 428

Query: 432 TIGYNRTYFSEWAKNVLNEVKTRKMDEAAQATANH 453
            IGYN TYF EWA+ VL +   R+M E A+   NH
Sbjct: 429 MIGYNETYFGEWARRVLGKFSIREMKELAE--CNH 456

BLAST of CmoCh04G004260.1 vs. NCBI nr
Match: gi|778692014|ref|XP_011653390.1| (PREDICTED: uncharacterized protein LOC101219216 [Cucumis sativus])

HSP 1 Score: 768.5 bits (1983), Expect = 6.8e-219
Identity = 362/462 (78.35%), Postives = 394/462 (85.28%), Query Frame = 1

Query: 1   MVKPSHTSKPHQRTTSI---LFSPKLFIYLLSISAILFIFFHIQSLHRHVPPRPQNNPSS 60
           MVK    SK   RTT     L SPKLF+YLLSISA+LFI FHI SLH HVPP P      
Sbjct: 1   MVKALQQSKSQSRTTKTTNNLVSPKLFLYLLSISALLFILFHIHSLHHHVPPPP------ 60

Query: 61  SSSSAAKLRRSVTFLPLKDLRYSHKPLEGHTWFMSSMYDIHEDGEVQFQQFPSPAADGDA 120
           SS  AAKLRRSVTFLPLKDLRYS+K L GHTWFMSS+YDI E+GEVQ+QQFPSP  DGD 
Sbjct: 61  SSIVAAKLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDGDE 120

Query: 121 RLLCLKGNDTHDGSWNYYAVAWPETLPENATVMKGLSFVSYNHYNYDNIWHGLSALMPFV 180
           R+LCLKG DTHDGSWNYY +AWPE LPENA V KG+SFVSYNHY+Y NIWHGLSALMPFV
Sbjct: 121 RMLCLKGRDTHDGSWNYYGLAWPEGLPENARVKKGVSFVSYNHYDYQNIWHGLSALMPFV 180

Query: 181 AWHQIQGKCEIPERWILYHWGELRLKMGTWVSTIMEVTFGGPPKIEAFDGISEGQPVCFE 240
           AWHQIQGKCE+PERWILYHWGELRL+MG WVST+ME TFG P + EAF+ ISEGQPVCFE
Sbjct: 181 AWHQIQGKCEVPERWILYHWGELRLRMGKWVSTLMEATFGAPLQFEAFEDISEGQPVCFE 240

Query: 241 KAVVMRHNEGGMSRQRRMETYDLMRCKARLFCNFTSPKPSVATVGMTLFMRTGARSFKNE 300
           KAVVMRHNEGGMSRQRRMETYD MRCKARLFCN TSP+P  A VGMT+ MRTG RSF+NE
Sbjct: 241 KAVVMRHNEGGMSRQRRMETYDFMRCKARLFCNLTSPEPLSAAVGMTMLMRTGPRSFRNE 300

Query: 301 TAVVEIFGAECTKVVGCRLRVAHSNNLTFCEQVSLMGKTDILVSPHGAQLTNMFLMDRNS 360
           T VVEIFG EC KV GCRL VA+SNNLTFCEQVSLMGKTDIL+SPHGAQLTNM LM+RNS
Sbjct: 301 TTVVEIFGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMNRNS 360

Query: 361 SVMEFFPKGWLKLAGIGQFVYQWMASWSGMRHQGAWRDPNG-LTCPYNEDDRRCMSIFKG 420
           SVMEFFPKGWL+LAGIGQ+VY W+ASWSGMRHQGAWRDPN  L CPY+  DRRCMSI+K 
Sbjct: 361 SVMEFFPKGWLELAGIGQYVYHWLASWSGMRHQGAWRDPNSTLPCPYSPGDRRCMSIYKA 420

Query: 421 GTIGYNRTYFSEWAKNVLNEVKTRKMDEAAQATANHVHQCSC 459
           GTIGYNRT+FSEWAK+VLNEVK RKM+EA + T N +H+CSC
Sbjct: 421 GTIGYNRTHFSEWAKSVLNEVKMRKMEEATKVTTNQIHECSC 456

BLAST of CmoCh04G004260.1 vs. NCBI nr
Match: gi|659125841|ref|XP_008462883.1| (PREDICTED: uncharacterized protein LOC103501161 isoform X1 [Cucumis melo])

HSP 1 Score: 713.4 bits (1840), Expect = 2.6e-202
Identity = 324/405 (80.00%), Postives = 356/405 (87.90%), Query Frame = 1

Query: 55  PSSSSSSAAKLRRSVTFLPLKDLRYSHKPLEGHTWFMSSMYDIHEDGEVQFQQFPSPAAD 114
           PS        LRRSVTFLPLKDLRYS+K L GHTWFMSS+YDI E+GEVQ+QQFPSP  D
Sbjct: 9   PSLXXXXXXXLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVD 68

Query: 115 GDARLLCLKGNDTHDGSWNYYAVAWPETLPENATVMKGLSFVSYNHYNYDNIWHGLSALM 174
           GD R+LCLKG DTHDGSWNYY +AWPE LPENATVMKG+SFVSYNHY+Y NIWHGLSALM
Sbjct: 69  GDERMLCLKGRDTHDGSWNYYGLAWPEGLPENATVMKGVSFVSYNHYDYQNIWHGLSALM 128

Query: 175 PFVAWHQIQGKCEIPERWILYHWGELRLKMGTWVSTIMEVTFGGPPKIEAFDGISEGQPV 234
           PFVAWHQIQGKCE+PERWILYHWGELRL+MG WV+T+ME TFG P +IEAF+GISEGQPV
Sbjct: 129 PFVAWHQIQGKCEVPERWILYHWGELRLRMGKWVNTLMEATFGAPIRIEAFEGISEGQPV 188

Query: 235 CFEKAVVMRHNEGGMSRQRRMETYDLMRCKARLFCNFTSPKPSVATVGMTLFMRTGARSF 294
           CFEKAVVMRHNEGGMSRQRRMETYD MRCKARL CN TSP+P    VGMT+ MRTG RSF
Sbjct: 189 CFEKAVVMRHNEGGMSRQRRMETYDFMRCKARLLCNLTSPEPLSGAVGMTMLMRTGPRSF 248

Query: 295 KNETAVVEIFGAECTKVVGCRLRVAHSNNLTFCEQVSLMGKTDILVSPHGAQLTNMFLMD 354
           +NET V EIFG EC KV GCRL VA+SNNLTFCEQVSLMGKTDIL+SPHGAQLTNM LM+
Sbjct: 249 RNETTVAEIFGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMN 308

Query: 355 RNSSVMEFFPKGWLKLAGIGQFVYQWMASWSGMRHQGAWRDPNG-LTCPYNEDDRRCMSI 414
           RNSSVMEFFPKGWL+LAGIGQ+VY W+ASWSGM+HQGAWRDPN  L CPY+ +DRRCMS 
Sbjct: 309 RNSSVMEFFPKGWLELAGIGQYVYHWLASWSGMKHQGAWRDPNSTLPCPYSPNDRRCMSF 368

Query: 415 FKGGTIGYNRTYFSEWAKNVLNEVKTRKMDEAAQATANHVHQCSC 459
           +KGGTIGYNRTYFSEWAK+VLNEVK RK++EA + T N VH+CSC
Sbjct: 369 YKGGTIGYNRTYFSEWAKSVLNEVKMRKIEEATKFTTNQVHECSC 413

BLAST of CmoCh04G004260.1 vs. NCBI nr
Match: gi|659125843|ref|XP_008462884.1| (PREDICTED: uncharacterized protein LOC103501161 isoform X2 [Cucumis melo])

HSP 1 Score: 652.9 bits (1683), Expect = 4.1e-184
Identity = 296/367 (80.65%), Postives = 323/367 (88.01%), Query Frame = 1

Query: 55  PSSSSSSAAKLRRSVTFLPLKDLRYSHKPLEGHTWFMSSMYDIHEDGEVQFQQFPSPAAD 114
           PS        LRRSVTFLPLKDLRYS+K L GHTWFMSS+YDI E+GEVQ+QQFPSP  D
Sbjct: 9   PSLXXXXXXXLRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVD 68

Query: 115 GDARLLCLKGNDTHDGSWNYYAVAWPETLPENATVMKGLSFVSYNHYNYDNIWHGLSALM 174
           GD R+LCLKG DTHDGSWNYY +AWPE LPENATVMKG+SFVSYNHY+Y NIWHGLSALM
Sbjct: 69  GDERMLCLKGRDTHDGSWNYYGLAWPEGLPENATVMKGVSFVSYNHYDYQNIWHGLSALM 128

Query: 175 PFVAWHQIQGKCEIPERWILYHWGELRLKMGTWVSTIMEVTFGGPPKIEAFDGISEGQPV 234
           PFVAWHQIQGKCE+PERWILYHWGELRL+MG WV+T+ME TFG P +IEAF+GISEGQPV
Sbjct: 129 PFVAWHQIQGKCEVPERWILYHWGELRLRMGKWVNTLMEATFGAPIRIEAFEGISEGQPV 188

Query: 235 CFEKAVVMRHNEGGMSRQRRMETYDLMRCKARLFCNFTSPKPSVATVGMTLFMRTGARSF 294
           CFEKAVVMRHNEGGMSRQRRMETYD MRCKARL CN TSP+P    VGMT+ MRTG RSF
Sbjct: 189 CFEKAVVMRHNEGGMSRQRRMETYDFMRCKARLLCNLTSPEPLSGAVGMTMLMRTGPRSF 248

Query: 295 KNETAVVEIFGAECTKVVGCRLRVAHSNNLTFCEQVSLMGKTDILVSPHGAQLTNMFLMD 354
           +NET V EIFG EC KV GCRL VA+SNNLTFCEQVSLMGKTDIL+SPHGAQLTNM LM+
Sbjct: 249 RNETTVAEIFGKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMN 308

Query: 355 RNSSVMEFFPKGWLKLAGIGQFVYQWMASWSGMRHQGAWRDPNG-LTCPYNEDDRRCMSI 414
           RNSSVMEFFPKGWL+LAGIGQ+VY W+ASWSGM+HQGAWRDPN  L CPY+ +DRRCMS 
Sbjct: 309 RNSSVMEFFPKGWLELAGIGQYVYHWLASWSGMKHQGAWRDPNSTLPCPYSPNDRRCMSF 368

Query: 415 FKGGTIG 421
           +KGGTIG
Sbjct: 369 YKGGTIG 375

BLAST of CmoCh04G004260.1 vs. NCBI nr
Match: gi|1009151277|ref|XP_015893468.1| (PREDICTED: uncharacterized protein LOC107427594 [Ziziphus jujuba])

HSP 1 Score: 645.6 bits (1664), Expect = 6.6e-182
Identity = 315/488 (64.55%), Postives = 369/488 (75.61%), Query Frame = 1

Query: 3   KPSHTSKPHQRTT-SIL--FSPKLFIYLLSISAILFIFFHIQSLHRHVPPRPQNNPSS-- 62
           K  H  K H RT  SIL  +SP+L I++L+    LF+ FHIQSL    PP   + P S  
Sbjct: 18  KAHHHPKQHSRTIFSILPLYSPRLSIFILATCVALFVLFHIQSLQ--TPPSSPSPPWSLM 77

Query: 63  ----------------------SSSSAAKLRRSVTFLPLKDLRYSHKPLEGHTWFMSSMY 122
                                 +++   KLR SVTFLPLKDLRY+H  L+GHTWFMSSMY
Sbjct: 78  HQYWQRATTTRIFTNCTTNQLANTTVTDKLRDSVTFLPLKDLRYAHAALDGHTWFMSSMY 137

Query: 123 DIHEDGEVQFQQFPSPAADGDARLLCLKGNDTHDGSWNYYAVAWPETLPENATVMKGLSF 182
           D HE+GEVQ+QQFPS ++ G  R+LCLKG DTHDGSWN YA+AW E LP NAT MKGL+F
Sbjct: 138 DTHEEGEVQYQQFPSESSKG--RILCLKGRDTHDGSWNSYALAWQEALPHNATFMKGLTF 197

Query: 183 VSYNHYNYDNIWHGLSALMPFVAWHQIQGKCEIPERWILYHWGELRLKMGTWVSTIMEVT 242
           VSYNHYNY+NIWHGLSA+MPFVAW++  G  ++P+RW+LYHWGELR KMG W+ T+ME T
Sbjct: 198 VSYNHYNYENIWHGLSAVMPFVAWYKKNGCTQLPQRWVLYHWGELRFKMGLWLKTLMEAT 257

Query: 243 FGGPPKIEAFDGIS--EGQPVCFEKAVVMRHNEGGMSRQRRMETYDLMRCKARLFCNFTS 302
           F GP  IE F+ +   E  PVCFE AVVMRHNEGGMSR++RME YDL+RCKAR++CN +S
Sbjct: 258 FDGPQHIEGFEWVENDEFSPVCFETAVVMRHNEGGMSREKRMEVYDLIRCKARIYCNVSS 317

Query: 303 PKPS-VATVGMTLFMRTGARSFKNETAVVEIFGAECTKVVGCRLRVAHSNNLTFCEQVSL 362
            K + V  +GMTLFMR G RSFKNETAV+EIF  EC KV GCRL VA+SNNLT CEQV L
Sbjct: 318 EKKTTVPEIGMTLFMRMGPRSFKNETAVIEIFAKECGKVEGCRLMVAYSNNLTVCEQVKL 377

Query: 363 MGKTDILVSPHGAQLTNMFLMDRNSSVMEFFPKGWLKLAGIGQFVYQWMASWSGMRHQGA 422
           M  TDILVSPHGAQLTNMFLMDRNSSVMEFFPKGWLKLAG+GQ+V+ W+ASWSGM+HQGA
Sbjct: 378 MSSTDILVSPHGAQLTNMFLMDRNSSVMEFFPKGWLKLAGVGQYVHHWLASWSGMKHQGA 437

Query: 423 WRDPNGLTCPYNEDDRRCMSIFKGGTIGYNRTYFSEWAKNVLNEVKTRKMDE-AAQATAN 460
           WRDPNG  CPY+EDDRRCMSI+K G IG+N TYFSEWA+NVLNEVK RKM+E A + T  
Sbjct: 438 WRDPNGDHCPYSEDDRRCMSIYKSGKIGFNNTYFSEWARNVLNEVKARKMEEYALKGTFP 497

BLAST of CmoCh04G004260.1 vs. NCBI nr
Match: gi|743939586|ref|XP_011014243.1| (PREDICTED: uncharacterized protein LOC105118077 [Populus euphratica])

HSP 1 Score: 640.6 bits (1651), Expect = 2.1e-180
Identity = 308/474 (64.98%), Postives = 361/474 (76.16%), Query Frame = 1

Query: 3   KPSHTSKPHQRTTSILFSPKLFIYLLSISAILFIFFHIQSLHRHV---PP--------RP 62
           KPS   K    +   L+SPKL I+LL++   LF  +HIQSLH      PP        R 
Sbjct: 18  KPSFPKKRTTMSVFCLYSPKLSIFLLTLCVSLFTLYHIQSLHARKTSSPPWSFVHKWERV 77

Query: 63  QNNPSSSSSSAAKLRRSVTFLPLKDLRYSHKPLEGHTWFMSSMYDIHEDGEVQFQQFPSP 122
            N      S A KLR+SVTFLPLKDLRY+ K L+GHTWFMSSMYD  E+GEVQ+QQFPS 
Sbjct: 78  TNCTREHESMADKLRQSVTFLPLKDLRYADKALQGHTWFMSSMYDTREEGEVQYQQFPSK 137

Query: 123 AADGDARLLCLKGNDTHDGSWNYYAVAWPETLPENATVMKGLSFVSYNHYNYDNIWHGLS 182
           ++    RLLCLKG +THDGS N YA+AWPE LP NAT++KGL+FVSYNHYNYDNIWHGLS
Sbjct: 138 SSK--RRLLCLKGKETHDGSRNSYALAWPEALPFNATLLKGLTFVSYNHYNYDNIWHGLS 197

Query: 183 ALMPFVAWHQIQGKCEIPERWILYHWGELRLKMGTWVSTIMEVTFGGPPKIEAFDGISEG 242
           A++PFVAWH I+  CE P RWILYHWGELR +M  W+ T+   TFGG P  E+F+ +++G
Sbjct: 198 AMVPFVAWH-IRNGCESPSRWILYHWGELRFEMSPWLRTLTGATFGGAPYTESFEEVNDG 257

Query: 243 QPVCFEKAVVMRHNEGGMSRQRRMETYDLMRCKARLFCNFTSP-------KPSVATVGMT 302
           QP+CFEKAVVMRHNEGGMSR RR ETYDLMRC+AR++CN +         K  +  +GMT
Sbjct: 258 QPLCFEKAVVMRHNEGGMSRDRRTETYDLMRCRARMYCNVSLEGRIHEVNKQGLPVIGMT 317

Query: 303 LFMRTGARSFKNETAVVEIFGAECTKVVGCRLRVAHSNNLTFCEQVSLMGKTDILVSPHG 362
           LFMRTG RSF NE+AV+ IF  EC +V GCRL VA+SNNLTFCEQV +M  TDILVSPHG
Sbjct: 318 LFMRTGPRSFTNESAVIAIFEKECARVDGCRLMVAYSNNLTFCEQVKVMSLTDILVSPHG 377

Query: 363 AQLTNMFLMDRNSSVMEFFPKGWLKLAGIGQFVYQWMASWSGMRHQGAWRDPNGLTCPYN 422
           AQLTNMFLMD+NSSVMEFFPKGWLKLAG+GQ+V+ W+ASWSGMRHQGAWRDPNG  CPY 
Sbjct: 378 AQLTNMFLMDKNSSVMEFFPKGWLKLAGVGQYVFHWIASWSGMRHQGAWRDPNGDECPYA 437

Query: 423 EDDRRCMSIFKGGTIGYNRTYFSEWAKNVLNEVKTRKMDEAAQATANHVHQCSC 459
           EDDRRCMSI+K G IG+N TYFSEWA++VLNEVK RK++EAA  T      C+C
Sbjct: 438 EDDRRCMSIYKNGKIGFNETYFSEWARDVLNEVKIRKLEEAANKTNASTSACAC 488

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KXZ9_CUCSA4.7e-21978.35Uncharacterized protein OS=Cucumis sativus GN=Csa_4G112630 PE=4 SV=1[more]
B9S9E8_RICCO4.0e-17862.53Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0884460 PE=4 SV=1[more]
A0A061E286_THECC9.0e-17864.38Uncharacterized protein OS=Theobroma cacao GN=TCM_007404 PE=4 SV=1[more]
A0A067LA02_JATCU1.0e-17666.52Uncharacterized protein OS=Jatropha curcas GN=JCGZ_24948 PE=4 SV=1[more]
M5VM83_PRUPE1.1e-17562.74Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa1027207mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G33600.17.8e-15759.82 unknown protein[more]
AT4G33590.13.3e-15556.26 unknown protein[more]
Match NameE-valueIdentityDescription
gi|778692014|ref|XP_011653390.1|6.8e-21978.35PREDICTED: uncharacterized protein LOC101219216 [Cucumis sativus][more]
gi|659125841|ref|XP_008462883.1|2.6e-20280.00PREDICTED: uncharacterized protein LOC103501161 isoform X1 [Cucumis melo][more]
gi|659125843|ref|XP_008462884.1|4.1e-18480.65PREDICTED: uncharacterized protein LOC103501161 isoform X2 [Cucumis melo][more]
gi|1009151277|ref|XP_015893468.1|6.6e-18264.55PREDICTED: uncharacterized protein LOC107427594 [Ziziphus jujuba][more]
gi|743939586|ref|XP_011014243.1|2.1e-18064.98PREDICTED: uncharacterized protein LOC105118077 [Populus euphratica][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR007657Glycosyltransferase_61
Vocabulary: Molecular Function
TermDefinition
GO:0016757transferase activity, transferring glycosyl groups
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016757 transferase activity, transferring glycosyl groups

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmoCh04G004260CmoCh04G004260gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmoCh04G004260.1CmoCh04G004260.1-proteinpolypeptide


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh04G004260.1.three_prime_UTR.1CmoCh04G004260.1.three_prime_UTR.1three_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh04G004260.1.CDS.4CmoCh04G004260.1.CDS.4CDS
CmoCh04G004260.1.CDS.3CmoCh04G004260.1.CDS.3CDS
CmoCh04G004260.1.CDS.2CmoCh04G004260.1.CDS.2CDS
CmoCh04G004260.1.CDS.1CmoCh04G004260.1.CDS.1CDS


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh04G004260.1.five_prime_UTR.1CmoCh04G004260.1.five_prime_UTR.1five_prime_UTR


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh04G004260.1.exon.4CmoCh04G004260.1.exon.4exon
CmoCh04G004260.1.exon.3CmoCh04G004260.1.exon.3exon
CmoCh04G004260.1.exon.2CmoCh04G004260.1.exon.2exon
CmoCh04G004260.1.exon.1CmoCh04G004260.1.exon.1exon


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR007657Glycosyltransferase AER61, uncharacterisedPFAMPF04577DUF563coord: 207..390
score: 1.4
NoneNo IPR availablePANTHERPTHR20961GLYCOSYLTRANSFERASEcoord: 25..444
score: 7.2E
NoneNo IPR availablePANTHERPTHR20961:SF33SUBFAMILY NOT NAMEDcoord: 25..444
score: 7.2E