CmaCh02G011860 (gene) Cucurbita maxima (Rimu)

NameCmaCh02G011860
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionLate embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family
LocationCma_Chr02 : 7026015 .. 7026779 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCGAACGTGTTCACCCCACTGACTCCGATGCTGCTTCTGCACACTCCGGCGATGCTATCCCTCCCAAGAACCCTCCTCCCGGCACCTACGTCATCCAGCTTCCCAAGGACCAAGTCTACCGAGTTCCTCCTCCCGAAAATGCTACCCGCTTCGATCTCTACTCTCGACGGAACACCCGCCCCTCCCTCTGCCGCCGATTTCTCTGCTCCGTTCTCCTCCTCATCACCATACTCCTCCTCCTTTTCGCCATTGTTTCCGGCCTCTTCTTCTTAATCCTCCAACCTCTCTCTCCTCGCTTCTCCATTCTCGCCATCTCCACCAAGGGGATGCAGATCAAGCCGAACGCCTCAATTTCTCCCCAATTTAACGTCACTGTTCGAGCCGAAAACCCTAACAAAAAAATCGGAATCTACTACGAGAGAAACAGCAACATTAGCGTCAATTTCGGGGATGTAATGTTGTGCAAAGGCGCATTGCCGGCGTTGTATCAGCCGTGGAGAAATGTGACGGTTATGGCGGCAAAGCTTAAAGGATCTGGCATCAAATTGTCGAGCAGTGCTGTGAAAGCATTGAAGGATTCAGAGAAGCAGGGGAAAGTGAAATTGAAGGTGGATTTAAGAGCTCCGATTAAGATGAAATTATACTGGATGAAGACATGGACGATTAGGGCAAAAGTATCGTGCGAGATTTGGGTGAAAAAGGTAACGGGGGAGGCGAAGGCGACGGAGAAGAAGTGCGAGCATAGCTTGAAGCTGTGGTAG

mRNA sequence

ATGGCCGAACGTGTTCACCCCACTGACTCCGATGCTGCTTCTGCACACTCCGGCGATGCTATCCCTCCCAAGAACCCTCCTCCCGGCACCTACGTCATCCAGCTTCCCAAGGACCAAGTCTACCGAGTTCCTCCTCCCGAAAATGCTACCCGCTTCGATCTCTACTCTCGACGGAACACCCGCCCCTCCCTCTGCCGCCGATTTCTCTGCTCCGTTCTCCTCCTCATCACCATACTCCTCCTCCTTTTCGCCATTGTTTCCGGCCTCTTCTTCTTAATCCTCCAACCTCTCTCTCCTCGCTTCTCCATTCTCGCCATCTCCACCAAGGGGATGCAGATCAAGCCGAACGCCTCAATTTCTCCCCAATTTAACGTCACTGTTCGAGCCGAAAACCCTAACAAAAAAATCGGAATCTACTACGAGAGAAACAGCAACATTAGCGTCAATTTCGGGGATGTAATGTTGTGCAAAGGCGCATTGCCGGCGTTGTATCAGCCGTGGAGAAATGTGACGGTTATGGCGGCAAAGCTTAAAGGATCTGGCATCAAATTGTCGAGCAGTGCTGTGAAAGCATTGAAGGATTCAGAGAAGCAGGGGAAAGTGAAATTGAAGGTGGATTTAAGAGCTCCGATTAAGATGAAATTATACTGGATGAAGACATGGACGATTAGGGCAAAAGTATCGTGCGAGATTTGGGTGAAAAAGGTAACGGGGGAGGCGAAGGCGACGGAGAAGAAGTGCGAGCATAGCTTGAAGCTGTGGTAG

Coding sequence (CDS)

ATGGCCGAACGTGTTCACCCCACTGACTCCGATGCTGCTTCTGCACACTCCGGCGATGCTATCCCTCCCAAGAACCCTCCTCCCGGCACCTACGTCATCCAGCTTCCCAAGGACCAAGTCTACCGAGTTCCTCCTCCCGAAAATGCTACCCGCTTCGATCTCTACTCTCGACGGAACACCCGCCCCTCCCTCTGCCGCCGATTTCTCTGCTCCGTTCTCCTCCTCATCACCATACTCCTCCTCCTTTTCGCCATTGTTTCCGGCCTCTTCTTCTTAATCCTCCAACCTCTCTCTCCTCGCTTCTCCATTCTCGCCATCTCCACCAAGGGGATGCAGATCAAGCCGAACGCCTCAATTTCTCCCCAATTTAACGTCACTGTTCGAGCCGAAAACCCTAACAAAAAAATCGGAATCTACTACGAGAGAAACAGCAACATTAGCGTCAATTTCGGGGATGTAATGTTGTGCAAAGGCGCATTGCCGGCGTTGTATCAGCCGTGGAGAAATGTGACGGTTATGGCGGCAAAGCTTAAAGGATCTGGCATCAAATTGTCGAGCAGTGCTGTGAAAGCATTGAAGGATTCAGAGAAGCAGGGGAAAGTGAAATTGAAGGTGGATTTAAGAGCTCCGATTAAGATGAAATTATACTGGATGAAGACATGGACGATTAGGGCAAAAGTATCGTGCGAGATTTGGGTGAAAAAGGTAACGGGGGAGGCGAAGGCGACGGAGAAGAAGTGCGAGCATAGCTTGAAGCTGTGGTAG

Protein sequence

MAERVHPTDSDAASAHSGDAIPPKNPPPGTYVIQLPKDQVYRVPPPENATRFDLYSRRNTRPSLCRRFLCSVLLLITILLLLFAIVSGLFFLILQPLSPRFSILAISTKGMQIKPNASISPQFNVTVRAENPNKKIGIYYERNSNISVNFGDVMLCKGALPALYQPWRNVTVMAAKLKGSGIKLSSSAVKALKDSEKQGKVKLKVDLRAPIKMKLYWMKTWTIRAKVSCEIWVKKVTGEAKATEKKCEHSLKLW
BLAST of CmaCh02G011860 vs. TrEMBL
Match: A0A0A0L0D0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G269730 PE=4 SV=1)

HSP 1 Score: 298.1 bits (762), Expect = 1.0e-77
Identity = 167/258 (64.73%), Postives = 201/258 (77.91%), Query Frame = 1

Query: 1   MAERVHPT-DSDAASAHSGDAIPPKNPPPGTYVIQLPKDQVYRVPPPENATRFDLYSRRN 60
           MA+RVHPT + D+AS +     P K+PP  TYVIQ+PKDQVYR+PPPENA RF+LY+R +
Sbjct: 1   MADRVHPTLNPDSASNN-----PLKSPPSATYVIQIPKDQVYRIPPPENAARFNLYTRHH 60

Query: 61  TRPSLCRRFLCSVLLLITILLLLFAIVSGLFFLILQPLSPRFSILAISTKGMQIKPNA-S 120
            RPS CRRFLC +LLL    LLL AI S L FLILQP  PRFSILA+S    +IKPN  S
Sbjct: 61  HRPSPCRRFLCFILLL----LLLSAITSALVFLILQPDLPRFSILAVSIS--RIKPNTTS 120

Query: 121 ISPQFNVTVRAENPNKKIGIYYERNSNISVNFGDVMLCKGALPALYQPWRNVTVMAAKLK 180
            SPQFNVT+RAEN NK IGIYYE+NS +S+N  DVMLC+GALP LYQP RNVTVM  K+K
Sbjct: 121 FSPQFNVTIRAENHNKNIGIYYEKNSTVSMNLSDVMLCEGALPLLYQPPRNVTVMRVKVK 180

Query: 181 GSGIKLSSSAVKALKDSEKQGK-VKLKVDLRAPIKMKLYWMK-TWTIRAKVSCEIWVKKV 240
           GSGI+LSSS  KA +D EK+GK +++KVD+R P+KMKLYWM+  W IRAKV+C+I VKK 
Sbjct: 181 GSGIRLSSSTGKAFEDWEKEGKTLRMKVDVRGPMKMKLYWMEMRWRIRAKVTCKILVKKE 240

Query: 241 TGEAKATEKKCEHSLKLW 255
            G+ K  E+KC+HS+KLW
Sbjct: 241 MGKTKVMEEKCDHSMKLW 247

BLAST of CmaCh02G011860 vs. TrEMBL
Match: M5W0N4_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa010043mg PE=4 SV=1)

HSP 1 Score: 238.0 bits (606), Expect = 1.2e-59
Identity = 126/266 (47.37%), Postives = 179/266 (67.29%), Query Frame = 1

Query: 1   MAERVHPTDS----DAASAHSGDAIPPKN----PPPGTYVIQLPKDQVYRVPPPENATRF 60
           MA+RVHP DS    +          PP +    PPPGTYVIQ+PKDQVYRVPPPENA+R+
Sbjct: 1   MADRVHPRDSPFHTETTPLSLSRPSPPDSEKPVPPPGTYVIQIPKDQVYRVPPPENASRY 60

Query: 61  DLYSRRNTRPSLCRRFLCSVLLLITILLLLFAIVSGLFFLILQPLSPRFSILAISTKGMQ 120
             Y+ R TR S C    C  L L+  ++ L A  +G+F+L+++P +P +S+ +I+ KG  
Sbjct: 61  QSYTHRKTRRSSCHCCCCWFLGLLAAIVFLSAAAAGIFYLVVRPEAPNYSVESIAFKGFN 120

Query: 121 I----KPNASISPQFNVTVRAENPNKKIGIYYERNSNISVNFGDVMLCKGALPALYQPWR 180
           +     P ++ISP+ +VTVRA+NPNKKIGIYYER S++ + + D+ LC G LPA YQP +
Sbjct: 121 LTTTSSPPSAISPEIHVTVRAQNPNKKIGIYYERESSVKLFYSDIKLCDGVLPAFYQPSK 180

Query: 181 NVTVMAAKLKGSGIKLSSSAVKALKDSEKQGKVKLKVDLRAPIKMKLYWMKTWTIRAKVS 240
           NVT     L GSGI+L+S+  K L D++KQGKV L++DLRAP+++K+  +KTWTI  KV+
Sbjct: 181 NVTEFRTALTGSGIELTSAVQKGLVDAQKQGKVPLELDLRAPVRIKVGPIKTWTITVKVA 240

Query: 241 CEIWVKKVTGEAKATEKKCEHSLKLW 255
           C + V K+T +A    + C++S+  W
Sbjct: 241 CHLTVNKLTADANIVSRDCDYSVDPW 266

BLAST of CmaCh02G011860 vs. TrEMBL
Match: A0A061GWS9_THECC (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family, putative isoform 1 OS=Theobroma cacao GN=TCM_042018 PE=4 SV=1)

HSP 1 Score: 231.1 bits (588), Expect = 1.5e-57
Identity = 121/260 (46.54%), Postives = 174/260 (66.92%), Query Frame = 1

Query: 1   MAERVHPTDS-DAASAHSGDAIPPKNPP-----PGTYVIQLPKDQVYRVPPPENATRFDL 60
           MAE+VHP DS +  +  S   + P +P      PGTYVIQ+PKDQ+YRVPPPENA R+  
Sbjct: 1   MAEQVHPGDSPNVTNEQSAPKLAPPSPEKPVPQPGTYVIQIPKDQIYRVPPPENARRYAH 60

Query: 61  YSRRNTRPSLCRRFLCSVLLLITILLLLFAIVSGLFFLILQPLSPRFSILAISTKGMQIK 120
            S+R      CR   C +L +I +LLL  AI + + + + +P SP +S+ +++ KG+ + 
Sbjct: 61  LSKRKASGGTCRSCCCCLLTVILVLLLSAAIAAAVVYFVFKPESPNYSVESVAIKGLNLT 120

Query: 121 PNASISPQFNVTVRAENPNKKIGIYYERNSNISVNFGDVMLCKGALPALYQPWRNVTVMA 180
             + +SP+F+VTVRA NPN KIGIYYE+ S++ V + DV LC GALPA YQP  NVTV  
Sbjct: 121 SASPLSPEFDVTVRAHNPNDKIGIYYEKGSSVKVYYEDVNLCNGALPAFYQPTNNVTVFQ 180

Query: 181 AKLKGSGIKLSSSAVKALKDSEKQGKVKLKVDLRAPIKMKLYWMKTWTIRAKVSCEIWVK 240
             LKGSGI+L+++A++AL D++ +G V   + LRAP+K+K+  +KTW I AKV+C+I V 
Sbjct: 181 TALKGSGIELTNTALRALSDAQNKGTVPFTLKLRAPVKIKVGSIKTWKITAKVTCKITVD 240

Query: 241 KVTGEAKATEKKCEHSLKLW 255
            +T  +K   K C++ + LW
Sbjct: 241 NLTATSKIVSKDCDYGVDLW 260

BLAST of CmaCh02G011860 vs. TrEMBL
Match: W9R547_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_022546 PE=4 SV=1)

HSP 1 Score: 230.3 bits (586), Expect = 2.6e-57
Identity = 129/267 (48.31%), Postives = 178/267 (66.67%), Query Frame = 1

Query: 1   MAERVHPT-DSDAASA-------HSGDAIPPKN----PPPGTYVIQLPKDQVYRVPPPEN 60
           MA+RV+P+ D + AS+       H    IPP      PPPGTYVIQ+PKDQVYR+PPPEN
Sbjct: 1   MADRVYPSGDQETASSTPQNSSEHPLRPIPPSPEKPVPPPGTYVIQIPKDQVYRIPPPEN 60

Query: 61  ATRFDLYSRRNTRPSLCRRFLCSVLLLITILLLLFAIVSGLFFLILQPLSPRFSILAIST 120
           A RF  YSRR +R S C   LCS+L  I +L +L  I + + +L+ +P SP +SI  I+ 
Sbjct: 61  ARRFQNYSRRKSRRSKCGLCLCSLLGSIVVLAILAGISAAVLYLVFRPESPNYSIDDIAI 120

Query: 121 KGMQIKPN-ASISPQFNVTVRAENPNKKIGIYYERNSNISVNFGDVMLCKGALPALYQPW 180
           KG+    + A ISP  +V +RA+NPN  IGI+Y ++S+++V + DV LC GA+PA +QP 
Sbjct: 121 KGINTTASSAEISPAIDVVLRAKNPNDNIGIFYGKDSSVTVYYSDVELCNGAMPAFHQPS 180

Query: 181 RNVTVMAAKLKGSGIKLSSSAVKALKDSEKQGKVKLKVDLRAPIKMKLYWMKTWTIRAKV 240
            NVTV    L G GI+L++S  KAL+DSEK+GKV LK++++AP+K K+  +KTWT+  K 
Sbjct: 181 NNVTVFKTALTGPGIELTASMKKALRDSEKKGKVPLKLNMKAPVKFKVGSVKTWTLAVKF 240

Query: 241 SCEIWVKKVTGEAKATEKKCEHSLKLW 255
            C++ V K+T EAK   K CE+  K W
Sbjct: 241 RCDVTVDKLTAEAKIVSKDCEYGWKFW 267

BLAST of CmaCh02G011860 vs. TrEMBL
Match: A0A067LFL5_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_23841 PE=4 SV=1)

HSP 1 Score: 229.6 bits (584), Expect = 4.4e-57
Identity = 128/271 (47.23%), Postives = 174/271 (64.21%), Query Frame = 1

Query: 1   MAERVHPTDSDAASAH---SGDAIP-------------PKNPPP-GTYVIQLPKDQVYRV 60
           MAERVHP DS  +S     +  A P             P +PPP GTYVIQ+PKDQVYRV
Sbjct: 1   MAERVHPRDSPPSSTELKKTPSASPDRPLKPAAPLLEKPVSPPPAGTYVIQIPKDQVYRV 60

Query: 61  PPPENATRFDLYSRRNTRPSLCRRFLCSVLLLITILLLLFAIVSGLFFLILQPLSPRFSI 120
           PPPENA R+   SRR  R S C   LC    L+   +LL AI +G+ +L+ +P +P++SI
Sbjct: 61  PPPENAKRYQQLSRRKHRRSTCCCCLCWFFGLLFTFILLAAIAAGVLYLVFRPEAPKYSI 120

Query: 121 LAISTKGMQIKPNASISPQFNVTVRAENPNKKIGIYYERNSNISVNFGDVMLCKGALPAL 180
            ++S KG  +  +A +SP+F+V VRA NPN KIGIYY   S+++V + DV LC G LP  
Sbjct: 121 ESVSIKGFNLTSSAPLSPEFDVAVRAHNPNNKIGIYYRTGSSVNVYYNDVRLCNGKLPVF 180

Query: 181 YQPWRNVTVMAAKLKGSGIKLSSSAVKALKDSEKQGKVKLKVDLRAPIKMKLYWMKTWTI 240
           YQ   NVTV  A LKGSGI+L+S+  KAL   E +G V  K++LRAP+++K+  +KTWTI
Sbjct: 181 YQGTNNVTVFVASLKGSGIELTSAVHKALISGETKGAVPFKLNLRAPVRIKVGSLKTWTI 240

Query: 241 RAKVSCEIWVKKVTGEAKATEKKCEHSLKLW 255
             KV C++ V K+T ++K   K C++ ++LW
Sbjct: 241 TVKVDCDVTVDKLTSKSKLVSKHCDYGVELW 271

BLAST of CmaCh02G011860 vs. TAIR10
Match: AT2G27080.1 (AT2G27080.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 208.4 bits (529), Expect = 5.3e-54
Identity = 111/260 (42.69%), Postives = 160/260 (61.54%), Query Frame = 1

Query: 1   MAERVHPTDSDAASAH------SGDAIPPKNPPPGTYVIQLPKDQVYRVPPPENATRFDL 60
           MAERV+P DS   S        SG+      PPP TYVIQ+PKDQ+YR+PPPENA RF+ 
Sbjct: 1   MAERVYPADSPPQSGQFSGNFSSGEFPKKPAPPPSTYVIQVPKDQIYRIPPPENAHRFEQ 60

Query: 61  YSRRNTRPSLCRRFLCSVLLLITILLLLFAIVSGLFFLILQPLSPRFSILAISTKGMQIK 120
            SR+ T  S CR   CS L  + IL++L  I   + +LI +P +P++SI   S  G+ + 
Sbjct: 61  LSRKKTNRSNCRCCFCSFLAAVFILIVLAGISFAVLYLIYRPEAPKYSIEGFSVSGINLN 120

Query: 121 PNASISPQFNVTVRAENPNKKIGIYYERNSNISVNFGDVMLCKGALPALYQPWRNVTVMA 180
             + ISP FNVTVR+ N N KIG+YYE+ S++ V + DV +  G +P  YQP +NVTV+ 
Sbjct: 121 STSPISPSFNVTVRSRNGNGKIGVYYEKESSVDVYYNDVDISNGVMPVFYQPAKNVTVVK 180

Query: 181 AKLKGSGIKLSSSAVKALKDSEKQGKVKLKVDLRAPIKMKLYWMKTWTIRAKVSCEIWVK 240
             L GS I+L+S   K +++   +  V  K+ ++AP+K+K   +KTWT+   V C++ V 
Sbjct: 181 LVLSGSKIQLTSGMRKEMRNEVSKKTVPFKLKIKAPVKIKFGSVKTWTMIVNVDCDVTVD 240

Query: 241 KVTGEAKATEKKCEHSLKLW 255
           K+T  ++   +KC H + LW
Sbjct: 241 KLTAPSRIVSRKCSHDVDLW 260

BLAST of CmaCh02G011860 vs. TAIR10
Match: AT5G21130.1 (AT5G21130.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 188.0 bits (476), Expect = 7.4e-48
Identity = 106/252 (42.06%), Postives = 153/252 (60.71%), Query Frame = 1

Query: 6   HP---TDSDAASAHSGDAIPPK-NPPPGTYVIQLPKDQVYRVPPPENATRFDLYSRRNTR 65
           HP   T+  ++S +S D+   +  PPPGTYVI+LPKDQ+YRVPPPENA R++  SRR T 
Sbjct: 29  HPSLDTNDSSSSRYSVDSQKSRIGPPPGTYVIKLPKDQIYRVPPPENAHRYEYLSRRKTN 88

Query: 66  PSLCRRFLCSVLLLITILLLLFAIVSGLFFLILQPLSPRFSILAISTKGMQIKPNASISP 125
            S CRR LC  L  + I+++L AI  G F+L+ QP  P+FS+  +S  G+ +  ++  SP
Sbjct: 89  KSCCRRCLCYSLSALLIIIVLAAIAFGFFYLVYQPHKPQFSVSGVSVTGINLTSSSPFSP 148

Query: 126 QFNVTVRAENPNKKIGIYYERNSNISVNFGDVMLCKGALPALYQPWRNVTVMAAKLKGSG 185
              + +R++N   K+G+ YE+ +   V F    L  G   A  QP  NVTV+   LKGS 
Sbjct: 149 VIRIKLRSQNVKGKLGLIYEKGNEADVFFNGTKLGNGEFTAFKQPAGNVTVIVTVLKGSS 208

Query: 186 IKLSSSAVKALKDSEKQGKVKLKVDLRAPIKMKLYWMKTWTIRAKVSCEIWVKKVTGEAK 245
           +KL SS+ K L +S+K+GKV   + ++AP+K K+  + TWT+   V C+I V K+T  A 
Sbjct: 209 VKLKSSSRKELTESQKKGKVPFGLRIKAPVKFKVGSVTTWTMTITVDCKITVDKLTASAT 268

Query: 246 ATEKKCEHSLKL 254
              + CE  L L
Sbjct: 269 VKTENCETGLSL 280

BLAST of CmaCh02G011860 vs. TAIR10
Match: AT1G54540.1 (AT1G54540.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 94.7 bits (234), Expect = 8.5e-20
Identity = 70/227 (30.84%), Postives = 116/227 (51.10%), Query Frame = 1

Query: 26  PPPGTYVIQLPKDQVYRVPPPENATRFDLYSRRNTRPSLCRRFLCSVLLLITILLLLFAI 85
           P PG  V+ LP  +   +PPP   ++      RN    +C +  C VL L+ I L+  AI
Sbjct: 23  PAPGKTVL-LPVQRP--IPPPVIPSK-----NRN----MCCKIFCWVLSLLVIALIALAI 82

Query: 86  VSGLFFLILQPLSPRFSILAISTKGMQIKPNASISPQFNVTVRAENPNKKIGIYYERNSN 145
              + + +  P  P + + ++    + I  + S+S +F V + A NPN+KIGIYYE+  +
Sbjct: 83  AVAVVYFVFHPKLPSYEVNSLRVTNLGINLDLSLSAEFKVEITARNPNEKIGIYYEKGGH 142

Query: 146 ISVNFGDVMLCKGALPALYQPWRNVTVMAAKLKGSGIKLSSSAVKALKDSEKQGKVKLKV 205
           I V +    LC+G +P  YQ  RNVT +   L G   +  ++ + AL+  ++ G+V L +
Sbjct: 143 IGVWYDKTKLCEGPIPRFYQGHRNVTKLNVALTGRA-QYGNTVLAALQQQQQTGRVPLDL 202

Query: 206 DLRAPIKMKLYWMKTWTIRAKVSCEIWVKKVTGEAKATEKKCEHSLK 253
            + AP+ +KL  +K   IR   SC++ V  ++       K  + S K
Sbjct: 203 KVNAPVAIKLGNLKMKKIRILGSCKLVVDSLSTNNNINIKASDCSFK 236

BLAST of CmaCh02G011860 vs. TAIR10
Match: AT5G36970.1 (AT5G36970.1 NDR1/HIN1-like 25)

HSP 1 Score: 94.7 bits (234), Expect = 8.5e-20
Identity = 63/253 (24.90%), Postives = 119/253 (47.04%), Query Frame = 1

Query: 3   ERVHPTDSDAASAHSGDAIPPKNPPPGTYVIQLPKDQVYRVPPPENATRFDLYSRRNTRP 62
           +++HP     A  H    + P+      +       Q   + PP          ++ +R 
Sbjct: 5   QKIHPVSDPEAPPHPTAPLVPRGSSRSEHGDPTKTQQAAPLDPPRE--------KKGSRS 64

Query: 63  SLCRRFLCSVLLLITILLLLFAIVSGLFFLILQPLSPRFSILAISTKGMQIKPNASISPQ 122
             CR  +C  LL++ +L+++   + G+ +L+ +P  P ++I  +     Q+  + S+S  
Sbjct: 65  CWCR-CVCYTLLVLFLLIVIVGAIVGILYLVFRPKFPDYNIDRLQLTRFQLNQDLSLSTA 124

Query: 123 FNVTVRAENPNKKIGIYYERNSNISVNFGDVMLCKGALPALYQPWRNVTVMAAKLKGSGI 182
           FNVT+ A+NPN+KIGIYYE  S ISV +    +  G+LP  YQ   N T++  ++ G   
Sbjct: 125 FNVTITAKNPNEKIGIYYEDGSKISVLYMQTRISNGSLPKFYQGHENTTIILVEMTGFTQ 184

Query: 183 KLSSSAVKALKDSEKQGKVKLKVDLRAPIKMKLYWMKTWTIRAKVSCEIWVKKVTGEA-- 242
             +S      +     G + L++ +  P+++KL  +K   +R  V C + V  +   +  
Sbjct: 185 NATSLMTTLQEQQRLTGSIPLRIRVTQPVRIKLGKLKLMKVRFLVRCGVSVDSLAANSVI 244

Query: 243 KATEKKCEHSLKL 254
           +     C++  +L
Sbjct: 245 RVRSSNCKYRFRL 248

BLAST of CmaCh02G011860 vs. TAIR10
Match: AT1G65690.1 (AT1G65690.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 89.7 bits (221), Expect = 2.7e-18
Identity = 65/254 (25.59%), Postives = 120/254 (47.24%), Query Frame = 1

Query: 3   ERVHPT-DSDAASAHSGDAIPPKNPPPGTYVIQLPKDQVYRVPPPENATRFDLYSRRNTR 62
           ++++P  D +AA+A     + P+      +          +VP  +   RF   +    R
Sbjct: 5   QKIYPVQDPEAATARPTAPLVPRGSSRSEH------GDPSKVPLNQRPQRFVPLAPPKKR 64

Query: 63  PSLCRRFLCSVLLLITILLLLFAIVSGLFFLILQPLSPRFSILAISTKGMQIKPNASISP 122
            S C R  C     + +L++      G+ +L+ +P  P +SI  +      +  ++S++ 
Sbjct: 65  RSCCCRCFCYTFCFLLLLVVAVGASIGILYLVFKPKLPDYSIDRLQLTRFALNQDSSLTT 124

Query: 123 QFNVTVRAENPNKKIGIYYERNSNISVNFGDVMLCKGALPALYQPWRNVTVMAAKLKGSG 182
            FNVT+ A+NPN+KIGIYYE  S I+V + +  L  G+LP  YQ   N TV+  ++ G  
Sbjct: 125 AFNVTITAKNPNEKIGIYYEDGSKITVWYMEHQLSNGSLPKFYQGHENTTVIYVEMTGQT 184

Query: 183 IKLSSSAVKALKDSEKQGKVKLKVDLRAPIKMKLYWMKTWTIRAKVSCEIWVKKV--TGE 242
              S       +  ++ G + L++ +  P+++K   +K + +R  V C ++V  +     
Sbjct: 185 QNASGLRTTLEEQQQRTGNIPLRIRVNQPVRVKFGKLKLFEVRFLVRCGVFVDSLATNNV 244

Query: 243 AKATEKKCEHSLKL 254
            K     C+  L+L
Sbjct: 245 IKIQSSSCKFRLRL 252

BLAST of CmaCh02G011860 vs. NCBI nr
Match: gi|659097879|ref|XP_008449864.1| (PREDICTED: uncharacterized protein LOC103491613 [Cucumis melo])

HSP 1 Score: 308.5 bits (789), Expect = 1.1e-80
Identity = 171/257 (66.54%), Postives = 202/257 (78.60%), Query Frame = 1

Query: 1   MAERVHPT-DSDAASAHSGDAIPPKNPPPGTYVIQLPKDQVYRVPPPENATRFDLYSRRN 60
           MA+RVHPT + D+AS +      PK PP  TYVIQ+PKDQVYR+PPPENA RF+LY+R N
Sbjct: 1   MADRVHPTVNPDSASNN------PKGPPSATYVIQIPKDQVYRIPPPENAARFNLYTRHN 60

Query: 61  TRPSLCRRFLCSVLLLITILLLLFAIVSGLFFLILQPLSPRFSILAISTKGMQIKPNA-S 120
            RPS CRRFLC +LLL    L L AI S LFFLILQP  PR+SILA+S    +IK N  S
Sbjct: 61  QRPSSCRRFLCFILLL----LFLSAITSALFFLILQPDLPRYSILAVSIS--RIKSNTTS 120

Query: 121 ISPQFNVTVRAENPNKKIGIYYERNSNISVNFGDVMLCKGALPALYQPWRNVTVMAAKLK 180
           ISPQ NVT+RAEN NKKIGIYYE+NS +SV+  DVMLC+GALP LYQP  NVTV+A K+K
Sbjct: 121 ISPQLNVTIRAENHNKKIGIYYEKNSIVSVSLSDVMLCEGALPLLYQPPSNVTVIAVKMK 180

Query: 181 GSGIKLSSSAVKALKDSEKQGKVKLKVDLRAPIKMKLYWMKT-WTIRAKVSCEIWVKKVT 240
           GSGI+LSSS  KALKD EK+G+V+LKVD+RAPI+MKLYWMK  W IR KV+C+I VKKV 
Sbjct: 181 GSGIRLSSSTGKALKDWEKEGRVRLKVDVRAPIEMKLYWMKVRWRIRGKVTCKILVKKVM 240

Query: 241 GEAKATEKKCEHSLKLW 255
           G+ K  E+KC+HS+KLW
Sbjct: 241 GKTKVMEEKCDHSMKLW 245

BLAST of CmaCh02G011860 vs. NCBI nr
Match: gi|449463891|ref|XP_004149664.1| (PREDICTED: uncharacterized protein LOC101202799 [Cucumis sativus])

HSP 1 Score: 298.1 bits (762), Expect = 1.4e-77
Identity = 167/258 (64.73%), Postives = 201/258 (77.91%), Query Frame = 1

Query: 1   MAERVHPT-DSDAASAHSGDAIPPKNPPPGTYVIQLPKDQVYRVPPPENATRFDLYSRRN 60
           MA+RVHPT + D+AS +     P K+PP  TYVIQ+PKDQVYR+PPPENA RF+LY+R +
Sbjct: 1   MADRVHPTLNPDSASNN-----PLKSPPSATYVIQIPKDQVYRIPPPENAARFNLYTRHH 60

Query: 61  TRPSLCRRFLCSVLLLITILLLLFAIVSGLFFLILQPLSPRFSILAISTKGMQIKPNA-S 120
            RPS CRRFLC +LLL    LLL AI S L FLILQP  PRFSILA+S    +IKPN  S
Sbjct: 61  HRPSPCRRFLCFILLL----LLLSAITSALVFLILQPDLPRFSILAVSIS--RIKPNTTS 120

Query: 121 ISPQFNVTVRAENPNKKIGIYYERNSNISVNFGDVMLCKGALPALYQPWRNVTVMAAKLK 180
            SPQFNVT+RAEN NK IGIYYE+NS +S+N  DVMLC+GALP LYQP RNVTVM  K+K
Sbjct: 121 FSPQFNVTIRAENHNKNIGIYYEKNSTVSMNLSDVMLCEGALPLLYQPPRNVTVMRVKVK 180

Query: 181 GSGIKLSSSAVKALKDSEKQGK-VKLKVDLRAPIKMKLYWMK-TWTIRAKVSCEIWVKKV 240
           GSGI+LSSS  KA +D EK+GK +++KVD+R P+KMKLYWM+  W IRAKV+C+I VKK 
Sbjct: 181 GSGIRLSSSTGKAFEDWEKEGKTLRMKVDVRGPMKMKLYWMEMRWRIRAKVTCKILVKKE 240

Query: 241 TGEAKATEKKCEHSLKLW 255
            G+ K  E+KC+HS+KLW
Sbjct: 241 MGKTKVMEEKCDHSMKLW 247

BLAST of CmaCh02G011860 vs. NCBI nr
Match: gi|658047026|ref|XP_008359184.1| (PREDICTED: protein NDR1-like [Malus domestica])

HSP 1 Score: 241.1 bits (614), Expect = 2.1e-60
Identity = 131/272 (48.16%), Postives = 184/272 (67.65%), Query Frame = 1

Query: 1   MAERVHPTDSDAASA-----HSGDA------IPPKN----PPPGTYVIQLPKDQVYRVPP 60
           MA+RVHP  S + +      H+          PP++    PPPGTYVIQ+PKDQVYRVPP
Sbjct: 1   MADRVHPRGSPSHNETTQFPHTSSPQSTRTPSPPESEKLVPPPGTYVIQIPKDQVYRVPP 60

Query: 61  PENATRFDLYSRRNTRPSLCRRFLCSVLLLITILLLLFAIVSGLFFLILQPLSPRFSILA 120
           PENATR+  Y R+N R S CR   C +  L+  L+LL A  +G+F+L+++P SP +SI +
Sbjct: 61  PENATRYKNYXRQNPRRSSCRCCFCWLFGLVAALILLSAAAAGIFYLVVRPESPNYSIES 120

Query: 121 ISTKGMQI---KPNASISPQFNVTVRAENPNKKIGIYYERNSNISVNFGDVMLCKGALPA 180
           I+ KG  +    P+++ISP+  VTVR +NPNKKIGIYY + +++ + + D+ LC GALPA
Sbjct: 121 IAFKGFHLTTPSPSSTISPEIQVTVRVQNPNKKIGIYYGKKNSVKLFYSDIKLCDGALPA 180

Query: 181 LYQPWRNVTVMAAKLKGSGIKLSSSAVKALKDSEKQGKVKLKVDLRAPIKMKLYWMKTWT 240
            YQP +NVT     LKGSGIKL+S+  + L D+++QGKV LK+DLR P+++K+  +KTWT
Sbjct: 181 FYQPSKNVTEFRTALKGSGIKLTSTVQQGLVDAQRQGKVPLKLDLRMPVRIKVGAIKTWT 240

Query: 241 IRAKVSCEIWVKKVTGEAKATEKKCEHSLKLW 255
           I  KV C++ V K+T EAK   K C++S+  W
Sbjct: 241 ITVKVGCDLTVDKLTTEAKIVSKDCDYSVDPW 272

BLAST of CmaCh02G011860 vs. NCBI nr
Match: gi|658061062|ref|XP_008366385.1| (PREDICTED: protein NDR1-like [Malus domestica])

HSP 1 Score: 240.0 bits (611), Expect = 4.6e-60
Identity = 131/274 (47.81%), Postives = 182/274 (66.42%), Query Frame = 1

Query: 1   MAERVHPTD-----SDAASAHSGDAIPPKNP----------PPGTYVIQLPKDQVYRVPP 60
           MA+RVHP D       A   ++     P+ P          PPGTYVIQ+PKDQVYRVPP
Sbjct: 1   MADRVHPRDPPLHNETAPFPYTXSPESPRKPSPPESEKPVPPPGTYVIQIPKDQVYRVPP 60

Query: 61  PENATRFDLYSRRNTRPSLCRRFLCSVLLLITILLLLFAIVSGLFFLILQPLSPRFSILA 120
           PENATR+  Y+R+N R S CR   C +  L+  L+ L A  +G+F+L+++P SP +SI +
Sbjct: 61  PENATRYKNYTRQNPRRSSCRCCFCWLFGLVAALIFLSAAAAGIFYLVVRPKSPNYSIDS 120

Query: 121 ISTKGMQI-----KPNASISPQFNVTVRAENPNKKIGIYYERNSNISVNFGDVMLCKGAL 180
           I+ +G  +      P+ +ISP+  VTVRA+NPNKKIGIYY + S++ + + DV LC GAL
Sbjct: 121 IAFRGFNLTAPSPSPSYAISPEIQVTVRAQNPNKKIGIYYGKKSSVXLFYSDVKLCDGAL 180

Query: 181 PALYQPWRNVTVMAAKLKGSGIKLSSSAVKALKDSEKQGKVKLKVDLRAPIKMKLYWMKT 240
           PA YQP +NVTV    LKGSGI+L+S+A + L D++KQGKV L +D+R P+++K+  +KT
Sbjct: 181 PAFYQPLKNVTVFQTALKGSGIELTSAAQQGLVDAQKQGKVPLGLDIRMPVRIKVGPIKT 240

Query: 241 WTIRAKVSCEIWVKKVTGEAKATEKKCEHSLKLW 255
           WTI  KV C++ V K+T EAK   K C++S+  W
Sbjct: 241 WTINVKVGCDLTVDKLTTEAKIVSKDCDYSVDPW 274

BLAST of CmaCh02G011860 vs. NCBI nr
Match: gi|694450781|ref|XP_009350715.1| (PREDICTED: protein YLS9-like [Pyrus x bretschneideri])

HSP 1 Score: 238.8 bits (608), Expect = 1.0e-59
Identity = 132/272 (48.53%), Postives = 182/272 (66.91%), Query Frame = 1

Query: 1   MAERVHPTDSD-----------AASAHSGDAIPPKN----PPPGTYVIQLPKDQVYRVPP 60
           MA+RVHP DS            ++S       PP++    P PGTYVIQ+PKDQVYRVPP
Sbjct: 1   MADRVHPRDSPLHNETAPFPYTSSSESPRKPSPPESEKPVPSPGTYVIQIPKDQVYRVPP 60

Query: 61  PENATRFDLYSRRNTRPSLCRRFLCSVLLLITILLLLFAIVSGLFFLILQPLSPRFSILA 120
           PENATR+  Y+R+N R S CR   C    L+  L+LL A  +G+F+L+++P SP +SI +
Sbjct: 61  PENATRYKNYTRQNPRRSSCRCCFCWFFGLVAALILLSAAAAGIFYLVVRPKSPNYSIDS 120

Query: 121 ISTKGMQI---KPNASISPQFNVTVRAENPNKKIGIYYERNSNISVNFGDVMLCKGALPA 180
           I+ +G  +    P+ +ISP+  VTVRA+NPNKKIGIYY + S++ + + D  LC GALPA
Sbjct: 121 IAFRGFNLTAPSPSYAISPEIQVTVRAQNPNKKIGIYYGKKSSVKLFYSDGKLCDGALPA 180

Query: 181 LYQPWRNVTVMAAKLKGSGIKLSSSAVKALKDSEKQGKVKLKVDLRAPIKMKLYWMKTWT 240
            YQP  NVTV    LKGSGI+L+S+A + L D++KQGKV L +D+R P+++K+  +KTWT
Sbjct: 181 FYQPSSNVTVFRTALKGSGIELTSTAQQGLVDAQKQGKVPLGLDIRMPVRIKVGPIKTWT 240

Query: 241 IRAKVSCEIWVKKVTGEAKATEKKCEHSLKLW 255
           I  KV C++ V K+T EAK   K C++S+  W
Sbjct: 241 INVKVGCDLTVDKLTTEAKIVSKDCDYSVDPW 272

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0L0D0_CUCSA1.0e-7764.73Uncharacterized protein OS=Cucumis sativus GN=Csa_4G269730 PE=4 SV=1[more]
M5W0N4_PRUPE1.2e-5947.37Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa010043mg PE=4 SV=1[more]
A0A061GWS9_THECC1.5e-5746.54Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family, putat... [more]
W9R547_9ROSA2.6e-5748.31Uncharacterized protein OS=Morus notabilis GN=L484_022546 PE=4 SV=1[more]
A0A067LFL5_JATCU4.4e-5747.23Uncharacterized protein OS=Jatropha curcas GN=JCGZ_23841 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G27080.15.3e-5442.69 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT5G21130.17.4e-4842.06 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT1G54540.18.5e-2030.84 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT5G36970.18.5e-2024.90 NDR1/HIN1-like 25[more]
AT1G65690.12.7e-1825.59 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
Match NameE-valueIdentityDescription
gi|659097879|ref|XP_008449864.1|1.1e-8066.54PREDICTED: uncharacterized protein LOC103491613 [Cucumis melo][more]
gi|449463891|ref|XP_004149664.1|1.4e-7764.73PREDICTED: uncharacterized protein LOC101202799 [Cucumis sativus][more]
gi|658047026|ref|XP_008359184.1|2.1e-6048.16PREDICTED: protein NDR1-like [Malus domestica][more]
gi|658061062|ref|XP_008366385.1|4.6e-6047.81PREDICTED: protein NDR1-like [Malus domestica][more]
gi|694450781|ref|XP_009350715.1|1.0e-5948.53PREDICTED: protein YLS9-like [Pyrus x bretschneideri][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004864LEA_2
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh02G011860.1CmaCh02G011860.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004864Late embryogenesis abundant protein, LEA-14PFAMPF03168LEA_2coord: 126..229
score: 4.3
NoneNo IPR availablePANTHERPTHR31852FAMILY NOT NAMEDcoord: 23..254
score: 2.3
NoneNo IPR availablePANTHERPTHR31852:SF0LATE EMBRYOGENESIS ABUNDANT HYDROXYPROLINE-RICH GLYCOPROTEINcoord: 23..254
score: 2.3

The following gene(s) are paralogous to this gene:

None