Cp4.1LG20g03880 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG20g03880
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionLate embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family
LocationCp4.1LG20 : 2174342 .. 2175013 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAACACTCCCACCCAATTGGCGGAGACCAACACGCCAGAGCAGCGTCCAATCAAGCGCCACCACACGCCGCGCTATTACGCACAGCGTGTCAAGGACAGCCTAACCACGCGCGTCTCCAAGCTCATATGCGCCGTCTTCTTGGGCTTGTTGTTCATTGTGGGTATCATAATGTTTATATTGTGGCTTAGTTTACGACCCCACCGGCCCCGATTTTTCATTCACCATTTTTCAATTTTGGGTTTGGGCCTCGATAACGGCTACAAAAATCCCGAAATTGTCTTCAACGTCACGGCCCGAAACTCCAACCTTAACATTGGGATCTACTACGATTCCATGGTCGGTTCGGTTTATTATAAGAACCAGAAAATCGGGTCCACGCCGCTGCTGGATGAGTATTATGAAGGCCCCAAGACTACCAAGGTGCTGACGGCGGCGCTTAGTGGAGAGACGTTGAACGTCGATAGGCATCGGTGGATGGAGTTTAGTAAAGAGCGGTCTAAGGGAGCCGTCGGTTTCCGGTTGGAGATTTCGTCGACCATCCGGTTTAGAATATCCGCTTGGGATAGTAAGCGCCATGGTATGCACGCTAATTGTGACGTGTCGGTGGGCCGTGATGGGTTGGTTTTGCCTTCTTCCAAGGACGTGAGATGCCCGGTCTACTTTTCATGA

mRNA sequence

ATGAACACTCCCACCCAATTGGCGGAGACCAACACGCCAGAGCAGCGTCCAATCAAGCGCCACCACACGCCGCGCTATTACGCACAGCGTGTCAAGGACAGCCTAACCACGCGCGTCTCCAAGCTCATATGCGCCGTCTTCTTGGGCTTGTTGTTCATTGTGGGTATCATAATGTTTATATTGTGGCTTAGTTTACGACCCCACCGGCCCCGATTTTTCATTCACCATTTTTCAATTTTGGGTTTGGGCCTCGATAACGGCTACAAAAATCCCGAAATTGTCTTCAACGTCACGGCCCGAAACTCCAACCTTAACATTGGGATCTACTACGATTCCATGGTCGGTTCGGTTTATTATAAGAACCAGAAAATCGGGTCCACGCCGCTGCTGGATGAGTATTATGAAGGCCCCAAGACTACCAAGGTGCTGACGGCGGCGCTTAGTGGAGAGACGTTGAACGTCGATAGGCATCGGTGGATGGAGTTTAGTAAAGAGCGGTCTAAGGGAGCCGTCGGTTTCCGGTTGGAGATTTCGTCGACCATCCGGTTTAGAATATCCGCTTGGGATAGTAAGCGCCATGGTATGCACGCTAATTGTGACGTGTCGGTGGGCCGTGATGGGTTGGTTTTGCCTTCTTCCAAGGACGTGAGATGCCCGGTCTACTTTTCATGA

Coding sequence (CDS)

ATGAACACTCCCACCCAATTGGCGGAGACCAACACGCCAGAGCAGCGTCCAATCAAGCGCCACCACACGCCGCGCTATTACGCACAGCGTGTCAAGGACAGCCTAACCACGCGCGTCTCCAAGCTCATATGCGCCGTCTTCTTGGGCTTGTTGTTCATTGTGGGTATCATAATGTTTATATTGTGGCTTAGTTTACGACCCCACCGGCCCCGATTTTTCATTCACCATTTTTCAATTTTGGGTTTGGGCCTCGATAACGGCTACAAAAATCCCGAAATTGTCTTCAACGTCACGGCCCGAAACTCCAACCTTAACATTGGGATCTACTACGATTCCATGGTCGGTTCGGTTTATTATAAGAACCAGAAAATCGGGTCCACGCCGCTGCTGGATGAGTATTATGAAGGCCCCAAGACTACCAAGGTGCTGACGGCGGCGCTTAGTGGAGAGACGTTGAACGTCGATAGGCATCGGTGGATGGAGTTTAGTAAAGAGCGGTCTAAGGGAGCCGTCGGTTTCCGGTTGGAGATTTCGTCGACCATCCGGTTTAGAATATCCGCTTGGGATAGTAAGCGCCATGGTATGCACGCTAATTGTGACGTGTCGGTGGGCCGTGATGGGTTGGTTTTGCCTTCTTCCAAGGACGTGAGATGCCCGGTCTACTTTTCATGA

Protein sequence

MNTPTQLAETNTPEQRPIKRHHTPRYYAQRVKDSLTTRVSKLICAVFLGLLFIVGIIMFILWLSLRPHRPRFFIHHFSILGLGLDNGYKNPEIVFNVTARNSNLNIGIYYDSMVGSVYYKNQKIGSTPLLDEYYEGPKTTKVLTAALSGETLNVDRHRWMEFSKERSKGAVGFRLEISSTIRFRISAWDSKRHGMHANCDVSVGRDGLVLPSSKDVRCPVYFS
BLAST of Cp4.1LG20g03880 vs. Swiss-Prot
Match: NHL12_ARATH (NDR1/HIN1-like protein 12 OS=Arabidopsis thaliana GN=NHL12 PE=2 SV=1)

HSP 1 Score: 75.5 bits (184), Expect = 8.3e-13
Identity = 39/162 (24.07%), Postives = 76/162 (46.91%), Query Frame = 1

Query: 43  ICAVFLGLLFIVGIIMFILWLSLRPHRPRFFIHHFSILGLGLDN-GYKNPEIVFNVTARN 102
           IC V +G + IV I +F++W+ L+P +PRF +   ++    L             + +RN
Sbjct: 21  ICGVIIGFIIIVLITIFLVWIILQPTKPRFILQDATVYAFNLSQPNLLTSNFQITIASRN 80

Query: 103 SNLNIGIYYDSMVGSVYYKNQKIGSTPLLDEYYEGPKTTKVLTAALSGETLNVDRHRWME 162
            N  IGIYYD +     Y+NQ+I     +   Y+G K   V +  + G ++ +     + 
Sbjct: 81  RNSRIGIYYDRLHVYATYRNQQITLRTAIPPTYQGHKEDNVWSPFVYGNSVPIAPFNAVA 140

Query: 163 FSKERSKGAVGFRLEISSTIRFRISAWDSKRHGMHANCDVSV 204
              E+++G V   +     +R+++    + ++ +H  C   +
Sbjct: 141 LGDEQNRGFVTLIIRADGRVRWKVGTLITGKYHLHVRCQAFI 182

BLAST of Cp4.1LG20g03880 vs. Swiss-Prot
Match: NDR1_ARATH (Protein NDR1 OS=Arabidopsis thaliana GN=NDR1 PE=1 SV=1)

HSP 1 Score: 73.6 bits (179), Expect = 3.2e-12
Identity = 53/174 (30.46%), Postives = 81/174 (46.55%), Query Frame = 1

Query: 44  CAVFLGLLFIVGIIMFILWLSLRPHRPRFFIHHFSILGLGLD-NGYKNPEIVFNVTARNS 103
           C   L  +F  G+    LWLSLR  +P+  I +F I  LG D N   N  + F V   N 
Sbjct: 15  CTCCLSFIFTAGLTSLFLWLSLRADKPKCSIQNFFIPALGKDPNSRDNTTLNFMVRCDNP 74

Query: 104 NLNIGIYYDSM-VGSVYYKNQKIGSTPLL-------DEYYEGPKTTKVLTAALSGETLNV 163
           N + GIYYD + +        KI S+ L+        ++Y+G K      A   G+   +
Sbjct: 75  NKDKGIYYDDVHLNFSTINTTKINSSALVLVGNYTVPKFYQGHKKK----AKKWGQVKPL 134

Query: 164 DRHRWMEFSKERSKGAVGFRLEISSTIRFRISAWDSKRHGMHANCDVSVGRDGL 209
           +    +        G+  FRL++ + +RF+I  W +KR+G+    DV V  DG+
Sbjct: 135 NNQTVLR--AVLPNGSAVFRLDLKTQVRFKIVFWKTKRYGVEVGADVEVNGDGV 182

BLAST of Cp4.1LG20g03880 vs. Swiss-Prot
Match: SYP24_ARATH (Putative syntaxin-24 OS=Arabidopsis thaliana GN=SYP24 PE=3 SV=1)

HSP 1 Score: 58.9 bits (141), Expect = 8.1e-08
Identity = 31/90 (34.44%), Postives = 54/90 (60.00%), Query Frame = 1

Query: 96  NVTARNSNLNIGIYYDSMVGSVYYKNQKIGSTPLLDEYYEGPKTTKVLTAALSGETLNVD 155
           N++ RNS  +IGI+YD    +VYY NQ++G+ P +  +Y G K T +L A   G+TL + 
Sbjct: 34  NLSIRNSKSSIGIHYDRFEATVYYMNQRLGAVP-MPLFYLGSKNTMLLRALFEGQTLVLL 93

Query: 156 RHRWMEFSKERSKGAVGFRLEISSTIRFRI 186
           +    +  ++  K  V +R+++  +I FR+
Sbjct: 94  KGNERKKFEDDQKTGV-YRIDVKLSINFRV 121

BLAST of Cp4.1LG20g03880 vs. TrEMBL
Match: A0A0A0LUA7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G064760 PE=4 SV=1)

HSP 1 Score: 357.1 bits (915), Expect = 1.6e-95
Identity = 175/228 (76.75%), Postives = 196/228 (85.96%), Query Frame = 1

Query: 1   MNTPTQLA--ETNTPEQRPI---KRHHTPRYYAQRVKDSLTTRVSKLICAVFLGLLFIVG 60
           MN+P+ L    T+TPEQ P    KRHH+ RYYA RVK+SLTTRVSKLICA+FL LL I+G
Sbjct: 1   MNSPSHLPVHHTDTPEQHPTAPTKRHHSARYYAHRVKESLTTRVSKLICAIFLSLLLIIG 60

Query: 61  IIMFILWLSLRPHRPRFFIHHFSILGLGLDNGYKNPEIVFNVTARNSNLNIGIYYDSMVG 120
           II FILWLSLRPHRPRFFIH F++ GL L+NG+++ +IVFN TARNSNLNIGIYYD+M G
Sbjct: 61  IITFILWLSLRPHRPRFFIHDFTVTGLSLENGFESAQIVFNATARNSNLNIGIYYDAMSG 120

Query: 121 SVYYKNQKIGSTPLLDEYYEGPKTTKVLTAALSGETLNVDRHRWMEFSKERSKGAVGFRL 180
           SVYYK QKIGSTPLLD YYEGPKTTKVLTAALSG TLN+DR RWME S ERSKG V FRL
Sbjct: 121 SVYYKEQKIGSTPLLDSYYEGPKTTKVLTAALSGATLNIDRQRWMEISNERSKGVVVFRL 180

Query: 181 EISSTIRFRISAWDSKRHGMHANCDVSVGRDGLVLPSSKDVRCPVYFS 224
           EI+STIRFRISAWDSKRH MHANC VSVG DG++LPSSKD+RCPVYF+
Sbjct: 181 EITSTIRFRISAWDSKRHVMHANCPVSVGSDGMILPSSKDLRCPVYFT 228

BLAST of Cp4.1LG20g03880 vs. TrEMBL
Match: V4TPA5_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10017613mg PE=4 SV=1)

HSP 1 Score: 300.8 bits (769), Expect = 1.4e-78
Identity = 139/223 (62.33%), Postives = 177/223 (79.37%), Query Frame = 1

Query: 1   MNTPTQLAETNTPEQRPIKRHHTPRYYAQRVKDSLTTRVSKLICAVFLGLLFIVGIIMFI 60
           M++  +L   +TP+ +PIKRH+T RYYA RV++SLTTRVSK +C +FL LL + GII+F+
Sbjct: 1   MHSTDRLPVRSTPQNQPIKRHNTARYYAHRVRESLTTRVSKTLCIIFLSLLLVAGIILFV 60

Query: 61  LWLSLRPHRPRFFIHHFSILGLGLDNGYKNPEIVFNVTARNSNLNIGIYYDSMVGSVYYK 120
           L+LSLRPHRPR FIH FSI  L   NG++N EI+FNVTARNSN ++GIY+DS+ GSVYYK
Sbjct: 61  LYLSLRPHRPRIFIHEFSIPALAQPNGFENAEIIFNVTARNSNQHVGIYFDSVEGSVYYK 120

Query: 121 NQKIGSTPLLDEYYEGPKTTKVLTAALSGETLNVDRHRWMEFSKERSKGAVGFRLEISST 180
           NQ++G+TPL D +++ PKTT +L A LSG TL V+  RWMEF  +R +G VGFRLEI ST
Sbjct: 121 NQRVGATPLADTFFQEPKTTTILHATLSGATLTVNSGRWMEFMHDRGQGKVGFRLEIKST 180

Query: 181 IRFRISAWDSKRHGMHANCDVSVGRDGLVLPSSKDVRCPVYFS 224
           IRF++S WDSKRH MHA CDV VG DG +LP+ KD RCP+YF+
Sbjct: 181 IRFQVSTWDSKRHTMHATCDVFVGTDGFILPAYKDKRCPLYFT 223

BLAST of Cp4.1LG20g03880 vs. TrEMBL
Match: W9QEY6_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_014826 PE=4 SV=1)

HSP 1 Score: 298.5 bits (763), Expect = 6.7e-78
Identity = 140/212 (66.04%), Postives = 171/212 (80.66%), Query Frame = 1

Query: 13  PEQRPIKRHHTPRYYAQRVKDSLTTRVSKLICAVFLGLLFIVGIIMFILWLSLRPHRPRF 72
           PE RP+KRHHTPRYYA RVK+SLTTRVSK+ICA+FLGLLFIVG I FILWLSLRPHRPRF
Sbjct: 16  PESRPLKRHHTPRYYAHRVKESLTTRVSKMICAIFLGLLFIVGFITFILWLSLRPHRPRF 75

Query: 73  FIHHFSILGLGLDNGYKNPEIVFNVTARNSNLNIGIYYDSMVGSVYYKNQKIGSTPLLDE 132
            IH FS+ GLG ++G++N +I  N +ARN N NIGI YDSM GSVYY++QKIG+ PL+ E
Sbjct: 76  HIHEFSVPGLGQESGFENAQITVNASARNPNQNIGIRYDSMEGSVYYRDQKIGTKPLMHE 135

Query: 133 -YYEGPKTTKVLTAALSGETLNVDRHRWMEFSKERSKGAVGFRLEISSTIRFRISAWDSK 192
            + + PK T V+    SG TL V   RWMEF+ +R++G V FRLE++S IRFR+S+W+SK
Sbjct: 136 PFSQEPKNTTVIDGTFSGATLTVSSQRWMEFTNDRARGMVVFRLELTSMIRFRVSSWESK 195

Query: 193 RHGMHANCDVSVGRDGLVLPSSKDVRCPVYFS 224
           RH MHANCDV VG DGL+LP SK+ +CPVYF+
Sbjct: 196 RHRMHANCDVDVGPDGLILPGSKNRKCPVYFA 227

BLAST of Cp4.1LG20g03880 vs. TrEMBL
Match: M5WM48_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa015351mg PE=4 SV=1)

HSP 1 Score: 293.1 bits (749), Expect = 2.8e-76
Identity = 141/224 (62.95%), Postives = 176/224 (78.57%), Query Frame = 1

Query: 2   NTPTQLA-ETNTPEQRPIKRHHTPRYYAQRVKDSLTTRVSKLICAVFLGLLFIVGIIMFI 61
           N+P +L  +  TPE + +KR HTPRYYA RV++SLTTRVSKL C +FLGLL I+G+I FI
Sbjct: 3   NSPDRLPIQYPTPEPQRMKRQHTPRYYAHRVRESLTTRVSKLFCTIFLGLLLILGMIAFI 62

Query: 62  LWLSLRPHRPRFFIHHFSILGLGLDNGYKNPEIVFNVTARNSNLNIGIYYDSMVGSVYYK 121
           LWLSLRPHRPRF IH F++ GL  + G++N +I FN TARN+N +IGIYYDSM GS YYK
Sbjct: 63  LWLSLRPHRPRFHIHAFTVPGLNQETGFENAQITFNATARNANHDIGIYYDSMDGSAYYK 122

Query: 122 NQKIGS-TPLLDEYYEGPKTTKVLTAALSGETLNVDRHRWMEFSKERSKGAVGFRLEISS 181
           +Q+IGS + LL  +Y+GPK T ++  + +G TL V+  RWMEF  +RS+G V FRLE S+
Sbjct: 123 DQRIGSISGLLPPFYQGPKNTTIVAGSFTGATLTVNSQRWMEFINDRSRGTVVFRLEFSA 182

Query: 182 TIRFRISAWDSKRHGMHANCDVSVGRDGLVLPSSKDVRCPVYFS 224
           TIRFRI +WDSKRH MHANCDV VG+DGL+LP SKD RCPVYF+
Sbjct: 183 TIRFRIQSWDSKRHRMHANCDVDVGQDGLILPVSKDRRCPVYFT 226

BLAST of Cp4.1LG20g03880 vs. TrEMBL
Match: A0A061GGB2_THECC (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family OS=Theobroma cacao GN=TCM_030152 PE=4 SV=1)

HSP 1 Score: 281.6 bits (719), Expect = 8.5e-73
Identity = 128/211 (60.66%), Postives = 167/211 (79.15%), Query Frame = 1

Query: 13  PEQRPIKRHHTPRYYAQRVKDSLTTRVSKLICAVFLGLLFIVGIIMFILWLSLRPHRPRF 72
           P+ +P+KR+HT + YA+RV+DS TTRV+K +CA+FL LL  VGI+MFILWLSLRPHRPRF
Sbjct: 20  PQPQPMKRNHTAQRYARRVRDSFTTRVTKTLCAIFLSLLLCVGIVMFILWLSLRPHRPRF 79

Query: 73  FIHHFSILGLGLDNGYKNPEIVFNVTARNSNLNIGIYYDSMVGSVYYKNQKIGSTPLLDE 132
            I  F++ GL   +G++N +I FNVTARN N +IGIYYDSMVGS+YYK+Q+IGSTPLLD 
Sbjct: 80  HIMEFTVPGLAQPSGFENAQITFNVTARNPNQHIGIYYDSMVGSIYYKDQRIGSTPLLDP 139

Query: 133 YYEGPKTTKVLTAALSGETLNVDRHRWMEFSKERSKGAVGFRLEISSTIRFRISAWDSKR 192
           +++ PKTT ++       TL V+ +RW EF  +R +G V FRL+I+S IRF++S WDSK 
Sbjct: 140 FFQEPKTTTIVYRTFDTATLTVNSNRWKEFMDDRQQGMVVFRLDITSVIRFKVSTWDSKH 199

Query: 193 HGMHANCDVSVGRDGLVLPSSKDVRCPVYFS 224
           H MH NCDV+VG DG++LP+SKD +CPVYFS
Sbjct: 200 HKMHVNCDVAVGPDGMILPTSKDKKCPVYFS 230

BLAST of Cp4.1LG20g03880 vs. TAIR10
Match: AT4G05220.1 (AT4G05220.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 258.1 bits (658), Expect = 5.1e-69
Identity = 118/221 (53.39%), Postives = 160/221 (72.40%), Query Frame = 1

Query: 3   TPTQLAETNTPEQRPIKRHHTPRYYAQRVKDSLTTRVSKLICAVFLGLLFIVGIIMFILW 62
           T   +  +  P  +P+KRHH+  YYA RV++SL+TR+SK ICA+FL +LF VG+I FILW
Sbjct: 6   TTIPIRTSPVPRAQPMKRHHSASYYAHRVRESLSTRISKFICAMFLLVLFFVGVIAFILW 65

Query: 63  LSLRPHRPRFFIHHFSILGLGLDNGYKNPEIVFNVTARNSNLNIGIYYDSMVGSVYYKNQ 122
           LSLRPHRPRF I  F + GL    G +N  I FNVT  N N ++G+Y+DSM GS+YYK+Q
Sbjct: 66  LSLRPHRPRFHIQDFVVQGLDQPTGVENARIAFNVTILNPNQHMGVYFDSMEGSIYYKDQ 125

Query: 123 KIGSTPLLDEYYEGPKTTKVLTAALSGETLNVDRHRWMEFSKERSKGAVGFRLEISSTIR 182
           ++G  PLL+ +++ P  T ++T  L+G +L V+ +RW EFS +R++G VGFRL+I STIR
Sbjct: 126 RVGLIPLLNPFFQQPTNTTIVTGTLTGASLTVNSNRWTEFSNDRAQGTVGFRLDIVSTIR 185

Query: 183 FRISAWDSKRHGMHANCDVSVGRDGLVLPSSKDVRCPVYFS 224
           F++  W SK H MHANC++ VGRDGL+LP     RCPVYF+
Sbjct: 186 FKLHRWISKHHRMHANCNIVVGRDGLILPKFNHKRCPVYFT 226

BLAST of Cp4.1LG20g03880 vs. TAIR10
Match: AT1G61760.1 (AT1G61760.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 240.7 bits (613), Expect = 8.4e-64
Identity = 114/222 (51.35%), Postives = 153/222 (68.92%), Query Frame = 1

Query: 2   NTPTQLAETNTPEQRPIKRHHTPRYYAQRVKDSLTTRVSKLICAVFLGLLFIVGIIMFIL 61
           N    L   + P  RPI RHH+      RVK+SLTTRVSKLICA+FL LL  +GII FIL
Sbjct: 3   NKVDSLPVRSNPSTRPISRHHSASNIVHRVKESLTTRVSKLICAIFLSLLLCLGIITFIL 62

Query: 62  WLSLRPHRPRFFIHHFSILGLGLDNGYKNPEIVFNVTARNSNLNIGIYYDSMVGSVYYKN 121
           W+SL+PHRPR  I  FSI GL   +G++   I F +TA N N N+GIYYDSM GSVYYK 
Sbjct: 63  WISLQPHRPRVHIRGFSISGLSRPDGFETSHISFKITAHNPNQNVGIYYDSMEGSVYYKE 122

Query: 122 QKIGSTPLLDEYYEGPKTTKVLTAALSGETLNVDRHRWMEFSKERSKGAVGFRLEISSTI 181
           ++IGST L + +Y+ PK T  +  ALS   + V++ RWME  ++R++G + FRL++ S I
Sbjct: 123 KRIGSTKLTNPFYQDPKNTSSIDGALSRPAMAVNKDRWMEMERDRNQGKIMFRLKVRSMI 182

Query: 182 RFRISAWDSKRHGMHANCDVSVGRDGLVLPSSKDVRCPVYFS 224
           RF++  W SK H M+A+C + +G DG++L ++KD RCPVYF+
Sbjct: 183 RFKVYTWHSKSHKMYASCYIEIGWDGMLLSATKDKRCPVYFT 224

BLAST of Cp4.1LG20g03880 vs. TAIR10
Match: AT5G53730.1 (AT5G53730.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 85.1 bits (209), Expect = 5.9e-17
Identity = 53/183 (28.96%), Postives = 89/183 (48.63%), Query Frame = 1

Query: 23  TPRYYAQRVKDSLTTRVSKLI---CAVFLGLLFIVGIIMFILWLSLRPHRPRFFIHHFSI 82
           +P++ A++   ++  R  KL       F GLL I+    F++WL L P RP F +    I
Sbjct: 8   SPKHCAKKGGININNRHKKLFFTFSTFFSGLLLII----FLVWLILHPERPEFSLTEADI 67

Query: 83  LGLGLDNGYK---NPEIVFNVTARNSNLNIGIYYDSMVGSVYYKNQKIGSTPLLDEYYEG 142
             L L        N  +   + ++N N  +GIYYD ++    Y+ Q+I S   L  +Y+ 
Sbjct: 68  YSLNLTTSSTHLLNSSVQLTLFSKNPNKKVGIYYDKLLVYAAYRGQQITSEASLPPFYQS 127

Query: 143 PKTTKVLTAALSGETLNVDRHRWMEFSKERSKGAVGFRLEISSTIRFRISAWDSKRHGMH 200
            +   +LTA L G  L V +    + S+ERS G +   +++   +R++I  W S  +  +
Sbjct: 128 HEEINLLTAFLQGTELPVAQSFGYQISRERSTGKIIIGMKMDGKLRWKIGTWVSGAYRFN 186

BLAST of Cp4.1LG20g03880 vs. TAIR10
Match: AT4G01410.1 (AT4G01410.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 80.5 bits (197), Expect = 1.5e-15
Identity = 57/186 (30.65%), Postives = 89/186 (47.85%), Query Frame = 1

Query: 41  KLICAVFLGLLFIVGIIMFILWLSLRPHRPRFFIHHFSILGLGL-DNGYKNPEIV----- 100
           + IC     +L I+GII  ILWL  RPH+PR      +++G  + D  +  P ++     
Sbjct: 42  RAICGAIFTILVILGIIALILWLVYRPHKPR-----LTVVGAAIYDLNFTAPPLISTSVQ 101

Query: 101 FNVTARNSNLNIGIYYDSMVGSVYYKNQKIGSTPLLDEYYEGPKTTKVLTAALSGETLNV 160
           F+V ARN N  + I+YD +   V YK+Q I     L     G K+T V+   + G  + V
Sbjct: 102 FSVLARNPNRRVSIHYDKLSMYVTYKDQIITPPLPLPPLRLGHKSTVVIAPVMGGNGIPV 161

Query: 161 DRHRWMEFSKERSKGAVGFRLEISSTIRFRISAWDSKRHGMHANCDV-------SVGRDG 214
                     + + G V  R+ I   +R++  A  + R+G +A CDV       S G+  
Sbjct: 162 SPEVANGLKNDEAYGVVLMRVVIFGRLRWKAGAIKTGRYGFYARCDVWLRFNPSSNGQVP 221

BLAST of Cp4.1LG20g03880 vs. TAIR10
Match: AT3G52470.1 (AT3G52470.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 79.0 bits (193), Expect = 4.2e-15
Identity = 41/166 (24.70%), Postives = 78/166 (46.99%), Query Frame = 1

Query: 39  VSKLICAVFLGLLFIVGIIMFILWLSLRPHRPRFFIHHFSILGLGLDN-GYKNPEIVFNV 98
           V + +CA  +  + IV I +F++W+ LRP +PRF +   ++    L             +
Sbjct: 15  VVRKLCAAIIAFIVIVLITIFLVWVILRPTKPRFVLQDATVYAFNLSQPNLLTSNFQVTI 74

Query: 99  TARNSNLNIGIYYDSMVGSVYYKNQKIGSTPLLDEYYEGPKTTKVLTAALSGETLNVDRH 158
            +RN N  IGIYYD +     Y NQ+I     +   Y+G K   V +  + G  + +  +
Sbjct: 75  ASRNPNSKIGIYYDRLHVYATYMNQQITLRTAIPPTYQGHKEVNVWSPFVYGTAVPIAPY 134

Query: 159 RWMEFSKERSKGAVGFRLEISSTIRFRISAWDSKRHGMHANCDVSV 204
             +   +E+ +G VG  +    T+R+++    + ++ +H  C   +
Sbjct: 135 NSVALGEEKDRGFVGLMIRADGTVRWKVRTLITGKYHIHVRCQAFI 180

BLAST of Cp4.1LG20g03880 vs. NCBI nr
Match: gi|659067831|ref|XP_008441513.1| (PREDICTED: protein NDR1 [Cucumis melo])

HSP 1 Score: 366.3 bits (939), Expect = 3.8e-98
Identity = 180/228 (78.95%), Postives = 198/228 (86.84%), Query Frame = 1

Query: 1   MNTPTQLA--ETNTPEQRPI---KRHHTPRYYAQRVKDSLTTRVSKLICAVFLGLLFIVG 60
           MN+P+ L    T+TPEQRP    K HH+ RYYA RVK+SLTTRVSKLICA+FL LL I+G
Sbjct: 1   MNSPSHLPVHHTDTPEQRPTAPTKHHHSARYYAHRVKESLTTRVSKLICAIFLSLLLIIG 60

Query: 61  IIMFILWLSLRPHRPRFFIHHFSILGLGLDNGYKNPEIVFNVTARNSNLNIGIYYDSMVG 120
           II FILWLSLRPHRPRFFIH FS+ G GLDNG++N +IVFN TARNSNLNIGIYYD+M G
Sbjct: 61  IITFILWLSLRPHRPRFFIHDFSVTGFGLDNGFENAQIVFNATARNSNLNIGIYYDAMSG 120

Query: 121 SVYYKNQKIGSTPLLDEYYEGPKTTKVLTAALSGETLNVDRHRWMEFSKERSKGAVGFRL 180
           SVYYK+QKIGSTPLLD YYEGPKTTKVLTAALSG TLNVDR +WME S ERSKG V FRL
Sbjct: 121 SVYYKDQKIGSTPLLDSYYEGPKTTKVLTAALSGATLNVDRQQWMEMSNERSKGVVVFRL 180

Query: 181 EISSTIRFRISAWDSKRHGMHANCDVSVGRDGLVLPSSKDVRCPVYFS 224
           EI+STIRFRISAWDSKRHGMHANC VSVG DG++LPSSKDVRCPVYF+
Sbjct: 181 EITSTIRFRISAWDSKRHGMHANCPVSVGSDGMILPSSKDVRCPVYFT 228

BLAST of Cp4.1LG20g03880 vs. NCBI nr
Match: gi|449471964|ref|XP_004153455.1| (PREDICTED: protein NDR1 [Cucumis sativus])

HSP 1 Score: 357.1 bits (915), Expect = 2.3e-95
Identity = 175/228 (76.75%), Postives = 196/228 (85.96%), Query Frame = 1

Query: 1   MNTPTQLA--ETNTPEQRPI---KRHHTPRYYAQRVKDSLTTRVSKLICAVFLGLLFIVG 60
           MN+P+ L    T+TPEQ P    KRHH+ RYYA RVK+SLTTRVSKLICA+FL LL I+G
Sbjct: 1   MNSPSHLPVHHTDTPEQHPTAPTKRHHSARYYAHRVKESLTTRVSKLICAIFLSLLLIIG 60

Query: 61  IIMFILWLSLRPHRPRFFIHHFSILGLGLDNGYKNPEIVFNVTARNSNLNIGIYYDSMVG 120
           II FILWLSLRPHRPRFFIH F++ GL L+NG+++ +IVFN TARNSNLNIGIYYD+M G
Sbjct: 61  IITFILWLSLRPHRPRFFIHDFTVTGLSLENGFESAQIVFNATARNSNLNIGIYYDAMSG 120

Query: 121 SVYYKNQKIGSTPLLDEYYEGPKTTKVLTAALSGETLNVDRHRWMEFSKERSKGAVGFRL 180
           SVYYK QKIGSTPLLD YYEGPKTTKVLTAALSG TLN+DR RWME S ERSKG V FRL
Sbjct: 121 SVYYKEQKIGSTPLLDSYYEGPKTTKVLTAALSGATLNIDRQRWMEISNERSKGVVVFRL 180

Query: 181 EISSTIRFRISAWDSKRHGMHANCDVSVGRDGLVLPSSKDVRCPVYFS 224
           EI+STIRFRISAWDSKRH MHANC VSVG DG++LPSSKD+RCPVYF+
Sbjct: 181 EITSTIRFRISAWDSKRHVMHANCPVSVGSDGMILPSSKDLRCPVYFT 228

BLAST of Cp4.1LG20g03880 vs. NCBI nr
Match: gi|568827235|ref|XP_006467970.1| (PREDICTED: protein NDR1-like [Citrus sinensis])

HSP 1 Score: 301.2 bits (770), Expect = 1.5e-78
Identity = 139/223 (62.33%), Postives = 177/223 (79.37%), Query Frame = 1

Query: 1   MNTPTQLAETNTPEQRPIKRHHTPRYYAQRVKDSLTTRVSKLICAVFLGLLFIVGIIMFI 60
           M++  +L   +TP+ +PIKRH+T RYYA RV++SLTTRVSK +C +FL LL + GII+F+
Sbjct: 1   MHSTDRLPVRSTPQNQPIKRHNTARYYAHRVRESLTTRVSKTLCIIFLSLLLVAGIILFV 60

Query: 61  LWLSLRPHRPRFFIHHFSILGLGLDNGYKNPEIVFNVTARNSNLNIGIYYDSMVGSVYYK 120
           L+LSLRPHRPR FIH FSI  L   NG++N EI+FNVTARNSN ++GIY+DS+ GSVYYK
Sbjct: 61  LYLSLRPHRPRIFIHEFSIPALAQPNGFENAEIIFNVTARNSNQHVGIYFDSVEGSVYYK 120

Query: 121 NQKIGSTPLLDEYYEGPKTTKVLTAALSGETLNVDRHRWMEFSKERSKGAVGFRLEISST 180
           NQ++G+TPL D +++ PKTT +L A LSG TL V+  RWMEF  +R +G VGFRLEI ST
Sbjct: 121 NQQVGATPLADTFFQEPKTTTILHATLSGATLTVNSRRWMEFMHDRGQGKVGFRLEIKST 180

Query: 181 IRFRISAWDSKRHGMHANCDVSVGRDGLVLPSSKDVRCPVYFS 224
           IRF++S WDSKRH MHA CDV VG DG +LP+ KD RCP+YF+
Sbjct: 181 IRFQVSTWDSKRHTMHATCDVFVGTDGFILPAYKDKRCPLYFT 223

BLAST of Cp4.1LG20g03880 vs. NCBI nr
Match: gi|567913589|ref|XP_006449108.1| (hypothetical protein CICLE_v10017613mg [Citrus clementina])

HSP 1 Score: 300.8 bits (769), Expect = 1.9e-78
Identity = 139/223 (62.33%), Postives = 177/223 (79.37%), Query Frame = 1

Query: 1   MNTPTQLAETNTPEQRPIKRHHTPRYYAQRVKDSLTTRVSKLICAVFLGLLFIVGIIMFI 60
           M++  +L   +TP+ +PIKRH+T RYYA RV++SLTTRVSK +C +FL LL + GII+F+
Sbjct: 1   MHSTDRLPVRSTPQNQPIKRHNTARYYAHRVRESLTTRVSKTLCIIFLSLLLVAGIILFV 60

Query: 61  LWLSLRPHRPRFFIHHFSILGLGLDNGYKNPEIVFNVTARNSNLNIGIYYDSMVGSVYYK 120
           L+LSLRPHRPR FIH FSI  L   NG++N EI+FNVTARNSN ++GIY+DS+ GSVYYK
Sbjct: 61  LYLSLRPHRPRIFIHEFSIPALAQPNGFENAEIIFNVTARNSNQHVGIYFDSVEGSVYYK 120

Query: 121 NQKIGSTPLLDEYYEGPKTTKVLTAALSGETLNVDRHRWMEFSKERSKGAVGFRLEISST 180
           NQ++G+TPL D +++ PKTT +L A LSG TL V+  RWMEF  +R +G VGFRLEI ST
Sbjct: 121 NQRVGATPLADTFFQEPKTTTILHATLSGATLTVNSGRWMEFMHDRGQGKVGFRLEIKST 180

Query: 181 IRFRISAWDSKRHGMHANCDVSVGRDGLVLPSSKDVRCPVYFS 224
           IRF++S WDSKRH MHA CDV VG DG +LP+ KD RCP+YF+
Sbjct: 181 IRFQVSTWDSKRHTMHATCDVFVGTDGFILPAYKDKRCPLYFT 223

BLAST of Cp4.1LG20g03880 vs. NCBI nr
Match: gi|694446527|ref|XP_009349601.1| (PREDICTED: protein YLS9-like [Pyrus x bretschneideri])

HSP 1 Score: 299.3 bits (765), Expect = 5.7e-78
Identity = 144/215 (66.98%), Postives = 173/215 (80.47%), Query Frame = 1

Query: 11  NTPEQ-RPIKRHHTPRYYAQRVKDSLTTRVSKLICAVFLGLLFIVGIIMFILWLSLRPHR 70
           +TPEQ RP+KR HT RYYA RV++SLTTRVSKLIC +FL LL I+G+I FILWLSLRPHR
Sbjct: 12  STPEQQRPMKRQHTTRYYAHRVRESLTTRVSKLICTIFLSLLLILGMITFILWLSLRPHR 71

Query: 71  PRFFIHHFSILGLGLDNGYKNPEIVFNVTARNSNLNIGIYYDSMVGSVYYKNQKIGS-TP 130
           PRF IH FS+ GLG + G+ N EI+FN TARN+N NIGIYYDSM  +VYYK+Q+IGS T 
Sbjct: 72  PRFHIHAFSVPGLGQETGFANAEIMFNATARNANHNIGIYYDSMDATVYYKDQRIGSITG 131

Query: 131 LLDEYYEGPKTTKVLTAALSGETLNVDRHRWMEFSKERSKGAVGFRLEISSTIRFRISAW 190
           LLD +Y+GPK T ++  +  G TL V+  RWMEF+ +R KG V FRLE ++TIRFRI AW
Sbjct: 132 LLDPFYQGPKNTTIVMGSFQGATLTVNNQRWMEFTNDRKKGTVVFRLEFTATIRFRIHAW 191

Query: 191 DSKRHGMHANCDVSVGRDGLVLPSSKDVRCPVYFS 224
           DS+RH MHANCDV VG+DGL+LP SKD RCPVYF+
Sbjct: 192 DSRRHRMHANCDVDVGQDGLILPISKDRRCPVYFT 226

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
NHL12_ARATH8.3e-1324.07NDR1/HIN1-like protein 12 OS=Arabidopsis thaliana GN=NHL12 PE=2 SV=1[more]
NDR1_ARATH3.2e-1230.46Protein NDR1 OS=Arabidopsis thaliana GN=NDR1 PE=1 SV=1[more]
SYP24_ARATH8.1e-0834.44Putative syntaxin-24 OS=Arabidopsis thaliana GN=SYP24 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LUA7_CUCSA1.6e-9576.75Uncharacterized protein OS=Cucumis sativus GN=Csa_1G064760 PE=4 SV=1[more]
V4TPA5_9ROSI1.4e-7862.33Uncharacterized protein OS=Citrus clementina GN=CICLE_v10017613mg PE=4 SV=1[more]
W9QEY6_9ROSA6.7e-7866.04Uncharacterized protein OS=Morus notabilis GN=L484_014826 PE=4 SV=1[more]
M5WM48_PRUPE2.8e-7662.95Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa015351mg PE=4 SV=1[more]
A0A061GGB2_THECC8.5e-7360.66Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family OS=The... [more]
Match NameE-valueIdentityDescription
AT4G05220.15.1e-6953.39 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT1G61760.18.4e-6451.35 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT5G53730.15.9e-1728.96 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT4G01410.11.5e-1530.65 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT3G52470.14.2e-1524.70 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
Match NameE-valueIdentityDescription
gi|659067831|ref|XP_008441513.1|3.8e-9878.95PREDICTED: protein NDR1 [Cucumis melo][more]
gi|449471964|ref|XP_004153455.1|2.3e-9576.75PREDICTED: protein NDR1 [Cucumis sativus][more]
gi|568827235|ref|XP_006467970.1|1.5e-7862.33PREDICTED: protein NDR1-like [Citrus sinensis][more]
gi|567913589|ref|XP_006449108.1|1.9e-7862.33hypothetical protein CICLE_v10017613mg [Citrus clementina][more]
gi|694446527|ref|XP_009349601.1|5.7e-7866.98PREDICTED: protein YLS9-like [Pyrus x bretschneideri][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004864LEA_2
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006468 protein phosphorylation
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function
molecular_function GO:0004672 protein kinase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG20g03880.1Cp4.1LG20g03880.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004864Late embryogenesis abundant protein, LEA-14PFAMPF03168LEA_2coord: 97..198
score: 1.
NoneNo IPR availablePANTHERPTHR31415FAMILY NOT NAMEDcoord: 6..222
score: 2.0E
NoneNo IPR availablePANTHERPTHR31415:SF3LATE EMBRYOGENESIS ABUNDANT HYDROXYPROLINE-RICH GLYCOPROTEINcoord: 6..222
score: 2.0E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG20g03880Cp4.1LG09g08860Cucurbita pepo (Zucchini)cpecpeB048
Cp4.1LG20g03880Cp4.1LG06g08970Cucurbita pepo (Zucchini)cpecpeB445