Cp4.1LG20g03950 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG20g03950
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionBEST Arabidopsis thaliana protein match is: glycine-rich protein .
LocationCp4.1LG20 : 2240151 .. 2241149 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCTTCTACGATTCCTACTACGATTCTGCTCAAATTGAGCCTCCAATTCCGCAATCCAGCTACGAACCCACTTTCTACAATCTCTTTGACTACCCACCTCCTTGTTATTTCGGTCAGGCTTATGCTCCCTACACATCCAATTTCAATGAATTCCCCCAATTGATCGAGTATCAACCCGTTGACCATGGGGCTTATGGCTATACAATTAGCTACTCGGCCAATGCTTGTTCTGCATCGACTTTCAGTGTCCCAAAAGTGATCGAATACGACCCTGATTTGTATAGCGATGGTTACCAAAAGGTGTCGTCCCAATTTGTGATCTCCTACTCTGTTTCAGAATTCAACGAGACAGAATTTGAAGAGTACGATCCAACCCCTTACGGTGGTGGCTACGACATTCATGAAACCTACGGTAAGCCCCTTCAACCTTCAACTGACATTTGCTACTCACCCTCCTCTTCTTCACCTCCAAAACCCCCACCCACCGCAATTCAGGAGGCACCAAAGGAAAAAATTGAAGAAAAAACAAAGCCGTCGAGCGAAATCAAGCCGACCCAGATCGAGAAAGATAACACGGCATCTGAATCTGAAGAAATTGAGGAAGTTCAAGCGATTCCCTTTGCAGATCCGGGAATAGGGTATGGAAATGGAAGGGAAGTGAACCAATTTCCAAGTGGGTATGGACTGGAAGCGATGGATCTTTGTGAAAGTTTATTTGGGTATTGGCCATGTCTCTCACGGATTAAAAAGCAAACAGGTTGTAGACAACCCAACAACGGCTGTGGGCGTTGCCATGGGCATTGCTATTGCTATGGCAATTACGGCAACCAGTGGCAGACGGCGGCGGACTATCTATTCGGAAGCCATAACCCATATCCAGATGGAAGGAGTGAAGGAGACGGTGTTTATGGGTATCAAACACAGTATCAAACGGAGCCTGTCTATGGCTACGTTTGGTTGAATCAAGACGACTTCGTTCGGTCCGATGACGCTTGA

mRNA sequence

ATGGCCTTCTACGATTCCTACTACGATTCTGCTCAAATTGAGCCTCCAATTCCGCAATCCAGCTACGAACCCACTTTCTACAATCTCTTTGACTACCCACCTCCTTGTTATTTCGGTCAGGCTTATGCTCCCTACACATCCAATTTCAATGAATTCCCCCAATTGATCGAGTATCAACCCGTTGACCATGGGGCTTATGGCTATACAATTAGCTACTCGGCCAATGCTTGTTCTGCATCGACTTTCAGTGTCCCAAAAGTGATCGAATACGACCCTGATTTGTATAGCGATGGTTACCAAAAGGTGTCGTCCCAATTTGTGATCTCCTACTCTGTTTCAGAATTCAACGAGACAGAATTTGAAGAGTACGATCCAACCCCTTACGGTGGTGGCTACGACATTCATGAAACCTACGGTAAGCCCCTTCAACCTTCAACTGACATTTGCTACTCACCCTCCTCTTCTTCACCTCCAAAACCCCCACCCACCGCAATTCAGGAGGCACCAAAGGAAAAAATTGAAGAAAAAACAAAGCCGTCGAGCGAAATCAAGCCGACCCAGATCGAGAAAGATAACACGGCATCTGAATCTGAAGAAATTGAGGAAGTTCAAGCGATTCCCTTTGCAGATCCGGGAATAGGGTATGGAAATGGAAGGGAAGTGAACCAATTTCCAAGTGGGTATGGACTGGAAGCGATGGATCTTTGTGAAAGTTTATTTGGGTATTGGCCATGTCTCTCACGGATTAAAAAGCAAACAGGTTGTAGACAACCCAACAACGGCTGTGGGCGTTGCCATGGGCATTGCTATTGCTATGGCAATTACGGCAACCAGTGGCAGACGGCGGCGGACTATCTATTCGGAAGCCATAACCCATATCCAGATGGAAGGAGTGAAGGAGACGGTGTTTATGGGTATCAAACACAGTATCAAACGGAGCCTGTCTATGGCTACGTTTGGTTGAATCAAGACGACTTCGTTCGGTCCGATGACGCTTGA

Coding sequence (CDS)

ATGGCCTTCTACGATTCCTACTACGATTCTGCTCAAATTGAGCCTCCAATTCCGCAATCCAGCTACGAACCCACTTTCTACAATCTCTTTGACTACCCACCTCCTTGTTATTTCGGTCAGGCTTATGCTCCCTACACATCCAATTTCAATGAATTCCCCCAATTGATCGAGTATCAACCCGTTGACCATGGGGCTTATGGCTATACAATTAGCTACTCGGCCAATGCTTGTTCTGCATCGACTTTCAGTGTCCCAAAAGTGATCGAATACGACCCTGATTTGTATAGCGATGGTTACCAAAAGGTGTCGTCCCAATTTGTGATCTCCTACTCTGTTTCAGAATTCAACGAGACAGAATTTGAAGAGTACGATCCAACCCCTTACGGTGGTGGCTACGACATTCATGAAACCTACGGTAAGCCCCTTCAACCTTCAACTGACATTTGCTACTCACCCTCCTCTTCTTCACCTCCAAAACCCCCACCCACCGCAATTCAGGAGGCACCAAAGGAAAAAATTGAAGAAAAAACAAAGCCGTCGAGCGAAATCAAGCCGACCCAGATCGAGAAAGATAACACGGCATCTGAATCTGAAGAAATTGAGGAAGTTCAAGCGATTCCCTTTGCAGATCCGGGAATAGGGTATGGAAATGGAAGGGAAGTGAACCAATTTCCAAGTGGGTATGGACTGGAAGCGATGGATCTTTGTGAAAGTTTATTTGGGTATTGGCCATGTCTCTCACGGATTAAAAAGCAAACAGGTTGTAGACAACCCAACAACGGCTGTGGGCGTTGCCATGGGCATTGCTATTGCTATGGCAATTACGGCAACCAGTGGCAGACGGCGGCGGACTATCTATTCGGAAGCCATAACCCATATCCAGATGGAAGGAGTGAAGGAGACGGTGTTTATGGGTATCAAACACAGTATCAAACGGAGCCTGTCTATGGCTACGTTTGGTTGAATCAAGACGACTTCGTTCGGTCCGATGACGCTTGA

Protein sequence

MAFYDSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIEYQPVDHGAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQKVSSQFVISYSVSEFNETEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIQEAPKEKIEEKTKPSSEIKPTQIEKDNTASESEEIEEVQAIPFADPGIGYGNGREVNQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTGCRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRSEGDGVYGYQTQYQTEPVYGYVWLNQDDFVRSDDA
BLAST of Cp4.1LG20g03950 vs. TrEMBL
Match: A0A0A0LUY1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G070580 PE=4 SV=1)

HSP 1 Score: 523.1 bits (1346), Expect = 2.5e-145
Identity = 268/372 (72.04%), Postives = 290/372 (77.96%), Query Frame = 1

Query: 1   MAFY-------DSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAY----------A 60
           MAFY       DSYY+ AQIEPPIPQSS EP FYNLFDYPPPCYFGQAY          A
Sbjct: 1   MAFYNSYDFYDDSYYNYAQIEPPIPQSSNEPNFYNLFDYPPPCYFGQAYDYEVGYSANDA 60

Query: 61  PYTSNFNEFPQLIEYQPVDHGAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQKVS 120
           PY SNFNE PQLI+++PVDHG YGY I YSANACSAS+F++PK+ EY+PDLYS+    VS
Sbjct: 61  PYRSNFNELPQLIDHEPVDHGDYGYAIRYSANACSASSFTLPKLCEYNPDLYSE----VS 120

Query: 121 SQFVISYSVSEFNETEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSP-----P 180
           +QFVISYSVS+FNETEFEEYDPTPY GGYDI ETYGKPLQPS +ICY PSSSSP     P
Sbjct: 121 TQFVISYSVSQFNETEFEEYDPTPYDGGYDISETYGKPLQPSIEICYPPSSSSPSKSPPP 180

Query: 181 KPPPTA-----------IQEAPKEKIEEKTKPSSEIKPTQIEKDN--------TASESEE 240
            PPPTA           I EAPK KIEE+TKPSSEIKPTQIEK N        T SES E
Sbjct: 181 PPPPTATAIPIITTIPKIDEAPKGKIEEQTKPSSEIKPTQIEKTNNSSSSDSDTTSESGE 240

Query: 241 IEEVQAIPFADPGIGYGNGREVNQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTGCRQPN 300
           IEE +AI   DPGIGYGN REVN+FPSG GLEAMDLCESLFGYWPCLSR K+QT  RQP 
Sbjct: 241 IEEDKAIQLGDPGIGYGNAREVNEFPSGCGLEAMDLCESLFGYWPCLSRAKRQTAYRQPK 300

Query: 301 NGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRSEGDGVYGYQTQYQTEPVYGYV 332
           NGCGRCHGHCYCYGNYGN+WQTAA+YLFGSHNPY DGR EGD VYGYQ Q+Q EPVYGYV
Sbjct: 301 NGCGRCHGHCYCYGNYGNEWQTAAEYLFGSHNPYLDGRREGDVVYGYQRQFQEEPVYGYV 360

BLAST of Cp4.1LG20g03950 vs. TrEMBL
Match: B9HYT2_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0011s03530g PE=4 SV=1)

HSP 1 Score: 192.2 bits (487), Expect = 1.0e-45
Identity = 142/359 (39.55%), Postives = 174/359 (48.47%), Query Frame = 1

Query: 1   MAFYD-SYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIEYQ 60
           MA+Y  SY D  Q E    + S  P  YN    P P +   AY+ Y  N+NE   L    
Sbjct: 1   MAYYSYSYEDDYQGEYYTGEYSITP--YNSSYDPSPDHDSVAYSSY--NYNEHQVLAYDP 60

Query: 61  PVDHGAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQKVSSQFVISYSVSEFNETE 120
           P  + AY    SYS  A SASTFS P  IEYDP      Y    ++F++SY+VSEFNE  
Sbjct: 61  PSYYAAYDPVSSYSRTAYSASTFSEPVCIEYDPG----HYYNEQTRFIVSYNVSEFNEPA 120

Query: 121 FEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPP------PTAIQEAPKEKI 180
           +EEYDPTPY GGYD+  TYGKPL  S + CY  S+  P           + I    K+++
Sbjct: 121 YEEYDPTPYDGGYDLAATYGKPLPHSAETCYPRSTPDPNVSSLNGFSYGSIIAPYGKDEV 180

Query: 181 EE-KTKPSSEIK------------PTQIEKDNTASESE------EIEEVQAIPFADPG-- 240
            E   KP +E K            P  +E  N    S+      E  E + +   DP   
Sbjct: 181 NEPAAKPQNESKPISPPAIEAAPVPVPLELSNGRGNSQEKLQKGEESEEKGVDHPDPSPG 240

Query: 241 ------------IGYGNGREVNQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTGCRQPNN 300
                        GY  G    Q P GYGLEAMDLCESLFGYWPCLSR  +     Q   
Sbjct: 241 YDTGIANGSCGEFGYEYGMPGPQIPPGYGLEAMDLCESLFGYWPCLSRYARNVNDCQEAA 300

Query: 301 GCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPD----GRSEGDGVYGYQTQYQTEPV 316
            C          G+ GNQW+  ADYLFGS NPY +    G S G+ +YGY+  YQ EP+
Sbjct: 301 DC----------GSRGNQWKGTADYLFGSSNPYGERDDGGNSHGNAIYGYERHYQEEPL 341

BLAST of Cp4.1LG20g03950 vs. TrEMBL
Match: A0A067JDK9_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_21309 PE=4 SV=1)

HSP 1 Score: 187.6 bits (475), Expect = 2.5e-44
Identity = 132/326 (40.49%), Postives = 167/326 (51.23%), Query Frame = 1

Query: 1   MAFYDSYY----DSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLI 60
           MA+Y S Y       +     P + Y   + N +   PP     A A YT N N+     
Sbjct: 1   MAYYGSSYYMEDGGGEYNSSYPLTPY---YNNSYYDSPPIQDSMATAYYTYNSND----- 60

Query: 61  EYQPVDH-GAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQKVSSQFVISYSVSEF 120
              P+   G Y    SYS  A + S  S PK +EYDP      Y    ++F+ SYSVS+F
Sbjct: 61  ---PIPFFGTYDSVSSYSRIAYAVSATSEPKHMEYDPV----PYYSAQTRFITSYSVSQF 120

Query: 121 NETEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIQEAPKEKIEE 180
           NE +FEEYDPTPYGGGYD   TYGKPL PS   CY  S+     P P  +    KE+++E
Sbjct: 121 NEPDFEEYDPTPYGGGYDQTVTYGKPLPPSDQTCYPRST-----PDPVILPLNEKEELKE 180

Query: 181 KT-KPSSEIKPTQ-----IEKDNTASESEEIEEVQAIPFADPGIGYGNGREVNQFPSGYG 240
            T KP ++ KPT+      E+   +S  EE  + +++    P  G      V+Q P GYG
Sbjct: 181 DTPKPETQTKPTEGAETEQEQQQQSSFQEEESKEKSVDDYYPWSGSTGVPSVSQVPYGYG 240

Query: 241 LEAMDLCESLFGYWPCLSRIKKQTGCRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGS 300
           LEAMD+CE LFGYWPCLSR +++            C   C   GN   QW  AADYLFGS
Sbjct: 241 LEAMDICEGLFGYWPCLSRYRRKWD---------ECEQDC---GNRSTQWNMAADYLFGS 294

Query: 301 HNPY----PDGRSEGDGVYGYQTQYQ 312
            NPY     DG    +G+Y YQ QYQ
Sbjct: 301 PNPYSQRNDDGSCSWNGMYSYQRQYQ 294

BLAST of Cp4.1LG20g03950 vs. TrEMBL
Match: W9R7U3_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_026029 PE=4 SV=1)

HSP 1 Score: 181.8 bits (460), Expect = 1.4e-42
Identity = 127/327 (38.84%), Postives = 170/327 (51.99%), Query Frame = 1

Query: 20  SSY--EPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIEYQPVDHGAYGYTISYSANAC 79
           SSY  +P+ YNLFDY P  Y    Y  Y  ++++                YT S+S    
Sbjct: 40  SSYNKDPSIYNLFDYDPTPY----YHAYNRSYDQ----------------YTPSWSTTCY 99

Query: 80  SASTFSVPKVIEYDPD-LYSDGY-QKVSSQFVISYSVSEFNETEFEEYDPTPYGGGYDIH 139
           S  T +  K I Y+P+  +   Y QK  +QFV SYSVS FN  +F++YDPTPY GGYDI 
Sbjct: 100 STFTRTESKSIVYEPNSCHVISYDQKPRTQFVTSYSVSAFNVPDFDDYDPTPYDGGYDIA 159

Query: 140 ETYGKPLQPSTDICYSPSSSSPPKPPPTAIQEAPKEKIE-------EKTKPSSEIKPTQI 199
           +TYGKPL P+  ICY  S++       +  ++  +EK         EKTK   E KP + 
Sbjct: 160 QTYGKPLPPTDQICYPRSTAEITNVLTSDDKDKEREKAHDDHESQAEKTK--KESKPEET 219

Query: 200 EKDNTASESEEIEEVQAIPFADPGI-------------GYGNGREVNQFPSGYGLEAMDL 259
            K+    E EE E  + +   + G              GY  G +V+Q PSGYGLEAMDL
Sbjct: 220 VKEEEEEEEEEEEAKKELGHEENGTNYKERTEEVGGGNGYEYGNQVSQIPSGYGLEAMDL 279

Query: 260 CESLFGYWPCLSRIKKQT---GCRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNP 316
           CES+FGYWPC +R  ++     C +   GCG  +G+ Y Y      W  AA+YLFGS +P
Sbjct: 280 CESIFGYWPCFARYARRANNGNCHEGEYGCGYGYGYGYNY-----HWNGAAEYLFGSSDP 339

BLAST of Cp4.1LG20g03950 vs. TrEMBL
Match: M5X5K5_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa025399mg PE=4 SV=1)

HSP 1 Score: 181.8 bits (460), Expect = 1.4e-42
Identity = 125/325 (38.46%), Postives = 161/325 (49.54%), Query Frame = 1

Query: 15  PPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIEYQPVDHGAYGYTISYSA 74
           P I  SSYEP+F N  +Y P  YF   +                            SYSA
Sbjct: 35  PTISYSSYEPSFQNFLEYDPTPYFHAFH----------------------------SYSA 94

Query: 75  NACSASTFSVPKVIEYDPDLYSDGYQKVSSQFVISYSVSEFNETEFEEYDPTPYGGGYDI 134
           + C +   S    IEY+P  Y   Y    +QF+ISYSVSEFNE +FEEYDPTPY GG+DI
Sbjct: 95  SPCCSKPIS----IEYNPKFYEQSYD---TQFLISYSVSEFNEPDFEEYDPTPYDGGFDI 154

Query: 135 HETYGKPLQPSTDICYSPSSSSPPKPPPTAIQEAPKEK-------------IEEKTKPSS 194
            + YGKPL PS + CY  +  S    P    +E   E+             IEE+     
Sbjct: 155 AQVYGKPLSPSDETCYPLNGGSVSVTPLGGNKEQINEQAAKPINGSQPIPAIEEEQMQHQ 214

Query: 195 EIKPTQIEKDNT--ASESEEIEEVQAIPFAD-----PGIGYGNGREVNQFPSGYGLEAMD 254
           E +  Q  +++T       ++EEV+    +D       + +G  ++ +Q PSGYGLEAMD
Sbjct: 215 ESREDQPSQESTDQGKPDHQVEEVEESKGSDHEHNLGSLSHGYEKQAHQIPSGYGLEAMD 274

Query: 255 LCESLFGYWPCLSR-IKKQTGCRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPY 314
           +CESLFGYWPCLSR  K+     Q  +G GR          YGN W+  ADYLFGS NPY
Sbjct: 275 ICESLFGYWPCLSRDFKRGNDTGQGFSGEGR----------YGNPWEGTADYLFGSSNPY 314

BLAST of Cp4.1LG20g03950 vs. TAIR10
Match: AT1G11440.1 (AT1G11440.1 BEST Arabidopsis thaliana protein match is: glycine-rich protein (TAIR:AT3G29075.1))

HSP 1 Score: 128.3 bits (321), Expect = 9.1e-30
Identity = 127/383 (33.16%), Postives = 169/383 (44.13%), Query Frame = 1

Query: 3   FYDSY---YDSAQIEPPIPQSSY-----------EPTFYNLFDYPPPCYFGQAYAPYTSN 62
           FY++Y   YD  Q+     Q+ Y           EP  YN +                 N
Sbjct: 4   FYENYQSPYDYNQVNNLYDQNHYHYNQQQQQLGFEPMSYNYY-----------------N 63

Query: 63  FNEFPQLIEY-------QPVDHGAYGYTISYSAN-----ACSASTFSVPKVIEYDPDLYS 122
           +NE     EY        P+ +  Y +  S S       A S ST S PK + YDP+LY+
Sbjct: 64  WNESESESEYVAYSGYDDPMSYNCYNWNGSESETTSAYVAYSVSTMSEPKHLFYDPNLYT 123

Query: 123 DGYQKVSSQFVISYSVS---EFNETEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPS 182
             Y+    QF I  SV+   +FNE EF+EYDPTPYGGGYD+  TYGKPL PS + CY P 
Sbjct: 124 T-YES-PPQFSIYCSVASALDFNEPEFDEYDPTPYGGGYDVVATYGKPLPPSVETCY-PC 183

Query: 183 SSSP----PKPP------PTAIQEAPKEKIEEK----TKPSSEIKPTQIEK--------- 242
           S++P    P PP      P  I +  ++ + +K     +P  E+KP +  K         
Sbjct: 184 STAPHAKAPSPPEIIAPVPLGIYDGGQKNVVKKRVSFAEPVEEVKPIETIKEQEQEQDED 243

Query: 243 -----------DNTASESEEIEEVQAIPFADPGIGYGNGR-------EVNQF--PSGYGL 302
                      D+   E EE +E       D    YGN         EV     PSGYGL
Sbjct: 244 YDEESEDEDDGDDDDEEEEEGDEEAKEEEKDHSSSYGNEEYEVVDKGEVKALYVPSGYGL 303

Query: 303 EAMDLCESLF-GYWPCLSRIKKQTGCRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGS 308
           EA DLCE +F GY+PC+ R K++    Q       C        N  + W+T +D+LFG 
Sbjct: 304 EATDLCEVIFGGYFPCVLRNKRRQEDEQDRGAAVSC-----WESNDSDPWKTTSDHLFGD 361

BLAST of Cp4.1LG20g03950 vs. TAIR10
Match: AT3G29075.1 (AT3G29075.1 glycine-rich protein)

HSP 1 Score: 49.7 bits (117), Expect = 4.1e-06
Identity = 23/47 (48.94%), Postives = 30/47 (63.83%), Query Frame = 1

Query: 110 YSVSEFNETEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSS 157
           Y+  + +  +F EYDP PY GGYDI  TYG+ + PS + CY  SS S
Sbjct: 4   YTNDDNDVDDFTEYDPMPYSGGYDITVTYGRSIPPSDETCYPLSSLS 50

BLAST of Cp4.1LG20g03950 vs. NCBI nr
Match: gi|659067873|ref|XP_008441695.1| (PREDICTED: uncharacterized protein LOC103485767 [Cucumis melo])

HSP 1 Score: 541.6 bits (1394), Expect = 9.7e-151
Identity = 274/367 (74.66%), Postives = 294/367 (80.11%), Query Frame = 1

Query: 1   MAFYDSY-------YDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAY----------A 60
           MAFYDSY       Y+SAQIEPPI QSS EPTFYNLFDYPPPCYFGQAY          A
Sbjct: 17  MAFYDSYDFYDDSYYNSAQIEPPILQSSNEPTFYNLFDYPPPCYFGQAYDSEVGYFAINA 76

Query: 61  PYTSNFNEFPQLIEYQPVDHGAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQKVS 120
            Y SNF+EFPQLIE++PVDHG YGY I YSANACSAS+F++PKV  YDPDLYS+    VS
Sbjct: 77  AYGSNFSEFPQLIEHEPVDHGDYGYAIRYSANACSASSFTLPKVFGYDPDLYSE----VS 136

Query: 121 SQFVISYSVSEFNETEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPT 180
           +QFVISYSVSEFNET+FEEYDPTPY GGYDI+ETYGKPLQPST+ICY PSSSSP KPPP 
Sbjct: 137 TQFVISYSVSEFNETDFEEYDPTPYDGGYDIYETYGKPLQPSTEICYPPSSSSPSKPPPP 196

Query: 181 A-----------IQEAPKEKIEEKTKPSSEIKPTQIEKDN--------TASESEEIEEVQ 240
                       I EAPK KIEE+TKPSSEIKP QIEK N        T SES EIEEV+
Sbjct: 197 TATAIPITTIPKIDEAPKGKIEEQTKPSSEIKPIQIEKTNNSSSSDSDTTSESGEIEEVK 256

Query: 241 AIPFADPGIGYGNGREVNQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTGCRQPNNGCGR 300
           AI   DPGIGYGNGREVN+FPSGYGLEAMDLCESLFGYWPCLSR K+QT CRQP NGCGR
Sbjct: 257 AIQLGDPGIGYGNGREVNEFPSGYGLEAMDLCESLFGYWPCLSRAKRQTLCRQPKNGCGR 316

Query: 301 CHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRSEGDGVYGYQTQYQTEPVYGYVWLNQD 332
           CHGHCYCYGNYGNQWQTAA+YLFGSHNPY DGR EGDG YGYQ Q+Q EPVYGYVWLNQ+
Sbjct: 317 CHGHCYCYGNYGNQWQTAAEYLFGSHNPYLDGRGEGDGFYGYQRQFQEEPVYGYVWLNQN 376

BLAST of Cp4.1LG20g03950 vs. NCBI nr
Match: gi|778658573|ref|XP_011652905.1| (PREDICTED: uncharacterized protein At5g39570 [Cucumis sativus])

HSP 1 Score: 523.1 bits (1346), Expect = 3.6e-145
Identity = 268/372 (72.04%), Postives = 290/372 (77.96%), Query Frame = 1

Query: 1   MAFY-------DSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAY----------A 60
           MAFY       DSYY+ AQIEPPIPQSS EP FYNLFDYPPPCYFGQAY          A
Sbjct: 1   MAFYNSYDFYDDSYYNYAQIEPPIPQSSNEPNFYNLFDYPPPCYFGQAYDYEVGYSANDA 60

Query: 61  PYTSNFNEFPQLIEYQPVDHGAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQKVS 120
           PY SNFNE PQLI+++PVDHG YGY I YSANACSAS+F++PK+ EY+PDLYS+    VS
Sbjct: 61  PYRSNFNELPQLIDHEPVDHGDYGYAIRYSANACSASSFTLPKLCEYNPDLYSE----VS 120

Query: 121 SQFVISYSVSEFNETEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSP-----P 180
           +QFVISYSVS+FNETEFEEYDPTPY GGYDI ETYGKPLQPS +ICY PSSSSP     P
Sbjct: 121 TQFVISYSVSQFNETEFEEYDPTPYDGGYDISETYGKPLQPSIEICYPPSSSSPSKSPPP 180

Query: 181 KPPPTA-----------IQEAPKEKIEEKTKPSSEIKPTQIEKDN--------TASESEE 240
            PPPTA           I EAPK KIEE+TKPSSEIKPTQIEK N        T SES E
Sbjct: 181 PPPPTATAIPIITTIPKIDEAPKGKIEEQTKPSSEIKPTQIEKTNNSSSSDSDTTSESGE 240

Query: 241 IEEVQAIPFADPGIGYGNGREVNQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTGCRQPN 300
           IEE +AI   DPGIGYGN REVN+FPSG GLEAMDLCESLFGYWPCLSR K+QT  RQP 
Sbjct: 241 IEEDKAIQLGDPGIGYGNAREVNEFPSGCGLEAMDLCESLFGYWPCLSRAKRQTAYRQPK 300

Query: 301 NGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRSEGDGVYGYQTQYQTEPVYGYV 332
           NGCGRCHGHCYCYGNYGN+WQTAA+YLFGSHNPY DGR EGD VYGYQ Q+Q EPVYGYV
Sbjct: 301 NGCGRCHGHCYCYGNYGNEWQTAAEYLFGSHNPYLDGRREGDVVYGYQRQFQEEPVYGYV 360

BLAST of Cp4.1LG20g03950 vs. NCBI nr
Match: gi|743930503|ref|XP_011009503.1| (PREDICTED: uncharacterized protein LOC105114606 [Populus euphratica])

HSP 1 Score: 198.4 bits (503), Expect = 2.0e-47
Identity = 148/370 (40.00%), Postives = 182/370 (49.19%), Query Frame = 1

Query: 1   MAFYD-SYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIEYQ 60
           MA+Y   Y D  Q E    Q S  P  YN    P P +   AY+ Y  N+NE   L    
Sbjct: 1   MAYYSYRYEDDYQGEYYTGQYSITP--YNSSYDPSPDHDSVAYSSY--NYNEHQVLAYDP 60

Query: 61  PVDHGAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQKVSSQFVISYSVSEFNETE 120
           P  + AY    SYS  A SASTFS P  IEY P      Y    ++F++SY+VSEFNE  
Sbjct: 61  PSYYAAYDPVSSYSRTAYSASTFSEPMCIEYHPG----HYHNEQTRFIVSYNVSEFNEPA 120

Query: 121 FEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIQ----EAP--KEKI 180
           +EEYDPTPYGGGYD+  TYGKPL  S + CY  S+  P             +AP  K+++
Sbjct: 121 YEEYDPTPYGGGYDLAATYGKPLPHSAETCYPRSTPDPNVSSLNGFSYGSIKAPYGKDEV 180

Query: 181 EE-KTKPSSEIK------------PTQIEKDNTASES-EEIE-----EVQAIPFADPGIG 240
            E   KP +E K            P  +E  N    S EE++     E + +   DP  G
Sbjct: 181 NEPAAKPQNESKPISPPAIEAAPVPVPLELSNGRGNSREELQKGEESEEKGVDHPDPSPG 240

Query: 241 YGNGREVN--------------QFPSGYGLEAMDLCESLFGYWPCLSRIKKQTGCRQPNN 300
           Y  G                  Q P GYGLEAMDLCESLFGYWPCLSR  +     Q   
Sbjct: 241 YDTGIANGSCGEFGYEYGVPGPQIPPGYGLEAMDLCESLFGYWPCLSRYARNVNDCQEAA 300

Query: 301 GCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPD----GRSEGDGVYGYQTQYQTEPVY 327
            C          GN+GNQW+  ADYLFGS NPY +    G S G+ +YGY+  YQ EP+Y
Sbjct: 301 DC----------GNHGNQWKGTADYLFGSSNPYGERDDGGNSYGNAIYGYERHYQEEPLY 352

BLAST of Cp4.1LG20g03950 vs. NCBI nr
Match: gi|224114215|ref|XP_002316699.1| (hypothetical protein POPTR_0011s03530g [Populus trichocarpa])

HSP 1 Score: 192.2 bits (487), Expect = 1.4e-45
Identity = 142/359 (39.55%), Postives = 174/359 (48.47%), Query Frame = 1

Query: 1   MAFYD-SYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIEYQ 60
           MA+Y  SY D  Q E    + S  P  YN    P P +   AY+ Y  N+NE   L    
Sbjct: 1   MAYYSYSYEDDYQGEYYTGEYSITP--YNSSYDPSPDHDSVAYSSY--NYNEHQVLAYDP 60

Query: 61  PVDHGAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQKVSSQFVISYSVSEFNETE 120
           P  + AY    SYS  A SASTFS P  IEYDP      Y    ++F++SY+VSEFNE  
Sbjct: 61  PSYYAAYDPVSSYSRTAYSASTFSEPVCIEYDPG----HYYNEQTRFIVSYNVSEFNEPA 120

Query: 121 FEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPP------PTAIQEAPKEKI 180
           +EEYDPTPY GGYD+  TYGKPL  S + CY  S+  P           + I    K+++
Sbjct: 121 YEEYDPTPYDGGYDLAATYGKPLPHSAETCYPRSTPDPNVSSLNGFSYGSIIAPYGKDEV 180

Query: 181 EE-KTKPSSEIK------------PTQIEKDNTASESE------EIEEVQAIPFADPG-- 240
            E   KP +E K            P  +E  N    S+      E  E + +   DP   
Sbjct: 181 NEPAAKPQNESKPISPPAIEAAPVPVPLELSNGRGNSQEKLQKGEESEEKGVDHPDPSPG 240

Query: 241 ------------IGYGNGREVNQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTGCRQPNN 300
                        GY  G    Q P GYGLEAMDLCESLFGYWPCLSR  +     Q   
Sbjct: 241 YDTGIANGSCGEFGYEYGMPGPQIPPGYGLEAMDLCESLFGYWPCLSRYARNVNDCQEAA 300

Query: 301 GCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPD----GRSEGDGVYGYQTQYQTEPV 316
            C          G+ GNQW+  ADYLFGS NPY +    G S G+ +YGY+  YQ EP+
Sbjct: 301 DC----------GSRGNQWKGTADYLFGSSNPYGERDDGGNSHGNAIYGYERHYQEEPL 341

BLAST of Cp4.1LG20g03950 vs. NCBI nr
Match: gi|802780226|ref|XP_012091440.1| (PREDICTED: uncharacterized protein LOC105649416 [Jatropha curcas])

HSP 1 Score: 187.6 bits (475), Expect = 3.6e-44
Identity = 132/326 (40.49%), Postives = 167/326 (51.23%), Query Frame = 1

Query: 1   MAFYDSYY----DSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLI 60
           MA+Y S Y       +     P + Y   + N +   PP     A A YT N N+     
Sbjct: 1   MAYYGSSYYMEDGGGEYNSSYPLTPY---YNNSYYDSPPIQDSMATAYYTYNSND----- 60

Query: 61  EYQPVDH-GAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQKVSSQFVISYSVSEF 120
              P+   G Y    SYS  A + S  S PK +EYDP      Y    ++F+ SYSVS+F
Sbjct: 61  ---PIPFFGTYDSVSSYSRIAYAVSATSEPKHMEYDPV----PYYSAQTRFITSYSVSQF 120

Query: 121 NETEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIQEAPKEKIEE 180
           NE +FEEYDPTPYGGGYD   TYGKPL PS   CY  S+     P P  +    KE+++E
Sbjct: 121 NEPDFEEYDPTPYGGGYDQTVTYGKPLPPSDQTCYPRST-----PDPVILPLNEKEELKE 180

Query: 181 KT-KPSSEIKPTQ-----IEKDNTASESEEIEEVQAIPFADPGIGYGNGREVNQFPSGYG 240
            T KP ++ KPT+      E+   +S  EE  + +++    P  G      V+Q P GYG
Sbjct: 181 DTPKPETQTKPTEGAETEQEQQQQSSFQEEESKEKSVDDYYPWSGSTGVPSVSQVPYGYG 240

Query: 241 LEAMDLCESLFGYWPCLSRIKKQTGCRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGS 300
           LEAMD+CE LFGYWPCLSR +++            C   C   GN   QW  AADYLFGS
Sbjct: 241 LEAMDICEGLFGYWPCLSRYRRKWD---------ECEQDC---GNRSTQWNMAADYLFGS 294

Query: 301 HNPY----PDGRSEGDGVYGYQTQYQ 312
            NPY     DG    +G+Y YQ QYQ
Sbjct: 301 PNPYSQRNDDGSCSWNGMYSYQRQYQ 294

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LUY1_CUCSA2.5e-14572.04Uncharacterized protein OS=Cucumis sativus GN=Csa_1G070580 PE=4 SV=1[more]
B9HYT2_POPTR1.0e-4539.55Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0011s03530g PE=4 SV=1[more]
A0A067JDK9_JATCU2.5e-4440.49Uncharacterized protein OS=Jatropha curcas GN=JCGZ_21309 PE=4 SV=1[more]
W9R7U3_9ROSA1.4e-4238.84Uncharacterized protein OS=Morus notabilis GN=L484_026029 PE=4 SV=1[more]
M5X5K5_PRUPE1.4e-4238.46Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa025399mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G11440.19.1e-3033.16 BEST Arabidopsis thaliana protein match is: glycine-rich protein (TA... [more]
AT3G29075.14.1e-0648.94 glycine-rich protein[more]
Match NameE-valueIdentityDescription
gi|659067873|ref|XP_008441695.1|9.7e-15174.66PREDICTED: uncharacterized protein LOC103485767 [Cucumis melo][more]
gi|778658573|ref|XP_011652905.1|3.6e-14572.04PREDICTED: uncharacterized protein At5g39570 [Cucumis sativus][more]
gi|743930503|ref|XP_011009503.1|2.0e-4740.00PREDICTED: uncharacterized protein LOC105114606 [Populus euphratica][more]
gi|224114215|ref|XP_002316699.1|1.4e-4539.55hypothetical protein POPTR_0011s03530g [Populus trichocarpa][more]
gi|802780226|ref|XP_012091440.1|3.6e-4440.49PREDICTED: uncharacterized protein LOC105649416 [Jatropha curcas][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0070300 phosphatidic acid binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG20g03950.1Cp4.1LG20g03950.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR33971FAMILY NOT NAMEDcoord: 99..314
score: 1.9
NoneNo IPR availablePANTHERPTHR33971:SF3SUBFAMILY NOT NAMEDcoord: 99..314
score: 1.9

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG20g03950Melon (DHL92) v3.6.1cpemedB572
Cp4.1LG20g03950Silver-seed gourdcarcpeB0916
Cp4.1LG20g03950Silver-seed gourdcarcpeB1118
Cp4.1LG20g03950Silver-seed gourdcarcpeB1222
Cp4.1LG20g03950Silver-seed gourdcarcpeB1291
Cp4.1LG20g03950Cucumber (Chinese Long) v3cpecucB0650
Cp4.1LG20g03950Wax gourdcpewgoB0632
Cp4.1LG20g03950Wax gourdcpewgoB0637
Cp4.1LG20g03950Cucurbita pepo (Zucchini)cpecpeB048
Cp4.1LG20g03950Cucurbita pepo (Zucchini)cpecpeB086
Cp4.1LG20g03950Cucurbita pepo (Zucchini)cpecpeB412
Cp4.1LG20g03950Cucurbita pepo (Zucchini)cpecpeB437
Cp4.1LG20g03950Cucurbita pepo (Zucchini)cpecpeB445
Cp4.1LG20g03950Cucumber (Gy14) v1cgycpeB0040
Cp4.1LG20g03950Cucumber (Gy14) v1cgycpeB0394
Cp4.1LG20g03950Cucurbita maxima (Rimu)cmacpeB044
Cp4.1LG20g03950Cucurbita maxima (Rimu)cmacpeB438
Cp4.1LG20g03950Cucurbita maxima (Rimu)cmacpeB533
Cp4.1LG20g03950Cucurbita maxima (Rimu)cmacpeB677
Cp4.1LG20g03950Cucurbita maxima (Rimu)cmacpeB881
Cp4.1LG20g03950Cucurbita moschata (Rifu)cmocpeB201
Cp4.1LG20g03950Cucurbita moschata (Rifu)cmocpeB403
Cp4.1LG20g03950Cucurbita moschata (Rifu)cmocpeB443
Cp4.1LG20g03950Cucurbita moschata (Rifu)cmocpeB631
Cp4.1LG20g03950Cucurbita moschata (Rifu)cmocpeB818
Cp4.1LG20g03950Wild cucumber (PI 183967)cpecpiB527
Cp4.1LG20g03950Cucumber (Chinese Long) v2cpecuB525
Cp4.1LG20g03950Cucumber (Chinese Long) v2cpecuB526
Cp4.1LG20g03950Bottle gourd (USVL1VR-Ls)cpelsiB415
Cp4.1LG20g03950Watermelon (Charleston Gray)cpewcgB450
Cp4.1LG20g03950Watermelon (97103) v1cpewmB513
Cp4.1LG20g03950Melon (DHL92) v3.5.1cpemeB485
Cp4.1LG20g03950Melon (DHL92) v3.5.1cpemeB481