Cp4.1LG20g02290 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG20g02290
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPhosphoglycerate mutase family protein
LocationCp4.1LG20 : 1340456 .. 1341920 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TAATGCGGTGGAATTGACCATTGATCTTCTGATTTCTAAAAATCTAACGAAAATGAAAGATAAGGCTGAGAATGAACAAACCTCCATCGACGATTTTCGACCAACACGCCGTCGCCGCCGGCTAAAATGTACGCATCGTCCTCCGTTTCGATTGCAGCCCTTACCTGCCCACGCTTGCTTCAGCCTCCCTTGAAGCCTGTCGCGTGCTCTCAAGCTTCTTTCAACCGCCGCCATCTGATTACCACTCTGGTCTCTGTCTCTACCCCTTCTTTCATCGATTTCTGTCTGCCCTGTTCTTCTGATTTTGTTGCTCTTCCTGTCGCTGAAGCTCGTGGTCTGTTCCAGATGCCCCCTGTTCGTCTCGTCAATCGGTAACTTCACTAACCCAGATTTTTTCTTCGTTTTATTTCGCATTTGGAGGCCGATAAAAGGGTTTTAATGGGTTTGAGAAGGTACTTTCTGGTGAGGGCCGGAGAGTCAGAATTCGATAGCTTTGGTATAATTAACACAAACCCAGTTGCAAAAACATCAGTTGATAGCGGATTGTCTGAGGAAGGGAAGAAGCAAACTGTTAAAGCGGCTTTCAAATTGAAAGAAATGGGGGCTTGTGACAATGGCTGCTGGATTTGGCCCTCTATTACGCAGAGAGCTTATCAAGCAGCTGAGATTATTGCATCAGTTAATGGTGTGAATCGAAGGTATTCATCTTTGTCTTGAATGTTCAAGTTCTTGCCGCCATGGAATCAGATGTTTTTGTTGGTTTCAGTTATATAGTTCCTGAATATAGTTTTCTTGATGCCCGTGGGCTTGGAGCTTATGAAGGCAAGAGATTGGAGTCTGTTTCAGAAGTATGATTGTTTGAAAGTAATGATTTGCTTGTAGTTTCTGTCCAGAGACAGTGAATTGTTTGATTGTTCTTATGAATTGTGTTTGAAAACAACAGGTATATGCTTCAGATACCATCTCTTCACGGATCAAACCTCCTCCAATTGATGATGGTACCCCAAACGAGAGCGTATCCGATGTGTTCGTTCGTGTAACGCAACTGATGTCGATTCTCGAAACGCAATACTCTGGTGATACTATCATCATTGTGTCGCCAGATTCTGATAATTTGACAGTTCTTCAAGCTGGTTTAATCGGCCTTGACTTGCGAAGGTAGCTGCACATCTGAAATGTTTCTCTCATAAGCATCATCTTGTCATTGTGTTATTTGAATCATTGACAAGTTAAAGCTTCTTTTGTTCTTGAATTCAGGCACCATGATCTTTCCTTTGCACCCGGGGAGGTTCGCTTTGTTGATATAAGTAGTATCCCTTCCTATAAGCAACCAGCTTCAGCTGTTTATAAATGTTTAAACCTTCCTAATTGTAACTGATATAGCTTTCTTTGTATATATTATTTCTCTTCCATGTTTATTTTTACACACTTCGAGGGCCTAATGTGACATAGGGGTTACATCTCG

mRNA sequence

TAATGCGGTGGAATTGACCATTGATCTTCTGATTTCTAAAAATCTAACGAAAATGAAAGATAAGGCTGAGAATGAACAAACCTCCATCGACGATTTTCGACCAACACGCCGTCGCCGCCGGCTAAAATGTACGCATCGTCCTCCGTTTCGATTGCAGCCCTTACCTGCCCACGCTTGCTTCAGCCTCCCTTGAAGCCTGTCGCGTGCTCTCAAGCTTCTTTCAACCGCCGCCATCTGATTACCACTCTGGTCTCTGTCTCTACCCCTTCTTTCATCGATTTCTGTCTGCCCTGTTCTTCTGATTTTGTTGCTCTTCCTGTCGCTGAAGCTCGTGGTCTGTTCCAGATGCCCCCTGTTCGTCTCGTCAATCGGTACTTTCTGGTGAGGGCCGGAGAGTCAGAATTCGATAGCTTTGGTATAATTAACACAAACCCAGTTGCAAAAACATCAGTTGATAGCGGATTGTCTGAGGAAGGGAAGAAGCAAACTGTTAAAGCGGCTTTCAAATTGAAAGAAATGGGGGCTTGTGACAATGGCTGCTGGATTTGGCCCTCTATTACGCAGAGAGCTTATCAAGCAGCTGAGATTATTGCATCAGTTAATGGTGTGAATCGAAGTTATATAGTTCCTGAATATAGTTTTCTTGATGCCCGTGGGCTTGGAGCTTATGAAGGCAAGAGATTGGAGTCTGTTTCAGAAGTATATGCTTCAGATACCATCTCTTCACGGATCAAACCTCCTCCAATTGATGATGGTACCCCAAACGAGAGCGTATCCGATGTGTTCGTTCGTGTAACGCAACTGATGTCGATTCTCGAAACGCAATACTCTGGTGATACTATCATCATTGTGTCGCCAGATTCTGATAATTTGACAGTTCTTCAAGCTGGTTTAATCGGCCTTGACTTGCGAAGGCACCATGATCTTTCCTTTGCACCCGGGGAGGTTCGCTTTGTTGATATAAGTAGTATCCCTTCCTATAAGCAACCAGCTTCAGCTGTTTATAAATGTTTAAACCTTCCTAATTGTAACTGATATAGCTTTCTTTGTATATATTATTTCTCTTCCATGTTTATTTTTACACACTTCGAGGGCCTAATGTGACATAGGGGTTACATCTCG

Coding sequence (CDS)

ATGTACGCATCGTCCTCCGTTTCGATTGCAGCCCTTACCTGCCCACGCTTGCTTCAGCCTCCCTTGAAGCCTGTCGCGTGCTCTCAAGCTTCTTTCAACCGCCGCCATCTGATTACCACTCTGGTCTCTGTCTCTACCCCTTCTTTCATCGATTTCTGTCTGCCCTGTTCTTCTGATTTTGTTGCTCTTCCTGTCGCTGAAGCTCGTGGTCTGTTCCAGATGCCCCCTGTTCGTCTCGTCAATCGGTACTTTCTGGTGAGGGCCGGAGAGTCAGAATTCGATAGCTTTGGTATAATTAACACAAACCCAGTTGCAAAAACATCAGTTGATAGCGGATTGTCTGAGGAAGGGAAGAAGCAAACTGTTAAAGCGGCTTTCAAATTGAAAGAAATGGGGGCTTGTGACAATGGCTGCTGGATTTGGCCCTCTATTACGCAGAGAGCTTATCAAGCAGCTGAGATTATTGCATCAGTTAATGGTGTGAATCGAAGTTATATAGTTCCTGAATATAGTTTTCTTGATGCCCGTGGGCTTGGAGCTTATGAAGGCAAGAGATTGGAGTCTGTTTCAGAAGTATATGCTTCAGATACCATCTCTTCACGGATCAAACCTCCTCCAATTGATGATGGTACCCCAAACGAGAGCGTATCCGATGTGTTCGTTCGTGTAACGCAACTGATGTCGATTCTCGAAACGCAATACTCTGGTGATACTATCATCATTGTGTCGCCAGATTCTGATAATTTGACAGTTCTTCAAGCTGGTTTAATCGGCCTTGACTTGCGAAGGCACCATGATCTTTCCTTTGCACCCGGGGAGGTTCGCTTTGTTGATATAAGTAGTATCCCTTCCTATAAGCAACCAGCTTCAGCTGTTTATAAATGTTTAAACCTTCCTAATTGTAACTGA

Protein sequence

MYASSSVSIAALTCPRLLQPPLKPVACSQASFNRRHLITTLVSVSTPSFIDFCLPCSSDFVALPVAEARGLFQMPPVRLVNRYFLVRAGESEFDSFGIINTNPVAKTSVDSGLSEEGKKQTVKAAFKLKEMGACDNGCWIWPSITQRAYQAAEIIASVNGVNRSYIVPEYSFLDARGLGAYEGKRLESVSEVYASDTISSRIKPPPIDDGTPNESVSDVFVRVTQLMSILETQYSGDTIIIVSPDSDNLTVLQAGLIGLDLRRHHDLSFAPGEVRFVDISSIPSYKQPASAVYKCLNLPNCN
BLAST of Cp4.1LG20g02290 vs. TrEMBL
Match: A0A0A0LTH1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G046020 PE=4 SV=1)

HSP 1 Score: 544.3 bits (1401), Expect = 9.5e-152
Identity = 277/302 (91.72%), Postives = 281/302 (93.05%), Query Frame = 1

Query: 1   MYASSSVSIAALTCPRLLQPPLKPVACSQASFNRRHLITTLVSVSTPSFIDFCLPCSSDF 60
           MYA SSV IAAL C RLLQPPLKP AC Q  FNRRHLITTL+SV TPSFIDF LPCSSD 
Sbjct: 1   MYALSSVPIAALPCSRLLQPPLKPSACFQTFFNRRHLITTLLSVFTPSFIDFTLPCSSDL 60

Query: 61  VALPVAEARGLFQMPPVRLVNRYFLVRAGESEFDSFGIINTNPVAKTSVDSGLSEEGKKQ 120
           VA    EARGLFQMPPVRLVNRYFLVRAGESEFDSFGIINTNPVAKTSVDSGLSEEGKKQ
Sbjct: 61  VA----EARGLFQMPPVRLVNRYFLVRAGESEFDSFGIINTNPVAKTSVDSGLSEEGKKQ 120

Query: 121 TVKAAFKLKEMGACDNGCWIWPSITQRAYQAAEIIASVNGVNRSYIVPEYSFLDARGLGA 180
           TVKAAFKLKEMGAC+NGCWIWPSITQRAYQAAEIIASVNGVNRSYIVPEYSFLDARGLGA
Sbjct: 121 TVKAAFKLKEMGACENGCWIWPSITQRAYQAAEIIASVNGVNRSYIVPEYSFLDARGLGA 180

Query: 181 YEGKRLESVSEVYASDTISSRIKPPPIDDGTPNESVSDVFVRVTQLMSILETQYSGDTII 240
           YEGKRL+S+SEVYASDTISS  KPPP DDGTPNESVSDVFVRVTQLMSILETQYSGDTII
Sbjct: 181 YEGKRLDSMSEVYASDTISSIFKPPPTDDGTPNESVSDVFVRVTQLMSILETQYSGDTII 240

Query: 241 IVSPDSDNLTVLQAGLIGLDLRRHHDLSFAPGEVRFVDISSIPSYKQPASAVYKCLNLPN 300
           IVSPDSDNLTVLQAGLIGLDLRRHHDLSFAPGEVRFVDI SIPSYKQP SAVYKCLN PN
Sbjct: 241 IVSPDSDNLTVLQAGLIGLDLRRHHDLSFAPGEVRFVDIRSIPSYKQPPSAVYKCLNPPN 298

Query: 301 CN 303
           CN
Sbjct: 301 CN 298

BLAST of Cp4.1LG20g02290 vs. TrEMBL
Match: W9S3T4_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_015003 PE=4 SV=1)

HSP 1 Score: 438.0 bits (1125), Expect = 9.6e-120
Identity = 222/271 (81.92%), Postives = 241/271 (88.93%), Query Frame = 1

Query: 33  NRRHLITTLVSVS-TPSFIDFCLPCSSDFVALPVAEARGLFQMPPVRLVNRYFLVRAGES 92
           NRR+LI+  +++S T  F+DF L      +    A ARGLFQMPPVRL NRYFLVRAGES
Sbjct: 38  NRRNLISAFLTLSSTTHFVDFGLQS----IFSQKAFARGLFQMPPVRLTNRYFLVRAGES 97

Query: 93  EFDSFGIINTNPVAKTSVDSGLSEEGKKQTVKAAFKLKEMGACDNGCWIWPSITQRAYQA 152
           EF+S GIINTNPVAKTSVDSGLSE GKKQTVKAAF+LKEMGAC+ GCWIWPSITQRAYQA
Sbjct: 98  EFESLGIINTNPVAKTSVDSGLSERGKKQTVKAAFELKEMGACEKGCWIWPSITQRAYQA 157

Query: 153 AEIIASVNGVNRSYIVPEYSFLDARGLGAYEGKRLESVSEVYASDTISSRIKPPPIDDGT 212
           AEIIAS NG+ RSYIVPEYSFLDARGLGAYEGK L+SVSEVYASD+IS   KPPPIDDGT
Sbjct: 158 AEIIASFNGIGRSYIVPEYSFLDARGLGAYEGKSLDSVSEVYASDSISPTTKPPPIDDGT 217

Query: 213 PNESVSDVFVRVTQLMSILETQYSGDTIIIVSPDSDNLTVLQAGLIGLDLRRHHDLSFAP 272
           PNESV+DVFVRVTQLMSILETQYSGDT+IIVSPDSDNLTVLQAGL+GLDLRRH +LSFAP
Sbjct: 218 PNESVADVFVRVTQLMSILETQYSGDTVIIVSPDSDNLTVLQAGLLGLDLRRHRELSFAP 277

Query: 273 GEVRFVDISSIPSYKQPASAVYKCLNLPNCN 303
           GEVRFVD SSIP+YKQPASAVYKCLN P+CN
Sbjct: 278 GEVRFVDPSSIPTYKQPASAVYKCLNPPSCN 304

BLAST of Cp4.1LG20g02290 vs. TrEMBL
Match: A0A061FTK9_THECC (Phosphoglycerate mutase family protein isoform 1 OS=Theobroma cacao GN=TCM_012047 PE=4 SV=1)

HSP 1 Score: 436.0 bits (1120), Expect = 3.7e-119
Identity = 221/284 (77.82%), Postives = 246/284 (86.62%), Query Frame = 1

Query: 19  QPPLKPVACSQASFNRRHLITTLVSVSTPSFIDFCLPCSSDFVALPVAEARGLFQMPPVR 78
           Q P   V+C     NRR+L+ T +++S            S  V++PVA ARGL QMPP R
Sbjct: 21  QTPNLHVSCQP--INRRNLLLTSLTLSL-----------SPSVSVPVASARGLLQMPPPR 80

Query: 79  LVNRYFLVRAGESEFDSFGIINTNPVAKTSVDSGLSEEGKKQTVKAAFKLKEMGACDNGC 138
           L NRYFLVRAGESEF+SFGIINTNPVAKTSVDSGLSE+GKKQTV+AA +L+ MGAC+N C
Sbjct: 81  LSNRYFLVRAGESEFESFGIINTNPVAKTSVDSGLSEKGKKQTVRAALELRAMGACENNC 140

Query: 139 WIWPSITQRAYQAAEIIASVNGVNRSYIVPEYSFLDARGLGAYEGKRLESVSEVYASDTI 198
           WIWPSITQRAYQAAEIIA+VNGV+RSYIVPEYSFLDARGLGAYEGK+LE+VSEVY SD+I
Sbjct: 141 WIWPSITQRAYQAAEIIAAVNGVSRSYIVPEYSFLDARGLGAYEGKKLEAVSEVYESDSI 200

Query: 199 SSRIKPPPIDDGTPNESVSDVFVRVTQLMSILETQYSGDTIIIVSPDSDNLTVLQAGLIG 258
           SS IKPPPIDDGTPNESV+DVFVRVTQLMSILETQYS DT+IIVSPDSDNLT+LQAGL+G
Sbjct: 201 SSTIKPPPIDDGTPNESVADVFVRVTQLMSILETQYSEDTVIIVSPDSDNLTILQAGLVG 260

Query: 259 LDLRRHHDLSFAPGEVRFVDISSIPSYKQPASAVYKCLNLPNCN 303
           LDLRRH DLSFAPGEVR+VD SSIP+YKQPASAVYKCLN PNCN
Sbjct: 261 LDLRRHRDLSFAPGEVRYVDPSSIPTYKQPASAVYKCLNPPNCN 291

BLAST of Cp4.1LG20g02290 vs. TrEMBL
Match: B9IEQ4_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0015s08600g PE=4 SV=2)

HSP 1 Score: 432.2 bits (1110), Expect = 5.3e-118
Identity = 217/271 (80.07%), Postives = 238/271 (87.82%), Query Frame = 1

Query: 33  NRRHLITTL-VSVSTPSFIDFCLPCSSDFVALPVAEARGLFQMPPVRLVNRYFLVRAGES 92
           NRRHL+T L +S+ST              +++PVA+ARGLFQMPP RL N+Y+LVRAGES
Sbjct: 35  NRRHLLTALSISIST------------SHLSIPVADARGLFQMPPPRLTNQYYLVRAGES 94

Query: 93  EFDSFGIINTNPVAKTSVDSGLSEEGKKQTVKAAFKLKEMGACDNGCWIWPSITQRAYQA 152
           EF+S GIINTNPVAKTSVDSGLSE+GKKQ VKAA +LKEMGACD GCWIWPSITQRAYQ 
Sbjct: 95  EFESLGIINTNPVAKTSVDSGLSEKGKKQIVKAALQLKEMGACDTGCWIWPSITQRAYQT 154

Query: 153 AEIIASVNGVNRSYIVPEYSFLDARGLGAYEGKRLESVSEVYASDTISSRIKPPPIDDGT 212
           AEIIA+VN ++RSYIVPEYSFLDARGLGAYEGK LE+VSEVYASDTIS R KPPPIDDGT
Sbjct: 155 AEIIAAVNRISRSYIVPEYSFLDARGLGAYEGKNLEAVSEVYASDTISPRNKPPPIDDGT 214

Query: 213 PNESVSDVFVRVTQLMSILETQYSGDTIIIVSPDSDNLTVLQAGLIGLDLRRHHDLSFAP 272
           PNESV+DVFVRVTQLMSILETQYS +TIIIVSPDSDNLT+LQAGL+GLDLRRH DLSFAP
Sbjct: 215 PNESVADVFVRVTQLMSILETQYSEETIIIVSPDSDNLTILQAGLVGLDLRRHRDLSFAP 274

Query: 273 GEVRFVDISSIPSYKQPASAVYKCLNLPNCN 303
           GEVRFVDIS IP+YKQPASAVYKC N P CN
Sbjct: 275 GEVRFVDISRIPTYKQPASAVYKCRNPPICN 293

BLAST of Cp4.1LG20g02290 vs. TrEMBL
Match: M5WY25_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa009214mg PE=4 SV=1)

HSP 1 Score: 430.3 bits (1105), Expect = 2.0e-117
Identity = 224/304 (73.68%), Postives = 250/304 (82.24%), Query Frame = 1

Query: 7   VSIAALTCPRLLQPPLKPVACSQ------ASFNRRHLIT--TLVSVSTPSFIDFCLPCSS 66
           V+ +++  P    PP  P A SQ         NRR L+   T+ + +TP    F  P S+
Sbjct: 3   VTSSSIKTPPPPHPP-NPWAASQNPNNSFLPINRRSLLAALTISTTTTPFAHTFVNPMSA 62

Query: 67  DFVALPVAEARGLFQMPPVRLVNRYFLVRAGESEFDSFGIINTNPVAKTSVDSGLSEEGK 126
             VA     ARGLFQMPPVRL NRYFLVRAGESE++S G+INTNPVAKTSVD+GLSE+GK
Sbjct: 63  QEVAF----ARGLFQMPPVRLTNRYFLVRAGESEYESIGVINTNPVAKTSVDNGLSEKGK 122

Query: 127 KQTVKAAFKLKEMGACDNGCWIWPSITQRAYQAAEIIASVNGVNRSYIVPEYSFLDARGL 186
           KQ V++AF LKEMGACD  CWIWPSITQRAYQAAEIIASVNGV+RSYIVPEYSFLDARGL
Sbjct: 123 KQAVRSAFDLKEMGACDKNCWIWPSITQRAYQAAEIIASVNGVSRSYIVPEYSFLDARGL 182

Query: 187 GAYEGKRLESVSEVYASDTISSRIKPPPIDDGTPNESVSDVFVRVTQLMSILETQYSGDT 246
           GAYEGK+LE+VSEVYASDT+S  IKPPPIDDGTPNESVSDVFVRV QLMSILETQYS DT
Sbjct: 183 GAYEGKKLEAVSEVYASDTLSPTIKPPPIDDGTPNESVSDVFVRVIQLMSILETQYSEDT 242

Query: 247 IIIVSPDSDNLTVLQAGLIGLDLRRHHDLSFAPGEVRFVDISSIPSYKQPASAVYKCLNL 303
           +IIVSPDSDNLT+LQAG+IGLDLRRH +LSFAPGEVRFVD SS+P+YKQPASAVYKCL  
Sbjct: 243 VIIVSPDSDNLTILQAGIIGLDLRRHRELSFAPGEVRFVDTSSVPTYKQPASAVYKCLKP 301

BLAST of Cp4.1LG20g02290 vs. TAIR10
Match: AT5G62840.1 (AT5G62840.1 Phosphoglycerate mutase family protein)

HSP 1 Score: 402.5 bits (1033), Expect = 2.3e-112
Identity = 210/303 (69.31%), Postives = 239/303 (78.88%), Query Frame = 1

Query: 8   SIAALTCPRLLQPPLKPVA-------CSQASFNRRHLITTL-VSVSTPSFIDFCLPCSSD 67
           S  A+T    L PP  P          S     RR L  TL V ++TPS         S 
Sbjct: 4   SSPAVTTASHLHPPPSPETYQIPLNLLSSPHITRRDLFKTLSVCIATPSL--------SV 63

Query: 68  FVALPVAEARGLFQMPPVRLVNRYFLVRAGESEFDSFGIINTNPVAKTSVDSGLSEEGKK 127
            +A P A ARGLFQMPP+RL NRY+LVRAGES+++S GIINTNPVAKTSVDSGLSE+GKK
Sbjct: 64  SIAAP-ANARGLFQMPPLRLSNRYYLVRAGESDYESLGIINTNPVAKTSVDSGLSEKGKK 123

Query: 128 QTVKAAFKLKEMGACDNGCWIWPSITQRAYQAAEIIASVNGVNRSYIVPEYSFLDARGLG 187
           QT++AA +LK MGACD  CW+WPSITQRAYQAAEIIA++NG++RSYIVPEYSFLDARGLG
Sbjct: 124 QTLRAALQLKAMGACDRNCWLWPSITQRAYQAAEIIAAINGISRSYIVPEYSFLDARGLG 183

Query: 188 AYEGKRLESVSEVYASDTISSRIKPPPIDDGTPNESVSDVFVRVTQLMSILETQYSGDTI 247
           AYEGK+LES+SEVYA D+IS + KPPPI DGTPNESVSDVFVRVTQLMSILETQYS DTI
Sbjct: 184 AYEGKKLESISEVYALDSISMKTKPPPISDGTPNESVSDVFVRVTQLMSILETQYSEDTI 243

Query: 248 IIVSPDSDNLTVLQAGLIGLDLRRHHDLSFAPGEVRFVDISSIPSYKQPASAVYKCLNLP 303
           +IVSPDSDNL+VLQAG+ GLDLRRH +L F PGEVR +D +SIP YKQPASAVYKC   P
Sbjct: 244 VIVSPDSDNLSVLQAGIQGLDLRRHSELYFGPGEVRLLDANSIPVYKQPASAVYKCKKPP 297

BLAST of Cp4.1LG20g02290 vs. NCBI nr
Match: gi|449469527|ref|XP_004152471.1| (PREDICTED: uncharacterized protein LOC101222124 [Cucumis sativus])

HSP 1 Score: 544.3 bits (1401), Expect = 1.4e-151
Identity = 277/302 (91.72%), Postives = 281/302 (93.05%), Query Frame = 1

Query: 1   MYASSSVSIAALTCPRLLQPPLKPVACSQASFNRRHLITTLVSVSTPSFIDFCLPCSSDF 60
           MYA SSV IAAL C RLLQPPLKP AC Q  FNRRHLITTL+SV TPSFIDF LPCSSD 
Sbjct: 1   MYALSSVPIAALPCSRLLQPPLKPSACFQTFFNRRHLITTLLSVFTPSFIDFTLPCSSDL 60

Query: 61  VALPVAEARGLFQMPPVRLVNRYFLVRAGESEFDSFGIINTNPVAKTSVDSGLSEEGKKQ 120
           VA    EARGLFQMPPVRLVNRYFLVRAGESEFDSFGIINTNPVAKTSVDSGLSEEGKKQ
Sbjct: 61  VA----EARGLFQMPPVRLVNRYFLVRAGESEFDSFGIINTNPVAKTSVDSGLSEEGKKQ 120

Query: 121 TVKAAFKLKEMGACDNGCWIWPSITQRAYQAAEIIASVNGVNRSYIVPEYSFLDARGLGA 180
           TVKAAFKLKEMGAC+NGCWIWPSITQRAYQAAEIIASVNGVNRSYIVPEYSFLDARGLGA
Sbjct: 121 TVKAAFKLKEMGACENGCWIWPSITQRAYQAAEIIASVNGVNRSYIVPEYSFLDARGLGA 180

Query: 181 YEGKRLESVSEVYASDTISSRIKPPPIDDGTPNESVSDVFVRVTQLMSILETQYSGDTII 240
           YEGKRL+S+SEVYASDTISS  KPPP DDGTPNESVSDVFVRVTQLMSILETQYSGDTII
Sbjct: 181 YEGKRLDSMSEVYASDTISSIFKPPPTDDGTPNESVSDVFVRVTQLMSILETQYSGDTII 240

Query: 241 IVSPDSDNLTVLQAGLIGLDLRRHHDLSFAPGEVRFVDISSIPSYKQPASAVYKCLNLPN 300
           IVSPDSDNLTVLQAGLIGLDLRRHHDLSFAPGEVRFVDI SIPSYKQP SAVYKCLN PN
Sbjct: 241 IVSPDSDNLTVLQAGLIGLDLRRHHDLSFAPGEVRFVDIRSIPSYKQPPSAVYKCLNPPN 298

Query: 301 CN 303
           CN
Sbjct: 301 CN 298

BLAST of Cp4.1LG20g02290 vs. NCBI nr
Match: gi|659067221|ref|XP_008438298.1| (PREDICTED: uncharacterized protein LOC103483447 [Cucumis melo])

HSP 1 Score: 543.9 bits (1400), Expect = 1.8e-151
Identity = 277/303 (91.42%), Postives = 284/303 (93.73%), Query Frame = 1

Query: 1   MYASSSVSIAALTCPRLLQ-PPLKPVACSQASFNRRHLITTLVSVSTPSFIDFCLPCSSD 60
           MYAS SV +AAL CPRLLQ PPLKP  C + SFNRRHLITTL+SV  PSFIDF LPCSSD
Sbjct: 1   MYASFSVPVAALPCPRLLQQPPLKPSECFRTSFNRRHLITTLLSVFAPSFIDFTLPCSSD 60

Query: 61  FVALPVAEARGLFQMPPVRLVNRYFLVRAGESEFDSFGIINTNPVAKTSVDSGLSEEGKK 120
            VA    EARGLFQMPPVRLVNRYFLVRAGESEFDSFGIINTNPVAKTSVDSGLSEEGKK
Sbjct: 61  LVA----EARGLFQMPPVRLVNRYFLVRAGESEFDSFGIINTNPVAKTSVDSGLSEEGKK 120

Query: 121 QTVKAAFKLKEMGACDNGCWIWPSITQRAYQAAEIIASVNGVNRSYIVPEYSFLDARGLG 180
           QTVKAAFKLKEMGAC+NGCWIWPSITQRAYQAAEIIASVNGVNRSYIVPEYSFLDARGLG
Sbjct: 121 QTVKAAFKLKEMGACENGCWIWPSITQRAYQAAEIIASVNGVNRSYIVPEYSFLDARGLG 180

Query: 181 AYEGKRLESVSEVYASDTISSRIKPPPIDDGTPNESVSDVFVRVTQLMSILETQYSGDTI 240
           A+EGKRL+S+SEVYASDTISS IKPPPIDDGTPNESVSDVFVRVTQLMSILETQYSGDTI
Sbjct: 181 AFEGKRLDSLSEVYASDTISSSIKPPPIDDGTPNESVSDVFVRVTQLMSILETQYSGDTI 240

Query: 241 IIVSPDSDNLTVLQAGLIGLDLRRHHDLSFAPGEVRFVDISSIPSYKQPASAVYKCLNLP 300
           IIVSPDSDNLTVLQAGLIGLDLRRHHDLSFAPGEVRFVDI SIPSYKQPASAVYKCLN P
Sbjct: 241 IIVSPDSDNLTVLQAGLIGLDLRRHHDLSFAPGEVRFVDIRSIPSYKQPASAVYKCLNPP 299

Query: 301 NCN 303
           NCN
Sbjct: 301 NCN 299

BLAST of Cp4.1LG20g02290 vs. NCBI nr
Match: gi|703116715|ref|XP_010101199.1| (hypothetical protein L484_015003 [Morus notabilis])

HSP 1 Score: 438.0 bits (1125), Expect = 1.4e-119
Identity = 222/271 (81.92%), Postives = 241/271 (88.93%), Query Frame = 1

Query: 33  NRRHLITTLVSVS-TPSFIDFCLPCSSDFVALPVAEARGLFQMPPVRLVNRYFLVRAGES 92
           NRR+LI+  +++S T  F+DF L      +    A ARGLFQMPPVRL NRYFLVRAGES
Sbjct: 38  NRRNLISAFLTLSSTTHFVDFGLQS----IFSQKAFARGLFQMPPVRLTNRYFLVRAGES 97

Query: 93  EFDSFGIINTNPVAKTSVDSGLSEEGKKQTVKAAFKLKEMGACDNGCWIWPSITQRAYQA 152
           EF+S GIINTNPVAKTSVDSGLSE GKKQTVKAAF+LKEMGAC+ GCWIWPSITQRAYQA
Sbjct: 98  EFESLGIINTNPVAKTSVDSGLSERGKKQTVKAAFELKEMGACEKGCWIWPSITQRAYQA 157

Query: 153 AEIIASVNGVNRSYIVPEYSFLDARGLGAYEGKRLESVSEVYASDTISSRIKPPPIDDGT 212
           AEIIAS NG+ RSYIVPEYSFLDARGLGAYEGK L+SVSEVYASD+IS   KPPPIDDGT
Sbjct: 158 AEIIASFNGIGRSYIVPEYSFLDARGLGAYEGKSLDSVSEVYASDSISPTTKPPPIDDGT 217

Query: 213 PNESVSDVFVRVTQLMSILETQYSGDTIIIVSPDSDNLTVLQAGLIGLDLRRHHDLSFAP 272
           PNESV+DVFVRVTQLMSILETQYSGDT+IIVSPDSDNLTVLQAGL+GLDLRRH +LSFAP
Sbjct: 218 PNESVADVFVRVTQLMSILETQYSGDTVIIVSPDSDNLTVLQAGLLGLDLRRHRELSFAP 277

Query: 273 GEVRFVDISSIPSYKQPASAVYKCLNLPNCN 303
           GEVRFVD SSIP+YKQPASAVYKCLN P+CN
Sbjct: 278 GEVRFVDPSSIPTYKQPASAVYKCLNPPSCN 304

BLAST of Cp4.1LG20g02290 vs. NCBI nr
Match: gi|1009176091|ref|XP_015869245.1| (PREDICTED: uncharacterized protein LOC107406618 [Ziziphus jujuba])

HSP 1 Score: 436.0 bits (1120), Expect = 5.2e-119
Identity = 226/297 (76.09%), Postives = 249/297 (83.84%), Query Frame = 1

Query: 10  AALTCPRLLQPPL----KPVACSQASFNRRHLITTLVSVSTPSFIDFCLPCSSDFVALPV 69
           ++ T P    PPL      V  S    NRR+L++ L+++ST  F DF        ++LPV
Sbjct: 7   SSATPPNPSLPPLTITPNNVCNSFIPINRRNLLS-LLTLSTTHFSDFAFKP----ISLPV 66

Query: 70  AEARGLFQMPPVRLVNRYFLVRAGESEFDSFGIINTNPVAKTSVDSGLSEEGKKQTVKAA 129
           A ARGLFQMPPVRL NRYFLVRAGESEF+S GIINTNPVAKTSVDSGLSE+GKKQT+KAA
Sbjct: 67  ASARGLFQMPPVRLTNRYFLVRAGESEFESLGIINTNPVAKTSVDSGLSEKGKKQTLKAA 126

Query: 130 FKLKEMGACDNGCWIWPSITQRAYQAAEIIASVNGVNRSYIVPEYSFLDARGLGAYEGKR 189
            +LK  GAC   CWIWPSITQRAYQAAEIIASVNG++RSYIVPEYSFLDARGLGAYEGK 
Sbjct: 127 LELKTRGACARNCWIWPSITQRAYQAAEIIASVNGISRSYIVPEYSFLDARGLGAYEGKS 186

Query: 190 LESVSEVYASDTISSRIKPPPIDDGTPNESVSDVFVRVTQLMSILETQYSGDTIIIVSPD 249
           LESV EVYASD+IS   KPPPIDDGTPNESV+DVFVRVTQLMSILETQYSGDT++IVSPD
Sbjct: 187 LESVLEVYASDSISPNTKPPPIDDGTPNESVADVFVRVTQLMSILETQYSGDTVVIVSPD 246

Query: 250 SDNLTVLQAGLIGLDLRRHHDLSFAPGEVRFVDISSIPSYKQPASAVYKCLNLPNCN 303
           SDNLT+LQAGLIGLDLRRH +LSFAPGEVRFVD SSIP+YKQPASAVYKC N PNCN
Sbjct: 247 SDNLTILQAGLIGLDLRRHTELSFAPGEVRFVDTSSIPAYKQPASAVYKCFNPPNCN 298

BLAST of Cp4.1LG20g02290 vs. NCBI nr
Match: gi|590663341|ref|XP_007036188.1| (Phosphoglycerate mutase family protein isoform 1 [Theobroma cacao])

HSP 1 Score: 436.0 bits (1120), Expect = 5.2e-119
Identity = 221/284 (77.82%), Postives = 246/284 (86.62%), Query Frame = 1

Query: 19  QPPLKPVACSQASFNRRHLITTLVSVSTPSFIDFCLPCSSDFVALPVAEARGLFQMPPVR 78
           Q P   V+C     NRR+L+ T +++S            S  V++PVA ARGL QMPP R
Sbjct: 21  QTPNLHVSCQP--INRRNLLLTSLTLSL-----------SPSVSVPVASARGLLQMPPPR 80

Query: 79  LVNRYFLVRAGESEFDSFGIINTNPVAKTSVDSGLSEEGKKQTVKAAFKLKEMGACDNGC 138
           L NRYFLVRAGESEF+SFGIINTNPVAKTSVDSGLSE+GKKQTV+AA +L+ MGAC+N C
Sbjct: 81  LSNRYFLVRAGESEFESFGIINTNPVAKTSVDSGLSEKGKKQTVRAALELRAMGACENNC 140

Query: 139 WIWPSITQRAYQAAEIIASVNGVNRSYIVPEYSFLDARGLGAYEGKRLESVSEVYASDTI 198
           WIWPSITQRAYQAAEIIA+VNGV+RSYIVPEYSFLDARGLGAYEGK+LE+VSEVY SD+I
Sbjct: 141 WIWPSITQRAYQAAEIIAAVNGVSRSYIVPEYSFLDARGLGAYEGKKLEAVSEVYESDSI 200

Query: 199 SSRIKPPPIDDGTPNESVSDVFVRVTQLMSILETQYSGDTIIIVSPDSDNLTVLQAGLIG 258
           SS IKPPPIDDGTPNESV+DVFVRVTQLMSILETQYS DT+IIVSPDSDNLT+LQAGL+G
Sbjct: 201 SSTIKPPPIDDGTPNESVADVFVRVTQLMSILETQYSEDTVIIVSPDSDNLTILQAGLVG 260

Query: 259 LDLRRHHDLSFAPGEVRFVDISSIPSYKQPASAVYKCLNLPNCN 303
           LDLRRH DLSFAPGEVR+VD SSIP+YKQPASAVYKCLN PNCN
Sbjct: 261 LDLRRHRDLSFAPGEVRYVDPSSIPTYKQPASAVYKCLNPPNCN 291

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LTH1_CUCSA9.5e-15291.72Uncharacterized protein OS=Cucumis sativus GN=Csa_1G046020 PE=4 SV=1[more]
W9S3T4_9ROSA9.6e-12081.92Uncharacterized protein OS=Morus notabilis GN=L484_015003 PE=4 SV=1[more]
A0A061FTK9_THECC3.7e-11977.82Phosphoglycerate mutase family protein isoform 1 OS=Theobroma cacao GN=TCM_01204... [more]
B9IEQ4_POPTR5.3e-11880.07Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0015s08600g PE=4 SV=2[more]
M5WY25_PRUPE2.0e-11773.68Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa009214mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G62840.12.3e-11269.31 Phosphoglycerate mutase family protein[more]
Match NameE-valueIdentityDescription
gi|449469527|ref|XP_004152471.1|1.4e-15191.72PREDICTED: uncharacterized protein LOC101222124 [Cucumis sativus][more]
gi|659067221|ref|XP_008438298.1|1.8e-15191.42PREDICTED: uncharacterized protein LOC103483447 [Cucumis melo][more]
gi|703116715|ref|XP_010101199.1|1.4e-11981.92hypothetical protein L484_015003 [Morus notabilis][more]
gi|1009176091|ref|XP_015869245.1|5.2e-11976.09PREDICTED: uncharacterized protein LOC107406618 [Ziziphus jujuba][more]
gi|590663341|ref|XP_007036188.1|5.2e-11977.82Phosphoglycerate mutase family protein isoform 1 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR013078His_Pase_superF_clade-1
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0030154 cell differentiation
biological_process GO:0009965 leaf morphogenesis
biological_process GO:0008150 biological_process
biological_process GO:0045492 xylan biosynthetic process
biological_process GO:0010182 sugar mediated signaling pathway
biological_process GO:0042542 response to hydrogen peroxide
biological_process GO:0009644 response to high light intensity
biological_process GO:0016567 protein ubiquitination
biological_process GO:0010413 glucuronoxylan metabolic process
biological_process GO:0016556 mRNA modification
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0016874 ligase activity
molecular_function GO:0005515 protein binding
molecular_function GO:0004842 ubiquitin-protein transferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG20g02290.1Cp4.1LG20g02290.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013078Histidine phosphatase superfamily, clade-1PFAMPF00300His_Phos_1coord: 84..277
score: 1.6
NoneNo IPR availablePANTHERPTHR23029PHOSPHOGLYCERATE MUTASEcoord: 21..302
score: 1.8E
NoneNo IPR availablePANTHERPTHR23029:SF45PHOSPHOGLYCERATE MUTASE FAMILY PROTEINcoord: 21..302
score: 1.8E

The following gene(s) are paralogous to this gene:

None