Cp4.1LG14g03740 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG14g03740
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionMidasin
LocationCp4.1LG14 : 2090098 .. 2092302 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATTCTGACCGCCATTTTCGTACTACTAGCAGCAACAGTACCAGTTCCGCCGCCGCAGGCTCTAGCGAGCTCTTCATCTGCTTCACTTCTCGCTTCTCATCTTCTTCTTCTTCTGCCATGAAGATCTCTTCCAAGTGCCTTCTTAGCCCTGGCCGTGCCCGTGAAGGCCAAATCATCCTCTCTACTTCACTCAGCCGCCGTCTCAAATCCAGCGGCAGCCTCAAGGTCGGTCAGGCCTCGCCTGCGTTTCCTACCGGTGGGAAGAAGCGAGGATGCGCGTTCGATAATCCGGAGCCGTCGTCGCCGAAGGTCACTTGCATTGGACAGGTTAGGGTTAAGACTAAGAAGCAGGGAAGGAAGATGAGGGCTAGATCGCTGAAGCGGAGGACTAATTCGGAGGCGAGTTTTCGGAAATCGGAGAGTGTTGTTCAATCGTCGCAGATGAATGGTAATGATCAGCAATTCGTTGCGAATCAATCGTCGCGTCCTAATCTTCTTCGTCAGGACAGTATGAGTAATGGCGGAAACGGTTTCCAGCAGGAGCGGCATTCGCACCGGAGCCAGCGGTGGGTGCATTTGCCGTTCACAATTTGCGAGGCGCTTAGGGCTTTTGGTGCTGAACTCAACTGTTTCTTGCCGTGCCATTCGTCGTGTTCCAGCGATAGGGAGAGTAACAAGGAATCGAAGCCGGCGGGGAGGTCGGAGGAGGAGACCGAGAGTTCCTGCGGGAAAGTGTTTGCGCGGTGGTTGGTGGCGGTACAAGACAGAGACGGGAAGGGGAGGAAGATCGAACTAGTAGTTGGAGACGAAGAACCTCAAACGGAGAAGGAAAATGGAAGCCAGAGGCGGCATATTTTGGACGGAATGAATTTCAAAGACGAAAATGAAGTCGTTGAGAAAGAGGAATCGAGGATCAGCCTTTGCATTCCGCCGAAGAACGCTTTGCTGCTAATGAGGTGCAGATCTGATCCGGTGAAGGTGGCGGAGCTCGCGAAACGATTCTGTGAACCTCCTGCGCCACAACTGGAAGAACATGACAAGGAAGAACTAGACGAAGATCATGAAGAGAAGAAAACTACACAAAATGAAGCGAAAAGAGATGAATCTGTGCCTGTAAGTAAGGAAGACGACGAAGAAGAAAGAACAGTGAAGCTAAATCTGAAGCTTGAAAACGAGGAAGAAATCAATGAAGAATCTGTTTCCGATACTGAAAGAGGAGAAGAATCCACAGAAATGGCCACAGAAAACGAAATCGATGAGCAGAAATCAAACATTTCTATGATAAATCATCAGAGTCAAGAAGAAACAGCAGAAGACAGAATCGATCAAGATAATCAGCAAGAAACAATGGCGATCTCAAGAACAATTCCAATTCCGATTCAGAGCCACTGTGAATCTGAATTTGCTCAAGATGCAGAGAACCTGGAATCAGCCGAAGAAGACGAATCTAAGCGCGAACAAGACAACAGAACAGAGCAGAGAGAAGCATTTGAAGAAGACGAAAATGGCGAAAACCCTACTTCGGCGTCATTATCAGTAGAGACAGAACCAGTTTTAGACGAAACCGGAACTGAATTTGATGAGAATTGGGAAGAAGTAACAGAGACGACGACGGCGATTAAAGAGAAAGCGACGGATGAAGGAATCAGATCCGACACTCAAAACGACGACGAAATGATGGGTCCAGAGGCGGAGGACCAGTCAAAGGAGCGAGACACTCCGCCGCCGGAGCCGGAGAGAAAAACACAAACAGAATCACCTGTTCTTCCGGATTGCTTGCTGTTAATGATGTACGAGCCAAAGCTATCAATGGAGGTGTCGAAGGAGACATGGGTCTGCAGCACGGACTTCATAAGGTGCGTTCCAACGAGGGAGAAGAAGCCAGCCGGTCGAAACCCGCCGCCGCCACCGCCACCGAAGAAGCGGGAGAAGAAGCCGGCGGACAACACGCAGACGGCGGTGATCCAACCGGCGAGGTGGTCGTGTTCGTTTCCAGCGGCGGCGGCTGCGGCAACAATGATAGAACAGAAGAAGCTAGTGAGGGCCAAGGGTTACGAGCCGTTTGTTCTTACGAGGTGCAAGTCGGAGCCGATGAGGTCATCTTCTAACCTGGCGCCGGACGCTTGCTTTTGGAAGGACCGCAAGCTTGAGCCACATCGCCCAGCTACCTTCGGCGTCGGCGCGGCTGACGTTGGATTTTGA

mRNA sequence

ATGGATTCTGACCGCCATTTTCGTACTACTAGCAGCAACAGTACCAGTTCCGCCGCCGCAGGCTCTAGCGAGCTCTTCATCTGCTTCACTTCTCGCTTCTCATCTTCTTCTTCTTCTGCCATGAAGATCTCTTCCAAGTGCCTTCTTAGCCCTGGCCGTGCCCGTGAAGGCCAAATCATCCTCTCTACTTCACTCAGCCGCCGTCTCAAATCCAGCGGCAGCCTCAAGGTCGGTCAGGCCTCGCCTGCGTTTCCTACCGGTGGGAAGAAGCGAGGATGCGCGTTCGATAATCCGGAGCCGTCGTCGCCGAAGGTCACTTGCATTGGACAGGTTAGGGTTAAGACTAAGAAGCAGGGAAGGAAGATGAGGGCTAGATCGCTGAAGCGGAGGACTAATTCGGAGGCGAGTTTTCGGAAATCGGAGAGTGTTGTTCAATCGTCGCAGATGAATGGTAATGATCAGCAATTCGTTGCGAATCAATCGTCGCGTCCTAATCTTCTTCGTCAGGACAGTATGAGTAATGGCGGAAACGGTTTCCAGCAGGAGCGGCATTCGCACCGGAGCCAGCGGTGGGTGCATTTGCCGTTCACAATTTGCGAGGCGCTTAGGGCTTTTGGTGCTGAACTCAACTGTTTCTTGCCGTGCCATTCGTCGTGTTCCAGCGATAGGGAGAGTAACAAGGAATCGAAGCCGGCGGGGAGGTCGGAGGAGGAGACCGAGAGTTCCTGCGGGAAAGTGTTTGCGCGGTGGTTGGTGGCGGTACAAGACAGAGACGGGAAGGGGAGGAAGATCGAACTAGTAGTTGGAGACGAAGAACCTCAAACGGAGAAGGAAAATGGAAGCCAGAGGCGGCATATTTTGGACGGAATGAATTTCAAAGACGAAAATGAAGTCGTTGAGAAAGAGGAATCGAGGATCAGCCTTTGCATTCCGCCGAAGAACGCTTTGCTGCTAATGAGGTGCAGATCTGATCCGGTGAAGGTGGCGGAGCTCGCGAAACGATTCTGTGAACCTCCTGCGCCACAACTGGAAGAACATGACAAGGAAGAACTAGACGAAGATCATGAAGAGAAGAAAACTACACAAAATGAAGCGAAAAGAGATGAATCTGTGCCTGTAAGTAAGGAAGACGACGAAGAAGAAAGAACAGTGAAGCTAAATCTGAAGCTTGAAAACGAGGAAGAAATCAATGAAGAATCTGTTTCCGATACTGAAAGAGGAGAAGAATCCACAGAAATGGCCACAGAAAACGAAATCGATGAGCAGAAATCAAACATTTCTATGATAAATCATCAGAGTCAAGAAGAAACAGCAGAAGACAGAATCGATCAAGATAATCAGCAAGAAACAATGGCGATCTCAAGAACAATTCCAATTCCGATTCAGAGCCACTGTGAATCTGAATTTGCTCAAGATGCAGAGAACCTGGAATCAGCCGAAGAAGACGAATCTAAGCGCGAACAAGACAACAGAACAGAGCAGAGAGAAGCATTTGAAGAAGACGAAAATGGCGAAAACCCTACTTCGGCGTCATTATCAGTAGAGACAGAACCAGTTTTAGACGAAACCGGAACTGAATTTGATGAGAATTGGGAAGAAGTAACAGAGACGACGACGGCGATTAAAGAGAAAGCGACGGATGAAGGAATCAGATCCGACACTCAAAACGACGACGAAATGATGGGTCCAGAGGCGGAGGACCAGTCAAAGGAGCGAGACACTCCGCCGCCGGAGCCGGAGAGAAAAACACAAACAGAATCACCTGTTCTTCCGGATTGCTTGCTGTTAATGATGTACGAGCCAAAGCTATCAATGGAGGTGTCGAAGGAGACATGGGTCTGCAGCACGGACTTCATAAGGTGCGTTCCAACGAGGGAGAAGAAGCCAGCCGGTCGAAACCCGCCGCCGCCACCGCCACCGAAGAAGCGGGAGAAGAAGCCGGCGGACAACACGCAGACGGCGGTGATCCAACCGGCGAGGTGGTCGTGTTCGTTTCCAGCGGCGGCGGCTGCGGCAACAATGATAGAACAGAAGAAGCTAGTGAGGGCCAAGGGTTACGAGCCGTTTGTTCTTACGAGGTGCAAGTCGGAGCCGATGAGGTCATCTTCTAACCTGGCGCCGGACGCTTGCTTTTGGAAGGACCGCAAGCTTGAGCCACATCGCCCAGCTACCTTCGGCGTCGGCGCGGCTGACGTTGGATTTTGA

Coding sequence (CDS)

ATGGATTCTGACCGCCATTTTCGTACTACTAGCAGCAACAGTACCAGTTCCGCCGCCGCAGGCTCTAGCGAGCTCTTCATCTGCTTCACTTCTCGCTTCTCATCTTCTTCTTCTTCTGCCATGAAGATCTCTTCCAAGTGCCTTCTTAGCCCTGGCCGTGCCCGTGAAGGCCAAATCATCCTCTCTACTTCACTCAGCCGCCGTCTCAAATCCAGCGGCAGCCTCAAGGTCGGTCAGGCCTCGCCTGCGTTTCCTACCGGTGGGAAGAAGCGAGGATGCGCGTTCGATAATCCGGAGCCGTCGTCGCCGAAGGTCACTTGCATTGGACAGGTTAGGGTTAAGACTAAGAAGCAGGGAAGGAAGATGAGGGCTAGATCGCTGAAGCGGAGGACTAATTCGGAGGCGAGTTTTCGGAAATCGGAGAGTGTTGTTCAATCGTCGCAGATGAATGGTAATGATCAGCAATTCGTTGCGAATCAATCGTCGCGTCCTAATCTTCTTCGTCAGGACAGTATGAGTAATGGCGGAAACGGTTTCCAGCAGGAGCGGCATTCGCACCGGAGCCAGCGGTGGGTGCATTTGCCGTTCACAATTTGCGAGGCGCTTAGGGCTTTTGGTGCTGAACTCAACTGTTTCTTGCCGTGCCATTCGTCGTGTTCCAGCGATAGGGAGAGTAACAAGGAATCGAAGCCGGCGGGGAGGTCGGAGGAGGAGACCGAGAGTTCCTGCGGGAAAGTGTTTGCGCGGTGGTTGGTGGCGGTACAAGACAGAGACGGGAAGGGGAGGAAGATCGAACTAGTAGTTGGAGACGAAGAACCTCAAACGGAGAAGGAAAATGGAAGCCAGAGGCGGCATATTTTGGACGGAATGAATTTCAAAGACGAAAATGAAGTCGTTGAGAAAGAGGAATCGAGGATCAGCCTTTGCATTCCGCCGAAGAACGCTTTGCTGCTAATGAGGTGCAGATCTGATCCGGTGAAGGTGGCGGAGCTCGCGAAACGATTCTGTGAACCTCCTGCGCCACAACTGGAAGAACATGACAAGGAAGAACTAGACGAAGATCATGAAGAGAAGAAAACTACACAAAATGAAGCGAAAAGAGATGAATCTGTGCCTGTAAGTAAGGAAGACGACGAAGAAGAAAGAACAGTGAAGCTAAATCTGAAGCTTGAAAACGAGGAAGAAATCAATGAAGAATCTGTTTCCGATACTGAAAGAGGAGAAGAATCCACAGAAATGGCCACAGAAAACGAAATCGATGAGCAGAAATCAAACATTTCTATGATAAATCATCAGAGTCAAGAAGAAACAGCAGAAGACAGAATCGATCAAGATAATCAGCAAGAAACAATGGCGATCTCAAGAACAATTCCAATTCCGATTCAGAGCCACTGTGAATCTGAATTTGCTCAAGATGCAGAGAACCTGGAATCAGCCGAAGAAGACGAATCTAAGCGCGAACAAGACAACAGAACAGAGCAGAGAGAAGCATTTGAAGAAGACGAAAATGGCGAAAACCCTACTTCGGCGTCATTATCAGTAGAGACAGAACCAGTTTTAGACGAAACCGGAACTGAATTTGATGAGAATTGGGAAGAAGTAACAGAGACGACGACGGCGATTAAAGAGAAAGCGACGGATGAAGGAATCAGATCCGACACTCAAAACGACGACGAAATGATGGGTCCAGAGGCGGAGGACCAGTCAAAGGAGCGAGACACTCCGCCGCCGGAGCCGGAGAGAAAAACACAAACAGAATCACCTGTTCTTCCGGATTGCTTGCTGTTAATGATGTACGAGCCAAAGCTATCAATGGAGGTGTCGAAGGAGACATGGGTCTGCAGCACGGACTTCATAAGGTGCGTTCCAACGAGGGAGAAGAAGCCAGCCGGTCGAAACCCGCCGCCGCCACCGCCACCGAAGAAGCGGGAGAAGAAGCCGGCGGACAACACGCAGACGGCGGTGATCCAACCGGCGAGGTGGTCGTGTTCGTTTCCAGCGGCGGCGGCTGCGGCAACAATGATAGAACAGAAGAAGCTAGTGAGGGCCAAGGGTTACGAGCCGTTTGTTCTTACGAGGTGCAAGTCGGAGCCGATGAGGTCATCTTCTAACCTGGCGCCGGACGCTTGCTTTTGGAAGGACCGCAAGCTTGAGCCACATCGCCCAGCTACCTTCGGCGTCGGCGCGGCTGACGTTGGATTTTGA

Protein sequence

MDSDRHFRTTSSNSTSSAAAGSSELFICFTSRFSSSSSSAMKISSKCLLSPGRAREGQIILSTSLSRRLKSSGSLKVGQASPAFPTGGKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQGRKMRARSLKRRTNSEASFRKSESVVQSSQMNGNDQQFVANQSSRPNLLRQDSMSNGGNGFQQERHSHRSQRWVHLPFTICEALRAFGAELNCFLPCHSSCSSDRESNKESKPAGRSEEETESSCGKVFARWLVAVQDRDGKGRKIELVVGDEEPQTEKENGSQRRHILDGMNFKDENEVVEKEESRISLCIPPKNALLLMRCRSDPVKVAELAKRFCEPPAPQLEEHDKEELDEDHEEKKTTQNEAKRDESVPVSKEDDEEERTVKLNLKLENEEEINEESVSDTERGEESTEMATENEIDEQKSNISMINHQSQEETAEDRIDQDNQQETMAISRTIPIPIQSHCESEFAQDAENLESAEEDESKREQDNRTEQREAFEEDENGENPTSASLSVETEPVLDETGTEFDENWEEVTETTTAIKEKATDEGIRSDTQNDDEMMGPEAEDQSKERDTPPPEPERKTQTESPVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKPAGRNPPPPPPPKKREKKPADNTQTAVIQPARWSCSFPAAAAAATMIEQKKLVRAKGYEPFVLTRCKSEPMRSSSNLAPDACFWKDRKLEPHRPATFGVGAADVGF
BLAST of Cp4.1LG14g03740 vs. TrEMBL
Match: A0A0A0L789_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G236550 PE=4 SV=1)

HSP 1 Score: 957.2 bits (2473), Expect = 1.1e-275
Identity = 566/793 (71.37%), Postives = 630/793 (79.45%), Query Frame = 1

Query: 1   MDSDRHFRTTSSNSTSSAAAGSSELFICFTSRFSSSSSSAMKISSKCLLSPGRARE-GQI 60
           MDSD HFRTTS+NSTSS A  SSELFICFTSRFSSSSSS+MKISSK +LSPGR RE  QI
Sbjct: 1   MDSDPHFRTTSTNSTSSTATPSSELFICFTSRFSSSSSSSMKISSKSILSPGRPREPSQI 60

Query: 61  ILSTSLSRRLKSSGSLKVGQASPAFPTGGKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQG 120
            LSTSLSRRLKSSGSLK GQASP FPTG KKRGCAFDNPEPSSPKVTCIGQVRVKTKKQG
Sbjct: 61  SLSTSLSRRLKSSGSLKGGQASPMFPTGRKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQG 120

Query: 121 RKMRARSLKRRTNSEASFRKSESVVQSSQMNGNDQQFVANQSSRPNLLRQDSMSNGGNGF 180
           +KMRARS KRRTNSEASFR+SES+VQSSQ NG+DQQF ++ +   +LLRQ+S SN GNGF
Sbjct: 121 KKMRARSQKRRTNSEASFRRSESLVQSSQGNGSDQQFSSHHNH--HLLRQNSNSNAGNGF 180

Query: 181 QQERHSHRSQRWVHLPFTICEALRAFGAELNCFLPCHSSCSSDRESNKESKPAGRSEEET 240
           QQE  SHR+QRWVHLPFTICEALRAFGAELNCFLPCHSSCS +RE+NKESKPA RS E +
Sbjct: 181 QQECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCSGNRENNKESKPAERSSE-S 240

Query: 241 ESSCGKVFARWLVAVQDRDGKGRKIELVVGDEEPQTEKENGSQRRHILDGMNFKDENEVV 300
           ESSCG VFARWLVAVQD DGKGR+IELVVGDEE +TEKENGSQRRH+ +G++FKD+NE V
Sbjct: 241 ESSCGTVFARWLVAVQDGDGKGREIELVVGDEETRTEKENGSQRRHVFEGLDFKDKNEAV 300

Query: 301 EKEESRISLCIPPKNALLLMRCRSDPVKVAELAKRFCEPPAPQLEEHDKEELDEDHEEKK 360
           E+EESRIS+CIPPKNALLLMRCRSDPVK+AELAKRFCEPPAP+++E D+E  DED+E KK
Sbjct: 301 EEEESRISICIPPKNALLLMRCRSDPVKMAELAKRFCEPPAPKVDEEDEEGEDEDNEAKK 360

Query: 361 TTQNEAKRDESVPVS------------KEDDEEERTVKLNLKLENEEEINEESVSDTER- 420
             QNE KRD SVPVS            KE+++E +  +L +KLENEEE+NEE VSD ++ 
Sbjct: 361 -RQNEVKRDVSVPVSSIVTVNKEEEEVKEEEDERKVEQLIVKLENEEEMNEECVSDADKE 420

Query: 421 -------------------GEESTEMATENEIDEQKSNISMIN----HQSQEETAEDRID 480
                               EE+ EMATENEIDEQK +I+++N     Q+ EE  ED+ D
Sbjct: 421 KEEANLVLQEEEREEEEDNEEETIEMATENEIDEQK-DITVVNQLNQEQALEEKEEDKTD 480

Query: 481 QDNQQETMAISRTIPIPIQSHCESEFAQDAENLESAEEDESK----REQDNRTEQREAFE 540
           Q NQQETMAI   IP+ IQ+HCE E AQD E LES E++E K     EQD +TE+ E   
Sbjct: 481 QVNQQETMAI--PIPLLIQTHCEPEMAQDVEKLESVEKEEPKLSHESEQDQKTEEDENLR 540

Query: 541 ED------------ENGENPTSASLSVETEPVLDETGTEFDENWEEVTETTTAIKEKATD 600
           ED            ENGE  TS SLSVETEPV DET TE D N EE  E     +EK TD
Sbjct: 541 EDKEEEEEEEGENGENGETTTSPSLSVETEPVSDETETEVDVNREEEEEEE---EEKTTD 600

Query: 601 EGIRSDTQNDDEMMGPEAEDQSKERDTPPPE------PERKTQTESPVLPDCLLLMMYEP 660
           EGI  D +N D ++GPE EDQSKE +TPPPE      PERKTQTE+ VLPDCLLLMMYEP
Sbjct: 601 EGIGPDDEN-DVLVGPEEEDQSKEGETPPPEPESEPKPERKTQTETSVLPDCLLLMMYEP 660

Query: 661 KLSMEVSKETWVCSTDFIRCVPTREKKPAGRNPPPPPPPKKREKKPADNTQTAVIQPARW 720
           KLSMEVSKETWVCS DFIRCVPTREKK  G++PPPPPPPKKRE KP D TQTAV+QPARW
Sbjct: 661 KLSMEVSKETWVCSADFIRCVPTREKKAIGKDPPPPPPPKKRETKPTDTTQTAVVQPARW 720

Query: 721 SCSFPAAAAAATMIEQKKLVRAKGYEPFVLTRCKSEPMRSSSNLAPDACFWKDRKLEPHR 735
           SCSFPAAAAAA MIEQ KLVRAKGYEPFVLTRCKSEPMRSS+ LAPDAC WKDRKLEPHR
Sbjct: 721 SCSFPAAAAAAAMIEQ-KLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACCWKDRKLEPHR 780

BLAST of Cp4.1LG14g03740 vs. TrEMBL
Match: B9I1E3_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0011s09990g PE=4 SV=2)

HSP 1 Score: 485.3 bits (1248), Expect = 1.3e-133
Identity = 357/793 (45.02%), Postives = 477/793 (60.15%), Query Frame = 1

Query: 4   DRHFRTTSSNS--TSSAAAGSSELFICFTSRFSSSSSSAMKISSKCLLSPGRAREG-QII 63
           DR  R+ S+NS  TS+  + +SELFICFTSR SSSS   MK+SSK +LSPGR R+  QI 
Sbjct: 11  DRPHRSNSNNSSSTSNNNSNTSELFICFTSRLSSSS---MKLSSKSILSPGRHRDSSQIS 70

Query: 64  LSTSLSRRLKSSGSLKVGQASPAFPTGGKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQGR 123
           LS SLSRRL+SSGS+K GQASP FPT GKKRGCAF+NPEPSSPKVTCIGQVRVKTKKQG+
Sbjct: 71  LSNSLSRRLRSSGSMKGGQASPMFPTNGKKRGCAFENPEPSSPKVTCIGQVRVKTKKQGK 130

Query: 124 KMRARSLKRRTNSEASFRKSESVVQSSQMNGNDQQFVANQSSRPNLLRQDSMSNGGNGFQ 183
           K+R RS +R    E SFR+ +    + + + N    + NQ      L Q          Q
Sbjct: 131 KLRTRSKRR---GEISFRRVDQNSNTFEGSNNHHDLINNQ-----FLNQQQQ-------Q 190

Query: 184 QERHSHRSQRWVHLPFTICEALRAFGAELNCFLPCHSSCSSDRESNKESKPAGRSEEETE 243
           QE  SHR+QRWVH P TICEALRAFGAE NCFLPC SSC +  +  +E+  A  S     
Sbjct: 191 QEGLSHRNQRWVHFPVTICEALRAFGAEFNCFLPCRSSCMASEKEKEENTAAAGSNNNGS 250

Query: 244 SSCGKVFARWLVAVQDRDGKGRKIELVVGDE--EPQTEKENGSQRRHILDGMNFKDE--- 303
           SSCG VFARWLVAVQ+ +GKG++IELVVG+E  E + ++   S RRHI + + FK+E   
Sbjct: 251 SSCGAVFARWLVAVQEGEGKGKEIELVVGEEVVEEERDERRRSYRRHIFEDIEFKEEEGH 310

Query: 304 -----NEVVEKEESRISLCIPPKNALLLMRCRSDPVKVAELAKRFCEPPAPQLEEHDKEE 363
                N  +++EE+R+S+CIPPKNALLLMRCRSDPVK+A LA +F E PAPQ EE ++E+
Sbjct: 311 VFEGGNAGLQEEEARVSICIPPKNALLLMRCRSDPVKMAALANKFWESPAPQDEEDEEED 370

Query: 364 LDEDHEEKKT-------------TQNEAKRDESVPVSKE---DDEEERTVKLNLK----L 423
            +E  +++               ++ +A ++E + V +E   + +++ TV   L     +
Sbjct: 371 NEEGEKDRNLGAEVDKFINIENKSEVKASQEEEIKVEQEIIIEQKQDLTVSDKLAFCETI 430

Query: 424 ENEEEI---NEESVSDTERGEESTEM-ATENEIDEQKSNISMINHQSQEETAEDRIDQDN 483
           E   +I    EES+   E GE+S E+ +T++ ID     ++++    QEE   +     N
Sbjct: 431 EEHYQIIQETEESLVILEAGEDSQEIGSTDDNIDGVLQEVNLVK---QEEEESETPGVMN 490

Query: 484 QQETMAISRTIPI--PIQSHCESEFAQDAENLESAEEDESKREQDNRTEQREAFEEDEN- 543
            Q T +   T+ +     S  + E    A  + +  E +  +E +   ++   F+ ++  
Sbjct: 491 LQPTSSTQETVSLCSDESSSHDQEIVDPAALMNNENEYKVVQENEEDNQEERVFQAEQEQ 550

Query: 544 -----GENPTSASLSVETEPVLDETGTEFDENWEEVTETTTAIKEKATDEGIRSDTQNDD 603
                 ++    S+SV  E    +   +  ++ E  + +   ++ + T+E  +  T+N+ 
Sbjct: 551 VVQGLSDDIEENSVSVRFEQETLQVAVQDLQDQEPESLSVAELQVQETEEE-KETTENET 610

Query: 604 EMMGPEAEDQSKERDTPPPEPERKTQTESPVLPDCLLLMMYEPKLSMEVSKETWVCSTDF 663
           E+   E ED     +       R+     P+LPDCLLLMM EPKLSMEVSKETWVCSTDF
Sbjct: 611 ELAEEEPEDPKTHVNGQTGVKSREGDNSQPLLPDCLLLMMCEPKLSMEVSKETWVCSTDF 670

Query: 664 IRCVPTREKKPAGRNPPPPPPPKKR---EKKPAD-----NTQTAVIQPARWSCSFPAAAA 723
           IR +P   +  +  N    P  KKR   + KPA      N   ++ QP R SCS+PA   
Sbjct: 671 IRWLPEHSRPVSKTNGKDEP--KKRVSIDIKPAQVYNNGNNSNSLQQPRRSSCSYPAKPP 730

Query: 724 A--------ATMIEQKKLVRAKGYEPFVLTRCKSEPMRSSSNLAPDACFWKDRKLEPHRP 735
           A        +TMIEQK LV AK YEPFVLTRCKSEPMRS+S LAP+ACFWK+RKLEPHRP
Sbjct: 731 ARCAGTESMSTMIEQK-LVGAKAYEPFVLTRCKSEPMRSASKLAPEACFWKNRKLEPHRP 778

BLAST of Cp4.1LG14g03740 vs. TrEMBL
Match: A0A061F8W3_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_031658 PE=4 SV=1)

HSP 1 Score: 483.8 bits (1244), Expect = 3.7e-133
Identity = 363/794 (45.72%), Postives = 478/794 (60.20%), Query Frame = 1

Query: 1   MDSDRHFRTTSSNSTSSAA---AGSSELFICFTSRFSSSSSSAMKISSKCLLSPGRAREG 60
           MD +R  R+TS N++SS++   + +SELFICFTSR SSSS   MK+SSK +LSPGR RE 
Sbjct: 1   MDPERPHRSTSINNSSSSSNTTSTTSELFICFTSRLSSSS---MKLSSKSILSPGRTRES 60

Query: 61  -QIILSTSLSRRLKSSGSLKVGQASPAFPTGGKKRGCAFDNPEPSSPKVTCIGQVRVKTK 120
            QI LS+SLSRRLKS+GS+K GQASP FPT GKKRGCAF+NPEPSSPKVTCIGQVRVKTK
Sbjct: 61  SQISLSSSLSRRLKSNGSMKGGQASPMFPTNGKKRGCAFENPEPSSPKVTCIGQVRVKTK 120

Query: 121 KQGRKMRARSLKRRTNSEASFRKSESVVQSSQMNGNDQQFVANQSSRPNLLRQDSMSNGG 180
           KQG+K +A   KRR   E SFRK +        N N+     + SS  +      +SN  
Sbjct: 121 KQGKKFKACRSKRR--GEVSFRKVD------HNNANNGSNSLDTSSCQDYNMGHFLSNNN 180

Query: 181 NGFQQERHSHRSQRWVHLPFTICEALRAFGAELNCFLPCHSSCSSDRESNKESK--PAGR 240
           +  QQ++     ++WVHLP TICEALRAFGAE NCFLPC SSC +++   +E      G 
Sbjct: 181 HHHQQQQQQE-CKKWVHLPLTICEALRAFGAEFNCFLPCRSSCMANQRDKEERTGGSGGS 240

Query: 241 SEEETESSCGKVFARWLVAVQDRDGKGRKIELVVGDEEPQ----TEKENGSQRRHILDGM 300
           +     SSCG VFARWLVAVQ+ +GK R+IELVVG E+ +    +E    SQRRH+ + +
Sbjct: 241 NGNGNGSSCGAVFARWLVAVQEGEGKEREIELVVGGEDDERRESSEMMRSSQRRHVFEDI 300

Query: 301 NFKD-ENEVVEKEESRISLCIPPKNALLLMRCRSDPVKVAELAKRFCEPPAPQLEEHDKE 360
              D  NE V  EE+R+S+CIPPKNALLLMRCRSDPVK+A LA +F E P P+ EE ++E
Sbjct: 301 EINDCGNENVGDEEARVSICIPPKNALLLMRCRSDPVKMAALANKFWETPVPKDEEEEEE 360

Query: 361 ELDEDH--EEKKTTQNEAKRDESVPVSKEDDEEERTVKLNLKLENEE------------- 420
           E +E+   E K   + E + +E+     E + E R VK   ++E++E             
Sbjct: 361 EEEEEEGAENKSEEKEEEEEEENQRDVVEGEREGRRVKFEQEMEHQEVSEVSQMFVSCEA 420

Query: 421 -------EINEESVSDTERGEESTEMATENEIDEQKSNISMINH---QSQEETAEDRIDQ 480
                  E   E+V++TE   ES  +  E E+ E+    S+      + Q++  E+ +++
Sbjct: 421 TEEQEIPEAEAEAVAETEA--ESVFVGDEAELVEETLERSLKEETIIECQDQEQENEVEE 480

Query: 481 DNQQETMAISRTIPIPIQSHCESEFAQDAENLE-SAEEDESKREQDNRTEQREAFEEDEN 540
           D Q  T        +P+  H E    Q  EN++ S +E+E   E + + E+ EA EE+  
Sbjct: 481 DQQASTTNEEFLSEVPL--HLEK--LQREENVQGSDQENEDGLEGEQQEEEVEAEEENVL 540

Query: 541 G-------ENPTSASLSVETEPVLDETGTEFDENWEEVTETTTAIKEKATDEGIRSDTQN 600
           G       EN       VE + + +E   E         E ++ ++EK  +      TQ 
Sbjct: 541 GKVEEECEENENEGGEEVEDQAIAEEAEEE---------EESSTVEEKEAET-----TQE 600

Query: 601 DDEMMGPEAEDQSKERDTPPPEPERKTQTESPVLPDCLLLMMYEPKLSMEVSKETWVCST 660
             E+   EA      R+  P +  ++++++  +LPDCLLLMM EPKLSMEVSKETWVCST
Sbjct: 601 RSELQCLEA------REPDPGDESKESESQQNLLPDCLLLMMCEPKLSMEVSKETWVCST 660

Query: 661 DFIRCVPTREKKPAGRNPPPPPPPKKR---EKKPADNTQTAVIQPARWSCSFPAA----- 720
           DFIR VP ++K+PA +       PK+R   + KPA      ++QP R SCSFPAA     
Sbjct: 661 DFIRWVPEKKKQPAVKQKDGGDEPKRRLCIDSKPAP----MLLQPPRSSCSFPAAPPMAK 720

Query: 721 --------AAAATMIEQKKLVRAKGYEPFVLTRCKSEPMRSSSNLAPDACFWKDRKLEPH 735
                    + ATMIEQK +  +KGYEPFVLTRCKSEPMRSS+ L+PDACFWK+RKLE  
Sbjct: 721 AANGAGGGGSMATMIEQKLVGGSKGYEPFVLTRCKSEPMRSSAKLSPDACFWKNRKLE-- 749

BLAST of Cp4.1LG14g03740 vs. TrEMBL
Match: M5X0H9_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001903mg PE=4 SV=1)

HSP 1 Score: 459.1 bits (1180), Expect = 9.8e-126
Identity = 360/792 (45.45%), Postives = 462/792 (58.33%), Query Frame = 1

Query: 1   MDSDRHFRTTSSNSTS-SAAAGSSELFICFTSRFSSSSSSAMKISSKCLLSPGRARE-GQ 60
           M+SDR  RT S++S S + +  +SELFICFT+  S  SSS+MK+SSK +LSPGRARE  Q
Sbjct: 1   MESDRPHRTKSNSSISGTTSTTTSELFICFTT--SRLSSSSMKLSSKSILSPGRAREPSQ 60

Query: 61  IILSTSLSRRLKSSGSLKVGQASPAFPTGG---KKRGCAFDNPEPSSPKVTCIGQVRVKT 120
           I LS+SLSRRL++SGS+K GQASP FP+ G   KKRGCAF+NPEPSSPKVTCIGQVRVKT
Sbjct: 61  ISLSSSLSRRLRTSGSIKGGQASPMFPSNGGTSKKRGCAFENPEPSSPKVTCIGQVRVKT 120

Query: 121 KKQGRKMRARSLKRRTN-SEASFRKSESVVQSSQMNGNDQQFVANQSSRPNLLR----QD 180
           KKQG+KMR  S  +R+  SEASFRK E   QS+    +  Q + N+ +  N  +    Q 
Sbjct: 121 KKQGKKMRIISRSKRSRGSEASFRKPEQNQQSTNNTASQSQELYNRDNSSNNFQGLHFQS 180

Query: 181 SMSNGGNGFQQERHSHRSQRWVHLPFTICEALRAFGAELNCFLPCHSSC-SSDRESNKES 240
              N  N  QQE   HR+QRWVHLP TICEALRAFG+E NC +P  SSC +SD  +NKE 
Sbjct: 181 HQINNNN--QQECLRHRNQRWVHLPLTICEALRAFGSEFNCLIPNRSSCLASDDNNNKEK 240

Query: 241 KP-AGRSEEETESSCGKVFARWLVAVQDRDGKGRKIELVVGDEEPQTEKENGS-----QR 300
           +   G   E   SSCG VFARW VA+QD DGKGR+IEL+VG+++ +TE+   S     QR
Sbjct: 241 EENKGVRSESGGSSCGAVFARWFVALQDGDGKGREIELMVGEDQERTERSTNSSSGHSQR 300

Query: 301 RHILDGMNFKDE--NEVVEKEESR--ISLCIPPKNALLLMRCRSDPVKVAELAKRFCE-P 360
           R + +G+ FK+E  NE V +EE    +S+C+PPKNALLLMRCRSDPVK+A LA RF E P
Sbjct: 301 RQVFEGIEFKEERLNESVMEEEEAGGVSICVPPKNALLLMRCRSDPVKMAALANRFWEMP 360

Query: 361 PAPQLEEHDKEELDEDHE-------------------------EKKTTQNEAKRDESVPV 420
            APQ EE + EE  ED                           E +  + +   ++ V  
Sbjct: 361 AAPQDEEVEDEEEKEDKGLTEKAQDFVEEQGTDEVLEKVQNGLETEVAEGDGVCEKWVCD 420

Query: 421 SKEDDEEERTVKLNLKLENEEEINEESVSDTERGEESTEMATENEIDEQKSNISMINHQS 480
            +E ++ E   KL L     EE  +E     E  E+  ++  E E  E+K+         
Sbjct: 421 GEEHEDLEEVEKLVL-----EEKEDEKEGLDENPEKRQQLYDEVEEIEEKAECQQEAELE 480

Query: 481 QEETAEDRIDQDNQQETMAISRTIPIPIQSHCESEFAQDAENLESAEEDESKREQDNRTE 540
           ++E  E  + Q    E   +   +  P       EF ++    E+ E+++ +RE++   E
Sbjct: 481 EQEEQELDVTQQALSEECCVLDVVADPEML----EFEENEHECEATEQEQEQREEEKEEE 540

Query: 541 QREAFEEDENGENPTSASLSVETEPVLDETGTEF---DENWEEVTETTTAIKEKATDEGI 600
            RE        + P  ++  V++E + +E  TE    DE+ EE TET T  + +      
Sbjct: 541 VREV-------KLPIPSNECVKSEELEEEEKTEAEVADESTEEETETVTQYRPE------ 600

Query: 601 RSDTQNDDEMMGPEAEDQSKERDTPPPEPERKTQTESPVLPDCLLLMMYEPKLSMEVSKE 660
                       P +E+   + D+       K   ++ VLPDCLLLMM EPKLSMEVSKE
Sbjct: 601 ------------PVSENPKNQLDSGS-----KRAVQNSVLPDCLLLMMCEPKLSMEVSKE 660

Query: 661 TWVCSTDFIRCVPTREKKPAGRNPPPPPPPKKR---EKKPADNTQTA-VIQPARWSCSFP 720
           TWVC+TDFIRC+P R  K        P   KKR   +  PA       VIQP R SCSFP
Sbjct: 661 TWVCTTDFIRCLPERHVKKVDA----PDEAKKRVNIDSNPAAAPAAQPVIQPPRSSCSFP 720

Query: 721 AAA---AAATMIEQKKLVRAKGYEPFVLTRCKSEPMRSSSNL-APDACFWKDRKLEPHRP 735
             A   + ATMI QK LV +  YEPFVLTRCKSEPMRS+  L A + CFWK+RK+EPHR 
Sbjct: 721 VQAGPVSMATMIGQK-LVGSTAYEPFVLTRCKSEPMRSAGKLPAAETCFWKNRKMEPHRR 744

BLAST of Cp4.1LG14g03740 vs. TrEMBL
Match: A0A067LF51_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_15297 PE=4 SV=1)

HSP 1 Score: 455.7 bits (1171), Expect = 1.1e-124
Identity = 341/779 (43.77%), Postives = 460/779 (59.05%), Query Frame = 1

Query: 1   MDSDRHFRTTSSNSTSSAAAGSSELFICFTSRFSSSSSSAMKISSKCLLSPGRAREG-QI 60
           MD+DR  R+ SSN++SS+ + +SELFICFTSR SSSSS  MKISSK +LSPGRARE  QI
Sbjct: 1   MDTDRPHRSNSSNNSSSSNSSTSELFICFTSRLSSSSS--MKISSKSILSPGRARESSQI 60

Query: 61  ILSTSLSRRLKSSGSLKVGQASPAFPTG-GKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQ 120
            LS SLSRRL+++GS+K GQASP FPT  GKKRGC F+NPEPSSPKVTCIGQVRVKTKKQ
Sbjct: 61  SLSNSLSRRLRTNGSMKGGQASPMFPTSSGKKRGCTFENPEPSSPKVTCIGQVRVKTKKQ 120

Query: 121 GRKMRARSLKRRTNS--EASFRKSESVVQSSQMNGNDQQFVANQSSRPNLLRQDSMSNGG 180
             K + RS +R ++   E SFR+ +      Q + N+  F AN ++R        ++N  
Sbjct: 121 SHKFKTRSQRRASSGGGEVSFRRVD------QNSTNNNSFDANSNNR-------DLNN-- 180

Query: 181 NGFQQERHSHRSQRWVHLPFTICEALRAFGAELNCFLPCHSSCSSDRESNKESKPAGRSE 240
                E   HR+QRWV+LP TICEALR F    NCFLPC SSC ++ +  +E   A   +
Sbjct: 181 ----PECLPHRNQRWVNLPLTICEALREF----NCFLPCRSSCMANDKGKEEKTAAAAGD 240

Query: 241 EETE-SSCGKVFARWLVAVQDRDGKGRKIELVVGDEEP-------------QTEKENG-- 300
             +  SSC  VFARWLVAVQ+ DGKGR+IELVVG+EE              + E+E    
Sbjct: 241 GSSNGSSCAAVFARWLVAVQEGDGKGREIELVVGEEEEDGKGREIELVVGEEEEEEEDEE 300

Query: 301 -------SQRRHILDGMNFKDENEVVEKEESRISLCIPPKNALLLMRCRSDPVKVAELAK 360
                  S RRHI + + FK+EN  +++EE+R+S+CIPPKNALLLMRCRSDPVK+A LA 
Sbjct: 301 ESMGRRRSYRRHIFEEIEFKEENASLQEEEARVSICIPPKNALLLMRCRSDPVKMAALAN 360

Query: 361 RFCEPPAPQLEEHDKEELDEDHEEKKTTQNEAKRDESVPVSKEDDEEERTVKLNLKLENE 420
           +F E P    +E ++ E D    EK+   N+ +  +     +ED   E+ V      EN+
Sbjct: 361 KFWEAPLLANDEEEEGEEDCGKSEKQEMHNDVREKKH---KEEDLMSEKLVPCETIQENK 420

Query: 421 ----EEINEESVSDTERGEESTEMATENEIDEQKSNISMINHQSQEETAEDRIDQDNQQE 480
               E + E+ VS+T R  E T+     EI+E+K    ++ H  + E  E+ ++++   E
Sbjct: 421 TQESEVLEEQDVSETRR--EDTQ-----EIEERK----LVKHAEETEKQENMLEENEDLE 480

Query: 481 TMAISRTIPIPIQSHCESEFAQDAENLESAEEDESKREQDNRTEQREAFEEDENGENPTS 540
                                 + EN E   ++  + ++D+  +Q +  +E +  +    
Sbjct: 481 NSI------------------NNQENEEKLIQESKESQEDSLIDQEDYKQERDFSDGDIQ 540

Query: 541 ASLSVETEPVLDETGTEFDENWEEVTETTTAIKEKATDEGIRSDTQNDDEMMGPE-AEDQ 600
           +SL +  +P  +  G + ++   +  E      ++  +E    +T+  +E +  E +E +
Sbjct: 541 SSLHMSVQPEEEVVGHDIEDQESKPEELIIQESKQEEEEEEEEETELTEETVTHERSESE 600

Query: 601 SKERDTPPPEPERKTQTESPVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTR--- 660
             +      + + K +   P+LPDCLLLMM EPKLSMEVSKETWVCSTDFIR +P     
Sbjct: 601 DPKTQEGQMDIKSKERESQPMLPDCLLLMMCEPKLSMEVSKETWVCSTDFIRWLPEHSRP 660

Query: 661 -EKKPAGRNPPPPPPPKKR---EKKPADNTQTAVIQPARWSCSFPA-----AAAAATMIE 720
            +KK  G  P      K+R   +  P     + + QP R SCS+PA     AA A +M  
Sbjct: 661 VKKKDGGDEP------KRRVSIDINPPSMHGSKLQQPPRSSCSYPAKPPPRAAGAESMST 716

Query: 721 --QKKLVRAKGYEPFVLTRCKSEPMRSSSNLAPDACFWKDRKLEPHRPATFGVGAADVG 734
             +KKLV  KGYEPFVLTRCKSEPMRS+S LAPDACFWK+RKLEPHRPAT G+ AA VG
Sbjct: 721 AIEKKLVGTKGYEPFVLTRCKSEPMRSASKLAPDACFWKNRKLEPHRPATLGISAAGVG 716

BLAST of Cp4.1LG14g03740 vs. TAIR10
Match: AT3G15095.1 (AT3G15095.1 unknown protein)

HSP 1 Score: 338.2 bits (866), Expect = 1.3e-92
Identity = 305/785 (38.85%), Postives = 417/785 (53.12%), Query Frame = 1

Query: 2   DSDRHFRTTSSNSTSSAAAGSS-ELFICFTSRFSSSSSSAMKISSKCLLSPGRAREGQII 61
           +++R  R++S NS+S+  +GSS +LFICFTSRFSSSSS  M++SSK + SP R+      
Sbjct: 3   ETERPHRSSSINSSSNNNSGSSTDLFICFTSRFSSSSS--MRLSSKSIHSPARSA----C 62

Query: 62  LSTSLSRRLKSSGSLKVGQA----SPAF-PTGGKKR-GCAFDNP--------EPSSPKVT 121
           L+TSLSRRL++SGSLK   A    SP F   GG+KR G  ++N         EPSSPKVT
Sbjct: 63  LTTSLSRRLRTSGSLKNASAGVLNSPMFGANGGRKRSGSGYENSNNNNNNNIEPSSPKVT 122

Query: 122 CIGQVRVKTKKQ-GRKMRARSLKRRTNSEASFRKSESVVQSSQMNGNDQQFVANQSSRPN 181
           CIGQVRVKT+K   +KMRARS  RR   E SFR+S        ++ ND            
Sbjct: 123 CIGQVRVKTRKHVKKKMRARS--RRKGGENSFRRS--------VDQND------------ 182

Query: 182 LLRQDSMSNGGNGFQQERHSHRSQRWVHLPFTICEALRAFGAELNCFLPCHSSCSSDR-- 241
                    GG G    R      R VHLP TICE+LR+FG+ELNCF PC SSC+ +   
Sbjct: 183 ---------GGGGC---RFKASENRLVHLPVTICESLRSFGSELNCFFPCRSSCTENSHG 242

Query: 242 -----ESNKESKPAGRSEEETESSCGKVFARWLVAVQDRDG-KGRKIELVVGDEEPQTEK 301
                ESN +    G       +SCG VF RW VAV++  G K R+IELVVG E+   E 
Sbjct: 243 DGRRAESNNDGCGGGGGGS---NSCGAVFTRWFVAVEETSGGKRREIELVVGGEDEVEED 302

Query: 302 ENGSQRRHILDGMNFKDENEVVEKEE-----SRISLCIPPKNALLLMRCRSDPVKVAELA 361
              S+RRH+ +G++  +     EK+E      R+S+C PPKNALLLMRCRSDPVKVA LA
Sbjct: 303 RRRSRRRHVFEGLDLSEIEMKTEKKERGEEVGRMSICSPPKNALLLMRCRSDPVKVAALA 362

Query: 362 KRFCEPPAPQLEEHDKEELDEDHEEKKTTQNEAKRDESVPVSKEDDEEERTVKLNLKLEN 421
            R                      E++ + N+        V  E++E+ER  +  L++E+
Sbjct: 363 NRV--------------------RERQLSLNDG-------VYTEEEEDERRRRFELEIED 422

Query: 422 EEEIN--EESVSDTERGEESTEMATENEIDEQKSNISMINHQSQEETAEDRIDQDNQQET 481
           ++ I+  E+ +S    GE      T  E +E    ++    +++ E          ++E 
Sbjct: 423 KKRIDLCEKWIS----GE------TTVETEEVSVAVAEAEAEAEAEAPLPSNPATEEEER 482

Query: 482 MAISRTIPIPIQSHCESEFAQDAENLESAEED-ESKREQDNRTEQREAFEEDENGENPTS 541
           + +       ++     E  + ++ L+S EE+ E+   +    E R A EE+E       
Sbjct: 483 VKV-------VEDSIVEEEQEASKILDSFEEEIEATIMKKIEDEIRNAIEEEEK------ 542

Query: 542 ASLSVETEPVLDETGTEFDENWEEVTETTTAIKEKATDEGIRSDTQNDDEMMGPEAEDQS 601
               +E   V+    TE  E  +EV        E+ +++G R    + + +M    ++++
Sbjct: 543 -LAEMEELAVVAVAETEEVEESKEVVPDCIPQNEERSEQGNREPDPSPEVVMRRSLQEET 602

Query: 602 KERDTPPPEPERKTQTESPVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKP 661
            E+       E+ T T   VLPDCLLLMM EPKLSMEVSKETWVCSTDF+RC+P R   P
Sbjct: 603 TEK-------EKTTATPYKVLPDCLLLMMCEPKLSMEVSKETWVCSTDFVRCLPGRP--P 662

Query: 662 AGRNPPPPPPPKKREKKPADNTQTAV-----------------IQPARWSCSFPAAA--- 721
           A + PP          +P     TAV                 +QP R SCS+PAA    
Sbjct: 663 AKKIPPEAVGDNHHHHQPKKRIVTAVDSNASSRRRSIDRPPLHLQPPRSSCSYPAAPPII 684

Query: 722 AAATMIEQKKLVRAKGYEPFVLTRCKSEPMRSSSNLAPDACFWKDRKLEPHRPATFGVGA 735
            AA  + ++++  A   +P VL RCKSEP +S+S LAP+ACFWK+RKLEPH PAT GVG 
Sbjct: 723 TAAAAVGEQRVAGANKVQPPVLPRCKSEPRKSASKLAPEACFWKNRKLEPHPPATVGVGG 684

BLAST of Cp4.1LG14g03740 vs. NCBI nr
Match: gi|659111998|ref|XP_008456014.1| (PREDICTED: glutamic acid-rich protein isoform X1 [Cucumis melo])

HSP 1 Score: 959.1 bits (2478), Expect = 4.3e-276
Identity = 565/792 (71.34%), Postives = 628/792 (79.29%), Query Frame = 1

Query: 1   MDSDRHFRTTSSNSTSSAAAGSSELFICFTSRFSSSSSSAMKISSKCLLSPGRARE-GQI 60
           MD DRHFRTTS+NSTSS A  SSELFICFTSRFSSSSS  MKISSK +LSPGR RE  QI
Sbjct: 1   MDPDRHFRTTSTNSTSSTATPSSELFICFTSRFSSSSS--MKISSKSILSPGRHREPSQI 60

Query: 61  ILSTSLSRRLKSSGSLKVGQASPAFPTGGKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQG 120
            LSTSLSRRLKSSGSLK GQASP FPTG KKRGCAFDNPEPSSPKVTCIGQVRVKTKKQG
Sbjct: 61  SLSTSLSRRLKSSGSLKGGQASPMFPTGRKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQG 120

Query: 121 RKMRARSLKRRTNSEASFRKSESVVQSSQMNGNDQQFVANQSSRPNLLRQDSMSNGGNGF 180
           +KMRARS KRRTNSEASFR+SESVVQSSQ+N NDQQF ++ +   +LLRQ+S SN GNGF
Sbjct: 121 KKMRARSQKRRTNSEASFRRSESVVQSSQVNSNDQQFSSHHNH--HLLRQNSNSNAGNGF 180

Query: 181 QQERHSHRSQRWVHLPFTICEALRAFGAELNCFLPCHSSCSSDRESNKESKPAGRSEEET 240
           QQE  SHR+QRWVHLPFTICEALRAFGAELNCFLPCHSSCS +RE+NKE KPA RS E +
Sbjct: 181 QQECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCSGNRENNKEPKPAERSSE-S 240

Query: 241 ESSCGKVFARWLVAVQDRDGKGRKIELVVGDEEPQTEKENGSQRRHILDGMNFKDENEVV 300
           ESSCG VFARWLVAVQD DGKGR+IELVVGDEE +TEKENGSQRRH+ +G++FKD+NE V
Sbjct: 241 ESSCGTVFARWLVAVQDGDGKGREIELVVGDEETRTEKENGSQRRHVFEGLDFKDKNEAV 300

Query: 301 EKEESRISLCIPPKNALLLMRCRSDPVKVAELAKRFCEPPAPQLEEHDKEELDEDHEEKK 360
           E+EESRIS+CIPPKNALLLMRCRSDPVK+AELAKRFCEPPAP+++E D+EE +++  E K
Sbjct: 301 EEEESRISICIPPKNALLLMRCRSDPVKMAELAKRFCEPPAPKVDEEDEEEGEDEDNEAK 360

Query: 361 TTQNEAKRDESVPVS------KEDDEEERTVK---------LNLKLENEEEINEESVSDT 420
             +NE KRD SVPVS      KE++EEE   K           +KLENEEE+NEESVSD 
Sbjct: 361 KRKNEVKRDVSVPVSSIITVNKEEEEEEEEEKEEDERKVEQFVVKLENEEEVNEESVSDE 420

Query: 421 ER-------------------GEESTEMATENEIDEQKSNISMIN----HQSQEETAEDR 480
           ++                    EE+ EMATEN+ DEQK +I+++N     Q+ EE  ED+
Sbjct: 421 DKEKEEANLVLQEEQREEKDNEEETIEMATEND-DEQKQDITVVNQLNQEQALEEKEEDK 480

Query: 481 IDQDNQQETMAISRTIPIPIQSHCESEFAQDAENLESAEEDESK----REQDNRTEQREA 540
            DQ NQQETMA    IPIPIQ+HCE E AQDAE LES E++ESK     EQD +TE+ E 
Sbjct: 481 TDQVNQQETMA----IPIPIQTHCEPEMAQDAEKLESVEKEESKLSHESEQDQKTEEDEI 540

Query: 541 F-----------EEDENGENPTSASLSVETEPVLDETGTEFDENWEEVTETTTAIKEKAT 600
                       EE ENGENPTS SLSVET+PVLDET TE D   EE  E     +EKAT
Sbjct: 541 LREEKEEEEEEEEEGENGENPTSPSLSVETKPVLDETETEVDGKREEEEEEEEE-EEKAT 600

Query: 601 DEGIRSDTQNDDEMMGPEAEDQSKERDTPP----PEPERKTQTESPVLPDCLLLMMYEPK 660
           DEGI  D +N+  ++GPE EDQSKER+TPP    PEPE KTQTE+ VLPDCLLLMMYEPK
Sbjct: 601 DEGIGPDDENNGALVGPEEEDQSKERETPPPEPEPEPEGKTQTETSVLPDCLLLMMYEPK 660

Query: 661 LSMEVSKETWVCSTDFIRCVPTREKKPAGRNPPPPPPPKKREKKPADNTQTAVIQPARWS 720
           LSMEVSKETWVCS DFIRCVPTREKK  GR+PPPPPPPKKRE KP D  QT V+QPARWS
Sbjct: 661 LSMEVSKETWVCSADFIRCVPTREKKTVGRDPPPPPPPKKRETKPTDTMQTTVVQPARWS 720

Query: 721 CSFPAAAAAATMIEQKKLVRAKGYEPFVLTRCKSEPMRSSSNLAPDACFWKDRKLEPHRP 735
           CSFPAAAAAA MIEQ KL RAKGYEPFVLTRCKSEPMRSS+ LAPDACFWKDRKLEPHRP
Sbjct: 721 CSFPAAAAAAAMIEQ-KLARAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHRP 780

BLAST of Cp4.1LG14g03740 vs. NCBI nr
Match: gi|778680362|ref|XP_004146243.2| (PREDICTED: glutamic acid-rich protein [Cucumis sativus])

HSP 1 Score: 957.2 bits (2473), Expect = 1.6e-275
Identity = 566/793 (71.37%), Postives = 630/793 (79.45%), Query Frame = 1

Query: 1   MDSDRHFRTTSSNSTSSAAAGSSELFICFTSRFSSSSSSAMKISSKCLLSPGRARE-GQI 60
           MDSD HFRTTS+NSTSS A  SSELFICFTSRFSSSSSS+MKISSK +LSPGR RE  QI
Sbjct: 1   MDSDPHFRTTSTNSTSSTATPSSELFICFTSRFSSSSSSSMKISSKSILSPGRPREPSQI 60

Query: 61  ILSTSLSRRLKSSGSLKVGQASPAFPTGGKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQG 120
            LSTSLSRRLKSSGSLK GQASP FPTG KKRGCAFDNPEPSSPKVTCIGQVRVKTKKQG
Sbjct: 61  SLSTSLSRRLKSSGSLKGGQASPMFPTGRKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQG 120

Query: 121 RKMRARSLKRRTNSEASFRKSESVVQSSQMNGNDQQFVANQSSRPNLLRQDSMSNGGNGF 180
           +KMRARS KRRTNSEASFR+SES+VQSSQ NG+DQQF ++ +   +LLRQ+S SN GNGF
Sbjct: 121 KKMRARSQKRRTNSEASFRRSESLVQSSQGNGSDQQFSSHHNH--HLLRQNSNSNAGNGF 180

Query: 181 QQERHSHRSQRWVHLPFTICEALRAFGAELNCFLPCHSSCSSDRESNKESKPAGRSEEET 240
           QQE  SHR+QRWVHLPFTICEALRAFGAELNCFLPCHSSCS +RE+NKESKPA RS E +
Sbjct: 181 QQECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCSGNRENNKESKPAERSSE-S 240

Query: 241 ESSCGKVFARWLVAVQDRDGKGRKIELVVGDEEPQTEKENGSQRRHILDGMNFKDENEVV 300
           ESSCG VFARWLVAVQD DGKGR+IELVVGDEE +TEKENGSQRRH+ +G++FKD+NE V
Sbjct: 241 ESSCGTVFARWLVAVQDGDGKGREIELVVGDEETRTEKENGSQRRHVFEGLDFKDKNEAV 300

Query: 301 EKEESRISLCIPPKNALLLMRCRSDPVKVAELAKRFCEPPAPQLEEHDKEELDEDHEEKK 360
           E+EESRIS+CIPPKNALLLMRCRSDPVK+AELAKRFCEPPAP+++E D+E  DED+E KK
Sbjct: 301 EEEESRISICIPPKNALLLMRCRSDPVKMAELAKRFCEPPAPKVDEEDEEGEDEDNEAKK 360

Query: 361 TTQNEAKRDESVPVS------------KEDDEEERTVKLNLKLENEEEINEESVSDTER- 420
             QNE KRD SVPVS            KE+++E +  +L +KLENEEE+NEE VSD ++ 
Sbjct: 361 -RQNEVKRDVSVPVSSIVTVNKEEEEVKEEEDERKVEQLIVKLENEEEMNEECVSDADKE 420

Query: 421 -------------------GEESTEMATENEIDEQKSNISMIN----HQSQEETAEDRID 480
                               EE+ EMATENEIDEQK +I+++N     Q+ EE  ED+ D
Sbjct: 421 KEEANLVLQEEEREEEEDNEEETIEMATENEIDEQK-DITVVNQLNQEQALEEKEEDKTD 480

Query: 481 QDNQQETMAISRTIPIPIQSHCESEFAQDAENLESAEEDESK----REQDNRTEQREAFE 540
           Q NQQETMAI   IP+ IQ+HCE E AQD E LES E++E K     EQD +TE+ E   
Sbjct: 481 QVNQQETMAI--PIPLLIQTHCEPEMAQDVEKLESVEKEEPKLSHESEQDQKTEEDENLR 540

Query: 541 ED------------ENGENPTSASLSVETEPVLDETGTEFDENWEEVTETTTAIKEKATD 600
           ED            ENGE  TS SLSVETEPV DET TE D N EE  E     +EK TD
Sbjct: 541 EDKEEEEEEEGENGENGETTTSPSLSVETEPVSDETETEVDVNREEEEEEE---EEKTTD 600

Query: 601 EGIRSDTQNDDEMMGPEAEDQSKERDTPPPE------PERKTQTESPVLPDCLLLMMYEP 660
           EGI  D +N D ++GPE EDQSKE +TPPPE      PERKTQTE+ VLPDCLLLMMYEP
Sbjct: 601 EGIGPDDEN-DVLVGPEEEDQSKEGETPPPEPESEPKPERKTQTETSVLPDCLLLMMYEP 660

Query: 661 KLSMEVSKETWVCSTDFIRCVPTREKKPAGRNPPPPPPPKKREKKPADNTQTAVIQPARW 720
           KLSMEVSKETWVCS DFIRCVPTREKK  G++PPPPPPPKKRE KP D TQTAV+QPARW
Sbjct: 661 KLSMEVSKETWVCSADFIRCVPTREKKAIGKDPPPPPPPKKRETKPTDTTQTAVVQPARW 720

Query: 721 SCSFPAAAAAATMIEQKKLVRAKGYEPFVLTRCKSEPMRSSSNLAPDACFWKDRKLEPHR 735
           SCSFPAAAAAA MIEQ KLVRAKGYEPFVLTRCKSEPMRSS+ LAPDAC WKDRKLEPHR
Sbjct: 721 SCSFPAAAAAAAMIEQ-KLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACCWKDRKLEPHR 780

BLAST of Cp4.1LG14g03740 vs. NCBI nr
Match: gi|659112000|ref|XP_008456015.1| (PREDICTED: glutamic acid-rich protein isoform X2 [Cucumis melo])

HSP 1 Score: 733.8 bits (1893), Expect = 3.0e-208
Identity = 454/665 (68.27%), Postives = 513/665 (77.14%), Query Frame = 1

Query: 1   MDSDRHFRTTSSNSTSSAAAGSSELFICFTSRFSSSSSSAMKISSKCLLSPGRARE-GQI 60
           MD DRHFRTTS+NSTSS A  SSELFICFTSRFSSSSS  MKISSK +LSPGR RE  QI
Sbjct: 1   MDPDRHFRTTSTNSTSSTATPSSELFICFTSRFSSSSS--MKISSKSILSPGRHREPSQI 60

Query: 61  ILSTSLSRRLKSSGSLKVGQASPAFPTGGKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQG 120
            LSTSLSRRLKSSGSLK GQASP FPTG KKRGCAFDNPEPSSPKVTCIGQVRVKTKKQG
Sbjct: 61  SLSTSLSRRLKSSGSLKGGQASPMFPTGRKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQG 120

Query: 121 RKMRARSLKRRTNSEASFRKSESVVQSSQMNGNDQQFVANQSSRPNLLRQDSMSNGGNGF 180
           +KMRARS KRRTNSEASFR+SESVVQSSQ+N NDQQF ++ +   +LLRQ+S SN GNGF
Sbjct: 121 KKMRARSQKRRTNSEASFRRSESVVQSSQVNSNDQQFSSHHNH--HLLRQNSNSNAGNGF 180

Query: 181 QQERHSHRSQRWVHLPFTICEALRAFGAELNCFLPCHSSCSSDRESNKESKPAGRSEEET 240
           QQE  SHR+QRWVHLPFTICEALRAFGAELNCFLPCHSSCS +RE+NKE KPA RS E +
Sbjct: 181 QQECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCSGNRENNKEPKPAERSSE-S 240

Query: 241 ESSCGKVFARWLVAVQDRDGKGRKIELVVGDEEPQTEKENGSQRRHILDGMNFKDENEVV 300
           ESSCG VFARWLVAVQD DGKGR+IELVVGDEE +TEKENGSQRRH+ +G++FKD+NE V
Sbjct: 241 ESSCGTVFARWLVAVQDGDGKGREIELVVGDEETRTEKENGSQRRHVFEGLDFKDKNEAV 300

Query: 301 EKEESRISLCIPPKNALLLMRCRSDPVKVAELAKRFCEPPAPQLEEHDKEELDEDHEEKK 360
           E+EESRIS+CIPPKNALLLMRCRSDPVK+AELAKRFCEPPAP+++E D+EE +++  E K
Sbjct: 301 EEEESRISICIPPKNALLLMRCRSDPVKMAELAKRFCEPPAPKVDEEDEEEGEDEDNEAK 360

Query: 361 TTQNEAKRDESVPVS------KEDDEEERTVK---------LNLKLENEEEINEESVSDT 420
             +NE KRD SVPVS      KE++EEE   K           +KLENEEE+NEESVSD 
Sbjct: 361 KRKNEVKRDVSVPVSSIITVNKEEEEEEEEEKEEDERKVEQFVVKLENEEEVNEESVSDE 420

Query: 421 ER-------------------GEESTEMATENEIDEQKSNISMIN----HQSQEETAEDR 480
           ++                    EE+ EMATEN+ DEQK +I+++N     Q+ EE  ED+
Sbjct: 421 DKEKEEANLVLQEEQREEKDNEEETIEMATEND-DEQKQDITVVNQLNQEQALEEKEEDK 480

Query: 481 IDQDNQQETMAISRTIPIPIQSHCESEFAQDAENLESAEEDESK----REQDNRTEQREA 540
            DQ NQQETMA    IPIPIQ+HCE E AQDAE LES E++ESK     EQD +TE+ E 
Sbjct: 481 TDQVNQQETMA----IPIPIQTHCEPEMAQDAEKLESVEKEESKLSHESEQDQKTEEDEI 540

Query: 541 F-----------EEDENGENPTSASLSVETEPVLDETGTEFDENWEEVTETTTAIKEKAT 600
                       EE ENGENPTS SLSVET+PVLDET TE D   EE  E     +EKAT
Sbjct: 541 LREEKEEEEEEEEEGENGENPTSPSLSVETKPVLDETETEVDGKREEEEEEEEE-EEKAT 600

Query: 601 DEGIRSDTQNDDEMMGPEAEDQSKERDTPP----PEPERKTQTESPVLPDCLLLMMYEPK 608
           DEGI  D +N+  ++GPE EDQSKER+TPP    PEPE KTQTE+ VLPDCLLLMMYEPK
Sbjct: 601 DEGIGPDDENNGALVGPEEEDQSKERETPPPEPEPEPEGKTQTETSVLPDCLLLMMYEPK 654

BLAST of Cp4.1LG14g03740 vs. NCBI nr
Match: gi|1009162037|ref|XP_015899218.1| (PREDICTED: trichohyalin [Ziziphus jujuba])

HSP 1 Score: 515.0 bits (1325), Expect = 2.2e-142
Identity = 388/808 (48.02%), Postives = 487/808 (60.27%), Query Frame = 1

Query: 1   MDSDRHFRTTSSNSTSSAAAGSSELFICFTSRFSSSSSSAMKISSKCLLSPGRARE-GQI 60
           M+ +R  RTTSSNS SS +  +SELFICFTSR SSSS   MKISSK +LSPGRARE  QI
Sbjct: 1   MELERPHRTTSSNSNSSGS--TSELFICFTSRLSSSS---MKISSKSILSPGRAREPSQI 60

Query: 61  ILSTSLSRRLKSSGSLKVGQASPAFPTGGKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQG 120
            LS+SLSRRLK+SGS+K GQASP FPTGGKK+GCAFDNPEPSSPKVTCIGQVRVKTKKQG
Sbjct: 61  SLSSSLSRRLKNSGSIKGGQASPMFPTGGKKKGCAFDNPEPSSPKVTCIGQVRVKTKKQG 120

Query: 121 RKMRARSLKRRTNSEASFRKSESVVQSSQMNGNDQQFVANQSSRPNLLRQDSMSNGGNGF 180
           +KMR RS  +R +SE SFRKSE   QS  ++G  Q+   NQ+            N   GF
Sbjct: 121 KKMRIRS--KRRSSEPSFRKSEQNSQS--ISGKQQE---NQNHE----------NNHQGF 180

Query: 181 QQERHS-------HRSQRWVHLPFTICEALRAFGAELNCFLPCHSSCSSDRESNKESKPA 240
           Q +          HR+QRWVHLP TICEALRAFGAE NCFLPC SSC +  E +KE K  
Sbjct: 181 QFQNLQQPECLPHHRNQRWVHLPLTICEALRAFGAEFNCFLPCRSSCMTS-EKDKEEKGG 240

Query: 241 GRS-EEETESSCGKVFARWLVAVQDRDGKGRKIELVVGDEEPQTEKENGSQRRHILDGMN 300
            RS  +E  SSCG VFARWLVA+QD DGKGR+IELVVG+EE  TE+ + S+RR +L+G+ 
Sbjct: 241 ERSVADENGSSCGAVFARWLVALQDGDGKGREIELVVGEEERTTERRSSSRRRQMLEGIE 300

Query: 301 FKDENEV--VEKEESRISLCIPPKNALLLMRCRSDPVKVAELAKRFCE-PPAPQLEEHDK 360
            K+E++   +E EE R+S+CIPPKNALLLMRCRSDPVK+A LA RF E P AP+ EE + 
Sbjct: 301 IKEESKESGMEDEEGRVSICIPPKNALLLMRCRSDPVKMAALANRFWESPAAPKNEEGED 360

Query: 361 EELDEDHEEKKTTQNEAKRDESVPVSKE--DDEEERTVKLNLKLENEEEINEESVSDTER 420
           EE D D E+ +  +   + DE   V  E  D +E    + NL +E EEE  EE   + E 
Sbjct: 361 EEEDGDDEDGRRNKEAVQDDERGLVKAEVVDRKEVAEQQANLAMEEEEE-EEEVAEEAEE 420

Query: 421 GEESTEM----------------ATENEIDEQKSNISMINHQSQEETAEDRIDQDNQQET 480
             ES+E+                 +E E++E+K      + + +EE  E  ++ + Q   
Sbjct: 421 NPESSELDEQQQEIVSSGEDPSEESEGEVEEEKPKGKKEDEEEEEEEEEQELEANQQ--- 480

Query: 481 MAISRTIPIPIQSHCESEFAQDAENLESAEEDESKR-EQDNRTEQREAFEEDE-----NG 540
           +A+ +     +     SE   DAEN+   EE+ ++  E++N  EQ+   +E+E       
Sbjct: 481 LAVEKESVFDVSL---SEILVDAENVLDEEENAAQLVEKENSYEQKPPHQEEEKERELKE 540

Query: 541 ENPTSAS-------LSVETEPV-----------LDETGTEFDENWEEVTETTTAIKEKAT 600
           E   S S        SVE EPV            D+   E +  + E   +T   KE   
Sbjct: 541 ERRASVSKYSSEPVKSVEQEPVEKDNIHGVEEGEDKAENEIEAKFAEAAGSTKEEKETRE 600

Query: 601 DEGIRSDTQNDDEMMGPEAEDQSKERDTPPPEPERKTQTESPVLPDCLLLMMYEPKLSME 660
           +  I   ++       PE  + + E         +  +  SPVLPDCL LMM EPKLSME
Sbjct: 601 ETVIHQKSK-------PEYPNNTHEEVEVAGSEPKVERESSPVLPDCLRLMMCEPKLSME 660

Query: 661 VSKETWVCSTDFIRCVPTREKKPAGRNPPPPPPPKKREKKP-----------------AD 720
           VSKETWVCSTDF+R +P  E+K   RN    P   +  ++                  + 
Sbjct: 661 VSKETWVCSTDFVRWLP--ERKVNQRNGLDQPKKHQHHQQQQQLSIINNEDSNSVRPCSK 720

Query: 721 NTQTAVIQPARWSCSFPAAAAAATM--IEQKKLVRAKGYEPFVLTRCKSEPMRSSSNL-A 735
                ++QP R SCSFPA AA A+M  + ++KLV +KGYEPFVLTRCKSEPMRSS+    
Sbjct: 721 QLANQLMQPPRSSCSFPAPAAPASMATVVEEKLVGSKGYEPFVLTRCKSEPMRSSAKFPP 769

BLAST of Cp4.1LG14g03740 vs. NCBI nr
Match: gi|566194675|ref|XP_002316801.2| (hypothetical protein POPTR_0011s09990g [Populus trichocarpa])

HSP 1 Score: 485.3 bits (1248), Expect = 1.8e-133
Identity = 357/793 (45.02%), Postives = 477/793 (60.15%), Query Frame = 1

Query: 4   DRHFRTTSSNS--TSSAAAGSSELFICFTSRFSSSSSSAMKISSKCLLSPGRAREG-QII 63
           DR  R+ S+NS  TS+  + +SELFICFTSR SSSS   MK+SSK +LSPGR R+  QI 
Sbjct: 11  DRPHRSNSNNSSSTSNNNSNTSELFICFTSRLSSSS---MKLSSKSILSPGRHRDSSQIS 70

Query: 64  LSTSLSRRLKSSGSLKVGQASPAFPTGGKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQGR 123
           LS SLSRRL+SSGS+K GQASP FPT GKKRGCAF+NPEPSSPKVTCIGQVRVKTKKQG+
Sbjct: 71  LSNSLSRRLRSSGSMKGGQASPMFPTNGKKRGCAFENPEPSSPKVTCIGQVRVKTKKQGK 130

Query: 124 KMRARSLKRRTNSEASFRKSESVVQSSQMNGNDQQFVANQSSRPNLLRQDSMSNGGNGFQ 183
           K+R RS +R    E SFR+ +    + + + N    + NQ      L Q          Q
Sbjct: 131 KLRTRSKRR---GEISFRRVDQNSNTFEGSNNHHDLINNQ-----FLNQQQQ-------Q 190

Query: 184 QERHSHRSQRWVHLPFTICEALRAFGAELNCFLPCHSSCSSDRESNKESKPAGRSEEETE 243
           QE  SHR+QRWVH P TICEALRAFGAE NCFLPC SSC +  +  +E+  A  S     
Sbjct: 191 QEGLSHRNQRWVHFPVTICEALRAFGAEFNCFLPCRSSCMASEKEKEENTAAAGSNNNGS 250

Query: 244 SSCGKVFARWLVAVQDRDGKGRKIELVVGDE--EPQTEKENGSQRRHILDGMNFKDE--- 303
           SSCG VFARWLVAVQ+ +GKG++IELVVG+E  E + ++   S RRHI + + FK+E   
Sbjct: 251 SSCGAVFARWLVAVQEGEGKGKEIELVVGEEVVEEERDERRRSYRRHIFEDIEFKEEEGH 310

Query: 304 -----NEVVEKEESRISLCIPPKNALLLMRCRSDPVKVAELAKRFCEPPAPQLEEHDKEE 363
                N  +++EE+R+S+CIPPKNALLLMRCRSDPVK+A LA +F E PAPQ EE ++E+
Sbjct: 311 VFEGGNAGLQEEEARVSICIPPKNALLLMRCRSDPVKMAALANKFWESPAPQDEEDEEED 370

Query: 364 LDEDHEEKKT-------------TQNEAKRDESVPVSKE---DDEEERTVKLNLK----L 423
            +E  +++               ++ +A ++E + V +E   + +++ TV   L     +
Sbjct: 371 NEEGEKDRNLGAEVDKFINIENKSEVKASQEEEIKVEQEIIIEQKQDLTVSDKLAFCETI 430

Query: 424 ENEEEI---NEESVSDTERGEESTEM-ATENEIDEQKSNISMINHQSQEETAEDRIDQDN 483
           E   +I    EES+   E GE+S E+ +T++ ID     ++++    QEE   +     N
Sbjct: 431 EEHYQIIQETEESLVILEAGEDSQEIGSTDDNIDGVLQEVNLVK---QEEEESETPGVMN 490

Query: 484 QQETMAISRTIPI--PIQSHCESEFAQDAENLESAEEDESKREQDNRTEQREAFEEDEN- 543
            Q T +   T+ +     S  + E    A  + +  E +  +E +   ++   F+ ++  
Sbjct: 491 LQPTSSTQETVSLCSDESSSHDQEIVDPAALMNNENEYKVVQENEEDNQEERVFQAEQEQ 550

Query: 544 -----GENPTSASLSVETEPVLDETGTEFDENWEEVTETTTAIKEKATDEGIRSDTQNDD 603
                 ++    S+SV  E    +   +  ++ E  + +   ++ + T+E  +  T+N+ 
Sbjct: 551 VVQGLSDDIEENSVSVRFEQETLQVAVQDLQDQEPESLSVAELQVQETEEE-KETTENET 610

Query: 604 EMMGPEAEDQSKERDTPPPEPERKTQTESPVLPDCLLLMMYEPKLSMEVSKETWVCSTDF 663
           E+   E ED     +       R+     P+LPDCLLLMM EPKLSMEVSKETWVCSTDF
Sbjct: 611 ELAEEEPEDPKTHVNGQTGVKSREGDNSQPLLPDCLLLMMCEPKLSMEVSKETWVCSTDF 670

Query: 664 IRCVPTREKKPAGRNPPPPPPPKKR---EKKPAD-----NTQTAVIQPARWSCSFPAAAA 723
           IR +P   +  +  N    P  KKR   + KPA      N   ++ QP R SCS+PA   
Sbjct: 671 IRWLPEHSRPVSKTNGKDEP--KKRVSIDIKPAQVYNNGNNSNSLQQPRRSSCSYPAKPP 730

Query: 724 A--------ATMIEQKKLVRAKGYEPFVLTRCKSEPMRSSSNLAPDACFWKDRKLEPHRP 735
           A        +TMIEQK LV AK YEPFVLTRCKSEPMRS+S LAP+ACFWK+RKLEPHRP
Sbjct: 731 ARCAGTESMSTMIEQK-LVGAKAYEPFVLTRCKSEPMRSASKLAPEACFWKNRKLEPHRP 778

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0L789_CUCSA1.1e-27571.37Uncharacterized protein OS=Cucumis sativus GN=Csa_3G236550 PE=4 SV=1[more]
B9I1E3_POPTR1.3e-13345.02Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0011s09990g PE=4 SV=2[more]
A0A061F8W3_THECC3.7e-13345.72Uncharacterized protein OS=Theobroma cacao GN=TCM_031658 PE=4 SV=1[more]
M5X0H9_PRUPE9.8e-12645.45Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001903mg PE=4 SV=1[more]
A0A067LF51_JATCU1.1e-12443.77Uncharacterized protein OS=Jatropha curcas GN=JCGZ_15297 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G15095.11.3e-9238.85 unknown protein[more]
Match NameE-valueIdentityDescription
gi|659111998|ref|XP_008456014.1|4.3e-27671.34PREDICTED: glutamic acid-rich protein isoform X1 [Cucumis melo][more]
gi|778680362|ref|XP_004146243.2|1.6e-27571.37PREDICTED: glutamic acid-rich protein [Cucumis sativus][more]
gi|659112000|ref|XP_008456015.1|3.0e-20868.27PREDICTED: glutamic acid-rich protein isoform X2 [Cucumis melo][more]
gi|1009162037|ref|XP_015899218.1|2.2e-14248.02PREDICTED: trichohyalin [Ziziphus jujuba][more]
gi|566194675|ref|XP_002316801.2|1.8e-13345.02hypothetical protein POPTR_0011s09990g [Populus trichocarpa][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0010207 photosystem II assembly
cellular_component GO:0005575 cellular_component
cellular_component GO:0009507 chloroplast
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG14g03740.1Cp4.1LG14g03740.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableunknownCoilCoilcoord: 470..496
score: -coord: 348..368
scor
NoneNo IPR availablePANTHERPTHR33448FAMILY NOT NAMEDcoord: 1..154
score: 7.0E-227coord: 179..734
score: 7.0E
NoneNo IPR availablePANTHERPTHR33448:SF1CHLOROPLAST PROTEIN HCF243coord: 179..734
score: 7.0E-227coord: 1..154
score: 7.0E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG14g03740Cp4.1LG01g01650Cucurbita pepo (Zucchini)cpecpeB234