Cp4.1LG02g08520 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG02g08520
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionHydroxyproline-rich glycoprotein family protein, putative
LocationCp4.1LG02 : 6731 .. 9230 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAAAAGTTTTTGTTCGTGGTTATTCTCTATTTGAGTGTCTTTTATATCATTAATTTTTTTCTTTTTCACTCCCTCTGGGATTTTGTATCCTTTGAACTTTTAATCCTTTTCATTATATCTATGAGAATTTTCTTGTTAAAAAAACCTAGTAAATTTTTATTTATGGAACAATAGCATTTGATTTTTCCTTCTTTTTGTTTTTTGGTTTTTATTTTAATGATGTTTGATTAAATATTTGTATTTATTGCGGTTGATATTTGATTTATTGATCCCCACATTCAGGCTGCTCCTTTGAATCGCTTAAGCGACTTACATCCATTAACAACTTGCTCAAATCTAAAAGGCACTTTAACTAATGCTAAAACTGGATATTGGACATTTTTTATGGATAATTTTGAAATGGCCATCTCTGCTAGCAAGAATTTGTACGAAAATTGTGAAAAGGGATGGTTCTAAACTTTTTGGTTCTTCATTGACTATTCTTTAAGGAGCTGTTGTGATAACCTGTTAAGTTCTGGTACTTTCTTCTGTACCAGGGAAAGAGATGGGGTGGATGTTGGGGTGCATTATCTTGTTTTCACTCTCAGAAAGGAGAAAAGCGCATTGTGCCTGCATCTCGTTTACCTGAGGGCAATGTCGTGACAACCCAGCCAAATGGACCTCATGCAGCAGGAATAGCCAATCAGGCTACTGTGATAGCTCCATCCCTTTTAGCCCCACCTTCTTCACCAGCATCCTTTACAAATTCTGCACTCCCTTCAACAGCCCAATCACCTAGCTGTTTCTTGTCGTTGTCTGCCAACTCACCTGGAGGTCCTTCATCCACAATGTTCGCTACAGGGCCATATGCGCATGAAACACAACCGGTTTCTCCTCCTGTTTTCTCAGCTTTCACCACTGAACCGTCAACTGCTCCACTCACTCCCCCACCCGAACTAGCTCACCTAACCACTCCTTCTTCCCCTGATGTGCCTTTTGCTCAGTTCCTATCCTCATCGGTGGATCTCAAAGGAACTGGAAAGGCAAATTACGTTGCTTCCAATGATCTTCAAGCAGCATATTCTCTCTACCCTGGAAGTCCTGCCAGTAGCCTCGTGTCACCAATTTCAAGGACCTCCGGCGATTGCTTATCATCTTCTTTTCCTGAGAGGGACTTTCCACCACAGTGGAATCCTTCAGCTTCTCTCCAAGATGGAAAATATCCAAGAAGTGGTTCTGGTCGGCTATTTGGACATGAGAAAACTGGCACACCTCTTGCTTCTCAGGATTCTAATTTCTTCTGCCCTGCTACATTTGCACAATTCTATCTGGACAATCCACCATTCCCTCATACTGGTGGGAGGTTAAGTGTATCAAAGGATTCAGATGTTTACGCTTCTGGTGGGAATGGATACCAAAACCGGCACAGTAAGTCTCCAAAACAAGATGTGGAGGAAATAGAAGCTTACCGAGCATCTTTTGGTTTCAGTGCCGATGAAATTATAAGTACCACACAATACGTGGAGATATCTGATGTAATGGAAGATTCCTTTACTATGAGACCTTTTACTTCAACTAGTCTGTCTGCAGAAGAAAGTATTCAACCTCCATTAGTGGGTGAAAAACTGAAATCCACGCAGGCAACTTTACAGAGTCAAAGAAGTATTAAATCAGCATCTGACGTTGTTGAAAAAGAAACCTGCTCTGAAGTCCTGGCATTATGCAATGGCTGTAAAGGTGAGCCACTGTTCCCTGTTACAAGCTCACACTTGTTTCATGAGAAATCAAAAGAAAATAAAGACAAATACATTTTGGGGAACTTTTTCAAAATGTTGCAACTTTTCTTTCGGTTTCATTATTTCCTTTTTATATACACTGACTCACAATTCTGATGCTAGTAGTCAGGTTAAATTACATTTTTGGGATGTGGTTTTAGGAAAGCTACAATTTAGTTATGGTTACAATTGTCACCTGTTTGTCACCTGTTTGATGCAGACGATAAATTGCAAAGACAACCTGGTAACTTGCCAGGATCAAGTACTTCCCAAGGTGAAACAGAAGACCTATTCTCAAGAATAGGGTCGTCCAAAAATAGCCGCAAGTATAATCATGCTTTATCCTGCTCTGATGCAGAAGTTGATTACAGAAGAGGAAGGAGCCTGAGGGGGGAGGTCAAGGGAGATTTTTTATGGCATGACTAAGAGCCATTTCTGAAATAGTTTGTAGTATGTGTATCTGTTCTTTGCTTTGCGGGTTTCCATGGAATGTCTAACCTATGACCTACCTGTTCTCTGAGTTGACTTGATCAGTGGATGGATGCATATAGTTTGTGTTCTTTATCGTTCCAGTTTAGAAATGGAATGGGTCAAATCTCTGTATTTGGTAGACGAAGTGTCTTTTTTCCAATGAATTATATAATAGCATGTACAATCTTGCCCATCTTGTGCATACGTCATCATTATGCATATCCAAACTCCCGTTAATAGTGGTCATTGATTAGGTTTTCTTTAGAATCTAAGGT

mRNA sequence

GAAAAGTTTTTGTTCGTGGTTATTCTCTATTTGAGTGTCTTTTATATCATTAATTTTTTTCTTTTTCACTCCCTCTGGGATTTTGTATCCTTTGAACTTTTAATCCTTTTCATTATATCTATGAGAATTTTCTTGTTAAAAAAACCTAGTAAATTTTTATTTATGGAACAATAGCATTTGATTTTTCCTTCTTTTTGTTTTTTGGTTTTTATTTTAATGATGTTTGATTAAATATTTGTATTTATTGCGGTTGATATTTGATTTATTGATCCCCACATTCAGGCTGCTCCTTTGAATCGCTTAAGCGACTTACATCCATTAACAACTTGCTCAAATCTAAAAGGCACTTTAACTAATGCTAAAACTGGATATTGGACATTTTTTATGGATAATTTTGAAATGGCCATCTCTGCTAGCAAGAATTTGTACGAAAATTGTGAAAAGGGATGGTTCTAAACTTTTTGGTTCTTCATTGACTATTCTTTAAGGAGCTGTTGTGATAACCTGTTAAGTTCTGGTACTTTCTTCTGTACCAGGGAAAGAGATGGGGTGGATGTTGGGGTGCATTATCTTGTTTTCACTCTCAGAAAGGAGAAAAGCGCATTGTGCCTGCATCTCGTTTACCTGAGGGCAATGTCGTGACAACCCAGCCAAATGGACCTCATGCAGCAGGAATAGCCAATCAGGCTACTGTGATAGCTCCATCCCTTTTAGCCCCACCTTCTTCACCAGCATCCTTTACAAATTCTGCACTCCCTTCAACAGCCCAATCACCTAGCTGTTTCTTGTCGTTGTCTGCCAACTCACCTGGAGGTCCTTCATCCACAATGTTCGCTACAGGGCCATATGCGCATGAAACACAACCGGTTTCTCCTCCTGTTTTCTCAGCTTTCACCACTGAACCGTCAACTGCTCCACTCACTCCCCCACCCGAACTAGCTCACCTAACCACTCCTTCTTCCCCTGATGTGCCTTTTGCTCAGTTCCTATCCTCATCGGTGGATCTCAAAGGAACTGGAAAGGCAAATTACGTTGCTTCCAATGATCTTCAAGCAGCATATTCTCTCTACCCTGGAAGTCCTGCCAGTAGCCTCGTGTCACCAATTTCAAGGACCTCCGGCGATTGCTTATCATCTTCTTTTCCTGAGAGGGACTTTCCACCACAGTGGAATCCTTCAGCTTCTCTCCAAGATGGAAAATATCCAAGAAGTGGTTCTGGTCGGCTATTTGGACATGAGAAAACTGGCACACCTCTTGCTTCTCAGGATTCTAATTTCTTCTGCCCTGCTACATTTGCACAATTCTATCTGGACAATCCACCATTCCCTCATACTGGTGGGAGGTTAAGTGTATCAAAGGATTCAGATGTTTACGCTTCTGGTGGGAATGGATACCAAAACCGGCACAGTAAGTCTCCAAAACAAGATGTGGAGGAAATAGAAGCTTACCGAGCATCTTTTGGTTTCAGTGCCGATGAAATTATAAGTACCACACAATACGTGGAGATATCTGATGTAATGGAAGATTCCTTTACTATGAGACCTTTTACTTCAACTAGTCTGTCTGCAGAAGAAAGTATTCAACCTCCATTAGTGGGTGAAAAACTGAAATCCACGCAGGCAACTTTACAGAGTCAAAGAAGTATTAAATCAGCATCTGACGTTGTTGAAAAAGAAACCTGCTCTGAAGTCCTGGCATTATGCAATGGCTGTAAAGACGATAAATTGCAAAGACAACCTGGTAACTTGCCAGGATCAAGTACTTCCCAAGGTGAAACAGAAGACCTATTCTCAAGAATAGGGTCGTCCAAAAATAGCCGCAAGTATAATCATGCTTTATCCTGCTCTGATGCAGAAGTTGATTACAGAAGAGGAAGGAGCCTGAGGGGGGAGGTCAAGGGAGATTTTTTATGGCATGACTAAGAGCCATTTCTGAAATAGTTTGTAGTATGTTTGACTTGATCAGTGGATGGATGCATATAGTTTGTGTTCTTTATCGTTCCAGTTTAGAAATGGAATGGGTCAAATCTCTGTATTTGGTAGACGAAGTGTCTTTTTTCCAATGAATTATATAATAGCATGTACAATCTTGCCCATCTTGTGCATACGTCATCATTATGCATATCCAAACTCCCGTTAATAGTGGTCATTGATTAGGTTTTCTTTAGAATCTAAGGT

Coding sequence (CDS)

ATGTTCGCTACAGGGCCATATGCGCATGAAACACAACCGGTTTCTCCTCCTGTTTTCTCAGCTTTCACCACTGAACCGTCAACTGCTCCACTCACTCCCCCACCCGAACTAGCTCACCTAACCACTCCTTCTTCCCCTGATGTGCCTTTTGCTCAGTTCCTATCCTCATCGGTGGATCTCAAAGGAACTGGAAAGGCAAATTACGTTGCTTCCAATGATCTTCAAGCAGCATATTCTCTCTACCCTGGAAGTCCTGCCAGTAGCCTCGTGTCACCAATTTCAAGGACCTCCGGCGATTGCTTATCATCTTCTTTTCCTGAGAGGGACTTTCCACCACAGTGGAATCCTTCAGCTTCTCTCCAAGATGGAAAATATCCAAGAAGTGGTTCTGGTCGGCTATTTGGACATGAGAAAACTGGCACACCTCTTGCTTCTCAGGATTCTAATTTCTTCTGCCCTGCTACATTTGCACAATTCTATCTGGACAATCCACCATTCCCTCATACTGGTGGGAGGTTAAGTGTATCAAAGGATTCAGATGTTTACGCTTCTGGTGGGAATGGATACCAAAACCGGCACAGTAAGTCTCCAAAACAAGATGTGGAGGAAATAGAAGCTTACCGAGCATCTTTTGGTTTCAGTGCCGATGAAATTATAAGTACCACACAATACGTGGAGATATCTGATGTAATGGAAGATTCCTTTACTATGAGACCTTTTACTTCAACTAGTCTGTCTGCAGAAGAAAGTATTCAACCTCCATTAGTGGGTGAAAAACTGAAATCCACGCAGGCAACTTTACAGAGTCAAAGAAGTATTAAATCAGCATCTGACGTTGTTGAAAAAGAAACCTGCTCTGAAGTCCTGGCATTATGCAATGGCTGTAAAGACGATAAATTGCAAAGACAACCTGGTAACTTGCCAGGATCAAGTACTTCCCAAGGTGAAACAGAAGACCTATTCTCAAGAATAGGGTCGTCCAAAAATAGCCGCAAGTATAATCATGCTTTATCCTGCTCTGATGCAGAAGTTGATTACAGAAGAGGAAGGAGCCTGAGGGGGGAGGTCAAGGGAGATTTTTTATGGCATGACTAA

Protein sequence

MFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYVASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIISTTQYVEISDVMEDSFTMRPFTSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASDVVEKETCSEVLALCNGCKDDKLQRQPGNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD
BLAST of Cp4.1LG02g08520 vs. Swiss-Prot
Match: Y1666_ARATH (Uncharacterized protein At1g76660 OS=Arabidopsis thaliana GN=At1g76660 PE=2 SV=1)

HSP 1 Score: 326.6 bits (836), Expect = 3.4e-88
Identity = 198/356 (55.62%), Postives = 236/356 (66.29%), Query Frame = 1

Query: 1   MFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDL 60
           M+ATGPYAHETQ VSPPVFS FTTEPSTAP TPPPELA LT PSSPDVP+A+FL+SS+DL
Sbjct: 108 MYATGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPELARLTAPSSPDVPYARFLTSSMDL 167

Query: 61  KGTGKANYVASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASL 120
           K +GK +Y   NDLQA YSLYPGSPAS+L SPISR SGD L S                 
Sbjct: 168 KNSGKGHY---NDLQATYSLYPGSPASALRSPISRASGDGLLSP---------------- 227

Query: 121 QDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDS 180
           Q+GK  RS SG  FG++  G     Q+SNFFCP TFA+FYLD +P  P  GGRLSVSKDS
Sbjct: 228 QNGKCSRSDSGNTFGYDTNGVSTPLQESNFFCPETFAKFYLDHDPSVPQNGGRLSVSKDS 287

Query: 181 DVYASG--GNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIISTTQYVEISDVMEDSFTM 240
           DVY +   GNG QNR ++SPKQD+EE+EAYRASFGFSADEII+T+QYVEI+DVM+ SF  
Sbjct: 288 DVYPTNGYGNGNQNRQNRSPKQDMEELEAYRASFGFSADEIITTSQYVEITDVMDGSFNT 347

Query: 241 RPFTSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASDVVEKETCSEVLALCNGCKD 300
             ++            P  G+KL   +A L SQ S KS +D+  +    +     N  KD
Sbjct: 348 SAYS------------PSDGQKLLRREANLLSQTSPKSEADLDSQVVDFQSPKSSNSYKD 407

Query: 301 DKLQRQPGNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLR 354
            K + +            + E L SR+GS K SR Y+  +S SDAEV+YRRGRSLR
Sbjct: 408 HKQRNR---------IHADEEALLSRVGSVKGSRSYH--ISSSDAEVEYRRGRSLR 421

BLAST of Cp4.1LG02g08520 vs. TrEMBL
Match: A0A0A0L1G3_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G665140 PE=4 SV=1)

HSP 1 Score: 604.4 bits (1557), Expect = 9.3e-170
Identity = 314/364 (86.26%), Postives = 333/364 (91.48%), Query Frame = 1

Query: 1   MFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDL 60
           M+ATGPYAH+TQ VSPPVFSAF TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS DL
Sbjct: 111 MYATGPYAHDTQLVSPPVFSAFNTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSEDL 170

Query: 61  KGTGKANYVASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASL 120
           KGTGKANY+ASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDF PQWN SASL
Sbjct: 171 KGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFRPQWNSSASL 230

Query: 121 QDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSD 180
           QDGKYPRSGSGRLFG+EK GT LASQDSNFFCPATFAQFYLDN  FPHTGGRLSVSKDSD
Sbjct: 231 QDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSD 290

Query: 181 VYASGGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIISTTQYVEISDVMEDSFTMRPF 240
           VY+S GNGYQNRHSKSPKQDVEEIEAYRASFGFSADEII+TTQYVEISDVMEDSFTMRPF
Sbjct: 291 VYSSCGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPF 350

Query: 241 TSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASDVVEKETCSEVLALCNGCKDDKL 300
           TSTSLSAEES +PPL+GEKLKS+  TLQSQRSIKSA +    ETC+E+ ALCNG KD+KL
Sbjct: 351 TSTSLSAEESTEPPLLGEKLKSSHTTLQSQRSIKSAPE----ETCTEMPALCNGYKDNKL 410

Query: 301 QRQPGNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDF 360
           QRQPG++ GSSTS    +D+FSRIGSSKNSRKY+  LSCSDAEVDYRRGRSLR E KG+ 
Sbjct: 411 QRQPGDISGSSTSNQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLR-EAKGNG 469

Query: 361 LWHD 365
            WHD
Sbjct: 471 SWHD 469

BLAST of Cp4.1LG02g08520 vs. TrEMBL
Match: M5WAN0_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa006213mg PE=4 SV=1)

HSP 1 Score: 499.6 bits (1285), Expect = 3.2e-138
Identity = 269/372 (72.31%), Postives = 303/372 (81.45%), Query Frame = 1

Query: 1   MFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDL 60
           M+ATGPYA+ETQ VSPPVFS FTTEPSTAPLTPPPELAHLTTPSSPDVPFA+FLSSSVD+
Sbjct: 54  MYATGPYANETQLVSPPVFSTFTTEPSTAPLTPPPELAHLTTPSSPDVPFARFLSSSVDI 113

Query: 61  KGTGKANYVASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASL 120
           K T K NY+A+NDLQA YSLYPGSPASSL SPISR S DC SSSFPERDFP QW+PS S 
Sbjct: 114 KTTDKTNYIAANDLQATYSLYPGSPASSLRSPISRASNDC-SSSFPERDFPRQWDPSVSP 173

Query: 121 QDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSD 180
           Q+G YPRSGS RLFG++ TG   ASQDSNFFCPATFAQFYLDNPPFPH GGRLSVSKDSD
Sbjct: 174 QNGTYPRSGSARLFGYDTTGASAASQDSNFFCPATFAQFYLDNPPFPHAGGRLSVSKDSD 233

Query: 181 VYASGGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIISTTQYVEISDVMEDSFTMRPF 240
           VY++GGNG QNRH++SPKQDVEE+EAYRASFGFSADEII+TTQYVEISDVM+DSFTM PF
Sbjct: 234 VYSTGGNGSQNRHNRSPKQDVEELEAYRASFGFSADEIITTTQYVEISDVMDDSFTMTPF 293

Query: 241 TSTSLSAEESIQPPLVGEKLKS--TQATLQSQRSIKSASDVVEKETCSEVLALCNGCKDD 300
           TS  L  EE I+P  V E LK+  T+  LQSQ + KS SD+ E  + S++   CNG +D 
Sbjct: 294 TSHKLPTEEHIEPKSVTEGLKAQKTKTILQSQDTTKSESDLDEGGS-SDLPISCNGYEDH 353

Query: 301 KLQRQPGNLPGSSTS------QGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSL 360
           K  RQPG++  SST         + ED+FS++GSSK SRKY   LS SDAE+DYRRGRSL
Sbjct: 354 KSWRQPGDVSRSSTPGPGVRVLADEEDIFSKMGSSKLSRKYQLGLSSSDAEIDYRRGRSL 413

Query: 361 RGEVKGDFLWHD 365
           R E KG+F WHD
Sbjct: 414 R-ERKGEFAWHD 422

BLAST of Cp4.1LG02g08520 vs. TrEMBL
Match: A0A067HC38_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g013739mg PE=4 SV=1)

HSP 1 Score: 495.4 bits (1274), Expect = 6.1e-137
Identity = 254/368 (69.02%), Postives = 304/368 (82.61%), Query Frame = 1

Query: 1   MFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDL 60
           MFATGPYAHETQ VSPPVFS FTTEPSTAPLTPPPELAHLTTPSSPDVPFA+FL+SS+DL
Sbjct: 70  MFATGPYAHETQLVSPPVFSTFTTEPSTAPLTPPPELAHLTTPSSPDVPFARFLTSSMDL 129

Query: 61  KGTGKANYVASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASL 120
            GT KANY+A+NDLQA YSLYPGSP SSL+SPISRTSG+CLSSSFPER+FPPQW+P+ S 
Sbjct: 130 NGTDKANYIAANDLQATYSLYPGSPPSSLISPISRTSGECLSSSFPEREFPPQWDPTVSP 189

Query: 121 QDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDS 180
           Q+GKY RSGSGRL+ H+ TG    SQD+NFFCPATFAQFYLD + PFPHTGGRLSVSKDS
Sbjct: 190 QNGKYSRSGSGRLYTHDTTGGSRVSQDTNFFCPATFAQFYLDHDSPFPHTGGRLSVSKDS 249

Query: 181 DVYASGGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIISTTQYVEISDVMEDSFTMRP 240
           DVY +G NG QNRH+KSPKQDVEE+EAYRASFGFSADEII+T QYVEI+DVM+DSFTM P
Sbjct: 250 DVYPNGANGNQNRHTKSPKQDVEELEAYRASFGFSADEIITTPQYVEITDVMDDSFTMMP 309

Query: 241 FTSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASDVVEKETCSEVLALCNGCKDDK 300
           FTS   + EES+   + G+K +  ++ L + +++KS SD++      E+    +GC+D+K
Sbjct: 310 FTSDKPAFEESLPASMDGQKPQGRESNLLNPKNLKSDSDLMNGGIHHELTESSDGCEDNK 369

Query: 301 LQRQPGNLPGSSTSQGET----EDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGE 360
            +RQ G++ G+ST   +     ED+FS++ +S+NSRKY+  LSCSDAE+DYRRGRSLR E
Sbjct: 370 PKRQSGDVSGASTPGNQVLTDEEDIFSKMRTSRNSRKYHQGLSCSDAEIDYRRGRSLR-E 429

Query: 361 VKGDFLWH 364
            KGDF WH
Sbjct: 430 GKGDFSWH 436

BLAST of Cp4.1LG02g08520 vs. TrEMBL
Match: V4TGC4_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10000992mg PE=4 SV=1)

HSP 1 Score: 495.4 bits (1274), Expect = 6.1e-137
Identity = 254/368 (69.02%), Postives = 304/368 (82.61%), Query Frame = 1

Query: 1   MFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDL 60
           MFATGPYAHETQ VSPPVFS FTTEPSTAPLTPPPELAHLTTPSSPDVPFA+FL+SS+DL
Sbjct: 113 MFATGPYAHETQLVSPPVFSTFTTEPSTAPLTPPPELAHLTTPSSPDVPFARFLTSSMDL 172

Query: 61  KGTGKANYVASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASL 120
            GT KANY+A+NDLQA YSLYPGSP SSL+SPISRTSG+CLSSSFPER+FPPQW+P+ S 
Sbjct: 173 NGTDKANYIAANDLQATYSLYPGSPPSSLISPISRTSGECLSSSFPEREFPPQWDPTVSP 232

Query: 121 QDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDS 180
           Q+GKY RSGSGRL+ H+ TG    SQD+NFFCPATFAQFYLD + PFPHTGGRLSVSKDS
Sbjct: 233 QNGKYSRSGSGRLYTHDTTGGSRVSQDTNFFCPATFAQFYLDHDSPFPHTGGRLSVSKDS 292

Query: 181 DVYASGGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIISTTQYVEISDVMEDSFTMRP 240
           DVY +G NG QNRH+KSPKQDVEE+EAYRASFGFSADEII+T QYVEI+DVM+DSFTM P
Sbjct: 293 DVYPNGANGNQNRHTKSPKQDVEELEAYRASFGFSADEIITTPQYVEITDVMDDSFTMMP 352

Query: 241 FTSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASDVVEKETCSEVLALCNGCKDDK 300
           FTS   + EES+   + G+K +  ++ L + +++KS SD++      E+    +GC+D+K
Sbjct: 353 FTSDKPAFEESLPASMDGQKPQGRESNLLNPKNLKSDSDLMNGGIHHELTESSDGCEDNK 412

Query: 301 LQRQPGNLPGSSTSQGET----EDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGE 360
            +RQ G++ G+ST   +     ED+FS++ +S+NSRKY+  LSCSDAE+DYRRGRSLR E
Sbjct: 413 PKRQSGDVSGASTPGNQVLTDEEDIFSKMRTSRNSRKYHQGLSCSDAEIDYRRGRSLR-E 472

Query: 361 VKGDFLWH 364
            KGDF WH
Sbjct: 473 GKGDFSWH 479

BLAST of Cp4.1LG02g08520 vs. TrEMBL
Match: A0A067KKH5_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_08002 PE=4 SV=1)

HSP 1 Score: 489.2 bits (1258), Expect = 4.4e-135
Identity = 258/369 (69.92%), Postives = 295/369 (79.95%), Query Frame = 1

Query: 1   MFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDL 60
           MFATGPYAHETQ VSPPVFS FTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS+DL
Sbjct: 111 MFATGPYAHETQLVSPPVFSTFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDL 170

Query: 61  KGTGKANYVASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASL 120
           K T K NY+A+ DLQ  YSLYPGSPASSL+SPISRTSGDCLSSSFPERDFPPQW+PS S 
Sbjct: 171 KSTEKTNYIAAGDLQTTYSLYPGSPASSLISPISRTSGDCLSSSFPERDFPPQWDPSVSP 230

Query: 121 QDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDS 180
           Q+GKY R+GSGRLFGH+ TG  + SQD+NFFCPATFA+FYLD NPPFPHTGGRLSVSKDS
Sbjct: 231 QNGKYSRNGSGRLFGHDTTGASMVSQDTNFFCPATFARFYLDHNPPFPHTGGRLSVSKDS 290

Query: 181 DVYASGGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIISTTQYVEISDVMEDSFTMRP 240
           DVY +GGNG+Q+RH+++PKQDVEEIEAYRASFGFSADEII+T QYVEISDVM+DSFTM P
Sbjct: 291 DVYPAGGNGHQSRHNRNPKQDVEEIEAYRASFGFSADEIITTQQYVEISDVMDDSFTMTP 350

Query: 241 FTSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASDVVEKETCSEVLALCNGCKDDK 300
           FTS   + E S +       L  +Q    +  ++K  SD V    C E    C+  +D K
Sbjct: 351 FTSNKPTIEGSTE----AASLSDSQKAQTNLPTLKLKSDRV----CGEAPVSCDRYEDSK 410

Query: 301 LQRQPGNLPGSST----SQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGE 360
            +RQ G++ GSST    +  + +D+FS++ SSK SRKYN   SCSDAE+DYRRGRSL GE
Sbjct: 411 SRRQTGDVSGSSTPGIHALTDDDDIFSKMTSSKISRKYNLGSSCSDAEIDYRRGRSL-GE 470

Query: 361 VKGDFLWHD 365
            K DF WHD
Sbjct: 471 GKADFAWHD 470

BLAST of Cp4.1LG02g08520 vs. TAIR10
Match: AT1G76660.1 (AT1G76660.1 FUNCTIONS IN: molecular_function unknown)

HSP 1 Score: 326.6 bits (836), Expect = 1.9e-89
Identity = 198/356 (55.62%), Postives = 236/356 (66.29%), Query Frame = 1

Query: 1   MFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDL 60
           M+ATGPYAHETQ VSPPVFS FTTEPSTAP TPPPELA LT PSSPDVP+A+FL+SS+DL
Sbjct: 108 MYATGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPELARLTAPSSPDVPYARFLTSSMDL 167

Query: 61  KGTGKANYVASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASL 120
           K +GK +Y   NDLQA YSLYPGSPAS+L SPISR SGD L S                 
Sbjct: 168 KNSGKGHY---NDLQATYSLYPGSPASALRSPISRASGDGLLSP---------------- 227

Query: 121 QDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDS 180
           Q+GK  RS SG  FG++  G     Q+SNFFCP TFA+FYLD +P  P  GGRLSVSKDS
Sbjct: 228 QNGKCSRSDSGNTFGYDTNGVSTPLQESNFFCPETFAKFYLDHDPSVPQNGGRLSVSKDS 287

Query: 181 DVYASG--GNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIISTTQYVEISDVMEDSFTM 240
           DVY +   GNG QNR ++SPKQD+EE+EAYRASFGFSADEII+T+QYVEI+DVM+ SF  
Sbjct: 288 DVYPTNGYGNGNQNRQNRSPKQDMEELEAYRASFGFSADEIITTSQYVEITDVMDGSFNT 347

Query: 241 RPFTSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASDVVEKETCSEVLALCNGCKD 300
             ++            P  G+KL   +A L SQ S KS +D+  +    +     N  KD
Sbjct: 348 SAYS------------PSDGQKLLRREANLLSQTSPKSEADLDSQVVDFQSPKSSNSYKD 407

Query: 301 DKLQRQPGNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLR 354
            K + +            + E L SR+GS K SR Y+  +S SDAEV+YRRGRSLR
Sbjct: 408 HKQRNR---------IHADEEALLSRVGSVKGSRSYH--ISSSDAEVEYRRGRSLR 421

BLAST of Cp4.1LG02g08520 vs. TAIR10
Match: AT5G52430.1 (AT5G52430.1 hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 95.5 bits (236), Expect = 7.1e-20
Identity = 65/152 (42.76%), Postives = 87/152 (57.24%), Query Frame = 1

Query: 1   MFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELA-HLTTPSSPDVPFAQFLSSSVD 60
           +F  GPYA+ETQPV+PPVFSAF TEPSTAP TPPPE + H+TTPSSP+VPFAQ L+SS++
Sbjct: 128 VFTVGPYANETQPVTPPVFSAFITEPSTAPYTPPPESSVHITTPSSPEVPFAQLLTSSLE 187

Query: 61  LKGTGKANYVASNDLQAAY-----SLYPGSP-ASSLVSPISRTSGDCLSSSFPERDFPPQ 120
           L      + +      + Y      + PGSP   +L+SP S  S    SS +P +     
Sbjct: 188 LTRRDSTSGMNQKFSSSHYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSPYPGK----- 247

Query: 121 WNPSASLQDGKYPRSGSGRLFGHEKTGTPLAS 146
            +P    + G+ P+      F   K G+   S
Sbjct: 248 -SPMVEFRIGEPPKFLGFEHFTARKWGSRFGS 273

BLAST of Cp4.1LG02g08520 vs. TAIR10
Match: AT4G25620.1 (AT4G25620.1 hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 95.5 bits (236), Expect = 7.1e-20
Identity = 56/101 (55.45%), Postives = 69/101 (68.32%), Query Frame = 1

Query: 2   FATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLK 61
           F  GPYAHETQPV+PPVFSAFTTEPSTAP TPPPE     +PSSP+VPFAQ L+SS++  
Sbjct: 126 FTIGPYAHETQPVTPPVFSAFTTEPSTAPFTPPPE-----SPSSPEVPFAQLLTSSLERA 185

Query: 62  -----GTGKANYVASNDLQAAYSLYPGSPASSLVSPISRTS 98
                G     + A++    +  +YPGSP  +L+SP S TS
Sbjct: 186 RRNSGGGMNQKFSAAHYEFKSCQVYPGSPGGNLISPGSGTS 221

BLAST of Cp4.1LG02g08520 vs. TAIR10
Match: AT1G63720.1 (AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1))

HSP 1 Score: 85.9 bits (211), Expect = 5.7e-17
Identity = 54/111 (48.65%), Postives = 71/111 (63.96%), Query Frame = 1

Query: 1   MFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHL----TTPSSPDVPFAQFLSS 60
           +FA GPYAHETQ VSPPVFS +TTEPS+AP+TPP + + +    TTPSSP+VPFAQ  +S
Sbjct: 131 IFAIGPYAHETQLVSPPVFSTYTTEPSSAPITPPLDDSSIYLTTTTPSSPEVPFAQLFNS 190

Query: 61  SVDLKGTGKANYVASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPE 108
           +      G    ++S+     Y L PGSP   L+SP   + G   +S FP+
Sbjct: 191 NHQTGSYGYKFPMSSSYEFQFYQLPPGSPLGQLISP---SPGSGPTSPFPD 238

BLAST of Cp4.1LG02g08520 vs. NCBI nr
Match: gi|659105232|ref|XP_008453041.1| (PREDICTED: uncharacterized protein At1g76660 [Cucumis melo])

HSP 1 Score: 620.9 bits (1600), Expect = 1.4e-174
Identity = 320/364 (87.91%), Postives = 339/364 (93.13%), Query Frame = 1

Query: 1   MFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDL 60
           ++ATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS DL
Sbjct: 111 IYATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDL 170

Query: 61  KGTGKANYVASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASL 120
           KGTGKANY+ASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDF PQWN SASL
Sbjct: 171 KGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFRPQWNSSASL 230

Query: 121 QDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSD 180
           QDGKYPRSGSGRLFG+EK GT LASQDSNFFCPATFAQFYLDN  FPHTGGRLSVSKDSD
Sbjct: 231 QDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSD 290

Query: 181 VYASGGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIISTTQYVEISDVMEDSFTMRPF 240
           VY+S GNGYQNRHSKSPKQDVEEIEAYRASFGFSADEII+TTQYVEISDVMEDSFTMRPF
Sbjct: 291 VYSSCGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPF 350

Query: 241 TSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASDVVEKETCSEVLALCNGCKDDKL 300
           TSTSLSAEES +PPL+GEKLKS+  TLQ+QRSIKSA +VVEKETC+EV ALCNG KD+KL
Sbjct: 351 TSTSLSAEESTEPPLLGEKLKSSHTTLQNQRSIKSAPEVVEKETCTEVPALCNGYKDNKL 410

Query: 301 QRQPGNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDF 360
           QRQPG++ GSSTS    +D+FSRIGSSKNSRKY+  LSCSDAEVDYRRGRSLR E KG+ 
Sbjct: 411 QRQPGDILGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLR-EAKGNG 470

Query: 361 LWHD 365
            WHD
Sbjct: 471 SWHD 473

BLAST of Cp4.1LG02g08520 vs. NCBI nr
Match: gi|449466510|ref|XP_004150969.1| (PREDICTED: uncharacterized protein At1g76660 [Cucumis sativus])

HSP 1 Score: 604.4 bits (1557), Expect = 1.3e-169
Identity = 314/364 (86.26%), Postives = 333/364 (91.48%), Query Frame = 1

Query: 1   MFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDL 60
           M+ATGPYAH+TQ VSPPVFSAF TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS DL
Sbjct: 111 MYATGPYAHDTQLVSPPVFSAFNTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSEDL 170

Query: 61  KGTGKANYVASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASL 120
           KGTGKANY+ASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDF PQWN SASL
Sbjct: 171 KGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFRPQWNSSASL 230

Query: 121 QDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSD 180
           QDGKYPRSGSGRLFG+EK GT LASQDSNFFCPATFAQFYLDN  FPHTGGRLSVSKDSD
Sbjct: 231 QDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSD 290

Query: 181 VYASGGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIISTTQYVEISDVMEDSFTMRPF 240
           VY+S GNGYQNRHSKSPKQDVEEIEAYRASFGFSADEII+TTQYVEISDVMEDSFTMRPF
Sbjct: 291 VYSSCGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPF 350

Query: 241 TSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASDVVEKETCSEVLALCNGCKDDKL 300
           TSTSLSAEES +PPL+GEKLKS+  TLQSQRSIKSA +    ETC+E+ ALCNG KD+KL
Sbjct: 351 TSTSLSAEESTEPPLLGEKLKSSHTTLQSQRSIKSAPE----ETCTEMPALCNGYKDNKL 410

Query: 301 QRQPGNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDF 360
           QRQPG++ GSSTS    +D+FSRIGSSKNSRKY+  LSCSDAEVDYRRGRSLR E KG+ 
Sbjct: 411 QRQPGDISGSSTSNQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLR-EAKGNG 469

Query: 361 LWHD 365
            WHD
Sbjct: 471 SWHD 469

BLAST of Cp4.1LG02g08520 vs. NCBI nr
Match: gi|1009126185|ref|XP_015880014.1| (PREDICTED: uncharacterized protein At1g76660 [Ziziphus jujuba])

HSP 1 Score: 507.7 bits (1306), Expect = 1.7e-140
Identity = 259/369 (70.19%), Postives = 303/369 (82.11%), Query Frame = 1

Query: 1   MFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDL 60
           MFATGPYAHETQ VSPPVFS FTTEPSTAPLTPPPELAHLTTPSSPDVPFA FLSSSVDL
Sbjct: 113 MFATGPYAHETQLVSPPVFSTFTTEPSTAPLTPPPELAHLTTPSSPDVPFAHFLSSSVDL 172

Query: 61  KGTGKANYVASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASL 120
           K T K+NY+A NDL + YSLYPGSP SS++SPISRTS +C SSSFPER+FP QW+ S S 
Sbjct: 173 KSTDKSNYIAVNDLHSTYSLYPGSPPSSIISPISRTSNECSSSSFPEREFPTQWDSSVSP 232

Query: 121 QDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSD 180
           ++GKYPR+ SGRLF H+ TG P+ SQDSNFFCPATFAQFY+DNPPFPH GGRLSVSKDSD
Sbjct: 233 KNGKYPRNDSGRLFEHDATGGPMTSQDSNFFCPATFAQFYVDNPPFPHAGGRLSVSKDSD 292

Query: 181 VYASGGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIISTTQYVEISDVMEDSFTMRPF 240
            Y++GGNG+QNRHSKSPKQDVEEIEAYRASFGFSADEII+T+QYVEISDVMEDSFTM PF
Sbjct: 293 AYSTGGNGHQNRHSKSPKQDVEEIEAYRASFGFSADEIITTSQYVEISDVMEDSFTMTPF 352

Query: 241 TSTSLSAEESIQPPLV-GEKLKSTQATLQSQRSIKSASDVVEKETCSEVLALCNGCKDDK 300
           TS  L  +ESI+P  + G K   TQ + QSQ+S++S  D+++   C EV AL NG +D K
Sbjct: 353 TSNKLPMDESIEPASISGLKAIKTQTSAQSQKSLESELDLIDGGRCCEVPALSNGFEDHK 412

Query: 301 LQRQPGNLPGSSTSQG----ETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGE 360
             + PG++ GSST       + +D+FS++GSS+ S+KY   LSCSDAE+DYR GRS++ E
Sbjct: 413 SWKPPGDISGSSTPGNRILTDEDDIFSKVGSSRMSKKYQLGLSCSDAEIDYRSGRSVK-E 472

Query: 361 VKGDFLWHD 365
            KGDF WH+
Sbjct: 473 GKGDFKWHE 480

BLAST of Cp4.1LG02g08520 vs. NCBI nr
Match: gi|645255247|ref|XP_008233411.1| (PREDICTED: uncharacterized protein At1g76660 isoform X1 [Prunus mume])

HSP 1 Score: 500.0 bits (1286), Expect = 3.6e-138
Identity = 270/372 (72.58%), Postives = 303/372 (81.45%), Query Frame = 1

Query: 1   MFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDL 60
           M+ATGPYA+ETQ VSPPVFS FTTEPSTAPLTPPPELAHLTTPSSPDVPFA+FLSSSVD+
Sbjct: 111 MYATGPYANETQLVSPPVFSTFTTEPSTAPLTPPPELAHLTTPSSPDVPFARFLSSSVDI 170

Query: 61  KGTGKANYVASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASL 120
           K T K NY+A+NDLQA YSLYPGSPASSL SPISR S DC SSSFPERDFP QW+PS S 
Sbjct: 171 KTTDKTNYIAANDLQATYSLYPGSPASSLRSPISRASNDC-SSSFPERDFPRQWDPSVSP 230

Query: 121 QDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSD 180
           Q+G YPRSGS RLFG++ TG   ASQDSNFFCPATFAQFYLDNPPFPH GGRLSVSKDSD
Sbjct: 231 QNGTYPRSGSARLFGYDTTGASAASQDSNFFCPATFAQFYLDNPPFPHAGGRLSVSKDSD 290

Query: 181 VYASGGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIISTTQYVEISDVMEDSFTMRPF 240
           VY++GGNG QNRH++SPKQDVEE+EAYRASFGFSADEII+TTQYVEISDVM+DSFTM PF
Sbjct: 291 VYSTGGNGSQNRHNRSPKQDVEELEAYRASFGFSADEIITTTQYVEISDVMDDSFTMTPF 350

Query: 241 TSTSLSAEESIQPPLVGEKLKS--TQATLQSQRSIKSASDVVEKETCSEVLALCNGCKDD 300
           TS  L  EE I+P  V E LK+  T+  LQSQ + KS SD+ E  + S++   CNG +D 
Sbjct: 351 TSHKLPTEEHIEPISVTEGLKAQKTKTILQSQDTTKSESDLDEGGS-SDLPISCNGYEDH 410

Query: 301 KLQRQPGNLPGSSTS------QGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSL 360
           K  RQPG++  SST         + ED+FS+IGSSK SRKY   LS SDAE+DYRRGRSL
Sbjct: 411 KSWRQPGDVSRSSTPGPGIRVLADEEDIFSKIGSSKLSRKYQLGLSSSDAEIDYRRGRSL 470

Query: 361 RGEVKGDFLWHD 365
           R E KG+F WHD
Sbjct: 471 R-ERKGEFAWHD 479

BLAST of Cp4.1LG02g08520 vs. NCBI nr
Match: gi|595846660|ref|XP_007209172.1| (hypothetical protein PRUPE_ppa006213mg [Prunus persica])

HSP 1 Score: 499.6 bits (1285), Expect = 4.7e-138
Identity = 269/372 (72.31%), Postives = 303/372 (81.45%), Query Frame = 1

Query: 1   MFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDL 60
           M+ATGPYA+ETQ VSPPVFS FTTEPSTAPLTPPPELAHLTTPSSPDVPFA+FLSSSVD+
Sbjct: 54  MYATGPYANETQLVSPPVFSTFTTEPSTAPLTPPPELAHLTTPSSPDVPFARFLSSSVDI 113

Query: 61  KGTGKANYVASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASL 120
           K T K NY+A+NDLQA YSLYPGSPASSL SPISR S DC SSSFPERDFP QW+PS S 
Sbjct: 114 KTTDKTNYIAANDLQATYSLYPGSPASSLRSPISRASNDC-SSSFPERDFPRQWDPSVSP 173

Query: 121 QDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSD 180
           Q+G YPRSGS RLFG++ TG   ASQDSNFFCPATFAQFYLDNPPFPH GGRLSVSKDSD
Sbjct: 174 QNGTYPRSGSARLFGYDTTGASAASQDSNFFCPATFAQFYLDNPPFPHAGGRLSVSKDSD 233

Query: 181 VYASGGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIISTTQYVEISDVMEDSFTMRPF 240
           VY++GGNG QNRH++SPKQDVEE+EAYRASFGFSADEII+TTQYVEISDVM+DSFTM PF
Sbjct: 234 VYSTGGNGSQNRHNRSPKQDVEELEAYRASFGFSADEIITTTQYVEISDVMDDSFTMTPF 293

Query: 241 TSTSLSAEESIQPPLVGEKLKS--TQATLQSQRSIKSASDVVEKETCSEVLALCNGCKDD 300
           TS  L  EE I+P  V E LK+  T+  LQSQ + KS SD+ E  + S++   CNG +D 
Sbjct: 294 TSHKLPTEEHIEPKSVTEGLKAQKTKTILQSQDTTKSESDLDEGGS-SDLPISCNGYEDH 353

Query: 301 KLQRQPGNLPGSSTS------QGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSL 360
           K  RQPG++  SST         + ED+FS++GSSK SRKY   LS SDAE+DYRRGRSL
Sbjct: 354 KSWRQPGDVSRSSTPGPGVRVLADEEDIFSKMGSSKLSRKYQLGLSSSDAEIDYRRGRSL 413

Query: 361 RGEVKGDFLWHD 365
           R E KG+F WHD
Sbjct: 414 R-ERKGEFAWHD 422

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Y1666_ARATH3.4e-8855.62Uncharacterized protein At1g76660 OS=Arabidopsis thaliana GN=At1g76660 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L1G3_CUCSA9.3e-17086.26Uncharacterized protein OS=Cucumis sativus GN=Csa_4G665140 PE=4 SV=1[more]
M5WAN0_PRUPE3.2e-13872.31Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa006213mg PE=4 SV=1[more]
A0A067HC38_CITSI6.1e-13769.02Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g013739mg PE=4 SV=1[more]
V4TGC4_9ROSI6.1e-13769.02Uncharacterized protein OS=Citrus clementina GN=CICLE_v10000992mg PE=4 SV=1[more]
A0A067KKH5_JATCU4.4e-13569.92Uncharacterized protein OS=Jatropha curcas GN=JCGZ_08002 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G76660.11.9e-8955.62 FUNCTIONS IN: molecular_function unknown[more]
AT5G52430.17.1e-2042.76 hydroxyproline-rich glycoprotein family protein[more]
AT4G25620.17.1e-2055.45 hydroxyproline-rich glycoprotein family protein[more]
AT1G63720.15.7e-1748.65 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glyc... [more]
Match NameE-valueIdentityDescription
gi|659105232|ref|XP_008453041.1|1.4e-17487.91PREDICTED: uncharacterized protein At1g76660 [Cucumis melo][more]
gi|449466510|ref|XP_004150969.1|1.3e-16986.26PREDICTED: uncharacterized protein At1g76660 [Cucumis sativus][more]
gi|1009126185|ref|XP_015880014.1|1.7e-14070.19PREDICTED: uncharacterized protein At1g76660 [Ziziphus jujuba][more]
gi|645255247|ref|XP_008233411.1|3.6e-13872.58PREDICTED: uncharacterized protein At1g76660 isoform X1 [Prunus mume][more]
gi|595846660|ref|XP_007209172.1|4.7e-13872.31hypothetical protein PRUPE_ppa006213mg [Prunus persica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0005886 plasma membrane
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG02g08520.1Cp4.1LG02g08520.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR31798FAMILY NOT NAMEDcoord: 1..364
score: 2.9E
NoneNo IPR availablePANTHERPTHR31798:SF3SUBFAMILY NOT NAMEDcoord: 1..364
score: 2.9E

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:

None