Cp4.1LG14g08780 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG14g08780
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionGlycine-rich family protein
LocationCp4.1LG14 : 7254075 .. 7259782 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGTTAGCGAGATGAAGCTTCCCATGCCTTGTGACTTCTTTAATATATATATATATGTATATACATATATATGTATGTATATGTATATATATATACACACACACATTTCTTCCTTGCAATTCCTCTCAGATATTTCTCTCTCTCTCTCTCTCTCTCATCCTCTTTGTTGTTTGGTCTTCTGATCAGTTCTAACACCGATTTCTTTACACTCCCACTCCAAATCTCGCATGGTCAAGGAATTTTAAGGTAATGTTTTCTATATTCATTAGTTTAGTTTTCGATCTACGGTTACTTGATGATTATTGTGATTAGAATCGATAGAGAATTGGAAAAAAGACGAGTCAAACTTAAAATCGTGGTGGTGATGATTATGATGATGATGAGGCTGATGATGATGATCTTTATTTTCTCAGTTACTAGAATTTGAGGTTCGATTTGGACTAGTTTCTGAGAAAATTGGAAGTGTTCTTGTAGAATTGGTTTTTATCTGTTCTTAATGGATATTTTTTCATTGAAGGATGAACTTGTTCGACTATTGAGTAATATTAACGGTTATCGATGTGATTAATGAGGTCGAGGGAGACTTATGGTTGACGATATAATTTCTTGACTAGTTGACAATTATCTTAATGATTGATGAATTATATGTAATAATTCAACGATGGAAGGTAGTTAAGAATTTTTTTCTGAAACGGTTTTCATTCTTAGGTTTTTTCTGCAAAATGGAGAAGAATCAAGAGCTCGAGTGGGCAGAAGCACAACGGATTGAAATAGGTGTTGACCTGGTAGCTGCTGCTAAGCGTCAGCTTCAGTTTCTCTCGGCTGTCGATAGGAATCGATTCCTTTATGAAGGCCCTAGCCTTGAAAGGGCTATTTATCGGTAAGAAATCTAGAGATCGAGTTTTGCTCTAACTGATTTCAAATGTTTTTTTTGTTACTCCTCAAGTGTACTTGTTCATCAGTAATGCGTTTGGCTAAAATTTTTATTCTTAAAGCTTTGTAAATTTATGGAACTATTCTCTTGGATTACTACTTCGTTCTGAATTGATAACAGAGCATTAGATATCAGATTTCTGAAGTGTAAAATTCTACTTTTGTAGATACAACGCTTACTGGCTTCCATTGCTTGCCAAACATTCTGAATCTCCACTATTTGAAGGGCCTTTAGCTGTCCCCTTTGACTGTGAATGGATTTGGCATTGTCACAGATTGAATCCTGTACCTTCTAGACTTTGATGTATATAGTTCATTGGTAGACTGGTTTATGTGGTTGAGTGGCTAACAAAATTTTACTTAATTTGTGTAGGTGCGGTACAAATCTGAGTGTGAGGAACTTTATGGAAAGATACTTGACAATTCCAATGTTGTATCGACCCTAGGAAGTTCTTGTCTAAGAGAAACCGAAGAAGTTTGGAATGAATTATACCCTGAAGAGCCTTTCAACTTCAACTTCAACCCAACCGGAGAGTATCAAGAGGATGCCCTGAAAGTGCTCTCGGGACTTCAAAAATATACCAAATATGATCTTGTTTCAGCTGTTAAAAGACAGAGCCCCTTCTTTTATCAGGTACTGATACTATGATAATTTTATTTGCAAGTGTGCGCAATGAACTTGACTTTGTGCTTAAACTGTCTTAGTCCCCCAAAAAAGAAGGTGATTTAAAAGATAGAAAATGTTCTCAATTTTTATATTCAATCACTGGACGATTTGATCTAACAAGTTTGAACTTTTTCTGTATGATCTTTCTTTTAGTTTCCCTCTAGGGAACTTCTTGTGTTCTTTGTTTATTTTCAGCATCTTTGCCAATGATGACATTAGCATTTTTAGTAAAAATAAAATGTATCATTTTACACCAATATTATTCAGATGATTGAACTTCCCGTCATTCATCATATGTTCTGTCCTTCAATTGCTGTGGACAGGTGTCCCGACCTCATATGGACAATGAAGTTTTCCTTGAAGAAGCTGTGGCTAGATACAAAGGATTTCTATATTTAATCAAAAGCAACAGGGAGAGGTCTATAAAACGCTTTTGTGTTCCAACATATGATATTGATCTAATTTGGCATACTCATCAATTGCACCCTATTTCATATTGTAAAGACTTGAAAAACTTACTTGGTATGGTATTGGAGCATGATGATATGGATTCAGACCGAACAAAAGGGAAGAAATTGGATACTGGGTTCTCTGGAACTACAAAACAGTGGGAAGATACATTTGGTACTAGGTACTGGAAGGCAGGGGCAATGTATAGAGGCAATAGCCCATCTCCTCTCCTGCTAAATTCGTATTCAGGTTCAACTAACTTGATTAAAGACGACATGGTTTCATCCCAAGAGTGTCAAAACACAGTTCATCTTCCAGAATTAAAGACGGTTGAGGTAGCTCATTTTTCTTTATTTATTCTCATATTCCATTCCCGTTTTTCCCATGAATTTTTTGTGAAAATCTTAAGGTAACTGACACAAATGTATAAATATGGAAATGAACTTATCCTACTTCCACTTGTTATATATTACACGTTTCTATGTTCAGGTCCTATTGGAGTTTGTGGAAGTTAAAAACAAACCAGAAGGGTTGAAAGGAAATTTTTTTGTGCAATTTATGAAAAGTCAACCAGATGCAATATTCAGTGCCAAACGGAAACTAAGCATATTATCTGAGACAGGGGTGAAACAGGTAGCTTCTTTTCAGTGTGAACCTAAAGGAGATCTGCAATTTGAGCTCATCTGCAATAGGGCTTCCAGCATACCGATAACACGGACCAGCTTAACATTGGGCTCCATTTCACTCCCTCTACATGACATTCTTGTTCCAACCTCTAAGCTCTCAATGGAGAGATGGTTGGAGTTGAAGCCAGTTTCTGACCATGTAAGCTCAAAGCCAATTTCCCTCAGAGTGGCTATCTCATTCACCGTTCCCCACCCTGCACCACGGGAACTTCATATGTTTTTTTCTCGAGAACTATCCAGATGGACTTCTTTTCTCCCTTCCTGCACAAAGATGCAGCACTCAAAGGGTTGGATGCAAGTCACTGATGAAGCAGGCAATGAGGTCATCAGCCTTCAATTGAGGTAAGGTGATTCTTCCTTTCTAGTAGCTCAGGAGTTGATGCTATATTTATGGTTACAAGTACATACTCACTGTCTAAGAAGCATGGCGGAATATGAGACACAAATATGACACGACATGGATATGGAGACACATTCATTAAAATCTAACAAACCGACATGGAAAAGATACGTTTAATAAAATACACATTTATTTATTATTATTATTTTCTTATACTAGAGGAAAAATCAAAGTAAATAAGTCTATGCATTTATATTATGCAAAAAAAAAAAAAAAAGTTTGATATATTTCGCTTACAAAATTGGGGTTTAATACATGTCAAACATGTTGGACTCGGTTGATTTTGTATAACTATTGTCCAACACATAGCTATTGTGCTAACAAGTGTTCAACTAGTATGAGAGTGTTCAAGTGTCAGACACGGACACACTAGCCACACTAAAGTGTTTGTGCTTCTTAACTCACCATGTTTCCTCTTATGTGTTTGCTTTATAATAATAATGTTGCAATTGCAGTTGATATAAATGCGGTCTTTGTAGATTAATCATCTTTGTAGACAATTATCCAAAGTCTTCAATGAGCAATCTTTGAACTCGTTTGACATCCCATTTGTGAAGTTCATAAAGTCAAGTTCCTCTATACCTCTAAAGCATAGAGATGCTTTTATTCGACAGATTAGAGTCTTGGATACCACTTTCCTACATTCCCAATGTCAATGACAATTTATATCATAGTTTTATAACTTGTTTACAAATATTATCATATAATTACACTGCAGTTATGCGGTTCCTAATTTTCTTTGTTTTCATTCTATTTTTATAAGGGATTCTTTGAAGGAAAAGGTGGGGAAGAATTCCATTCCAACAAGTAAGGAGGTGATTGGCATAAAAATGTCCGGTGAATCAAGCCTTCTTGCTGAGTTTGTAAAGACAGGGTGGTCTCTGATTGATGGTCAATGGTTTCTTGATTTTCAAAAAAAATCCAATGAAGATGATCATCTTTTCAAGCTTGTGGGCAAGCGGCTGGTATGTATGGTCCCTGTTGTGGTAATTTGATTCTTATTATTTCATGATGCTGGAGCTCTTAAAAGAAAAAACACTCTGCTTGTCTATTATCATATCGAAAAACTAGATAGTATGCTGCTATTTTCCTTATTTGCCAGAAAAACTAGACATAGTAACACAATGAAATTTTCGTATCATATACAAGTCGATATAATTATTTGAGCATGCAATGCGTTCATCTACTAACTTTTCTCTAACAAACATTTGCACCCGAAATTCAAGCATACTAGTTGAATTTCCTAAGCTGATTTGGCCTTGAATGTTTAGGAGACCTTCATGTTTAAAAATAACATGTCAATGGAATAACTTTTTCAGCATGTAACTTTTCATCTTGATGACTTTAAATGTTTCCTATTTTCTGGCAAACATTTGCTTGTATAGGTGAAGTTTTTCCAGGGCAGTAAGCTGGATTATGAGCCCAAAAATTGTGGTAAACATAACCATGATTCGGACTTCGTGACTGCTATCGAATTCTCTGCAGAATATCCGTATGGAAGAGCAGTGGCATTGTTTGACTTGAAATTTGGAGTTATTAAGGTACTACTTCCATTCACTCTTGCAAATTGAATGTTTGAGAATGATACTGACCGACTCTATATTGTGATAAAAAGATAAAAGAAGAATGGATGGTGGTGCCCGGAATCATGACAGCTTTTCTTCTCCTTCATGCACGGAAGAAAAAAGGGTATAATGGCCTAACTGTAAGTGAAGAAAACTTGGAGGTTGCCCCTGTTCCTGAGAGTGTTCATACGTCTGGCAAGGAAGAAAATAATATGAATTTAATCAATGTATCCTTGTCAAGTACTGATTTAAAAGTCAATGTTTCTGAAGGTGTTTCTAACGAGAATGTAACAATACCTTTGAAAGAGGATGAGCTTAGTAGTCATTGTGGCCGATGTGATGCTGGAGGTTACACTGCAGGGTCCGGAAACATGGTGAAGAGCGGTAGATGTGGTGGATGCGGTGCTGGCGGTTGTGGTGGTGGGTGTGGAAACATGGTGAAGAGCGGTGGCTGTGGGGGTTGCGGTGCTGGCGGTTGTGGTGGTGGATGCGGAAATATTATAAACAGTGGTGGCTGTGGTGGATGCGGCGCTGGCGGCTGTGGTGGTGGGTGCGGAAATATTGTAAACAGTGGCGGATGTGGTGGATGCGGTGCTGGAGGTTGTGGTGGTGGGTGCGGAAATATTGTAAATAGTGGTAGTTCTGTTGGTGGAATCGTGGCAAAGAGTGGTGGGTGTGGCGGCGGATGCGGCGGTTTTGGCCATAAAACTGCCCAACCAAATGAAGGCAACCAGAGTGATGCAATTATTGAATGAGGTATTACGGAATGCCTAATAAAACAAAGTGTTTGTATTGTATTACTTTCCTTCCATATAAAGAAAGTATAATACCCAAAGCCTAAAAACCTATCCAATAATAAACACGATTGTATGCCTGTGTGAATGTCATGGTTTGTGTCTGTCTTATTATCTGTAAAGTGTTTGTCTGTATATATTCACAAATAAATTCGCACGACCTTTTTATTTCTTTTCAATTTTGAACTTATATATTATTAAGATTTGTTCGAGT

mRNA sequence

TGTTAGCGAGATGAAGCTTCCCATGCCTTGTGACTTCTTTAATATATATATATATGTATATACATATATATGTATGTATATGTATATATATATACACACACACATTTCTTCCTTGCAATTCCTCTCAGATATTTCTCTCTCTCTCTCTCTCTCTCATCCTCTTTGTTGTTTGGTCTTCTGATCAGTTCTAACACCGATTTCTTTACACTCCCACTCCAAATCTCGCATGGTCAAGGAATTTTAAGGTTTTTTCTGCAAAATGGAGAAGAATCAAGAGCTCGAGTGGGCAGAAGCACAACGGATTGAAATAGGTGTTGACCTGGTAGCTGCTGCTAAGCGTCAGCTTCAGTTTCTCTCGGCTGTCGATAGGAATCGATTCCTTTATGAAGGCCCTAGCCTTGAAAGGGCTATTTATCGATACAACGCTTACTGGCTTCCATTGCTTGCCAAACATTCTGAATCTCCACTATTTGAAGGGCCTTTAGCTGTCCCCTTTGACTGTGAATGGATTTGGCATTGTCACAGATTGAATCCTGTGCGGTACAAATCTGAGTGTGAGGAACTTTATGGAAAGATACTTGACAATTCCAATGTTGTATCGACCCTAGGAAGTTCTTGTCTAAGAGAAACCGAAGAAGTTTGGAATGAATTATACCCTGAAGAGCCTTTCAACTTCAACTTCAACCCAACCGGAGAGTATCAAGAGGATGCCCTGAAAGTGCTCTCGGGACTTCAAAAATATACCAAATATGATCTTGTTTCAGCTGTTAAAAGACAGAGCCCCTTCTTTTATCAGGTGTCCCGACCTCATATGGACAATGAAGTTTTCCTTGAAGAAGCTGTGGCTAGATACAAAGGATTTCTATATTTAATCAAAAGCAACAGGGAGAGGTCTATAAAACGCTTTTGTGTTCCAACATATGATATTGATCTAATTTGGCATACTCATCAATTGCACCCTATTTCATATTGTAAAGACTTGAAAAACTTACTTGGTATGGTATTGGAGCATGATGATATGGATTCAGACCGAACAAAAGGGAAGAAATTGGATACTGGGTTCTCTGGAACTACAAAACAGTGGGAAGATACATTTGGTACTAGGTACTGGAAGGCAGGGGCAATGTATAGAGGCAATAGCCCATCTCCTCTCCTGCTAAATTCGTATTCAGGTTCAACTAACTTGATTAAAGACGACATGGTTTCATCCCAAGAGTGTCAAAACACAGTTCATCTTCCAGAATTAAAGACGGTTGAGGTCCTATTGGAGTTTGTGGAAGTTAAAAACAAACCAGAAGGGTTGAAAGGAAATTTTTTTGTGCAATTTATGAAAAGTCAACCAGATGCAATATTCAGTGCCAAACGGAAACTAAGCATATTATCTGAGACAGGGGTGAAACAGGTAGCTTCTTTTCAGTGTGAACCTAAAGGAGATCTGCAATTTGAGCTCATCTGCAATAGGGCTTCCAGCATACCGATAACACGGACCAGCTTAACATTGGGCTCCATTTCACTCCCTCTACATGACATTCTTGTTCCAACCTCTAAGCTCTCAATGGAGAGATGGTTGGAGTTGAAGCCAGTTTCTGACCATGTAAGCTCAAAGCCAATTTCCCTCAGAGTGGCTATCTCATTCACCGTTCCCCACCCTGCACCACGGGAACTTCATATGTTTTTTTCTCGAGAACTATCCAGATGGACTTCTTTTCTCCCTTCCTGCACAAAGATGCAGCACTCAAAGGGTTGGATGCAAGTCACTGATGAAGCAGGCAATGAGGTCATCAGCCTTCAATTGAGGGATTCTTTGAAGGAAAAGGTGGGGAAGAATTCCATTCCAACAAGTAAGGAGGTGATTGGCATAAAAATGTCCGGTGAATCAAGCCTTCTTGCTGAGTTTGTAAAGACAGGGTGGTCTCTGATTGATGGTCAATGGTTTCTTGATTTTCAAAAAAAATCCAATGAAGATGATCATCTTTTCAAGCTTGTGGGCAAGCGGCTGGTATGTATGGTCCCTGTTGTGGTGAAGTTTTTCCAGGGCAGTAAGCTGGATTATGAGCCCAAAAATTGTGGTAAACATAACCATGATTCGGACTTCGTGACTGCTATCGAATTCTCTGCAGAATATCCGTATGGAAGAGCAGTGGCATTGTTTGACTTGAAATTTGGAGTTATTAAGATAAAAGAAGAATGGATGGTGGTGCCCGGAATCATGACAGCTTTTCTTCTCCTTCATGCACGGAAGAAAAAAGGGTATAATGGCCTAACTGTAAGTGAAGAAAACTTGGAGGTTGCCCCTGTTCCTGAGAGTGTTCATACGTCTGGCAAGGAAGAAAATAATATGAATTTAATCAATGTATCCTTGTCAAGTACTGATTTAAAAGTCAATGTTTCTGAAGGTGTTTCTAACGAGAATGTAACAATACCTTTGAAAGAGGATGAGCTTAGTAGTCATTGTGGCCGATGTGATGCTGGAGGTTACACTGCAGGGTCCGGAAACATGGTGAAGAGCGGTAGATGTGGTGGATGCGGTGCTGGCGGTTGTGGTGGTGGGTGTGGAAACATGGTGAAGAGCGGTGGCTGTGGGGGTTGCGGTGCTGGCGGTTGTGGTGGTGGATGCGGAAATATTATAAACAGTGGTGGCTGTGGTGGATGCGGCGCTGGCGGCTGTGGTGGTGGGTGCGGAAATATTGTAAACAGTGGCGGATGTGGTGGATGCGGTGCTGGAGGTTGTGGTGGTGGGTGCGGAAATATTGTAAATAGTGGTAGTTCTGTTGGTGGAATCGTGGCAAAGAGTGGTGGGTGTGGCGGCGGATGCGGCGGTTTTGGCCATAAAACTGCCCAACCAAATGAAGGCAACCAGAGTGATGCAATTATTGAATGAGGTATTACGGAATGCCTAATAAAACAAAGTGTTTGTATTGTATTACTTTCCTTCCATATAAAGAAAGTATAATACCCAAAGCCTAAAAACCTATCCAATAATAAACACGATTGTATGCCTGTGTGAATGTCATGGTTTGTGTCTGTCTTATTATCTGTAAAGTGTTTGTCTGTATATATTCACAAATAAATTCGCACGACCTTTTTATTTCTTTTCAATTTTGAACTTATATATTATTAAGATTTGTTCGAGT

Coding sequence (CDS)

ATGGAGAAGAATCAAGAGCTCGAGTGGGCAGAAGCACAACGGATTGAAATAGGTGTTGACCTGGTAGCTGCTGCTAAGCGTCAGCTTCAGTTTCTCTCGGCTGTCGATAGGAATCGATTCCTTTATGAAGGCCCTAGCCTTGAAAGGGCTATTTATCGATACAACGCTTACTGGCTTCCATTGCTTGCCAAACATTCTGAATCTCCACTATTTGAAGGGCCTTTAGCTGTCCCCTTTGACTGTGAATGGATTTGGCATTGTCACAGATTGAATCCTGTGCGGTACAAATCTGAGTGTGAGGAACTTTATGGAAAGATACTTGACAATTCCAATGTTGTATCGACCCTAGGAAGTTCTTGTCTAAGAGAAACCGAAGAAGTTTGGAATGAATTATACCCTGAAGAGCCTTTCAACTTCAACTTCAACCCAACCGGAGAGTATCAAGAGGATGCCCTGAAAGTGCTCTCGGGACTTCAAAAATATACCAAATATGATCTTGTTTCAGCTGTTAAAAGACAGAGCCCCTTCTTTTATCAGGTGTCCCGACCTCATATGGACAATGAAGTTTTCCTTGAAGAAGCTGTGGCTAGATACAAAGGATTTCTATATTTAATCAAAAGCAACAGGGAGAGGTCTATAAAACGCTTTTGTGTTCCAACATATGATATTGATCTAATTTGGCATACTCATCAATTGCACCCTATTTCATATTGTAAAGACTTGAAAAACTTACTTGGTATGGTATTGGAGCATGATGATATGGATTCAGACCGAACAAAAGGGAAGAAATTGGATACTGGGTTCTCTGGAACTACAAAACAGTGGGAAGATACATTTGGTACTAGGTACTGGAAGGCAGGGGCAATGTATAGAGGCAATAGCCCATCTCCTCTCCTGCTAAATTCGTATTCAGGTTCAACTAACTTGATTAAAGACGACATGGTTTCATCCCAAGAGTGTCAAAACACAGTTCATCTTCCAGAATTAAAGACGGTTGAGGTCCTATTGGAGTTTGTGGAAGTTAAAAACAAACCAGAAGGGTTGAAAGGAAATTTTTTTGTGCAATTTATGAAAAGTCAACCAGATGCAATATTCAGTGCCAAACGGAAACTAAGCATATTATCTGAGACAGGGGTGAAACAGGTAGCTTCTTTTCAGTGTGAACCTAAAGGAGATCTGCAATTTGAGCTCATCTGCAATAGGGCTTCCAGCATACCGATAACACGGACCAGCTTAACATTGGGCTCCATTTCACTCCCTCTACATGACATTCTTGTTCCAACCTCTAAGCTCTCAATGGAGAGATGGTTGGAGTTGAAGCCAGTTTCTGACCATGTAAGCTCAAAGCCAATTTCCCTCAGAGTGGCTATCTCATTCACCGTTCCCCACCCTGCACCACGGGAACTTCATATGTTTTTTTCTCGAGAACTATCCAGATGGACTTCTTTTCTCCCTTCCTGCACAAAGATGCAGCACTCAAAGGGTTGGATGCAAGTCACTGATGAAGCAGGCAATGAGGTCATCAGCCTTCAATTGAGGGATTCTTTGAAGGAAAAGGTGGGGAAGAATTCCATTCCAACAAGTAAGGAGGTGATTGGCATAAAAATGTCCGGTGAATCAAGCCTTCTTGCTGAGTTTGTAAAGACAGGGTGGTCTCTGATTGATGGTCAATGGTTTCTTGATTTTCAAAAAAAATCCAATGAAGATGATCATCTTTTCAAGCTTGTGGGCAAGCGGCTGGTATGTATGGTCCCTGTTGTGGTGAAGTTTTTCCAGGGCAGTAAGCTGGATTATGAGCCCAAAAATTGTGGTAAACATAACCATGATTCGGACTTCGTGACTGCTATCGAATTCTCTGCAGAATATCCGTATGGAAGAGCAGTGGCATTGTTTGACTTGAAATTTGGAGTTATTAAGATAAAAGAAGAATGGATGGTGGTGCCCGGAATCATGACAGCTTTTCTTCTCCTTCATGCACGGAAGAAAAAAGGGTATAATGGCCTAACTGTAAGTGAAGAAAACTTGGAGGTTGCCCCTGTTCCTGAGAGTGTTCATACGTCTGGCAAGGAAGAAAATAATATGAATTTAATCAATGTATCCTTGTCAAGTACTGATTTAAAAGTCAATGTTTCTGAAGGTGTTTCTAACGAGAATGTAACAATACCTTTGAAAGAGGATGAGCTTAGTAGTCATTGTGGCCGATGTGATGCTGGAGGTTACACTGCAGGGTCCGGAAACATGGTGAAGAGCGGTAGATGTGGTGGATGCGGTGCTGGCGGTTGTGGTGGTGGGTGTGGAAACATGGTGAAGAGCGGTGGCTGTGGGGGTTGCGGTGCTGGCGGTTGTGGTGGTGGATGCGGAAATATTATAAACAGTGGTGGCTGTGGTGGATGCGGCGCTGGCGGCTGTGGTGGTGGGTGCGGAAATATTGTAAACAGTGGCGGATGTGGTGGATGCGGTGCTGGAGGTTGTGGTGGTGGGTGCGGAAATATTGTAAATAGTGGTAGTTCTGTTGGTGGAATCGTGGCAAAGAGTGGTGGGTGTGGCGGCGGATGCGGCGGTTTTGGCCATAAAACTGCCCAACCAAATGAAGGCAACCAGAGTGATGCAATTATTGAATGA

Protein sequence

MEKNQELEWAEAQRIEIGVDLVAAAKRQLQFLSAVDRNRFLYEGPSLERAIYRYNAYWLPLLAKHSESPLFEGPLAVPFDCEWIWHCHRLNPVRYKSECEELYGKILDNSNVVSTLGSSCLRETEEVWNELYPEEPFNFNFNPTGEYQEDALKVLSGLQKYTKYDLVSAVKRQSPFFYQVSRPHMDNEVFLEEAVARYKGFLYLIKSNRERSIKRFCVPTYDIDLIWHTHQLHPISYCKDLKNLLGMVLEHDDMDSDRTKGKKLDTGFSGTTKQWEDTFGTRYWKAGAMYRGNSPSPLLLNSYSGSTNLIKDDMVSSQECQNTVHLPELKTVEVLLEFVEVKNKPEGLKGNFFVQFMKSQPDAIFSAKRKLSILSETGVKQVASFQCEPKGDLQFELICNRASSIPITRTSLTLGSISLPLHDILVPTSKLSMERWLELKPVSDHVSSKPISLRVAISFTVPHPAPRELHMFFSRELSRWTSFLPSCTKMQHSKGWMQVTDEAGNEVISLQLRDSLKEKVGKNSIPTSKEVIGIKMSGESSLLAEFVKTGWSLIDGQWFLDFQKKSNEDDHLFKLVGKRLVCMVPVVVKFFQGSKLDYEPKNCGKHNHDSDFVTAIEFSAEYPYGRAVALFDLKFGVIKIKEEWMVVPGIMTAFLLLHARKKKGYNGLTVSEENLEVAPVPESVHTSGKEENNMNLINVSLSSTDLKVNVSEGVSNENVTIPLKEDELSSHCGRCDAGGYTAGSGNMVKSGRCGGCGAGGCGGGCGNMVKSGGCGGCGAGGCGGGCGNIINSGGCGGCGAGGCGGGCGNIVNSGGCGGCGAGGCGGGCGNIVNSGSSVGGIVAKSGGCGGGCGGFGHKTAQPNEGNQSDAIIE
BLAST of Cp4.1LG14g08780 vs. Swiss-Prot
Match: GRDP1_ARATH (Glycine-rich domain-containing protein 1 OS=Arabidopsis thaliana GN=GRDP1 PE=2 SV=1)

HSP 1 Score: 766.9 bits (1979), Expect = 2.3e-220
Identity = 432/859 (50.29%), Postives = 570/859 (66.36%), Query Frame = 1

Query: 2   EKNQELEWAEAQRIEIGVDLVAAAKRQLQFLSAVDRNRFLYEGPSLERAIYRYNAYWLPL 61
           EK+ E+EW EAQ+IEI VDL+AAAK+ L FL  VDRNR+LY+GP+LE+AIYRYNA WLPL
Sbjct: 4   EKDHEVEWLEAQKIEISVDLLAAAKQHLLFLETVDRNRWLYDGPALEKAIYRYNACWLPL 63

Query: 62  LAKHSESP-LFEGPLAVPFDCEWIWHCHRLNPVRYKSECEELYGKILDNSNVVSTLGSSC 121
           L K+SES  + EG L  P DCEWIWHCHRLNPVRY S+CE+ YG++LDNS V+S++  +C
Sbjct: 64  LVKYSESSSVSEGSLVPPLDCEWIWHCHRLNPVRYNSDCEQFYGRVLDNSGVLSSVDGNC 123

Query: 122 LRETEEVWNELYPEEPFNFNFNPTGEYQEDALKVLSGLQKYTKYDLVSAVKRQSPFFYQV 181
             +TE++W  LYP+EP+  + +      ED  +  S L+K TKYDLVSAVKRQSPF+YQV
Sbjct: 124 KLKTEDLWKRLYPDEPYELDLDNID--LEDISEKSSALEKCTKYDLVSAVKRQSPFYYQV 183

Query: 182 SRPHMDNEVFLEEAVARYKGFLYLIKSNRERSIKRFCVPTYDIDLIWHTHQLHPISYCKD 241
           SR H+++++FL+EAVARYKGFLYLIK NRERS+KRFCVPTYD+DLIWHTHQLHP+SYC D
Sbjct: 184 SRSHVNSDIFLQEAVARYKGFLYLIKMNRERSLKRFCVPTYDVDLIWHTHQLHPVSYCDD 243

Query: 242 LKNLLGMVLEHDDMDSDRTKGKKLDTGFSGTTKQWEDTFGTRYWKAGAMYRGNSPSPLLL 301
           +  L+G VLEHDD DSDR KGKKLDTGFS TT QWE+TFGTRYWKAGAM+RG +P P+  
Sbjct: 244 MVKLIGKVLEHDDTDSDRGKGKKLDTGFSKTTAQWEETFGTRYWKAGAMHRGKTPVPVTN 303

Query: 302 NSYSGSTNLIKDDMVSSQECQNTVHLPELKTVEVLLEFVEVKNKPEGLKGNFFVQFMKSQ 361
           + Y+    L+KD   +  + QN +  PE++ VEVLLE + V+N P+G KG   V F K+Q
Sbjct: 304 SPYASDV-LVKDP-TAKDDFQNLIQFPEVEVVEVLLEIIGVRNLPDGHKGKVSVMFSKTQ 363

Query: 362 PDAIFSAKRKLSILSETGVKQVASFQCEPKGDLQFELICNRASSIPITRTSLTLGSISLP 421
           PD++F+A+R+L+ILSE G KQVA+FQCEP G+L F+LI    S IP++R    LG  SL 
Sbjct: 364 PDSLFNAERRLTILSEVGEKQVATFQCEPTGELVFKLISCSPSKIPVSREPKNLGFASLS 423

Query: 422 LHDILVPT-SKLSMERWLELKPVS-DHVSSKPISLRVAISFTVPHPAPRELHMFFSRELS 481
           L + L P  ++LS+E+WLEL P       +KPISLRVA+SFT P  +P  LHM  SR   
Sbjct: 424 LKEFLFPVITQLSVEKWLELTPSKGSQTDTKPISLRVAVSFTPPVRSPSVLHMVQSRPSC 483

Query: 482 RWTSFLPSCTKMQHSKGWMQVTDEAGNEVISLQLRDSLKEKVGKNSIPTSKEVIGIKMSG 541
           + + F P   K + +K    + DE   EVI+LQ+R+S    + K+     ++V+G+  SG
Sbjct: 484 KGSCFFPIIGKSRLAKSSTHIVDETQTEVITLQIRNSADGGILKDD---QRQVMGVTDSG 543

Query: 542 ESSLLAEFVKTGWSLIDGQWFLDFQKKSNEDDHLFKLVGKRLVCMVPVVVKFFQGSKLDY 601
           E+ +LA +  + WSL+D +W L     S  D+ LF+++G R       VVK F G KLDY
Sbjct: 544 ETRVLAVYTGSFWSLLDSKWSLKQINASTADNPLFEILGPR-------VVKIFSGRKLDY 603

Query: 602 EPKNCGKHNHDSDFVTAIEFSAEYPYGRAVALFDLKFGVIKIKEEWMVVPGIMTAFLLLH 661
           EPK+C     D DF+T +EFS ++PYG+ V L D++FG I+ KE W+++PGI++AF+L  
Sbjct: 604 EPKHCANLRSDLDFMTLVEFSKQHPYGKTVGLVDMRFGSIEAKENWLLLPGIVSAFILHT 663

Query: 662 ARKKKGYNGLTVSEENLEVAPVPESVHTS--GKEENNMNLINVSLSSTDLKVNVSEGVSN 721
             KK G  G  V+ ++++     ES  T      ENN+N      +ST+++   +     
Sbjct: 664 VLKKGGSEGFNVTTKDIK----EESKQTKLVAATENNVNA-----NSTNVETQTA----- 723

Query: 722 ENVTIPLKEDELSSHCGRCDAGGYTAGSGNMVKSGRCGGCGAGGCGGGCGNMVKSGG-CG 781
             +T P K     S CG    GG +   GNMVK+    GCG+  C G CG+MVKS     
Sbjct: 724 --ITAPKK----GSGCG----GGCSGECGNMVKAANASGCGSS-CSGECGDMVKSAANAS 783

Query: 782 GCGAGGCGGGCGNIINSGGCGGCGAGGCGGGCGNIVNSGGCGGCGAGGCGGGCGNIVNSG 841
           GCG+G C G CGN++ +    G   GG G  C     + GCGG   GGCGGGCG++V S 
Sbjct: 784 GCGSG-CSGECGNMVKAANASG---GGYGARC-KAAKASGCGGGCGGGCGGGCGDMVKS- 810

Query: 842 SSVGGIVAKSGGCGGGCGG 855
                    + GCGGGC G
Sbjct: 844 -------VNASGCGGGCNG 810

BLAST of Cp4.1LG14g08780 vs. Swiss-Prot
Match: GRDP2_ARATH (Glycine-rich domain-containing protein 2 OS=Arabidopsis thaliana GN=GRDP2 PE=2 SV=1)

HSP 1 Score: 731.9 bits (1888), Expect = 8.4e-210
Identity = 419/846 (49.53%), Postives = 553/846 (65.37%), Query Frame = 1

Query: 2   EKNQELEWAEAQRIEIGVDLVAAAKRQLQFLSAVDRNRFLYEGPSLERAIYRYNAYWLPL 61
           EK Q LEW EAQ+I+I VDL+AAAK+ L FL AVDRNR LY+GP+L+RAIYRYNAYWLPL
Sbjct: 4   EKEQTLEWNEAQKIDISVDLLAAAKKHLLFLGAVDRNRCLYDGPALQRAIYRYNAYWLPL 63

Query: 62  LAKHSESP-LFEGPLAVPFDCEWIWHCHRLNPVRYKSECEELYGKILDNSNVVSTLGSSC 121
           LA+++ES  + +GPL  P DCEW+WHCHRLNPVRYK++CE+ YG++LDNS VVS++  +C
Sbjct: 64  LAQYTESSSICQGPLVPPLDCEWVWHCHRLNPVRYKTDCEQFYGRVLDNSGVVSSVNGNC 123

Query: 122 LRETEEVWNELYPEEPFNFNFNPTGEYQEDALKVLSGLQKYTKYDLVSAVKRQSPFFYQV 181
             +TE +W  LYP EP++ +F        D    +S L+K T YDLV AVKRQSPFFYQV
Sbjct: 124 KSQTETLWKRLYPTEPYDLDFANAISEPAD----VSALEKCTTYDLVLAVKRQSPFFYQV 183

Query: 182 SRPHMDNEVFLEEAVARYKGFLYLIKSNRERSIKRFCVPTYDIDLIWHTHQLHPISYCKD 241
           SR H+DN+VFL+EAVARYK FLYLIK NRERSIK FCVPTYDIDLIWHTHQLH ISYC D
Sbjct: 184 SRAHVDNDVFLQEAVARYKAFLYLIKGNRERSIKLFCVPTYDIDLIWHTHQLHAISYCND 243

Query: 242 LKNLLGMVLEHDDMDSDRTKGKKLDTGFSGTTKQWEDTFGTRYWKAGAMYRGNSPSPLLL 301
           L  ++G VLEHDD DSDR+KGKKLDTGFSGTT QWE+TFG RYWKAGAM RGN+P P+  
Sbjct: 244 LTKMIGKVLEHDDTDSDRSKGKKLDTGFSGTTAQWEETFGRRYWKAGAMNRGNTPKPVTT 303

Query: 302 NSY--SGSTNLIKDDMVSSQECQNTVHLPELKTVEVLLEFVEVKNKPEGLKGNFFVQFMK 361
           + Y  SG  ++ K+     +E QN +  PE+K +EV+LE V VKN P+  KG  FV F K
Sbjct: 304 SPYVCSGKKSIAKE-----EESQNVIQYPEVKVIEVILEIVGVKNLPDAHKGKVFVLFSK 363

Query: 362 SQPDAIFSAKRKLSILSET-GVKQVASFQCEPKGDLQFELICNRASSIPITRTSLTLGSI 421
           +QPD++F+A+R+L++LSE+ G KQVA FQCEP G+L F+L+ +++ S         LG  
Sbjct: 364 TQPDSLFNAERRLTVLSESCGEKQVALFQCEPTGELSFQLMSSKSKS---------LGFT 423

Query: 422 SLPLHDILVPTSKLSMERWLELKPVSDHVSS--KPISLRVAISFTVPHPAPRELHMFFSR 481
           SL   + L P +KLS+E+WLEL P     +    PISLRVA+SFT P  +P  LH+  +R
Sbjct: 424 SLSFSEFLSPVTKLSVEKWLELTPTKRGKADDPNPISLRVAVSFTPPTRSPTVLHLVQAR 483

Query: 482 ELSRWTSFLPSCTKMQHSKGWMQVTDEAGNEVISLQLRDSLKEKVGKNSIPTSKEVIGIK 541
              + + FLP   K++ +K + +V DE   EVI+LQ+R+S  +   K      ++VIG+K
Sbjct: 484 PSLKGSCFLPMLRKVRLAKSFTRVVDETETEVINLQMRNS-NDAAPKGD---RRQVIGVK 543

Query: 542 MSGESSLLAEFVKTGWSLIDGQWFLDFQKKSNEDDHLFKLVGKRLVCMVPVVVKFFQGSK 601
             GE+ +LAE+  T WSL+D +W L        D  LF+L G R+       VK + G K
Sbjct: 544 ECGETYVLAEYDGTFWSLLDSKWSLKQTCNPATDGPLFELSGTRM-------VKVYSGRK 603

Query: 602 LDYEPKNCGKHNHDSDFVTAIEFSAEYPYGRAVALFDLKFGVIKIKEEWMVVPGIMTAFL 661
           L+YEPK+C K   + DF+TA+EFS ++PYG+AV L DLKFG I+  E+W+V+PG++++F+
Sbjct: 604 LEYEPKHCSKLRSEQDFMTAVEFSKQHPYGKAVGLLDLKFGSIEANEKWLVLPGMVSSFI 663

Query: 662 LLHARKKKGYNGLTVSEENLEVAPVPESVHTSGKEENNMNLINVSLSSTDLKVNVSEGVS 721
           L    KK+G++           A   ++V  +G  E +  +  +S    +      E + 
Sbjct: 664 LSDLLKKEGFS-----------AAAKDTVKANGITEESTEIDVLSQEKLE-----EETMM 723

Query: 722 NENVTIPLK-EDELSSHCGRCDAGGYTAGSGNMV--KSGRCGGC-GAGGCGGGCGNMVKS 781
           + + T P+    E  +   RC +      SGNM+  + G CGGC G GGCGGG       
Sbjct: 724 DVDTTTPVAVAAEKINGGARCFSKEL---SGNMIEEEGGHCGGCGGCGGCGGG------- 774

Query: 782 GGCGGCGAGGCGGGCGNIINSGGCGGCGAGGCGGGCGNIVNSGGCGGCGAGGCGGGCGNI 838
           GGCGG      GG CG +    GCGG   G C GG      S GC     G CGGGCGN+
Sbjct: 784 GGCGG------GGRCGGMTKIEGCGG---GSCTGG------STGC-----GNCGGGCGNM 774

BLAST of Cp4.1LG14g08780 vs. TrEMBL
Match: A0A0A0L5L1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G033730 PE=4 SV=1)

HSP 1 Score: 1345.1 bits (3480), Expect = 0.0e+00
Identity = 695/884 (78.62%), Postives = 748/884 (84.62%), Query Frame = 1

Query: 1   MEKNQELEWAEAQRIEIGVDLVAAAKRQLQFLSAVDRNRFLYEGPSLERAIYRYNAYWLP 60
           MEKNQELEW EAQ+IEIGVDLVAAAKRQLQFLSAVDR+RFLYE PSLERAIYRYNAYWLP
Sbjct: 1   MEKNQELEWVEAQQIEIGVDLVAAAKRQLQFLSAVDRSRFLYESPSLERAIYRYNAYWLP 60

Query: 61  LLAKHSESPLFEGPLAVPFDCEWIWHCHRLNPVRYKSECEELYGKILDNSNVVSTLGSSC 120
           LLAKHSESPL +GPL VPFDCEWIWHCHRLNPVRYKS+CEELYGKILDNSNV ST+GSSC
Sbjct: 61  LLAKHSESPLLDGPLVVPFDCEWIWHCHRLNPVRYKSDCEELYGKILDNSNVKSTIGSSC 120

Query: 121 LRETEEVWNELYPEEPFNFNFNPTGEYQEDALKVLSGLQKYTKYDLVSAVKRQSPFFYQV 180
            RETEEVWNELYPEEPFNFN   T E QED  KVLSGL+KYTKYDLVSAVKRQ PFFYQV
Sbjct: 121 SRETEEVWNELYPEEPFNFN--STSESQEDVSKVLSGLEKYTKYDLVSAVKRQGPFFYQV 180

Query: 181 SRPHMDNEVFLEEAVARYKGFLYLIKSNRERSIKRFCVPTYDIDLIWHTHQLHPISYCKD 240
           SRPHM NE+FL+EAVARYKGFLYLIKSNRE+S+KRFCVPTYDIDLIWH+HQLHP+SYCKD
Sbjct: 181 SRPHMGNEIFLQEAVARYKGFLYLIKSNREKSLKRFCVPTYDIDLIWHSHQLHPLSYCKD 240

Query: 241 LKNLLGMVLEHDDMDSDRTKGKKLDTGFSGTTKQWEDTFGTRYWKAGAMYRGNSPSPLLL 300
           LK +LG+VLEHDD DSDRTKGKKLD GFSGTTKQWEDTFGTRYW+AG MYRGN PSPL+L
Sbjct: 241 LKKILGVVLEHDDTDSDRTKGKKLDNGFSGTTKQWEDTFGTRYWRAGVMYRGNCPSPLVL 300

Query: 301 NSYSGSTNLIKDDMVSSQECQNTVHLPELKTVEVLLEFVEVKNKPEGLKGNFFVQFMKSQ 360
           N YS STN I+DD+VSSQ+CQN VHLPELKTVEVLLEFVEVKN PEGLKGN FVQFMKSQ
Sbjct: 301 NPYSASTNTIRDDVVSSQDCQNIVHLPELKTVEVLLEFVEVKNIPEGLKGNLFVQFMKSQ 360

Query: 361 PDAIFSAKRKLSILSETGVKQVASFQCEPKGDLQFELICNRASSIPITRTSLTLGSISLP 420
           PDAIF++K KLSILSETGVKQVASFQCEPKGDL+ ELIC R+S+IPITRT LTLGS+SLP
Sbjct: 361 PDAIFNSKWKLSILSETGVKQVASFQCEPKGDLKLELICCRSSNIPITRTPLTLGSVSLP 420

Query: 421 --LHDILVPTSKLSMERWLELKPVSDHVSSKPISLRVAISFTVPHPAPRELHMFFSRELS 480
             L DILVP+SKLSMERWLELKPVSDHVSSKPISLRVAISFTVPHPA RELHMF SRELS
Sbjct: 421 LGLDDILVPSSKLSMERWLELKPVSDHVSSKPISLRVAISFTVPHPAQRELHMFSSRELS 480

Query: 481 RWTSFLPSCTKMQHSKGWMQVTDEAGNEVISLQLRDSLKEKVGKNSIPTSKEVIGIKMSG 540
           RWTSFLPSCT+MQ SKGW QVTDEAGN+VI+LQLRDSLK KVGKN+IPTSKEVIGIKMSG
Sbjct: 481 RWTSFLPSCTRMQRSKGWTQVTDEAGNDVINLQLRDSLKAKVGKNNIPTSKEVIGIKMSG 540

Query: 541 ESSLLAEFVKTGWSLIDGQWFLDFQKKSNEDDHLFKLVGKRLVCMVPVVVKFFQGSKLDY 600
           ES  LAEFVKTGWSLIDGQW LD Q+KS+EDDHLFKLVGKRL       V+F+QG KLDY
Sbjct: 541 ESCHLAEFVKTGWSLIDGQWLLDLQQKSSEDDHLFKLVGKRL-------VRFYQGRKLDY 600

Query: 601 EPKNCGKHNHDSDFVTAIEFSAEYPYGRAVALFDLKFGVIKIKEEWMVVPGIMTAFLLLH 660
           EPKNC KHN + DF++AIEFSAEYPYGRAVALFDLKFGVIKIKEEWM+VPGI+TAFLLLH
Sbjct: 601 EPKNCEKHNREQDFMSAIEFSAEYPYGRAVALFDLKFGVIKIKEEWMLVPGILTAFLLLH 660

Query: 661 ARKKKGYNGLTVSEENLEVAPVPESVHTSGKEENNMNLINVSLSSTDLKVNVSEGV---- 720
             KKKGYN LTV+EE LE     E V  SGKEE  MNL N+S SSTDLK NVSEG+    
Sbjct: 661 TWKKKGYNSLTVNEEKLEADTDHERVQKSGKEEMTMNLTNLSSSSTDLKANVSEGIAVVP 720

Query: 721 -----SNENVTIPLKEDELSSHCGRCDAGGYTAGSGNMVKSGRCGGCGAGGCGGGCGNMV 780
                S EN+T+ L +D+LSSHC +      + G GNMVKSG CGGCGAGGCG  CGNMV
Sbjct: 721 IKEEDSKENITMSLNQDKLSSHCDQNTV--KSGGRGNMVKSGGCGGCGAGGCGSECGNMV 780

Query: 781 KSGGCGGCGAGGCGGGCGNIINSGGCGGCGAGGCGGGCGNIV-NSGGCGGCGAGGCGGGC 840
           KSGGCG    GGCGGGCGNI+NSGGCGGCG        G I+  SGGCG  G+GGC    
Sbjct: 781 KSGGCG----GGCGGGCGNIVNSGGCGGCG--------GEILAKSGGCG--GSGGC---- 838

Query: 841 GNIVNSGSSVGGIVAKSGGCGGGCGGFGHKTAQPNEGNQSDAII 873
                            GGCGGGCG FG+KTAQPNEG Q+D  I
Sbjct: 841 -----------------GGCGGGCGSFGYKTAQPNEGKQTDGSI 838

BLAST of Cp4.1LG14g08780 vs. TrEMBL
Match: A0A061DIX2_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_000919 PE=4 SV=1)

HSP 1 Score: 932.2 bits (2408), Expect = 4.7e-268
Identity = 504/861 (58.54%), Postives = 616/861 (71.54%), Query Frame = 1

Query: 1   MEKNQELEWAEAQRIEIGVDLVAAAKRQLQFLSAVDRNRFLYEGPSLERAIYRYNAYWLP 60
           +EK QELEW EAQ+ EI +DLVAAAK+QL+FL+AVDRNR+LY+GP+L+RAIYRYNAYWLP
Sbjct: 3   LEKEQELEWIEAQKTEISLDLVAAAKKQLEFLAAVDRNRWLYDGPTLQRAIYRYNAYWLP 62

Query: 61  LLAKHSESPLFEGPLAVPFDCEWIWHCHRLNPVRYKSECEELYGKILDNSNVVSTLGSSC 120
           LLAK+ +    EGPL VP DCEWIWHCHRLNPVRYKS+CEELYG+ILDNSNVVS+L  +C
Sbjct: 63  LLAKYHKEEFSEGPLVVPLDCEWIWHCHRLNPVRYKSDCEELYGRILDNSNVVSSLQCTC 122

Query: 121 LRETEEVWNELYPEEPFNFNFNPTGEYQEDALKVLSGLQKYTKYDLVSAVKRQSPFFYQV 180
            R+TEE+WN LYP+EP++F+   T    E+A + LSGL+K+TKYDL+SAVKRQSPFFYQV
Sbjct: 123 KRQTEEIWNRLYPDEPYDFDL--TKALSENASQTLSGLEKHTKYDLISAVKRQSPFFYQV 182

Query: 181 SRPHMDNEVFLEEAVARYKGFLYLIKSNRERSIKRFCVPTYDIDLIWHTHQLHPISYCKD 240
           SR HM N++F+E AVARYKGFL+LIK NRERSIKRFCVPTYDIDLIWHTHQLHP+SYCKD
Sbjct: 183 SRAHMHNDIFIEGAVARYKGFLHLIKRNRERSIKRFCVPTYDIDLIWHTHQLHPVSYCKD 242

Query: 241 LKNLLGMVLEHDDMDSDRTKGKKLDTGFSGTTKQWEDTFGTRYWKAGAMYRGNSPSPLLL 300
           L   +G +LEHDD DSDRTKGKKLD GFSGTTKQWE+TFG RYWKAGAMYRG+SPSPL  
Sbjct: 243 LNKAVGKILEHDDTDSDRTKGKKLDVGFSGTTKQWEETFGIRYWKAGAMYRGSSPSPLTA 302

Query: 301 NSYSGSTNLIKDDMVSSQECQNTVHLPELKTVEVLLEFVEVKNKPEGLKGNFFVQFMKSQ 360
                 T  +  ++ ++  CQ  + LPE+K VEVLLEFV VKN P+  KGN FV F K+Q
Sbjct: 303 IPCMPDT--LSKEVDATNACQKIIKLPEMKVVEVLLEFVGVKNLPDEKKGNLFVLFSKTQ 362

Query: 361 PDAIFSAKRKLSILSETGVKQVASFQCEPKGDLQFELICNRASSIPITRTSLTLGSISLP 420
           PD  F AK+KL+ILS++G KQVASFQCEP G+L FEL+ + AS++P T+T  TLG+ SL 
Sbjct: 363 PDVFFKAKQKLTILSKSGQKQVASFQCEPNGELLFELVSHSASNLPGTKTCKTLGTASLS 422

Query: 421 LHDILVPTSKLSMERWLELKPVSDHVSSKPISLRVAISFTVPHPAPRELHMFFSRELSRW 480
           L + LVP SKL++E+WL+L P S + SSKPI LRVA+SFTVP  AP  LHM  SR  S+ 
Sbjct: 423 LREFLVPVSKLAVEKWLDLMPSSGNGSSKPIGLRVAVSFTVPAIAPHMLHMVRSRPFSKG 482

Query: 481 TSF-LPSCTKMQHSKGWMQVTDEAGNEVISLQLRDSLKEKVGKNSIPTSKEVIGIKMSGE 540
           + F LP   ++Q  KG  +V DE   EVI LQ+ +S K K+ K S  + K+VIG    GE
Sbjct: 483 SCFQLPLAGRVQAGKGCTRVIDETQAEVIRLQMSESGKAKM-KGSCLSRKQVIGTTKHGE 542

Query: 541 SSLLAEFVKTGWSLIDGQWFLDFQKKSNEDDHLFKLVGKRLVCMVPVVVKFFQGSKLDYE 600
           +  LAEFV T WSL+D QW L   ++ +E  HLF L G R+       VK F G KLDYE
Sbjct: 543 THALAEFVGTRWSLMDSQWVLQHSEEVSEHGHLFDLKGNRM-------VKVFLGRKLDYE 602

Query: 601 PKNCGKHNHDSDFVTAIEFSAEYPYGRAVALFDLKFGVIKIKEEWMVVPGIMTAFLLLHA 660
           PK+C K  ++ DF+TA+EFSAE+PYG AVAL DLK G +K KE+W V+PG+++AF+L H 
Sbjct: 603 PKHCEKKRNEGDFMTAVEFSAEHPYGTAVALLDLKSGCLKAKEKWFVLPGLISAFILSHI 662

Query: 661 RKKKGYNGLTVSEENLEVAPVPESVHTSGKEENNMNLINVSLSSTDLKVNVSEGVSNENV 720
            K+KG+ GLT+  +N +       V     E +++N      +S + +VN+   V+ EN 
Sbjct: 663 LKRKGHIGLTIDVKNTKEVDSATEV-----ENDHVN----PTASIETEVNLDGDVTLENA 722

Query: 721 TIPLKEDELSSHCGRCDAGGYTAGSGNMVKSGRCGGCGAGGCGGGCGNMVKSGGCGGCGA 780
            IP K+         C+ G Y    GN VKSG CGGCGA      CGNMVKSGGCGGC A
Sbjct: 723 MIPKKDS--------CN-GDYGGEKGNEVKSGGCGGCGA-----ECGNMVKSGGCGGCSA 782

Query: 781 G-------GCGGGCGNIINSGGCGGCGAGGCGGGCGNIVNSGGCGGCGAGGCGGGCGNIV 840
           G       GCGGGCG+++NS GCG     GCGGGCG+ VNS GCGGC  GGCG GCG+ V
Sbjct: 783 GCSGGCGSGCGGGCGSMVNSSGCG----AGCGGGCGSRVNSSGCGGC--GGCGAGCGSRV 814

Query: 841 NSGSSVGGIVAKSGGCGGGCG 854
            S           GGC   CG
Sbjct: 843 KS--------TGCGGCSLSCG 814

BLAST of Cp4.1LG14g08780 vs. TrEMBL
Match: A9YWR4_MEDTR (DNA-binding protein, putative OS=Medicago truncatula GN=MTR_5g030890 PE=4 SV=1)

HSP 1 Score: 914.8 bits (2363), Expect = 7.8e-263
Identity = 496/859 (57.74%), Postives = 601/859 (69.97%), Query Frame = 1

Query: 1   MEKNQELEWAEAQRIEIGVDLVAAAKRQLQFLSAVDRNRFLYEGPSLERAIYRYNAYWLP 60
           ME  QE  W EAQ+I + VDLV  AK+QLQFL+AVDRNR LY+GP+L+RAIYRYNA WLP
Sbjct: 1   MEAEQEHAWNEAQKIGMSVDLVDVAKKQLQFLAAVDRNRHLYDGPALDRAIYRYNACWLP 60

Query: 61  LLAKHSESPLFEGPLAVPFDCEWIWHCHRLNPVRYKSECEELYGKILDNSNVVSTLGSSC 120
           LLAKHSES +FEGPL VP DCEWIWHCHRLNPVRYK +CEELYG +LDN +VVST+   C
Sbjct: 61  LLAKHSESRIFEGPLVVPLDCEWIWHCHRLNPVRYKLDCEELYGLVLDNFDVVSTVEGIC 120

Query: 121 LRETEEVWNELYPEEPFN---FNFNPTGEYQEDALKVLSGLQKYTKYDLVSAVKRQSPFF 180
            R+TEE+WN+LYP+EP+N    N +P     ED  K  + L KYTKYDL+SAVKRQSPFF
Sbjct: 121 GRQTEEIWNKLYPDEPYNSDLINLDP-----EDISKRTTSLAKYTKYDLISAVKRQSPFF 180

Query: 181 YQVSRPHMDNEVFLEEAVARYKGFLYLIKSNRERSIKRFCVPTYDIDLIWHTHQLHPISY 240
           YQVSRP++ +++F++EA ARYKGFLYLIK N+E+ I RFCVPTYDIDL+WH+HQLHP++Y
Sbjct: 181 YQVSRPYIKDDLFIKEAEARYKGFLYLIKKNKEKGINRFCVPTYDIDLMWHSHQLHPVAY 240

Query: 241 CKDLKNLLGMVLEHDDMDSDRTKGKKLDTGFSGTTKQWEDTFGTRYWKAGAMYRGNSPSP 300
            KDL   LG +LEHDD DSDRTKGKKLD GFSGTTKQWEDTFGTRYWKAGAMY+GN+PSP
Sbjct: 241 SKDLNEALGKILEHDDTDSDRTKGKKLDVGFSGTTKQWEDTFGTRYWKAGAMYKGNAPSP 300

Query: 301 LLLNSYSGSTNLIKDDMVSSQECQNTVHLPELKTVEVLLEFVEVKNKPEGLKGNFFVQFM 360
           +  + +S S N  K  +VSS+E  +   L + K VEV LEFV+VKN P+G +G+ FV F 
Sbjct: 301 ITSSPFSSSKNCKK--VVSSKEQLHDNLLQDRKVVEVFLEFVDVKNLPDGQEGSLFVLFS 360

Query: 361 KSQPDAIFSAKRKLSILSETGVKQVASFQCEPKGDLQFELICNRASSIPITRTSLTLGSI 420
           KSQPDA F AKR+LSILS+T  KQVASFQCEP G+L FEL+ + +S + + ++   LGS 
Sbjct: 361 KSQPDAFFEAKRRLSILSKTKEKQVASFQCEPTGELLFELMSHSSSKLSLRKSPKALGSA 420

Query: 421 SLPLHDILVPTSKLSMERWLELKPVSDHVSSKPISLRVAISFTVPHPAPRELHMFFSREL 480
           ++P+ D L P SKL +E+WLEL P S  +S+KPI LRVAISFT P PAP    +  SR +
Sbjct: 421 AIPMQDYLDPVSKLYIEKWLELVPSSGVMSTKPILLRVAISFTAPIPAPYTFQLAQSRPV 480

Query: 481 SRWTSFLPSCTKMQHSKGWMQVTDEAGNEVISLQLRDSLKEKVGKNSIPTSKEVIGIKMS 540
           S+ T F     K Q +K W   TDE G  +ISLQ+RD    K  KN     KEV G+  S
Sbjct: 481 SKNTCFFNLPVKPQQAKSWTHATDENGTRIISLQMRDL---KNAKNVENLGKEVAGLMES 540

Query: 541 GESSLLAEFVKTGWSLIDGQWFLDFQKKSNEDDHLFKLVGKRLVCMVPVVVKFFQGSKLD 600
           GE+  LAE+++ GWS +D  W L    KS  D H+F+L G +        +K F G K +
Sbjct: 541 GETRTLAEYMENGWSFMDNLWLLHRPSKSKNDGHIFELTGTK-------TIKIFSGRKGE 600

Query: 601 YEPKNCGKHNHDSDFVTAIEFSAEYPYGRAVALFDLKFGVIKIKEEWMVVPGIMTAFLLL 660
           YE +   K  ++ DF+TA+EFS E PYG+AVAL DLK  ++  KE+WMV+PGI+ AFL  
Sbjct: 601 YELRYHLKQGNEMDFLTAVEFSIEDPYGKAVALLDLKSNLVSAKEKWMVLPGIILAFLAS 660

Query: 661 HARKKKGYNGLTVSEENLEVAPVPESVHTSGKEENNMNLINVSLS-STDLKVNVSEGVSN 720
              KK+GY G+    ++LEV    E +     E N++N   +S       KV +S G   
Sbjct: 661 DIMKKEGYEGIIAKSKDLEVVDTYEEI-----ERNDLNGAELSRDVGITKKVVLSSGGCG 720

Query: 721 ENVTIPLKEDELSSHCGRCDAGGYTAGSGNMVKSGRCG-GCGAGGCGGGCGNMVKSGGCG 780
                       S  CG C AGG   G GNM+KSG CG GCG+ GCGGGCGNM+KSGGCG
Sbjct: 721 SGCGSGCGNAVRSGGCGGCGAGGCGGGCGNMIKSGGCGSGCGS-GCGGGCGNMIKSGGCG 780

Query: 781 GCGAGGCGGGCGNIINSGGCGGCGAGGCGGGCGNIVNSGGCGGCGAGGCGGGCGNIVNSG 840
           G   GGCGGGCGNII SGGCGG   GGCGGGCGNI+ SGGCGG   GGCGGGCGNIV SG
Sbjct: 781 GGCGGGCGGGCGNIIKSGGCGGGCDGGCGGGCGNIIKSGGCGGGCGGGCGGGCGNIVESG 833

Query: 841 SSVGGIVAKSGGCGGGCGG 855
              GG     GGCGGGCGG
Sbjct: 841 GCGGGC---GGGCGGGCGG 833

BLAST of Cp4.1LG14g08780 vs. TrEMBL
Match: V4SFT2_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10024875mg PE=4 SV=1)

HSP 1 Score: 912.1 bits (2356), Expect = 5.0e-262
Identity = 506/870 (58.16%), Postives = 620/870 (71.26%), Query Frame = 1

Query: 1   MEKNQELEWAEAQRIEIGVD-LVAAAKRQLQFLSAVDRNRFLYEGPSLERAIYRYNAYWL 60
           M K QE EWAEAQ IEI VD LVAAAK+QLQFL+AVDRNR+LYEGP+L+RAIYRYNA WL
Sbjct: 3   MAKEQEFEWAEAQEIEISVDDLVAAAKQQLQFLAAVDRNRWLYEGPTLQRAIYRYNACWL 62

Query: 61  PLLAKHSESPLFEGPLAVPFDCEWIWHCHRLNPVRYKSECEELYGKILDNSNVVSTLGSS 120
           PLLAKHSES + +G L VP DCEWIWHCHRLNPV+YKS+CEELYGK LDNS VVS++  +
Sbjct: 63  PLLAKHSESHISKGCLVVPLDCEWIWHCHRLNPVQYKSDCEELYGKNLDNSYVVSSIQGT 122

Query: 121 CLRETEEVWNELYPEEPFNFNFNPTGEYQEDALKVLSGLQKYTKYDLVSAVKRQSPFFYQ 180
           C +ETEE+WN LYPEEP+  +        ED    LSGL+K+TKYDLVSAVKRQSPFFYQ
Sbjct: 123 CRKETEEIWNRLYPEEPYELDLAKISS--EDFSAELSGLEKFTKYDLVSAVKRQSPFFYQ 182

Query: 181 VSRPHMDNEVFLEEAVARYKGFLYLIKSNRERSIKRFCVPTYDIDLIWHTHQLHPISYCK 240
           VSR H +N+VFLEEAVARYKGFL+LIK NRERSIKRFCVPTYDIDLIWHTHQLHP SYCK
Sbjct: 183 VSRSHFNNDVFLEEAVARYKGFLHLIKKNRERSIKRFCVPTYDIDLIWHTHQLHPDSYCK 242

Query: 241 DLKNLLGMVLEHDDMDSDRTKGKKLDTGFSGTTKQWEDTFGTRYWKAGAMYRGNSPSPLL 300
           D+   LG VLEHDDMD DRTKGKKLDTGFSGTTKQWE+TFG+RY KAGAMYRG +PSPL 
Sbjct: 243 DMNKTLGKVLEHDDMDQDRTKGKKLDTGFSGTTKQWEETFGSRYPKAGAMYRGTAPSPLT 302

Query: 301 LNSYSGSTNLIKDDMVSSQECQNTVHLPELKTVEVLLEFVEVKNKPEGLKGNFFVQFMKS 360
              +S  ++++  ++VSS+ECQ  +H+ +LK  EV +E V VKN PE  KG+ FV F KS
Sbjct: 303 TIPFS--SDIVSKEVVSSKECQKIIHILDLKIAEVFVEIVAVKNLPEDHKGDLFVFFSKS 362

Query: 361 QPDAIFSAKRKLSILSETGVKQVASFQCEPKGDLQFELICNRASSIPITRTSLTLGSISL 420
           QPD  F+AK+KL+ILS++G+KQVASFQCE  G+L FEL+ +  S IP+T  S T+G+ SL
Sbjct: 363 QPDIFFNAKQKLTILSKSGMKQVASFQCEAAGELLFELVSHSTSKIPMTGASKTMGTASL 422

Query: 421 PLHDILVPTSKLSMERWLELKPVSDHVSSKPISLRVAISFTVPHPAPRELHMFFSRELSR 480
            L + + P SKL++E+W +L P S +VSSKPISLR+A+SFT+P  AP  L M  SR LS+
Sbjct: 423 SLQNFISPISKLAVEQWFDLVPRSGNVSSKPISLRIAVSFTIPTLAPHLLRMVRSRPLSK 482

Query: 481 WTSFLPSCTKMQHSKGWMQVTDEAGNEVISLQLRDSLKEKVGKNSIPTSKEVIGIKMSGE 540
            + F P   ++Q +K W +V DE  +EVISLQ+RD  KEK G N     K+VIG+  SGE
Sbjct: 483 GSCFFPLPGRIQPAKSWTRVIDETQSEVISLQMRDPKKEKGGDNCT-LKKQVIGVTESGE 542

Query: 541 SSLLAEFVKTGWSLIDGQWFLDFQKKSNEDDHLFKLVGKRLVCMVPVVVKFFQGSKLDYE 600
           +  LAE V+TGWS++D  W L  +KKS+++ HLF+L+G R++ + P       G KLDYE
Sbjct: 543 TITLAEMVETGWSVMDCCWSL--KKKSSKEGHLFELLGNRMINLFP-------GRKLDYE 602

Query: 601 PKNCGKHNHDSDFVTAIEFSAEYPYGRAVALFDLKFGVIKIKEEWMVVPGIMTAFLLLHA 660
            K+C K   + DFVTAIEFS   PYG+A+AL DLK GVIK+KEEW ++ GI++AF+L  A
Sbjct: 603 HKHCQKQRSEEDFVTAIEFSPADPYGKAIALLDLKSGVIKVKEEWFLLLGIISAFILSDA 662

Query: 661 RKKKGYNGLTVSEENLEVAPVPESVHTSGKEENNMNLINVSLSSTDLKVNVSEGVSNENV 720
            K+ GY+G T ++E ++     E    S + E    L    + +  +     E   N+N+
Sbjct: 663 LKE-GYDGFTANDEIMK-----EMKSASDRVEG---LREEGICTKMIPPVEDEPELNKNM 722

Query: 721 TIPLKEDELSSHCGRCDAGGYTAGSGNM--VKSGRCGGCGAGGCGGGCGNMVKSGGCGGC 780
           T  L     S  CG C +G    G G +  VKS  CGGCG GG  GGCGNMV  GGCGGC
Sbjct: 723 TNELN----SGGCGGCGSG---CGGGRVASVKSSGCGGCGGGG--GGCGNMVNGGGCGGC 782

Query: 781 GAG-------GCGGGCGNIINSGGC------GGCGAGGCGGGCGNIVNSGGCGGCGAGGC 840
           G G       GCGGGC  ++ S GC      GGCG+GGCG GCGN+V +   GGCG+GGC
Sbjct: 783 GGGCGGGCGGGCGGGCAALVKSSGCGGGECGGGCGSGGCGAGCGNMVKT---GGCGSGGC 829

Query: 841 GGGCGNIVNSGSSVGGIVAKSGGCGGGCGG 855
           G GCGN V +G          GGCGGGCGG
Sbjct: 843 GAGCGNTVKTGG--------CGGCGGGCGG 829

BLAST of Cp4.1LG14g08780 vs. TrEMBL
Match: I1J4X2_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_01G019800 PE=4 SV=1)

HSP 1 Score: 909.4 bits (2349), Expect = 3.3e-261
Identity = 495/869 (56.96%), Postives = 609/869 (70.08%), Query Frame = 1

Query: 1   MEKNQELEWAEAQRIEIGVDLVAAAKRQLQFLSAVDRNRFLYEGPSLERAIYRYNAYWLP 60
           ME  QE+EW EAQ+I I VDL   AK+QLQFL+ VD+NR LY+GP+L+RAIYRYNA W+P
Sbjct: 1   MEPQQEMEWNEAQKIPISVDLEVVAKKQLQFLATVDKNRHLYDGPALDRAIYRYNACWIP 60

Query: 61  LLAKHSESPLFEGPLAVPFDCEWIWHCHRLNPVRYKSECEELYGKILDNSNVVSTLGSSC 120
           LLAKHSESP+FEGPL VP DCEWIWHCHRLNPVRYK++CEELYG++LDN  V +T+   C
Sbjct: 61  LLAKHSESPIFEGPLVVPLDCEWIWHCHRLNPVRYKTDCEELYGRVLDNFGVATTVEGIC 120

Query: 121 LRETEEVWNELYPEEPFNFNFNPTGEYQEDALKVLSGLQKYTKYDLVSAVKRQSPFFYQV 180
             +TEE+WN+LYP+EP+N +        ED  K +S L+KYTKYDL+SA KRQSPFFYQV
Sbjct: 121 GWQTEEIWNKLYPDEPYNADL--VNLLPEDISKRISKLEKYTKYDLISAAKRQSPFFYQV 180

Query: 181 SRPHMDNEVFLEEAVARYKGFLYLIKSNRERSIKRFCVPTYDIDLIWHTHQLHPISYCKD 240
           SR HM N++F++EAVARYKGFL+LIK N+E+ IKRFCVPTYDIDLIWH+HQLHP++YCKD
Sbjct: 181 SRTHMKNDLFIKEAVARYKGFLHLIKRNKEKGIKRFCVPTYDIDLIWHSHQLHPVAYCKD 240

Query: 241 LKNLLGMVLEHDDMDSDRTKGKKLDTGFSGTTKQWEDTFGTRYWKAGAMYRGNSPSPLLL 300
           L   LG VLEHDD DSDRTKGKKLD GFSGTT+QWE TFGTRYWKAGAMYRGN+PSP+  
Sbjct: 241 LNEALGKVLEHDDTDSDRTKGKKLDLGFSGTTRQWEVTFGTRYWKAGAMYRGNAPSPITS 300

Query: 301 NSYSGSTNLIKDDMVSSQECQNTVHLPELKTVEVLLEFVEVKNKPEGLKGNFFVQFMKSQ 360
           N +  S    K  +VSS E    + LP+ K +EVLLEF+ VKN PEG +G+  V F KSQ
Sbjct: 301 NPFPSSITCKK--VVSSNEYPQEISLPDRKVMEVLLEFIGVKNLPEGQEGDLCVLFSKSQ 360

Query: 361 PDAIFSAKRKLSILSETGVKQVASFQCEPKGDLQFELICNRASSIPITRTSLTLGSISLP 420
           PDA F AKR+LSILS +  KQVASF+CEP G+L FEL+ + +S + I +++ TLGS S  
Sbjct: 361 PDAFFDAKRRLSILSVSREKQVASFRCEPTGELLFELMSSSSSKLSIRKSTKTLGSASFS 420

Query: 421 LHDILVPTSKLSMERWLELKPVSDHVSSKPISLRVAISFTVPHPAPRELHMFFSRELSRW 480
           + D L P SKL +E+WLEL P S  +SSKPI LRVAISFTVP  AP  L M  SR  S+ 
Sbjct: 421 MKDYLDPVSKLYVEKWLELVPGSGTMSSKPILLRVAISFTVPVLAPYTLEMTQSRPFSKN 480

Query: 481 TSFLPSCTKMQHSKGWMQVTDEAGNEVISLQLRDSLKEKVGKNSIPTSKEVIGIKMSGES 540
           T       + QH+K W  VTDE G  +ISLQ+RD    K  KN     KEV+G+  SGE+
Sbjct: 481 TCLFNLPVRPQHAKSWTHVTDENGTRIISLQMRDL---KNAKNIGNPGKEVVGLMKSGET 540

Query: 541 SLLAEFVKTGWSLIDGQWFLDFQKKSNEDDHLFKLVG--KRLVCMVPVVVKFFQGSKLDY 600
             LAEF++ GWS+++  W      KS  D HLF+L G  KR        V+ F G KLDY
Sbjct: 541 RTLAEFMENGWSILENLWLFHLPNKSTNDGHLFELTGANKR--------VRIFPGRKLDY 600

Query: 601 EPKNCGKHNHDSDFVTAIEFSAEYPYGRAVALFDLKFGVIKIKEEWMVVPGIMTAFLLLH 660
           E ++ GK  ++ +F+TA+EFS E PYG+AVAL DL+   +  KE+WMV+PGI+  F+  +
Sbjct: 601 ELRHNGKRGNEMNFLTAVEFSIEEPYGKAVALLDLRSRHVTAKEKWMVLPGIILTFIASN 660

Query: 661 ARKKKGYNGLTVSEENLEVAPVPESVHTSGKEENNMNLIN-VSLSSTDLKVNVSEGVSNE 720
             KK+GY G+    ++L+V            EEN   ++N   LSST++         NE
Sbjct: 661 IMKKEGYEGIIAKSKDLKV--------NGPNEENEKTVLNGTGLSSTNM--------CNE 720

Query: 721 NVTIPLKEDELSSHCGRC-DAGGYTAGSG----NMVKSGRCGGCGAGGCGGGCGNMVKSG 780
           +  I  K +     CG   ++GG  AG G    N++KSG CGGCGA GCGGGCGN++KSG
Sbjct: 721 DEGITYKSEISIGGCGNAVESGGCGAGCGGGCGNLIKSGGCGGCGA-GCGGGCGNLIKSG 780

Query: 781 GCGGCGA--GGCGGGCGNIINSGGCGGCGA---GGCGGGCGNIVNSGGCGGC--GAGGCG 840
           GCGGCGA  GGCGGGCGN+I SGGCGGCGA   GGCGGGCG+++ SGGCGGC  G GGCG
Sbjct: 781 GCGGCGAGCGGCGGGCGNLIKSGGCGGCGAGCGGGCGGGCGSMLKSGGCGGCGGGCGGCG 825

Query: 841 GGCGNIVNSGSSVGGIVAKSGGCGGGCGG 855
           GGCGN +           +S GC GGCGG
Sbjct: 841 GGCGNRL-----------ESSGC-GGCGG 825

BLAST of Cp4.1LG14g08780 vs. TAIR10
Match: AT2G22660.2 (AT2G22660.2 Protein of unknown function (duplicated DUF1399))

HSP 1 Score: 766.9 bits (1979), Expect = 1.3e-221
Identity = 432/859 (50.29%), Postives = 570/859 (66.36%), Query Frame = 1

Query: 2   EKNQELEWAEAQRIEIGVDLVAAAKRQLQFLSAVDRNRFLYEGPSLERAIYRYNAYWLPL 61
           EK+ E+EW EAQ+IEI VDL+AAAK+ L FL  VDRNR+LY+GP+LE+AIYRYNA WLPL
Sbjct: 4   EKDHEVEWLEAQKIEISVDLLAAAKQHLLFLETVDRNRWLYDGPALEKAIYRYNACWLPL 63

Query: 62  LAKHSESP-LFEGPLAVPFDCEWIWHCHRLNPVRYKSECEELYGKILDNSNVVSTLGSSC 121
           L K+SES  + EG L  P DCEWIWHCHRLNPVRY S+CE+ YG++LDNS V+S++  +C
Sbjct: 64  LVKYSESSSVSEGSLVPPLDCEWIWHCHRLNPVRYNSDCEQFYGRVLDNSGVLSSVDGNC 123

Query: 122 LRETEEVWNELYPEEPFNFNFNPTGEYQEDALKVLSGLQKYTKYDLVSAVKRQSPFFYQV 181
             +TE++W  LYP+EP+  + +      ED  +  S L+K TKYDLVSAVKRQSPF+YQV
Sbjct: 124 KLKTEDLWKRLYPDEPYELDLDNID--LEDISEKSSALEKCTKYDLVSAVKRQSPFYYQV 183

Query: 182 SRPHMDNEVFLEEAVARYKGFLYLIKSNRERSIKRFCVPTYDIDLIWHTHQLHPISYCKD 241
           SR H+++++FL+EAVARYKGFLYLIK NRERS+KRFCVPTYD+DLIWHTHQLHP+SYC D
Sbjct: 184 SRSHVNSDIFLQEAVARYKGFLYLIKMNRERSLKRFCVPTYDVDLIWHTHQLHPVSYCDD 243

Query: 242 LKNLLGMVLEHDDMDSDRTKGKKLDTGFSGTTKQWEDTFGTRYWKAGAMYRGNSPSPLLL 301
           +  L+G VLEHDD DSDR KGKKLDTGFS TT QWE+TFGTRYWKAGAM+RG +P P+  
Sbjct: 244 MVKLIGKVLEHDDTDSDRGKGKKLDTGFSKTTAQWEETFGTRYWKAGAMHRGKTPVPVTN 303

Query: 302 NSYSGSTNLIKDDMVSSQECQNTVHLPELKTVEVLLEFVEVKNKPEGLKGNFFVQFMKSQ 361
           + Y+    L+KD   +  + QN +  PE++ VEVLLE + V+N P+G KG   V F K+Q
Sbjct: 304 SPYASDV-LVKDP-TAKDDFQNLIQFPEVEVVEVLLEIIGVRNLPDGHKGKVSVMFSKTQ 363

Query: 362 PDAIFSAKRKLSILSETGVKQVASFQCEPKGDLQFELICNRASSIPITRTSLTLGSISLP 421
           PD++F+A+R+L+ILSE G KQVA+FQCEP G+L F+LI    S IP++R    LG  SL 
Sbjct: 364 PDSLFNAERRLTILSEVGEKQVATFQCEPTGELVFKLISCSPSKIPVSREPKNLGFASLS 423

Query: 422 LHDILVPT-SKLSMERWLELKPVS-DHVSSKPISLRVAISFTVPHPAPRELHMFFSRELS 481
           L + L P  ++LS+E+WLEL P       +KPISLRVA+SFT P  +P  LHM  SR   
Sbjct: 424 LKEFLFPVITQLSVEKWLELTPSKGSQTDTKPISLRVAVSFTPPVRSPSVLHMVQSRPSC 483

Query: 482 RWTSFLPSCTKMQHSKGWMQVTDEAGNEVISLQLRDSLKEKVGKNSIPTSKEVIGIKMSG 541
           + + F P   K + +K    + DE   EVI+LQ+R+S    + K+     ++V+G+  SG
Sbjct: 484 KGSCFFPIIGKSRLAKSSTHIVDETQTEVITLQIRNSADGGILKDD---QRQVMGVTDSG 543

Query: 542 ESSLLAEFVKTGWSLIDGQWFLDFQKKSNEDDHLFKLVGKRLVCMVPVVVKFFQGSKLDY 601
           E+ +LA +  + WSL+D +W L     S  D+ LF+++G R       VVK F G KLDY
Sbjct: 544 ETRVLAVYTGSFWSLLDSKWSLKQINASTADNPLFEILGPR-------VVKIFSGRKLDY 603

Query: 602 EPKNCGKHNHDSDFVTAIEFSAEYPYGRAVALFDLKFGVIKIKEEWMVVPGIMTAFLLLH 661
           EPK+C     D DF+T +EFS ++PYG+ V L D++FG I+ KE W+++PGI++AF+L  
Sbjct: 604 EPKHCANLRSDLDFMTLVEFSKQHPYGKTVGLVDMRFGSIEAKENWLLLPGIVSAFILHT 663

Query: 662 ARKKKGYNGLTVSEENLEVAPVPESVHTS--GKEENNMNLINVSLSSTDLKVNVSEGVSN 721
             KK G  G  V+ ++++     ES  T      ENN+N      +ST+++   +     
Sbjct: 664 VLKKGGSEGFNVTTKDIK----EESKQTKLVAATENNVNA-----NSTNVETQTA----- 723

Query: 722 ENVTIPLKEDELSSHCGRCDAGGYTAGSGNMVKSGRCGGCGAGGCGGGCGNMVKSGG-CG 781
             +T P K     S CG    GG +   GNMVK+    GCG+  C G CG+MVKS     
Sbjct: 724 --ITAPKK----GSGCG----GGCSGECGNMVKAANASGCGSS-CSGECGDMVKSAANAS 783

Query: 782 GCGAGGCGGGCGNIINSGGCGGCGAGGCGGGCGNIVNSGGCGGCGAGGCGGGCGNIVNSG 841
           GCG+G C G CGN++ +    G   GG G  C     + GCGG   GGCGGGCG++V S 
Sbjct: 784 GCGSG-CSGECGNMVKAANASG---GGYGARC-KAAKASGCGGGCGGGCGGGCGDMVKS- 810

Query: 842 SSVGGIVAKSGGCGGGCGG 855
                    + GCGGGC G
Sbjct: 844 -------VNASGCGGGCNG 810

BLAST of Cp4.1LG14g08780 vs. TAIR10
Match: AT4G37900.1 (AT4G37900.1 Protein of unknown function (duplicated DUF1399))

HSP 1 Score: 731.9 bits (1888), Expect = 4.7e-211
Identity = 419/846 (49.53%), Postives = 553/846 (65.37%), Query Frame = 1

Query: 2   EKNQELEWAEAQRIEIGVDLVAAAKRQLQFLSAVDRNRFLYEGPSLERAIYRYNAYWLPL 61
           EK Q LEW EAQ+I+I VDL+AAAK+ L FL AVDRNR LY+GP+L+RAIYRYNAYWLPL
Sbjct: 4   EKEQTLEWNEAQKIDISVDLLAAAKKHLLFLGAVDRNRCLYDGPALQRAIYRYNAYWLPL 63

Query: 62  LAKHSESP-LFEGPLAVPFDCEWIWHCHRLNPVRYKSECEELYGKILDNSNVVSTLGSSC 121
           LA+++ES  + +GPL  P DCEW+WHCHRLNPVRYK++CE+ YG++LDNS VVS++  +C
Sbjct: 64  LAQYTESSSICQGPLVPPLDCEWVWHCHRLNPVRYKTDCEQFYGRVLDNSGVVSSVNGNC 123

Query: 122 LRETEEVWNELYPEEPFNFNFNPTGEYQEDALKVLSGLQKYTKYDLVSAVKRQSPFFYQV 181
             +TE +W  LYP EP++ +F        D    +S L+K T YDLV AVKRQSPFFYQV
Sbjct: 124 KSQTETLWKRLYPTEPYDLDFANAISEPAD----VSALEKCTTYDLVLAVKRQSPFFYQV 183

Query: 182 SRPHMDNEVFLEEAVARYKGFLYLIKSNRERSIKRFCVPTYDIDLIWHTHQLHPISYCKD 241
           SR H+DN+VFL+EAVARYK FLYLIK NRERSIK FCVPTYDIDLIWHTHQLH ISYC D
Sbjct: 184 SRAHVDNDVFLQEAVARYKAFLYLIKGNRERSIKLFCVPTYDIDLIWHTHQLHAISYCND 243

Query: 242 LKNLLGMVLEHDDMDSDRTKGKKLDTGFSGTTKQWEDTFGTRYWKAGAMYRGNSPSPLLL 301
           L  ++G VLEHDD DSDR+KGKKLDTGFSGTT QWE+TFG RYWKAGAM RGN+P P+  
Sbjct: 244 LTKMIGKVLEHDDTDSDRSKGKKLDTGFSGTTAQWEETFGRRYWKAGAMNRGNTPKPVTT 303

Query: 302 NSY--SGSTNLIKDDMVSSQECQNTVHLPELKTVEVLLEFVEVKNKPEGLKGNFFVQFMK 361
           + Y  SG  ++ K+     +E QN +  PE+K +EV+LE V VKN P+  KG  FV F K
Sbjct: 304 SPYVCSGKKSIAKE-----EESQNVIQYPEVKVIEVILEIVGVKNLPDAHKGKVFVLFSK 363

Query: 362 SQPDAIFSAKRKLSILSET-GVKQVASFQCEPKGDLQFELICNRASSIPITRTSLTLGSI 421
           +QPD++F+A+R+L++LSE+ G KQVA FQCEP G+L F+L+ +++ S         LG  
Sbjct: 364 TQPDSLFNAERRLTVLSESCGEKQVALFQCEPTGELSFQLMSSKSKS---------LGFT 423

Query: 422 SLPLHDILVPTSKLSMERWLELKPVSDHVSS--KPISLRVAISFTVPHPAPRELHMFFSR 481
           SL   + L P +KLS+E+WLEL P     +    PISLRVA+SFT P  +P  LH+  +R
Sbjct: 424 SLSFSEFLSPVTKLSVEKWLELTPTKRGKADDPNPISLRVAVSFTPPTRSPTVLHLVQAR 483

Query: 482 ELSRWTSFLPSCTKMQHSKGWMQVTDEAGNEVISLQLRDSLKEKVGKNSIPTSKEVIGIK 541
              + + FLP   K++ +K + +V DE   EVI+LQ+R+S  +   K      ++VIG+K
Sbjct: 484 PSLKGSCFLPMLRKVRLAKSFTRVVDETETEVINLQMRNS-NDAAPKGD---RRQVIGVK 543

Query: 542 MSGESSLLAEFVKTGWSLIDGQWFLDFQKKSNEDDHLFKLVGKRLVCMVPVVVKFFQGSK 601
             GE+ +LAE+  T WSL+D +W L        D  LF+L G R+       VK + G K
Sbjct: 544 ECGETYVLAEYDGTFWSLLDSKWSLKQTCNPATDGPLFELSGTRM-------VKVYSGRK 603

Query: 602 LDYEPKNCGKHNHDSDFVTAIEFSAEYPYGRAVALFDLKFGVIKIKEEWMVVPGIMTAFL 661
           L+YEPK+C K   + DF+TA+EFS ++PYG+AV L DLKFG I+  E+W+V+PG++++F+
Sbjct: 604 LEYEPKHCSKLRSEQDFMTAVEFSKQHPYGKAVGLLDLKFGSIEANEKWLVLPGMVSSFI 663

Query: 662 LLHARKKKGYNGLTVSEENLEVAPVPESVHTSGKEENNMNLINVSLSSTDLKVNVSEGVS 721
           L    KK+G++           A   ++V  +G  E +  +  +S    +      E + 
Sbjct: 664 LSDLLKKEGFS-----------AAAKDTVKANGITEESTEIDVLSQEKLE-----EETMM 723

Query: 722 NENVTIPLK-EDELSSHCGRCDAGGYTAGSGNMV--KSGRCGGC-GAGGCGGGCGNMVKS 781
           + + T P+    E  +   RC +      SGNM+  + G CGGC G GGCGGG       
Sbjct: 724 DVDTTTPVAVAAEKINGGARCFSKEL---SGNMIEEEGGHCGGCGGCGGCGGG------- 774

Query: 782 GGCGGCGAGGCGGGCGNIINSGGCGGCGAGGCGGGCGNIVNSGGCGGCGAGGCGGGCGNI 838
           GGCGG      GG CG +    GCGG   G C GG      S GC     G CGGGCGN+
Sbjct: 784 GGCGG------GGRCGGMTKIEGCGG---GSCTGG------STGC-----GNCGGGCGNM 774

BLAST of Cp4.1LG14g08780 vs. TAIR10
Match: AT1G56230.1 (AT1G56230.1 Protein of unknown function (DUF1399))

HSP 1 Score: 135.2 bits (339), Expect = 2.0e-31
Identity = 142/562 (25.27%), Postives = 239/562 (42.53%), Query Frame = 1

Query: 8   EWAEAQRIEIGVDLVAAAKRQLQFLSAVDRNRFLYEGPSLERAIYRYNAYWLPLLAKHSE 67
           E +E   + IG D++++A+R +  L +V   ++L+  P +  AI RY+  W+PL++  + 
Sbjct: 19  EISEVDAVRIGGDIISSARRLIALLRSVGDCQWLHHPPVIAEAIRRYDELWMPLISDLTV 78

Query: 68  SPLFEGPLAVPFDCEWIWHCHRLNPVRYKSECEELYGKILDNSNVVSTLGSS-CLRETEE 127
             L    +  P D EW+W CH LNPV Y   CE  + K++    +         + + E+
Sbjct: 79  G-LKPPMILPPLDVEWVWFCHCLNPVSYSDYCERRFSKLIGKPAIYDEENEDYAVLQCEK 138

Query: 128 VWNELYPEEPFNFNFNPTGEYQEDALKVLSGLQKYTKYDLVSAVKRQSPFFYQVSRPHMD 187
           +W+  YP E F    +P      D+L+ +S + +    D+ S VK+Q   + + S P+M 
Sbjct: 139 IWSLRYPLESFENRADP------DSLETVSLVNE----DIKSLVKKQMFLWEKFSAPYMS 198

Query: 188 NEVFLEEAVARYKGFLYLIKSNRERSIKRFCVPTYDIDLIWHTHQLHPISYCKDLKNLLG 247
             V+L  A  RYKGFL ++   ++       +P  DI L+W THQ +P  Y  D+  +L 
Sbjct: 199 ETVYLIAARLRYKGFLLILHKFKDEVSS--LIPASDILLMWLTHQSYPTVYKDDVDEMLE 258

Query: 248 MVLEHDDMDSDRTKGKKLDTGFSGTTKQWEDTFGTRYWKAGAMYRGNSPSPLLLNSYSGS 307
            +        ++ +  +++T    T + W+  F   Y KAG          ++ N    S
Sbjct: 259 EMTRKVVQVGEKVEKTEVET----TKELWDRYFNQPYEKAG------GELSIIANESGLS 318

Query: 308 TNLIKDDMVSSQECQNTVHLPELKTVEVLLEFVEVKNKPEGLKG--NFFVQFMKSQPDAI 367
            N +    VS  +          + V  L  F+ +  K E  +     F++   ++    
Sbjct: 319 NNTMFYWPVSDMDVNTAYKSIRPRFVLELCIFLRLNPKAEQNESIDRSFLRLRVARCHRK 378

Query: 368 FSAKRKLSIL-SETGVKQVASFQCEPKGDLQF--ELICNRASSIPITRTSLTLGSISLPL 427
               +K++ L SE   ++     CE  G L F  E  C+R+  I   ++    G I  P 
Sbjct: 379 LQLDKKMTDLSSEASWQKAWHLYCE-FGTLGFILESHCDRSRGI-CFKSGKPEGMIEFPW 438

Query: 428 HDILVPTSKLSMERWLELKPVSDHVSSKPISLRVAISFTVPHPAPRELHMFFSRELSRWT 487
           +D+L   S L+  R+L           K +S  V  S T P  AP  L     R      
Sbjct: 439 NDLLRAHS-LASGRFL----------GKQVS--VFASVTPPVQAPYLLRFVPDRVTDDSG 498

Query: 488 SFLPSCTKMQHS-----KGWM--QVTDEAGNEVISLQLRDSLKEKVGKNSIPTSKEVIGI 547
           + +    +  ++       W+   V D AG E   +++R      VGK       EV   
Sbjct: 499 AMISDSVQRTNNFRPQEGRWLTRTVLDHAGRECFVIRIR------VGKGVFKRGGEVPSP 534

Query: 548 KMSGESSLLAEFVKTGWSLIDG 557
             S E   + E     WS ++G
Sbjct: 559 VKSEER--ITEVRVGSWSYVEG 534

BLAST of Cp4.1LG14g08780 vs. TAIR10
Match: AT4G37682.1 (AT4G37682.1 Protein of unknown function (DUF1399))

HSP 1 Score: 59.7 bits (143), Expect = 1.0e-08
Identity = 29/53 (54.72%), Postives = 39/53 (73.58%), Query Frame = 1

Query: 167 VSAVKRQSPFFYQVSRPHMDN-EVFLEEAVARYKGFLYLIKSNRERSIKRFCV 219
           +SAVKRQ PF+YQVSR H+DN +VFL+EA+ARYK FL ++  +    +  F V
Sbjct: 21  LSAVKRQGPFYYQVSRAHVDNDDVFLQEALARYKAFLIILVFSIREEVVVFVV 73

BLAST of Cp4.1LG14g08780 vs. NCBI nr
Match: gi|659096009|ref|XP_008448876.1| (PREDICTED: uncharacterized protein LOC103490909 [Cucumis melo])

HSP 1 Score: 1384.0 bits (3581), Expect = 0.0e+00
Identity = 713/895 (79.66%), Postives = 765/895 (85.47%), Query Frame = 1

Query: 1   MEKNQELEWAEAQRIEIGVDLVAAAKRQLQFLSAVDRNRFLYEGPSLERAIYRYNAYWLP 60
           MEKNQELEW EAQ+IEIGVDLVAAAKRQLQFLSAV+RNRFLYE PSLERAIYRYNAYWLP
Sbjct: 1   MEKNQELEWVEAQQIEIGVDLVAAAKRQLQFLSAVERNRFLYESPSLERAIYRYNAYWLP 60

Query: 61  LLAKHSESPLFEGPLAVPFDCEWIWHCHRLNPVRYKSECEELYGKILDNSNVVSTLGSSC 120
           LLAKHSESPLF+GPL VPFDCEWIWHCHRLNPVRYKS+CEELYGKILDNSNV+ST+GSSC
Sbjct: 61  LLAKHSESPLFDGPLVVPFDCEWIWHCHRLNPVRYKSDCEELYGKILDNSNVISTIGSSC 120

Query: 121 LRETEEVWNELYPEEPFNFNFN--PTGEYQEDALKVLSGLQKYTKYDLVSAVKRQSPFFY 180
            RETE+VWNELYPEEPFNFNFN   T + QED  +VLSGLQKYTKYDLVSAVKRQSPFFY
Sbjct: 121 SRETEKVWNELYPEEPFNFNFNFDSTRDSQEDISEVLSGLQKYTKYDLVSAVKRQSPFFY 180

Query: 181 QVSRPHMDNEVFLEEAVARYKGFLYLIKSNRERSIKRFCVPTYDIDLIWHTHQLHPISYC 240
           QVSRPHM NE+FL+EAVARYKGFLYLIKSNRE+SIKRFCVPTYDIDLIWH+HQLHP+SYC
Sbjct: 181 QVSRPHMGNEIFLQEAVARYKGFLYLIKSNREKSIKRFCVPTYDIDLIWHSHQLHPLSYC 240

Query: 241 KDLKNLLGMVLEHDDMDSDRTKGKKLDTGFSGTTKQWEDTFGTRYWKAGAMYRGNSPSPL 300
           KDLK +LG VLEHDD DSDRTKGKKLD GFSGTTKQWEDTFGTRYWKAGAMYRGN PSPL
Sbjct: 241 KDLKKILGTVLEHDDTDSDRTKGKKLDNGFSGTTKQWEDTFGTRYWKAGAMYRGNCPSPL 300

Query: 301 LLNSYSGSTNLIKDDMVSSQECQNTVHLPELKTVEVLLEFVEVKNKPEGLKGNFFVQFMK 360
           +LN YS STN IKDD+VSSQ+CQN VHLPELKTVEVLLEFVEVKN PEGLKGN FVQFMK
Sbjct: 301 VLNPYSASTNTIKDDVVSSQDCQNIVHLPELKTVEVLLEFVEVKNIPEGLKGNLFVQFMK 360

Query: 361 SQPDAIFSAKRKLSILSETGVKQVASFQCEPKGDLQFELICNRASSIPITRTSLTLGSIS 420
           SQPDAIF++K KLSILSETG+KQVASFQCEPKGDLQ ELIC R+S+IPITRT+LTLGS+S
Sbjct: 361 SQPDAIFNSKWKLSILSETGLKQVASFQCEPKGDLQLELICCRSSNIPITRTTLTLGSVS 420

Query: 421 LPLHDILVPTSKLSMERWLELKPVSDHVSSKPISLRVAISFTVPHPAPRELHMFFSRELS 480
           LPL DILVP+SKLSMERWLELKPVSDHVSSKPISLRVA+SFTVPHPA RELHMF SRELS
Sbjct: 421 LPLDDILVPSSKLSMERWLELKPVSDHVSSKPISLRVAVSFTVPHPAQRELHMFSSRELS 480

Query: 481 RWTSFLPSCTKMQHSKGWMQVTDEAGNEVISLQLRDSLKEKVGKNSIPTSKEVIGIKMSG 540
           RWTSFLPSCT+MQ SKGW QVTDEAGNEVI+LQLRDSLKEKVGKN+IPTSKEVIGIKMSG
Sbjct: 481 RWTSFLPSCTRMQRSKGWTQVTDEAGNEVINLQLRDSLKEKVGKNTIPTSKEVIGIKMSG 540

Query: 541 ESSLLAEFVKTGWSLIDGQWFLDFQKKSNEDDHLFKLVGKRLVCMVPVVVKFFQGSKLDY 600
           ES  LAEFVKTGWSLIDGQW LD Q+KS+EDDHLFKLVGKR       +V+F+QG KLDY
Sbjct: 541 ESCHLAEFVKTGWSLIDGQWLLDLQQKSSEDDHLFKLVGKR------KLVRFYQGRKLDY 600

Query: 601 EPKNCGKHNHDSDFVTAIEFSAEYPYGRAVALFDLKFGVIKIKEEWMVVPGIMTAFLLLH 660
           EPKNC KHN + DF++AIEFSAEYPYGRAVALFDLKFGV KIKEEWM+VPGI+TAFLLLH
Sbjct: 601 EPKNCEKHNREQDFMSAIEFSAEYPYGRAVALFDLKFGVSKIKEEWMLVPGILTAFLLLH 660

Query: 661 ARKKKGYNGLTVSEENLEVAPVPESVHTSGKEENNMNLINVSLSSTDLKVNVSEGV---- 720
             KKKGYN LTVSEE LE    PE V  S KEE  MN  N+S SSTDLK NVSEG+    
Sbjct: 661 TWKKKGYNSLTVSEEKLEADTDPERVQKSRKEEKTMNQTNLSFSSTDLKANVSEGIAVVP 720

Query: 721 -----SNENVTIPLKEDELSSHCGRCDAGGYTAGSGNMVKSGRCGGCGAGGCGGGCGNMV 780
                S EN T+ L +D+LSSHCG+           N VKSG CGGCG      GCGNMV
Sbjct: 721 IKEEDSKENTTMSLNQDKLSSHCGQ-----------NTVKSGGCGGCGT-----GCGNMV 780

Query: 781 KSGGCGGCGAGGCGGGCGNIINSGGCGGCGAGGCGGGCGNIVNSGGCGGC--GAGGCGGG 840
           KSGGCGGCGAGGCG  CGN++ S    GCGAGGCG GCGNIVNSGGCGGC  G GGCGGG
Sbjct: 781 KSGGCGGCGAGGCGSECGNMVKS---SGCGAGGCGAGCGNIVNSGGCGGCGGGCGGCGGG 840

Query: 841 CGNIVNSGSSVGGIVAKSGGCG----------GGCGGFGHKTAQPNEGNQSDAII 873
           CG         GGI+AKSGGCG          GGCG FG+KTA+PNEG Q+DA I
Sbjct: 841 CG---------GGILAKSGGCGGCGGGGCGGCGGCGSFGYKTAEPNEGKQTDASI 861

BLAST of Cp4.1LG14g08780 vs. NCBI nr
Match: gi|449465866|ref|XP_004150648.1| (PREDICTED: uncharacterized protein LOC101219844 [Cucumis sativus])

HSP 1 Score: 1345.1 bits (3480), Expect = 0.0e+00
Identity = 695/884 (78.62%), Postives = 748/884 (84.62%), Query Frame = 1

Query: 1   MEKNQELEWAEAQRIEIGVDLVAAAKRQLQFLSAVDRNRFLYEGPSLERAIYRYNAYWLP 60
           MEKNQELEW EAQ+IEIGVDLVAAAKRQLQFLSAVDR+RFLYE PSLERAIYRYNAYWLP
Sbjct: 1   MEKNQELEWVEAQQIEIGVDLVAAAKRQLQFLSAVDRSRFLYESPSLERAIYRYNAYWLP 60

Query: 61  LLAKHSESPLFEGPLAVPFDCEWIWHCHRLNPVRYKSECEELYGKILDNSNVVSTLGSSC 120
           LLAKHSESPL +GPL VPFDCEWIWHCHRLNPVRYKS+CEELYGKILDNSNV ST+GSSC
Sbjct: 61  LLAKHSESPLLDGPLVVPFDCEWIWHCHRLNPVRYKSDCEELYGKILDNSNVKSTIGSSC 120

Query: 121 LRETEEVWNELYPEEPFNFNFNPTGEYQEDALKVLSGLQKYTKYDLVSAVKRQSPFFYQV 180
            RETEEVWNELYPEEPFNFN   T E QED  KVLSGL+KYTKYDLVSAVKRQ PFFYQV
Sbjct: 121 SRETEEVWNELYPEEPFNFN--STSESQEDVSKVLSGLEKYTKYDLVSAVKRQGPFFYQV 180

Query: 181 SRPHMDNEVFLEEAVARYKGFLYLIKSNRERSIKRFCVPTYDIDLIWHTHQLHPISYCKD 240
           SRPHM NE+FL+EAVARYKGFLYLIKSNRE+S+KRFCVPTYDIDLIWH+HQLHP+SYCKD
Sbjct: 181 SRPHMGNEIFLQEAVARYKGFLYLIKSNREKSLKRFCVPTYDIDLIWHSHQLHPLSYCKD 240

Query: 241 LKNLLGMVLEHDDMDSDRTKGKKLDTGFSGTTKQWEDTFGTRYWKAGAMYRGNSPSPLLL 300
           LK +LG+VLEHDD DSDRTKGKKLD GFSGTTKQWEDTFGTRYW+AG MYRGN PSPL+L
Sbjct: 241 LKKILGVVLEHDDTDSDRTKGKKLDNGFSGTTKQWEDTFGTRYWRAGVMYRGNCPSPLVL 300

Query: 301 NSYSGSTNLIKDDMVSSQECQNTVHLPELKTVEVLLEFVEVKNKPEGLKGNFFVQFMKSQ 360
           N YS STN I+DD+VSSQ+CQN VHLPELKTVEVLLEFVEVKN PEGLKGN FVQFMKSQ
Sbjct: 301 NPYSASTNTIRDDVVSSQDCQNIVHLPELKTVEVLLEFVEVKNIPEGLKGNLFVQFMKSQ 360

Query: 361 PDAIFSAKRKLSILSETGVKQVASFQCEPKGDLQFELICNRASSIPITRTSLTLGSISLP 420
           PDAIF++K KLSILSETGVKQVASFQCEPKGDL+ ELIC R+S+IPITRT LTLGS+SLP
Sbjct: 361 PDAIFNSKWKLSILSETGVKQVASFQCEPKGDLKLELICCRSSNIPITRTPLTLGSVSLP 420

Query: 421 --LHDILVPTSKLSMERWLELKPVSDHVSSKPISLRVAISFTVPHPAPRELHMFFSRELS 480
             L DILVP+SKLSMERWLELKPVSDHVSSKPISLRVAISFTVPHPA RELHMF SRELS
Sbjct: 421 LGLDDILVPSSKLSMERWLELKPVSDHVSSKPISLRVAISFTVPHPAQRELHMFSSRELS 480

Query: 481 RWTSFLPSCTKMQHSKGWMQVTDEAGNEVISLQLRDSLKEKVGKNSIPTSKEVIGIKMSG 540
           RWTSFLPSCT+MQ SKGW QVTDEAGN+VI+LQLRDSLK KVGKN+IPTSKEVIGIKMSG
Sbjct: 481 RWTSFLPSCTRMQRSKGWTQVTDEAGNDVINLQLRDSLKAKVGKNNIPTSKEVIGIKMSG 540

Query: 541 ESSLLAEFVKTGWSLIDGQWFLDFQKKSNEDDHLFKLVGKRLVCMVPVVVKFFQGSKLDY 600
           ES  LAEFVKTGWSLIDGQW LD Q+KS+EDDHLFKLVGKRL       V+F+QG KLDY
Sbjct: 541 ESCHLAEFVKTGWSLIDGQWLLDLQQKSSEDDHLFKLVGKRL-------VRFYQGRKLDY 600

Query: 601 EPKNCGKHNHDSDFVTAIEFSAEYPYGRAVALFDLKFGVIKIKEEWMVVPGIMTAFLLLH 660
           EPKNC KHN + DF++AIEFSAEYPYGRAVALFDLKFGVIKIKEEWM+VPGI+TAFLLLH
Sbjct: 601 EPKNCEKHNREQDFMSAIEFSAEYPYGRAVALFDLKFGVIKIKEEWMLVPGILTAFLLLH 660

Query: 661 ARKKKGYNGLTVSEENLEVAPVPESVHTSGKEENNMNLINVSLSSTDLKVNVSEGV---- 720
             KKKGYN LTV+EE LE     E V  SGKEE  MNL N+S SSTDLK NVSEG+    
Sbjct: 661 TWKKKGYNSLTVNEEKLEADTDHERVQKSGKEEMTMNLTNLSSSSTDLKANVSEGIAVVP 720

Query: 721 -----SNENVTIPLKEDELSSHCGRCDAGGYTAGSGNMVKSGRCGGCGAGGCGGGCGNMV 780
                S EN+T+ L +D+LSSHC +      + G GNMVKSG CGGCGAGGCG  CGNMV
Sbjct: 721 IKEEDSKENITMSLNQDKLSSHCDQNTV--KSGGRGNMVKSGGCGGCGAGGCGSECGNMV 780

Query: 781 KSGGCGGCGAGGCGGGCGNIINSGGCGGCGAGGCGGGCGNIV-NSGGCGGCGAGGCGGGC 840
           KSGGCG    GGCGGGCGNI+NSGGCGGCG        G I+  SGGCG  G+GGC    
Sbjct: 781 KSGGCG----GGCGGGCGNIVNSGGCGGCG--------GEILAKSGGCG--GSGGC---- 838

Query: 841 GNIVNSGSSVGGIVAKSGGCGGGCGGFGHKTAQPNEGNQSDAII 873
                            GGCGGGCG FG+KTAQPNEG Q+D  I
Sbjct: 841 -----------------GGCGGGCGSFGYKTAQPNEGKQTDGSI 838

BLAST of Cp4.1LG14g08780 vs. NCBI nr
Match: gi|590706353|ref|XP_007047699.1| (Uncharacterized protein TCM_000919 [Theobroma cacao])

HSP 1 Score: 932.2 bits (2408), Expect = 6.7e-268
Identity = 504/861 (58.54%), Postives = 616/861 (71.54%), Query Frame = 1

Query: 1   MEKNQELEWAEAQRIEIGVDLVAAAKRQLQFLSAVDRNRFLYEGPSLERAIYRYNAYWLP 60
           +EK QELEW EAQ+ EI +DLVAAAK+QL+FL+AVDRNR+LY+GP+L+RAIYRYNAYWLP
Sbjct: 3   LEKEQELEWIEAQKTEISLDLVAAAKKQLEFLAAVDRNRWLYDGPTLQRAIYRYNAYWLP 62

Query: 61  LLAKHSESPLFEGPLAVPFDCEWIWHCHRLNPVRYKSECEELYGKILDNSNVVSTLGSSC 120
           LLAK+ +    EGPL VP DCEWIWHCHRLNPVRYKS+CEELYG+ILDNSNVVS+L  +C
Sbjct: 63  LLAKYHKEEFSEGPLVVPLDCEWIWHCHRLNPVRYKSDCEELYGRILDNSNVVSSLQCTC 122

Query: 121 LRETEEVWNELYPEEPFNFNFNPTGEYQEDALKVLSGLQKYTKYDLVSAVKRQSPFFYQV 180
            R+TEE+WN LYP+EP++F+   T    E+A + LSGL+K+TKYDL+SAVKRQSPFFYQV
Sbjct: 123 KRQTEEIWNRLYPDEPYDFDL--TKALSENASQTLSGLEKHTKYDLISAVKRQSPFFYQV 182

Query: 181 SRPHMDNEVFLEEAVARYKGFLYLIKSNRERSIKRFCVPTYDIDLIWHTHQLHPISYCKD 240
           SR HM N++F+E AVARYKGFL+LIK NRERSIKRFCVPTYDIDLIWHTHQLHP+SYCKD
Sbjct: 183 SRAHMHNDIFIEGAVARYKGFLHLIKRNRERSIKRFCVPTYDIDLIWHTHQLHPVSYCKD 242

Query: 241 LKNLLGMVLEHDDMDSDRTKGKKLDTGFSGTTKQWEDTFGTRYWKAGAMYRGNSPSPLLL 300
           L   +G +LEHDD DSDRTKGKKLD GFSGTTKQWE+TFG RYWKAGAMYRG+SPSPL  
Sbjct: 243 LNKAVGKILEHDDTDSDRTKGKKLDVGFSGTTKQWEETFGIRYWKAGAMYRGSSPSPLTA 302

Query: 301 NSYSGSTNLIKDDMVSSQECQNTVHLPELKTVEVLLEFVEVKNKPEGLKGNFFVQFMKSQ 360
                 T  +  ++ ++  CQ  + LPE+K VEVLLEFV VKN P+  KGN FV F K+Q
Sbjct: 303 IPCMPDT--LSKEVDATNACQKIIKLPEMKVVEVLLEFVGVKNLPDEKKGNLFVLFSKTQ 362

Query: 361 PDAIFSAKRKLSILSETGVKQVASFQCEPKGDLQFELICNRASSIPITRTSLTLGSISLP 420
           PD  F AK+KL+ILS++G KQVASFQCEP G+L FEL+ + AS++P T+T  TLG+ SL 
Sbjct: 363 PDVFFKAKQKLTILSKSGQKQVASFQCEPNGELLFELVSHSASNLPGTKTCKTLGTASLS 422

Query: 421 LHDILVPTSKLSMERWLELKPVSDHVSSKPISLRVAISFTVPHPAPRELHMFFSRELSRW 480
           L + LVP SKL++E+WL+L P S + SSKPI LRVA+SFTVP  AP  LHM  SR  S+ 
Sbjct: 423 LREFLVPVSKLAVEKWLDLMPSSGNGSSKPIGLRVAVSFTVPAIAPHMLHMVRSRPFSKG 482

Query: 481 TSF-LPSCTKMQHSKGWMQVTDEAGNEVISLQLRDSLKEKVGKNSIPTSKEVIGIKMSGE 540
           + F LP   ++Q  KG  +V DE   EVI LQ+ +S K K+ K S  + K+VIG    GE
Sbjct: 483 SCFQLPLAGRVQAGKGCTRVIDETQAEVIRLQMSESGKAKM-KGSCLSRKQVIGTTKHGE 542

Query: 541 SSLLAEFVKTGWSLIDGQWFLDFQKKSNEDDHLFKLVGKRLVCMVPVVVKFFQGSKLDYE 600
           +  LAEFV T WSL+D QW L   ++ +E  HLF L G R+       VK F G KLDYE
Sbjct: 543 THALAEFVGTRWSLMDSQWVLQHSEEVSEHGHLFDLKGNRM-------VKVFLGRKLDYE 602

Query: 601 PKNCGKHNHDSDFVTAIEFSAEYPYGRAVALFDLKFGVIKIKEEWMVVPGIMTAFLLLHA 660
           PK+C K  ++ DF+TA+EFSAE+PYG AVAL DLK G +K KE+W V+PG+++AF+L H 
Sbjct: 603 PKHCEKKRNEGDFMTAVEFSAEHPYGTAVALLDLKSGCLKAKEKWFVLPGLISAFILSHI 662

Query: 661 RKKKGYNGLTVSEENLEVAPVPESVHTSGKEENNMNLINVSLSSTDLKVNVSEGVSNENV 720
            K+KG+ GLT+  +N +       V     E +++N      +S + +VN+   V+ EN 
Sbjct: 663 LKRKGHIGLTIDVKNTKEVDSATEV-----ENDHVN----PTASIETEVNLDGDVTLENA 722

Query: 721 TIPLKEDELSSHCGRCDAGGYTAGSGNMVKSGRCGGCGAGGCGGGCGNMVKSGGCGGCGA 780
            IP K+         C+ G Y    GN VKSG CGGCGA      CGNMVKSGGCGGC A
Sbjct: 723 MIPKKDS--------CN-GDYGGEKGNEVKSGGCGGCGA-----ECGNMVKSGGCGGCSA 782

Query: 781 G-------GCGGGCGNIINSGGCGGCGAGGCGGGCGNIVNSGGCGGCGAGGCGGGCGNIV 840
           G       GCGGGCG+++NS GCG     GCGGGCG+ VNS GCGGC  GGCG GCG+ V
Sbjct: 783 GCSGGCGSGCGGGCGSMVNSSGCG----AGCGGGCGSRVNSSGCGGC--GGCGAGCGSRV 814

Query: 841 NSGSSVGGIVAKSGGCGGGCG 854
            S           GGC   CG
Sbjct: 843 KS--------TGCGGCSLSCG 814

BLAST of Cp4.1LG14g08780 vs. NCBI nr
Match: gi|470103796|ref|XP_004288315.1| (PREDICTED: uncharacterized protein LOC101307152 isoform X1 [Fragaria vesca subsp. vesca])

HSP 1 Score: 929.1 bits (2400), Expect = 5.7e-267
Identity = 517/873 (59.22%), Postives = 610/873 (69.87%), Query Frame = 1

Query: 1   MEKNQELEWAEAQRIEIGVDLVAAAKRQLQFLSAVDRNRFLYEGPSLERAIYRYNAYWLP 60
           MEK QELEW++AQ I I VDLVAAAK+QLQFL+AVDRNR+LYEG +L+RAIYRYNA WLP
Sbjct: 5   MEKEQELEWSKAQGIGISVDLVAAAKQQLQFLAAVDRNRYLYEGKALQRAIYRYNACWLP 64

Query: 61  LLAKHSESPLFEGPLAVPFDCEWIWHCHRLNPVRYKSECEELYGKILDNSNVVSTLGSSC 120
           LLAKHSES +FEGPL VP DCEWIWHCHRLNPVRYK++CEELYGKILDNSNVVS++  SC
Sbjct: 65  LLAKHSESQVFEGPLVVPLDCEWIWHCHRLNPVRYKTDCEELYGKILDNSNVVSSVQGSC 124

Query: 121 LRETEEVWNELYPEEPFNFNFNPTGEYQEDALKVLSGLQKYTKYDLVSAVKRQSPFFYQV 180
             +TEE+WN LYPEEP+NFN        ED  +  S L   TKYDLVSAVKRQ PFFYQV
Sbjct: 125 KSKTEEIWNCLYPEEPYNFNLQKA--LSEDISERNSKLDNCTKYDLVSAVKRQYPFFYQV 184

Query: 181 SRPHMDNEVFLEEAVARYKGFLYLIKSNRERSIKRFCVPTYDIDLIWHTHQLHPISYCKD 240
           S PHM+++++LE AV+RYKGFL+LIKSN E+S++RFCVPTYDIDLIWHTHQLHP+SYCKD
Sbjct: 185 SSPHMNHDLYLEAAVSRYKGFLHLIKSNNEKSLRRFCVPTYDIDLIWHTHQLHPVSYCKD 244

Query: 241 LKNLLGMVLEHDDMDSDRTKGKKLDTGFSGTTKQWEDTFGTRYWKAGAMYRGNSPSPLLL 300
           L  LLG VLEHDDMDSDRTKGKKLDTGFSGTTKQWE+ FGTRYW+AGAMYRG++PSPL  
Sbjct: 245 LHELLGKVLEHDDMDSDRTKGKKLDTGFSGTTKQWEEAFGTRYWRAGAMYRGSAPSPLTT 304

Query: 301 NSYSGSTNLIKDDMVSSQECQNTVHLPELKTVEVLLEFVEVKNKPEGLKGNFFVQFMKSQ 360
             Y   +N+I  D+ +  + Q  + LPE+K VEVLLEF+EVKN PEG KG  F  F K+ 
Sbjct: 305 TPY--QSNVISKDVTAHPDLQKVIELPEVKAVEVLLEFLEVKNLPEGHKGMLFASFNKTT 364

Query: 361 PDAIFSAKRKLSILSETGVKQVASFQCEPKGDLQFELICNRASSIPITRTSLTLGSISLP 420
            D  F+AKR+LSI SE G KQVASFQCEP G+L FEL+ +  S IP+ RT  TLGS S  
Sbjct: 365 QDIFFNAKRRLSIFSEFGEKQVASFQCEPTGELLFELMSHSPSQIPMKRTYKTLGSASFS 424

Query: 421 LHDILVPTSKLSMERWLELKPVSDHVSSKPISLRVAISFTVPHPAPRELHMFFSRELSRW 480
           L D L+P SKL +E+WLE+ P S+  + KPI LR A+SFT+P  A   LH+  SR  S+ 
Sbjct: 425 LQDCLLPPSKLYVEKWLEMVPSSEVGNLKPIYLRFAMSFTIPAIAQHTLHIVRSRLFSKS 484

Query: 481 TSFLPSCTKMQHSKGWMQVTDEAGNEVISLQLRDSLKEKVGKNSIPTSKEVIGIKMSGES 540
           + F P   K Q +K W QV DE G EV+ LQ+RD+  EKV   S+P  KEV+GI  SG+ 
Sbjct: 485 SCFFPFAGKNQDAKSWTQVIDETGTEVLRLQMRDAEMEKVKGISVP-KKEVVGITKSGKI 544

Query: 541 SLLAEFVKTGWSLIDGQWFLDFQKKSNEDDHLFKLVGKRLVCMVPVVVKFFQGSKLDYEP 600
             LAE V TGWSLID  W L   ++ N + HLF L GK++       VKFF G KLDYEP
Sbjct: 545 CTLAECVGTGWSLIDSHWSL--HREKNSEGHLFLLKGKKM-------VKFFPGRKLDYEP 604

Query: 601 KNCGK----HNHDSDFVTAIEFSAEYPYGRAVALFDLKFGVIKIKEEWMVVPGIMTAFLL 660
           K C K    +     F+T +EFSAE PYG+AVAL DLK G IK+KEE + VPGI+ AF+L
Sbjct: 605 KQCEKLTSENKTQQHFMTLVEFSAEDPYGKAVALLDLKSGCIKVKEESITVPGIIMAFML 664

Query: 661 LHARKKKGYNGLTVSEENLEVAPVPESVHTSGKEENNMNLINVSLSSTDLKVNVSEGVSN 720
            +  KK+ Y G  V+    E   V E ++ + +E    NL +   S   LK  V EG   
Sbjct: 665 SNKLKKERYGGFAVNA--AEKGSVEEEINENPEEGKETNLSSSGASEVKLKSEVVEG--- 724

Query: 721 ENVTIPLKEDELSSHCGR-CDAGGYTAGSGNMVKSGRCGGCGAGGCGGGCGNMVKSGGCG 780
            NV    K       CG  C     TAGS         GGCG+ GCGGGCG M KS GCG
Sbjct: 725 -NVVTSQKGGGCGGACGSGCGNATRTAGSAG------SGGCGS-GCGGGCGTMEKSSGCG 784

Query: 781 GCGAGGCGGGCGNIINSGGCGGCGAGGCGGGCGNIVNSGGCGGCGAGGCGGGCGNIVNSG 840
                GCGGGCGN++ SGGCGGCGA GCGGGCGNI+ SGGCGGCG GGCGGGCGN++ SG
Sbjct: 785 ----SGCGGGCGNLVQSGGCGGCGA-GCGGGCGNILKSGGCGGCG-GGCGGGCGNMLKSG 839

Query: 841 SSVGGIVAKSGGCGGGCGGF--GHKTAQPNEGN 867
              G     SGGCGGGCG    G+   +   GN
Sbjct: 845 GCGG-----SGGCGGGCGSLFKGNGLYENTSGN 839

BLAST of Cp4.1LG14g08780 vs. NCBI nr
Match: gi|764517300|ref|XP_011466594.1| (PREDICTED: uncharacterized protein LOC101307152 isoform X2 [Fragaria vesca subsp. vesca])

HSP 1 Score: 922.5 bits (2383), Expect = 5.3e-265
Identity = 517/879 (58.82%), Postives = 610/879 (69.40%), Query Frame = 1

Query: 1   MEKNQELEWAEAQRIEIGVDLVAAAKRQLQFLSAVDRNRFLYEGPSLERAIYRYNAYWLP 60
           MEK QELEW++AQ I I VDLVAAAK+QLQFL+AVDRNR+LYEG +L+RAIYRYNA WLP
Sbjct: 5   MEKEQELEWSKAQGIGISVDLVAAAKQQLQFLAAVDRNRYLYEGKALQRAIYRYNACWLP 64

Query: 61  LLAKHSESPLFEGPLAVPFDCEWIWHCHRLNPVRYKSECEELYGKILDNSNVVSTLGSSC 120
           LLAKHSES +FEGPL VP DCEWIWHCHRLNPVRYK++CEELYGKILDNSNVVS++  SC
Sbjct: 65  LLAKHSESQVFEGPLVVPLDCEWIWHCHRLNPVRYKTDCEELYGKILDNSNVVSSVQGSC 124

Query: 121 LRETEEVWNELYPEEPFNFNFNPTGEYQEDALKVLSGLQKYTKYDLVSAVKRQSPFFYQV 180
             +TEE+WN LYPEEP+NFN        ED  +  S L   TKYDLVSAVKRQ PFFYQV
Sbjct: 125 KSKTEEIWNCLYPEEPYNFNLQKA--LSEDISERNSKLDNCTKYDLVSAVKRQYPFFYQV 184

Query: 181 SRPHMDNEVFLEEAVARYKGFLYLIKSNRERSIKRFCVPTYDIDLIWHTHQLHPISYCKD 240
           S PHM+++++LE AV+RYKGFL+LIKSN E+S++RFCVPTYDIDLIWHTHQLHP+SYCKD
Sbjct: 185 SSPHMNHDLYLEAAVSRYKGFLHLIKSNNEKSLRRFCVPTYDIDLIWHTHQLHPVSYCKD 244

Query: 241 LKNLLGMVLEHDDMDSDRTKGKKLDTGFSGTTKQWEDTFGTRYWKAGAMYRGNSPSPLLL 300
           L  LLG VLEHDDMDSDRTKGKKLDTGFSGTTKQWE+ FGTRYW+AGAMYRG++PSPL  
Sbjct: 245 LHELLGKVLEHDDMDSDRTKGKKLDTGFSGTTKQWEEAFGTRYWRAGAMYRGSAPSPLTT 304

Query: 301 NSYSGSTNLIKDDMVSSQECQNTVHLPELKTVEVLLEFVEVKNKPEGLKGNFFVQFMKSQ 360
             Y   +N+I  D+ +  + Q  + LPE+K VEVLLEF+EVKN PEG KG  F  F K+ 
Sbjct: 305 TPY--QSNVISKDVTAHPDLQKVIELPEVKAVEVLLEFLEVKNLPEGHKGMLFASFNKTT 364

Query: 361 PDAIFSAKRKLSILSETGVKQVASFQCEPKGDLQFELICNRASSIPITRTSLTLGSISLP 420
            D  F+AKR+LSI SE G KQVASFQCEP G+L FEL+ +  S IP+ RT  TLGS S  
Sbjct: 365 QDIFFNAKRRLSIFSEFGEKQVASFQCEPTGELLFELMSHSPSQIPMKRTYKTLGSASFS 424

Query: 421 LHDILVPTSKLSMERWLELKPVSDHVSSKPISLRVAISFTVPHPAPRELHMFFSRELSRW 480
           L D L+P SKL +E+WLE+ P S+  + KPI LR A+SFT+P  A   LH+  SR  S+ 
Sbjct: 425 LQDCLLPPSKLYVEKWLEMVPSSEVGNLKPIYLRFAMSFTIPAIAQHTLHIVRSRLFSKS 484

Query: 481 TSFLPSCTKMQHSKGWMQVTDEAGNEVISLQL------RDSLKEKVGKNSIPTSKEVIGI 540
           + F P   K Q +K W QV DE G EV+ LQ+      RD+  EKV   S+P  KEV+GI
Sbjct: 485 SCFFPFAGKNQDAKSWTQVIDETGTEVLRLQMSLNSIYRDAEMEKVKGISVP-KKEVVGI 544

Query: 541 KMSGESSLLAEFVKTGWSLIDGQWFLDFQKKSNEDDHLFKLVGKRLVCMVPVVVKFFQGS 600
             SG+   LAE V TGWSLID  W L   ++ N + HLF L GK++       VKFF G 
Sbjct: 545 TKSGKICTLAECVGTGWSLIDSHWSL--HREKNSEGHLFLLKGKKM-------VKFFPGR 604

Query: 601 KLDYEPKNCGK----HNHDSDFVTAIEFSAEYPYGRAVALFDLKFGVIKIKEEWMVVPGI 660
           KLDYEPK C K    +     F+T +EFSAE PYG+AVAL DLK G IK+KEE + VPGI
Sbjct: 605 KLDYEPKQCEKLTSENKTQQHFMTLVEFSAEDPYGKAVALLDLKSGCIKVKEESITVPGI 664

Query: 661 MTAFLLLHARKKKGYNGLTVSEENLEVAPVPESVHTSGKEENNMNLINVSLSSTDLKVNV 720
           + AF+L +  KK+ Y G  V+    E   V E ++ + +E    NL +   S   LK  V
Sbjct: 665 IMAFMLSNKLKKERYGGFAVNA--AEKGSVEEEINENPEEGKETNLSSSGASEVKLKSEV 724

Query: 721 SEGVSNENVTIPLKEDELSSHCGR-CDAGGYTAGSGNMVKSGRCGGCGAGGCGGGCGNMV 780
            EG    NV    K       CG  C     TAGS         GGCG+ GCGGGCG M 
Sbjct: 725 VEG----NVVTSQKGGGCGGACGSGCGNATRTAGSAG------SGGCGS-GCGGGCGTME 784

Query: 781 KSGGCGGCGAGGCGGGCGNIINSGGCGGCGAGGCGGGCGNIVNSGGCGGCGAGGCGGGCG 840
           KS GCG     GCGGGCGN++ SGGCGGCGA GCGGGCGNI+ SGGCGGCG GGCGGGCG
Sbjct: 785 KSSGCG----SGCGGGCGNLVQSGGCGGCGA-GCGGGCGNILKSGGCGGCG-GGCGGGCG 844

Query: 841 NIVNSGSSVGGIVAKSGGCGGGCGGF--GHKTAQPNEGN 867
           N++ SG   G     SGGCGGGCG    G+   +   GN
Sbjct: 845 NMLKSGGCGG-----SGGCGGGCGSLFKGNGLYENTSGN 845

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GRDP1_ARATH2.3e-22050.29Glycine-rich domain-containing protein 1 OS=Arabidopsis thaliana GN=GRDP1 PE=2 S... [more]
GRDP2_ARATH8.4e-21049.53Glycine-rich domain-containing protein 2 OS=Arabidopsis thaliana GN=GRDP2 PE=2 S... [more]
Match NameE-valueIdentityDescription
A0A0A0L5L1_CUCSA0.0e+0078.62Uncharacterized protein OS=Cucumis sativus GN=Csa_3G033730 PE=4 SV=1[more]
A0A061DIX2_THECC4.7e-26858.54Uncharacterized protein OS=Theobroma cacao GN=TCM_000919 PE=4 SV=1[more]
A9YWR4_MEDTR7.8e-26357.74DNA-binding protein, putative OS=Medicago truncatula GN=MTR_5g030890 PE=4 SV=1[more]
V4SFT2_9ROSI5.0e-26258.16Uncharacterized protein OS=Citrus clementina GN=CICLE_v10024875mg PE=4 SV=1[more]
I1J4X2_SOYBN3.3e-26156.96Uncharacterized protein OS=Glycine max GN=GLYMA_01G019800 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G22660.21.3e-22150.29 Protein of unknown function (duplicated DUF1399)[more]
AT4G37900.14.7e-21149.53 Protein of unknown function (duplicated DUF1399)[more]
AT1G56230.12.0e-3125.27 Protein of unknown function (DUF1399)[more]
AT4G37682.11.0e-0854.72 Protein of unknown function (DUF1399)[more]
Match NameE-valueIdentityDescription
gi|659096009|ref|XP_008448876.1|0.0e+0079.66PREDICTED: uncharacterized protein LOC103490909 [Cucumis melo][more]
gi|449465866|ref|XP_004150648.1|0.0e+0078.62PREDICTED: uncharacterized protein LOC101219844 [Cucumis sativus][more]
gi|590706353|ref|XP_007047699.1|6.7e-26858.54Uncharacterized protein TCM_000919 [Theobroma cacao][more]
gi|470103796|ref|XP_004288315.1|5.7e-26759.22PREDICTED: uncharacterized protein LOC101307152 isoform X1 [Fragaria vesca subsp... [more]
gi|764517300|ref|XP_011466594.1|5.3e-26558.82PREDICTED: uncharacterized protein LOC101307152 isoform X2 [Fragaria vesca subsp... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR009836GRDP-like
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG14g08780.1Cp4.1LG14g08780.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR009836Glycine-rich domain-containing protein-likePFAMPF07173DUF1399coord: 11..96
score: 7.2E-10coord: 94..240
score: 3.7
NoneNo IPR availablePANTHERPTHR34365FAMILY NOT NAMEDcoord: 1..853
score: 1.1E
NoneNo IPR availablePANTHERPTHR34365:SF2SUBFAMILY NOT NAMEDcoord: 1..853
score: 1.1E

The following gene(s) are paralogous to this gene:

None