Cp4.1LG01g18370 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g18370
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionZein-binding domain-containing protein
LocationCp4.1LG01 : 15665425 .. 15670092 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TAGTCTAAATAAAAAACAACTTCCCTTACTTCTTCCTTCCCCTCTTCATCTTCTTCTGCTTGTCTCTCTCTCCTCTCCGTTTTCCACCATTTTTCTGCGATAAACATCTATTTTTTGCCCAAATCCTCCATTTTTGGCTCCTCTGTACATACCCACCAAACAACAAACACAAAACTCCACCATCGTCCATGGCTGCCAACAGATTCGCCACCCTCTTACACAGAAACTCCAACAAGATCACCCTTATTTTAGTCTACGCCCTTCTCGAATGGGTTCTCATCTTTCTTCTTCTTCTTCAAGCTCTTTTTTCTTACCTGATTATTAAATTTGCTGAGTTGTTTGGTTTGAAGCGCCCCTGTTTGTGGTGTTCTAGAGTCGACCATGTGTTCGAGCCTGGGAAGAAGTTTTCTTACAGAGATCTTCTTTGTGAACCTCATGCTATGGAGATTTCTAATCTGGGTTACTGTTCGAATCATCGCAAACTTACTGAGGTTCGAGATTTGTGCGAGGATTGCTTCTCCTCTTCTAATCCCAACGAGTTCTATCAGATTCCTAAGAACTTTGCCTTTTTTGGTGATGAGAAGGAGGATTTTAGGTGCTGTTCTTGCTGTGGGGAGAGCTTGAAGAATCGGTTGTTTTCCCCTTGTATTTTGATTAAGCCCAATTGGGGGGATTTGGATTATGCCCACAAGGGGAATTTGATTTCTGAAGCGGACATTGATGTTCAATCTGATGAAATCCATGCTTCCCCCACGGAAGATATCATCGGAAATAGGGAAATCTCCATTGTTTCCGGTGGGGAACAGGCTGAGAAGAACTCCGGTTGCTCTGTTTGTGGTTGTTGCTGTAAAGTTTCGGCGGTTCATGAGGAGGAGAAGGAGGAGGAGGATAAAGCTAAAATGGGTGGTGAAAAGGACGGAGATTTTCTTGAACTGGCTGAAGATCTGAGCTCTCTTAATCACAAAACTGTTCAACTCGGTTGTGTGAGAGAGAATGAATCGGCTGAGACTGCCCCTCATCATCTTGAGTTTTACATTGACCGGGGCAATGATCGGCGGTTGATTCCAGTTGATTTGATCGATTTTTCAGCCTCCGATCACAACAACAACGAAAGCAATATCCTAAGTTCAGTGAAAGATGAGGAACAAGAACAAGAACCAGAACCAGAACCAGAACAAGAACAAGAACAAGAACAAGAACAAGATCAAGAACAACAACAAGAGGATTGTGGGAATGAAGATGTTGTTCTGGATTTTGGTTCCAACTTTGAGAAGCAGGGACAAGATGTGACAGAAGATTGGGAAGTTATTTCAGGAGAGAGATTGGCAGAGTTTCTCTCTGTTTCTCTCCATGAGAGCAAGCAGAAAGTTGCAGAGGTGGAAGCCATGAAGGTGGAGGAGGGTAGCACAAGGGCCTCTGGGTTGGGTTCAGATGAAGATCCATCAATGGAAGTAGAAGAGCAAGAAATAGAAGAACAAGAGCAAGAAATAGAAGAACAAGAGCAACAAATAGAAGAACAACAGCAAGAAATAGAAGAACAAGAGCAAGAAATAGAAGAAGCTGAAGCTTCCATTGGTGAAGCAATTCAAGCTCCAGCCATTGATGATGCTCACGAAGAAGACCTTGCAGAATTGGTGCTAGATTCAGATCTTCATCAAGGTACTATTCTAGACTAGACACTACATTTCAGTATATGATGTTATGTTCTTTTGAAGATCAAATGTGAAAATACCTGAGAAATTTTGTGCTTATTTTTGTTCTTGTTTGATCTTGTTTGATCTCTTTCAATAAATTAGATATTCACGAGTGGAATGATGAACATGAAGTAGAGATTTCAATCGGGACGGATATTCCCGATCACGAACCGATCGATGAGATTCAAACTCAAAACGACATTCCTTCACATCCCAATGTTCAAGAAGATCCTTCCCCAACTTCAACATTGGTAGTTGATGACAATATGCAAGGTAGCTACCATTCTTTCCTTTGCTTTCCCACATCCATATAAGGAGTTTAGCTTAATCTGATCATTCGTGTTTTGCTGCATTCCACATTTAGATTATAACAAAGCTGAGAAATCCGAGGAAGCTGAGGACACTAAGGAAGAGGTAGAGTTCAAGATATTGTCCGTGGAAACGAGTTCTCAACCGTCAGACAATCACAAACCGTCGAGGTCTGAGTTCAATGAGAATGAGGAAGAAGATAAAGTTCCTGATACACCAACTTCAATGGATAGTCTCCACCAGCTACACAAGAAGCTGCTATTACTAGACAGAAAAGAATCTGGAGCCGAAGAGTCGTTGGATGGGAGCGTGATTAGCGAGACTGAAAGCGGGGATGGAGTATTGACGATTGAGAAATTGAAGTCGGCGTTGAGAACCGAACGAAAGGTTTTGAATGCCTTATATTCAGAGCTAGAAGAAGAGAGAAGTGCTTCTGCCATAGCAGCCAACCAGACAATGGCAATGATAAATAGGCTTCAAGAGGAGAAAGCAAGCATGCAAATGGAAGCTTTACAGTACCAAAGAATGATGGAAGAACAATCTGAATATGACCAGGAAGCTTTACAGCTTTTGAACGAGCTCGTGGTGAAGAGAGAAAAGGAAAAGCAAGAGCTCGAGAAAGGAATCGAAGTTTACCGAAAAAAGCTTCAAGATTATGAAGCCAAAGAGAAAATGGCATTGTTAAGGAGCAGGAAAGAAGGGAGCATCCAAAGTAGAAATTCCTCGGTTTCTTGTAGCAATGCCGATGATAGCGATGGGCTATCGATCGATTTGAACACCGAGGCAAAGAAAGATGAAGATTTGTTTTGTAACCAAGAAACAAACAATCAAAACACCCCAGCTGAGGCAGTTCTTTATTTGGAAGAAACATTAGCAAACTTTGAGGAAGAAAGACTGTCCATTCTAGAGGAGCTGAAGATGTTGGAAGAGAAGCTCTTTACATTGAGTGATGAAGAACAACAATTTGATGACATTGAGCATTACAGTGAACATAATGGCAATGGCTACCATAAGAACTCGGATTCTGTTTCTGAAACAAATGGATTCGAAAACGGTCATCATGTCAAGGAAATGAATGGAAACCATCATCCAGGGAAGAGAACGATGAGCACGAAAGCCAAAAGACTTCTTCCTCTTTTCGACGATGCAGTCGATACAGACGTTGAAGATGTAACCATTGGAGAAGAACAAGGGTTCGACTCTGTTTCAATGCAGAAGTCCTTAGACAACAAATTCGACACGGAGTTCAGGAGGGTTGCTGTTGAGGAAGAAGTGGATCACGTGTATGAGAGATTACAAGCACTTGAAGCAGATAGGGAGTTTCTAAAGCATTGCATTGGCTCCCTAAGAAAAGGAGACAAAGGCTTAGAGCTTCTCCAAGAAATCTTACAGCATCTCCGAGATCTAAGGAGCGTCGATATCCAGTTGAAAAATATGGGAGACGGTGTCGTAGCATGATCGTGATCACATCCCCCAAACAAAATCCAAAACAGGTAAAAGCCTTCTGCATCATCCTTTGTTTATTTCATTTTAAACTCTGAAGATGCATCGTTATTCTCTATGGAAAACAAACTGAAAACTTGGTCGTTTGGAATTTGAACTGATGCAACAATCAATGATACAGAAACCCAAAACCCGTCAACCTCGCCGTGTCATCGTCATCGTCACCGTCGTCGACAGAATCAGAAAGGCTCCATATGGTGGAGCCTCCTGATAAGAAAACAAAAGTCCCCCCTCCCCCCTCCCCCCTCCCCCCTTCAAAGTTGGCCCAAATTTTTTGTGGGTTGTGTTCATGATGATCATCACCTTATGAGGGAAGGTGAATGAAGGTGTAGGTAGTCAGTCAGTGAGTCAGTGTATGATTTGTTTCATGTTTTTGCGAGGAAAGAGGCTGGCTTGCAAAGCTGGAGACACCACAGGCAAACACTCCAGTGCCAGCAACATATAGCTTAGAGTATCATTTAGACAGTTCTAGATTTAAGCACAATCTTTTCTTCTTTTATTTTTTTTTTGCTGGGAAAGGGGAAGAAATTGGGTAAATAACCCTTCTTTTTTCCCTCCCTCCATCATTTGCATCATTTCATCATCAAATGGGGTTTGTTTGTTCTTTCTTTTCTTTTTTTATTCTCTTTTTCCTTACTATTTGAAAGAGTTTCTGTACACTACATCTTTATGTTTGTAAGTATATAAAAGTTGAGTGGTTTGTCTAAAACTGTTGTCATTCTGGGTTGTTCATAAAATTTTGAATCTTCTTGTTTCCTTCCGATCTCATATTTGTCGATAATTTCGTGTTTGTTGGGTTTGTAATGAAAAGTATAATGTTTTGAGAAAATCTTTCTTGATATATAATTGTAGAAGCTTTATATGAAGGGGAACAATGTAATGTAAAGCAAAGTTTGCTTTGACGGTTTATTTTGCTTCAAGTACAAAACAAGTCCAAAATGGCAAATAGGCACAGTCCAACCAAAAAANAAAATAAAGGGGCTATTTTGAAAAAAAAAAAAAGAAAAAAGAACAAAGTGGTTGGGGGGTGGGGACGTAAAATAAAATAAAGTGGCCAAATATGATTGTGGAAGAAAGCTTGCAAATATTTAGTGGCCAAATGCATTTATTCTTTTGGGTATTAATAGTTTAAGGCCTCTCATGTATTTACCAATTAACGTTC

mRNA sequence

TAGTCTAAATAAAAAACAACTTCCCTTACTTCTTCCTTCCCCTCTTCATCTTCTTCTGCTTGTCTCTCTCTCCTCTCCGTTTTCCACCATTTTTCTGCGATAAACATCTATTTTTTGCCCAAATCCTCCATTTTTGGCTCCTCTGTACATACCCACCAAACAACAAACACAAAACTCCACCATCGTCCATGGCTGCCAACAGATTCGCCACCCTCTTACACAGAAACTCCAACAAGATCACCCTTATTTTAGTCTACGCCCTTCTCGAATGGGTTCTCATCTTTCTTCTTCTTCTTCAAGCTCTTTTTTCTTACCTGATTATTAAATTTGCTGAGTTGTTTGGTTTGAAGCGCCCCTGTTTGTGGTGTTCTAGAGTCGACCATGTGTTCGAGCCTGGGAAGAAGTTTTCTTACAGAGATCTTCTTTGTGAACCTCATGCTATGGAGATTTCTAATCTGGGTTACTGTTCGAATCATCGCAAACTTACTGAGGTTCGAGATTTGTGCGAGGATTGCTTCTCCTCTTCTAATCCCAACGAGTTCTATCAGATTCCTAAGAACTTTGCCTTTTTTGGTGATGAGAAGGAGGATTTTAGGTGCTGTTCTTGCTGTGGGGAGAGCTTGAAGAATCGGTTGTTTTCCCCTTGTATTTTGATTAAGCCCAATTGGGGGGATTTGGATTATGCCCACAAGGGGAATTTGATTTCTGAAGCGGACATTGATGTTCAATCTGATGAAATCCATGCTTCCCCCACGGAAGATATCATCGGAAATAGGGAAATCTCCATTGTTTCCGGTGGGGAACAGGCTGAGAAGAACTCCGGTTGCTCTGTTTGTGGTTGTTGCTGTAAAGTTTCGGCGGTTCATGAGGAGGAGAAGGAGGAGGAGGATAAAGCTAAAATGGGTGGTGAAAAGGACGGAGATTTTCTTGAACTGGCTGAAGATCTGAGCTCTCTTAATCACAAAACTGTTCAACTCGGTTGTGTGAGAGAGAATGAATCGGCTGAGACTGCCCCTCATCATCTTGAGTTTTACATTGACCGGGGCAATGATCGGCGGTTGATTCCAGTTGATTTGATCGATTTTTCAGCCTCCGATCACAACAACAACGAAAGCAATATCCTAAGTTCAGTGAAAGATGAGGAACAAGAACAAGAACCAGAACCAGAACCAGAACAAGAACAAGAACAAGAACAAGAACAAGATCAAGAACAACAACAAGAGGATTGTGGGAATGAAGATGTTGTTCTGGATTTTGGTTCCAACTTTGAGAAGCAGGGACAAGATGTGACAGAAGATTGGGAAGTTATTTCAGGAGAGAGATTGGCAGAGTTTCTCTCTGTTTCTCTCCATGAGAGCAAGCAGAAAGTTGCAGAGGTGGAAGCCATGAAGGTGGAGGAGGGTAGCACAAGGGCCTCTGGGTTGGGTTCAGATGAAGATCCATCAATGGAAGTAGAAGAGCAAGAAATAGAAGAACAAGAGCAAGAAATAGAAGAACAAGAGCAACAAATAGAAGAACAACAGCAAGAAATAGAAGAACAAGAGCAAGAAATAGAAGAAGCTGAAGCTTCCATTGGTGAAGCAATTCAAGCTCCAGCCATTGATGATGCTCACGAAGAAGACCTTGCAGAATTGGTGCTAGATTCAGATCTTCATCAAGATATTCACGAGTGGAATGATGAACATGAAGTAGAGATTTCAATCGGGACGGATATTCCCGATCACGAACCGATCGATGAGATTCAAACTCAAAACGACATTCCTTCACATCCCAATGTTCAAGAAGATCCTTCCCCAACTTCAACATTGGTAGTTGATGACAATATGCAAGATTATAACAAAGCTGAGAAATCCGAGGAAGCTGAGGACACTAAGGAAGAGGTAGAGTTCAAGATATTGTCCGTGGAAACGAGTTCTCAACCGTCAGACAATCACAAACCGTCGAGGTCTGAGTTCAATGAGAATGAGGAAGAAGATAAAGTTCCTGATACACCAACTTCAATGGATAGTCTCCACCAGCTACACAAGAAGCTGCTATTACTAGACAGAAAAGAATCTGGAGCCGAAGAGTCGTTGGATGGGAGCGTGATTAGCGAGACTGAAAGCGGGGATGGAGTATTGACGATTGAGAAATTGAAGTCGGCGTTGAGAACCGAACGAAAGGTTTTGAATGCCTTATATTCAGAGCTAGAAGAAGAGAGAAGTGCTTCTGCCATAGCAGCCAACCAGACAATGGCAATGATAAATAGGCTTCAAGAGGAGAAAGCAAGCATGCAAATGGAAGCTTTACAGTACCAAAGAATGATGGAAGAACAATCTGAATATGACCAGGAAGCTTTACAGCTTTTGAACGAGCTCGTGGTGAAGAGAGAAAAGGAAAAGCAAGAGCTCGAGAAAGGAATCGAAGTTTACCGAAAAAAGCTTCAAGATTATGAAGCCAAAGAGAAAATGGCATTGTTAAGGAGCAGGAAAGAAGGGAGCATCCAAAGTAGAAATTCCTCGGTTTCTTGTAGCAATGCCGATGATAGCGATGGGCTATCGATCGATTTGAACACCGAGGCAAAGAAAGATGAAGATTTGTTTTGTAACCAAGAAACAAACAATCAAAACACCCCAGCTGAGGCAGTTCTTTATTTGGAAGAAACATTAGCAAACTTTGAGGAAGAAAGACTGTCCATTCTAGAGGAGCTGAAGATGTTGGAAGAGAAGCTCTTTACATTGAGTGATGAAGAACAACAATTTGATGACATTGAGCATTACAGTGAACATAATGGCAATGGCTACCATAAGAACTCGGATTCTGTTTCTGAAACAAATGGATTCGAAAACGGTCATCATGTCAAGGAAATGAATGGAAACCATCATCCAGGGAAGAGAACGATGAGCACGAAAGCCAAAAGACTTCTTCCTCTTTTCGACGATGCAGTCGATACAGACGTTGAAGATGTAACCATTGGAGAAGAACAAGGGTTCGACTCTGTTTCAATGCAGAAGTCCTTAGACAACAAATTCGACACGGAGTTCAGGAGGGTTGCTGTTGAGGAAGAAGTGGATCACGTGTATGAGAGATTACAAGCACTTGAAGCAGATAGGGAGTTTCTAAAGCATTGCATTGGCTCCCTAAGAAAAGGAGACAAAGGCTTAGAGCTTCTCCAAGAAATCTTACAGCATCTCCGAGATCTAAGGAGCGTCGATATCCAGTTGAAAAATATGGGAGACGGTGTCGTAGCATGATCGTGATCACATCCCCCAAACAAAATCCAAAACAGAGGCTGGCTTGCAAAGCTGGAGACACCACAGGCAAACACTCCAGTGCCAGCAACATATAGCTTAGAGTATCATTTAGACAGTTCTAGATTTAAGCACAATCTTTTCTTCTTTTATTTTTTTTTTGCTGGGAAAGGGGAAGAAATTGGGTAAATAACCCTTCTTTTTTCCCTCCCTCCATCATTTGCATCATTTCATCATCAAATGGGGTTTGTTTGTTCTTTCTTTTCTTTTTTTATTCTCTTTTTCCTTACTATTTGAAAGAGTTTCTGTACACTACATCTTTATGTTTGTAAGTATATAAAAGTTGAGTGGTTTGTCTAAAACTGTTGTCATTCTGGGTTGTTCATAAAATTTTGAATCTTCTTGTTTCCTTCCGATCTCATATTTGTCGATAATTTCGTGTTTGTTGGGTTTGTAATGAAAAGTATAATGTTTTGAGAAAATCTTTCTTGATATATAATTGTAGAAGCTTTATATGAAGGGGAACAATGTAATGTAAAGCAAAGTTTGCTTTGACGGTTTATTTTGCTTCAAGTACAAAACAAGTCCAAAATGGCAAATAGGCACAGTCCAACCAAAAAANAAAATAAAGGGGCTATTTTGAAAAAAAAAAAAAGAAAAAAGAACAAAGTGGTTGGGGGGTGGGGACGTAAAATAAAATAAAGTGGCCAAATATGATTGTGGAAGAAAGCTTGCAAATATTTAGTGGCCAAATGCATTTATTCTTTTGGGTATTAATAGTTTAAGGCCTCTCATGTATTTACCAATTAACGTTC

Coding sequence (CDS)

ATGGCTGCCAACAGATTCGCCACCCTCTTACACAGAAACTCCAACAAGATCACCCTTATTTTAGTCTACGCCCTTCTCGAATGGGTTCTCATCTTTCTTCTTCTTCTTCAAGCTCTTTTTTCTTACCTGATTATTAAATTTGCTGAGTTGTTTGGTTTGAAGCGCCCCTGTTTGTGGTGTTCTAGAGTCGACCATGTGTTCGAGCCTGGGAAGAAGTTTTCTTACAGAGATCTTCTTTGTGAACCTCATGCTATGGAGATTTCTAATCTGGGTTACTGTTCGAATCATCGCAAACTTACTGAGGTTCGAGATTTGTGCGAGGATTGCTTCTCCTCTTCTAATCCCAACGAGTTCTATCAGATTCCTAAGAACTTTGCCTTTTTTGGTGATGAGAAGGAGGATTTTAGGTGCTGTTCTTGCTGTGGGGAGAGCTTGAAGAATCGGTTGTTTTCCCCTTGTATTTTGATTAAGCCCAATTGGGGGGATTTGGATTATGCCCACAAGGGGAATTTGATTTCTGAAGCGGACATTGATGTTCAATCTGATGAAATCCATGCTTCCCCCACGGAAGATATCATCGGAAATAGGGAAATCTCCATTGTTTCCGGTGGGGAACAGGCTGAGAAGAACTCCGGTTGCTCTGTTTGTGGTTGTTGCTGTAAAGTTTCGGCGGTTCATGAGGAGGAGAAGGAGGAGGAGGATAAAGCTAAAATGGGTGGTGAAAAGGACGGAGATTTTCTTGAACTGGCTGAAGATCTGAGCTCTCTTAATCACAAAACTGTTCAACTCGGTTGTGTGAGAGAGAATGAATCGGCTGAGACTGCCCCTCATCATCTTGAGTTTTACATTGACCGGGGCAATGATCGGCGGTTGATTCCAGTTGATTTGATCGATTTTTCAGCCTCCGATCACAACAACAACGAAAGCAATATCCTAAGTTCAGTGAAAGATGAGGAACAAGAACAAGAACCAGAACCAGAACCAGAACAAGAACAAGAACAAGAACAAGAACAAGATCAAGAACAACAACAAGAGGATTGTGGGAATGAAGATGTTGTTCTGGATTTTGGTTCCAACTTTGAGAAGCAGGGACAAGATGTGACAGAAGATTGGGAAGTTATTTCAGGAGAGAGATTGGCAGAGTTTCTCTCTGTTTCTCTCCATGAGAGCAAGCAGAAAGTTGCAGAGGTGGAAGCCATGAAGGTGGAGGAGGGTAGCACAAGGGCCTCTGGGTTGGGTTCAGATGAAGATCCATCAATGGAAGTAGAAGAGCAAGAAATAGAAGAACAAGAGCAAGAAATAGAAGAACAAGAGCAACAAATAGAAGAACAACAGCAAGAAATAGAAGAACAAGAGCAAGAAATAGAAGAAGCTGAAGCTTCCATTGGTGAAGCAATTCAAGCTCCAGCCATTGATGATGCTCACGAAGAAGACCTTGCAGAATTGGTGCTAGATTCAGATCTTCATCAAGATATTCACGAGTGGAATGATGAACATGAAGTAGAGATTTCAATCGGGACGGATATTCCCGATCACGAACCGATCGATGAGATTCAAACTCAAAACGACATTCCTTCACATCCCAATGTTCAAGAAGATCCTTCCCCAACTTCAACATTGGTAGTTGATGACAATATGCAAGATTATAACAAAGCTGAGAAATCCGAGGAAGCTGAGGACACTAAGGAAGAGGTAGAGTTCAAGATATTGTCCGTGGAAACGAGTTCTCAACCGTCAGACAATCACAAACCGTCGAGGTCTGAGTTCAATGAGAATGAGGAAGAAGATAAAGTTCCTGATACACCAACTTCAATGGATAGTCTCCACCAGCTACACAAGAAGCTGCTATTACTAGACAGAAAAGAATCTGGAGCCGAAGAGTCGTTGGATGGGAGCGTGATTAGCGAGACTGAAAGCGGGGATGGAGTATTGACGATTGAGAAATTGAAGTCGGCGTTGAGAACCGAACGAAAGGTTTTGAATGCCTTATATTCAGAGCTAGAAGAAGAGAGAAGTGCTTCTGCCATAGCAGCCAACCAGACAATGGCAATGATAAATAGGCTTCAAGAGGAGAAAGCAAGCATGCAAATGGAAGCTTTACAGTACCAAAGAATGATGGAAGAACAATCTGAATATGACCAGGAAGCTTTACAGCTTTTGAACGAGCTCGTGGTGAAGAGAGAAAAGGAAAAGCAAGAGCTCGAGAAAGGAATCGAAGTTTACCGAAAAAAGCTTCAAGATTATGAAGCCAAAGAGAAAATGGCATTGTTAAGGAGCAGGAAAGAAGGGAGCATCCAAAGTAGAAATTCCTCGGTTTCTTGTAGCAATGCCGATGATAGCGATGGGCTATCGATCGATTTGAACACCGAGGCAAAGAAAGATGAAGATTTGTTTTGTAACCAAGAAACAAACAATCAAAACACCCCAGCTGAGGCAGTTCTTTATTTGGAAGAAACATTAGCAAACTTTGAGGAAGAAAGACTGTCCATTCTAGAGGAGCTGAAGATGTTGGAAGAGAAGCTCTTTACATTGAGTGATGAAGAACAACAATTTGATGACATTGAGCATTACAGTGAACATAATGGCAATGGCTACCATAAGAACTCGGATTCTGTTTCTGAAACAAATGGATTCGAAAACGGTCATCATGTCAAGGAAATGAATGGAAACCATCATCCAGGGAAGAGAACGATGAGCACGAAAGCCAAAAGACTTCTTCCTCTTTTCGACGATGCAGTCGATACAGACGTTGAAGATGTAACCATTGGAGAAGAACAAGGGTTCGACTCTGTTTCAATGCAGAAGTCCTTAGACAACAAATTCGACACGGAGTTCAGGAGGGTTGCTGTTGAGGAAGAAGTGGATCACGTGTATGAGAGATTACAAGCACTTGAAGCAGATAGGGAGTTTCTAAAGCATTGCATTGGCTCCCTAAGAAAAGGAGACAAAGGCTTAGAGCTTCTCCAAGAAATCTTACAGCATCTCCGAGATCTAAGGAGCGTCGATATCCAGTTGAAAAATATGGGAGACGGTGTCGTAGCATGA

Protein sequence

MAANRFATLLHRNSNKITLILVYALLEWVLIFLLLLQALFSYLIIKFAELFGLKRPCLWCSRVDHVFEPGKKFSYRDLLCEPHAMEISNLGYCSNHRKLTEVRDLCEDCFSSSNPNEFYQIPKNFAFFGDEKEDFRCCSCCGESLKNRLFSPCILIKPNWGDLDYAHKGNLISEADIDVQSDEIHASPTEDIIGNREISIVSGGEQAEKNSGCSVCGCCCKVSAVHEEEKEEEDKAKMGGEKDGDFLELAEDLSSLNHKTVQLGCVRENESAETAPHHLEFYIDRGNDRRLIPVDLIDFSASDHNNNESNILSSVKDEEQEQEPEPEPEQEQEQEQEQDQEQQQEDCGNEDVVLDFGSNFEKQGQDVTEDWEVISGERLAEFLSVSLHESKQKVAEVEAMKVEEGSTRASGLGSDEDPSMEVEEQEIEEQEQEIEEQEQQIEEQQQEIEEQEQEIEEAEASIGEAIQAPAIDDAHEEDLAELVLDSDLHQDIHEWNDEHEVEISIGTDIPDHEPIDEIQTQNDIPSHPNVQEDPSPTSTLVVDDNMQDYNKAEKSEEAEDTKEEVEFKILSVETSSQPSDNHKPSRSEFNENEEEDKVPDTPTSMDSLHQLHKKLLLLDRKESGAEESLDGSVISETESGDGVLTIEKLKSALRTERKVLNALYSELEEERSASAIAANQTMAMINRLQEEKASMQMEALQYQRMMEEQSEYDQEALQLLNELVVKREKEKQELEKGIEVYRKKLQDYEAKEKMALLRSRKEGSIQSRNSSVSCSNADDSDGLSIDLNTEAKKDEDLFCNQETNNQNTPAEAVLYLEETLANFEEERLSILEELKMLEEKLFTLSDEEQQFDDIEHYSEHNGNGYHKNSDSVSETNGFENGHHVKEMNGNHHPGKRTMSTKAKRLLPLFDDAVDTDVEDVTIGEEQGFDSVSMQKSLDNKFDTEFRRVAVEEEVDHVYERLQALEADREFLKHCIGSLRKGDKGLELLQEILQHLRDLRSVDIQLKNMGDGVVA
BLAST of Cp4.1LG01g18370 vs. Swiss-Prot
Match: MYOB2_ARATH (Myosin-binding protein 2 OS=Arabidopsis thaliana GN=MYOB2 PE=1 SV=1)

HSP 1 Score: 276.2 bits (705), Expect = 1.5e-72
Identity = 260/678 (38.35%), Postives = 364/678 (53.69%), Query Frame = 1

Query: 358  SNFEKQGQDVTEDWEVISGERLAEFLSVSLHESKQKVAEVEAMKVEEGSTRASGLGSDED 417
            S  E + + V +  E +  + + E  S  +     K  E+   K EE            D
Sbjct: 168  SQEETEEKKVPQSHEKLEDDDVDEEFSCYVSSFDCKNKEIATEKEEENRV---------D 227

Query: 418  PSMEVEEQEIEEQEQEIEEQEQQIEEQQQEIEEQEQEIEEAEASIGEAIQAPAIDD---- 477
              +EVE  E   +  E    E+       E  +  +E+ E     G+ I    ++     
Sbjct: 228  LPIEVETAESAPKNLEFYIDEEDCHLIPVEFYKPSEEVREISDINGDFILDFGVEHDFTA 287

Query: 478  -AHEEDLAELVLDSDLHQDIHEWN----------DEHEVEISIGTDIPDHEPIDEIQTQN 537
             A  E++++     +   +  E N          +E + E+SIGT+IPDHE I +I +  
Sbjct: 288  AAETEEISDFASPGESKPEDAETNLVASEMENDDEETDAEVSIGTEIPDHEQIGDIPSHQ 347

Query: 538  DIPSHPNVQEDPSPTSTLVVDDNMQDYNKAEKSEEAEDTKEEVEFKILSVETSSQPSDNH 597
             IP H               DD+ ++              E +EFK +++ET        
Sbjct: 348  LIPHHD--------------DDDHEE--------------ETLEFKTVTIETKMPVL--- 407

Query: 598  KPSRSEFNENEEEDKVPDTPTSMDSLHQ-LHKKLLLLDRKESGAEESLDGSVISETESGD 657
                     N  E+++ +   SM+S H  LH  +  L+++ S     +DG      E  +
Sbjct: 408  ---------NINEERILEAQGSMESSHSSLHNAMFHLEQRVS-----VDG-----IECPE 467

Query: 658  GVLTIEKLKSALRTERKVLNALYSELEEERSASAIAANQTMAMINRLQEEKASMQMEALQ 717
            GVLT++KLK  L+ ERK L+ALY ELE ER+ASA+AA++TMAMINRL EEKA+MQMEALQ
Sbjct: 468  GVLTVDKLKFELQEERKALHALYEELEVERNASAVAASETMAMINRLHEEKAAMQMEALQ 527

Query: 718  YQRMMEEQSEYDQEALQLLNELVVKREKEKQELEKGIEVYRKKLQDYEAKEKMALLRSR- 777
            YQRMMEEQ+E+DQEALQLLNEL+V REKE  ELEK +EVYRK++++YEAKEKM +LR R 
Sbjct: 528  YQRMMEEQAEFDQEALQLLNELMVNREKENAELEKELEVYRKRMEEYEAKEKMGMLRRRL 587

Query: 778  KEGSIQS-RNSSVSCSNADDSDGLSIDLNTEAKKDEDLFCNQETNNQNTPAEAVLYLEET 837
            ++ S+ S RN+  S  N   S+G     N E   D   +  +E   +NTP + VL L+E 
Sbjct: 588  RDSSVDSYRNNGDSDEN---SNGELQFKNVEGVTD---WKYRENEMENTPVDVVLRLDEC 647

Query: 838  LANFEEERLSILEELKMLEEKLFTLSDEEQQFDDIEHYSEHNGNGYHKNSDSVSETNGFE 897
            L +++ ERLSIL  LK LEEKL  L++EE   ++ + +                E+NG  
Sbjct: 648  LDDYDGERLSILGRLKFLEEKLTDLNNEEDDEEEAKTF----------------ESNGSI 707

Query: 898  NGH---HVKEMNGNHHPGKRTMSTKAKRLLPLFDDAVDTDVEDVTIG---EEQGFDSVSM 957
            NG+   H KE NG H         K+KRLLPLFD AVD ++E+        E GFD    
Sbjct: 708  NGNEHIHGKETNGKHRV------IKSKRLLPLFD-AVDGEMENGLSNGNHHENGFDD--- 746

Query: 958  QKSLDNKFDTEFRRVAVEEEVDHVYERLQALEADREFLKHCIGSLRKGDKGLELLQEILQ 1011
                      +   V +EEEVD +YERL+ALEADREFL+HC+GSL+KGDKG+ LL EILQ
Sbjct: 768  --------SEKGENVTIEEEVDELYERLEALEADREFLRHCVGSLKKGDKGVHLLHEILQ 746

BLAST of Cp4.1LG01g18370 vs. Swiss-Prot
Match: MYOB3_ARATH (Myosin-binding protein 3 OS=Arabidopsis thaliana GN=MYOB3 PE=1 SV=1)

HSP 1 Score: 253.4 bits (646), Expect = 1.0e-65
Identity = 195/468 (41.67%), Postives = 278/468 (59.40%), Query Frame = 1

Query: 545  NMQDYNKAEKSEEAEDTKEEVEFKILSVETSSQPSDNHKPSRSEFNENEEEDKV----PD 604
            N+  Y + + S   E+ +EE     L  +     S N   S+ E  E + E+      P+
Sbjct: 254  NVATYGEDQISGRVEEKEEETGVADLLYDQFE--SKNFTGSQIEEEEEDREETTKELDPE 313

Query: 605  TPTSMDSLHQLHKKLLLLDRKE-SGAEESLDGSV-ISETESGDGVLTIEKLKSALRTERK 664
            TPTS+ +L   +KKL  L R E + AE++ DG+V +SE + GD + TIE+L+  +R E++
Sbjct: 314  TPTSVSTL--FNKKLHFLARNEYAAAEDAGDGNVLVSEMDGGDPLRTIERLRETVRAEQE 373

Query: 665  VLNALYSELEEERSASAIAANQTMAMINRLQEEKASMQMEALQYQRMMEEQSEYDQEALQ 724
             L  LY+ELEEERSASAI+ANQTMAMI RLQEEKA +QMEALQYQRMMEEQ+EYDQEALQ
Sbjct: 374  ALRDLYAELEEERSASAISANQTMAMITRLQEEKAKVQMEALQYQRMMEEQAEYDQEALQ 433

Query: 725  LLNELVVKREKEKQELEKGIEVYRKKLQDYEAKEKMALLRSRKEGSIQSRNSSVSCSNAD 784
            LLN L+VKREKEK++L++ +EVYR K+ +YE+K K  ++    +              AD
Sbjct: 434  LLNHLMVKREKEKEQLQRELEVYRAKVLEYESKAKNKIIVVEND------------CEAD 493

Query: 785  DSDGLSIDLNTEAKKDEDLFCNQETNNQNTPAEAVLY---LEETLANFEEERLSILEELK 844
            D D        E  ++ED     + + +    + V +   L E+L+ FEEERL IL++LK
Sbjct: 494  DDD------KEEENREEDNSSEMDVDLEKITLDCVQHMSMLGESLSEFEEERLVILDQLK 553

Query: 845  MLEEKLFTLSDEEQQFDDIEHYSEHNGNGYHKNSDSVSETNGFENGHHVKEMNGNHHPGK 904
            +LE++L T+ D+E   D  E       N Y + S          NGH           G 
Sbjct: 554  VLEDRLVTMQDKESAEDPGEF-----SNSYEEAS----------NGH-----------GG 613

Query: 905  RTMSTKAKRLLPLFDDAVDTDVEDVTIGEEQGFDSVSMQKSLDNKFDTEFRRVAVEEEVD 964
             TM++ AK LLPL  DA + + ED          S  + +S +  F ++  ++ + ++VD
Sbjct: 614  LTMASMAKSLLPLL-DAAENESED---------GSQGLPESDEKNFGSDSEKLEIIKQVD 663

Query: 965  HVYERLQALEADREFLKHCIGSLRKGDKGLELLQEILQHLRDLRSVDI 1004
             VYERLQ LE D EFLK+C+ S +KGDKG ++L++ILQHLRDLR++++
Sbjct: 674  SVYERLQELETDGEFLKNCMSSAKKGDKGTDILKDILQHLRDLRTIEL 663

BLAST of Cp4.1LG01g18370 vs. Swiss-Prot
Match: MYOB6_ARATH (Probable myosin-binding protein 6 OS=Arabidopsis thaliana GN=MYOB6 PE=2 SV=1)

HSP 1 Score: 110.9 bits (276), Expect = 8.1e-23
Identity = 103/281 (36.65%), Postives = 160/281 (56.94%), Query Frame = 1

Query: 578 PSDNHKPSRSEFNENEEEDKVPDTPTSMDSLHQLHK--KLLLLDRKESGAE---ESLDGS 637
           P+ + + S ++ +ENE E K  D   +   +   +K   + L D  ++       SL  S
Sbjct: 223 PAPSPRVSHNKLSENESEFKDMDVDRTPSFVRGGNKFFGIPLSDSAQNSPRWSVRSLKKS 282

Query: 638 VISETESGD------GVLTIEKLKSALRTERKVLNALYSELEEERSASAIAANQTMAMIN 697
           V+++TE+        G   + +LK  +R ++K L  LY EL+EERSASA+AAN+ MAMI 
Sbjct: 283 VLNKTENASDTTDPTGESILNQLKKEVRLDKKSLIDLYMELDEERSASAVAANEAMAMIT 342

Query: 698 RLQEEKASMQMEALQYQRMMEEQSEYDQEALQLLNELVVKREKEKQELEKGIEVYRKKLQ 757
           RLQ EKA++QMEALQYQRMM+EQ+EYDQEALQ ++  + KRE+E +ELE   EVYR+K  
Sbjct: 343 RLQAEKAAVQMEALQYQRMMDEQAEYDQEALQSMSSELAKREEEMKELEAEFEVYREKYG 402

Query: 758 DYEAKEKMALLRSRKEGSIQSRNSSV--SCSNADDSDGLSIDLNTEAKKDEDLFCN-QET 817
               +E      +R+E   Q+ N+S    C        L++  + + +  E++  N Q  
Sbjct: 403 CLTDQED-----AREEFHKQNGNASAYDDCQETKPVSDLAVSSSNQQENGENIDQNGQSK 462

Query: 818 NNQNTPAEAVLYLEETLANFEEERLSILEELKMLEEKLFTL 845
            ++ + AE V+  +E   +  E +  I++EL  + E+L TL
Sbjct: 463 RSEESTAENVVSADEEKGS--ESKEGIVKELSEITERLSTL 496

BLAST of Cp4.1LG01g18370 vs. Swiss-Prot
Match: MYOB5_ARATH (Probable myosin-binding protein 5 OS=Arabidopsis thaliana GN=MYOB5 PE=2 SV=1)

HSP 1 Score: 106.3 bits (264), Expect = 2.0e-21
Identity = 93/231 (40.26%), Postives = 130/231 (56.28%), Query Frame = 1

Query: 636 ETESGDGVLTIEKLKSALRTERKVLNALYSELEEERSASAIAANQTMAMINRLQEEKASM 695
           E+E  DG   ++ L   +R +RK L  LY EL+EERSASA+AAN  MAMI RLQ EKA++
Sbjct: 291 ESEVLDGDSILQHLNRQVRLDRKSLMDLYMELDEERSASAVAANNAMAMITRLQAEKAAV 350

Query: 696 QMEALQYQRMMEEQSEYDQEALQLLNELVVKREKEKQELEKGIEVYRKKLQDYEAKEKMA 755
           QMEALQYQRMM+EQ+EYDQEALQ +N L+VKRE+E +ELE GIEVYR          +  
Sbjct: 351 QMEALQYQRMMDEQAEYDQEALQSMNGLLVKREEEMKELEAGIEVYRL---------RYG 410

Query: 756 LLRSRK---EGSIQSRNSSVSCSNADDSDGLSIDLNT-EAKKDEDLFCNQETNNQNTPAE 815
           LLR  +   E  +      VS            DL    +  +EDL   +++   +    
Sbjct: 411 LLREERGEAEEFLDEETKPVS------------DLPVCSSNHEEDLEQMKDSAEDSIGNN 470

Query: 816 AVLYLEETLANFEEERLSILEELKMLEEKLFTLSDEE---QQFDDIEHYSE 860
            V+ +EE   N   + + +++E+  + E+L  +  +    QQ  D+   SE
Sbjct: 471 GVMIIEEEKENGSRKDM-LVKEISEITERLNAIESKGELLQQISDVLDVSE 499

BLAST of Cp4.1LG01g18370 vs. Swiss-Prot
Match: MYOB1_ARATH (Myosin-binding protein 1 OS=Arabidopsis thaliana GN=MYOB1 PE=1 SV=1)

HSP 1 Score: 88.6 bits (218), Expect = 4.3e-16
Identity = 55/158 (34.81%), Postives = 86/158 (54.43%), Query Frame = 1

Query: 14  SNKITLILVYALLEWVLIFLLLLQALFSYLIIKFAELFGLKRPCLWCSRVDHVFEPGK-- 73
           S   T  L  A  EW+L+F+L + ++FSY+I +FA+   L+ PCL CS +DH+    K  
Sbjct: 3   SRSFTRALALAFNEWLLMFMLFVNSIFSYVIARFADYSELQSPCLMCSNLDHILRRTKDL 62

Query: 74  KFSYRDLLCEPHAMEISNLGYCSNHRKLTEVRDLCEDC---FSSSNPN--EFYQI----- 133
           K ++ D++C  H  EIS+L YC  H KL +VR +CE C   F+++N +  E Y++     
Sbjct: 63  KKTHWDIICSKHKSEISSLVYCHAHGKLVDVRGMCETCLFSFATTNKSNAETYRLLVGKL 122

Query: 134 --PKNFAFFGDEKEDFRC-----CSCCGESLKNRLFSP 153
               +F    D  +   C     C+CC     N+L++P
Sbjct: 123 GEDSHFGSKSDRSKYPNCSKLTDCTCC-----NQLWTP 155

BLAST of Cp4.1LG01g18370 vs. TrEMBL
Match: A0A0A0KRI5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G568820 PE=4 SV=1)

HSP 1 Score: 1281.2 bits (3314), Expect = 0.0e+00
Identity = 744/1043 (71.33%), Postives = 825/1043 (79.10%), Query Frame = 1

Query: 1    MAANRFATLLHRNSNKITLILVYALLEWVLIFLLLLQALFSYLIIKFAELFGLKRPCLWC 60
            MAAN+FAT+LHRNSNKITLILVYALLEWVLIFLLLL  LFSYLI+KFAE FGLKRPCLWC
Sbjct: 1    MAANKFATILHRNSNKITLILVYALLEWVLIFLLLLHGLFSYLIVKFAEWFGLKRPCLWC 60

Query: 61   SRVDHVFEPGKKFSYRDLLCEPHAMEISNLGYCSNHRKLTEVRDLCEDCFSSSNPNEFYQ 120
            SRVDHVFEP +K SYRDLLCE HAMEISNLGYCSNHRKL+E RDLCEDC SSS  NEFYQ
Sbjct: 61   SRVDHVFEPQRKQSYRDLLCEGHAMEISNLGYCSNHRKLSEFRDLCEDCSSSSKSNEFYQ 120

Query: 121  IPKNFAFFGDEKEDFRCCSCCGESLKNRLFSPCILIKPNWGDLDYAHKGNLISEADIDVQ 180
            I K+F FF DEKEDFR CSCCGE+LK RLFSPCILIKPNWGDLDY  KGNLISE +    
Sbjct: 121  ISKSFPFFDDEKEDFRTCSCCGETLKGRLFSPCILIKPNWGDLDYTQKGNLISETE---- 180

Query: 181  SDEIHASPTEDIIGNREISIVSGGEQAEKNSGCSVCGCCCKVSAVHEEEKEEEDKAKMGG 240
            +DEIH S +ED+ GNR ISIVSGGE+ EKNS CSVCGC CK SAVHE++  ++D+A +  
Sbjct: 181  TDEIHVSQSEDVSGNRGISIVSGGEEGEKNSTCSVCGCGCKDSAVHEDD--DDDRADISA 240

Query: 241  EKDGDFLELAEDLSSLNHKTVQLGCVRENESAETAPHHLEFYIDRGNDRRLIPVDLIDFS 300
            +KDG FLELAEDL+  N +TV++GC +E+E  ET P+HLEFYIDRG+DRRLIPVDLIDFS
Sbjct: 241  QKDGGFLELAEDLTICNQETVEVGCEKEDELPETVPNHLEFYIDRGDDRRLIPVDLIDFS 300

Query: 301  ASDHNNNESNILSSVKDEEQEQEPEPEPEQEQEQEQEQDQEQQQEDCGNEDVVLDFGSNF 360
            A D +N+ SNILS VKDEEQEQE                      DCGNEDVVLDF SNF
Sbjct: 301  APDDDNSTSNILSQVKDEEQEQE----------------------DCGNEDVVLDFASNF 360

Query: 361  EKQGQDVTEDWEVISGERLAEFLSVSLHESKQKVAEVEAMKVEEGSTRASGLGSDEDPSM 420
            E +   V+E WEVISGERLAEFLS SLHE+KQ+V EVEAM VEE      G+G +E    
Sbjct: 361  ENRRHGVSEAWEVISGERLAEFLSASLHENKQRVEEVEAMDVEEDPL--VGVGKEE---- 420

Query: 421  EVEEQEIEEQEQEIEEQEQQ--IEEQQQEIEEQEQEIEEAEASIGEAIQAPAIDDAH--- 480
            E EE+E EE +  I+E  Q    +  ++E+EE      + ++ + E        D H   
Sbjct: 421  EKEEEEEEEADASIDESSQAPASDAHKEELEELVVATRQPDSDLHE--------DFHMWS 480

Query: 481  EEDLAELVLDSDLHQDIHEWNDEHEVEISIGTDIPDHEPIDEIQTQN-----DIPSHPNV 540
            +E   E+ + +D+    HE  DE + +I    D+P H  + E  + +     D    PN+
Sbjct: 481  DELEVEISIGTDIPD--HEPIDEIQTQI----DLPPHPDLQEDPSPSSSLDVDNMQDPNI 540

Query: 541  QEDPSPTSTLVVDDNMQ-----------DYNKAEKSEEAEDTKEEV-------EFKILSV 600
             E+      ++ ++  +           D +K   SE  ED +E+        EFKILSV
Sbjct: 541  VEEVEEAEEVMEEEKFKIFSMETSSQPSDNHKPSSSEVNEDEEEDKVPGTEVEEFKILSV 600

Query: 601  ETSSQPSDNHKPSRSEFNENEEEDKVPDTPTSMDSLHQLHKKLLLLDRKESGAEESLDGS 660
            ETSS PSDNHK S SE NENEEEDKVPDTPTSMDSLHQLHKKLLLLDRKESG EESLDGS
Sbjct: 601  ETSSHPSDNHKSSSSEVNENEEEDKVPDTPTSMDSLHQLHKKLLLLDRKESGTEESLDGS 660

Query: 661  VISETESGDGVLTIEKLKSALRTERKVLNALYSELEEERSASAIAANQTMAMINRLQEEK 720
            VISETE GDGVLT+EKLKSALRTERK LNALY+ELEEERSASAIAANQTMAMINRLQEEK
Sbjct: 661  VISETEGGDGVLTLEKLKSALRTERKALNALYAELEEERSASAIAANQTMAMINRLQEEK 720

Query: 721  ASMQMEALQYQRMMEEQSEYDQEALQLLNELVVKREKEKQELEKGIEVYRKKLQDYEAKE 780
            ASMQMEALQYQRMMEEQSEYDQEALQLLNELVVKREKEKQELEK IE+YRKKLQDYEAKE
Sbjct: 721  ASMQMEALQYQRMMEEQSEYDQEALQLLNELVVKREKEKQELEKEIEIYRKKLQDYEAKE 780

Query: 781  KMALLRSRKEGSIQSRNSSVSCSNADDSDGLSIDLNTEAKKDEDLFCNQETNNQNTPAEA 840
            K+ALLR RKEGSI+SRNSSVSCSNADDSDGLSIDLNTEAKKDEDLF NQET NQNTPAEA
Sbjct: 781  KIALLRIRKEGSIRSRNSSVSCSNADDSDGLSIDLNTEAKKDEDLFSNQETENQNTPAEA 840

Query: 841  VLYLEETLANFEEERLSILEELKMLEEKLFTLSDEEQQFDDIEHYSEHNGNGYHKNSDSV 900
            VLYLEETLANFEEERLSILEELKMLEEKLFTLSDEEQQF+DI+HY E NGNGY KNSD  
Sbjct: 841  VLYLEETLANFEEERLSILEELKMLEEKLFTLSDEEQQFEDIDHYCERNGNGYDKNSDYS 900

Query: 901  SETNGFENGHHVKEMNGNHHPGKRTMSTKAKRLLPLFDDAVDTD-VEDVTIGEEQGFDSV 960
              TNGFENGH+ KEMNG H+P +R MSTKAKRLLPLFDD VD D VEDVT GEEQGFDS+
Sbjct: 901  PGTNGFENGHNAKEMNGKHYPERRAMSTKAKRLLPLFDDVVDADVVEDVTNGEEQGFDSI 960

Query: 961  SMQKSLDNKFDTEFRRVAVEEEVDHVYERLQALEADREFLKHCIGSLRKGDKGLELLQEI 1015
            S+QKSLDNKFDTEFRRVAVEEEVDHVYERLQALEADREFLKHCIGSLRKGDKGLELLQEI
Sbjct: 961  SIQKSLDNKFDTEFRRVAVEEEVDHVYERLQALEADREFLKHCIGSLRKGDKGLELLQEI 995

BLAST of Cp4.1LG01g18370 vs. TrEMBL
Match: B9SP67_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0629030 PE=4 SV=1)

HSP 1 Score: 795.8 bits (2054), Expect = 6.1e-227
Identity = 529/1066 (49.62%), Postives = 697/1066 (65.38%), Query Frame = 1

Query: 1    MAANRFATLLHRNSNKITLILVYALLEWVLIFLLLLQALFSYLIIKFAELFGLKRPCLWC 60
            MAAN+FAT+LH+N+NK+TLILVYA+LEWVLI LLLL +LFSYLIIKFA+ FGLKRPCLWC
Sbjct: 1    MAANKFATMLHKNTNKLTLILVYAMLEWVLIILLLLNSLFSYLIIKFADYFGLKRPCLWC 60

Query: 61   SRVDHVFEPGK-KFSYRDLLCEPHAMEISNLGYCSNHRKLTEVRDLCEDCFSSSNPNEFY 120
            SR+DH FEP K + SYR L+CE HA+EIS L YCS+HRKLTE +D+CEDC SSS+P    
Sbjct: 61   SRLDHFFEPSKFQNSYRSLICETHALEISKLSYCSSHRKLTESQDMCEDCLSSSSPQS-- 120

Query: 121  QIPKNFAFF--------------GDEK----EDFRCCSCCGESLKNRLFSPC-ILIKPNW 180
            ++ K FAFF              GD+     E    CSCCG SL+ +LF P    IKP+W
Sbjct: 121  ELSKKFAFFPWIKKLGVLQDCCAGDKVCENVEIISNCSCCGVSLETKLFCPDDYAIKPSW 180

Query: 181  GDLDYAHKGNLISEADIDVQSDEIH---------ASPTEDIIGNREISIVSGGEQAEKNS 240
            GD +   KG+L+ E +IDV+                  + I+ N  +  +   E+ E+N 
Sbjct: 181  GDSENTQKGDLVWEEEIDVKDHSDRNMSGFVCDRCGEEQRIVENTGVEDIKTEEKTEENF 240

Query: 241  GCSVCGCCCKVSAVHEEEKEEEDKAK-MGGEKDGDFLELAEDLSSLNHKTVQLGCVRENE 300
             C V    CK   V++ +KE+    K     K+ DF    ++ S      VQ  C+++  
Sbjct: 241  SCFVSSVDCKEMVVNDSDKEDISTEKEQESTKEDDFNVSVDEPSCDQAVMVQADCIKDM- 300

Query: 301  SAETAPHHLEFYIDRGNDRRLIPVDLIDFSASDHNNNESNILSSVKDEEQEQEPEPEPEQ 360
            S +  P HLEFYID+ +D  LIP++L++ S+                             
Sbjct: 301  SKDIQPQHLEFYIDQ-DDCHLIPIELLNSSS----------------------------- 360

Query: 361  EQEQEQEQDQEQQQEDCGNEDVVLDFGS-NFEKQGQDVTEDWEVISGERLAEFLSVSLHE 420
             ++Q  ++ ++ + E+CG+ED VL+F + +   Q + V ED    + E     L +   E
Sbjct: 361  -EKQISDKKEKGEVENCGSEDFVLEFDNKHVGPQYELVVEDR--CNFEEKLPLLPIQECE 420

Query: 421  SKQKVAEVEAMKVEEGSTRASGLGSDEDPSMEVEEQEIEE---------------QEQEI 480
             +  V E+E   + E     +     +   ME E +++                 +  +I
Sbjct: 421  EENMVDELEPRDLNENENENASAVYADYELMEEESEQVSIAQPIGTITSNGDDVLENSQI 480

Query: 481  EEQEQQIEEQQQEIEEQEQEIEEAEASIGEAIQAPAIDDAHEEDLAELVLDSDLHQDIHE 540
             ++  +++  Q   E  + ++ E EA +    + P  +   E    EL   S   + +  
Sbjct: 481  SDEGMELDNNQVSEEVLQMQVNEIEADVSMGTEIPDHEPIQEIQTPEL--HSLCVEVLQM 540

Query: 541  WNDEHEVEISIGTDIPDHEPIDEIQTQNDIPSHPNVQEDPSPTS--TLVVDDNMQDYNKA 600
              DE E  +SIG +IPDHEPI+EIQT++   S   V+EDPS ++     +DD+   YN+A
Sbjct: 541  QVDEIEAYVSIGAEIPDHEPIEEIQTESFPSSCLCVEEDPSTSNGDNHALDDH--GYNQA 600

Query: 601  EKSEEAEDTKEEVEFKILSVETSSQPSDNHKPSRSEFNENEEEDKVPDTPTSMDSLHQLH 660
            E+        +EVEF+ +++ETS     +H     E N+ EE DK PDTPTS+DSLH LH
Sbjct: 601  EE--------DEVEFRAMTIETSEPVIKSHLSLCLESNDIEE-DKTPDTPTSVDSLHHLH 660

Query: 661  KKLLLLDRKESGAEESLDGSVISETESGDGVLTIEKLKSALRTERKVLNALYSELEEERS 720
            KKLLLL+R+ES AEESLDGSVIS+ E+GDGVLT+EKLKSALR+ERK LNALY+ELEEERS
Sbjct: 661  KKLLLLERRESNAEESLDGSVISDIEAGDGVLTVEKLKSALRSERKALNALYAELEEERS 720

Query: 721  ASAIAANQTMAMINRLQEEKASMQMEALQYQRMMEEQSEYDQEALQLLNELVVKREKEKQ 780
            ASA+AANQTMAMINRLQEEKA+MQMEALQYQRMMEEQSEYDQEALQLLNEL++KREKE+ 
Sbjct: 721  ASAVAANQTMAMINRLQEEKAAMQMEALQYQRMMEEQSEYDQEALQLLNELMIKREKERT 780

Query: 781  ELEKGIEVYRKKLQDYEAKEKMALLRSRKEGSIQSRNSSVSCSNADDSDGLSIDLNTEAK 840
            ELEK +E+YRKK+QDYE KEK+ +LR RKE SI+S  SS S SNA+DSDGLS+DLN E K
Sbjct: 781  ELEKELELYRKKVQDYETKEKLMMLRRRKESSIRSGTSSASYSNAEDSDGLSVDLNHEVK 840

Query: 841  KDEDLFCNQETNNQNTPAEAVLYLEETLANFEEERLSILEELKMLEEKLFTLSDE-EQQF 900
            ++     + E++NQNTP +AV+YLEE+L NFEEERLSILE+LK+LEEKLFTLSDE E  F
Sbjct: 841  EEVGFDNHLESSNQNTPVDAVVYLEESLNNFEEERLSILEQLKVLEEKLFTLSDEDEHHF 900

Query: 901  DD---IEHYSEHNGNGYHKNSDSVSETNGFENGHHVKEMNGNHHPGKRTMSTKAKRLLPL 960
            +D   IEH  E NGNGY+++ D  SE NG  NGH+ KEMNG H+  ++ +  KAKRLLPL
Sbjct: 901  EDIKPIEHLYEENGNGYNEDFDHSSEANGVANGHY-KEMNGKHYQERKIIGAKAKRLLPL 960

Query: 961  FDDAVDTDVEDVTI-GEEQGFDSVSMQKSLDNKFDTEFRRVAVEEEVDHVYERLQALEAD 1014
            F DA+D++ ED  + G E+G DS+ + KS+ NKFD + +++A+EEEVDHVYERLQALEAD
Sbjct: 961  F-DAIDSEAEDGMLNGHEEGVDSIVLLKSI-NKFDIDSKKLAIEEEVDHVYERLQALEAD 1014

BLAST of Cp4.1LG01g18370 vs. TrEMBL
Match: A0A061E7U6_THECC (Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_010975 PE=4 SV=1)

HSP 1 Score: 795.0 bits (2052), Expect = 1.0e-226
Identity = 547/1062 (51.51%), Postives = 690/1062 (64.97%), Query Frame = 1

Query: 1    MAANRFATLLHRNSNKITLILVYALLEWVLIFLLLLQALFSYLIIKFAELFGLKRPCLWC 60
            MAAN+FAT+LH+N+N+ITLILVY LLEW+LI LLLL +LFSYLIIKFA+ FGLKRPCLWC
Sbjct: 1    MAANKFATMLHKNTNRITLILVYTLLEWILIILLLLNSLFSYLIIKFADYFGLKRPCLWC 60

Query: 61   SRVDHVFEPGK-KFSYRDLLCEPHAMEISNLGYCSNHRKLTEVRDLCEDCFSSSNPNEFY 120
            +R+DH+FEP K   S RDL+C+ HA EIS LGYCSNHRKL E +D+CEDC SSS  ++F 
Sbjct: 61   TRLDHIFEPSKYNNSCRDLVCDDHANEISKLGYCSNHRKLAESQDMCEDCLSSS-WSDFS 120

Query: 121  QIPKNFAFF------------GDE-----KEDFRCCSCCGESLKNRLFSPCILIKPNWGD 180
             + K  AFF            GD+      E+F+ CSCCG  L+ +   P +LIKP+W  
Sbjct: 121  DLSKKLAFFPWMKQVGLIQDGGDKVIENGDENFK-CSCCGVMLEKKWNFPYLLIKPSWEV 180

Query: 181  LDYAHKGNLISEA-DIDVQSDEIHASP----------TEDIIG---NREISIVSGGE--- 240
            LDY  KGNLI+EA  +D  +DE +AS            ED  G   N  I I+S G+   
Sbjct: 181  LDYTQKGNLITEAGGVDGIADEGNASDGIRSDFVANYQEDEQGVEENNRIEIISVGDDEA 240

Query: 241  ------QAEKNSGCSVCGCCCKVSAVHEEEKEEE--DKAKMGGEKDGDFLELAEDLSSLN 300
                  + E++  C +    C   A +E++K +   +K ++  E++G+      ++S   
Sbjct: 241  DKGREMEKEEDFSCFISSFDCNQMAANEDDKHDVVIEKDQIPMEEEGNL-----NVSMDG 300

Query: 301  HKTVQLGCVRENESAETAPHHLEFYIDRGNDRRLIPVDLIDFSASDHNNNESNILSSVKD 360
                Q+ C +E ES E  P HLEFYI+ G+D  LIPV+LID +A      ES  +   ++
Sbjct: 301  KVVTQVACSKE-ESPEFLPKHLEFYIE-GDDCHLIPVELIDSTAV-----ESGRIYKFRE 360

Query: 361  EEQEQEPEPEPEQEQEQEQEQDQEQQQEDCGNEDVVLDFGSNFEKQGQDVTEDWEVISGE 420
            E+Q                            N DV+LDF        + V E+ +  SGE
Sbjct: 361  EDQGIS------------------------DNGDVILDFDLRPGTPVELVVEN-KCSSGE 420

Query: 421  RLAEFLSVSLHESKQKVAEVEAMKVEEGSTRASGLGSDEDPSMEVEEQEIEEQEQEIEEQ 480
            ++   LS    E +  VA VE+                      VE  E +E   E   +
Sbjct: 421  KVT-LLSAQESEDESSVAVVES----------------------VESNEKKESFSEHAGE 480

Query: 481  EQQIEEQQQEIEEQEQEIEEAEASIGEAIQAPAIDDAH-EEDLAELVLDSDLHQDIHEWN 540
            E  +EE+    +EQ    +  +  + EA      DDA     + E   D D +Q   E N
Sbjct: 481  EDLMEEE----DEQVATTQATQTPLNEA------DDAQGSAAIREGETDVDGNQVSDEQN 540

Query: 541  DEHEVEISIGTDIPDHEPIDEIQTQNDIPSHPNVQEDPSPTSTLVVDDNMQDYNKAEKSE 600
            DE E EISIGTDIPDHEPI++IQ Q+    +   QEDPS +S  +  D+      AE   
Sbjct: 541  DEIEAEISIGTDIPDHEPIEDIQMQH---LYECTQEDPSSSSAQLHADDDHGSKNAE--- 600

Query: 601  EAEDTKEEVEFKILSVETSSQPSDNHKPSRSEFNENEEEDKVPDTPTSMDSLHQLHKKLL 660
                 +E ++FK ++VET  Q   NH    SE NE  EEDKVPDTPTS+DSLH LHKKLL
Sbjct: 601  -----EETIQFKTITVETCDQAIKNHLSLSSELNE-VEEDKVPDTPTSIDSLHLLHKKLL 660

Query: 661  LLDRKESGAEESLDGSVISETESGDGVLTIEKLKSALRTERKVLNALYSELEEERSASAI 720
            LLDRKESG E+SLDGSV S+ E  DGVLT+EKLKSAL+ ERK LNALY+ELEEERSASA+
Sbjct: 661  LLDRKESGTEDSLDGSVFSDIEVADGVLTVEKLKSALKAERKALNALYTELEEERSASAV 720

Query: 721  AANQTMAMINRLQEEKASMQMEALQYQRMMEEQSEYDQEALQLLNELVVKREKEKQELEK 780
            AANQTMAMINRLQEEKA+MQMEALQYQRMMEEQSEYDQEALQLLNEL+VKREKEK ELEK
Sbjct: 721  AANQTMAMINRLQEEKAAMQMEALQYQRMMEEQSEYDQEALQLLNELMVKREKEKAELEK 780

Query: 781  GIEVYRKKLQDYEAKEKMALLRSRKEGSIQSRNSSVSCSNADDSDGLSIDLNTEAKKDED 840
             +EVYR+K+QDYEA+EKM +LR RKE S +S  +S SCSNA+DSDGLS+DLN E K+++ 
Sbjct: 781  ELEVYRRKVQDYEAREKMIMLRRRKEDSTRSA-TSASCSNAEDSDGLSVDLNHEPKEEDS 840

Query: 841  LFCNQETNNQNTPAEAVLYLEETLANFEEERLSILEELKMLEEKLFTLSDEEQQ-FDDI- 900
               +QE +NQNTPA+AVLYLEE+LANFEEERLSILE+LK+LEEKL +L+DEE+Q F+DI 
Sbjct: 841  FDNHQEDSNQNTPADAVLYLEESLANFEEERLSILEQLKVLEEKLVSLNDEEEQHFEDIK 900

Query: 901  --EHYSEHNGNGYHKNSDSVSETNGFENGHHVKEMNGNHHPGKRTMSTKAKRLLPLFDDA 960
              E+  E NGNG+H++SD   ETNG  NG H   +NG HH  K+ M+ KAKRLLPLF DA
Sbjct: 901  SVEYLYEENGNGFHESSDFSYETNGVANG-HFNGVNGKHHQEKKLMAAKAKRLLPLF-DA 960

Query: 961  VDTDVED-VTIGEEQGFDSVSMQKSLDNKFDTEFRRVAVEEEVDHVYERLQALEADREFL 1014
             D ++ED +  G E GFDSV +Q       + E +++A+EEEVDHVYERLQALEADREFL
Sbjct: 961  TDAEIEDGILNGHENGFDSVVLQHFSPPNSELESKKLAIEEEVDHVYERLQALEADREFL 975

BLAST of Cp4.1LG01g18370 vs. TrEMBL
Match: M5Y1T3_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa000840mg PE=4 SV=1)

HSP 1 Score: 776.5 bits (2004), Expect = 3.8e-221
Identity = 536/1064 (50.38%), Postives = 685/1064 (64.38%), Query Frame = 1

Query: 1    MAANRFATLLHRNSNKITLILVYALLEWVLIFLLLLQALFSYLIIKFAELFGLKRPCLWC 60
            MAAN+FAT+LHRN+NKITLILVY LLEW+LI LLLL +LFS+LIIKFA+ FGLK PCLWC
Sbjct: 1    MAANKFATMLHRNTNKITLILVYTLLEWILIILLLLNSLFSFLIIKFADYFGLKTPCLWC 60

Query: 61   SRVDHVFEPGK-KFSYRDLLCEPHAMEISNLGYCSNHRKLTEVRDLCEDCFSSSNPNEFY 120
            SR+DH+ EPGK K S+RDL+CE HA EIS LGYCSNH+KL E +D+CEDC S  +  E+ 
Sbjct: 61   SRLDHLLEPGKNKNSHRDLVCETHANEISKLGYCSNHQKLAESQDMCEDCSSQPDSEEW- 120

Query: 121  QIPKNFAFF-----------GDEK-----EDFRCCSCCGESLKNRLFSPCILIKPNWGDL 180
               K FAFF           GDEK     ++   CSCCG  L N+ + PCILIKP+W  L
Sbjct: 121  --SKKFAFFPWMKQIGVIQGGDEKVIQNGDENLNCSCCGMKL-NKFYPPCILIKPSWEVL 180

Query: 181  DYAHKGNLISEADIDVQSDE---IHASPTEDIIGNRE-------------ISIVSGG--- 240
            DY  K +L  EA +D Q++E      S ++ II   E             I  V GG   
Sbjct: 181  DYTQKQSLTMEAGVDAQTEEGDHSDQSRSDFIIDQHEDEEAIEVNRKDNTIFDVDGGCKR 240

Query: 241  --EQAEKNSGCSVCGCCCKVSAVHEEEKEEEDKAKMGGEKDGDFLELAEDLSSLNHKTVQ 300
              ++AE++S CSVC   CK    +E++K +    +    K+ + L ++ D    +H+T  
Sbjct: 241  REDEAEEHSACSVCDYGCKEIVANEDDKVDRVIEEQEPIKEAN-LNVSMDDQPRDHQTFI 300

Query: 301  LGCVRENESAETAPHHLEFYIDRGNDRRLIPVDLIDFSASDHNNNESNILSSVKDEEQEQ 360
                    S E  P HLEFYID+ +D RL+ VDLID                        
Sbjct: 301  QASCDNGLSPEILPQHLEFYIDQ-DDCRLVLVDLID------------------------ 360

Query: 361  EPEPEPEQEQEQEQEQDQEQQQEDCGNEDVVLDFGSNFEKQGQDVTEDWEVISGERLAEF 420
                 P   + Q  ++ + + Q +   EDV+LDFG  FE Q + V E W   S E     
Sbjct: 361  ----SPTTTELQSHKKYKVEDQGNSSYEDVILDFGMCFEAQAKPVVESWR--SSEESVTL 420

Query: 421  LSVSLHESKQKVAEVEAMKVEEGSTRASGLGSDEDPSMEVEEQEIEEQEQEIEEQEQQIE 480
            LS   HESK           EEG  RAS L S++           + +E  I ++E +  
Sbjct: 421  LS--FHESK-----------EEG--RASVLDSEDLGENRSSSSVFQGEEGGIAKEENEPV 480

Query: 481  EQQQEIEEQEQEIEEAEASIGEAIQAPAIDDAHEEDLAELVLDSDLHQ----DIHEWNDE 540
               Q  +   QE ++ +   G++  A A DD          +DSD+HQ    D++  NDE
Sbjct: 481  ATTQATQTSSQEDDDDDDDDGQSNAAIARDD----------IDSDVHQAFEDDVYMHNDE 540

Query: 541  HEVEISIGTDIPDHEPIDEIQTQNDI--PSHPNVQEDPSPTSTLVVDDNMQDYNKAEKSE 600
             + E+SIGT+IPD EPIDE+Q   +    S+P  QEDPS   T   + +  D++ ++++E
Sbjct: 541  IDAEVSIGTEIPDQEPIDEMQLAQEFLHSSYPCAQEDPS---TSCANLHACDHHGSKQAE 600

Query: 601  EAEDTKEEVEFKILSVETSSQPSDNHKPSRSEFNENEEEDKVPDTPTSMDSLHQLHKKLL 660
            E     E ++FK  S ET  +  +NH    SEFNE EEE KVPDTPTS+DSLHQLHK+LL
Sbjct: 601  E-----ELLKFKTFSAETGEEAKENHFSLGSEFNEIEEE-KVPDTPTSIDSLHQLHKELL 660

Query: 661  LLDRKESGAEESLDGSVISETESGDGVLTIEKLKSALRTERKVLNALYSELEEERSASAI 720
            L +R+E G EESLDGSV+S+ E GDGV+TIEKLK+ LR ERK LN LY+ELEEERSASA+
Sbjct: 661  LFERREVGTEESLDGSVLSDIEGGDGVMTIEKLKTVLRAERKALNELYAELEEERSASAV 720

Query: 721  AANQTMAMINRLQEEKASMQMEALQYQRMMEEQSEYDQEALQLLNELVVKREKEKQELEK 780
            AA+QTMAMINRLQEEKA+MQMEALQYQRMMEEQSEYDQEA+QLLNEL+VKREKEKQE+E+
Sbjct: 721  AASQTMAMINRLQEEKAAMQMEALQYQRMMEEQSEYDQEAMQLLNELMVKREKEKQEVER 780

Query: 781  GIEVYRKKLQDYEAKEKMALLRSRKEGSIQSRNSSVSCSNADDSDGLSIDLNTEAKKDED 840
             +E+ RKK+QDYEAKE+M +LR  K+GS +SR+SS  CSNA+DSDGLSIDLN E+K+++ 
Sbjct: 781  ELEICRKKVQDYEAKERMMILRRMKDGSTRSRSSSGPCSNAEDSDGLSIDLNNESKEEDS 840

Query: 841  LFCNQETNNQNTPAEAVLYLEETLANFEEERLSILEELKMLEEKLFTLSDEE----QQFD 900
                +E +NQNTP +AVLYLEE+LA+FEEE+LSIL++LK LEEKL TLSDEE    Q   
Sbjct: 841  ---REEGSNQNTPTDAVLYLEESLASFEEEKLSILDQLKELEEKLLTLSDEEEEHFQNMK 900

Query: 901  DIEHYSEHNGNGYHKNSDSVSETNGFENGHHVKEMNGNHHPGKRTMSTKAKRLLPLFDDA 960
             I+++   NGNGYH+  D  SE NG  NGH  KEMNG H+       +K KRLLPLF DA
Sbjct: 901  PIKYFLSENGNGYHEKLDVSSEVNGVANGHS-KEMNGKHN----IKGSKGKRLLPLF-DA 960

Query: 961  VDTDVEDVTI---GEEQGFDSVSMQKSLDNKFDTEFRRVAVEEEVDHVYERLQALEADRE 1014
            ++ + ED  +   G+  G+DS + Q  +  KF+ E ++ A+EEEV HVYERLQALEADRE
Sbjct: 961  IEAEAEDGELELNGDTGGYDSFASQDFV-IKFEEENKKFAIEEEVGHVYERLQALEADRE 984

BLAST of Cp4.1LG01g18370 vs. TrEMBL
Match: V7CPF2_PHAVU (Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_002G183000g PE=4 SV=1)

HSP 1 Score: 765.0 bits (1974), Expect = 1.2e-217
Identity = 514/1060 (48.49%), Postives = 680/1060 (64.15%), Query Frame = 1

Query: 1    MAANRFATLLHRNSNKITLILVYALLEWVLIFLLLLQALFSYLIIKFAELFGLKRPCLWC 60
            MAAN+FAT+LHRN+NKITL+LVYA+LEW+LI LLLL +LFSYLIIKFA+ FGLKRPC+WC
Sbjct: 1    MAANKFATMLHRNTNKITLVLVYAILEWILIILLLLNSLFSYLIIKFADYFGLKRPCIWC 60

Query: 61   SRVDHVFEPGK-KFSYRDLLCEPHAMEISNLGYCSNHRKLTEVRDLCEDCFSSSNPNEFY 120
            +R+DH+ E GK K S RDL+CE HA EIS LG+CS H+KL E + +CEDC SSS P ++ 
Sbjct: 61   TRIDHIIESGKNKTSCRDLVCEAHASEISKLGFCSIHQKLAESQAMCEDCSSSSQP-DYV 120

Query: 121  QIPKNFAFF----------------GDEK----EDFRCCSCCGESLKNRLFSPCILIKPN 180
            ++ +NF FF                GD+     E+   CSCCG +   R + PCI IKP+
Sbjct: 121  KLSRNFGFFPWMKQIGMIQDESADAGDKAIVKVEEAMRCSCCGVNFDKRFYPPCIFIKPS 180

Query: 181  WGDLDYAHKGNLISEADIDVQSDEIHASPT------EDIIGNRE-------ISIVSG--- 240
               L+Y  K NL++E  + V+ DE H          ED  GN E       + +  G   
Sbjct: 181  LNVLEYDQKQNLVTERGVGVEIDEDHTRSDIVLDHHEDGQGNGENKESHMVVEVDQGLDR 240

Query: 241  -GEQAEKNSGCSVCGCC--------CKVSAVHEEEKEEEDKAKMGGEKDGDFLELAEDLS 300
              E+AEK+  CSVC           CK+    E+ KE  ++  +   K  D  + A+D  
Sbjct: 241  KDEEAEKSCDCSVCDASVDILCDEICKLDLGVEKGKETIEEESLNASKSMD--DDADDDQ 300

Query: 301  SLNHKTVQLGCVRENESAETAPHHLEFYIDRGNDRRLIPVDLIDFSASDHNNNESNILSS 360
            +      Q+ C RE  + ET P HLEF+I  G+D RLIPV+L+D  A+++  +   ++  
Sbjct: 301  ACEKSAAQVDCTREI-TVETPPKHLEFFI-HGDDCRLIPVELVDSPATENRTHSRYMVGG 360

Query: 361  VKDEEQEQEPEPEPEQEQEQEQEQDQEQQQEDCGNEDVVLDFGSNFEKQGQDVTEDWEVI 420
                                         +    NED +LDF  + + + + + E+W  I
Sbjct: 361  -----------------------------EGLNSNEDFILDFDMSADAEAEPLVENWH-I 420

Query: 421  SGERLAEFLSVSLHESKQKVAEVEAMKVEEGSTRASGLGSDEDPSMEVEEQEIEEQEQEI 480
            SG+ +AEF   S  E+     E  A  V+  +T  S L S      ++EE+ + +  +++
Sbjct: 421  SGDIVAEF---SCQEN-----ENAAKSVQLRTTGQSPLLS------QLEEENLVQNCEDM 480

Query: 481  EEQEQQIEEQQQEIEEQEQEIEEAEASIGEAIQAPAIDDAHEEDLAELVLDSDLHQDIHE 540
               +   +  + E  E   E  +AE     ++ +        ED +++            
Sbjct: 481  RFFQPADDFTKDENVEANMESRDAEQCSDVSLAS--------EDASQMQ----------- 540

Query: 541  WNDEHEVEISIGTDIPDHEPIDEIQTQNDI-PSHPNVQEDPSPTSTLVVDDNMQDYNKAE 600
              +E+E E+SIGT+IPD E +DE Q+Q+ +  ++  ++EDPS   T  V  N+QD +  +
Sbjct: 541  -GEEYEAEVSIGTEIPDQEQVDEYQSQDVLLDTNQQIEEDPS---TSAVRFNVQDESGDD 600

Query: 601  KSEEAEDTKEEVEFKILSVETSSQPSDNHKPSRSEFNENEEEDKVPDTPTSMDSLHQLHK 660
            K EE       VEFK LS+E      +NH PS    NENEEE KVPDTPTS++SLHQLHK
Sbjct: 601  KGEEF------VEFKTLSIEVRMPTVNNHLPSLLVLNENEEE-KVPDTPTSVESLHQLHK 660

Query: 661  KLLLLDRKESGAEESLDGSVISETESGDGVLTIEKLKSALRTERKVLNALYSELEEERSA 720
            KLLLL+RKESG EESLDGSVIS+ E G+  +T+EKLK+AL++ERK L+ LY+ELEEERSA
Sbjct: 661  KLLLLERKESGTEESLDGSVISDIECGE--VTMEKLKAALKSERKALSTLYAELEEERSA 720

Query: 721  SAIAANQTMAMINRLQEEKASMQMEALQYQRMMEEQSEYDQEALQLLNELVVKREKEKQE 780
            SAIAANQTMAMINRLQEEKA+MQMEALQYQRMMEEQSEYDQEALQLLNEL++KREKEKQE
Sbjct: 721  SAIAANQTMAMINRLQEEKAAMQMEALQYQRMMEEQSEYDQEALQLLNELMMKREKEKQE 780

Query: 781  LEKGIEVYRKKLQDYEAKEKMALLRSRKEGSIQSRNSSVSCSNADDSDGLSIDLNTEAKK 840
            LEK +E++RKK+ DYE +EKM +  SR++GS++SR SS SCSNA+DSDGLSIDLN EAK+
Sbjct: 781  LEKELEIFRKKVHDYEVREKMVM--SRRDGSMRSRTSSPSCSNAEDSDGLSIDLNHEAKE 840

Query: 841  DEDLFCNQETNNQNTPAEAVLYLEETLANFEEERLSILEELKMLEEKLFTLSDEEQQFDD 900
            +   + +QE +NQNTP +AVLYLEE+LANFEEERL ILE+LK+LEEKL  L+ EE+   D
Sbjct: 841  ENGFYSHQECSNQNTPVDAVLYLEESLANFEEERLQILEQLKVLEEKLVILNYEEEHCSD 900

Query: 901  ---IEHYSEHNGNGYHKNSDSVSETNGFENGHHVKEMNGNHHPGKRTMSTKAKRLLPLFD 960
                   SE NGNGYH + D   + NGF NG H KE+NG HH G++ M  KAKRLLPLF 
Sbjct: 901  DAKSVELSEENGNGYHDDDDHEGQVNGFANG-HAKEINGKHHKGRKIMGAKAKRLLPLF- 960

Query: 961  DAVDTDVEDVTIGEEQGFDSVSMQKSLDNKFDTEFRRVAVEEEVDHVYERLQALEADREF 1011
            DA+ ++ EDV +  ++  D   +Q +   K +   ++ A+EEEVD+VYERLQ LEADREF
Sbjct: 961  DAMSSEAEDVELSGDE-LDLPHLQDNSVEKVNMVKKKFALEEEVDNVYERLQVLEADREF 974

BLAST of Cp4.1LG01g18370 vs. TAIR10
Match: AT1G70750.1 (AT1G70750.1 Protein of unknown function, DUF593)

HSP 1 Score: 276.2 bits (705), Expect = 8.2e-74
Identity = 260/678 (38.35%), Postives = 364/678 (53.69%), Query Frame = 1

Query: 358  SNFEKQGQDVTEDWEVISGERLAEFLSVSLHESKQKVAEVEAMKVEEGSTRASGLGSDED 417
            S  E + + V +  E +  + + E  S  +     K  E+   K EE            D
Sbjct: 168  SQEETEEKKVPQSHEKLEDDDVDEEFSCYVSSFDCKNKEIATEKEEENRV---------D 227

Query: 418  PSMEVEEQEIEEQEQEIEEQEQQIEEQQQEIEEQEQEIEEAEASIGEAIQAPAIDD---- 477
              +EVE  E   +  E    E+       E  +  +E+ E     G+ I    ++     
Sbjct: 228  LPIEVETAESAPKNLEFYIDEEDCHLIPVEFYKPSEEVREISDINGDFILDFGVEHDFTA 287

Query: 478  -AHEEDLAELVLDSDLHQDIHEWN----------DEHEVEISIGTDIPDHEPIDEIQTQN 537
             A  E++++     +   +  E N          +E + E+SIGT+IPDHE I +I +  
Sbjct: 288  AAETEEISDFASPGESKPEDAETNLVASEMENDDEETDAEVSIGTEIPDHEQIGDIPSHQ 347

Query: 538  DIPSHPNVQEDPSPTSTLVVDDNMQDYNKAEKSEEAEDTKEEVEFKILSVETSSQPSDNH 597
             IP H               DD+ ++              E +EFK +++ET        
Sbjct: 348  LIPHHD--------------DDDHEE--------------ETLEFKTVTIETKMPVL--- 407

Query: 598  KPSRSEFNENEEEDKVPDTPTSMDSLHQ-LHKKLLLLDRKESGAEESLDGSVISETESGD 657
                     N  E+++ +   SM+S H  LH  +  L+++ S     +DG      E  +
Sbjct: 408  ---------NINEERILEAQGSMESSHSSLHNAMFHLEQRVS-----VDG-----IECPE 467

Query: 658  GVLTIEKLKSALRTERKVLNALYSELEEERSASAIAANQTMAMINRLQEEKASMQMEALQ 717
            GVLT++KLK  L+ ERK L+ALY ELE ER+ASA+AA++TMAMINRL EEKA+MQMEALQ
Sbjct: 468  GVLTVDKLKFELQEERKALHALYEELEVERNASAVAASETMAMINRLHEEKAAMQMEALQ 527

Query: 718  YQRMMEEQSEYDQEALQLLNELVVKREKEKQELEKGIEVYRKKLQDYEAKEKMALLRSR- 777
            YQRMMEEQ+E+DQEALQLLNEL+V REKE  ELEK +EVYRK++++YEAKEKM +LR R 
Sbjct: 528  YQRMMEEQAEFDQEALQLLNELMVNREKENAELEKELEVYRKRMEEYEAKEKMGMLRRRL 587

Query: 778  KEGSIQS-RNSSVSCSNADDSDGLSIDLNTEAKKDEDLFCNQETNNQNTPAEAVLYLEET 837
            ++ S+ S RN+  S  N   S+G     N E   D   +  +E   +NTP + VL L+E 
Sbjct: 588  RDSSVDSYRNNGDSDEN---SNGELQFKNVEGVTD---WKYRENEMENTPVDVVLRLDEC 647

Query: 838  LANFEEERLSILEELKMLEEKLFTLSDEEQQFDDIEHYSEHNGNGYHKNSDSVSETNGFE 897
            L +++ ERLSIL  LK LEEKL  L++EE   ++ + +                E+NG  
Sbjct: 648  LDDYDGERLSILGRLKFLEEKLTDLNNEEDDEEEAKTF----------------ESNGSI 707

Query: 898  NGH---HVKEMNGNHHPGKRTMSTKAKRLLPLFDDAVDTDVEDVTIG---EEQGFDSVSM 957
            NG+   H KE NG H         K+KRLLPLFD AVD ++E+        E GFD    
Sbjct: 708  NGNEHIHGKETNGKHRV------IKSKRLLPLFD-AVDGEMENGLSNGNHHENGFDD--- 746

Query: 958  QKSLDNKFDTEFRRVAVEEEVDHVYERLQALEADREFLKHCIGSLRKGDKGLELLQEILQ 1011
                      +   V +EEEVD +YERL+ALEADREFL+HC+GSL+KGDKG+ LL EILQ
Sbjct: 768  --------SEKGENVTIEEEVDELYERLEALEADREFLRHCVGSLKKGDKGVHLLHEILQ 746

BLAST of Cp4.1LG01g18370 vs. TAIR10
Match: AT5G16720.1 (AT5G16720.1 Protein of unknown function, DUF593)

HSP 1 Score: 253.4 bits (646), Expect = 5.7e-67
Identity = 195/468 (41.67%), Postives = 278/468 (59.40%), Query Frame = 1

Query: 545  NMQDYNKAEKSEEAEDTKEEVEFKILSVETSSQPSDNHKPSRSEFNENEEEDKV----PD 604
            N+  Y + + S   E+ +EE     L  +     S N   S+ E  E + E+      P+
Sbjct: 254  NVATYGEDQISGRVEEKEEETGVADLLYDQFE--SKNFTGSQIEEEEEDREETTKELDPE 313

Query: 605  TPTSMDSLHQLHKKLLLLDRKE-SGAEESLDGSV-ISETESGDGVLTIEKLKSALRTERK 664
            TPTS+ +L   +KKL  L R E + AE++ DG+V +SE + GD + TIE+L+  +R E++
Sbjct: 314  TPTSVSTL--FNKKLHFLARNEYAAAEDAGDGNVLVSEMDGGDPLRTIERLRETVRAEQE 373

Query: 665  VLNALYSELEEERSASAIAANQTMAMINRLQEEKASMQMEALQYQRMMEEQSEYDQEALQ 724
             L  LY+ELEEERSASAI+ANQTMAMI RLQEEKA +QMEALQYQRMMEEQ+EYDQEALQ
Sbjct: 374  ALRDLYAELEEERSASAISANQTMAMITRLQEEKAKVQMEALQYQRMMEEQAEYDQEALQ 433

Query: 725  LLNELVVKREKEKQELEKGIEVYRKKLQDYEAKEKMALLRSRKEGSIQSRNSSVSCSNAD 784
            LLN L+VKREKEK++L++ +EVYR K+ +YE+K K  ++    +              AD
Sbjct: 434  LLNHLMVKREKEKEQLQRELEVYRAKVLEYESKAKNKIIVVEND------------CEAD 493

Query: 785  DSDGLSIDLNTEAKKDEDLFCNQETNNQNTPAEAVLY---LEETLANFEEERLSILEELK 844
            D D        E  ++ED     + + +    + V +   L E+L+ FEEERL IL++LK
Sbjct: 494  DDD------KEEENREEDNSSEMDVDLEKITLDCVQHMSMLGESLSEFEEERLVILDQLK 553

Query: 845  MLEEKLFTLSDEEQQFDDIEHYSEHNGNGYHKNSDSVSETNGFENGHHVKEMNGNHHPGK 904
            +LE++L T+ D+E   D  E       N Y + S          NGH           G 
Sbjct: 554  VLEDRLVTMQDKESAEDPGEF-----SNSYEEAS----------NGH-----------GG 613

Query: 905  RTMSTKAKRLLPLFDDAVDTDVEDVTIGEEQGFDSVSMQKSLDNKFDTEFRRVAVEEEVD 964
             TM++ AK LLPL  DA + + ED          S  + +S +  F ++  ++ + ++VD
Sbjct: 614  LTMASMAKSLLPLL-DAAENESED---------GSQGLPESDEKNFGSDSEKLEIIKQVD 663

Query: 965  HVYERLQALEADREFLKHCIGSLRKGDKGLELLQEILQHLRDLRSVDI 1004
             VYERLQ LE D EFLK+C+ S +KGDKG ++L++ILQHLRDLR++++
Sbjct: 674  SVYERLQELETDGEFLKNCMSSAKKGDKGTDILKDILQHLRDLRTIEL 663

BLAST of Cp4.1LG01g18370 vs. TAIR10
Match: AT1G74830.1 (AT1G74830.1 Protein of unknown function, DUF593)

HSP 1 Score: 110.9 bits (276), Expect = 4.6e-24
Identity = 103/281 (36.65%), Postives = 160/281 (56.94%), Query Frame = 1

Query: 578 PSDNHKPSRSEFNENEEEDKVPDTPTSMDSLHQLHK--KLLLLDRKESGAE---ESLDGS 637
           P+ + + S ++ +ENE E K  D   +   +   +K   + L D  ++       SL  S
Sbjct: 223 PAPSPRVSHNKLSENESEFKDMDVDRTPSFVRGGNKFFGIPLSDSAQNSPRWSVRSLKKS 282

Query: 638 VISETESGD------GVLTIEKLKSALRTERKVLNALYSELEEERSASAIAANQTMAMIN 697
           V+++TE+        G   + +LK  +R ++K L  LY EL+EERSASA+AAN+ MAMI 
Sbjct: 283 VLNKTENASDTTDPTGESILNQLKKEVRLDKKSLIDLYMELDEERSASAVAANEAMAMIT 342

Query: 698 RLQEEKASMQMEALQYQRMMEEQSEYDQEALQLLNELVVKREKEKQELEKGIEVYRKKLQ 757
           RLQ EKA++QMEALQYQRMM+EQ+EYDQEALQ ++  + KRE+E +ELE   EVYR+K  
Sbjct: 343 RLQAEKAAVQMEALQYQRMMDEQAEYDQEALQSMSSELAKREEEMKELEAEFEVYREKYG 402

Query: 758 DYEAKEKMALLRSRKEGSIQSRNSSV--SCSNADDSDGLSIDLNTEAKKDEDLFCN-QET 817
               +E      +R+E   Q+ N+S    C        L++  + + +  E++  N Q  
Sbjct: 403 CLTDQED-----AREEFHKQNGNASAYDDCQETKPVSDLAVSSSNQQENGENIDQNGQSK 462

Query: 818 NNQNTPAEAVLYLEETLANFEEERLSILEELKMLEEKLFTL 845
            ++ + AE V+  +E   +  E +  I++EL  + E+L TL
Sbjct: 463 RSEESTAENVVSADEEKGS--ESKEGIVKELSEITERLSTL 496

BLAST of Cp4.1LG01g18370 vs. TAIR10
Match: AT1G18990.1 (AT1G18990.1 Protein of unknown function, DUF593)

HSP 1 Score: 106.3 bits (264), Expect = 1.1e-22
Identity = 93/231 (40.26%), Postives = 130/231 (56.28%), Query Frame = 1

Query: 636 ETESGDGVLTIEKLKSALRTERKVLNALYSELEEERSASAIAANQTMAMINRLQEEKASM 695
           E+E  DG   ++ L   +R +RK L  LY EL+EERSASA+AAN  MAMI RLQ EKA++
Sbjct: 291 ESEVLDGDSILQHLNRQVRLDRKSLMDLYMELDEERSASAVAANNAMAMITRLQAEKAAV 350

Query: 696 QMEALQYQRMMEEQSEYDQEALQLLNELVVKREKEKQELEKGIEVYRKKLQDYEAKEKMA 755
           QMEALQYQRMM+EQ+EYDQEALQ +N L+VKRE+E +ELE GIEVYR          +  
Sbjct: 351 QMEALQYQRMMDEQAEYDQEALQSMNGLLVKREEEMKELEAGIEVYRL---------RYG 410

Query: 756 LLRSRK---EGSIQSRNSSVSCSNADDSDGLSIDLNT-EAKKDEDLFCNQETNNQNTPAE 815
           LLR  +   E  +      VS            DL    +  +EDL   +++   +    
Sbjct: 411 LLREERGEAEEFLDEETKPVS------------DLPVCSSNHEEDLEQMKDSAEDSIGNN 470

Query: 816 AVLYLEETLANFEEERLSILEELKMLEEKLFTLSDEE---QQFDDIEHYSE 860
            V+ +EE   N   + + +++E+  + E+L  +  +    QQ  D+   SE
Sbjct: 471 GVMIIEEEKENGSRKDM-LVKEISEITERLNAIESKGELLQQISDVLDVSE 499

BLAST of Cp4.1LG01g18370 vs. TAIR10
Match: AT1G08800.1 (AT1G08800.1 Protein of unknown function, DUF593)

HSP 1 Score: 88.6 bits (218), Expect = 2.4e-17
Identity = 55/158 (34.81%), Postives = 86/158 (54.43%), Query Frame = 1

Query: 14  SNKITLILVYALLEWVLIFLLLLQALFSYLIIKFAELFGLKRPCLWCSRVDHVFEPGK-- 73
           S   T  L  A  EW+L+F+L + ++FSY+I +FA+   L+ PCL CS +DH+    K  
Sbjct: 3   SRSFTRALALAFNEWLLMFMLFVNSIFSYVIARFADYSELQSPCLMCSNLDHILRRTKDL 62

Query: 74  KFSYRDLLCEPHAMEISNLGYCSNHRKLTEVRDLCEDC---FSSSNPN--EFYQI----- 133
           K ++ D++C  H  EIS+L YC  H KL +VR +CE C   F+++N +  E Y++     
Sbjct: 63  KKTHWDIICSKHKSEISSLVYCHAHGKLVDVRGMCETCLFSFATTNKSNAETYRLLVGKL 122

Query: 134 --PKNFAFFGDEKEDFRC-----CSCCGESLKNRLFSP 153
               +F    D  +   C     C+CC     N+L++P
Sbjct: 123 GEDSHFGSKSDRSKYPNCSKLTDCTCC-----NQLWTP 155

BLAST of Cp4.1LG01g18370 vs. NCBI nr
Match: gi|778703967|ref|XP_011655455.1| (PREDICTED: myosin-binding protein 3 isoform X1 [Cucumis sativus])

HSP 1 Score: 1282.7 bits (3318), Expect = 0.0e+00
Identity = 748/1046 (71.51%), Postives = 827/1046 (79.06%), Query Frame = 1

Query: 1    MAANRFATLLHRNSNKITLILVYALLEWVLIFLLLLQALFSYLIIKFAELFGLKRPCLWC 60
            MAAN+FAT+LHRNSNKITLILVYALLEWVLIFLLLL  LFSYLI+KFAE FGLKRPCLWC
Sbjct: 1    MAANKFATILHRNSNKITLILVYALLEWVLIFLLLLHGLFSYLIVKFAEWFGLKRPCLWC 60

Query: 61   SRVDHVFEPGKKFSYRDLLCEPHAMEISNLGYCSNHRKLTEVRDLCEDCFSSSNPNEFYQ 120
            SRVDHVFEP +K SYRDLLCE HAMEISNLGYCSNHRKL+E RDLCEDC SSS  NEFYQ
Sbjct: 61   SRVDHVFEPQRKQSYRDLLCEGHAMEISNLGYCSNHRKLSEFRDLCEDCSSSSKSNEFYQ 120

Query: 121  IPKNFAFFGDEKEDFRCCSCCGESLKNRLFSPCILIKPNWGDLDYAHKGNLISEADIDVQ 180
            I K+F FF DEKEDFR CSCCGE+LK RLFSPCILIKPNWGDLDY  KGNLISE +    
Sbjct: 121  ISKSFPFFDDEKEDFRTCSCCGETLKGRLFSPCILIKPNWGDLDYTQKGNLISETE---- 180

Query: 181  SDEIHASPTEDIIGNREISIVSGGEQAEKNSGCSVCGCCCKVSAVHEEEKEEEDKAKMGG 240
            +DEIH S +ED+ GNR ISIVSGGE+ EKNS CSVCGC CK SAVHE++  ++D+A +  
Sbjct: 181  TDEIHVSQSEDVSGNRGISIVSGGEEGEKNSTCSVCGCGCKDSAVHEDD--DDDRADISA 240

Query: 241  EKDGDFLELAEDLSSLNHKTVQLGCVRENESAETAPHHLEFYIDRGNDRRLIPVDLIDFS 300
            +KDG FLELAEDL+  N +TV++GC +E+E  ET P+HLEFYIDRG+DRRLIPVDLIDFS
Sbjct: 241  QKDGGFLELAEDLTICNQETVEVGCEKEDELPETVPNHLEFYIDRGDDRRLIPVDLIDFS 300

Query: 301  ASDHNNNESNILSSVKDEEQEQEPEPEPEQEQEQEQEQDQEQQQEDCGNEDVVLDFGSNF 360
            A D +N+ SNILS VKDEEQEQE                      DCGNEDVVLDF SNF
Sbjct: 301  APDDDNSTSNILSQVKDEEQEQE----------------------DCGNEDVVLDFASNF 360

Query: 361  EKQGQDVTEDWEVISGERLAEFLSVSLHESKQKVAEVEAMKVEEGSTRASGLGSDEDPSM 420
            E +   V+E WEVISGERLAEFLS SLHE+KQ+V EVEAM VEE            DP +
Sbjct: 361  ENRRHGVSEAWEVISGERLAEFLSASLHENKQRVEEVEAMDVEE------------DPLV 420

Query: 421  EVEEQEIEEQEQEIEEQEQQIEEQQQE--IEEQEQEIEEAEASIGEAIQAPAIDDAHEED 480
             V ++E +E+E+E EE +  I+E  Q    +  ++E+EE       A + P   D HE D
Sbjct: 421  GVGKEEEKEEEEE-EEADASIDESSQAPASDAHKEELEELVV----ATRQPD-SDLHEVD 480

Query: 481  LA----ELVLDSDLHQDI--HEWNDEHEVEISIGTDIPDHEPIDEIQTQN-----DIPSH 540
                  EL ++  +  DI  HE  DE + +I    D+P H  + E  + +     D    
Sbjct: 481  FHMWSDELEVEISIGTDIPDHEPIDEIQTQI----DLPPHPDLQEDPSPSSSLDVDNMQD 540

Query: 541  PNVQEDPSPTSTLVVDDNMQ-----------DYNKAEKSEEAEDTKEEV-------EFKI 600
            PN+ E+      ++ ++  +           D +K   SE  ED +E+        EFKI
Sbjct: 541  PNIVEEVEEAEEVMEEEKFKIFSMETSSQPSDNHKPSSSEVNEDEEEDKVPGTEVEEFKI 600

Query: 601  LSVETSSQPSDNHKPSRSEFNENEEEDKVPDTPTSMDSLHQLHKKLLLLDRKESGAEESL 660
            LSVETSS PSDNHK S SE NENEEEDKVPDTPTSMDSLHQLHKKLLLLDRKESG EESL
Sbjct: 601  LSVETSSHPSDNHKSSSSEVNENEEEDKVPDTPTSMDSLHQLHKKLLLLDRKESGTEESL 660

Query: 661  DGSVISETESGDGVLTIEKLKSALRTERKVLNALYSELEEERSASAIAANQTMAMINRLQ 720
            DGSVISETE GDGVLT+EKLKSALRTERK LNALY+ELEEERSASAIAANQTMAMINRLQ
Sbjct: 661  DGSVISETEGGDGVLTLEKLKSALRTERKALNALYAELEEERSASAIAANQTMAMINRLQ 720

Query: 721  EEKASMQMEALQYQRMMEEQSEYDQEALQLLNELVVKREKEKQELEKGIEVYRKKLQDYE 780
            EEKASMQMEALQYQRMMEEQSEYDQEALQLLNELVVKREKEKQELEK IE+YRKKLQDYE
Sbjct: 721  EEKASMQMEALQYQRMMEEQSEYDQEALQLLNELVVKREKEKQELEKEIEIYRKKLQDYE 780

Query: 781  AKEKMALLRSRKEGSIQSRNSSVSCSNADDSDGLSIDLNTEAKKDEDLFCNQETNNQNTP 840
            AKEK+ALLR RKEGSI+SRNSSVSCSNADDSDGLSIDLNTEAKKDEDLF NQET NQNTP
Sbjct: 781  AKEKIALLRIRKEGSIRSRNSSVSCSNADDSDGLSIDLNTEAKKDEDLFSNQETENQNTP 840

Query: 841  AEAVLYLEETLANFEEERLSILEELKMLEEKLFTLSDEEQQFDDIEHYSEHNGNGYHKNS 900
            AEAVLYLEETLANFEEERLSILEELKMLEEKLFTLSDEEQQF+DI+HY E NGNGY KNS
Sbjct: 841  AEAVLYLEETLANFEEERLSILEELKMLEEKLFTLSDEEQQFEDIDHYCERNGNGYDKNS 900

Query: 901  DSVSETNGFENGHHVKEMNGNHHPGKRTMSTKAKRLLPLFDDAVDTD-VEDVTIGEEQGF 960
            D    TNGFENGH+ KEMNG H+P +R MSTKAKRLLPLFDD VD D VEDVT GEEQGF
Sbjct: 901  DYSPGTNGFENGHNAKEMNGKHYPERRAMSTKAKRLLPLFDDVVDADVVEDVTNGEEQGF 960

Query: 961  DSVSMQKSLDNKFDTEFRRVAVEEEVDHVYERLQALEADREFLKHCIGSLRKGDKGLELL 1015
            DS+S+QKSLDNKFDTEFRRVAVEEEVDHVYERLQALEADREFLKHCIGSLRKGDKGLELL
Sbjct: 961  DSISIQKSLDNKFDTEFRRVAVEEEVDHVYERLQALEADREFLKHCIGSLRKGDKGLELL 996

BLAST of Cp4.1LG01g18370 vs. NCBI nr
Match: gi|778703971|ref|XP_011655456.1| (PREDICTED: myosin-binding protein 2 isoform X2 [Cucumis sativus])

HSP 1 Score: 1281.2 bits (3314), Expect = 0.0e+00
Identity = 744/1043 (71.33%), Postives = 825/1043 (79.10%), Query Frame = 1

Query: 1    MAANRFATLLHRNSNKITLILVYALLEWVLIFLLLLQALFSYLIIKFAELFGLKRPCLWC 60
            MAAN+FAT+LHRNSNKITLILVYALLEWVLIFLLLL  LFSYLI+KFAE FGLKRPCLWC
Sbjct: 1    MAANKFATILHRNSNKITLILVYALLEWVLIFLLLLHGLFSYLIVKFAEWFGLKRPCLWC 60

Query: 61   SRVDHVFEPGKKFSYRDLLCEPHAMEISNLGYCSNHRKLTEVRDLCEDCFSSSNPNEFYQ 120
            SRVDHVFEP +K SYRDLLCE HAMEISNLGYCSNHRKL+E RDLCEDC SSS  NEFYQ
Sbjct: 61   SRVDHVFEPQRKQSYRDLLCEGHAMEISNLGYCSNHRKLSEFRDLCEDCSSSSKSNEFYQ 120

Query: 121  IPKNFAFFGDEKEDFRCCSCCGESLKNRLFSPCILIKPNWGDLDYAHKGNLISEADIDVQ 180
            I K+F FF DEKEDFR CSCCGE+LK RLFSPCILIKPNWGDLDY  KGNLISE +    
Sbjct: 121  ISKSFPFFDDEKEDFRTCSCCGETLKGRLFSPCILIKPNWGDLDYTQKGNLISETE---- 180

Query: 181  SDEIHASPTEDIIGNREISIVSGGEQAEKNSGCSVCGCCCKVSAVHEEEKEEEDKAKMGG 240
            +DEIH S +ED+ GNR ISIVSGGE+ EKNS CSVCGC CK SAVHE++  ++D+A +  
Sbjct: 181  TDEIHVSQSEDVSGNRGISIVSGGEEGEKNSTCSVCGCGCKDSAVHEDD--DDDRADISA 240

Query: 241  EKDGDFLELAEDLSSLNHKTVQLGCVRENESAETAPHHLEFYIDRGNDRRLIPVDLIDFS 300
            +KDG FLELAEDL+  N +TV++GC +E+E  ET P+HLEFYIDRG+DRRLIPVDLIDFS
Sbjct: 241  QKDGGFLELAEDLTICNQETVEVGCEKEDELPETVPNHLEFYIDRGDDRRLIPVDLIDFS 300

Query: 301  ASDHNNNESNILSSVKDEEQEQEPEPEPEQEQEQEQEQDQEQQQEDCGNEDVVLDFGSNF 360
            A D +N+ SNILS VKDEEQEQE                      DCGNEDVVLDF SNF
Sbjct: 301  APDDDNSTSNILSQVKDEEQEQE----------------------DCGNEDVVLDFASNF 360

Query: 361  EKQGQDVTEDWEVISGERLAEFLSVSLHESKQKVAEVEAMKVEEGSTRASGLGSDEDPSM 420
            E +   V+E WEVISGERLAEFLS SLHE+KQ+V EVEAM VEE      G+G +E    
Sbjct: 361  ENRRHGVSEAWEVISGERLAEFLSASLHENKQRVEEVEAMDVEEDPL--VGVGKEE---- 420

Query: 421  EVEEQEIEEQEQEIEEQEQQ--IEEQQQEIEEQEQEIEEAEASIGEAIQAPAIDDAH--- 480
            E EE+E EE +  I+E  Q    +  ++E+EE      + ++ + E        D H   
Sbjct: 421  EKEEEEEEEADASIDESSQAPASDAHKEELEELVVATRQPDSDLHE--------DFHMWS 480

Query: 481  EEDLAELVLDSDLHQDIHEWNDEHEVEISIGTDIPDHEPIDEIQTQN-----DIPSHPNV 540
            +E   E+ + +D+    HE  DE + +I    D+P H  + E  + +     D    PN+
Sbjct: 481  DELEVEISIGTDIPD--HEPIDEIQTQI----DLPPHPDLQEDPSPSSSLDVDNMQDPNI 540

Query: 541  QEDPSPTSTLVVDDNMQ-----------DYNKAEKSEEAEDTKEEV-------EFKILSV 600
             E+      ++ ++  +           D +K   SE  ED +E+        EFKILSV
Sbjct: 541  VEEVEEAEEVMEEEKFKIFSMETSSQPSDNHKPSSSEVNEDEEEDKVPGTEVEEFKILSV 600

Query: 601  ETSSQPSDNHKPSRSEFNENEEEDKVPDTPTSMDSLHQLHKKLLLLDRKESGAEESLDGS 660
            ETSS PSDNHK S SE NENEEEDKVPDTPTSMDSLHQLHKKLLLLDRKESG EESLDGS
Sbjct: 601  ETSSHPSDNHKSSSSEVNENEEEDKVPDTPTSMDSLHQLHKKLLLLDRKESGTEESLDGS 660

Query: 661  VISETESGDGVLTIEKLKSALRTERKVLNALYSELEEERSASAIAANQTMAMINRLQEEK 720
            VISETE GDGVLT+EKLKSALRTERK LNALY+ELEEERSASAIAANQTMAMINRLQEEK
Sbjct: 661  VISETEGGDGVLTLEKLKSALRTERKALNALYAELEEERSASAIAANQTMAMINRLQEEK 720

Query: 721  ASMQMEALQYQRMMEEQSEYDQEALQLLNELVVKREKEKQELEKGIEVYRKKLQDYEAKE 780
            ASMQMEALQYQRMMEEQSEYDQEALQLLNELVVKREKEKQELEK IE+YRKKLQDYEAKE
Sbjct: 721  ASMQMEALQYQRMMEEQSEYDQEALQLLNELVVKREKEKQELEKEIEIYRKKLQDYEAKE 780

Query: 781  KMALLRSRKEGSIQSRNSSVSCSNADDSDGLSIDLNTEAKKDEDLFCNQETNNQNTPAEA 840
            K+ALLR RKEGSI+SRNSSVSCSNADDSDGLSIDLNTEAKKDEDLF NQET NQNTPAEA
Sbjct: 781  KIALLRIRKEGSIRSRNSSVSCSNADDSDGLSIDLNTEAKKDEDLFSNQETENQNTPAEA 840

Query: 841  VLYLEETLANFEEERLSILEELKMLEEKLFTLSDEEQQFDDIEHYSEHNGNGYHKNSDSV 900
            VLYLEETLANFEEERLSILEELKMLEEKLFTLSDEEQQF+DI+HY E NGNGY KNSD  
Sbjct: 841  VLYLEETLANFEEERLSILEELKMLEEKLFTLSDEEQQFEDIDHYCERNGNGYDKNSDYS 900

Query: 901  SETNGFENGHHVKEMNGNHHPGKRTMSTKAKRLLPLFDDAVDTD-VEDVTIGEEQGFDSV 960
              TNGFENGH+ KEMNG H+P +R MSTKAKRLLPLFDD VD D VEDVT GEEQGFDS+
Sbjct: 901  PGTNGFENGHNAKEMNGKHYPERRAMSTKAKRLLPLFDDVVDADVVEDVTNGEEQGFDSI 960

Query: 961  SMQKSLDNKFDTEFRRVAVEEEVDHVYERLQALEADREFLKHCIGSLRKGDKGLELLQEI 1015
            S+QKSLDNKFDTEFRRVAVEEEVDHVYERLQALEADREFLKHCIGSLRKGDKGLELLQEI
Sbjct: 961  SIQKSLDNKFDTEFRRVAVEEEVDHVYERLQALEADREFLKHCIGSLRKGDKGLELLQEI 995

BLAST of Cp4.1LG01g18370 vs. NCBI nr
Match: gi|659072837|ref|XP_008467120.1| (PREDICTED: protein SGM1 isoform X2 [Cucumis melo])

HSP 1 Score: 1159.1 bits (2997), Expect = 0.0e+00
Identity = 683/1067 (64.01%), Postives = 772/1067 (72.35%), Query Frame = 1

Query: 1    MAANRFATLLHRNSNKITLILVYALLEWVLIFLLLLQALFSYLIIKFAELFGLKRPCLWC 60
            MAAN+FAT+LHRNSNKITLILVYALLEWVLIFLLLL  LFSYLI+KFAE FGLKRPCLWC
Sbjct: 1    MAANKFATILHRNSNKITLILVYALLEWVLIFLLLLHGLFSYLIVKFAEWFGLKRPCLWC 60

Query: 61   SRVDHVFEPGKKFSYRDLLCEPHAMEISNLGYCSNHRKLTEVRDLCEDCFSSSNPNEFYQ 120
            SRVDHVFEP +K SYRDLLCE HAMEISNLGYCSNHRKL+E RDLCEDC SSS  NEFYQ
Sbjct: 61   SRVDHVFEPERKHSYRDLLCEGHAMEISNLGYCSNHRKLSEFRDLCEDCSSSSKSNEFYQ 120

Query: 121  IPKNFAFFGDEKEDFRCCSCCGESLKNRLFSPCILIKPNWGDLDYAHKGNLISEADIDVQ 180
            I K+F FF DEKEDF+ CSCCGE+LK+RLFSPCILIKPNWGDLDY  KGN ISE     +
Sbjct: 121  ISKSFPFFDDEKEDFKSCSCCGETLKSRLFSPCILIKPNWGDLDYTQKGNFISE----TE 180

Query: 181  SDEIHASPTEDIIGNREISIVSGGEQAEKNSGCSVCGCCCKVSAVHEEEKEEEDKAKMGG 240
            +DEIH S +ED+ GNR ISIVSGGE+ EKNS CSVCGC CK SAVHE+  +++D+A +  
Sbjct: 181  TDEIHVSQSEDVSGNRGISIVSGGEEGEKNSTCSVCGCGCKDSAVHED--DDDDRADISA 240

Query: 241  EKDGDFLELAEDLSSLNHKTVQLGCVRENESAETAPHHLEFYIDRGNDRRLIPVDLIDFS 300
            EKDGDFLELAEDL+  N KTV++GC +E+E  ET P+HLEFYIDRG+DRRLIPVDLIDFS
Sbjct: 241  EKDGDFLELAEDLTICNQKTVEVGCEKEDELPETVPNHLEFYIDRGDDRRLIPVDLIDFS 300

Query: 301  ASDHNNNESNILSSVKDEEQEQEPEPEPEQEQEQEQEQDQEQQQEDCGNEDVVLDFGSNF 360
            A D +NN SNILS VKDEEQE                      QEDCGNEDVVLDFGSNF
Sbjct: 301  APDDDNNTSNILSQVKDEEQE----------------------QEDCGNEDVVLDFGSNF 360

Query: 361  EKQGQDVTEDWEVISGERLAEFLSVSLHESKQKVAEVEAMKVEEGSTRASGLGSDEDPSM 420
            E Q   V EDWEVISGERLAEFLSVSLHE+KQ+V EVEAM VE            EDP M
Sbjct: 361  ENQRHGVNEDWEVISGERLAEFLSVSLHENKQRVEEVEAMDVE------------EDPLM 420

Query: 421  EVEEQEIEEQEQEIEEQEQQIEEQQQEIEEQEQEIEEAEASIGEAIQAPAIDDAHEEDLA 480
             V ++E +E +  I+                            EA QAPA  DA +E+L 
Sbjct: 421  GVGKEEEKEADASID----------------------------EASQAPA-SDALKEELE 480

Query: 481  ELVL-----DSDLHQDIHEWNDEHEVEISIGTDIPDHEPIDEIQTQNDIPSHPNVQEDPS 540
            ELV+     DSDLH+D H WNDE EVEISIGTDIPDHEPIDEIQTQ D+P HP++QE+PS
Sbjct: 481  ELVVATRQPDSDLHEDFHMWNDELEVEISIGTDIPDHEPIDEIQTQIDLPPHPDLQEEPS 540

Query: 541  PTSTLVVDDNMQDYNKAEKSEEAEDTKEEVEFKILSVETSSQPSDNHKPSRSEFNENEEE 600
            P+S+L VD        + + EE  + KEE EFKI SVETSSQPSD HKPS SE NE+EEE
Sbjct: 541  PSSSLDVD--------SMQVEETVEVKEEEEFKIFSVETSSQPSDYHKPSSSEVNEDEEE 600

Query: 601  DKVPDTPTSMDSLHQLHKKLLLLDRKESGAEESLDGSVISETESGDGV------------ 660
            DKVP T      L  +       D  +  + E      ++E E  D V            
Sbjct: 601  DKVPGTEVEEFKLLSVETCSHPSDNHKPSSSE------VNENEEEDKVPDTPTSMDSLHQ 660

Query: 661  -----LTIEKLKSALRTERKVLNALYSE---------LEEERSASAIAANQTMAMINRLQ 720
                 L +++ +S   TE  +  ++ SE         LE+ +SA         A+   L+
Sbjct: 661  LHKKLLLLDRKESG--TEESLDGSVISETEGGDGVLTLEKLKSALRTERKALNALYAELE 720

Query: 721  EEKASMQMEALQ----YQRMMEEQSEYDQEALQLLNELVVKR-----------------E 780
            EE+++  + A Q      R+ EE++    EALQ    +  +                  E
Sbjct: 721  EERSASAIAANQTMAMINRLQEEKASMQMEALQYQRMMEEQSEYDQEALQLLNELVVKRE 780

Query: 781  KEKQELEKGIEVYRKKLQDYEAKEKMALLRSRKEGSIQSRNSSVSCSNADDSDGLSIDLN 840
            KEKQELEK IE+YRKKLQDYEAKEK+ALLR+RKEGSI+SRNSSVSCSNADDSDGLSIDLN
Sbjct: 781  KEKQELEKEIEIYRKKLQDYEAKEKIALLRNRKEGSIRSRNSSVSCSNADDSDGLSIDLN 840

Query: 841  TEAKKDEDLFCNQETNNQNTPAEAVLYLEETLANFEEERLSILEELKMLEEKLFTLSDEE 900
             EAKKDED F NQET NQNTPAEAVLYLEETLANFEEERLSILEELKMLEEKLFTLSDEE
Sbjct: 841  AEAKKDEDFFSNQETENQNTPAEAVLYLEETLANFEEERLSILEELKMLEEKLFTLSDEE 900

Query: 901  QQFDDIEHYSEHNGNGYHKNSDSVSETNGFENGHHVKEMNGNHHPGKRTMSTKAKRLLPL 960
            QQF+DI+HY E NGNGYHKNSD  + TNGFENGH+ KEMNG H+P +R MSTKAKRLLPL
Sbjct: 901  QQFEDIDHYCERNGNGYHKNSDYSTGTNGFENGHNAKEMNGKHYPERRAMSTKAKRLLPL 960

Query: 961  FDDAVDTD-VEDVTIGEEQGFDSVSMQKSLDNKFDTEFRRVAVEEEVDHVYERLQALEAD 1015
            FDD VD D VEDVT G+EQGFDS+SMQKSLDNKFDTEFRRVAVEEEVDHVYERLQALEAD
Sbjct: 961  FDDVVDADVVEDVTNGDEQGFDSISMQKSLDNKFDTEFRRVAVEEEVDHVYERLQALEAD 982

BLAST of Cp4.1LG01g18370 vs. NCBI nr
Match: gi|659072835|ref|XP_008467119.1| (PREDICTED: intracellular protein transport protein USO1 isoform X1 [Cucumis melo])

HSP 1 Score: 1154.4 bits (2985), Expect = 0.0e+00
Identity = 683/1068 (63.95%), Postives = 772/1068 (72.28%), Query Frame = 1

Query: 1    MAANRFATLLHRNSNKITLILVYALLEWVLIFLLLLQALFSYLIIKFAELFGLKRPCLWC 60
            MAAN+FAT+LHRNSNKITLILVYALLEWVLIFLLLL  LFSYLI+KFAE FGLKRPCLWC
Sbjct: 1    MAANKFATILHRNSNKITLILVYALLEWVLIFLLLLHGLFSYLIVKFAEWFGLKRPCLWC 60

Query: 61   SRVDHVFEPGKKFSYRDLLCEPHAMEISNLGYCSNHRKLTEVRDLCEDCFSSSNPNEFYQ 120
            SRVDHVFEP +K SYRDLLCE HAMEISNLGYCSNHRKL+E RDLCEDC SSS  NEFYQ
Sbjct: 61   SRVDHVFEPERKHSYRDLLCEGHAMEISNLGYCSNHRKLSEFRDLCEDCSSSSKSNEFYQ 120

Query: 121  IPKNFAFFGDEKEDFRCCSCCGESLKNRLFSPCILIKPNWGDLDYAHKGNLISEADIDVQ 180
            I K+F FF DEKEDF+ CSCCGE+LK+RLFSPCILIKPNWGDLDY  KGN ISE     +
Sbjct: 121  ISKSFPFFDDEKEDFKSCSCCGETLKSRLFSPCILIKPNWGDLDYTQKGNFISE----TE 180

Query: 181  SDEIHASPTEDIIGNREISIVSGGEQAEKNSGCSVCGCCCKVSAVHEEEKEEEDKAKMGG 240
            +DEIH S +ED+ GNR ISIVSGGE+ EKNS CSVCGC CK SAVHE+  +++D+A +  
Sbjct: 181  TDEIHVSQSEDVSGNRGISIVSGGEEGEKNSTCSVCGCGCKDSAVHED--DDDDRADISA 240

Query: 241  EKDGDFLELAEDLSSLNHKTVQLGCVRENESAETAPHHLEFYIDRGNDRRLIPVDLIDFS 300
            EKDGDFLELAEDL+  N KTV++GC +E+E  ET P+HLEFYIDRG+DRRLIPVDLIDFS
Sbjct: 241  EKDGDFLELAEDLTICNQKTVEVGCEKEDELPETVPNHLEFYIDRGDDRRLIPVDLIDFS 300

Query: 301  ASDHNNNESNILSSVKDEEQEQEPEPEPEQEQEQEQEQDQEQQQEDCGNEDVVLDFGSNF 360
            A D +NN SNILS VKDEEQE                      QEDCGNEDVVLDFGSNF
Sbjct: 301  APDDDNNTSNILSQVKDEEQE----------------------QEDCGNEDVVLDFGSNF 360

Query: 361  EKQGQDVTEDWEVISGERLAEFLSVSLHESKQKVAEVEAMKVEEGSTRASGLGSDEDPSM 420
            E Q   V EDWEVISGERLAEFLSVSLHE+KQ+V EVEAM VE            EDP M
Sbjct: 361  ENQRHGVNEDWEVISGERLAEFLSVSLHENKQRVEEVEAMDVE------------EDPLM 420

Query: 421  EVEEQEIEEQEQEIEEQEQQIEEQQQEIEEQEQEIEEAEASIGEAIQAPAIDDAHEEDLA 480
             V ++E +E +  I+                            EA QAPA  DA +E+L 
Sbjct: 421  GVGKEEEKEADASID----------------------------EASQAPA-SDALKEELE 480

Query: 481  ELVL-----DSDLHQ-DIHEWNDEHEVEISIGTDIPDHEPIDEIQTQNDIPSHPNVQEDP 540
            ELV+     DSDLH+ D H WNDE EVEISIGTDIPDHEPIDEIQTQ D+P HP++QE+P
Sbjct: 481  ELVVATRQPDSDLHEVDFHMWNDELEVEISIGTDIPDHEPIDEIQTQIDLPPHPDLQEEP 540

Query: 541  SPTSTLVVDDNMQDYNKAEKSEEAEDTKEEVEFKILSVETSSQPSDNHKPSRSEFNENEE 600
            SP+S+L VD        + + EE  + KEE EFKI SVETSSQPSD HKPS SE NE+EE
Sbjct: 541  SPSSSLDVD--------SMQVEETVEVKEEEEFKIFSVETSSQPSDYHKPSSSEVNEDEE 600

Query: 601  EDKVPDTPTSMDSLHQLHKKLLLLDRKESGAEESLDGSVISETESGDGV----------- 660
            EDKVP T      L  +       D  +  + E      ++E E  D V           
Sbjct: 601  EDKVPGTEVEEFKLLSVETCSHPSDNHKPSSSE------VNENEEEDKVPDTPTSMDSLH 660

Query: 661  ------LTIEKLKSALRTERKVLNALYSE---------LEEERSASAIAANQTMAMINRL 720
                  L +++ +S   TE  +  ++ SE         LE+ +SA         A+   L
Sbjct: 661  QLHKKLLLLDRKESG--TEESLDGSVISETEGGDGVLTLEKLKSALRTERKALNALYAEL 720

Query: 721  QEEKASMQMEALQ----YQRMMEEQSEYDQEALQLLNELVVKR----------------- 780
            +EE+++  + A Q      R+ EE++    EALQ    +  +                  
Sbjct: 721  EEERSASAIAANQTMAMINRLQEEKASMQMEALQYQRMMEEQSEYDQEALQLLNELVVKR 780

Query: 781  EKEKQELEKGIEVYRKKLQDYEAKEKMALLRSRKEGSIQSRNSSVSCSNADDSDGLSIDL 840
            EKEKQELEK IE+YRKKLQDYEAKEK+ALLR+RKEGSI+SRNSSVSCSNADDSDGLSIDL
Sbjct: 781  EKEKQELEKEIEIYRKKLQDYEAKEKIALLRNRKEGSIRSRNSSVSCSNADDSDGLSIDL 840

Query: 841  NTEAKKDEDLFCNQETNNQNTPAEAVLYLEETLANFEEERLSILEELKMLEEKLFTLSDE 900
            N EAKKDED F NQET NQNTPAEAVLYLEETLANFEEERLSILEELKMLEEKLFTLSDE
Sbjct: 841  NAEAKKDEDFFSNQETENQNTPAEAVLYLEETLANFEEERLSILEELKMLEEKLFTLSDE 900

Query: 901  EQQFDDIEHYSEHNGNGYHKNSDSVSETNGFENGHHVKEMNGNHHPGKRTMSTKAKRLLP 960
            EQQF+DI+HY E NGNGYHKNSD  + TNGFENGH+ KEMNG H+P +R MSTKAKRLLP
Sbjct: 901  EQQFEDIDHYCERNGNGYHKNSDYSTGTNGFENGHNAKEMNGKHYPERRAMSTKAKRLLP 960

Query: 961  LFDDAVDTD-VEDVTIGEEQGFDSVSMQKSLDNKFDTEFRRVAVEEEVDHVYERLQALEA 1015
            LFDD VD D VEDVT G+EQGFDS+SMQKSLDNKFDTEFRRVAVEEEVDHVYERLQALEA
Sbjct: 961  LFDDVVDADVVEDVTNGDEQGFDSISMQKSLDNKFDTEFRRVAVEEEVDHVYERLQALEA 983

BLAST of Cp4.1LG01g18370 vs. NCBI nr
Match: gi|255573730|ref|XP_002527786.1| (PREDICTED: myosin-binding protein 2 [Ricinus communis])

HSP 1 Score: 795.8 bits (2054), Expect = 8.8e-227
Identity = 529/1066 (49.62%), Postives = 697/1066 (65.38%), Query Frame = 1

Query: 1    MAANRFATLLHRNSNKITLILVYALLEWVLIFLLLLQALFSYLIIKFAELFGLKRPCLWC 60
            MAAN+FAT+LH+N+NK+TLILVYA+LEWVLI LLLL +LFSYLIIKFA+ FGLKRPCLWC
Sbjct: 1    MAANKFATMLHKNTNKLTLILVYAMLEWVLIILLLLNSLFSYLIIKFADYFGLKRPCLWC 60

Query: 61   SRVDHVFEPGK-KFSYRDLLCEPHAMEISNLGYCSNHRKLTEVRDLCEDCFSSSNPNEFY 120
            SR+DH FEP K + SYR L+CE HA+EIS L YCS+HRKLTE +D+CEDC SSS+P    
Sbjct: 61   SRLDHFFEPSKFQNSYRSLICETHALEISKLSYCSSHRKLTESQDMCEDCLSSSSPQS-- 120

Query: 121  QIPKNFAFF--------------GDEK----EDFRCCSCCGESLKNRLFSPC-ILIKPNW 180
            ++ K FAFF              GD+     E    CSCCG SL+ +LF P    IKP+W
Sbjct: 121  ELSKKFAFFPWIKKLGVLQDCCAGDKVCENVEIISNCSCCGVSLETKLFCPDDYAIKPSW 180

Query: 181  GDLDYAHKGNLISEADIDVQSDEIH---------ASPTEDIIGNREISIVSGGEQAEKNS 240
            GD +   KG+L+ E +IDV+                  + I+ N  +  +   E+ E+N 
Sbjct: 181  GDSENTQKGDLVWEEEIDVKDHSDRNMSGFVCDRCGEEQRIVENTGVEDIKTEEKTEENF 240

Query: 241  GCSVCGCCCKVSAVHEEEKEEEDKAK-MGGEKDGDFLELAEDLSSLNHKTVQLGCVRENE 300
             C V    CK   V++ +KE+    K     K+ DF    ++ S      VQ  C+++  
Sbjct: 241  SCFVSSVDCKEMVVNDSDKEDISTEKEQESTKEDDFNVSVDEPSCDQAVMVQADCIKDM- 300

Query: 301  SAETAPHHLEFYIDRGNDRRLIPVDLIDFSASDHNNNESNILSSVKDEEQEQEPEPEPEQ 360
            S +  P HLEFYID+ +D  LIP++L++ S+                             
Sbjct: 301  SKDIQPQHLEFYIDQ-DDCHLIPIELLNSSS----------------------------- 360

Query: 361  EQEQEQEQDQEQQQEDCGNEDVVLDFGS-NFEKQGQDVTEDWEVISGERLAEFLSVSLHE 420
             ++Q  ++ ++ + E+CG+ED VL+F + +   Q + V ED    + E     L +   E
Sbjct: 361  -EKQISDKKEKGEVENCGSEDFVLEFDNKHVGPQYELVVEDR--CNFEEKLPLLPIQECE 420

Query: 421  SKQKVAEVEAMKVEEGSTRASGLGSDEDPSMEVEEQEIEE---------------QEQEI 480
             +  V E+E   + E     +     +   ME E +++                 +  +I
Sbjct: 421  EENMVDELEPRDLNENENENASAVYADYELMEEESEQVSIAQPIGTITSNGDDVLENSQI 480

Query: 481  EEQEQQIEEQQQEIEEQEQEIEEAEASIGEAIQAPAIDDAHEEDLAELVLDSDLHQDIHE 540
             ++  +++  Q   E  + ++ E EA +    + P  +   E    EL   S   + +  
Sbjct: 481  SDEGMELDNNQVSEEVLQMQVNEIEADVSMGTEIPDHEPIQEIQTPEL--HSLCVEVLQM 540

Query: 541  WNDEHEVEISIGTDIPDHEPIDEIQTQNDIPSHPNVQEDPSPTS--TLVVDDNMQDYNKA 600
              DE E  +SIG +IPDHEPI+EIQT++   S   V+EDPS ++     +DD+   YN+A
Sbjct: 541  QVDEIEAYVSIGAEIPDHEPIEEIQTESFPSSCLCVEEDPSTSNGDNHALDDH--GYNQA 600

Query: 601  EKSEEAEDTKEEVEFKILSVETSSQPSDNHKPSRSEFNENEEEDKVPDTPTSMDSLHQLH 660
            E+        +EVEF+ +++ETS     +H     E N+ EE DK PDTPTS+DSLH LH
Sbjct: 601  EE--------DEVEFRAMTIETSEPVIKSHLSLCLESNDIEE-DKTPDTPTSVDSLHHLH 660

Query: 661  KKLLLLDRKESGAEESLDGSVISETESGDGVLTIEKLKSALRTERKVLNALYSELEEERS 720
            KKLLLL+R+ES AEESLDGSVIS+ E+GDGVLT+EKLKSALR+ERK LNALY+ELEEERS
Sbjct: 661  KKLLLLERRESNAEESLDGSVISDIEAGDGVLTVEKLKSALRSERKALNALYAELEEERS 720

Query: 721  ASAIAANQTMAMINRLQEEKASMQMEALQYQRMMEEQSEYDQEALQLLNELVVKREKEKQ 780
            ASA+AANQTMAMINRLQEEKA+MQMEALQYQRMMEEQSEYDQEALQLLNEL++KREKE+ 
Sbjct: 721  ASAVAANQTMAMINRLQEEKAAMQMEALQYQRMMEEQSEYDQEALQLLNELMIKREKERT 780

Query: 781  ELEKGIEVYRKKLQDYEAKEKMALLRSRKEGSIQSRNSSVSCSNADDSDGLSIDLNTEAK 840
            ELEK +E+YRKK+QDYE KEK+ +LR RKE SI+S  SS S SNA+DSDGLS+DLN E K
Sbjct: 781  ELEKELELYRKKVQDYETKEKLMMLRRRKESSIRSGTSSASYSNAEDSDGLSVDLNHEVK 840

Query: 841  KDEDLFCNQETNNQNTPAEAVLYLEETLANFEEERLSILEELKMLEEKLFTLSDE-EQQF 900
            ++     + E++NQNTP +AV+YLEE+L NFEEERLSILE+LK+LEEKLFTLSDE E  F
Sbjct: 841  EEVGFDNHLESSNQNTPVDAVVYLEESLNNFEEERLSILEQLKVLEEKLFTLSDEDEHHF 900

Query: 901  DD---IEHYSEHNGNGYHKNSDSVSETNGFENGHHVKEMNGNHHPGKRTMSTKAKRLLPL 960
            +D   IEH  E NGNGY+++ D  SE NG  NGH+ KEMNG H+  ++ +  KAKRLLPL
Sbjct: 901  EDIKPIEHLYEENGNGYNEDFDHSSEANGVANGHY-KEMNGKHYQERKIIGAKAKRLLPL 960

Query: 961  FDDAVDTDVEDVTI-GEEQGFDSVSMQKSLDNKFDTEFRRVAVEEEVDHVYERLQALEAD 1014
            F DA+D++ ED  + G E+G DS+ + KS+ NKFD + +++A+EEEVDHVYERLQALEAD
Sbjct: 961  F-DAIDSEAEDGMLNGHEEGVDSIVLLKSI-NKFDIDSKKLAIEEEVDHVYERLQALEAD 1014

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
MYOB2_ARATH1.5e-7238.35Myosin-binding protein 2 OS=Arabidopsis thaliana GN=MYOB2 PE=1 SV=1[more]
MYOB3_ARATH1.0e-6541.67Myosin-binding protein 3 OS=Arabidopsis thaliana GN=MYOB3 PE=1 SV=1[more]
MYOB6_ARATH8.1e-2336.65Probable myosin-binding protein 6 OS=Arabidopsis thaliana GN=MYOB6 PE=2 SV=1[more]
MYOB5_ARATH2.0e-2140.26Probable myosin-binding protein 5 OS=Arabidopsis thaliana GN=MYOB5 PE=2 SV=1[more]
MYOB1_ARATH4.3e-1634.81Myosin-binding protein 1 OS=Arabidopsis thaliana GN=MYOB1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KRI5_CUCSA0.0e+0071.33Uncharacterized protein OS=Cucumis sativus GN=Csa_5G568820 PE=4 SV=1[more]
B9SP67_RICCO6.1e-22749.62Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0629030 PE=4 SV=1[more]
A0A061E7U6_THECC1.0e-22651.51Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_010975 PE=4 SV=1[more]
M5Y1T3_PRUPE3.8e-22150.38Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa000840mg PE=4 SV=1[more]
V7CPF2_PHAVU1.2e-21748.49Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_002G183000g PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G70750.18.2e-7438.35 Protein of unknown function, DUF593[more]
AT5G16720.15.7e-6741.67 Protein of unknown function, DUF593[more]
AT1G74830.14.6e-2436.65 Protein of unknown function, DUF593[more]
AT1G18990.11.1e-2240.26 Protein of unknown function, DUF593[more]
AT1G08800.12.4e-1734.81 Protein of unknown function, DUF593[more]
Match NameE-valueIdentityDescription
gi|778703967|ref|XP_011655455.1|0.0e+0071.51PREDICTED: myosin-binding protein 3 isoform X1 [Cucumis sativus][more]
gi|778703971|ref|XP_011655456.1|0.0e+0071.33PREDICTED: myosin-binding protein 2 isoform X2 [Cucumis sativus][more]
gi|659072837|ref|XP_008467120.1|0.0e+0064.01PREDICTED: protein SGM1 isoform X2 [Cucumis melo][more]
gi|659072835|ref|XP_008467119.1|0.0e+0063.95PREDICTED: intracellular protein transport protein USO1 isoform X1 [Cucumis melo... [more]
gi|255573730|ref|XP_002527786.1|8.8e-22749.62PREDICTED: myosin-binding protein 2 [Ricinus communis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR007656GTD-bd
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016459 myosin complex
molecular_function GO:0003674 molecular_function
molecular_function GO:0017022 myosin binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g18370.1Cp4.1LG01g18370.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR007656Zein-binding domainPFAMPF04576Zein-bindingcoord: 648..736
score: 8.6
NoneNo IPR availableunknownCoilCoilcoord: 947..967
score: -coord: 420..468
score: -coord: 820..840
score: -coord: 650..670
score: -coord: 678..751
scor
NoneNo IPR availablePANTHERPTHR31448FAMILY NOT NAMEDcoord: 2..306
score: 0.0coord: 336..465
score: 0.0coord: 565..1010
score:
NoneNo IPR availablePANTHERPTHR31448:SF3IFA-BINDING PROTEINcoord: 2..306
score: 0.0coord: 336..465
score: 0.0coord: 565..1010
score:

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG01g18370Cp4.1LG13g03620Cucurbita pepo (Zucchini)cpecpeB200