Cp4.1LG01g16800 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g16800
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionMyosin heavy chain, putative
LocationCp4.1LG01 : 10631556 .. 10637578 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAAAAGAAGACTGGTTGGTAAATGGTAGCCATCGTCCTTTTCTTCTATGCTCCGTCCTTTGGTCGTCCTTTTTTCTTTCTCCATTGAATTTAGATTCACCGACGGAAATCCTTTTCATCCAAGTTTTAGCTTCATTCTCAAGTAATCGCAATCCAAATCCCTTTTCATTTGCAACTTCGGCTACCCATTGGTCTCCGGCGTCACCCATCCACGCACGCGCGAGTTCCATCCACCTATTGCGCGCCAACTCCGTCCCTCGTCAAACGGCGACGACGAGTTTTCCTCCCTCGGGGGATCTTCTCTGCCGGTGCTTCTGTTGGCGGCGACGCTCCTTCTGCGATGACAACACCCAGATTCGCGACGGCGCGAACTACTTTTTGGCAGCTCGATGACTTTTGGCGGGGACGCACAACCCGAACCCATACGAATACAATTCAATAAGGACCCAATTTCTTTTCGGCAAGATTTAGGCGGTAAACCCTTTTGAATTTGATCGAAGTGTTTTAGAGTAGTTTCGAACCCTTGGAATTCCTCTCTATTTAGGTTTGGACTAGTTTCGGATTGATTGAACATAGTTTATGGTCCACGTTGGCAACGCCTCCTCATAACCATTTTGACTTTAATCAAGAATTTGGAACTGAGTTCGAACCTCGAAATCTTGTCTTTCTTGTTTCAGGAACCGTTAGAATAGGCTAGGAATACTTTACATTTGTATGTAGTTTTTGTTGGTTGAGTCTGTTTTCCCTATGCCTTATATGGTGTATGTCTTGTTTGATTTTTTTGATAATTATGATTTATATAATGCCATTCCTGAACTCTTAGATATTTGGATGTGTTTGAACATGCTGAGGTGTTAGTTATAATATTGTAATGTTGAGCATGCTGTGGTGTTGTGATGTTGAGATTGACTGGAAAAGCATGATGAATGTGTTCATGTGGTCTTTCGGCTAAGATCATGGTCTTGGAAGCATGATATAGGATTGTCCATTCTTGTGAGACTGTGAGTCTTATTAGTGAGAAAAGGGTTAGAGGACTTTAAGTTTTGAGAAAAGGAAGTAAGGTATATTTTTTTGGATGGGAAATAATTTCATTGATGATATAAACTTACAAGATATAGAAAATGGGAAACCCCCCAAGGATAGATCTATAACAGGCTTCCCTAATTAGAAAGGAGGGAAGCAAAGCTGTAGTGATGGAAAAGGAGAGAGCATTTACACCAAGATACAACCATATTGACCATAGCATAAAAAAAATCTACAGAAAGATCATTCCTTATCTTGAAAGATTCTTCTATTTCATTCTTGCCATATAATCCAACAAAAGGTTCTGATAAAGTTCTCCCTAGGAGTCTCCTTTCCCTTTTGAAGGGTGACCTCCTGGAGTGTAACTTGGAAGCAACTTGGGATCATTTGGTAGGATCCTTTGTCAACAAGAGAGGGTAAGGTGATATTCCAGAAGTTCACACCAAAATTGCAAGAGAGAATCAAATGACTTTGTGTTTTTGAACTTTTCTTGTTGATTTCACACCATTGTGGGTAGAGAACCAGGTAAGGCATCCTTCTTTGTAACTTGTCATCTTTAATGATAGCTTTGTGGGAAAGTTCCTGACCTTTCCAAATCAACTCGTAAGGCTTTGGTACTTTGTCTTTGCTACAGGGTAAAGATCACGAATGAGGGAGCTAGTAGTAAAAAACCAATTGGATTTTTGTTCCCATATTCTCTATCAATTTCGGTTGAAAGAACCACTAAAGATAGGCAATGTAAAAGGGAGGCATATTCATCAATTTCAGCATCTCTTTTGGTTTCTTCTCAACTTGAGATTCCAAATATCCTTAGCATGCACCTGTAAGTCATGCACCACCACTACTTTACCAGGGGAGAGGTTATAAATAAGGGGGAATCTTCTATAAATAGGCCCATTTCCAAGCCAAAATTCGGTCCATAAATCAACCTTCAATCCATCCCCAAGCTTGTAGTTCATCTATGTCCTATATCAAAGTTGTAAAGTTCAAGATAGATTTCCATGGGCCTCTTGCTGATTCTACAAATGTTTGCCTAGTTTTTTTTTTTAGAGTTGTGGTGTCATATATTGCATCAGCTACTTTTCTCCCAAAGGCTGTTCCTCACCTTGGTATTGCCAAATCCATTTTGCCAAGAGAACTATATTTACAAATGGTTGAGGCTTTTAAGTTGGTGAATATTTGTGCATCTTGGGTAAAAATGGATGAGGGATGATACAAGGCTTTGGTACAATGGTCAATGATCGATGTGATTAGTCCATAGAGCAAGTTTTGTGGTGACCGTGGAATACTAATATGTTGCTAAGTCATCATTGCGTCATCAATACAAACAATCATGGAGCAATGTAACAGCTCGAGGTCTTGAATACAAATGGTCAAGGGTCAAACGTTGAGTACTGTAAACTACAGAGAAGCATTGAGGCCGTAGGTACTAATGGTCAGAGGTTGATGCGACTCGAAGGAGTCAATCGTGAAGAGCTTGTGAATGCGAAGTAAAGAGAGTTATGGGAAATAGAGGTGTTTGAAAGAGCTTTGAACAGGAACATTATGCATAATATGAAGTCATATGTTGAGCATGTTTCTTATGTTACTCGCTTTCCATATAGTACGTGATTAGCTTGTATGTAAGATACATGATATCGTAATGTACATTGTATGTTGTTTGGTGGCTCCTCAGGTGGGAGCTATGTACTGATTACACTTGTACGTATCCCTCCATTTCCCATTGGTGATGTAAAAGAGGAGGTGTCCCCAAAAACAACTCGGAGCCTAAACTATGATTTACTTATATGGTTGAATTTAGTTCTTTGAATATGTTGCTTGATCATTCTTTTACTAATTTTTCTTTAAATGAGTTTGTTGTTTTACTTGGGATCAAGTTTGAAAATCTAGTAATGTGCCTTAATGTAGGCTAGAATTTTGGGGTGTTACACAGATGGCGAGTAGTGTTGACGAAGACGTAGATGCTGTGCTCAGTGATGTTGAGGGTGATGAACATCCTACTGTGATTCAGAATCCTTCTACTGAAGAAATCTCTGTTGAGAGGTTTAGAGAGATTCTTGCGGAGCGTGATCGTGAGCGGAAAGCTCGAGAAGCAACAGAGAATTCAAAATCAGAATTGCTAGTGTCGTTTAATCGCTTAAAGGCGCTTGCCCATGAGGCGATTAAGAAGCGCGATGAGTGTGGGAGGCAGCGGGATGAGGCATTAAGGGAGAAGGAAGAAGCCTTGAAATTGAATGAAAAGGTTTCTGCAGAGTTGGCTGTGGCAAATCAGCAGAGAGATGAAGTCTCAAAGCTTAGGGATGAGATTACCAAGGAATTTGATGAGATCCTCAAGGAAAGGGATGCTCTGAGATCAGAAATTGGTAATGCATCCCATATGCTAGTGACTGGGATTGATAAGATATCTGCAAAAGTGAGCAATTTCAAGAATTTTACAGCAGGTGGGTTGCCTAGGTCTCAAAAATATACTGGATTACCTGCAGTTGCCTATGGAGTTATCAAGAGAACAAACGAAATTGTTGAAGAGCTTGTTAGACAGATTGACACCACGGCGAAGGAAAGAAACGAAACCAGGGAACAAATGGAGCTTAGGAATTATGAAATTGCCATTGAGGTTTCTCAGCTTGAAGCTACGATTAGTGGGCTGAGGGATGAGGTTTCGAAGAAAACTTCTGTTATTGAAGACTTGGAAAACACTCTCACCAAAAGGGATAAAAAGATATCTGAAATTGAAGCAGATTTGTGTGGTAAATTAACTAGGGCTGAGGATGAAGCTTCTGAACTGAGGCAGGTTGCGCAGGAGTACGATGATAAGTTAAGGAATTTGGAGTCAAAAGTAGAGTCCCAAAGGCCTTTACTTATGGATCAGTTGGGTTTCATTTCAAAAATTCACGACCAAATTTATGATATTATTAAGATAGTTGATGCCAGTGATATAGACCATTCTGAATTTTCAGAGTCATTGTTCCTCCCTCAGGAAACAGACATAGAGGAGAATGTCCGGGCGTCGTTGGCTGGAATGGAATCTATATATGCATTAGCAGTACTTGTCATGGATAAGACAAGGAGTTCAACTCAGGAAAAGATTCGTGAAATTAAGAATTTAAATGAAACAGTTGCGCAGTTGCTCAAGGAGAAAGAACATATTGGAAATTTGCTAAGGAGTGCATTATCTAAGAGGATAACATCTGATCCCTCAAAAGCAAATCAATTATTTGAAGTTGCGGAGAATGGTTTAAGGGAGGCTGGGATTGATTTCAAATTCAGCAAGCTTCTTGGAGACGAGATTTTTTCAACATCTAGGGACAATGGTAAAGCAGTAGATGCAGAGGAGGATGAAATATTCACCCTGGTAAGTGCATTTGTCTTTTATTATAAGCTTTTCCTTTAAATGATATCAAGTTTTAAATTTCTTTTATGTTCTCTTGCAAAATGCAGGCTGGTGCTCTGGAGAATATCGTGAAGGCATCTCAGATTGAAATCATTGAGCTACGGCATTCACTGGAGGAATTAAGGTACGAGAACATATATTCAGCAGTCCCTGAGTTGCTGATTTTCTGATTAAAACAAACCCTTGGACTTTGTATTTGTGATTTACTTCAGGTATTGGAGTCTCGCAGAAATGACCATATTTATAGAATTATACTTGTGCATATGTAATCTCTGCTGATTTTGACCTGTAATTTCTTTGCATTNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTATCTTACATGTGTTAAGGGCAGAGTCAGTTGTACTTAAAGAGCGTCTAGAATCCCAATCCAAGGAGCTTAAACTTAGATCGCTTCAAATTATGGAACTTGAAGAAAAGGAGAGAGTTGCCAATGAAAGTGTATGTAATTAATTGTTCATTGATTTAAGTTAATTGTTCATTCGTTTCTGTTCATAATTATTGGGTTTTGGGTTTCTTCGTGATTTAGGTTGAAGGGTTAATGATGGACATTACAGCTGCGGAAGAAGAAATCATGAGATGGAAGGTAGCTGCAGAGCAAGAAGCGGCTGCTGGCAAAGCTGTAGAGCAAGAGTTTCTGGCACAGGTATGCTTGCTTCTAACTTGTCTGGCACAGGGTTCGGAGTATTGTTTATAATCTAAATTTTCATGAACAAAAAAGTTACCCAAGGGACCTTATTTATATCAATTTGCAAGTATTCTTGTTATCTTTCCCATTGCATGGTCTGGACCTGATAACTTGGTTACTAGTTCATACAAAGCTACTTCTTGTGCAGCCTATAATCAAGGGCTTTTATTCTAATTTGTAATTATGATTACTGAAAGTATTAGTCTTTTCCAGCTAAATACTTTATGATAAGAGACTTGAATTTGGCATTGGCAGATCTCAGCTGTTAAAGAGGAAGTTGAAGAAGCAAGGAAGGTGATGTTGGATTCGGATAATAAACTCAAATTCAAAGAACAGACAGTAAACGCTGCCATGGCAGCCAGAGATGCGGCGGAGAAATCATTGAGACTGGCGGACGTGAGGGCGTCCAGGTTGAGGGAGAGGGTGGAAGAGCTTACCCGACAGCTTGAAGAGCTCGATAAACGAGAAGATTCGAGAAGAGGATTAAATGGGAGTAGATACGTTTGTTGGCCATGGCAGTGGCTTGGACTCGACTTTGTTGGCTCTCGCCGTTCTGAAACACAACAACAAGAGAGTTCAAACGAGATGGAACTTTCTGAGCCACTTCTCTGAGATATTTTTTTCATTTTTCTTTCCTTTTATTTGGGTGTGGGTGGATGGGTTGGTGATTAGTTCTTGTATTTATTGTGAAAATGAAATGTTAGAAGATTATAGAAAAATACATCACGTGCCTTGTTTGGTATTGAATGCTCTCTCGGGTGATCGATTGTTTTTCATATAACATTGGTTTGAGCAAATGTGAATATAATATTAAAACGCTATTCTTTCACAACAATGTTCTCTCCCATCTGAGCTTTGCACATCATCTGAT

mRNA sequence

CAAAAGAAGACTGGTTGGTAAATGGTAGCCATCGTCCTTTTCTTCTATGCTCCGTCCTTTGGTCGTCCTTTTTTCTTTCTCCATTGAATTTAGATTCACCGACGGAAATCCTTTTCATCCAAGTTTTAGCTTCATTCTCAAGTAATCGCAATCCAAATCCCTTTTCATTTGCAACTTCGGCTACCCATTGGTCTCCGGCGTCACCCATCCACGCACGCGCGAGTTCCATCCACCTATTGCGCGCCAACTCCGTCCCTCGTCAAACGGCGACGACGAGTTTTCCTCCCTCGGGGGATCTTCTCTGCCGGTGCTTCTGTTGGCGGCGACGCTCCTTCTGCGATGACAACACCCAGATTCGCGACGGCGCGAACTACTTTTTGGCAGCTCGATGACTTTTGGCGGGGACGCACAACCCGAACCCATACGAATACAATTCAATAAGGACCCAATTTCTTTTCGGCAAGATTTAGGCGGCTAGAATTTTGGGGTGTTACACAGATGGCGAGTAGTGTTGACGAAGACGTAGATGCTGTGCTCAGTGATGTTGAGGGTGATGAACATCCTACTGTGATTCAGAATCCTTCTACTGAAGAAATCTCTGTTGAGAGGTTTAGAGAGATTCTTGCGGAGCGTGATCGTGAGCGGAAAGCTCGAGAAGCAACAGAGAATTCAAAATCAGAATTGCTAGTGTCGTTTAATCGCTTAAAGGCGCTTGCCCATGAGGCGATTAAGAAGCGCGATGAGTGTGGGAGGCAGCGGGATGAGGCATTAAGGGAGAAGGAAGAAGCCTTGAAATTGAATGAAAAGGTTTCTGCAGAGTTGGCTGTGGCAAATCAGCAGAGAGATGAAGTCTCAAAGCTTAGGGATGAGATTACCAAGGAATTTGATGAGATCCTCAAGGAAAGGGATGCTCTGAGATCAGAAATTGGTAATGCATCCCATATGCTAGTGACTGGGATTGATAAGATATCTGCAAAAGTGAGCAATTTCAAGAATTTTACAGCAGGTGGGTTGCCTAGGTCTCAAAAATATACTGGATTACCTGCAGTTGCCTATGGAGTTATCAAGAGAACAAACGAAATTGTTGAAGAGCTTGTTAGACAGATTGACACCACGGCGAAGGAAAGAAACGAAACCAGGGAACAAATGGAGCTTAGGAATTATGAAATTGCCATTGAGGTTTCTCAGCTTGAAGCTACGATTAGTGGGCTGAGGGATGAGGTTTCGAAGAAAACTTCTGTTATTGAAGACTTGGAAAACACTCTCACCAAAAGGGATAAAAAGATATCTGAAATTGAAGCAGATTTGTGTGGTAAATTAACTAGGGCTGAGGATGAAGCTTCTGAACTGAGGCAGGTTGCGCAGGAGTACGATGATAAGTTAAGGAATTTGGAGTCAAAAGTAGAGTCCCAAAGGCCTTTACTTATGGATCAGTTGGGTTTCATTTCAAAAATTCACGACCAAATTTATGATATTATTAAGATAGTTGATGCCAGTGATATAGACCATTCTGAATTTTCAGAGTCATTGTTCCTCCCTCAGGAAACAGACATAGAGGAGAATGTCCGGGCGTCGTTGGCTGGAATGGAATCTATATATGCATTAGCAGTACTTGTCATGGATAAGACAAGGAGTTCAACTCAGGAAAAGATTCGTGAAATTAAGAATTTAAATGAAACAGTTGCGCAGTTGCTCAAGGAGAAAGAACATATTGGAAATTTGCTAAGGAGTGCATTATCTAAGAGGATAACATCTGATCCCTCAAAAGCAAATCAATTATTTGAAGTTGCGGAGAATGGTTTAAGGGAGGCTGGGATTGATTTCAAATTCAGCAAGCTTCTTGGAGACGAGATTTTTTCAACATCTAGGGACAATGGTAAAGCAGTAGATGCAGAGGAGGATGAAATATTCACCCTGGCTGGTGCTCTGGAGAATATCGTGAAGGCATCTCAGATTGAAATCATTGAGCTACGGCATTCACTGGAGGAATTAAGGGCAGAGTCAGTTGTACTTAAAGAGCGTCTAGAATCCCAATCCAAGGAGCTTAAACTTAGATCGCTTCAAATTATGGAACTTGAAGAAAAGGAGAGAGTTGCCAATGAAAGTGTTGAAGGGTTAATGATGGACATTACAGCTGCGGAAGAAGAAATCATGAGATGGAAGGTAGCTGCAGAGCAAGAAGCGGCTGCTGGCAAAGCTGTAGAGCAAGAGTTTCTGGCACAGATCTCAGCTGTTAAAGAGGAAGTTGAAGAAGCAAGGAAGGTGATGTTGGATTCGGATAATAAACTCAAATTCAAAGAACAGACAGTAAACGCTGCCATGGCAGCCAGAGATGCGGCGGAGAAATCATTGAGACTGGCGGACGTGAGGGCGTCCAGGTTGAGGGAGAGGGTGGAAGAGCTTACCCGACAGCTTGAAGAGCTCGATAAACGAGAAGATTCGAGAAGAGGATTAAATGGGAGTAGATACGTTTGTTGGCCATGGCAGTGGCTTGGACTCGACTTTGTTGGCTCTCGCCGTTCTGAAACACAACAACAAGAGAGTTCAAACGAGATGGAACTTTCTGAGCCACTTCTCTGAGATATTTTTTTCATTTTTCTTTCCTTTTATTTGGGTGTGGGTGGATGGGTTGGTGATTAGTTCTTGTATTTATTGTGAAAATGAAATGTTAGAAGATTATAGAAAAATACATCACGTGCCTTATCGATTGTTTTTCATATAACATTGGTTTGAGCAAATGTGAATATAATATTAAAACGCTATTCTTTCACAACAATGTTCTCTCCCATCTGAGCTTTGCACATCATCTGAT

Coding sequence (CDS)

ATGGCGAGTAGTGTTGACGAAGACGTAGATGCTGTGCTCAGTGATGTTGAGGGTGATGAACATCCTACTGTGATTCAGAATCCTTCTACTGAAGAAATCTCTGTTGAGAGGTTTAGAGAGATTCTTGCGGAGCGTGATCGTGAGCGGAAAGCTCGAGAAGCAACAGAGAATTCAAAATCAGAATTGCTAGTGTCGTTTAATCGCTTAAAGGCGCTTGCCCATGAGGCGATTAAGAAGCGCGATGAGTGTGGGAGGCAGCGGGATGAGGCATTAAGGGAGAAGGAAGAAGCCTTGAAATTGAATGAAAAGGTTTCTGCAGAGTTGGCTGTGGCAAATCAGCAGAGAGATGAAGTCTCAAAGCTTAGGGATGAGATTACCAAGGAATTTGATGAGATCCTCAAGGAAAGGGATGCTCTGAGATCAGAAATTGGTAATGCATCCCATATGCTAGTGACTGGGATTGATAAGATATCTGCAAAAGTGAGCAATTTCAAGAATTTTACAGCAGGTGGGTTGCCTAGGTCTCAAAAATATACTGGATTACCTGCAGTTGCCTATGGAGTTATCAAGAGAACAAACGAAATTGTTGAAGAGCTTGTTAGACAGATTGACACCACGGCGAAGGAAAGAAACGAAACCAGGGAACAAATGGAGCTTAGGAATTATGAAATTGCCATTGAGGTTTCTCAGCTTGAAGCTACGATTAGTGGGCTGAGGGATGAGGTTTCGAAGAAAACTTCTGTTATTGAAGACTTGGAAAACACTCTCACCAAAAGGGATAAAAAGATATCTGAAATTGAAGCAGATTTGTGTGGTAAATTAACTAGGGCTGAGGATGAAGCTTCTGAACTGAGGCAGGTTGCGCAGGAGTACGATGATAAGTTAAGGAATTTGGAGTCAAAAGTAGAGTCCCAAAGGCCTTTACTTATGGATCAGTTGGGTTTCATTTCAAAAATTCACGACCAAATTTATGATATTATTAAGATAGTTGATGCCAGTGATATAGACCATTCTGAATTTTCAGAGTCATTGTTCCTCCCTCAGGAAACAGACATAGAGGAGAATGTCCGGGCGTCGTTGGCTGGAATGGAATCTATATATGCATTAGCAGTACTTGTCATGGATAAGACAAGGAGTTCAACTCAGGAAAAGATTCGTGAAATTAAGAATTTAAATGAAACAGTTGCGCAGTTGCTCAAGGAGAAAGAACATATTGGAAATTTGCTAAGGAGTGCATTATCTAAGAGGATAACATCTGATCCCTCAAAAGCAAATCAATTATTTGAAGTTGCGGAGAATGGTTTAAGGGAGGCTGGGATTGATTTCAAATTCAGCAAGCTTCTTGGAGACGAGATTTTTTCAACATCTAGGGACAATGGTAAAGCAGTAGATGCAGAGGAGGATGAAATATTCACCCTGGCTGGTGCTCTGGAGAATATCGTGAAGGCATCTCAGATTGAAATCATTGAGCTACGGCATTCACTGGAGGAATTAAGGGCAGAGTCAGTTGTACTTAAAGAGCGTCTAGAATCCCAATCCAAGGAGCTTAAACTTAGATCGCTTCAAATTATGGAACTTGAAGAAAAGGAGAGAGTTGCCAATGAAAGTGTTGAAGGGTTAATGATGGACATTACAGCTGCGGAAGAAGAAATCATGAGATGGAAGGTAGCTGCAGAGCAAGAAGCGGCTGCTGGCAAAGCTGTAGAGCAAGAGTTTCTGGCACAGATCTCAGCTGTTAAAGAGGAAGTTGAAGAAGCAAGGAAGGTGATGTTGGATTCGGATAATAAACTCAAATTCAAAGAACAGACAGTAAACGCTGCCATGGCAGCCAGAGATGCGGCGGAGAAATCATTGAGACTGGCGGACGTGAGGGCGTCCAGGTTGAGGGAGAGGGTGGAAGAGCTTACCCGACAGCTTGAAGAGCTCGATAAACGAGAAGATTCGAGAAGAGGATTAAATGGGAGTAGATACGTTTGTTGGCCATGGCAGTGGCTTGGACTCGACTTTGTTGGCTCTCGCCGTTCTGAAACACAACAACAAGAGAGTTCAAACGAGATGGAACTTTCTGAGCCACTTCTCTGA

Protein sequence

MASSVDEDVDAVLSDVEGDEHPTVIQNPSTEEISVERFREILAERDRERKAREATENSKSELLVSFNRLKALAHEAIKKRDECGRQRDEALREKEEALKLNEKVSAELAVANQQRDEVSKLRDEITKEFDEILKERDALRSEIGNASHMLVTGIDKISAKVSNFKNFTAGGLPRSQKYTGLPAVAYGVIKRTNEIVEELVRQIDTTAKERNETREQMELRNYEIAIEVSQLEATISGLRDEVSKKTSVIEDLENTLTKRDKKISEIEADLCGKLTRAEDEASELRQVAQEYDDKLRNLESKVESQRPLLMDQLGFISKIHDQIYDIIKIVDASDIDHSEFSESLFLPQETDIEENVRASLAGMESIYALAVLVMDKTRSSTQEKIREIKNLNETVAQLLKEKEHIGNLLRSALSKRITSDPSKANQLFEVAENGLREAGIDFKFSKLLGDEIFSTSRDNGKAVDAEEDEIFTLAGALENIVKASQIEIIELRHSLEELRAESVVLKERLESQSKELKLRSLQIMELEEKERVANESVEGLMMDITAAEEEIMRWKVAAEQEAAAGKAVEQEFLAQISAVKEEVEEARKVMLDSDNKLKFKEQTVNAAMAARDAAEKSLRLADVRASRLRERVEELTRQLEELDKREDSRRGLNGSRYVCWPWQWLGLDFVGSRRSETQQQESSNEMELSEPLL
BLAST of Cp4.1LG01g16800 vs. Swiss-Prot
Match: Y3905_ARATH (Uncharacterized protein At3g49055 OS=Arabidopsis thaliana GN=At3g49055 PE=2 SV=1)

HSP 1 Score: 131.7 bits (330), Expect = 3.0e-29
Identity = 138/452 (30.53%), Postives = 237/452 (52.43%), Query Frame = 1

Query: 229 SQLEATISGLRDEVS----KKTSVIEDLENTLTKRD---KKISEIEADLCGKLTRAEDEA 288
           S L++    LR ++     ++T +I        +RD   ++ SE+EA +  ++   E+  
Sbjct: 28  SSLDSNFLSLRSQIFEASYRRTDLIRVNHELFHERDALRRRNSELEAGILEEVMIREEMK 87

Query: 289 SELRQVAQEYDDKLRNLESKVESQRPLLMDQLGFISKIHDQIYDIIKIVDASDIDHSEFS 348
            +L +V++E    +  LE + + +  LL D   ++  + D++  +I+ ++  ++   E  
Sbjct: 88  RDL-EVSKE---TVSELEGEAKEKTKLLSDIADYVRSMEDRLSKLIRCLNEENVPEEERG 147

Query: 349 ESLFLPQETDIEENVRASLAGMESIYALAVLVMDKTRSSTQEKIREIKNLNETVAQLLKE 408
             L    ET  E N ++ L  ++ +    V  ++  + ST++K  E   L+ +V  L +E
Sbjct: 148 RKL----ETK-EYNSKSILELVKEV----VTKLETFQESTKKKKME---LSRSVEFLEEE 207

Query: 409 KEHIGNLLRSALSKRITSDPS-------KANQLFEVAENGLREAGIDFKFSKLLGDEIFS 468
              I  LLR+AL ++ T++         K   L ++A  GL+  G  F     LG+ +  
Sbjct: 208 NRDINVLLRAALFEKQTAEKQLKEMNDQKGLALLQIAGRGLQRIGFGFG----LGESVEE 267

Query: 469 TSRDNGKAVDAEEDEIFTLAGALENIVKASQIEIIELRHSLEELRAESVVLKERLESQSK 528
           +S     A + EE+ +     A+E  +K  + E+ +L+ SLEE R E V L++  E Q++
Sbjct: 268 SSETGNIANEEEENGVVI---AIEKTMKKLRQEVSQLKISLEESRLEEVGLRKVTEEQAQ 327

Query: 529 ELKLRSLQIMELEEKERVANESVEGLMMDITAAEEEIMRWKVAAEQEAAAGKAVEQEFLA 588
           +L   ++ I +L+ +E+   ++VE L+  I  AE E+ RW+ A E E  AG+   +    
Sbjct: 328 KLAENTVYINKLQNQEKFLAQNVEELVKAIREAESEVSRWREACELEVEAGQREVEVRDQ 387

Query: 589 QISAVKEEVEEARKVMLDSDNKLKFKEQTVNAAMAARDAAEKSLRLADVRASRLRERVEE 648
            I+ +K EVE+ R  +  S+ KLK KE+   AAM A +AAEKSLRLA+ R ++L  R+E 
Sbjct: 388 LIAVLKSEVEKLRSALARSEGKLKLKEELAKAAMVAEEAAEKSLRLAERRIAQLLSRIEH 447

Query: 649 LTRQLEELDKREDSRRGLNGSRYV-CWP-WQW 665
           L RQLEE +  E  RRG    RYV CWP W++
Sbjct: 448 LYRQLEEAESTE-RRRG--KFRYVWCWPMWRF 453

BLAST of Cp4.1LG01g16800 vs. TrEMBL
Match: A0A0A0KIP2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G011700 PE=4 SV=1)

HSP 1 Score: 1113.6 bits (2879), Expect = 0.0e+00
Identity = 616/695 (88.63%), Postives = 654/695 (94.10%), Query Frame = 1

Query: 1   MASSVDEDVDAVLSDVEGDEHPTVIQNPSTEEISVERFREILAERDRERKAREATENSKS 60
           MAS +DED D VLSDVEGDEHP  IQNPS EEI+VERFREILAERDRER++REA ENSKS
Sbjct: 1   MASGLDEDADVVLSDVEGDEHPITIQNPSPEEITVERFREILAERDRERQSREAAENSKS 60

Query: 61  ELLVSFNRLKALAHEAIKKRDECGRQRDEALREKEEALKLNEKVSAELAVANQQRDEVSK 120
           EL VSFNRLKALAHEAIKKRDECGRQRDEALREKEEALKLNEKVSAELA AN+QRDE  K
Sbjct: 61  ELQVSFNRLKALAHEAIKKRDECGRQRDEALREKEEALKLNEKVSAELAEANRQRDEALK 120

Query: 121 LRDEITKEFDEILKERDALRSEIGNASHMLVTGIDKISAKVSNFKNFTAGGLPRSQKYTG 180
           LRDEITKEFDEILK+RD LRSEIGNASHMLVTGIDKISAKVS+FKNFTAGGLPRSQKYTG
Sbjct: 121 LRDEITKEFDEILKDRDTLRSEIGNASHMLVTGIDKISAKVSSFKNFTAGGLPRSQKYTG 180

Query: 181 LPAVAYGVIKRTNEIVEELVRQIDTTAKERNETREQMELRNYEIAIEVSQLEATISGLRD 240
           LPAVAYGVIKRTNEI+EELVRQIDTT K RNETREQMELRNYEIAIEVSQLEATISGL+D
Sbjct: 181 LPAVAYGVIKRTNEIIEELVRQIDTTTKSRNETREQMELRNYEIAIEVSQLEATISGLKD 240

Query: 241 EVSKKTSVIEDLENTLTKRDKKISEIEADLCGKLTRAEDEASELRQVAQEYDDKLRNLES 300
           EVSKKTSVIEDLENT+ ++DKKI E E DL GKL RAEDEAS+LRQ+ QEYDDKLR+LES
Sbjct: 241 EVSKKTSVIEDLENTIIEKDKKICENEVDLVGKLRRAEDEASDLRQLVQEYDDKLRDLES 300

Query: 301 KVESQRPLLMDQLGFISKIHDQIYDIIKIVDASDIDHSEFSESLFLPQETDIEENVRASL 360
           K+ESQRPLL+DQLG ISKIHDQIYDIIKIVD SD+DHSEFSESLFLP+ETD+EENVRASL
Sbjct: 301 KMESQRPLLVDQLGLISKIHDQIYDIIKIVDVSDVDHSEFSESLFLPRETDMEENVRASL 360

Query: 361 AGMESIYALAVLVMDKTRSSTQEKIREIKNLNETVAQLLKEKEHIGNLLRSALSKRITSD 420
           AGMESIYALA LVMDKTR+  +EKIRE KNLNETVAQLLKEKEHIG LLR+ALSKR+TSD
Sbjct: 361 AGMESIYALAKLVMDKTRNLIEEKIRESKNLNETVAQLLKEKEHIGYLLRTALSKRMTSD 420

Query: 421 P-SKANQLFEVAENGLREAGIDFKFSKLLGDEIFSTSRDNGKAVDAEEDEIFTLAGALEN 480
           P SKANQLFEVAENGLREAGIDFKFSKLLG+E FST+RDN KA+DA EDEIFTLAGALEN
Sbjct: 421 PSSKANQLFEVAENGLREAGIDFKFSKLLGEEKFSTTRDNRKALDA-EDEIFTLAGALEN 480

Query: 481 IVKASQIEIIELRHSLEELRAESVVLKERLESQSKELKLRSLQIMELEEKERVANESVEG 540
           IVKASQIEIIELRHSLEELRAESVVLKERLESQSKELKLRSLQI ELEEKERVANESVEG
Sbjct: 481 IVKASQIEIIELRHSLEELRAESVVLKERLESQSKELKLRSLQIQELEEKERVANESVEG 540

Query: 541 LMMDITAAEEEIMRWKVAAEQEAAAGKAVEQEFLAQISAVKEEVEEARKVMLDSDNKLKF 600
           LMMD+TAAEEEI+RWKVAAEQEAAAGKAVEQEFLAQIS VK+E+EEAR+V+LDSD KLKF
Sbjct: 541 LMMDVTAAEEEIIRWKVAAEQEAAAGKAVEQEFLAQISGVKQELEEARQVILDSDKKLKF 600

Query: 601 KEQTVNAAMAARDAAEKSLRLADVRASRLRERVEELTRQLEELDKREDSRRG-LNGSRYV 660
           KE+TVNAAMAARDAAEKSLRLADVRASRLRERVE+LTRQLE+LD RE+SR G  NG RYV
Sbjct: 601 KEETVNAAMAARDAAEKSLRLADVRASRLRERVEDLTRQLEQLDNREESRIGSSNGHRYV 660

Query: 661 CWPWQWLGLDFVGSRRSETQQQESSNEMELSEPLL 694
           CWPWQWLGLDFVGSR SETQQQESSNEMELSEPL+
Sbjct: 661 CWPWQWLGLDFVGSRHSETQQQESSNEMELSEPLI 694

BLAST of Cp4.1LG01g16800 vs. TrEMBL
Match: A0A061E8N7_THECC (Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_010623 PE=4 SV=1)

HSP 1 Score: 887.1 bits (2291), Expect = 1.4e-254
Identity = 490/694 (70.61%), Postives = 590/694 (85.01%), Query Frame = 1

Query: 1   MASSVDEDVDAVLSDVEGDEH-PTVIQNPSTEEISVERFREILAERDRERKAREATENSK 60
           M+++ DE+ DAVLSDVE DE  P VI+ PS +++SVE+FREILAE +RE++AREATENSK
Sbjct: 1   MSTAADEEADAVLSDVESDEPIPIVIKEPSRDDVSVEKFREILAELEREKQAREATENSK 60

Query: 61  SELLVSFNRLKALAHEAIKKRDECGRQRDEALREKEEALKLNEKVSAELAVANQQRDEVS 120
           SEL VSFNRLKALAHEAI+KRDEC RQRDEALREKEEAL+ NE V A+LA AN+ +D+V+
Sbjct: 61  SELQVSFNRLKALAHEAIRKRDECARQRDEALREKEEALRSNENVLAQLAEANKIKDDVT 120

Query: 121 KLRDEITKEFDEILKERDALRSEIGNASHMLVTGIDKISAKVSNFKNFTAGGLPRSQKYT 180
           K R+++ K+ +E  K +D LRSEI  ++HMLV+GI+KIS KVSNFKNF AGGLPRSQKYT
Sbjct: 121 KQREDLAKQLEEATKGKDGLRSEIETSAHMLVSGIEKISGKVSNFKNFAAGGLPRSQKYT 180

Query: 181 GLPAVAYGVIKRTNEIVEELVRQIDTTAKERNETREQMELRNYEIAIEVSQLEATISGLR 240
           GLP+VAYGVIKRTNEIVEELV+Q++TTAK RNE REQME RNYEIAIEVSQLEATISGLR
Sbjct: 181 GLPSVAYGVIKRTNEIVEELVKQMETTAKSRNEAREQMEQRNYEIAIEVSQLEATISGLR 240

Query: 241 DEVSKKTSVIEDLENTLTKRDKKISEIEADLCGKLTRAEDEASELRQVAQEYDDKLRNLE 300
           +EV+KK+++ E+LE  + ++D K  EIE ++  K+  AE+E+ ELR +A EYDDKL++LE
Sbjct: 241 EEVAKKSNLTENLEKNIAEKDGKFVEIEKEMSEKINWAENESMELRNLASEYDDKLKSLE 300

Query: 301 SKVESQRPLLMDQLGFISKIHDQIYDIIKIVDASDIDHSEFSESLFLPQETDIEENVRAS 360
           SK+E QRPLL+DQL F+SKIH+ IYD IKIVDA ++D S+ SES FLPQETD+EEN+RA 
Sbjct: 301 SKMELQRPLLVDQLNFVSKIHESIYDAIKIVDADNMDQSDVSESFFLPQETDLEENIRAC 360

Query: 361 LAGMESIYALAVLVMDKTRSSTQEKIREIKNLNETVAQLLKEKEHIGNLLRSALSKRITS 420
           LAGMESIY L  +++ KT+   +EK  E+K+LNETV +L+KEKEHIG+LLRSALSKR+TS
Sbjct: 361 LAGMESIYELTRILVGKTKDLVEEKNHEVKSLNETVGRLIKEKEHIGSLLRSALSKRMTS 420

Query: 421 D-PSKANQLFEVAENGLREAGIDFKFSKLLGDEIFSTSRDNGKAVDAEEDEIFTLAGALE 480
           +  SK N+LF+ AENGLREAGIDFKFSKL+GD       +  +A D E+DEI+TLAGALE
Sbjct: 421 ENKSKTNELFQTAENGLREAGIDFKFSKLIGD------GNKAEAQDTEQDEIYTLAGALE 480

Query: 481 NIVKASQIEIIELRHSLEELRAESVVLKERLESQSKELKLRSLQIMELEEKERVANESVE 540
           NIVK SQ+EIIEL+HS+EELRAES VLKE +E+Q+KE+  R  +I ELEEKERVANESVE
Sbjct: 481 NIVKTSQLEIIELQHSVEELRAESSVLKEHVEAQAKEINQRMRRIEELEEKERVANESVE 540

Query: 541 GLMMDITAAEEEIMRWKVAAEQEAAAGKAVEQEFLAQISAVKEEVEEARKVMLDSDNKLK 600
           GLMMDI AAEEEI RWK AAEQEAAAG+AVEQEFL Q+SAVK+E+EEA++ ML+S+ KLK
Sbjct: 541 GLMMDIAAAEEEISRWKSAAEQEAAAGRAVEQEFLTQLSAVKQELEEAKQAMLESEKKLK 600

Query: 601 FKEQTVNAAMAARDAAEKSLRLADVRASRLRERVEELTRQLEELDKREDSRRGLNGSRYV 660
           FKE+T  AAM ARDAAEKSLRLAD+RASRLR+RVEEL+RQLEE + REDS RG NGSRYV
Sbjct: 601 FKEETAAAAMGARDAAEKSLRLADMRASRLRDRVEELSRQLEEFETREDS-RGRNGSRYV 660

Query: 661 CWPWQWLGLDFVGSRRSETQQQESSNEMELSEPL 693
           CWPWQWLGLDFVG R+ E QQQ SSNEMELSEPL
Sbjct: 661 CWPWQWLGLDFVGFRKPEMQQQ-SSNEMELSEPL 686

BLAST of Cp4.1LG01g16800 vs. TrEMBL
Match: A0A0B0MQ58_GOSAR (Uncharacterized protein OS=Gossypium arboreum GN=F383_23910 PE=4 SV=1)

HSP 1 Score: 874.0 bits (2257), Expect = 1.2e-250
Identity = 489/696 (70.26%), Postives = 585/696 (84.05%), Query Frame = 1

Query: 1   MASSVDEDVDAVLSDVEGDEH-PTVIQNPSTEEISVERFREILAERDRERKAREATENSK 60
           M S+ +E+VDAVLSDVE DE  P VI++PS E++SVE+FREILAE DRE++AREA ENSK
Sbjct: 1   MTSAGNEEVDAVLSDVESDEPVPIVIKDPSREDVSVEKFREILAELDREKQAREAAENSK 60

Query: 61  SELLVSFNRLKALAHEAIKKRDECGRQRDEALREKEEALKLNEKVSAELAVANQQRDEVS 120
           SEL VSFNRLKALAHEAIKKRDECGRQRDEALREKEEAL+ N+ ++A+LA AN+ +DEV+
Sbjct: 61  SELQVSFNRLKALAHEAIKKRDECGRQRDEALREKEEALRSNDNLTAQLAEANKIKDEVT 120

Query: 121 KLRDEITKEFDEILKERDALRSEIGNASHMLVTGIDKISAKVSNFKNFTAGGLPRSQKYT 180
           K R+++ K+ +E  K +D LRSEI  ++HMLV+GI+KIS KV+NFKNF+AGGLPRSQKYT
Sbjct: 121 KQREDLAKQLEEASKGKDGLRSEIETSAHMLVSGIEKISGKVNNFKNFSAGGLPRSQKYT 180

Query: 181 GLPAVAYGVIKRTNEIVEELVRQIDTTAKERNETREQMELRNYEIAIEVSQLEATISGLR 240
           GLP+VAYGVIKRTNEIVEELV+QI+TT K RNE REQ+E RNYEIAIEVSQLEATISGLR
Sbjct: 181 GLPSVAYGVIKRTNEIVEELVKQIETTTKSRNEAREQIEQRNYEIAIEVSQLEATISGLR 240

Query: 241 DEVSKKTSVIEDLENTLTKRDKKISEIEADLCGKLTRAEDEASELRQVAQEYDDKLRNLE 300
           DEV+ KT++IE+LE  + ++D KI EIE ++  K+  A+DE  ELR +  EYDDKL+  +
Sbjct: 241 DEVANKTNIIENLEKNIAEKDGKIGEIEKEMSEKINLAQDELMELRNLTSEYDDKLKIWQ 300

Query: 301 SKVESQRPLLMDQLGFISKIHDQIYDIIKIVDASDIDHSEFSESLFLPQETDIEENVRAS 360
            ++E QRPLL+DQL F+S+IH+ IYD+IKIVDA ++D S+ SES FLPQETD  EN+RA 
Sbjct: 301 MRMELQRPLLVDQLNFVSRIHEIIYDVIKIVDADNMDQSDVSESFFLPQETDSVENIRAC 360

Query: 361 LAGMESIYALAVLVMDKTRSSTQEKIREIKNLNETVAQLLKEKEHIGNLLRSALSKRITS 420
           LAGMESIY L  ++  KT+    EK RE+K+LNETVA+L+KEKEHIG+LLRSALS+R+ S
Sbjct: 361 LAGMESIYELTGILAGKTKDLVDEKNREVKSLNETVARLIKEKEHIGSLLRSALSRRMAS 420

Query: 421 D-PSKANQLFEVAENGLREAGIDFKFSKLLGDEIFSTSRDNGKAVD--AEEDEIFTLAGA 480
           +  SK N+LF+ AENGLREAGIDFKFS L+G        D  KA D  +++DEI+TLAGA
Sbjct: 421 ENKSKTNELFQTAENGLREAGIDFKFSNLIG--------DGNKAEDPGSDQDEIYTLAGA 480

Query: 481 LENIVKASQIEIIELRHSLEELRAESVVLKERLESQSKELKLRSLQIMELEEKERVANES 540
           LENIVK SQ+EIIEL+HS+EELRAES VLKE +E+Q+KEL  R  +I ELEEKERVANES
Sbjct: 481 LENIVKTSQLEIIELQHSVEELRAESSVLKEHVEAQAKELNQRMHRIEELEEKERVANES 540

Query: 541 VEGLMMDITAAEEEIMRWKVAAEQEAAAGKAVEQEFLAQISAVKEEVEEARKVMLDSDNK 600
           VEGLMMDI AAEEEI RWK AAEQEAAAG+AVEQEFLAQ+SAVK E+EEA++ ML+S+ K
Sbjct: 541 VEGLMMDIAAAEEEITRWKSAAEQEAAAGRAVEQEFLAQLSAVKLELEEAKQAMLESEKK 600

Query: 601 LKFKEQTVNAAMAARDAAEKSLRLADVRASRLRERVEELTRQLEELDKREDSRRGLNGSR 660
           LKFKE+T  AAMAARDAAEKSL+LAD+RASRLRERVEELTRQLEE + REDS RG NG R
Sbjct: 601 LKFKEETAAAAMAARDAAEKSLKLADMRASRLRERVEELTRQLEEFETREDS-RGRNGPR 660

Query: 661 YVCWPWQWLGLDFVGSRRSETQQQESSNEMELSEPL 693
           YVCWPWQWLGLDFVG  + ETQQQ SSNEMELSEPL
Sbjct: 661 YVCWPWQWLGLDFVGFHKPETQQQ-SSNEMELSEPL 686

BLAST of Cp4.1LG01g16800 vs. TrEMBL
Match: A0A0D2SIY3_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_005G175900 PE=4 SV=1)

HSP 1 Score: 874.0 bits (2257), Expect = 1.2e-250
Identity = 489/696 (70.26%), Postives = 586/696 (84.20%), Query Frame = 1

Query: 1   MASSVDEDVDAVLSDVEGDEH-PTVIQNPSTEEISVERFREILAERDRERKAREATENSK 60
           M S+ DE+VDAVLSDVE DE  P VI++PS E++SVE+FREILAE DRE++AREA ENSK
Sbjct: 1   MTSAGDEEVDAVLSDVESDEPVPIVIKDPSREDVSVEKFREILAELDREKQAREAAENSK 60

Query: 61  SELLVSFNRLKALAHEAIKKRDECGRQRDEALREKEEALKLNEKVSAELAVANQQRDEVS 120
           SEL VSFNRLKALAHEAIKKRDECGRQRDEALREKEEAL+ N+ ++A+L  AN+ +DEV+
Sbjct: 61  SELQVSFNRLKALAHEAIKKRDECGRQRDEALREKEEALRSNDNLTAQLTEANKIKDEVT 120

Query: 121 KLRDEITKEFDEILKERDALRSEIGNASHMLVTGIDKISAKVSNFKNFTAGGLPRSQKYT 180
           K R+++ K+ +E  K +D LRSEI  ++HMLV+GI+KIS KV+NFKNF+AGGLPRSQKYT
Sbjct: 121 KQREDLAKQLEEASKGKDGLRSEIETSAHMLVSGIEKISGKVNNFKNFSAGGLPRSQKYT 180

Query: 181 GLPAVAYGVIKRTNEIVEELVRQIDTTAKERNETREQMELRNYEIAIEVSQLEATISGLR 240
           GLP+VAYGVIKRTNEIVEELV+QI+TT K RNE REQ+E RNYEIAIEVSQLEATISGLR
Sbjct: 181 GLPSVAYGVIKRTNEIVEELVKQIETTTKSRNEAREQIEQRNYEIAIEVSQLEATISGLR 240

Query: 241 DEVSKKTSVIEDLENTLTKRDKKISEIEADLCGKLTRAEDEASELRQVAQEYDDKLRNLE 300
           DEV+KKT++IE+LE  + ++D KI EIE ++  K+  AEDE  ELR ++ EYDDKL+  +
Sbjct: 241 DEVAKKTNIIENLEKNIAEKDGKIGEIEKEMSEKINLAEDELMELRNLSSEYDDKLKIWQ 300

Query: 301 SKVESQRPLLMDQLGFISKIHDQIYDIIKIVDASDIDHSEFSESLFLPQETDIEENVRAS 360
            ++E QRPLL+DQL F+S+IH+ IYD+IKIVDA ++D S+ SES FLPQETD EEN+RA 
Sbjct: 301 MRMELQRPLLVDQLNFVSRIHEIIYDVIKIVDADNMDQSDVSESFFLPQETDSEENIRAC 360

Query: 361 LAGMESIYALAVLVMDKTRSSTQEKIREIKNLNETVAQLLKEKEHIGNLLRSALSKRITS 420
           LAGMESIY L  ++  KT+   +EK RE+K+LNETVA+L+KEKEHIG+LLRSALS+R+ S
Sbjct: 361 LAGMESIYELTGILAVKTKDLVEEKNREVKSLNETVARLIKEKEHIGSLLRSALSRRMVS 420

Query: 421 D-PSKANQLFEVAENGLREAGIDFKFSKLLGDEIFSTSRDNGKAVD--AEEDEIFTLAGA 480
           +  SK N+LF+ AENGLREAGIDFKF  L+G        D  KA D  +++DEI+TLAGA
Sbjct: 421 ENKSKTNELFQTAENGLREAGIDFKFRNLIG--------DGNKAEDPGSDQDEIYTLAGA 480

Query: 481 LENIVKASQIEIIELRHSLEELRAESVVLKERLESQSKELKLRSLQIMELEEKERVANES 540
           LENIVK SQ+EIIEL+HS+EELRAES VLKE +E+Q+KEL  R  +I ELEEKERVANES
Sbjct: 481 LENIVKTSQLEIIELQHSVEELRAESSVLKEHVEAQAKELNQRMHRIEELEEKERVANES 540

Query: 541 VEGLMMDITAAEEEIMRWKVAAEQEAAAGKAVEQEFLAQISAVKEEVEEARKVMLDSDNK 600
           VEGLMMDI AAEEEI RWK AAEQEAAAG+AVE+EFLAQ+SAVK E+EEA++ ML+S+ K
Sbjct: 541 VEGLMMDIAAAEEEITRWKSAAEQEAAAGRAVEREFLAQLSAVKLELEEAKQAMLESEKK 600

Query: 601 LKFKEQTVNAAMAARDAAEKSLRLADVRASRLRERVEELTRQLEELDKREDSRRGLNGSR 660
           LKFKE+T  AAMAARDAAEKSL+LAD+RASRLRERVEELT QLEE + REDS RG NG R
Sbjct: 601 LKFKEETAAAAMAARDAAEKSLKLADMRASRLRERVEELTCQLEEFETREDS-RGRNGPR 660

Query: 661 YVCWPWQWLGLDFVGSRRSETQQQESSNEMELSEPL 693
           YVCWPWQWLGLDFVG  + ETQQQ SSNEMELSEPL
Sbjct: 661 YVCWPWQWLGLDFVGFHKPETQQQ-SSNEMELSEPL 686

BLAST of Cp4.1LG01g16800 vs. TrEMBL
Match: A0A067JV23_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_17831 PE=4 SV=1)

HSP 1 Score: 870.9 bits (2249), Expect = 1.0e-249
Identity = 482/690 (69.86%), Postives = 582/690 (84.35%), Query Frame = 1

Query: 6   DEDVDAVLSDVEGDEH-PTVIQNPSTEEISVERFREILAERDRERKAREATENSKSELLV 65
           DEDV AVLSDVEGD+  P V+++P  E++SVE++RE+LAE DRER AREA E SKSEL V
Sbjct: 7   DEDV-AVLSDVEGDDPVPIVVRSPRLEDVSVEKYRELLAELDRERAAREAAETSKSELQV 66

Query: 66  SFNRLKALAHEAIKKRDECGRQRDEALREKEEALKLNEKVSAELAVANQQRDEVSKLRDE 125
           SFNRLKALAHEAI+KRDEC RQRDE+++EKEEALK  E++S EL   N+ ++E  K +DE
Sbjct: 67  SFNRLKALAHEAIRKRDECARQRDESVKEKEEALKEKERISVELIEVNKLKEEAVKQKDE 126

Query: 126 ITKEFDEILKERDALRSEIGNASHMLVTGIDKISAKVSNFKNFTAGGLPRSQKYTGLPAV 185
           I K+F+E +K RD L+SEI N+ HMLV+GI+KIS KVSN KNF A GLPRSQKYTGLPAV
Sbjct: 127 IGKQFEEAVKARDGLQSEIENSRHMLVSGIEKISGKVSNVKNFAAAGLPRSQKYTGLPAV 186

Query: 186 AYGVIKRTNEIVEELVRQIDTTAKERNETREQMELRNYEIAIEVSQLEATISGLRDEVSK 245
           AYGVIKRTNEIVEELVRQID TAK RNE REQM++RNYEIAIEVSQLEATISGLRDEV+K
Sbjct: 187 AYGVIKRTNEIVEELVRQIDATAKSRNEAREQMDMRNYEIAIEVSQLEATISGLRDEVAK 246

Query: 246 KTSVIEDLENTLTKRDKKISEIEADLCGKLTRAEDEASELRQVAQEYDDKLRNLESKVES 305
           KTS+IE+LE  + ++++K+SEIE ++  K    E+EA ELR++  EYDDKLRNLESK+E 
Sbjct: 247 KTSLIENLEKNVVEKEEKVSEIEREMFEKTHSVENEAFELRELVVEYDDKLRNLESKLEL 306

Query: 306 QRPLLMDQLGFISKIHDQIYDIIKIVDASDIDHSEFSESLFLPQETDIEENVRASLAGME 365
           QRPLL+DQL  ++KIHD++YD+IK+VD + +D S+ SESLFLPQ+TD+EEN+RASLAGME
Sbjct: 307 QRPLLIDQLNLVAKIHDRLYDVIKLVDTNHLD-SDLSESLFLPQQTDMEENIRASLAGME 366

Query: 366 SIYALAVLVMDKTRSSTQEKIREIKNLNETVAQLLKEKEHIGNLLRSALSKRITSD-PSK 425
           SIY L  +V++KT+   ++K  E+K LNETV +L+KEKE IG+LLRSALSKR+  D  SK
Sbjct: 367 SIYELTRIVVEKTKDLLEKKSHEVKGLNETVGRLVKEKEQIGSLLRSALSKRMRLDQSSK 426

Query: 426 ANQLFEVAENGLREAGIDFKFSKLLGDEIFSTSRDNGKAVDAEEDEIFTLAGALENIVKA 485
            N+LF+ AENGLREAGID KFSK+LGD     S+D G+ +D EEDEI+ LAGALENIVKA
Sbjct: 427 TNELFQAAENGLREAGIDIKFSKILGDNKVPASQDKGRPLDMEEDEIYNLAGALENIVKA 486

Query: 486 SQIEIIELRHSLEELRAESVVLKERLESQSKELKLRSLQIMELEEKERVANESVEGLMMD 545
           SQ+EIIEL+HS++ELRAE+ +LKE +E+Q+KEL  R  +I ELEEKERVANESVEGLMMD
Sbjct: 487 SQLEIIELQHSVDELRAEASLLKEHIEAQAKELDQRMRRIEELEEKERVANESVEGLMMD 546

Query: 546 ITAAEEEIMRWKVAAEQEAAAGKAVEQEFLAQISAVKEEVEEARKVMLDSDNKLKFKEQT 605
           I AAEEEI RWKVAAEQEAAAG+++EQEF+AQ+SA+K+E+EEAR  M +S+ KLKFKE+T
Sbjct: 547 IAAAEEEITRWKVAAEQEAAAGRSIEQEFVAQLSALKQELEEARHAMFESEKKLKFKEET 606

Query: 606 VNAAMAARDAAEKSLRLADVRASRLRERVEELTRQLEELDKREDSRRGLNGSRYVCWPWQ 665
             AAMAAR+AAEKSLRLAD+RASRLR+RVEEL+ QLEE + REDS  G NG RYVCWPWQ
Sbjct: 607 AAAAMAAREAAEKSLRLADMRASRLRDRVEELSHQLEEFETREDS-GGRNGPRYVCWPWQ 666

Query: 666 WLGLDFVGSRRSETQQQESSNEMELSEPLL 694
           WLGLDFVG RR ET QQ SSNEMELSEPLL
Sbjct: 667 WLGLDFVGVRRPET-QQPSSNEMELSEPLL 692

BLAST of Cp4.1LG01g16800 vs. TAIR10
Match: AT1G24560.1 (AT1G24560.1 unknown protein)

HSP 1 Score: 730.7 bits (1885), Expect = 8.3e-211
Identity = 418/696 (60.06%), Postives = 537/696 (77.16%), Query Frame = 1

Query: 1   MASSVDEDVDAVLSDVEGDEH-PTVIQNPSTEEISVERFREILAERDRERKAREATENSK 60
           MA+  DED  AVLSDVE DE  P V+++   EE S ER  E++AE DRE+KAREA E+SK
Sbjct: 1   MANGADED--AVLSDVESDEPAPVVLKDSPREEASDERITELIAELDREKKAREAAESSK 60

Query: 61  SELLVSFNRLKALAHEAIKKRDECGRQRDEALREKEEALKLNEKVSAELAVANQQRDEVS 120
           SEL VSFNRLKALA EAIKKRDE  R+RDEAL+EKE                  + + V+
Sbjct: 61  SELQVSFNRLKALAVEAIKKRDESKRERDEALKEKENL--------------TNELENVN 120

Query: 121 KLRDEITKEFDEILKERDALRSEIGNASHMLVTGIDKISAKVSNFKNFTAGGLPRSQKYT 180
           K +DE++K+ DE L+ RD L++EI N+SHMLV+GI+KIS KVS+FKNF+ GGLP+SQKYT
Sbjct: 121 KGKDEMSKKLDEALRSRDGLKAEIENSSHMLVSGIEKISGKVSSFKNFSNGGLPKSQKYT 180

Query: 181 GLPAVAYGVIKRTNEIVEELVRQIDTTAKERNETREQMELRNYEIAIEVSQLEATISGLR 240
           GL +VAYGVIKRTNEIVEELVRQIDTTAK RNE REQM+ RNYEIAIEVSQLE+ IS LR
Sbjct: 181 GLTSVAYGVIKRTNEIVEELVRQIDTTAKSRNEAREQMDQRNYEIAIEVSQLESAISNLR 240

Query: 241 DEVSKKTSVIEDLENTLTKRDKKISEIEADLCGKLTRAEDEASELRQVAQEYDDKLRNLE 300
            EV++K S+++DLE  +++++K+I+E+E     K++  E E  EL+Q+  EYD KL+ +E
Sbjct: 241 LEVAEKASIVDDLERGVSEKEKRIAELEKGNLEKVSLLEGEVVELKQLVDEYDGKLKTME 300

Query: 301 SKVESQRPLLMDQLGFISKIHDQIYDIIKIVDASDIDHSEFSESLFLPQETDIEENVRAS 360
            K+ +QRPLLMDQL  +S+IHDQ+Y++++IVD +  + S+ SES F+PQET++EEN+RAS
Sbjct: 301 LKMVAQRPLLMDQLNLVSRIHDQLYEVVRIVDGNSSEQSDLSESFFMPQETEMEENIRAS 360

Query: 361 LAGMESIYALAVLVMDKTRSSTQEKIREIKNLNETVAQLLKEKEHIGNLLRSALSKRITS 420
           LAGMESI+ L  +V  K +S  +EK  E+KNLNETV  L+KEKEHIG LLRSALSKR+  
Sbjct: 361 LAGMESIFELTKVVSGKAQSLVEEKSHELKNLNETVGLLVKEKEHIGTLLRSALSKRVIG 420

Query: 421 D-PSKANQLFEVAENGLREAGIDFKFSKLLGDEIFSTSR-DNGKAVDAEEDEIFTLAGAL 480
           + PS+  +LF+ AENGLR+ G D KF+KLL D     SR DN      E++EI++LA  L
Sbjct: 421 EQPSQKRELFQAAENGLRDGGTDSKFAKLLKDGKVQDSRSDNTHDHSKEDNEIYSLASTL 480

Query: 481 ENIVKASQIEIIELRHSLEELRAESVVLKERLESQSKELKLRSLQIMELEEKERVANESV 540
           ENIVKASQ+EI+EL+H LE  R E+  L+++L++Q+KEL  R  QI EL+EKER+ANE+V
Sbjct: 481 ENIVKASQLEIVELQHLLEASREETSSLRKQLDTQTKELNQRMRQIEELKEKERIANENV 540

Query: 541 EGLMMDITAAEEEIMRWKVAAEQEAAAGKAVEQEFLAQISAVKEEVEEARKVMLDSDNKL 600
           EGLM DI AAEEEI RWKVAAEQEAAAG AVEQ+F +Q+  +KEE+EEA++ +++S+ KL
Sbjct: 541 EGLMTDIAAAEEEITRWKVAAEQEAAAGGAVEQDFTSQLYVLKEELEEAKQAIIESEKKL 600

Query: 601 KFKEQTVNAAMAARDAAEKSLRLADVRASRLRERVEELTRQLEELDKREDSRRGLNGSRY 660
           KFKE+T  AAM ARDAAE+SLRLAD RA++LRER++EL R++EEL+   D     N +RY
Sbjct: 601 KFKEETAAAAMGARDAAERSLRLADNRATKLRERIQELNRKVEELETHRDMNTS-NRARY 660

Query: 661 VCWPWQWLGLDFVGSRRSETQQQESSNEMELSEPLL 694
            CWPWQ LG+DFVGSRR E+  QES+NEMEL+EPLL
Sbjct: 661 ACWPWQLLGIDFVGSRRIES-GQESANEMELAEPLL 678

BLAST of Cp4.1LG01g16800 vs. TAIR10
Match: AT3G49055.1 (AT3G49055.1 unknown protein)

HSP 1 Score: 131.7 bits (330), Expect = 1.7e-30
Identity = 138/452 (30.53%), Postives = 237/452 (52.43%), Query Frame = 1

Query: 229 SQLEATISGLRDEVS----KKTSVIEDLENTLTKRD---KKISEIEADLCGKLTRAEDEA 288
           S L++    LR ++     ++T +I        +RD   ++ SE+EA +  ++   E+  
Sbjct: 28  SSLDSNFLSLRSQIFEASYRRTDLIRVNHELFHERDALRRRNSELEAGILEEVMIREEMK 87

Query: 289 SELRQVAQEYDDKLRNLESKVESQRPLLMDQLGFISKIHDQIYDIIKIVDASDIDHSEFS 348
            +L +V++E    +  LE + + +  LL D   ++  + D++  +I+ ++  ++   E  
Sbjct: 88  RDL-EVSKE---TVSELEGEAKEKTKLLSDIADYVRSMEDRLSKLIRCLNEENVPEEERG 147

Query: 349 ESLFLPQETDIEENVRASLAGMESIYALAVLVMDKTRSSTQEKIREIKNLNETVAQLLKE 408
             L    ET  E N ++ L  ++ +    V  ++  + ST++K  E   L+ +V  L +E
Sbjct: 148 RKL----ETK-EYNSKSILELVKEV----VTKLETFQESTKKKKME---LSRSVEFLEEE 207

Query: 409 KEHIGNLLRSALSKRITSDPS-------KANQLFEVAENGLREAGIDFKFSKLLGDEIFS 468
              I  LLR+AL ++ T++         K   L ++A  GL+  G  F     LG+ +  
Sbjct: 208 NRDINVLLRAALFEKQTAEKQLKEMNDQKGLALLQIAGRGLQRIGFGFG----LGESVEE 267

Query: 469 TSRDNGKAVDAEEDEIFTLAGALENIVKASQIEIIELRHSLEELRAESVVLKERLESQSK 528
           +S     A + EE+ +     A+E  +K  + E+ +L+ SLEE R E V L++  E Q++
Sbjct: 268 SSETGNIANEEEENGVVI---AIEKTMKKLRQEVSQLKISLEESRLEEVGLRKVTEEQAQ 327

Query: 529 ELKLRSLQIMELEEKERVANESVEGLMMDITAAEEEIMRWKVAAEQEAAAGKAVEQEFLA 588
           +L   ++ I +L+ +E+   ++VE L+  I  AE E+ RW+ A E E  AG+   +    
Sbjct: 328 KLAENTVYINKLQNQEKFLAQNVEELVKAIREAESEVSRWREACELEVEAGQREVEVRDQ 387

Query: 589 QISAVKEEVEEARKVMLDSDNKLKFKEQTVNAAMAARDAAEKSLRLADVRASRLRERVEE 648
            I+ +K EVE+ R  +  S+ KLK KE+   AAM A +AAEKSLRLA+ R ++L  R+E 
Sbjct: 388 LIAVLKSEVEKLRSALARSEGKLKLKEELAKAAMVAEEAAEKSLRLAERRIAQLLSRIEH 447

Query: 649 LTRQLEELDKREDSRRGLNGSRYV-CWP-WQW 665
           L RQLEE +  E  RRG    RYV CWP W++
Sbjct: 448 LYRQLEEAESTE-RRRG--KFRYVWCWPMWRF 453

BLAST of Cp4.1LG01g16800 vs. NCBI nr
Match: gi|659130256|ref|XP_008465072.1| (PREDICTED: uncharacterized protein At3g49055 isoform X2 [Cucumis melo])

HSP 1 Score: 1118.2 bits (2891), Expect = 0.0e+00
Identity = 620/695 (89.21%), Postives = 654/695 (94.10%), Query Frame = 1

Query: 1   MASSVDEDVDAVLSDVEGDEHPTVIQNPSTEEISVERFREILAERDRERKAREATENSKS 60
           MAS +DEDVD VLSDVEGDEHP  IQNPS EEI+VERFREILAERDRER++REA ENSKS
Sbjct: 1   MASGLDEDVDVVLSDVEGDEHPITIQNPSPEEITVERFREILAERDRERQSREAAENSKS 60

Query: 61  ELLVSFNRLKALAHEAIKKRDECGRQRDEALREKEEALKLNEKVSAELAVANQQRDEVSK 120
           EL VSFNRLKALAHEAIKKRDECGRQRDEALREKEEALKLNEKVS+ELA  N+QRDEV K
Sbjct: 61  ELQVSFNRLKALAHEAIKKRDECGRQRDEALREKEEALKLNEKVSSELAEVNRQRDEVLK 120

Query: 121 LRDEITKEFDEILKERDALRSEIGNASHMLVTGIDKISAKVSNFKNFTAGGLPRSQKYTG 180
           LRDEITKEFDEILKERD LRSEIGNASHMLVTGIDKISAKVS+FKNFTAGGLPRSQKYTG
Sbjct: 121 LRDEITKEFDEILKERDTLRSEIGNASHMLVTGIDKISAKVSSFKNFTAGGLPRSQKYTG 180

Query: 181 LPAVAYGVIKRTNEIVEELVRQIDTTAKERNETREQMELRNYEIAIEVSQLEATISGLRD 240
           LPAVAYGVIKRTNEI+EELVRQIDTT K RNETREQMELRNYEIAIEVSQLEATISGL+D
Sbjct: 181 LPAVAYGVIKRTNEIIEELVRQIDTTTKSRNETREQMELRNYEIAIEVSQLEATISGLKD 240

Query: 241 EVSKKTSVIEDLENTLTKRDKKISEIEADLCGKLTRAEDEASELRQVAQEYDDKLRNLES 300
           EVSKKTSVIEDLENT+  +DKKISEIE D+ GKL+RAEDEASELRQ+ QEYDDKLR+LE 
Sbjct: 241 EVSKKTSVIEDLENTIIGKDKKISEIEEDVGGKLSRAEDEASELRQLVQEYDDKLRDLEL 300

Query: 301 KVESQRPLLMDQLGFISKIHDQIYDIIKIVDASDIDHSEFSESLFLPQETDIEENVRASL 360
           K+ESQRPLL+DQLG ISKIHDQIYDIIKIVD SD+DHSEFSESLFLP+ETD+EEN+RASL
Sbjct: 301 KMESQRPLLVDQLGLISKIHDQIYDIIKIVDVSDVDHSEFSESLFLPRETDMEENLRASL 360

Query: 361 AGMESIYALAVLVMDKTRSSTQEKIREIKNLNETVAQLLKEKEHIGNLLRSALSKRITSD 420
           AGMESIYALA LVMDKTRS   EKIRE KNLNETVAQLLKEKEHIG LLR+ALSKR+TSD
Sbjct: 361 AGMESIYALAKLVMDKTRSLIDEKIRETKNLNETVAQLLKEKEHIGYLLRTALSKRMTSD 420

Query: 421 P-SKANQLFEVAENGLREAGIDFKFSKLLGDEIFSTSRDNGKAVDAEEDEIFTLAGALEN 480
           P SKANQLFEVAENGLREAGIDFKFSKLLG+E F T+RDN KA+DA EDEIFTLAGALEN
Sbjct: 421 PSSKANQLFEVAENGLREAGIDFKFSKLLGEEKFPTTRDNRKALDA-EDEIFTLAGALEN 480

Query: 481 IVKASQIEIIELRHSLEELRAESVVLKERLESQSKELKLRSLQIMELEEKERVANESVEG 540
           IVKASQIEIIELRHSLEELRAESVVLKERLESQSKELKLRSLQI ELEEKERVANESVEG
Sbjct: 481 IVKASQIEIIELRHSLEELRAESVVLKERLESQSKELKLRSLQIQELEEKERVANESVEG 540

Query: 541 LMMDITAAEEEIMRWKVAAEQEAAAGKAVEQEFLAQISAVKEEVEEARKVMLDSDNKLKF 600
           LMMD+TAAEEEIMRWKVAAEQEAAAGKAVEQEFLAQIS VK+E+EEAR+V+LDSD KLKF
Sbjct: 541 LMMDVTAAEEEIMRWKVAAEQEAAAGKAVEQEFLAQISGVKQELEEARQVILDSDKKLKF 600

Query: 601 KEQTVNAAMAARDAAEKSLRLADVRASRLRERVEELTRQLEELDKREDSRRG-LNGSRYV 660
           KE+TVNAAMAARDAAEKSLRLADVRASRLRERVEELTRQLE+LD RE+SRRG  NG RYV
Sbjct: 601 KEETVNAAMAARDAAEKSLRLADVRASRLRERVEELTRQLEQLDNREESRRGSSNGHRYV 660

Query: 661 CWPWQWLGLDFVGSRRSETQQQESSNEMELSEPLL 694
           CWPWQWLGLDFVGSR SETQ QESSNEMELSEPLL
Sbjct: 661 CWPWQWLGLDFVGSRHSETQHQESSNEMELSEPLL 694

BLAST of Cp4.1LG01g16800 vs. NCBI nr
Match: gi|449463814|ref|XP_004149626.1| (PREDICTED: myosin heavy chain, non-muscle [Cucumis sativus])

HSP 1 Score: 1113.6 bits (2879), Expect = 0.0e+00
Identity = 616/695 (88.63%), Postives = 654/695 (94.10%), Query Frame = 1

Query: 1   MASSVDEDVDAVLSDVEGDEHPTVIQNPSTEEISVERFREILAERDRERKAREATENSKS 60
           MAS +DED D VLSDVEGDEHP  IQNPS EEI+VERFREILAERDRER++REA ENSKS
Sbjct: 1   MASGLDEDADVVLSDVEGDEHPITIQNPSPEEITVERFREILAERDRERQSREAAENSKS 60

Query: 61  ELLVSFNRLKALAHEAIKKRDECGRQRDEALREKEEALKLNEKVSAELAVANQQRDEVSK 120
           EL VSFNRLKALAHEAIKKRDECGRQRDEALREKEEALKLNEKVSAELA AN+QRDE  K
Sbjct: 61  ELQVSFNRLKALAHEAIKKRDECGRQRDEALREKEEALKLNEKVSAELAEANRQRDEALK 120

Query: 121 LRDEITKEFDEILKERDALRSEIGNASHMLVTGIDKISAKVSNFKNFTAGGLPRSQKYTG 180
           LRDEITKEFDEILK+RD LRSEIGNASHMLVTGIDKISAKVS+FKNFTAGGLPRSQKYTG
Sbjct: 121 LRDEITKEFDEILKDRDTLRSEIGNASHMLVTGIDKISAKVSSFKNFTAGGLPRSQKYTG 180

Query: 181 LPAVAYGVIKRTNEIVEELVRQIDTTAKERNETREQMELRNYEIAIEVSQLEATISGLRD 240
           LPAVAYGVIKRTNEI+EELVRQIDTT K RNETREQMELRNYEIAIEVSQLEATISGL+D
Sbjct: 181 LPAVAYGVIKRTNEIIEELVRQIDTTTKSRNETREQMELRNYEIAIEVSQLEATISGLKD 240

Query: 241 EVSKKTSVIEDLENTLTKRDKKISEIEADLCGKLTRAEDEASELRQVAQEYDDKLRNLES 300
           EVSKKTSVIEDLENT+ ++DKKI E E DL GKL RAEDEAS+LRQ+ QEYDDKLR+LES
Sbjct: 241 EVSKKTSVIEDLENTIIEKDKKICENEVDLVGKLRRAEDEASDLRQLVQEYDDKLRDLES 300

Query: 301 KVESQRPLLMDQLGFISKIHDQIYDIIKIVDASDIDHSEFSESLFLPQETDIEENVRASL 360
           K+ESQRPLL+DQLG ISKIHDQIYDIIKIVD SD+DHSEFSESLFLP+ETD+EENVRASL
Sbjct: 301 KMESQRPLLVDQLGLISKIHDQIYDIIKIVDVSDVDHSEFSESLFLPRETDMEENVRASL 360

Query: 361 AGMESIYALAVLVMDKTRSSTQEKIREIKNLNETVAQLLKEKEHIGNLLRSALSKRITSD 420
           AGMESIYALA LVMDKTR+  +EKIRE KNLNETVAQLLKEKEHIG LLR+ALSKR+TSD
Sbjct: 361 AGMESIYALAKLVMDKTRNLIEEKIRESKNLNETVAQLLKEKEHIGYLLRTALSKRMTSD 420

Query: 421 P-SKANQLFEVAENGLREAGIDFKFSKLLGDEIFSTSRDNGKAVDAEEDEIFTLAGALEN 480
           P SKANQLFEVAENGLREAGIDFKFSKLLG+E FST+RDN KA+DA EDEIFTLAGALEN
Sbjct: 421 PSSKANQLFEVAENGLREAGIDFKFSKLLGEEKFSTTRDNRKALDA-EDEIFTLAGALEN 480

Query: 481 IVKASQIEIIELRHSLEELRAESVVLKERLESQSKELKLRSLQIMELEEKERVANESVEG 540
           IVKASQIEIIELRHSLEELRAESVVLKERLESQSKELKLRSLQI ELEEKERVANESVEG
Sbjct: 481 IVKASQIEIIELRHSLEELRAESVVLKERLESQSKELKLRSLQIQELEEKERVANESVEG 540

Query: 541 LMMDITAAEEEIMRWKVAAEQEAAAGKAVEQEFLAQISAVKEEVEEARKVMLDSDNKLKF 600
           LMMD+TAAEEEI+RWKVAAEQEAAAGKAVEQEFLAQIS VK+E+EEAR+V+LDSD KLKF
Sbjct: 541 LMMDVTAAEEEIIRWKVAAEQEAAAGKAVEQEFLAQISGVKQELEEARQVILDSDKKLKF 600

Query: 601 KEQTVNAAMAARDAAEKSLRLADVRASRLRERVEELTRQLEELDKREDSRRG-LNGSRYV 660
           KE+TVNAAMAARDAAEKSLRLADVRASRLRERVE+LTRQLE+LD RE+SR G  NG RYV
Sbjct: 601 KEETVNAAMAARDAAEKSLRLADVRASRLRERVEDLTRQLEQLDNREESRIGSSNGHRYV 660

Query: 661 CWPWQWLGLDFVGSRRSETQQQESSNEMELSEPLL 694
           CWPWQWLGLDFVGSR SETQQQESSNEMELSEPL+
Sbjct: 661 CWPWQWLGLDFVGSRHSETQQQESSNEMELSEPLI 694

BLAST of Cp4.1LG01g16800 vs. NCBI nr
Match: gi|659130246|ref|XP_008465067.1| (PREDICTED: uncharacterized protein At3g49055 isoform X1 [Cucumis melo])

HSP 1 Score: 1113.2 bits (2878), Expect = 0.0e+00
Identity = 620/697 (88.95%), Postives = 654/697 (93.83%), Query Frame = 1

Query: 1   MASSVDEDVDAVLSDVEGDEHPTVIQNPSTEEISVERFREILAERDRERKAREATENSKS 60
           MAS +DEDVD VLSDVEGDEHP  IQNPS EEI+VERFREILAERDRER++REA ENSKS
Sbjct: 1   MASGLDEDVDVVLSDVEGDEHPITIQNPSPEEITVERFREILAERDRERQSREAAENSKS 60

Query: 61  ELLVSFNRLKALAHEAIKKRDECGRQRDEALREKEEALKLNEKVSAELAVANQQRDEVSK 120
           EL VSFNRLKALAHEAIKKRDECGRQRDEALREKEEALKLNEKVS+ELA  N+QRDEV K
Sbjct: 61  ELQVSFNRLKALAHEAIKKRDECGRQRDEALREKEEALKLNEKVSSELAEVNRQRDEVLK 120

Query: 121 LRDEITKEFDEILKERDALRSEIGNASHMLVTGIDKISAKVSNFKNFTAGGLPRSQKYTG 180
           LRDEITKEFDEILKERD LRSEIGNASHMLVTGIDKISAKVS+FKNFTAGGLPRSQKYTG
Sbjct: 121 LRDEITKEFDEILKERDTLRSEIGNASHMLVTGIDKISAKVSSFKNFTAGGLPRSQKYTG 180

Query: 181 LPAVAYGVIKRTNEIVEELVRQIDTTAKERNETREQMELRNYEIAIEVSQLEATISGLRD 240
           LPAVAYGVIKRTNEI+EELVRQIDTT K RNETREQMELRNYEIAIEVSQLEATISGL+D
Sbjct: 181 LPAVAYGVIKRTNEIIEELVRQIDTTTKSRNETREQMELRNYEIAIEVSQLEATISGLKD 240

Query: 241 EVSKKTSVIEDLENTLTKRDKKISEIEADLCGKLTRAEDEASELRQVAQEYDDKLRNLES 300
           EVSKKTSVIEDLENT+  +DKKISEIE D+ GKL+RAEDEASELRQ+ QEYDDKLR+LE 
Sbjct: 241 EVSKKTSVIEDLENTIIGKDKKISEIEEDVGGKLSRAEDEASELRQLVQEYDDKLRDLEL 300

Query: 301 KVESQRPLLMDQLGFISKIHDQIYDIIKIVDASDIDHSEFSESLFLPQETDIEENVRASL 360
           K+ESQRPLL+DQLG ISKIHDQIYDIIKIVD SD+DHSEFSESLFLP+ETD+EEN+RASL
Sbjct: 301 KMESQRPLLVDQLGLISKIHDQIYDIIKIVDVSDVDHSEFSESLFLPRETDMEENLRASL 360

Query: 361 AGMESIYALAVLVMDKTRSSTQEKIREIKNLNETVAQLLKEKEHIGNLLRSALSKRITSD 420
           AGMESIYALA LVMDKTRS   EKIRE KNLNETVAQLLKEKEHIG LLR+ALSKR+TSD
Sbjct: 361 AGMESIYALAKLVMDKTRSLIDEKIRETKNLNETVAQLLKEKEHIGYLLRTALSKRMTSD 420

Query: 421 P-SKANQLFEVAENGLREAGIDFKFSKLLGDEIFSTSRDNGKAVDAEEDEIFTLAGALEN 480
           P SKANQLFEVAENGLREAGIDFKFSKLLG+E F T+RDN KA+DA EDEIFTLAGALEN
Sbjct: 421 PSSKANQLFEVAENGLREAGIDFKFSKLLGEEKFPTTRDNRKALDA-EDEIFTLAGALEN 480

Query: 481 IVKASQIEIIELRHSLEELRAESVVLKERLESQSKELKLRSLQIMELEEKERVANESVEG 540
           IVKASQIEIIELRHSLEELRAESVVLKERLESQSKELKLRSLQI ELEEKERVANESVEG
Sbjct: 481 IVKASQIEIIELRHSLEELRAESVVLKERLESQSKELKLRSLQIQELEEKERVANESVEG 540

Query: 541 LMMDITAAEEEIMRWKVAAEQEAAAGKAVEQEFLAQ--ISAVKEEVEEARKVMLDSDNKL 600
           LMMD+TAAEEEIMRWKVAAEQEAAAGKAVEQEFLAQ  IS VK+E+EEAR+V+LDSD KL
Sbjct: 541 LMMDVTAAEEEIMRWKVAAEQEAAAGKAVEQEFLAQVWISGVKQELEEARQVILDSDKKL 600

Query: 601 KFKEQTVNAAMAARDAAEKSLRLADVRASRLRERVEELTRQLEELDKREDSRRG-LNGSR 660
           KFKE+TVNAAMAARDAAEKSLRLADVRASRLRERVEELTRQLE+LD RE+SRRG  NG R
Sbjct: 601 KFKEETVNAAMAARDAAEKSLRLADVRASRLRERVEELTRQLEQLDNREESRRGSSNGHR 660

Query: 661 YVCWPWQWLGLDFVGSRRSETQQQESSNEMELSEPLL 694
           YVCWPWQWLGLDFVGSR SETQ QESSNEMELSEPLL
Sbjct: 661 YVCWPWQWLGLDFVGSRHSETQHQESSNEMELSEPLL 696

BLAST of Cp4.1LG01g16800 vs. NCBI nr
Match: gi|645230440|ref|XP_008221937.1| (PREDICTED: uncharacterized protein At3g49055 [Prunus mume])

HSP 1 Score: 903.7 bits (2334), Expect = 2.0e-259
Identity = 496/696 (71.26%), Postives = 600/696 (86.21%), Query Frame = 1

Query: 1   MASSVDEDVDAVLSDVEGDEH-PTVIQNPSTEEISVERFREILAERDRERKAREATENSK 60
           MAS+ DED DAVLSDVEGD+  P  I+ PS +EIS ERFRE++AE DRER+AREA ENSK
Sbjct: 1   MASAGDEDNDAVLSDVEGDDSVPVAIKTPSPDEISAERFRELVAELDRERQAREAVENSK 60

Query: 61  SELLVSFNRLKALAHEAIKKRDECGRQRDEALREKEEALKLNEKVSAELAVANQQRDEVS 120
           S+L + FNRLKALAHEAIKKRDE GRQRDEALREKEEA K NEKVS+ELA +N+ +DE  
Sbjct: 61  SDLQIQFNRLKALAHEAIKKRDEWGRQRDEALREKEEASKTNEKVSSELAESNRAKDEAL 120

Query: 121 KLRDEITKEFDEILKERDALRSEIGNASHMLVTGIDKISAKVSNFKNFTAGGLPRSQKYT 180
           + RDEI K+ DE++KERD LRS+IGN++HML++GIDKIS KVSNFKNF  GGLPRSQKYT
Sbjct: 121 QQRDEIAKQLDEVVKERDGLRSDIGNSTHMLMSGIDKISGKVSNFKNFGVGGLPRSQKYT 180

Query: 181 -GLPAVAYGVIKRTNEIVEELVRQIDTTAKERNETREQMELRNYEIAIEVSQLEATISGL 240
            GLPAVAYGVIKRTNEIVEELVRQID+TAK RNETREQM+ RNYEIAIE+SQLEATI  L
Sbjct: 181 TGLPAVAYGVIKRTNEIVEELVRQIDSTAKSRNETREQMDQRNYEIAIEISQLEATIGSL 240

Query: 241 RDEVSKKTSVIEDLENTLTKRDKKISEIEADLCGKLTRAEDEASELRQVAQEYDDKLRNL 300
           R+EV+KKTS++E LE ++ +++ K+SEIE ++  KL++AE E SEL+Q+  EYDDKL NL
Sbjct: 241 REEVAKKTSIVEKLEKSMAEKNGKVSEIEREMEEKLSKAESEVSELKQLVGEYDDKLTNL 300

Query: 301 ESKVESQRPLLMDQLGFISKIHDQIYDIIKIVDASDIDHSEFSESLFLPQETDIEENVRA 360
           +SK+E+QRPLL DQL  +SKIHD++Y +++IVDA+++D SEFSESLFLPQETD+EEN+RA
Sbjct: 301 DSKMEAQRPLLFDQLDLVSKIHDRLYHVMRIVDANNLDQSEFSESLFLPQETDMEENIRA 360

Query: 361 SLAGMESIYALAVLVMDKTRSSTQEKIREIKNLNETVAQLLKEKEHIGNLLRSALSKRIT 420
           +LAGMESI+ L  +V++KTR  T+EK REIK+L+ETV++L+KEKE IG+LLRSALSKRIT
Sbjct: 361 TLAGMESIHELTRIVIEKTRDLTEEKNREIKSLDETVSRLVKEKEQIGSLLRSALSKRIT 420

Query: 421 SDPS-KANQLFEVAENGLREAGIDFKFSKLLGDEIFSTSRDNGKAVDAEEDEIFTLAGAL 480
           S PS K ++LF+VAENGLREAGI+FKFSK +GD    T       ++ EEDEI+ LAGAL
Sbjct: 421 SSPSSKTSELFQVAENGLREAGIEFKFSKHVGDGEVDT-------LETEEDEIYALAGAL 480

Query: 481 ENIVKASQIEIIELRHSLEELRAESVVLKERLESQSKELKLRSLQIMELEEKERVANESV 540
           ENIVKASQ+EII+L+HS+EELRAE  +LK+ +E+Q+KEL  R  +I ELEEKERVANESV
Sbjct: 481 ENIVKASQLEIIDLQHSVEELRAELSLLKQHVEAQAKELDYRLRRIEELEEKERVANESV 540

Query: 541 EGLMMDITAAEEEIMRWKVAAEQEAAAGKAVEQEFLAQISAVKEEVEEARKVMLDSDNKL 600
           EGLMMDI AAEEEI RWK AAEQEAAAG  VEQEF+AQ+SA+K E+EEA++ +++S+ KL
Sbjct: 541 EGLMMDIVAAEEEIARWKAAAEQEAAAGTGVEQEFVAQLSALKLELEEAKQAIVESEKKL 600

Query: 601 KFKEQTVNAAMAARDAAEKSLRLADVRASRLRERVEELTRQLEELDKREDSRRGLNGSRY 660
           KFKE+T +AAMAARDAAEKSL+LAD+RASRLR+RVEELTRQLEE + REDSRRGL+G RY
Sbjct: 601 KFKEETADAAMAARDAAEKSLKLADLRASRLRDRVEELTRQLEEFESREDSRRGLSGPRY 660

Query: 661 VCWPWQWLGLDFVGSRRSETQQQESSNEMELSEPLL 694
           VCWPWQWLGLDFVG  RS+TQQ+ SSNEMELSEPLL
Sbjct: 661 VCWPWQWLGLDFVGVSRSDTQQESSSNEMELSEPLL 689

BLAST of Cp4.1LG01g16800 vs. NCBI nr
Match: gi|590695331|ref|XP_007044860.1| (Uncharacterized protein isoform 1 [Theobroma cacao])

HSP 1 Score: 887.1 bits (2291), Expect = 2.0e-254
Identity = 490/694 (70.61%), Postives = 590/694 (85.01%), Query Frame = 1

Query: 1   MASSVDEDVDAVLSDVEGDEH-PTVIQNPSTEEISVERFREILAERDRERKAREATENSK 60
           M+++ DE+ DAVLSDVE DE  P VI+ PS +++SVE+FREILAE +RE++AREATENSK
Sbjct: 1   MSTAADEEADAVLSDVESDEPIPIVIKEPSRDDVSVEKFREILAELEREKQAREATENSK 60

Query: 61  SELLVSFNRLKALAHEAIKKRDECGRQRDEALREKEEALKLNEKVSAELAVANQQRDEVS 120
           SEL VSFNRLKALAHEAI+KRDEC RQRDEALREKEEAL+ NE V A+LA AN+ +D+V+
Sbjct: 61  SELQVSFNRLKALAHEAIRKRDECARQRDEALREKEEALRSNENVLAQLAEANKIKDDVT 120

Query: 121 KLRDEITKEFDEILKERDALRSEIGNASHMLVTGIDKISAKVSNFKNFTAGGLPRSQKYT 180
           K R+++ K+ +E  K +D LRSEI  ++HMLV+GI+KIS KVSNFKNF AGGLPRSQKYT
Sbjct: 121 KQREDLAKQLEEATKGKDGLRSEIETSAHMLVSGIEKISGKVSNFKNFAAGGLPRSQKYT 180

Query: 181 GLPAVAYGVIKRTNEIVEELVRQIDTTAKERNETREQMELRNYEIAIEVSQLEATISGLR 240
           GLP+VAYGVIKRTNEIVEELV+Q++TTAK RNE REQME RNYEIAIEVSQLEATISGLR
Sbjct: 181 GLPSVAYGVIKRTNEIVEELVKQMETTAKSRNEAREQMEQRNYEIAIEVSQLEATISGLR 240

Query: 241 DEVSKKTSVIEDLENTLTKRDKKISEIEADLCGKLTRAEDEASELRQVAQEYDDKLRNLE 300
           +EV+KK+++ E+LE  + ++D K  EIE ++  K+  AE+E+ ELR +A EYDDKL++LE
Sbjct: 241 EEVAKKSNLTENLEKNIAEKDGKFVEIEKEMSEKINWAENESMELRNLASEYDDKLKSLE 300

Query: 301 SKVESQRPLLMDQLGFISKIHDQIYDIIKIVDASDIDHSEFSESLFLPQETDIEENVRAS 360
           SK+E QRPLL+DQL F+SKIH+ IYD IKIVDA ++D S+ SES FLPQETD+EEN+RA 
Sbjct: 301 SKMELQRPLLVDQLNFVSKIHESIYDAIKIVDADNMDQSDVSESFFLPQETDLEENIRAC 360

Query: 361 LAGMESIYALAVLVMDKTRSSTQEKIREIKNLNETVAQLLKEKEHIGNLLRSALSKRITS 420
           LAGMESIY L  +++ KT+   +EK  E+K+LNETV +L+KEKEHIG+LLRSALSKR+TS
Sbjct: 361 LAGMESIYELTRILVGKTKDLVEEKNHEVKSLNETVGRLIKEKEHIGSLLRSALSKRMTS 420

Query: 421 D-PSKANQLFEVAENGLREAGIDFKFSKLLGDEIFSTSRDNGKAVDAEEDEIFTLAGALE 480
           +  SK N+LF+ AENGLREAGIDFKFSKL+GD       +  +A D E+DEI+TLAGALE
Sbjct: 421 ENKSKTNELFQTAENGLREAGIDFKFSKLIGD------GNKAEAQDTEQDEIYTLAGALE 480

Query: 481 NIVKASQIEIIELRHSLEELRAESVVLKERLESQSKELKLRSLQIMELEEKERVANESVE 540
           NIVK SQ+EIIEL+HS+EELRAES VLKE +E+Q+KE+  R  +I ELEEKERVANESVE
Sbjct: 481 NIVKTSQLEIIELQHSVEELRAESSVLKEHVEAQAKEINQRMRRIEELEEKERVANESVE 540

Query: 541 GLMMDITAAEEEIMRWKVAAEQEAAAGKAVEQEFLAQISAVKEEVEEARKVMLDSDNKLK 600
           GLMMDI AAEEEI RWK AAEQEAAAG+AVEQEFL Q+SAVK+E+EEA++ ML+S+ KLK
Sbjct: 541 GLMMDIAAAEEEISRWKSAAEQEAAAGRAVEQEFLTQLSAVKQELEEAKQAMLESEKKLK 600

Query: 601 FKEQTVNAAMAARDAAEKSLRLADVRASRLRERVEELTRQLEELDKREDSRRGLNGSRYV 660
           FKE+T  AAM ARDAAEKSLRLAD+RASRLR+RVEEL+RQLEE + REDS RG NGSRYV
Sbjct: 601 FKEETAAAAMGARDAAEKSLRLADMRASRLRDRVEELSRQLEEFETREDS-RGRNGSRYV 660

Query: 661 CWPWQWLGLDFVGSRRSETQQQESSNEMELSEPL 693
           CWPWQWLGLDFVG R+ E QQQ SSNEMELSEPL
Sbjct: 661 CWPWQWLGLDFVGFRKPEMQQQ-SSNEMELSEPL 686

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Y3905_ARATH3.0e-2930.53Uncharacterized protein At3g49055 OS=Arabidopsis thaliana GN=At3g49055 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KIP2_CUCSA0.0e+0088.63Uncharacterized protein OS=Cucumis sativus GN=Csa_5G011700 PE=4 SV=1[more]
A0A061E8N7_THECC1.4e-25470.61Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_010623 PE=4 SV=1[more]
A0A0B0MQ58_GOSAR1.2e-25070.26Uncharacterized protein OS=Gossypium arboreum GN=F383_23910 PE=4 SV=1[more]
A0A0D2SIY3_GOSRA1.2e-25070.26Uncharacterized protein OS=Gossypium raimondii GN=B456_005G175900 PE=4 SV=1[more]
A0A067JV23_JATCU1.0e-24969.86Uncharacterized protein OS=Jatropha curcas GN=JCGZ_17831 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G24560.18.3e-21160.06 unknown protein[more]
AT3G49055.11.7e-3030.53 unknown protein[more]
Match NameE-valueIdentityDescription
gi|659130256|ref|XP_008465072.1|0.0e+0089.21PREDICTED: uncharacterized protein At3g49055 isoform X2 [Cucumis melo][more]
gi|449463814|ref|XP_004149626.1|0.0e+0088.63PREDICTED: myosin heavy chain, non-muscle [Cucumis sativus][more]
gi|659130246|ref|XP_008465067.1|0.0e+0088.95PREDICTED: uncharacterized protein At3g49055 isoform X1 [Cucumis melo][more]
gi|645230440|ref|XP_008221937.1|2.0e-25971.26PREDICTED: uncharacterized protein At3g49055 [Prunus mume][more]
gi|590695331|ref|XP_007044860.1|2.0e-25470.61Uncharacterized protein isoform 1 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006816 calcium ion transport
biological_process GO:0009630 gravitropism
biological_process GO:0006486 protein glycosylation
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g16800.1Cp4.1LG01g16800.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableunknownCoilCoilcoord: 87..142
score: -coord: 189..216
score: -coord: 235..269
score: -coord: 274..301
score: -coord: 481..529
score: -coord: 618..645
score: -coord: 381..408
scor
NoneNo IPR availablePANTHERPTHR34937FAMILY NOT NAMEDcoord: 115..693
score: 0.0coord: 2..98
score:
NoneNo IPR availablePANTHERPTHR34937:SF1MYOSIN HEAVY CHAIN-RELATEDcoord: 2..98
score: 0.0coord: 115..693
score:

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG01g16800Wax gourdcpewgoB0559
Cp4.1LG01g16800Cucurbita maxima (Rimu)cmacpeB720
Cp4.1LG01g16800Cucurbita moschata (Rifu)cmocpeB673
Cp4.1LG01g16800Watermelon (Charleston Gray)cpewcgB369