CmaCh20G001420 (gene) Cucurbita maxima (Rimu)

NameCmaCh20G001420
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionReceptor-like kinase
LocationCma_Chr20 : 694599 .. 697755 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCAGCCGATAAAGCGTGCGATTTGCCCAACCGTCTATAAGAGAAACACCCACGACAAAATGGAATGTAGCAATATTCATTATGCTGGAGAAGTTTCATCATTAATTTTTCCCCACCAGGGTTCTCAATTTTGCCTGCCATGGATCCCTGGATTCACAGGTTCAGGCATCGCCTCCTCGCATGGTTTCATCGATCTCGTTCTGGTACTTCACTTCTGCGCTTCGATTTCGCCCTTTTTTTGTTTGTTTATATTTGTAATGTGTTTCGCATATTTGATTTTGTTGGCACTGGTGAATTTCAAGCAAATATGATATTTGGTCAATTGGTCCATTGGATTCCGTGAGGCTTTGTGACGATTGTTGATTGTTGTGGTTGAATTATGATGCTTCTCTGTAGCTTGGCTCTGAACTGTATCGGAATTTGGAAATTGGACGTTGACTGTGCTTTCACGTGATATTTTTACGTGCAAAATTGTTAAAGAACGCATTTCACATGAGGAATTCAACGTTTAAATGATTTTAAATCTTGCGTTATAATCTACAGGATTTTCCTTACCTTTATTCTCGTCCTGTTATGAACTTTTCGAATTTCAACACCTCATTTTCTAAAGGCGTCGACGAGCTGCTCTTGCTCATATTAATGGCACCTAGTATCGATTTGAGCGATATTATACCCAATCATACACTTAGCGGCATTATACATACGTCAGGGGCTTTTTCTTTATGTTTTGTGCTGATATGTTTGTCTGTTTCTGCATTTTTCATTTTCAGCGATCTAAATCCTATGGATTTAGATCACATCACCTATTTCCTGCAATAATGTTCTTGTACTTTTATATGTCTGGAAACAACTACCTAAATATAACTCAATTGACTTTCTATTTCAGTGCCAATATTATACGTGAGGCGTTTTTCGTACAGGGAAATAAAAAGGGCAACTGGTGGGTTCAACAGAGTTGTCTACACCAATCCTCAGTGTGCTGCTTATAATGCTAGATTTCAAGATGGCAGTGTTGCATTGGTAAAAGAACTGAGAGCTTTGAATGACGATGTCTTCAAGACGGAAGTGCAGCTCCTTGGACGCTTGCACCATCGTCATTTACTTACATTGAGAGGCTTCTCCACAGGGCATAAGAGGTTTGATGATTCTTATCTTGACTGCCAATCGAATGAGTATAGTCTTCGCTTCATCTTTGTTTGTCGGATACTTTACTCTAACATGCATAATATGAGGATATCGCTCTCTTGTATTGTTCATATGGCGCTATACACTGACTATGTTGTGAAATGTCAACTTGCATTCCCGTGGAGTCTCTACAGTTTGGGCAGAAATGTCGATTATAGTATCGCTTATCCTGGATGCAGCATAGAAATGTTTCAAATGGCTAATAATCTTTAACACTTTGTCATATATAGTTGTTTTTCGGTGCCTGTCATATTACTTCATGCATTCTCAAATGCAATTAAGTAGAATAAGCCTCTATGACATAGATCTACCATCATATGGCTGGATTATTTGAATAACATATTGGGATACTTTGTTCTACGAGGATATAGGCTTTTACTCGTTGAGAGATTGTTATGTTTCAGAGATATTGTGATTAGGTGAAAGAATTTGAGTATAGGTTTGAAATGAGATAGGTTTAGCGGAGTAGATCAAATGGGAAGTCTAACTATAATAGTTTCAAGCCCACCGCTAGCAGATATTGTCCACTTTGGGGTTTCTCAAGGATTTTAAAATGCGTATGCTAGGGAGAGGTTTCCACACCCTTGTAAAGAATGTTTCGTTCTCCTCTCCAACCGACGTGGGATCTACCATGAGGAAGAAGAGAGAACCCCACGTTAAGAACTTTTCAAATGCACAAATAAAATTTGATCTGATCGAAACATCTCTGCATACGGGAAGCAATTTTTATTTCGAATTTGATCACAGCAGTTGCTTTTGACTTGTTCAATATCAACTATTATGTCCATTGATTAGTTGCTGTCGTTTGGCATTTTCACATTTCCATTTTATCCTTTGATTCTATGCAGATTGCTCGTGTTTGACGAGATTGGCAATGGAAGCTTGAGGGATCATCTTAATGGTATCATCGAGAACTGAGTTATTTGAATGCATAGTTTCGTTTGACATAGATGATTGCATCACAATGGTCCTTCTGTGGCAGATCCTCTCAGGACTCCCTTGAATTGGAGGGCGAGGCTGCAAATAGCCGCAAGTGTGGCTGCTGCCTTGGTAAGTTCTCTAAACTTAACGACACTGTTGTCACTTGATATATCATAAGTGAGTTCCCATGACCTCCTCGAATAACATTCGAAACAAAGGGCTTAAAGCATCCATAGTTAAATTATCTTTGATGCAGGAATACTTGCTTCTTTTCACCGACCCCCCAATGTGTCATGTCTCCATTTCCTCAAACACTATAATGTTAGATGAAAATTTCACTGCAAAGGTACCGAAATAATCTTTGGAGTTTCCTCGCTCGAATTTTGCGAAAAAACATGTTAAAATTCTTCCTTTTTACCTGCTCAGATATCTGATGTTGGCTTTCTCTACTCCCCGGTAAGCAGCGGTGATCAGGCAGATGCATCAAAGGCAGATGGTTAGTTTTTCAAATAAAGAAATGGGAGAGATCTGCAGTATCATTCTCCCCATGTTTAAGATTTAAGATTGAAGTTTTCGTTGTCTTCTGATAACCACTGCAGATTTCATGGATGAGAAATGTGGAAACATAATTTACCAGCTGGGAATCCTAATCCTGGAGCTCATAACGGGACAATCATCAGATGGGACAAGCGGCGATCTCATCAAATGGATCCAAGGGAAAAACTTTGCGAGGTCAATGAACAAGATGATCGACCCTGATCTGGGAAATAGTTTTGACAACAAAGAAGTGAGAAATCTTCTATCAATAGCAAAGCTGTGTGTAAAGTCCAGAGAAAAGCCAAGATTCTCCATTCCACAGATTTTCCGGTATCTGCAGAGTAAAGTAGATGTTACTAGCACCTACTAGTATTTTCTTCTTCATTTTTTATTTTTATTTTTATTTTTATTTTTTGTTTCTGTTTTGTTCACGACTTATTTTCATACCCACGTAAATTAAATTATTTTATTAGGAAAAAGAAAAAAAATCTATTATTCTGTTCCTTCTTCCTTCA

mRNA sequence

TCAGCCGATAAAGCGTGCGATTTGCCCAACCGTCTATAAGAGAAACACCCACGACAAAATGGAATGTAGCAATATTCATTATGCTGGAGAAGTTTCATCATTAATTTTTCCCCACCAGGGTTCTCAATTTTGCCTGCCATGGATCCCTGGATTCACAGGTTCAGGCATCGCCTCCTCGCATGGTTTCATCGATCTCGTTCTGTGCCAATATTATACGTGAGGCGTTTTTCGTACAGGGAAATAAAAAGGGCAACTGGTGGGTTCAACAGAGTTGTCTACACCAATCCTCAGTGTGCTGCTTATAATGCTAGATTTCAAGATGGCAGTGTTGCATTGGTAAAAGAACTGAGAGCTTTGAATGACGATGTCTTCAAGACGGAAGTGCAGCTCCTTGGACGCTTGCACCATCGTCATTTACTTACATTGAGAGGCTTCTCCACAGGGCATAAGAGATTGCTCGTGTTTGACGAGATTGGCAATGGAAGCTTGAGGGATCATCTTAATGATCCTCTCAGGACTCCCTTGAATTGGAGGGCGAGGCTGCAAATAGCCGCAAGTGTGGCTGCTGCCTTGGAATACTTGCTTCTTTTCACCGACCCCCCAATGTGTCATGTCTCCATTTCCTCAAACACTATAATGTTAGATGAAAATTTCACTGCAAAGATATCTGATGTTGGCTTTCTCTACTCCCCGGTAAGCAGCGGTGATCAGGCAGATGCATCAAAGGCAGATGATTTCATGGATGAGAAATGTGGAAACATAATTTACCAGCTGGGAATCCTAATCCTGGAGCTCATAACGGGACAATCATCAGATGGGACAAGCGGCGATCTCATCAAATGGATCCAAGGGAAAAACTTTGCGAGGTCAATGAACAAGATGATCGACCCTGATCTGGGAAATAGTTTTGACAACAAAGAAGTGAGAAATCTTCTATCAATAGCAAAGCTGTGTGTAAAGTCCAGAGAAAAGCCAAGATTCTCCATTCCACAGATTTTCCGGTATCTGCAGAGTAAAGTAGATGTTACTAGCACCTACTAGTATTTTCTTCTTCATTTTTTATTTTTATTTTTATTTTTATTTTTTGTTTCTGTTTTGTTCACGACTTATTTTCATACCCACGTAAATTAAATTATTTTATTAGGAAAAAGAAAAAAAATCTATTATTCTGTTCCTTCTTCCTTCA

Coding sequence (CDS)

ATGGATCCCTGGATTCACAGGTTCAGGCATCGCCTCCTCGCATGGTTTCATCGATCTCGTTCTGTGCCAATATTATACGTGAGGCGTTTTTCGTACAGGGAAATAAAAAGGGCAACTGGTGGGTTCAACAGAGTTGTCTACACCAATCCTCAGTGTGCTGCTTATAATGCTAGATTTCAAGATGGCAGTGTTGCATTGGTAAAAGAACTGAGAGCTTTGAATGACGATGTCTTCAAGACGGAAGTGCAGCTCCTTGGACGCTTGCACCATCGTCATTTACTTACATTGAGAGGCTTCTCCACAGGGCATAAGAGATTGCTCGTGTTTGACGAGATTGGCAATGGAAGCTTGAGGGATCATCTTAATGATCCTCTCAGGACTCCCTTGAATTGGAGGGCGAGGCTGCAAATAGCCGCAAGTGTGGCTGCTGCCTTGGAATACTTGCTTCTTTTCACCGACCCCCCAATGTGTCATGTCTCCATTTCCTCAAACACTATAATGTTAGATGAAAATTTCACTGCAAAGATATCTGATGTTGGCTTTCTCTACTCCCCGGTAAGCAGCGGTGATCAGGCAGATGCATCAAAGGCAGATGATTTCATGGATGAGAAATGTGGAAACATAATTTACCAGCTGGGAATCCTAATCCTGGAGCTCATAACGGGACAATCATCAGATGGGACAAGCGGCGATCTCATCAAATGGATCCAAGGGAAAAACTTTGCGAGGTCAATGAACAAGATGATCGACCCTGATCTGGGAAATAGTTTTGACAACAAAGAAGTGAGAAATCTTCTATCAATAGCAAAGCTGTGTGTAAAGTCCAGAGAAAAGCCAAGATTCTCCATTCCACAGATTTTCCGGTATCTGCAGAGTAAAGTAGATGTTACTAGCACCTACTAG

Protein sequence

MDPWIHRFRHRLLAWFHRSRSVPILYVRRFSYREIKRATGGFNRVVYTNPQCAAYNARFQDGSVALVKELRALNDDVFKTEVQLLGRLHHRHLLTLRGFSTGHKRLLVFDEIGNGSLRDHLNDPLRTPLNWRARLQIAASVAAALEYLLLFTDPPMCHVSISSNTIMLDENFTAKISDVGFLYSPVSSGDQADASKADDFMDEKCGNIIYQLGILILELITGQSSDGTSGDLIKWIQGKNFARSMNKMIDPDLGNSFDNKEVRNLLSIAKLCVKSREKPRFSIPQIFRYLQSKVDVTSTY
BLAST of CmaCh20G001420 vs. Swiss-Prot
Match: Y1497_ARATH (Probable receptor-like protein kinase At1g49730 OS=Arabidopsis thaliana GN=At1g49730 PE=2 SV=1)

HSP 1 Score: 171.0 bits (432), Expect = 2.0e-41
Identity = 103/291 (35.40%), Postives = 165/291 (56.70%), Query Frame = 1

Query: 28  RRFSYREIKRATGGFNRVVYTNPQCAAYNARFQDGSVALVKELRALNDDV---FKTEVQL 87
           R+FSY+E+  AT  FN V+        Y A F DG +A VK++  +++     F  E+ L
Sbjct: 315 RKFSYKEMTNATNDFNTVIGQGGFGTVYKAEFNDGLIAAVKKMNKVSEQAEQDFCREIGL 374

Query: 88  LGRLHHRHLLTLRGFSTGHK-RLLVFDEIGNGSLRDHLNDPLRTPLNWRARLQIAASVAA 147
           L +LHHR+L+ L+GF    K R LV+D + NGSL+DHL+   + P +W  R++IA  VA 
Sbjct: 375 LAKLHHRNLVALKGFCINKKERFLVYDYMKNGSLKDHLHAIGKPPPSWGTRMKIAIDVAN 434

Query: 148 ALEYLLLFTDPPMCHVSISSNTIMLDENFTAKISDVGFLYS---------PVSSGDQADA 207
           ALEYL  + DPP+CH  I S+ I+LDENF AK+SD G  +S         PV++  +   
Sbjct: 435 ALEYLHFYCDPPLCHRDIKSSNILLDENFVAKLSDFGLAHSSRDGSVCFEPVNTDIRGTP 494

Query: 208 SKAD------DFMDEKCGNIIYQLGILILELITGQSSDGTSGDLIKWIQGKNFARSMN-K 267
              D        + EK  + +Y  G+++LELITG+ +     +L++  Q    A+S + +
Sbjct: 495 GYVDPEYVVTQELTEK--SDVYSYGVVLLELITGRRAVDEGRNLVEMSQRFLLAKSKHLE 554

Query: 268 MIDPDLGNSFDN---KEVRNLLSIAKLCVKSREKPRFSIPQIFRYLQSKVD 296
           ++DP + +S ++   K++  ++++ +LC +   + R SI Q+ R L    D
Sbjct: 555 LVDPRIKDSINDAGGKQLDAVVTVVRLCTEKEGRSRPSIKQVLRLLCESCD 603

BLAST of CmaCh20G001420 vs. Swiss-Prot
Match: Y4245_ARATH (Probable LRR receptor-like serine/threonine-protein kinase At4g20450 OS=Arabidopsis thaliana GN=At4g20450 PE=2 SV=1)

HSP 1 Score: 131.0 bits (328), Expect = 2.2e-29
Identity = 90/294 (30.61%), Postives = 140/294 (47.62%), Query Frame = 1

Query: 19  SRSVPILYVRRFSYREIKRATGGFNRVVYTNPQCAAYNARFQDGSVALVK---ELRALND 78
           SRS  +   R ++Y E+   T  F R +        Y+    D     VK   E  A   
Sbjct: 570 SRSSMVANKRSYTYEEVAVITNNFERPLGEGGFGVVYHGNVNDNEQVAVKVLSESSAQGY 629

Query: 79  DVFKTEVQLLGRLHHRHLLTLRGF-STGHKRLLVFDEIGNGSLRDHLN-DPLRTPLNWRA 138
             FK EV LL R+HH +L+TL G+   G   +L+++ + NG+L+ HL+ +  R+PL+W  
Sbjct: 630 KQFKAEVDLLLRVHHINLVTLVGYCDEGQHLVLIYEYMSNGNLKQHLSGENSRSPLSWEN 689

Query: 139 RLQIAASVAAALEYLLLFTDPPMCHVSISSNTIMLDENFTAKISDVGFLYS-PVSSGDQA 198
           RL+IAA  A  LEYL +   PPM H  I S  I+LD NF AK+ D G   S PV S    
Sbjct: 690 RLRIAAETAQGLEYLHIGCKPPMIHRDIKSMNILLDNNFQAKLGDFGLSRSFPVGSETHV 749

Query: 199 DASKA------------DDFMDEKCGNIIYQLGILILELITGQ---SSDGTSGDLIKWIQ 258
             + A             +++ EK  + ++  G+++LE+IT Q           + +W+ 
Sbjct: 750 STNVAGSPGYLDPEYYRTNWLTEK--SDVFSFGVVLLEIITSQPVIDQTREKSHIGEWVG 809

Query: 259 GKNFARSMNKMIDPDLGNSFDNKEVRNLLSIAKLCVKSREKPRFSIPQIFRYLQ 292
            K     +  ++DP +   +D+  +   L +A  CV      R ++ Q+   LQ
Sbjct: 810 FKLTNGDIKNIVDPSMNGDYDSSSLWKALELAMSCVSPSSSGRPNMSQVANELQ 861

BLAST of CmaCh20G001420 vs. Swiss-Prot
Match: NCRK_ARATH (Receptor-like serine/threonine-protein kinase NCRK OS=Arabidopsis thaliana GN=NCRK PE=1 SV=1)

HSP 1 Score: 129.8 bits (325), Expect = 5.0e-29
Identity = 91/305 (29.84%), Postives = 152/305 (49.84%), Query Frame = 1

Query: 29  RFSYREIKRATGGF--NRVVYTNPQCAAYNARFQDGSVALVKELRALNDD----VFKTEV 88
           +FSY E+++AT  F  N V+        Y  + +DG  A +K L     D    +F TEV
Sbjct: 197 QFSYTELEQATNKFSSNSVIGHGGSSCVYRGQLKDGKTAAIKRLNTPKGDDTDTLFSTEV 256

Query: 89  QLLGRLHHRHLLTLRGFSTGH-----KRLLVFDEIGNGSLRDHLNDPLRTPLNWRARLQI 148
           +LL RLHH H++ L G+ +       +RLLVF+ +  GSLRD L+  L   + W  R+ +
Sbjct: 257 ELLSRLHHYHVVPLIGYCSEFHGKHAERLLVFEYMSYGSLRDCLDGELGEKMTWNIRISV 316

Query: 149 AASVAAALEYLLLFTDPPMCHVSISSNTIMLDENFTAKISDVGFLYSPVSSGDQADASK- 208
           A   A  LEYL     P + H  + S  I+LDEN+ AKI+D+G      S G Q+ +S  
Sbjct: 317 ALGAARGLEYLHEAAAPRILHRDVKSTNILLDENWHAKITDLGMAKCLSSDGLQSGSSSP 376

Query: 209 -----------ADDFMDEKCG---NIIYQLGILILELITGQ------SSDGTSGDLIKWI 268
                      A ++    C    + ++  G+++LELITG+      S++     L+ W 
Sbjct: 377 TTGLQGTFGYFAPEYAIAGCASQMSDVFSFGVVLLELITGRKPIQKPSNNKGEESLVIWA 436

Query: 269 --QGKNFARSMNKMIDPDLGNSFDNKEVRNLLSIAKLCVKSREKPRFSIPQIFRYLQSKV 300
             + ++  R + ++ DP L   F  +E++ +  +AK C+    + R ++ ++ + L +  
Sbjct: 437 VPRLQDSKRVIEELPDPRLNGKFAEEEMQIMAYLAKECLLLDPESRPTMREVVQILSTIT 496

BLAST of CmaCh20G001420 vs. Swiss-Prot
Match: PERK9_ARATH (Proline-rich receptor-like protein kinase PERK9 OS=Arabidopsis thaliana GN=PERK9 PE=2 SV=1)

HSP 1 Score: 125.2 bits (313), Expect = 1.2e-27
Identity = 90/290 (31.03%), Postives = 144/290 (49.66%), Query Frame = 1

Query: 30  FSYREIKRATGGFNR--VVYTNPQCAAYNARFQDGSVALVKELR---ALNDDVFKTEVQL 89
           FSY E+ +AT GF++  ++        Y     DG V  VK+L+      D  FK EV+ 
Sbjct: 365 FSYEELVKATNGFSQENLLGEGGFGCVYKGILPDGRVVAVKQLKIGGGQGDREFKAEVET 424

Query: 90  LGRLHHRHLLTLRGFS-TGHKRLLVFDEIGNGSLRDHLNDPLRTPLNWRARLQIAASVAA 149
           L R+HHRHL+++ G   +G +RLL++D + N  L  HL+   ++ L+W  R++IAA  A 
Sbjct: 425 LSRIHHRHLVSIVGHCISGDRRLLIYDYVSNNDLYFHLHGE-KSVLDWATRVKIAAGAAR 484

Query: 150 ALEYLLLFTDPPMCHVSISSNTIMLDENFTAKISDVGFLYSPV------------SSGDQ 209
            L YL     P + H  I S+ I+L++NF A++SD G     +            + G  
Sbjct: 485 GLAYLHEDCHPRIIHRDIKSSNILLEDNFDARVSDFGLARLALDCNTHITTRVIGTFGYM 544

Query: 210 ADASKADDFMDEKCGNIIYQLGILILELITGQSSDGTS---GD--LIKW----IQGKNFA 269
           A    +   + EK  + ++  G+++LELITG+    TS   GD  L++W    I      
Sbjct: 545 APEYASSGKLTEK--SDVFSFGVVLLELITGRKPVDTSQPLGDESLVEWARPLISHAIET 604

Query: 270 RSMNKMIDPDLGNSFDNKEVRNLLSIAKLCVKSREKPRFSIPQIFRYLQS 293
              + + DP LG ++   E+  ++  A  CV+     R  + QI R  +S
Sbjct: 605 EEFDSLADPKLGGNYVESEMFRMIEAAGACVRHLATKRPRMGQIVRAFES 651

BLAST of CmaCh20G001420 vs. Swiss-Prot
Match: RLK7_ARATH (Receptor-like protein kinase At5g59670 OS=Arabidopsis thaliana GN=At5g59670 PE=1 SV=1)

HSP 1 Score: 121.3 bits (303), Expect = 1.8e-26
Identity = 86/291 (29.55%), Postives = 143/291 (49.14%), Query Frame = 1

Query: 28  RRFSYREIKRATGGFNRVVYTNPQCAAYNARFQDGSVALVKELRALNDD---VFKTEVQL 87
           +RF+Y E+ + T  F RV+        Y+   +      VK L   +      FK EV L
Sbjct: 552 KRFTYSEVVQVTKNFQRVLGKGGFGMVYHGTVKGSEQVAVKVLSQSSTQGSKEFKAEVDL 611

Query: 88  LGRLHHRHLLTLRGFST-GHKRLLVFDEIGNGSLRDHLNDPL-RTPLNWRARLQIAASVA 147
           L R+HH +L++L G+   G    LV++ + NG L+ HL+     + +NW  RL+IA   A
Sbjct: 612 LLRVHHTNLVSLVGYCCEGDYLALVYEFLPNGDLKQHLSGKGGNSIINWSIRLRIALEAA 671

Query: 148 AALEYLLLFTDPPMCHVSISSNTIMLDENFTAKISDVGFLYSPVSSGDQADASKAD---D 207
             LEYL +   PPM H  + +  I+LDENF AK++D G   S    G+  +++       
Sbjct: 672 LGLEYLHIGCTPPMVHRDVKTANILLDENFKAKLADFGLSRSFQGEGESQESTTIAGTLG 731

Query: 208 FMDEKC---GNI-----IYQLGILILELITGQS-SDGTSGD--LIKWIQGKNFARSMNKM 267
           ++D +C   G +     +Y  GI++LE+IT Q   + TSGD  + +W+  +     + ++
Sbjct: 732 YLDPECYHSGRLGEKSDVYSFGIVLLEMITNQPVINQTSGDSHITQWVGFQMNRGDILEI 791

Query: 268 IDPDLGNSFDNKEVRNLLSIAKLCVKSREKPRFSIPQIFRYLQSKVDVTST 300
           +DP+L   ++       L +A  C       R S+ Q+   L+  +   +T
Sbjct: 792 MDPNLRKDYNINSAWRALELAMSCAYPSSSKRPSMSQVIHELKECIACENT 842

BLAST of CmaCh20G001420 vs. TrEMBL
Match: A0A0A0KJY4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G426910 PE=4 SV=1)

HSP 1 Score: 526.9 bits (1356), Expect = 1.6e-146
Identity = 262/299 (87.63%), Postives = 279/299 (93.31%), Query Frame = 1

Query: 1   MDPWIHRFRHRLLAWFHRSRSVPILYVRRFSYREIKRATGGFNRVVYTNPQCAAYNARFQ 60
           MDPWIHRFRHRLLAWFHRSRSVPILYVRRFSY+EIKRATGGFNRVVYTNP+ AAYNA+FQ
Sbjct: 29  MDPWIHRFRHRLLAWFHRSRSVPILYVRRFSYKEIKRATGGFNRVVYTNPRSAAYNAKFQ 88

Query: 61  DGSVALVKELRALNDDVFKTEVQLLGRLHHRHLLTLRGFST-GHKRLLVFDEIGNGSLRD 120
           DG VALVKE RALND++F TEVQLLGRLHHRHLLTLRGFST GHKRLLVFDEIGNGSLRD
Sbjct: 89  DGRVALVKEQRALNDNLFYTEVQLLGRLHHRHLLTLRGFSTAGHKRLLVFDEIGNGSLRD 148

Query: 121 HLNDPLRTPLNWRARLQIAASVAAALEYLLLFTDPPMCHVSISSNTIMLDENFTAKISDV 180
            LNDPLRTPLNWR RLQIAA VAAALEYLLLFT PPMCHVSISS+TIMLDENFTAKISDV
Sbjct: 149 LLNDPLRTPLNWRMRLQIAAGVAAALEYLLLFTHPPMCHVSISSSTIMLDENFTAKISDV 208

Query: 181 GFLYSPVSSGDQADASKADDFMDEKCGNIIYQLGILILELITGQSSDGTSGDLIKWIQGK 240
           GFL SPV+    +DA+K+DDF+DEK GNIIYQLG+LILELITGQSSDGT  DLIKWIQG 
Sbjct: 209 GFLCSPVNITGYSDATKSDDFVDEKSGNIIYQLGVLILELITGQSSDGTGADLIKWIQGT 268

Query: 241 NFARSMNKMIDPDLGNSFDNKEVRNLLSIAKLCVKSREKPRFSIPQIFRYLQSKVDVTS 299
           NFARSMNKMIDPDLGNSFD K+VRNLLS+AKLC+KSREKPRFSI QIFRYLQSKVD++S
Sbjct: 269 NFARSMNKMIDPDLGNSFDYKDVRNLLSVAKLCIKSREKPRFSIAQIFRYLQSKVDLSS 327

BLAST of CmaCh20G001420 vs. TrEMBL
Match: A0A140G4K3_9ROSI (LRR-RLK OS=Vernicia montana PE=2 SV=1)

HSP 1 Score: 404.8 bits (1039), Expect = 9.0e-110
Identity = 198/298 (66.44%), Postives = 245/298 (82.21%), Query Frame = 1

Query: 1   MDPWIHRFRHRLLAWFHRSRSVPILYVRRFSYREIKRATGGFNRVVYTNPQCAAYNARFQ 60
           MD  I + R  LLAW HRSRS PI +VRRFSY++IKRAT GF+R++Y++ Q  AY A+FQ
Sbjct: 1   MDRLIRKIRPHLLAWLHRSRSGPISFVRRFSYKDIKRATDGFHRILYSDSQGTAYRAKFQ 60

Query: 61  DGSVALVKELRALND--DVFKTEVQLLGRLHHRHLLTLRGFSTGHKRLLVFDEIGNGSLR 120
           DG +ALVKE++ L+   DVF  +VQLLGRLHHRHLL L+GFSTGHKRLLVFD I NGSL+
Sbjct: 61  DGDIALVKEVKDLSQEKDVFIRQVQLLGRLHHRHLLALKGFSTGHKRLLVFDNIENGSLK 120

Query: 121 DHLNDPLRTPLNWRARLQIAASVAAALEYLLLFTDPPMCHVSISSNTIMLDENFTAKISD 180
           +HLNDPLRTPLNW+ RL+IA  V AALEYLLLF++PPM HVSISS+ IMLDENFTAK+S+
Sbjct: 121 EHLNDPLRTPLNWKTRLRIAIGVVAALEYLLLFSNPPMYHVSISSSNIMLDENFTAKLSN 180

Query: 181 VGFLYSPVSSGDQADASKADDFMDEKCGNIIYQLGILILELITGQSSDGTSGDLIKWIQG 240
           VG L S  +      AS A+D M++ CGNII+QLG+LILELITGQSS+  S DL++WIQG
Sbjct: 181 VGLLSSIENYVTMPQASCAEDCMNQNCGNIIFQLGVLILELITGQSSEKGSTDLVQWIQG 240

Query: 241 KNFARSMNKMIDPDLGNSFDNKEVRNLLSIAKLCVKSREKPRFSIPQIFRYLQSKVDV 297
             F  S+ KMIDPDLGNS+D++E++NLL++A+LC+KS +KP+FSIPQIFRYLQ KVD+
Sbjct: 241 SRFGSSIQKMIDPDLGNSYDSRELKNLLAVARLCIKSGDKPKFSIPQIFRYLQKKVDI 298

BLAST of CmaCh20G001420 vs. TrEMBL
Match: A0A061GVS0_THECC (Kinase superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_041210 PE=4 SV=1)

HSP 1 Score: 391.7 bits (1005), Expect = 7.8e-106
Identity = 187/298 (62.75%), Postives = 241/298 (80.87%), Query Frame = 1

Query: 1   MDPWIHRFRHRLLAWFHRSRSVPILYVRRFSYREIKRATGGFNRVVYTNPQCAAYNARFQ 60
           MDP I + R RLLAW HRSRS PI ++++FSY+++KRAT GF+R++Y+N + AAY A+F+
Sbjct: 1   MDPVIRKLRFRLLAWLHRSRSGPISFMKKFSYKDVKRATDGFHRIIYSNSRGAAYKAKFE 60

Query: 61  DGSVALVKELRALND--DVFKTEVQLLGRLHHRHLLTLRGFSTGHKRLLVFDEIGNGSLR 120
            G VALVKE RA ++  D F  EVQ LGRLHHRHLL LRGFSTGHKRLLVFD I NGSL+
Sbjct: 61  GGEVALVKEARAFDEGIDNFYREVQFLGRLHHRHLLALRGFSTGHKRLLVFDNIENGSLK 120

Query: 121 DHLNDPLRTPLNWRARLQIAASVAAALEYLLLFTDPPMCHVSISSNTIMLDENFTAKISD 180
           +H NDPLRTPLNW+ARLQIA  VAAALEYLLLF++PP+ HVSISS+ IM DENFTAK+SD
Sbjct: 121 EHFNDPLRTPLNWKARLQIAVGVAAALEYLLLFSNPPVYHVSISSSNIMFDENFTAKLSD 180

Query: 181 VGFLYSPVSSGDQADASKADDFMDEKCGNIIYQLGILILELITGQSSDGTSGDLIKWIQG 240
           VG L S  +  +    S +++ + ++CGNII+QLG+LILELITGQSS+    DLI+W+QG
Sbjct: 181 VGLLSSVGTYVEMPHPSCSEECLGQECGNIIFQLGVLILELITGQSSEQGGTDLIQWVQG 240

Query: 241 KNFARSMNKMIDPDLGNSFDNKEVRNLLSIAKLCVKSREKPRFSIPQIFRYLQSKVDV 297
              + S++ MIDPDLGN++D++E++ LLS+A+LC+KS+  P+F IPQ+FRYLQ KVD+
Sbjct: 241 SRLSSSIHMMIDPDLGNNYDSRELKKLLSVARLCIKSKNNPKFPIPQVFRYLQKKVDI 298

BLAST of CmaCh20G001420 vs. TrEMBL
Match: A0A0D2QPP3_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_001G126300 PE=4 SV=1)

HSP 1 Score: 384.8 bits (987), Expect = 9.6e-104
Identity = 186/298 (62.42%), Postives = 237/298 (79.53%), Query Frame = 1

Query: 1   MDPWIHRFRHRLLAWFHRSRSVPILYVRRFSYREIKRATGGFNRVVYTNPQCAAYNARFQ 60
           MDP I + R RLLAW HRSRS PIL+V++FSY+++KRAT GF+R+VY+N   AAY A F+
Sbjct: 1   MDPMIRKLRFRLLAWLHRSRSGPILFVKKFSYKDVKRATDGFHRIVYSNSHGAAYKANFE 60

Query: 61  DGSVALVKELRALND--DVFKTEVQLLGRLHHRHLLTLRGFSTGHKRLLVFDEIGNGSLR 120
            G VALVKE RA ++  + F  EVQ LGRLHHRHLL+LRGFSTG KRLLVFD I NGSL+
Sbjct: 61  GGEVALVKEARAFDEGKESFYREVQFLGRLHHRHLLSLRGFSTGQKRLLVFDNIENGSLK 120

Query: 121 DHLNDPLRTPLNWRARLQIAASVAAALEYLLLFTDPPMCHVSISSNTIMLDENFTAKISD 180
           +H NDPLRTPLNW+ARLQIA  VAAALEYLLLF++PP+ HVSISS+ IMLDENFTAK+SD
Sbjct: 121 EHFNDPLRTPLNWKARLQIAVGVAAALEYLLLFSNPPVYHVSISSSNIMLDENFTAKLSD 180

Query: 181 VGFLYSPVSSGDQADASKADDFMDEKCGNIIYQLGILILELITGQSSDGTSGDLIKWIQG 240
           VG L S  +      +S +++ MD++CGNI+YQLG+LILELITGQSS+    DLI+W+QG
Sbjct: 181 VGLLSSIGTYVQMPHSSCSEECMDQECGNIVYQLGVLILELITGQSSEKGGTDLIQWVQG 240

Query: 241 KNFARSMNKMIDPDLGNSFDNKEVRNLLSIAKLCVKSREKPRFSIPQIFRYLQSKVDV 297
              + S++ MIDPDLGN++D  E++ LL +A+LC+KS+  P+F + Q+FR+LQ KV +
Sbjct: 241 SRLSSSIHMMIDPDLGNNYDAGELKKLLVVARLCIKSKSNPKFPVSQVFRFLQKKVHI 298

BLAST of CmaCh20G001420 vs. TrEMBL
Match: A0A067L6U5_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_23910 PE=4 SV=1)

HSP 1 Score: 384.4 bits (986), Expect = 1.3e-103
Identity = 190/300 (63.33%), Postives = 234/300 (78.00%), Query Frame = 1

Query: 1   MDPWIHRFRHRLLAWFHRSRSVPILYVRRFSYREIKRATGGFNRVVYTNPQCAAYNARFQ 60
           MD  I + R  LLAW HRSR  P+ +VRRFSY +IKRAT GF+R++Y++    AY A+F+
Sbjct: 1   MDRLIRKIRPHLLAWLHRSRFGPVSFVRRFSYEDIKRATDGFHRILYSDSNGTAYRAKFK 60

Query: 61  DGSVALVKELRALND--DVFKTEVQLLGRLHHRHLLTLRGFSTGHKRLLVFDEIGNGSLR 120
           DG +ALVKE++ L+   DVF  +VQLLGRLHHRHLL L+GFS G KRLLVFD I NGSL+
Sbjct: 61  DGDIALVKEVKDLSQEKDVFYRQVQLLGRLHHRHLLALKGFSNGRKRLLVFDNIENGSLK 120

Query: 121 DHLNDPLRTPLNWRARLQIAASVAAALEYLLLFTDPPMCHVSISSNTIMLDENFTAKISD 180
           +HLNDPLRTPLNW+ RL+IA  V AALEYLLLF++PPM HVSISS+ IMLDENFTAK+S 
Sbjct: 121 EHLNDPLRTPLNWKTRLRIAIGVVAALEYLLLFSNPPMYHVSISSSNIMLDENFTAKLSS 180

Query: 181 VGFLYSPVSSGDQADASKADDFMDEKCGNIIYQLGILILELITGQSSDGTSGDLIKWIQG 240
           V  L S  +      AS   D M++ CGNII+QLG+LILELITGQSS+  S DLI+WIQG
Sbjct: 181 VSLLSSVENYVTTPHASCTGDCMNQSCGNIIFQLGVLILELITGQSSEKGSTDLIQWIQG 240

Query: 241 KNFARSMNKMIDPDLGNSFDNKEVRNLLSIAKLCVKSREKPRFSIPQIFRYLQSKVDVTS 299
             F  S+  MIDPDLGNS+D+KE++NLL++A+LC+KS  KP+FSIPQ+FRYLQ K D+ S
Sbjct: 241 SRFRSSIQNMIDPDLGNSYDSKELKNLLAVARLCIKSGNKPKFSIPQMFRYLQKKFDILS 300

BLAST of CmaCh20G001420 vs. TAIR10
Match: AT5G22050.2 (AT5G22050.2 Protein kinase superfamily protein)

HSP 1 Score: 325.9 bits (834), Expect = 2.7e-89
Identity = 169/298 (56.71%), Postives = 218/298 (73.15%), Query Frame = 1

Query: 7   RFRHRLLAWFHRSRSVPILYVRRFSYREIKRATGGFNRVVYTNPQCAAYNARFQDGSVAL 66
           R R  LLAW  RSRS  I ++RRF Y+EI +AT GF +V+YTN   +AY A+F+ G VAL
Sbjct: 10  RIRFLLLAWLRRSRSGRIEFIRRFGYKEIIKATEGFRKVIYTNYHGSAYRAKFKGGEVAL 69

Query: 67  VKELRALN--DDVFKTEVQLLGRLHHRHLLTLRGFSTGHKRLLVFDEIGNGSLRDHLNDP 126
           VKEL AL+   + F  EVQLLGRL HRHLLTLRGF  G KRLLVFD I NGSL++HLNDP
Sbjct: 70  VKELTALDLGRERFDEEVQLLGRLRHRHLLTLRGFCIGRKRLLVFDNIENGSLKEHLNDP 129

Query: 127 LRTPLNWRARLQIAASVAAALEYLLLFT--DPPMCHVSISSNTIMLDENFTAKISDVGFL 186
           L+TPLNW+ R+QIA  VAAALEYLL+F+  D  +  VS++S  IMLDENFT KISD+   
Sbjct: 130 LKTPLNWKTRIQIAIGVAAALEYLLVFSSNDAQIYDVSVNSCNIMLDENFTPKISDIRVN 189

Query: 187 YSPVSSGDQA-DASKADDFMDEKCGNIIYQLGILILELITGQSSDGTSGDLIKWIQGKNF 246
             P +      D+       DE+CGN+I+QLG+L+LELITGQSSD    DLI+W+Q    
Sbjct: 190 RHPKNHPKATHDSCSEGSCADEECGNVIFQLGVLMLELITGQSSDRQGNDLIEWVQDSCI 249

Query: 247 ARSMNKMIDPDLGNSFDNKEVRNLLSIAKLCVKSR-EKPRFSIPQIFRYLQSKVDVTS 299
           A S++KMIDPDLGN++ ++E++ +L++A+LC+K+R E P FSI  ++RYLQ K+DV +
Sbjct: 250 ANSIDKMIDPDLGNNYSSRELQKVLAVARLCIKTRYEPPSFSITHVYRYLQKKIDVAT 307

BLAST of CmaCh20G001420 vs. TAIR10
Match: AT3G19300.1 (AT3G19300.1 Protein kinase superfamily protein)

HSP 1 Score: 174.9 bits (442), Expect = 7.6e-44
Identity = 99/288 (34.38%), Postives = 165/288 (57.29%), Query Frame = 1

Query: 28  RRFSYREIKRATGGFNRVVYTNPQCAAYNARFQDGSVALVKELRALND---DVFKTEVQL 87
           R+FSY+EI++AT  FN V+        Y A F +G VA VK++   ++   D F  E++L
Sbjct: 314 RKFSYKEIRKATEDFNAVIGRGGFGTVYKAEFSNGLVAAVKKMNKSSEQAEDEFCREIEL 373

Query: 88  LGRLHHRHLLTLRGF-STGHKRLLVFDEIGNGSLRDHLNDPLRTPLNWRARLQIAASVAA 147
           L RLHHRHL+ L+GF +  ++R LV++ + NGSL+DHL+   ++PL+W +R++IA  VA 
Sbjct: 374 LARLHHRHLVALKGFCNKKNERFLVYEYMENGSLKDHLHSTEKSPLSWESRMKIAIDVAN 433

Query: 148 ALEYLLLFTDPPMCHVSISSNTIMLDENFTAKISDVG---------FLYSPVSSGDQADA 207
           ALEYL  + DPP+CH  I S+ I+LDE+F AK++D G           + PV++  +   
Sbjct: 434 ALEYLHFYCDPPLCHRDIKSSNILLDEHFVAKLADFGLAHASRDGSICFEPVNTDIRGTP 493

Query: 208 SKAD------DFMDEKCGNIIYQLGILILELITGQSSDGTSGDLIKWIQGKNFARSMN-K 267
              D        + EK  + +Y  G+++LE+ITG+ +     +L++  Q    + S    
Sbjct: 494 GYVDPEYVVTHELTEK--SDVYSYGVVLLEIITGKRAVDEGRNLVELSQPLLVSESRRID 553

Query: 268 MIDPDLGNSFDNKEVRNLLSIAKLCVKSREKPRFSIPQIFRYLQSKVD 296
           ++DP + +  D +++  ++++ + C +     R SI Q+ R L    D
Sbjct: 554 LVDPRIKDCIDGEQLETVVAVVRWCTEKEGVARPSIKQVLRLLYESCD 599

BLAST of CmaCh20G001420 vs. TAIR10
Match: AT1G49730.1 (AT1G49730.1 Protein kinase superfamily protein)

HSP 1 Score: 171.0 bits (432), Expect = 1.1e-42
Identity = 103/291 (35.40%), Postives = 165/291 (56.70%), Query Frame = 1

Query: 28  RRFSYREIKRATGGFNRVVYTNPQCAAYNARFQDGSVALVKELRALNDDV---FKTEVQL 87
           R+FSY+E+  AT  FN V+        Y A F DG +A VK++  +++     F  E+ L
Sbjct: 345 RKFSYKEMTNATNDFNTVIGQGGFGTVYKAEFNDGLIAAVKKMNKVSEQAEQDFCREIGL 404

Query: 88  LGRLHHRHLLTLRGFSTGHK-RLLVFDEIGNGSLRDHLNDPLRTPLNWRARLQIAASVAA 147
           L +LHHR+L+ L+GF    K R LV+D + NGSL+DHL+   + P +W  R++IA  VA 
Sbjct: 405 LAKLHHRNLVALKGFCINKKERFLVYDYMKNGSLKDHLHAIGKPPPSWGTRMKIAIDVAN 464

Query: 148 ALEYLLLFTDPPMCHVSISSNTIMLDENFTAKISDVGFLYS---------PVSSGDQADA 207
           ALEYL  + DPP+CH  I S+ I+LDENF AK+SD G  +S         PV++  +   
Sbjct: 465 ALEYLHFYCDPPLCHRDIKSSNILLDENFVAKLSDFGLAHSSRDGSVCFEPVNTDIRGTP 524

Query: 208 SKAD------DFMDEKCGNIIYQLGILILELITGQSSDGTSGDLIKWIQGKNFARSMN-K 267
              D        + EK  + +Y  G+++LELITG+ +     +L++  Q    A+S + +
Sbjct: 525 GYVDPEYVVTQELTEK--SDVYSYGVVLLELITGRRAVDEGRNLVEMSQRFLLAKSKHLE 584

Query: 268 MIDPDLGNSFDN---KEVRNLLSIAKLCVKSREKPRFSIPQIFRYLQSKVD 296
           ++DP + +S ++   K++  ++++ +LC +   + R SI Q+ R L    D
Sbjct: 585 LVDPRIKDSINDAGGKQLDAVVTVVRLCTEKEGRSRPSIKQVLRLLCESCD 633

BLAST of CmaCh20G001420 vs. TAIR10
Match: AT4G20450.1 (AT4G20450.1 Leucine-rich repeat protein kinase family protein)

HSP 1 Score: 131.0 bits (328), Expect = 1.3e-30
Identity = 90/294 (30.61%), Postives = 140/294 (47.62%), Query Frame = 1

Query: 19  SRSVPILYVRRFSYREIKRATGGFNRVVYTNPQCAAYNARFQDGSVALVK---ELRALND 78
           SRS  +   R ++Y E+   T  F R +        Y+    D     VK   E  A   
Sbjct: 570 SRSSMVANKRSYTYEEVAVITNNFERPLGEGGFGVVYHGNVNDNEQVAVKVLSESSAQGY 629

Query: 79  DVFKTEVQLLGRLHHRHLLTLRGF-STGHKRLLVFDEIGNGSLRDHLN-DPLRTPLNWRA 138
             FK EV LL R+HH +L+TL G+   G   +L+++ + NG+L+ HL+ +  R+PL+W  
Sbjct: 630 KQFKAEVDLLLRVHHINLVTLVGYCDEGQHLVLIYEYMSNGNLKQHLSGENSRSPLSWEN 689

Query: 139 RLQIAASVAAALEYLLLFTDPPMCHVSISSNTIMLDENFTAKISDVGFLYS-PVSSGDQA 198
           RL+IAA  A  LEYL +   PPM H  I S  I+LD NF AK+ D G   S PV S    
Sbjct: 690 RLRIAAETAQGLEYLHIGCKPPMIHRDIKSMNILLDNNFQAKLGDFGLSRSFPVGSETHV 749

Query: 199 DASKA------------DDFMDEKCGNIIYQLGILILELITGQ---SSDGTSGDLIKWIQ 258
             + A             +++ EK  + ++  G+++LE+IT Q           + +W+ 
Sbjct: 750 STNVAGSPGYLDPEYYRTNWLTEK--SDVFSFGVVLLEIITSQPVIDQTREKSHIGEWVG 809

Query: 259 GKNFARSMNKMIDPDLGNSFDNKEVRNLLSIAKLCVKSREKPRFSIPQIFRYLQ 292
            K     +  ++DP +   +D+  +   L +A  CV      R ++ Q+   LQ
Sbjct: 810 FKLTNGDIKNIVDPSMNGDYDSSSLWKALELAMSCVSPSSSGRPNMSQVANELQ 861

BLAST of CmaCh20G001420 vs. TAIR10
Match: AT2G28250.1 (AT2G28250.1 Protein kinase superfamily protein)

HSP 1 Score: 129.8 bits (325), Expect = 2.8e-30
Identity = 91/305 (29.84%), Postives = 152/305 (49.84%), Query Frame = 1

Query: 29  RFSYREIKRATGGF--NRVVYTNPQCAAYNARFQDGSVALVKELRALNDD----VFKTEV 88
           +FSY E+++AT  F  N V+        Y  + +DG  A +K L     D    +F TEV
Sbjct: 197 QFSYTELEQATNKFSSNSVIGHGGSSCVYRGQLKDGKTAAIKRLNTPKGDDTDTLFSTEV 256

Query: 89  QLLGRLHHRHLLTLRGFSTGH-----KRLLVFDEIGNGSLRDHLNDPLRTPLNWRARLQI 148
           +LL RLHH H++ L G+ +       +RLLVF+ +  GSLRD L+  L   + W  R+ +
Sbjct: 257 ELLSRLHHYHVVPLIGYCSEFHGKHAERLLVFEYMSYGSLRDCLDGELGEKMTWNIRISV 316

Query: 149 AASVAAALEYLLLFTDPPMCHVSISSNTIMLDENFTAKISDVGFLYSPVSSGDQADASK- 208
           A   A  LEYL     P + H  + S  I+LDEN+ AKI+D+G      S G Q+ +S  
Sbjct: 317 ALGAARGLEYLHEAAAPRILHRDVKSTNILLDENWHAKITDLGMAKCLSSDGLQSGSSSP 376

Query: 209 -----------ADDFMDEKCG---NIIYQLGILILELITGQ------SSDGTSGDLIKWI 268
                      A ++    C    + ++  G+++LELITG+      S++     L+ W 
Sbjct: 377 TTGLQGTFGYFAPEYAIAGCASQMSDVFSFGVVLLELITGRKPIQKPSNNKGEESLVIWA 436

Query: 269 --QGKNFARSMNKMIDPDLGNSFDNKEVRNLLSIAKLCVKSREKPRFSIPQIFRYLQSKV 300
             + ++  R + ++ DP L   F  +E++ +  +AK C+    + R ++ ++ + L +  
Sbjct: 437 VPRLQDSKRVIEELPDPRLNGKFAEEEMQIMAYLAKECLLLDPESRPTMREVVQILSTIT 496

BLAST of CmaCh20G001420 vs. NCBI nr
Match: gi|659097396|ref|XP_008449602.1| (PREDICTED: probable receptor-like protein kinase At1g49730 [Cucumis melo])

HSP 1 Score: 528.1 bits (1359), Expect = 1.0e-146
Identity = 260/299 (86.96%), Postives = 277/299 (92.64%), Query Frame = 1

Query: 1   MDPWIHRFRHRLLAWFHRSRSVPILYVRRFSYREIKRATGGFNRVVYTNPQCAAYNARFQ 60
           MDPWIHRFRHRLLAWFHRSRSVPILYVRRFSY+E+KRATGGFNRVVYTNP+ AAYNA+FQ
Sbjct: 1   MDPWIHRFRHRLLAWFHRSRSVPILYVRRFSYKEVKRATGGFNRVVYTNPRSAAYNAKFQ 60

Query: 61  DGSVALVKELRALNDDVFKTEVQLLGRLHHRHLLTLRGFST-GHKRLLVFDEIGNGSLRD 120
           DG VALVKELRALND++F TEVQLLGRLHHRHLLTLRGFST GHKRLLVFDEIGNGSLRD
Sbjct: 61  DGRVALVKELRALNDNLFYTEVQLLGRLHHRHLLTLRGFSTAGHKRLLVFDEIGNGSLRD 120

Query: 121 HLNDPLRTPLNWRARLQIAASVAAALEYLLLFTDPPMCHVSISSNTIMLDENFTAKISDV 180
            LNDPLRTPLNWR RLQIAA VAAALEYL LFT PPMCHVSISS+TIMLDENFTAKISD+
Sbjct: 121 LLNDPLRTPLNWRMRLQIAAGVAAALEYLFLFTHPPMCHVSISSSTIMLDENFTAKISDI 180

Query: 181 GFLYSPVSSGDQADASKADDFMDEKCGNIIYQLGILILELITGQSSDGTSGDLIKWIQGK 240
           GFL SPV+     DA K+DDF DEKCGNIIYQLG+LILELITGQSSDGT  DLI+WIQG 
Sbjct: 181 GFLCSPVNITGYPDAQKSDDFRDEKCGNIIYQLGVLILELITGQSSDGTGADLIRWIQGT 240

Query: 241 NFARSMNKMIDPDLGNSFDNKEVRNLLSIAKLCVKSREKPRFSIPQIFRYLQSKVDVTS 299
           NFARSMNKMIDPDLGNSFD K+VRNLLSIA+LC+KSREKPRFS+ QIFRYLQSKVD+TS
Sbjct: 241 NFARSMNKMIDPDLGNSFDYKDVRNLLSIARLCIKSREKPRFSVAQIFRYLQSKVDLTS 299

BLAST of CmaCh20G001420 vs. NCBI nr
Match: gi|449444823|ref|XP_004140173.1| (PREDICTED: probable receptor-like protein kinase At1g49730 [Cucumis sativus])

HSP 1 Score: 526.9 bits (1356), Expect = 2.2e-146
Identity = 262/299 (87.63%), Postives = 279/299 (93.31%), Query Frame = 1

Query: 1   MDPWIHRFRHRLLAWFHRSRSVPILYVRRFSYREIKRATGGFNRVVYTNPQCAAYNARFQ 60
           MDPWIHRFRHRLLAWFHRSRSVPILYVRRFSY+EIKRATGGFNRVVYTNP+ AAYNA+FQ
Sbjct: 29  MDPWIHRFRHRLLAWFHRSRSVPILYVRRFSYKEIKRATGGFNRVVYTNPRSAAYNAKFQ 88

Query: 61  DGSVALVKELRALNDDVFKTEVQLLGRLHHRHLLTLRGFST-GHKRLLVFDEIGNGSLRD 120
           DG VALVKE RALND++F TEVQLLGRLHHRHLLTLRGFST GHKRLLVFDEIGNGSLRD
Sbjct: 89  DGRVALVKEQRALNDNLFYTEVQLLGRLHHRHLLTLRGFSTAGHKRLLVFDEIGNGSLRD 148

Query: 121 HLNDPLRTPLNWRARLQIAASVAAALEYLLLFTDPPMCHVSISSNTIMLDENFTAKISDV 180
            LNDPLRTPLNWR RLQIAA VAAALEYLLLFT PPMCHVSISS+TIMLDENFTAKISDV
Sbjct: 149 LLNDPLRTPLNWRMRLQIAAGVAAALEYLLLFTHPPMCHVSISSSTIMLDENFTAKISDV 208

Query: 181 GFLYSPVSSGDQADASKADDFMDEKCGNIIYQLGILILELITGQSSDGTSGDLIKWIQGK 240
           GFL SPV+    +DA+K+DDF+DEK GNIIYQLG+LILELITGQSSDGT  DLIKWIQG 
Sbjct: 209 GFLCSPVNITGYSDATKSDDFVDEKSGNIIYQLGVLILELITGQSSDGTGADLIKWIQGT 268

Query: 241 NFARSMNKMIDPDLGNSFDNKEVRNLLSIAKLCVKSREKPRFSIPQIFRYLQSKVDVTS 299
           NFARSMNKMIDPDLGNSFD K+VRNLLS+AKLC+KSREKPRFSI QIFRYLQSKVD++S
Sbjct: 269 NFARSMNKMIDPDLGNSFDYKDVRNLLSVAKLCIKSREKPRFSIAQIFRYLQSKVDLSS 327

BLAST of CmaCh20G001420 vs. NCBI nr
Match: gi|1001910041|gb|AMM42975.1| (LRR-RLK [Vernicia montana])

HSP 1 Score: 404.8 bits (1039), Expect = 1.3e-109
Identity = 198/298 (66.44%), Postives = 245/298 (82.21%), Query Frame = 1

Query: 1   MDPWIHRFRHRLLAWFHRSRSVPILYVRRFSYREIKRATGGFNRVVYTNPQCAAYNARFQ 60
           MD  I + R  LLAW HRSRS PI +VRRFSY++IKRAT GF+R++Y++ Q  AY A+FQ
Sbjct: 1   MDRLIRKIRPHLLAWLHRSRSGPISFVRRFSYKDIKRATDGFHRILYSDSQGTAYRAKFQ 60

Query: 61  DGSVALVKELRALND--DVFKTEVQLLGRLHHRHLLTLRGFSTGHKRLLVFDEIGNGSLR 120
           DG +ALVKE++ L+   DVF  +VQLLGRLHHRHLL L+GFSTGHKRLLVFD I NGSL+
Sbjct: 61  DGDIALVKEVKDLSQEKDVFIRQVQLLGRLHHRHLLALKGFSTGHKRLLVFDNIENGSLK 120

Query: 121 DHLNDPLRTPLNWRARLQIAASVAAALEYLLLFTDPPMCHVSISSNTIMLDENFTAKISD 180
           +HLNDPLRTPLNW+ RL+IA  V AALEYLLLF++PPM HVSISS+ IMLDENFTAK+S+
Sbjct: 121 EHLNDPLRTPLNWKTRLRIAIGVVAALEYLLLFSNPPMYHVSISSSNIMLDENFTAKLSN 180

Query: 181 VGFLYSPVSSGDQADASKADDFMDEKCGNIIYQLGILILELITGQSSDGTSGDLIKWIQG 240
           VG L S  +      AS A+D M++ CGNII+QLG+LILELITGQSS+  S DL++WIQG
Sbjct: 181 VGLLSSIENYVTMPQASCAEDCMNQNCGNIIFQLGVLILELITGQSSEKGSTDLVQWIQG 240

Query: 241 KNFARSMNKMIDPDLGNSFDNKEVRNLLSIAKLCVKSREKPRFSIPQIFRYLQSKVDV 297
             F  S+ KMIDPDLGNS+D++E++NLL++A+LC+KS +KP+FSIPQIFRYLQ KVD+
Sbjct: 241 SRFGSSIQKMIDPDLGNSYDSRELKNLLAVARLCIKSGDKPKFSIPQIFRYLQKKVDI 298

BLAST of CmaCh20G001420 vs. NCBI nr
Match: gi|590586198|ref|XP_007015638.1| (Kinase superfamily protein isoform 1 [Theobroma cacao])

HSP 1 Score: 391.7 bits (1005), Expect = 1.1e-105
Identity = 187/298 (62.75%), Postives = 241/298 (80.87%), Query Frame = 1

Query: 1   MDPWIHRFRHRLLAWFHRSRSVPILYVRRFSYREIKRATGGFNRVVYTNPQCAAYNARFQ 60
           MDP I + R RLLAW HRSRS PI ++++FSY+++KRAT GF+R++Y+N + AAY A+F+
Sbjct: 1   MDPVIRKLRFRLLAWLHRSRSGPISFMKKFSYKDVKRATDGFHRIIYSNSRGAAYKAKFE 60

Query: 61  DGSVALVKELRALND--DVFKTEVQLLGRLHHRHLLTLRGFSTGHKRLLVFDEIGNGSLR 120
            G VALVKE RA ++  D F  EVQ LGRLHHRHLL LRGFSTGHKRLLVFD I NGSL+
Sbjct: 61  GGEVALVKEARAFDEGIDNFYREVQFLGRLHHRHLLALRGFSTGHKRLLVFDNIENGSLK 120

Query: 121 DHLNDPLRTPLNWRARLQIAASVAAALEYLLLFTDPPMCHVSISSNTIMLDENFTAKISD 180
           +H NDPLRTPLNW+ARLQIA  VAAALEYLLLF++PP+ HVSISS+ IM DENFTAK+SD
Sbjct: 121 EHFNDPLRTPLNWKARLQIAVGVAAALEYLLLFSNPPVYHVSISSSNIMFDENFTAKLSD 180

Query: 181 VGFLYSPVSSGDQADASKADDFMDEKCGNIIYQLGILILELITGQSSDGTSGDLIKWIQG 240
           VG L S  +  +    S +++ + ++CGNII+QLG+LILELITGQSS+    DLI+W+QG
Sbjct: 181 VGLLSSVGTYVEMPHPSCSEECLGQECGNIIFQLGVLILELITGQSSEQGGTDLIQWVQG 240

Query: 241 KNFARSMNKMIDPDLGNSFDNKEVRNLLSIAKLCVKSREKPRFSIPQIFRYLQSKVDV 297
              + S++ MIDPDLGN++D++E++ LLS+A+LC+KS+  P+F IPQ+FRYLQ KVD+
Sbjct: 241 SRLSSSIHMMIDPDLGNNYDSRELKKLLSVARLCIKSKNNPKFPIPQVFRYLQKKVDI 298

BLAST of CmaCh20G001420 vs. NCBI nr
Match: gi|743929248|ref|XP_011008851.1| (PREDICTED: probable receptor-like protein kinase At1g49730 isoform X2 [Populus euphratica])

HSP 1 Score: 385.2 bits (988), Expect = 1.1e-103
Identity = 190/299 (63.55%), Postives = 238/299 (79.60%), Query Frame = 1

Query: 1   MDPWIHRFRHRLLAWFHRSRSVPILYVRRFSYREIKRATGGFNRVVYTNPQCAAYNARFQ 60
           MD  I + R  LLAW HRSRS P   VRRFSY++IKRAT GF+R++Y+N   AAY ARFQ
Sbjct: 1   MDRLIRKIRPYLLAWHHRSRSSPESSVRRFSYKDIKRATDGFHRIIYSNSHGAAYRARFQ 60

Query: 61  DGSVALVKELRALND--DVFKTEVQLLGRLHHRHLLTLRGFSTGHKRLLVFDEIGNGSLR 120
           +G VALVKE++ LN   D F  EVQLLGRLHHRHLL L+GFSTGHKRLLV+D I  GSL+
Sbjct: 61  NGEVALVKEVKDLNQGKDNFLKEVQLLGRLHHRHLLALKGFSTGHKRLLVYDNIEMGSLK 120

Query: 121 DHLNDPLRTPLNWRARLQIAASVAAALEYLLLFTDPPMCHVSISSNTIMLDENFTAKISD 180
           +HLNDPL+TPLNWR RLQIA  VAAALEYLLLF++PP+ HVS+S++ IMLDEN+ AKISD
Sbjct: 121 EHLNDPLKTPLNWRTRLQIAIGVAAALEYLLLFSNPPIYHVSVSASNIMLDENYIAKISD 180

Query: 181 VGFLYSPVSSGDQADASKADDFMDEKCGNIIYQLGILILELITGQSSDGTSGDLIKWIQG 240
           VG + S  ++     +S ++D MD  CGN+ +QLG+LILELITGQSS+  S DLI+WIQ 
Sbjct: 181 VGLINSVGANVTVPHSSNSEDCMDHPCGNLTFQLGVLILELITGQSSENGSTDLIQWIQE 240

Query: 241 KNFARSMNKMIDPDLGNSFDNKEVRNLLSIAKLCVKSREKPRFSIPQIFRYLQSKVDVT 298
             +  S+ KMIDPDLGN++D++E++NLL++A+LC+KS +KP+FSIPQIFRYLQ K + T
Sbjct: 241 SRYRSSIQKMIDPDLGNNYDSRELKNLLAVARLCIKSGDKPKFSIPQIFRYLQKKAENT 299

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Y1497_ARATH2.0e-4135.40Probable receptor-like protein kinase At1g49730 OS=Arabidopsis thaliana GN=At1g4... [more]
Y4245_ARATH2.2e-2930.61Probable LRR receptor-like serine/threonine-protein kinase At4g20450 OS=Arabidop... [more]
NCRK_ARATH5.0e-2929.84Receptor-like serine/threonine-protein kinase NCRK OS=Arabidopsis thaliana GN=NC... [more]
PERK9_ARATH1.2e-2731.03Proline-rich receptor-like protein kinase PERK9 OS=Arabidopsis thaliana GN=PERK9... [more]
RLK7_ARATH1.8e-2629.55Receptor-like protein kinase At5g59670 OS=Arabidopsis thaliana GN=At5g59670 PE=1... [more]
Match NameE-valueIdentityDescription
A0A0A0KJY4_CUCSA1.6e-14687.63Uncharacterized protein OS=Cucumis sativus GN=Csa_6G426910 PE=4 SV=1[more]
A0A140G4K3_9ROSI9.0e-11066.44LRR-RLK OS=Vernicia montana PE=2 SV=1[more]
A0A061GVS0_THECC7.8e-10662.75Kinase superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_041210 PE=4 SV=1[more]
A0A0D2QPP3_GOSRA9.6e-10462.42Uncharacterized protein OS=Gossypium raimondii GN=B456_001G126300 PE=4 SV=1[more]
A0A067L6U5_JATCU1.3e-10363.33Uncharacterized protein OS=Jatropha curcas GN=JCGZ_23910 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G22050.22.7e-8956.71 Protein kinase superfamily protein[more]
AT3G19300.17.6e-4434.38 Protein kinase superfamily protein[more]
AT1G49730.11.1e-4235.40 Protein kinase superfamily protein[more]
AT4G20450.11.3e-3030.61 Leucine-rich repeat protein kinase family protein[more]
AT2G28250.12.8e-3029.84 Protein kinase superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659097396|ref|XP_008449602.1|1.0e-14686.96PREDICTED: probable receptor-like protein kinase At1g49730 [Cucumis melo][more]
gi|449444823|ref|XP_004140173.1|2.2e-14687.63PREDICTED: probable receptor-like protein kinase At1g49730 [Cucumis sativus][more]
gi|1001910041|gb|AMM42975.1|1.3e-10966.44LRR-RLK [Vernicia montana][more]
gi|590586198|ref|XP_007015638.1|1.1e-10562.75Kinase superfamily protein isoform 1 [Theobroma cacao][more]
gi|743929248|ref|XP_011008851.1|1.1e-10363.55PREDICTED: probable receptor-like protein kinase At1g49730 isoform X2 [Populus e... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000719Prot_kinase_dom
IPR001245Ser-Thr/Tyr_kinase_cat_dom
IPR011009Kinase-like_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004672protein kinase activity
GO:0005524ATP binding
Vocabulary: Biological Process
TermDefinition
GO:0006468protein phosphorylation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006468 protein phosphorylation
biological_process GO:0018108 peptidyl-tyrosine phosphorylation
cellular_component GO:0005886 plasma membrane
cellular_component GO:0005575 cellular_component
molecular_function GO:0005524 ATP binding
molecular_function GO:0004672 protein kinase activity
molecular_function GO:0004713 protein tyrosine kinase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh20G001420.1CmaCh20G001420.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000719Protein kinase domainPROFILEPS50011PROTEIN_KINASE_DOMcoord: 32..300
score: 11
IPR001245Serine-threonine/tyrosine-protein kinase catalytic domainPFAMPF07714Pkinase_Tyrcoord: 66..183
score: 1.8
IPR011009Protein kinase-like domainunknownSSF56112Protein kinase-like (PK-like)coord: 22..290
score: 2.85
NoneNo IPR availableGENE3DG3DSA:1.10.510.10coord: 119..290
score: 5.7
NoneNo IPR availableGENE3DG3DSA:3.30.200.20coord: 55..118
score: 1.
NoneNo IPR availablePANTHERPTHR27001FAMILY NOT NAMEDcoord: 1..296
score: 8.5E
NoneNo IPR availablePANTHERPTHR27001:SF20PROTEIN KINASE FAMILY PROTEINcoord: 1..296
score: 8.5E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CmaCh20G001420CmaCh02G013000Cucurbita maxima (Rimu)cmacmaB470