CmaCh20G005400 (gene) Cucurbita maxima (Rimu)

NameCmaCh20G005400
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionARM repeat superfamily protein, putative isoform 2
LocationCma_Chr20 : 2601636 .. 2606935 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGTATCTGGTCGAGTGCCGGAGAGGGAGAGGCGGAGTAGCTGGGGAAATTTCAGGCCACACGAATACGACGCTTCTGTCGTAGAAAATTTTGGTTGAAGACGACAGCGACACTATCAGCTAATCGTGTTCTTGAAACCTTGTTATGGAGGTGGGCTCAGATTCAGACCCTATTGAAGCGGAACTGGACCCAGAACTCGAATCTGTAGAAGGCGGTAAAGGACCTGCTCATCACCCTTCCGCTCCATTTGATGAGGTTTCTATTCAAACTTCTGAGCTTGTTTAATATTTTAATTAGCTTTTTATGGCTTCGTTTATTGCTCTGAAAATTTCTCTTGAAACTTGAGCTAGTCACTGTATTTGTTCTTTCAGCTGTGCTAACCTGGATTTTCTTTGGTACTTGACTTTACGGAGTAGCATTATACGATGAATTGGTATTACGATGGTGATAAGTTTGGTTGTGTTGTCATAGAAATTTTGTTATTGAGATAGCTAAATAAATCTCGAGACCGAGACAGTAGATTGAATTTCTTATCGACATCACTTTGCAGTTATTTGACATCTCAACGACGGTTGATCCTAGCTATATTATCTCTCTCATACGGAAACTTCTGCCATCCAATGCAAGTAACCTGCGCAATTCTTATGGAATTAGAGATGACGACGGTAACGCCTCAGTAACCAACATGGATGAAAGTGATGCCTATTTATCTGGCGACCAAGTATTAAGTTCTTCAGGAACAGTGAATGAATGCCAGGGCATTGAAATTGCGGATGGTTCTGACAAACTTGCTGATCGAGAAGGTGAGGATGAAGGTGCATGTCCTAGATCGGAGCAACGTATTTCATCATCAGAAGAAAATGTCTGGGAAGAGTATGGTTGCATTCTGTGGGATCTTTCTGCGAGTAAATCTCATGCAGAATTAATGGTACTTTAGTTTCCTTCCTACCCACACCCACACCCACACACTCTGTTCGTCCCACTTGTCTTGATAAGAGAAGAAGTGGGCTGATTAAATTATAAAACGTGTTCTTATTAGCACTGTAGTCATCATTATTGACCTTGCCAAGTCTCGTATTATACTCTTTCTTCCCTTGTTTCCGGTACCCTCCATATGTAGTTTAGCATTGACTTGCAAATAATATGATGCTTTTCAAATGGAACTGGTGACACTATGTTTTGAGTATTCTAGCAATAGTGTCACTATAGCCACCTGCAATTAAAACCGGTTTCATTTACGTAATCTTCATGTTTACATGTTTTCCAGGTTCAGAACCTCGTTCTTGAAGTTCTTTCTGCGAACCTGATGGTCTCACAATCTGTGCGTGTTATGGTATGTGACAACGTGATGTGTAATTTTAATAATCATTTTCCATTATGCGTTAGAAAGTGCATCATTATCTTTTATTTTCTTCAAAGAGATGGATAAACAATGTTGCTCTATTTTGAGGATATCAGTTGAGTACTGCCTCAGATCCATATTCAATCGATTACTTGCTGCAGGAGATTGTCCTTGGAATTATTGGAAACCTGGCCTGCCATGAAGTTCCCATGAAACATATAGTCGACAAGAGTGGATTGATTACAACCATTGTGAACCAGCTGTTTCTAGATGATGCTCAATGTTTATGTGAAGTTTGCAGGTAGCGTGAAAAAATACGCCTTTCTAATCAATTTATATAATGTCCAATTGCAAATTATATGCAACTTTCTTTTCCCAATATTTCATTTGGTTAGGTTGGCATTCATACCTGCCATACTATGTTTTAAGACTCATTAGGACAATTTTTGTTTACCATATTGGCCATTTATTGGTACCCCTTCTTCAGTTTTTGTGGCATTCAATTGAAATATGGTTTAGTGACAATTTTATTCAATCCTTCTACTTGCTCTGTAATATGCACATTTTGTTATGGTTATTACGACTAACAGGTTGTTAGATGCGGGTCTTCAAAGTAGCGAATGTGCCATATGGGCTGGGGCATTGAATTCTGAGCATGTTCTCTCTCGTATTCTATGGGTTTCTGAGAATACCTTAAATCCACAACTTATAGAAAAGGTATCAAAATCTATATCCGTGGAAGCAAAGAACATAATAGGTTACTGTAAATTATGGCATAATATAATTGCAAGCCAAAATATTTAGCTGGTCTGATGGCTTAAATAGTTTCTTTCTTGATCAGAGTGTTGGGTTATTATCAACCATCATTGAAAGTCAGCAAGAAGTTGTGCATGTTCTTCTCCCATGTTTGATGAAGCTGGGTTTGTCGAGTGCTTTGTTCAACCTTTTTTCTTTTGAGATGAAAATATTAACAAATGAAAGATCAGCTGAAAGGTATTAGTCTCCTTTGTCTATAGGTAATGTAGATTTGTGTGTAATATAAGTGCATTTAGATAAAAATATTAGTAATTATCTCTTAATTATTTTAAGATATACTTTCCTTTTATAATTAGCCTCCCAAGTCGGGGAGTTCATTCACCCAATGTATTGTATCAGGAAAGAAAATGTTATTTTAATTTGTATGCATCGACTGATAATCAATGAGTTTAGTGGTAGAGAAATTTGTAGGTTCGATTCATTATATTCATTAATCTAAGGTCTGGGTTCTTTGGTGGAAGTGCATTTAAGTTGGTTTAGGAATAATGGGTCATCAAAAAAGAAGGAAAAGGAGGAGAAAAAGGTAGTAACAATATGAAGAATCTATAACGGATCATGTAGAGTTTTGTATGTTCGTAGTCATGGCAGCTCCAACCTGCATTGTATGACGTGGTTTATGATGCTTGTAAATATGAATAAAAAATGATGATTAGCGAAGCTTTTTTGAGTCCATCAGTCCTCTCACAAAATTCAATTTCAGGTATTCAATTTTGGACGCGATTCTTCGTGCAGTTGAAGCACTCTCTGGAATTGAAGAGCATTCTCAAGAATTTTGTTCAAACAAAAAACTTTTTCAGCTTGTTTGTGAACTAGTCAAATTGCCAGATGCATTTGAGGTACAGTTCAGGTACCTAAAGTAAGAGTAGAGCTGTGACCTTCTAATAATAGGAATTGGCGTATGTATTAACATTTCCATTTATTACCTCAGAATTATTTAATTTACTTTTATCAGCGTGATAAATTGCATAACTTGTTAAATGTGTCCATTTGCCACCCGACAAATTATGTAGTCCTCAAATGTGGATATGGCTGTATATCATGTGATCTACCACCTTTCCTGTCGTTTAATTTGTTGCTAGAAGATGCTGCAGTTTTCTTTCTTCGGCTGTTTTTTGTTGTATATTTTAAACTACTTCTATGTTCTGTGATTGACGGCATGCTACAAATATATTTTTCTTTCACGATCAAGAAACCCTTCCGTCTCTGAAATGGACATATTTCTTTCCATAGGTTTCAAGTTCTTGTATCAGTGCTGTGATTTTGATTGCAAATATTCTGTCAGATCTACCTGATCTAGCCTTTGACATGAGTCAGGGTAAGTTAAGTTCATTATATTTCTTATTGCCAAGTTTCATTCATCACCTTATGTGTACTTAATACTTCTGGTCTATCCAGATTTGTCTTTCCTACAAGGTCTACTTGATATATTCTCTTTTGCTGGGGATGACCTAGAAGCCCGTGATGCTGTTTGGAGCATCATTGCCAGGATATTGGTGCATGTTGAAGAAAAGGCGATGAGCAGACCAAGGATGTTTGAGTGCGTGTCGTTACTAGTGAGTAAGACTGATCTCATCGAGGATGATCTTCTAGACCACCGTATGACTGAATTGAATAAAAAAGAGGATGGATTGACCTCTGCCTGCACAAAATCAAACTCTAGATGTATATCCGTATCCTGTTCTTTGAGTTTGTTTTACTAGCCTACTGTTATTTATTGCTCAATGTAGGCAATTCATTTTCAATTTTTGATGGGTCATTTTCTTTCATAAATGAACTTTGTGTTGTCCTTTACTCATCCCTTAGTTAGGAAGGATCATTGCTATCTTAAATTGTTGGGCTGCTTCAAAGGATGAAGGGACAGATGTTAGAGACGAATATCGTGCAGAAGATATCGATGTGAATAGATTGTTGAGTTGTTGCTGTAAACATTCTGAGTAAGATCTATTTCAATTTCTATGTTCATTATGAACTCTCTTCTTCTGCAATGCCTGATTTAAGTGCTACGTTTTTTCTCCTAACTCAGATGGCTGGGAAGTTAGCAGAGACGGAAATCTACACTTCATCACCCATAGCGTTGCTGTTAACCACTTTCAGTACGTACACTAAAGTGGTAATCGATCTTTTCTGCGGAATTTCAATCATGATTGTTTTCGCTAATCACTGTAGATTTAGTACTGATTCGGTGCTTTTGTTTATCTTTTTGGTTTCAGCATCACCATTTTGCATGCAGTCCTGGATATTATTTTGAATCATTATCAACTGAAAGAAAGGTCAATGCATATATTTACTGAATGGAGCATCGTTTGTGGCTGTACTATGCAACCGGGGTCAAGTCTGTTTGCTGGATGATGTTTTATTGGTCTCAGGCTCCACATAGTCCCCACCAACTTCCAAAATCTTGCAAAAATGTTTGGTTGTATACTACTTGCCTTATAAATGGTTTGAAGATGGTTTTAGGACCAAAATTTAGCTGTGATCGGCTATATCTATAAAATCACATATTATTCAACGATCACTACATTGGCACATCGTGCTCAATTAATTGCCTGCTAACTTCATTGGTTCCTTCCCACCCATCTTTTTGAGCAATTTTCATGATCTAAATTTAGGTGTTGTCAGGTTAGGCCTATTACTAATTTGCTCTTTTCATACCTGCGTTACGTCTCATTTTCACTTTGATGCATAATTTCTGGTGTCTGCGCCGTCCCCTTTATATAACAACATTGGCTCTTCTTCTCACACAGTTTAGTTGCGTTTGCATATGTATGCTTGAAGGAGTGTTCATTTTCCCCTCTAAATTTGACCGAAAATGTGGTCAATGTCAGAAAAAGCAGGTATCAAGTGGGCAATGTGTTTGCCAACGTCATTAGTGATGGCTCTCAGGATTGCTTACCTTTTGTTTTCGGCAACAATTGTTTTCCTTTCGTGTCTTTTCTTGCACCAAATCATCCAATGATTCATCAGCAGGTTGTCTCATAACTTCATCTCATTGTCAAGTAAGTAATTAATCGTTATGAAATCAAGCCTTGTATATTGGCTGATTTTCTTATTTAAATTTCCTGATCTTTTATATTGGAAATCTTTAATGAATAAGATGCATTATTGTCTAGATTTGGATCGTGGTCAGGTTTAGAAAGTAG

mRNA sequence

ATGGGTATCTGGTCGAGTGCCGGAGAGGGAGAGGCGGAGTAGCTGGGGAAATTTCAGGCCACACGAATACGACGCTTCTGTCGTAGAAAATTTTGGTTGAAGACGACAGCGACACTATCAGCTAATCGTGTTCTTGAAACCTTGTTATGGAGGTGGGCTCAGATTCAGACCCTATTGAAGCGGAACTGGACCCAGAACTCGAATCTGTAGAAGGCGGTAAAGGACCTGCTCATCACCCTTCCGCTCCATTTGATGAGTTATTTGACATCTCAACGACGGTTGATCCTAGCTATATTATCTCTCTCATACGGAAACTTCTGCCATCCAATGCAAGTAACCTGCGCAATTCTTATGGAATTAGAGATGACGACGGTAACGCCTCAGTAACCAACATGGATGAAAGTGATGCCTATTTATCTGGCGACCAAGTATTAAGTTCTTCAGGAACAGTGAATGAATGCCAGGGCATTGAAATTGCGGATGGTTCTGACAAACTTGCTGATCGAGAAGGTGAGGATGAAGGTGCATGTCCTAGATCGGAGCAACGTATTTCATCATCAGAAGAAAATGTCTGGGAAGAGTATGGTTGCATTCTGTGGGATCTTTCTGCGAGTAAATCTCATGCAGAATTAATGGTTCAGAACCTCGTTCTTGAAGTTCTTTCTGCGAACCTGATGGTCTCACAATCTGTGCGTGTTATGGAGATTGTCCTTGGAATTATTGGAAACCTGGCCTGCCATGAAGTTCCCATGAAACATATAGTCGACAAGAGTGGATTGATTACAACCATTGTGAACCAGCTGTTTCTAGATGATGCTCAATGTTTATGTGAAGTTTGCAGGTTGTTAGATGCGGGTCTTCAAAGTAGCGAATGTGCCATATGGGCTGGGGCATTGAATTCTGAGCATGTTCTCTCTCGTATTCTATGGGTTTCTGAGAATACCTTAAATCCACAACTTATAGAAAAGAGTGTTGGGTTATTATCAACCATCATTGAAAGTCAGCAAGAAGTTGTGCATGTTCTTCTCCCATGTTTGATGAAGCTGGGTTTGTCGAGTGCTTTGTTCAACCTTTTTTCTTTTGAGATGAAAATATTAACAAATGAAAGATCAGCTGAAAGGTATTCAATTTTGGACGCGATTCTTCGTGCAGTTGAAGCACTCTCTGGAATTGAAGAGCATTCTCAAGAATTTTGTTCAAACAAAAAACTTTTTCAGCTTGTTTGTGAACTAGTCAAATTGCCAGATGCATTTGAGGTTTCAAGTTCTTGTATCAGTGCTGTGATTTTGATTGCAAATATTCTGTCAGATCTACCTGATCTAGCCTTTGACATGAGTCAGGATTTGTCTTTCCTACAAGGTCTACTTGATATATTCTCTTTTGCTGGGGATGACCTAGAAGCCCGTGATGCTGTTTGGAGCATCATTGCCAGGATATTGGTGCATGTTGAAGAAAAGGCGATGAGCAGACCAAGGATGTTTGAGTGCGTGTCGTTACTAGTGAGTAAGACTGATCTCATCGAGGATGATCTTCTAGACCACCGTATGACTGAATTGAATAAAAAAGAGGATGGATTGACCTCTGCCTGCACAAAATCAAACTCTAGATGTATATCCTTAGGAAGGATCATTGCTATCTTAAATTGTTGGGCTGCTTCAAAGGATGAAGGGACAGATGTTAGAGACGAATATCGTGCAGAAGATATCGATGTGAATAGATTGTTGAGTTGTTGCTGTAAACATTCTGAATGGCTGGGAATACGTACACTAAAGTGCATCACCATTTTGCATGCAGTCCTGGATATTATTTTGAATCATTATCAACTGAAAGAAAGGTCAATGCATATATTTACTGAATGGAGCATCGTTTGTGGCTGTACTATGCAACCGGGGTCAAGATTGCTTACCTTTTGTTTTCGGCAACAATTGTTTTCCTTTCGTGTCTTTTCTTGCACCAAATCATCCAATGATTCATCAGCAGGTTGTCTCATAACTTCATCTCATTGTCAAATTTGGATCGTGGTCAGGTTTAGAAAGTAG

Coding sequence (CDS)

ATGGAGGTGGGCTCAGATTCAGACCCTATTGAAGCGGAACTGGACCCAGAACTCGAATCTGTAGAAGGCGGTAAAGGACCTGCTCATCACCCTTCCGCTCCATTTGATGAGTTATTTGACATCTCAACGACGGTTGATCCTAGCTATATTATCTCTCTCATACGGAAACTTCTGCCATCCAATGCAAGTAACCTGCGCAATTCTTATGGAATTAGAGATGACGACGGTAACGCCTCAGTAACCAACATGGATGAAAGTGATGCCTATTTATCTGGCGACCAAGTATTAAGTTCTTCAGGAACAGTGAATGAATGCCAGGGCATTGAAATTGCGGATGGTTCTGACAAACTTGCTGATCGAGAAGGTGAGGATGAAGGTGCATGTCCTAGATCGGAGCAACGTATTTCATCATCAGAAGAAAATGTCTGGGAAGAGTATGGTTGCATTCTGTGGGATCTTTCTGCGAGTAAATCTCATGCAGAATTAATGGTTCAGAACCTCGTTCTTGAAGTTCTTTCTGCGAACCTGATGGTCTCACAATCTGTGCGTGTTATGGAGATTGTCCTTGGAATTATTGGAAACCTGGCCTGCCATGAAGTTCCCATGAAACATATAGTCGACAAGAGTGGATTGATTACAACCATTGTGAACCAGCTGTTTCTAGATGATGCTCAATGTTTATGTGAAGTTTGCAGGTTGTTAGATGCGGGTCTTCAAAGTAGCGAATGTGCCATATGGGCTGGGGCATTGAATTCTGAGCATGTTCTCTCTCGTATTCTATGGGTTTCTGAGAATACCTTAAATCCACAACTTATAGAAAAGAGTGTTGGGTTATTATCAACCATCATTGAAAGTCAGCAAGAAGTTGTGCATGTTCTTCTCCCATGTTTGATGAAGCTGGGTTTGTCGAGTGCTTTGTTCAACCTTTTTTCTTTTGAGATGAAAATATTAACAAATGAAAGATCAGCTGAAAGGTATTCAATTTTGGACGCGATTCTTCGTGCAGTTGAAGCACTCTCTGGAATTGAAGAGCATTCTCAAGAATTTTGTTCAAACAAAAAACTTTTTCAGCTTGTTTGTGAACTAGTCAAATTGCCAGATGCATTTGAGGTTTCAAGTTCTTGTATCAGTGCTGTGATTTTGATTGCAAATATTCTGTCAGATCTACCTGATCTAGCCTTTGACATGAGTCAGGATTTGTCTTTCCTACAAGGTCTACTTGATATATTCTCTTTTGCTGGGGATGACCTAGAAGCCCGTGATGCTGTTTGGAGCATCATTGCCAGGATATTGGTGCATGTTGAAGAAAAGGCGATGAGCAGACCAAGGATGTTTGAGTGCGTGTCGTTACTAGTGAGTAAGACTGATCTCATCGAGGATGATCTTCTAGACCACCGTATGACTGAATTGAATAAAAAAGAGGATGGATTGACCTCTGCCTGCACAAAATCAAACTCTAGATGTATATCCTTAGGAAGGATCATTGCTATCTTAAATTGTTGGGCTGCTTCAAAGGATGAAGGGACAGATGTTAGAGACGAATATCGTGCAGAAGATATCGATGTGAATAGATTGTTGAGTTGTTGCTGTAAACATTCTGAATGGCTGGGAATACGTACACTAAAGTGCATCACCATTTTGCATGCAGTCCTGGATATTATTTTGAATCATTATCAACTGAAAGAAAGGTCAATGCATATATTTACTGAATGGAGCATCGTTTGTGGCTGTACTATGCAACCGGGGTCAAGATTGCTTACCTTTTGTTTTCGGCAACAATTGTTTTCCTTTCGTGTCTTTTCTTGCACCAAATCATCCAATGATTCATCAGCAGGTTGTCTCATAACTTCATCTCATTGTCAAATTTGGATCGTGGTCAGGTTTAGAAAGTAG

Protein sequence

MEVGSDSDPIEAELDPELESVEGGKGPAHHPSAPFDELFDISTTVDPSYIISLIRKLLPSNASNLRNSYGIRDDDGNASVTNMDESDAYLSGDQVLSSSGTVNECQGIEIADGSDKLADREGEDEGACPRSEQRISSSEENVWEEYGCILWDLSASKSHAELMVQNLVLEVLSANLMVSQSVRVMEIVLGIIGNLACHEVPMKHIVDKSGLITTIVNQLFLDDAQCLCEVCRLLDAGLQSSECAIWAGALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVVHVLLPCLMKLGLSSALFNLFSFEMKILTNERSAERYSILDAILRAVEALSGIEEHSQEFCSNKKLFQLVCELVKLPDAFEVSSSCISAVILIANILSDLPDLAFDMSQDLSFLQGLLDIFSFAGDDLEARDAVWSIIARILVHVEEKAMSRPRMFECVSLLVSKTDLIEDDLLDHRMTELNKKEDGLTSACTKSNSRCISLGRIIAILNCWAASKDEGTDVRDEYRAEDIDVNRLLSCCCKHSEWLGIRTLKCITILHAVLDIILNHYQLKERSMHIFTEWSIVCGCTMQPGSRLLTFCFRQQLFSFRVFSCTKSSNDSSAGCLITSSHCQIWIVVRFRK
BLAST of CmaCh20G005400 vs. TrEMBL
Match: A0A0A0KDI1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G149940 PE=4 SV=1)

HSP 1 Score: 867.1 bits (2239), Expect = 1.3e-248
Identity = 455/536 (84.89%), Postives = 484/536 (90.30%), Query Frame = 1

Query: 1   MEVGSDSDPIEAELDPELESVEGGKGPAHHPSAPFDELFDISTTVDPSYIISLIRKLLPS 60
           MEVGSD DPIEAELD +LE V+   GPAHHPSAPFDE+FDISTTVDPSYIISLIRKLLP 
Sbjct: 1   MEVGSDLDPIEAELDADLEPVKDCNGPAHHPSAPFDEVFDISTTVDPSYIISLIRKLLPL 60

Query: 61  NASNLRNSYGIRDDDGNASVTNMDESDAYLSGDQVLSSSGTVNECQGIEIADGSDKLADR 120
           NASN RNS G   D G+ SV  MDE D Y+SGDQ+ SSSGTV++C GIEI D S KLAD+
Sbjct: 61  NASNTRNSCGNGHDGGDTSVNKMDEGDGYVSGDQLFSSSGTVSKCLGIEIEDDSGKLADK 120

Query: 121 EGEDEGACPRSEQRISSSEENVWEEYGCILWDLSASKSHAELMVQNLVLEVLSANLMVSQ 180
           EGEDEGACP+SEQ ISSSEE VWEEYGCILWDLSAS+S AELMVQNLVLEVLSANLMVSQ
Sbjct: 121 EGEDEGACPKSEQLISSSEEKVWEEYGCILWDLSASRSQAELMVQNLVLEVLSANLMVSQ 180

Query: 181 SVRVMEIVLGIIGNLACHEVPMKHIVDKSGLITTIVNQLFLDDAQCLCEVCRLLDAGLQS 240
           SVRVMEI LGIIGNLACHEVPMKHIV KSGLITTIV+QLFLDDAQCLCEVCRLL+ GLQS
Sbjct: 181 SVRVMEISLGIIGNLACHEVPMKHIVAKSGLITTIVSQLFLDDAQCLCEVCRLLNTGLQS 240

Query: 241 SECAIWAGALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVVHVLLPCLMKL 300
           SEC IWA ALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQE+VHVLL CLMKL
Sbjct: 241 SECVIWAEALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEIVHVLLSCLMKL 300

Query: 301 GLSSALFNLFSFEMKILTNERSAERYSILDAILRAVEALSGIEEHSQEFCSNKKLFQLVC 360
           GLSS LFNLFSFEMKILTNERSAER+SILD ILRAVEALSG EEHS+E CSNK+LFQLV 
Sbjct: 301 GLSSVLFNLFSFEMKILTNERSAERHSILDVILRAVEALSGNEEHSRELCSNKELFQLVR 360

Query: 361 ELVKLPDAFEVSSSCISAVILIANILSDLPDLAFDMSQDLSFLQGLLDIFSFAGDDLEAR 420
           +LVKLPDAFEVSSSCISAV+LIANILSD+PDLAF+MSQDLSFLQGLLDIFSF GDD EAR
Sbjct: 361 DLVKLPDAFEVSSSCISAVVLIANILSDVPDLAFEMSQDLSFLQGLLDIFSFVGDDFEAR 420

Query: 421 DAVWSIIARILVHVEEKAMSRPRMFECVSLLVSKTDLIEDDLLDHRMTELNKKEDGLTSA 480
           DAVWSIIARILV V+E  MSRP++FE VSLLVSKTDLIEDDLLDH MTE NK+EDG+TSA
Sbjct: 421 DAVWSIIARILVRVQENVMSRPKLFEYVSLLVSKTDLIEDDLLDHCMTESNKEEDGMTSA 480

Query: 481 CTKSNSRCISLGRIIAILNCWAASKDEGTDVRDEYRAEDIDVNRLLSCCCKHSEWL 537
           CTKSNSRCISL RII+ILN W ASKDEGTDVRDEY  ED+DVNRLL+CC KHSE L
Sbjct: 481 CTKSNSRCISLRRIISILNHWTASKDEGTDVRDEYCLEDVDVNRLLTCCSKHSEEL 536

BLAST of CmaCh20G005400 vs. TrEMBL
Match: M5W7F7_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004180mg PE=4 SV=1)

HSP 1 Score: 562.0 bits (1447), Expect = 9.2e-157
Identity = 322/543 (59.30%), Postives = 394/543 (72.56%), Query Frame = 1

Query: 1   MEVGSDSDPIEAELDPELESVEGGKGPAHHPSAPFDELFDISTTVDPSYIISLIRKLLPS 60
           M V + S P+E + + E + V+    PAH+PSAP DE FDISTTVDPSY+ISLIRKLLP+
Sbjct: 1   MAVDAKSVPLEDQEEQERQ-VQRHDAPAHNPSAPPDEFFDISTTVDPSYVISLIRKLLPA 60

Query: 61  NASNLRNSYGIRDDDGNASVTNM-----DESDAYLSGDQVLSSSGTVNECQGIEIADGSD 120
           NASN  NS+G   D   A V  +     D++   LSGD++L  S   +E   +EIAD   
Sbjct: 61  NASNNHNSHG---DVFYAHVQELETDHTDKTAPTLSGDRLLHVSNDGSE--SMEIADDFH 120

Query: 121 KLADREGEDEGACPRSEQRISSSE--ENVWEEYGCILWDLSASKSHAELMVQNLVLEVLS 180
           K A  E ++ G+   +EQ   S    E  WEEYGCILWDL+ASK+HAELMVQNL+LEVL 
Sbjct: 121 KSAPEERQNNGSYDGAEQCGHSVPVGEEAWEEYGCILWDLAASKTHAELMVQNLILEVLL 180

Query: 181 ANLMVSQSVRVMEIVLGIIGNLACHEVPMKHIVDKSGLITTIVNQLFLDDAQCLCEVCRL 240
           ANL+VSQS+R MEI LGIIGNLACHEVPMKHIV   GLI T+V+QLF +DAQCLCE CRL
Sbjct: 181 ANLVVSQSLRAMEITLGIIGNLACHEVPMKHIVSTIGLIGTVVDQLFSEDAQCLCEACRL 240

Query: 241 LDAGLQSSECAIWAGALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVVHVL 300
           L  GLQSSEC  WA  L SEH+LSRILW++EN+LNPQLIEKSV +L   IES +EVV +L
Sbjct: 241 LTVGLQSSECISWAKELQSEHILSRILWIAENSLNPQLIEKSVEVLLATIESSEEVVLIL 300

Query: 301 LPCLMKLGLSSALFNLFSFEMKILTNERSAERYSILDAILRAVEALSGIEEHSQEFCSNK 360
           LP LMKLGL+S L NL  FEM  L +ER  ERY +LD ILR++EALS I+ HSQE CSNK
Sbjct: 301 LPPLMKLGLASLLINLLDFEMSQLLSERVPERYPVLDVILRSIEALSVIDGHSQEICSNK 360

Query: 361 KLFQLVCELVKLPDAFEVSSSCISAVILIANILSDLPDLAFDMSQDLSFLQGLLDIFSFA 420
            LF+LVC+LVKLPD  EV++SCI+A +LIANILSD P LA ++SQDL FLQGLLDIF F+
Sbjct: 361 DLFRLVCDLVKLPDKVEVANSCITAGVLIANILSDEPHLASEISQDLPFLQGLLDIFPFS 420

Query: 421 GDDLEARDAVWSIIARILVHVEEKAMSRPRMFECVSLLVSKTDLIEDDLLDHRMTELNKK 480
            +DLEAR A+W+IIAR+LV V+E  MSR  + + VS+LVSK+D IEDDLLD ++ ELN K
Sbjct: 421 SEDLEARSALWNIIARLLVRVQENEMSRSALQQYVSVLVSKSDAIEDDLLDFQLDELNSK 480

Query: 481 EDGLTSACTKSNSRCISLGRIIAILNCWAASKDEG--TDVRDEYRAEDIDVNRLLSCCCK 535
                       +R  SL RII++LN W ASKD+    ++      +DI+++RLL CCCK
Sbjct: 481 ------------ARTTSLRRIISLLNQWTASKDDDKENEMMGNRYEDDINIDRLLDCCCK 525

BLAST of CmaCh20G005400 vs. TrEMBL
Match: A0A061GW25_THECC (ARM repeat superfamily protein, putative isoform 1 OS=Theobroma cacao GN=TCM_041542 PE=4 SV=1)

HSP 1 Score: 544.7 bits (1402), Expect = 1.5e-151
Identity = 307/511 (60.08%), Postives = 382/511 (74.76%), Query Frame = 1

Query: 27  PAHHPSAPFDELFDISTTVDPSYIISLIRKLLPSNASNLRNSYGIRDDDGNASVTNMDES 86
           P+HHPSAP DELFDISTTVDPSY+ISLIRKLLP +A N        DD+     +N +  
Sbjct: 27  PSHHPSAPPDELFDISTTVDPSYVISLIRKLLPLDARN--------DDNTEIRGSNCN-- 86

Query: 87  DAYLSGDQVLSSSGTVNECQGIEIADGSDKLADREGEDEGACPRSEQ--RISSSEENVWE 146
                 D+V+SSS   ++C+G+EI D   K +D +GEDE    R  +  R+S+ EE VWE
Sbjct: 87  ------DEVVSSSN--DKCKGMEIVDDFSK-SDFQGEDEEDSGRGGENARVSAGEE-VWE 146

Query: 147 EYGCILWDLSASKSHAELMVQNLVLEVLSANLMVSQSVRVMEIVLGIIGNLACHEVPMKH 206
           E GC+LWDL+A+++HAELMVQNL+LEVL ANLMV+QSVRV EI LGI+GNLACHEVPMKH
Sbjct: 147 ECGCVLWDLAANQTHAELMVQNLILEVLLANLMVTQSVRVTEICLGIMGNLACHEVPMKH 206

Query: 207 IVDKSGLITTIVNQLFLDDAQCLCEVCRLLDAGLQSSECAIWAGALNSEHVLSRILWVSE 266
           +V  +GLI+ IV+QLFLDD QCL E CRLL  GLQ SEC IWA AL SEH+LSRILWV+E
Sbjct: 207 MVSTNGLISVIVDQLFLDDTQCLGEACRLLSLGLQGSECRIWAEALQSEHILSRILWVTE 266

Query: 267 NTLNPQLIEKSVGLLSTIIESQQEVVHVLLPCLMKLGLSSALFNLFSFEMKILTNERSAE 326
           NTLNPQLIEKSVGLL  ++ESQ+EV H+LL  LMKLGL++ L NL +FEM  LTNER  E
Sbjct: 267 NTLNPQLIEKSVGLLLAMLESQKEVEHILLLPLMKLGLATVLVNLLAFEMSKLTNERIPE 326

Query: 327 RYSILDAILRAVEALSGIEEHSQEFCSNKKLFQLVCELVKLPDAFEVSSSCISAVILIAN 386
           RYS+LD ILRA+EAL  ++ +SQE CSNK+ FQLVC+L+K PD  EVS+SC++A ++IAN
Sbjct: 327 RYSVLDVILRALEALCVLDGYSQEICSNKEFFQLVCDLIKFPDKVEVSNSCVTAGVIIAN 386

Query: 387 ILSDLPDLAFDMSQDLSFLQGLLDIFSFAGDDLEARDAVWSIIARILVHVEEKAMSRPRM 446
           ILSD+ DLA D+SQDL FLQGL DIF F  D+LEAR A+WSIIAR+LV V+E  MS   +
Sbjct: 387 ILSDVSDLASDLSQDLPFLQGLFDIFPFTSDELEARCALWSIIARLLVRVQEDEMSASSL 446

Query: 447 FECVSLLVSKTDLIEDDLLDHRMTELNKKEDGLTSACTKSNSRCISLGRIIAILNCWAAS 506
            + V +L SK DLIEDDL DH+  E NK+ + L +    SN+R  +L RII+ILN W + 
Sbjct: 447 RQYVFILSSKADLIEDDLFDHQFDE-NKENESLATCGRISNARTFALRRIISILNKWNSL 506

Query: 507 KD--EGTDVRDEYRAEDIDVNRLLSCCCKHS 534
           KD  E   V +E+ A D +++RLL CC K++
Sbjct: 507 KDSVEEKHVMEEH-ANDENIHRLLDCCHKYT 515

BLAST of CmaCh20G005400 vs. TrEMBL
Match: A0A061GVV3_THECC (ARM repeat superfamily protein, putative isoform 2 OS=Theobroma cacao GN=TCM_041542 PE=4 SV=1)

HSP 1 Score: 544.7 bits (1402), Expect = 1.5e-151
Identity = 307/511 (60.08%), Postives = 382/511 (74.76%), Query Frame = 1

Query: 27  PAHHPSAPFDELFDISTTVDPSYIISLIRKLLPSNASNLRNSYGIRDDDGNASVTNMDES 86
           P+HHPSAP DELFDISTTVDPSY+ISLIRKLLP +A N        DD+     +N +  
Sbjct: 27  PSHHPSAPPDELFDISTTVDPSYVISLIRKLLPLDARN--------DDNTEIRGSNCN-- 86

Query: 87  DAYLSGDQVLSSSGTVNECQGIEIADGSDKLADREGEDEGACPRSEQ--RISSSEENVWE 146
                 D+V+SSS   ++C+G+EI D   K +D +GEDE    R  +  R+S+ EE VWE
Sbjct: 87  ------DEVVSSSN--DKCKGMEIVDDFSK-SDFQGEDEEDSGRGGENARVSAGEE-VWE 146

Query: 147 EYGCILWDLSASKSHAELMVQNLVLEVLSANLMVSQSVRVMEIVLGIIGNLACHEVPMKH 206
           E GC+LWDL+A+++HAELMVQNL+LEVL ANLMV+QSVRV EI LGI+GNLACHEVPMKH
Sbjct: 147 ECGCVLWDLAANQTHAELMVQNLILEVLLANLMVTQSVRVTEICLGIMGNLACHEVPMKH 206

Query: 207 IVDKSGLITTIVNQLFLDDAQCLCEVCRLLDAGLQSSECAIWAGALNSEHVLSRILWVSE 266
           +V  +GLI+ IV+QLFLDD QCL E CRLL  GLQ SEC IWA AL SEH+LSRILWV+E
Sbjct: 207 MVSTNGLISVIVDQLFLDDTQCLGEACRLLSLGLQGSECRIWAEALQSEHILSRILWVTE 266

Query: 267 NTLNPQLIEKSVGLLSTIIESQQEVVHVLLPCLMKLGLSSALFNLFSFEMKILTNERSAE 326
           NTLNPQLIEKSVGLL  ++ESQ+EV H+LL  LMKLGL++ L NL +FEM  LTNER  E
Sbjct: 267 NTLNPQLIEKSVGLLLAMLESQKEVEHILLLPLMKLGLATVLVNLLAFEMSKLTNERIPE 326

Query: 327 RYSILDAILRAVEALSGIEEHSQEFCSNKKLFQLVCELVKLPDAFEVSSSCISAVILIAN 386
           RYS+LD ILRA+EAL  ++ +SQE CSNK+ FQLVC+L+K PD  EVS+SC++A ++IAN
Sbjct: 327 RYSVLDVILRALEALCVLDGYSQEICSNKEFFQLVCDLIKFPDKVEVSNSCVTAGVIIAN 386

Query: 387 ILSDLPDLAFDMSQDLSFLQGLLDIFSFAGDDLEARDAVWSIIARILVHVEEKAMSRPRM 446
           ILSD+ DLA D+SQDL FLQGL DIF F  D+LEAR A+WSIIAR+LV V+E  MS   +
Sbjct: 387 ILSDVSDLASDLSQDLPFLQGLFDIFPFTSDELEARCALWSIIARLLVRVQEDEMSASSL 446

Query: 447 FECVSLLVSKTDLIEDDLLDHRMTELNKKEDGLTSACTKSNSRCISLGRIIAILNCWAAS 506
            + V +L SK DLIEDDL DH+  E NK+ + L +    SN+R  +L RII+ILN W + 
Sbjct: 447 RQYVFILSSKADLIEDDLFDHQFDE-NKENESLATCGRISNARTFALRRIISILNKWNSL 506

Query: 507 KD--EGTDVRDEYRAEDIDVNRLLSCCCKHS 534
           KD  E   V +E+ A D +++RLL CC K++
Sbjct: 507 KDSVEEKHVMEEH-ANDENIHRLLDCCHKYT 515

BLAST of CmaCh20G005400 vs. TrEMBL
Match: A0A067KE10_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_11230 PE=4 SV=1)

HSP 1 Score: 542.7 bits (1397), Expect = 5.8e-151
Identity = 298/530 (56.23%), Postives = 379/530 (71.51%), Query Frame = 1

Query: 5   SDSDPIEAELDPELESVEGGKGPAHHPSAPFDELFDISTTVDPSYIISLIRKLLPSNASN 64
           S+S+P E E   + E       PAHHPSAP  ELFDISTTVDPSYIISLIRKL+P +  N
Sbjct: 3   SESNPPEEEEQYQREQEAAHDAPAHHPSAPAHELFDISTTVDPSYIISLIRKLIPPSVEN 62

Query: 65  LRNSYGIRDDDGNASVTNMDESDAYLSGDQVLSSSGTVNECQGIEIADGSDKLADREGED 124
             N+ G+     NA    M+E  A  S D++  +   VN  + + + D   K A R+G+D
Sbjct: 63  NHNAKGVDCKGSNADY--MEEHGASPSRDRIPDT--LVNRSENMNVVDDFKKSACRDGKD 122

Query: 125 EGACPRSEQRISSSEENVWEEYGCILWDLSASKSHAELMVQNLVLEVLSANLMVSQSVRV 184
           + + P S+Q    +EE  WEEYGCILWDL+AS++HAELMV+NL+LEVL A+L VSQSVR+
Sbjct: 123 QDSSP-SKQPGVLAEEETWEEYGCILWDLAASRTHAELMVENLILEVLLAHLRVSQSVRI 182

Query: 185 MEIVLGIIGNLACHEVPMKHIVDKSGLITTIVNQLFLDDAQCLCEVCRLLDAGLQSSECA 244
           MEI LGIIGNLACHEVPMKH+V  +GLI  IV QLFLDD QCLCE CRLL  GLQ   C 
Sbjct: 183 MEICLGIIGNLACHEVPMKHVVSTNGLIEIIVYQLFLDDTQCLCEACRLLTLGLQGDMCN 242

Query: 245 IWAGALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVVHVLLPCLMKLGLSS 304
            W  AL SE++L R++WV+ENTLNPQL+EK V LLS I+ES++ V  +LLP LMKLGL++
Sbjct: 243 TWVEALQSENILGRVMWVAENTLNPQLLEKVVELLSAILESEK-VSSILLPSLMKLGLTN 302

Query: 305 ALFNLFSFEMKILTNERSAERYSILDAILRAVEALSGIEEHSQEFCSNKKLFQLVCELVK 364
            L NL + EM  LT ER  ERY +LD ILRA+E +S ++ HSQE CSNK+LFQLVC+LVK
Sbjct: 303 LLINLLASEMSTLTGERIPERYVVLDVILRAIEVISTLDGHSQEICSNKELFQLVCDLVK 362

Query: 365 LPDAFEVSSSCISAVILIANILSDLPDLAFDMSQDLSFLQGLLDIFSFAGDDLEARDAVW 424
            PD  EV++SC +  +L+ANILSD+PDLA ++S DL+FLQGLLDIF FA DD EAR A+W
Sbjct: 363 FPDKVEVANSCATVSVLVANILSDVPDLALEISHDLAFLQGLLDIFPFASDDCEARSALW 422

Query: 425 SIIARILVHVEEKAMSRPRMFECVSLLVSKTDLIEDDLLDHRMTELNKKEDGLTSACTKS 484
           SI AR+LV V+E  +    + + V +LV+KTDLIEDDLLD ++ + +K+     S+  KS
Sbjct: 423 SIFARLLVRVKENELDLSTLCQYVLVLVTKTDLIEDDLLDQQLDDASKETKISISSDIKS 482

Query: 485 NSRCISLGRIIAILNCWAASKD--EGTDVRDEYRAEDIDVNRLLSCCCKH 533
           N+R  +L RI++ILN W A KD  +  DV +E+ A ++DV RLL CC KH
Sbjct: 483 NTRNTALQRIVSILNRWTALKDSHKVEDVMEEHYAIEVDVGRLLDCCRKH 526

BLAST of CmaCh20G005400 vs. TAIR10
Match: AT5G22820.2 (AT5G22820.2 ARM repeat superfamily protein)

HSP 1 Score: 442.6 bits (1137), Expect = 4.1e-124
Identity = 247/509 (48.53%), Postives = 339/509 (66.60%), Query Frame = 1

Query: 27  PAHHPSAPFDELFDISTTVDPSYIISLIRKLLP-SNASNLRNSYGIRDDDGNASVTNMDE 86
           P+HHP  P DELFDISTTVDPSY+ISLIRKLLP  + S+ R++  +  D       N+ +
Sbjct: 32  PSHHPPPPPDELFDISTTVDPSYLISLIRKLLPIDSGSDERHNDHMNTD-------NVVQ 91

Query: 87  SDAYLSGDQVLSSSGTVNECQGIEIADGSDKLADREGEDEGACPRSEQRISSSEENVWEE 146
               +SG+ V+ +S    + + ++I D  D+     GE   +CP    +  SS    WE+
Sbjct: 92  GVVAISGNGVVETSN--GDPESMDIGDNHDESTYEVGETVSSCPAPGMQDGSS---AWED 151

Query: 147 YGCILWDLSASKSHAELMVQNLVLEVLSANLMVSQSVRVMEIVLGIIGNLACHEVPMKHI 206
           +GC+LWDL+AS++HAELMVQNL+LEVL ANLMVS+S R+ EI LGII NLACHE  +KHI
Sbjct: 152 HGCVLWDLAASRTHAELMVQNLILEVLHANLMVSKSPRIREICLGIIRNLACHEGLLKHI 211

Query: 207 VDKSGLITTIVNQLFLDDAQCLCEVCRLLDAGLQSSECAIWAGALNSEHVLSRILWVSEN 266
              +G++ T+V QLFLDD QCL EVCR+L  GL  + C  WA  L S+ +L  ILW++EN
Sbjct: 212 ESTAGIVNTLVGQLFLDDTQCLSEVCRILTTGLSGAGCTSWAHCLESDDILRHILWIAEN 271

Query: 267 TLNPQLIEKSVGLLSTIIESQQEVVHVLLPCLMKLGLSSALFNLFSFEMKILTNERSAER 326
           TLNP LIEKSVGLL  IIE Q EV  +L+P LM LGL+S L NL SFEM  LT ER  ER
Sbjct: 272 TLNPHLIEKSVGLLLGIIEGQPEVEQLLIPPLMNLGLTSLLINLLSFEMSKLTKERIPER 331

Query: 327 YSILDAILRAVEALSGIEEHSQEFCSNKKLFQLVCELVKLPDAFEVSSSCISAVILIANI 386
           Y +L+ ILRA+EALS  + +S+E CS+K+LFQLVC+L+KL D  EV++SC++  +LIAN+
Sbjct: 332 YPVLEIILRAIEALSASDSYSKEICSSKELFQLVCDLMKLQDKAEVATSCVTTGVLIANM 391

Query: 387 LSDLPDLAFDMSQDLSFLQGLLDIFSFAGDDLEARDAVWSIIARILVHVEEKAMSRPRMF 446
           LS+  D   ++ +D SFL+GL     FA DD+EAR A+W++IAR+L  V E  ++   + 
Sbjct: 392 LSERVDFIPEVLEDFSFLEGLFSTLPFASDDVEARRALWNVIARLLARVNESEINTFLLS 451

Query: 447 ECVSLLVSKTDLIEDDLLDHRMTELNKKEDGLTSACTKSNSRCISLGRIIAILNCWAASK 506
           + + +L+S  D+IEDD LD ++ + N+  +   S   KS++R I++ +I +ILN W A K
Sbjct: 452 QYILVLLSNADIIEDDFLDTQLEDSNESRNSFPSQ-IKSSARTIAIQKIESILNNWNARK 511

Query: 507 D--EGTDVRDEYRAEDIDVNRLLSCCCKH 533
           +  +   V         DV RL  CC ++
Sbjct: 512 ENLQEETVNGNCSINLADVKRLFDCCHRY 527

BLAST of CmaCh20G005400 vs. NCBI nr
Match: gi|449456176|ref|XP_004145826.1| (PREDICTED: uncharacterized protein LOC101215373 [Cucumis sativus])

HSP 1 Score: 867.1 bits (2239), Expect = 1.9e-248
Identity = 455/536 (84.89%), Postives = 484/536 (90.30%), Query Frame = 1

Query: 1   MEVGSDSDPIEAELDPELESVEGGKGPAHHPSAPFDELFDISTTVDPSYIISLIRKLLPS 60
           MEVGSD DPIEAELD +LE V+   GPAHHPSAPFDE+FDISTTVDPSYIISLIRKLLP 
Sbjct: 1   MEVGSDLDPIEAELDADLEPVKDCNGPAHHPSAPFDEVFDISTTVDPSYIISLIRKLLPL 60

Query: 61  NASNLRNSYGIRDDDGNASVTNMDESDAYLSGDQVLSSSGTVNECQGIEIADGSDKLADR 120
           NASN RNS G   D G+ SV  MDE D Y+SGDQ+ SSSGTV++C GIEI D S KLAD+
Sbjct: 61  NASNTRNSCGNGHDGGDTSVNKMDEGDGYVSGDQLFSSSGTVSKCLGIEIEDDSGKLADK 120

Query: 121 EGEDEGACPRSEQRISSSEENVWEEYGCILWDLSASKSHAELMVQNLVLEVLSANLMVSQ 180
           EGEDEGACP+SEQ ISSSEE VWEEYGCILWDLSAS+S AELMVQNLVLEVLSANLMVSQ
Sbjct: 121 EGEDEGACPKSEQLISSSEEKVWEEYGCILWDLSASRSQAELMVQNLVLEVLSANLMVSQ 180

Query: 181 SVRVMEIVLGIIGNLACHEVPMKHIVDKSGLITTIVNQLFLDDAQCLCEVCRLLDAGLQS 240
           SVRVMEI LGIIGNLACHEVPMKHIV KSGLITTIV+QLFLDDAQCLCEVCRLL+ GLQS
Sbjct: 181 SVRVMEISLGIIGNLACHEVPMKHIVAKSGLITTIVSQLFLDDAQCLCEVCRLLNTGLQS 240

Query: 241 SECAIWAGALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVVHVLLPCLMKL 300
           SEC IWA ALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQE+VHVLL CLMKL
Sbjct: 241 SECVIWAEALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEIVHVLLSCLMKL 300

Query: 301 GLSSALFNLFSFEMKILTNERSAERYSILDAILRAVEALSGIEEHSQEFCSNKKLFQLVC 360
           GLSS LFNLFSFEMKILTNERSAER+SILD ILRAVEALSG EEHS+E CSNK+LFQLV 
Sbjct: 301 GLSSVLFNLFSFEMKILTNERSAERHSILDVILRAVEALSGNEEHSRELCSNKELFQLVR 360

Query: 361 ELVKLPDAFEVSSSCISAVILIANILSDLPDLAFDMSQDLSFLQGLLDIFSFAGDDLEAR 420
           +LVKLPDAFEVSSSCISAV+LIANILSD+PDLAF+MSQDLSFLQGLLDIFSF GDD EAR
Sbjct: 361 DLVKLPDAFEVSSSCISAVVLIANILSDVPDLAFEMSQDLSFLQGLLDIFSFVGDDFEAR 420

Query: 421 DAVWSIIARILVHVEEKAMSRPRMFECVSLLVSKTDLIEDDLLDHRMTELNKKEDGLTSA 480
           DAVWSIIARILV V+E  MSRP++FE VSLLVSKTDLIEDDLLDH MTE NK+EDG+TSA
Sbjct: 421 DAVWSIIARILVRVQENVMSRPKLFEYVSLLVSKTDLIEDDLLDHCMTESNKEEDGMTSA 480

Query: 481 CTKSNSRCISLGRIIAILNCWAASKDEGTDVRDEYRAEDIDVNRLLSCCCKHSEWL 537
           CTKSNSRCISL RII+ILN W ASKDEGTDVRDEY  ED+DVNRLL+CC KHSE L
Sbjct: 481 CTKSNSRCISLRRIISILNHWTASKDEGTDVRDEYCLEDVDVNRLLTCCSKHSEEL 536

BLAST of CmaCh20G005400 vs. NCBI nr
Match: gi|659117533|ref|XP_008458652.1| (PREDICTED: uncharacterized protein LOC103497988 isoform X1 [Cucumis melo])

HSP 1 Score: 860.9 bits (2223), Expect = 1.4e-246
Identity = 453/536 (84.51%), Postives = 480/536 (89.55%), Query Frame = 1

Query: 1   MEVGSDSDPIEAELDPELESVEGGKGPAHHPSAPFDELFDISTTVDPSYIISLIRKLLPS 60
           MEVGSDSDPIEAEL+ ++E VE   GPAHHPSAP DELFDISTTVDPSYIISLIRKLLP 
Sbjct: 1   MEVGSDSDPIEAELEADVEPVEDCNGPAHHPSAPLDELFDISTTVDPSYIISLIRKLLPL 60

Query: 61  NASNLRNSYGIRDDDGNASVTNMDESDAYLSGDQVLSSSGTVNECQGIEIADGSDKLADR 120
           NASN RNS     D G+ SV  MDE D YLSGDQ+LSSSGTV++C G+EIADGS KLAD+
Sbjct: 61  NASNTRNSCENGHDGGDTSVNKMDEGDGYLSGDQLLSSSGTVSKCLGLEIADGSGKLADK 120

Query: 121 EGEDEGACPRSEQRISSSEENVWEEYGCILWDLSASKSHAELMVQNLVLEVLSANLMVSQ 180
           EGEDEGAC +SEQ ISS EE VWEEYGCILWDLSAS+S AELMVQNLVLEVLSANLMVSQ
Sbjct: 121 EGEDEGACLKSEQLISSPEEKVWEEYGCILWDLSASRSQAELMVQNLVLEVLSANLMVSQ 180

Query: 181 SVRVMEIVLGIIGNLACHEVPMKHIVDKSGLITTIVNQLFLDDAQCLCEVCRLLDAGLQS 240
           SVRVMEI LGIIGNLACHEVPMKHIV KSGLITTIV+QLFLDDAQCLCEVCRLL+ GLQS
Sbjct: 181 SVRVMEISLGIIGNLACHEVPMKHIVAKSGLITTIVSQLFLDDAQCLCEVCRLLNTGLQS 240

Query: 241 SECAIWAGALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVVHVLLPCLMKL 300
           SEC IWA ALN EHVLSRILWVSENTLNPQLIEKSVGLLSTIIES QEVVH LLPCLMKL
Sbjct: 241 SECVIWAEALNFEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESHQEVVHALLPCLMKL 300

Query: 301 GLSSALFNLFSFEMKILTNERSAERYSILDAILRAVEALSGIEEHSQEFCSNKKLFQLVC 360
           GLSS LFNLFSFEMKILTNERSAER+SILD ILRAVE LSGIEEHS E CSNK+LFQLV 
Sbjct: 301 GLSSVLFNLFSFEMKILTNERSAERHSILDVILRAVETLSGIEEHSHELCSNKELFQLVR 360

Query: 361 ELVKLPDAFEVSSSCISAVILIANILSDLPDLAFDMSQDLSFLQGLLDIFSFAGDDLEAR 420
           +LVKLPDAFEVSSSCISAV+LIANILSD+PDLAF+MSQDLSFLQGL D FSFAGDDLEAR
Sbjct: 361 DLVKLPDAFEVSSSCISAVVLIANILSDVPDLAFEMSQDLSFLQGLFDTFSFAGDDLEAR 420

Query: 421 DAVWSIIARILVHVEEKAMSRPRMFECVSLLVSKTDLIEDDLLDHRMTELNKKEDGLTSA 480
           DAVWSIIARILV V+E  MSRP++ E VSLLVSKTDLIEDDLLDH MTE NK+EDG+TSA
Sbjct: 421 DAVWSIIARILVRVQENVMSRPKLLEYVSLLVSKTDLIEDDLLDHCMTESNKEEDGMTSA 480

Query: 481 CTKSNSRCISLGRIIAILNCWAASKDEGTDVRDEYRAEDIDVNRLLSCCCKHSEWL 537
           CTKSNSRCISL RII+ILN W ASKDEGTDVRDEY  ED+DVNRLL+CC KHSE L
Sbjct: 481 CTKSNSRCISLRRIISILNHWTASKDEGTDVRDEYCVEDVDVNRLLTCCSKHSEEL 536

BLAST of CmaCh20G005400 vs. NCBI nr
Match: gi|659117537|ref|XP_008458654.1| (PREDICTED: uncharacterized protein LOC103497988 isoform X3 [Cucumis melo])

HSP 1 Score: 843.6 bits (2178), Expect = 2.3e-241
Identity = 447/536 (83.40%), Postives = 474/536 (88.43%), Query Frame = 1

Query: 1   MEVGSDSDPIEAELDPELESVEGGKGPAHHPSAPFDELFDISTTVDPSYIISLIRKLLPS 60
           MEVGSDSDPIEAEL+ ++E VE   GPAHHPSAP DELFDISTTVDPSYIISLIRKLLP 
Sbjct: 1   MEVGSDSDPIEAELEADVEPVEDCNGPAHHPSAPLDELFDISTTVDPSYIISLIRKLLPL 60

Query: 61  NASNLRNSYGIRDDDGNASVTNMDESDAYLSGDQVLSSSGTVNECQGIEIADGSDKLADR 120
           NASN RNS     D G+ SV  MDE D YLSGDQ+LSSSGTV++C G+EIADGS KLAD+
Sbjct: 61  NASNTRNSCENGHDGGDTSVNKMDEGDGYLSGDQLLSSSGTVSKCLGLEIADGSGKLADK 120

Query: 121 EGEDEGACPRSEQRISSSEENVWEEYGCILWDLSASKSHAELMVQNLVLEVLSANLMVSQ 180
           EGEDEGAC +SEQ ISS EE VWEEYGCILWDLSAS+S AELMVQNLVLEVLSANLMVSQ
Sbjct: 121 EGEDEGACLKSEQLISSPEEKVWEEYGCILWDLSASRSQAELMVQNLVLEVLSANLMVSQ 180

Query: 181 SVRVMEIVLGIIGNLACHEVPMKHIVDKSGLITTIVNQLFLDDAQCLCEVCRLLDAGLQS 240
           SVRVMEI LGIIGNLACHEVPMKHIV KSGLITTIV+QLFLDDAQCLCEVCRLL+ GLQS
Sbjct: 181 SVRVMEISLGIIGNLACHEVPMKHIVAKSGLITTIVSQLFLDDAQCLCEVCRLLNTGLQS 240

Query: 241 SECAIWAGALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVVHVLLPCLMKL 300
           SEC IWA ALN EHVLSRILWVSENTLNPQLIEKSVGLLSTIIES QEVVH LLPCLMKL
Sbjct: 241 SECVIWAEALNFEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESHQEVVHALLPCLMKL 300

Query: 301 GLSSALFNLFSFEMKILTNERSAERYSILDAILRAVEALSGIEEHSQEFCSNKKLFQLVC 360
           GLSS LFNLFSFEMKILTNERSAER+SILD ILRAVE LSGIEEHS E CSNK+LFQLV 
Sbjct: 301 GLSSVLFNLFSFEMKILTNERSAERHSILDVILRAVETLSGIEEHSHELCSNKELFQLVR 360

Query: 361 ELVKLPDAFEVSSSCISAVILIANILSDLPDLAFDMSQDLSFLQGLLDIFSFAGDDLEAR 420
           +LVKLPDAFEVSSSCISAV+LIANILSD+PDLAF+MS      QGL D FSFAGDDLEAR
Sbjct: 361 DLVKLPDAFEVSSSCISAVVLIANILSDVPDLAFEMS------QGLFDTFSFAGDDLEAR 420

Query: 421 DAVWSIIARILVHVEEKAMSRPRMFECVSLLVSKTDLIEDDLLDHRMTELNKKEDGLTSA 480
           DAVWSIIARILV V+E  MSRP++ E VSLLVSKTDLIEDDLLDH MTE NK+EDG+TSA
Sbjct: 421 DAVWSIIARILVRVQENVMSRPKLLEYVSLLVSKTDLIEDDLLDHCMTESNKEEDGMTSA 480

Query: 481 CTKSNSRCISLGRIIAILNCWAASKDEGTDVRDEYRAEDIDVNRLLSCCCKHSEWL 537
           CTKSNSRCISL RII+ILN W ASKDEGTDVRDEY  ED+DVNRLL+CC KHSE L
Sbjct: 481 CTKSNSRCISLRRIISILNHWTASKDEGTDVRDEYCVEDVDVNRLLTCCSKHSEEL 530

BLAST of CmaCh20G005400 vs. NCBI nr
Match: gi|659117539|ref|XP_008458655.1| (PREDICTED: uncharacterized protein LOC103497988 isoform X4 [Cucumis melo])

HSP 1 Score: 809.3 bits (2089), Expect = 4.8e-231
Identity = 434/536 (80.97%), Postives = 457/536 (85.26%), Query Frame = 1

Query: 1   MEVGSDSDPIEAELDPELESVEGGKGPAHHPSAPFDELFDISTTVDPSYIISLIRKLLPS 60
           MEVGSDSDPIEAEL+ ++E VE   GPAHHPSAP DELFDISTTVDPSYIISLIRKLLP 
Sbjct: 1   MEVGSDSDPIEAELEADVEPVEDCNGPAHHPSAPLDELFDISTTVDPSYIISLIRKLLPL 60

Query: 61  NASNLRNSYGIRDDDGNASVTNMDESDAYLSGDQVLSSSGTVNECQGIEIADGSDKLADR 120
           NASN RNS     D G+ SV  MDE                          DGS KLAD+
Sbjct: 61  NASNTRNSCENGHDGGDTSVNKMDE--------------------------DGSGKLADK 120

Query: 121 EGEDEGACPRSEQRISSSEENVWEEYGCILWDLSASKSHAELMVQNLVLEVLSANLMVSQ 180
           EGEDEGAC +SEQ ISS EE VWEEYGCILWDLSAS+S AELMVQNLVLEVLSANLMVSQ
Sbjct: 121 EGEDEGACLKSEQLISSPEEKVWEEYGCILWDLSASRSQAELMVQNLVLEVLSANLMVSQ 180

Query: 181 SVRVMEIVLGIIGNLACHEVPMKHIVDKSGLITTIVNQLFLDDAQCLCEVCRLLDAGLQS 240
           SVRVMEI LGIIGNLACHEVPMKHIV KSGLITTIV+QLFLDDAQCLCEVCRLL+ GLQS
Sbjct: 181 SVRVMEISLGIIGNLACHEVPMKHIVAKSGLITTIVSQLFLDDAQCLCEVCRLLNTGLQS 240

Query: 241 SECAIWAGALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVVHVLLPCLMKL 300
           SEC IWA ALN EHVLSRILWVSENTLNPQLIEKSVGLLSTIIES QEVVH LLPCLMKL
Sbjct: 241 SECVIWAEALNFEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESHQEVVHALLPCLMKL 300

Query: 301 GLSSALFNLFSFEMKILTNERSAERYSILDAILRAVEALSGIEEHSQEFCSNKKLFQLVC 360
           GLSS LFNLFSFEMKILTNERSAER+SILD ILRAVE LSGIEEHS E CSNK+LFQLV 
Sbjct: 301 GLSSVLFNLFSFEMKILTNERSAERHSILDVILRAVETLSGIEEHSHELCSNKELFQLVR 360

Query: 361 ELVKLPDAFEVSSSCISAVILIANILSDLPDLAFDMSQDLSFLQGLLDIFSFAGDDLEAR 420
           +LVKLPDAFEVSSSCISAV+LIANILSD+PDLAF+MSQDLSFLQGL D FSFAGDDLEAR
Sbjct: 361 DLVKLPDAFEVSSSCISAVVLIANILSDVPDLAFEMSQDLSFLQGLFDTFSFAGDDLEAR 420

Query: 421 DAVWSIIARILVHVEEKAMSRPRMFECVSLLVSKTDLIEDDLLDHRMTELNKKEDGLTSA 480
           DAVWSIIARILV V+E  MSRP++ E VSLLVSKTDLIEDDLLDH MTE NK+EDG+TSA
Sbjct: 421 DAVWSIIARILVRVQENVMSRPKLLEYVSLLVSKTDLIEDDLLDHCMTESNKEEDGMTSA 480

Query: 481 CTKSNSRCISLGRIIAILNCWAASKDEGTDVRDEYRAEDIDVNRLLSCCCKHSEWL 537
           CTKSNSRCISL RII+ILN W ASKDEGTDVRDEY  ED+DVNRLL+CC KHSE L
Sbjct: 481 CTKSNSRCISLRRIISILNHWTASKDEGTDVRDEYCVEDVDVNRLLTCCSKHSEEL 510

BLAST of CmaCh20G005400 vs. NCBI nr
Match: gi|659117535|ref|XP_008458653.1| (PREDICTED: uncharacterized protein LOC103497988 isoform X2 [Cucumis melo])

HSP 1 Score: 796.6 bits (2056), Expect = 3.2e-227
Identity = 437/563 (77.62%), Postives = 466/563 (82.77%), Query Frame = 1

Query: 1   MEVGSDSDPIEAELDPELESVEGGKGPAHHPSAPFDELFDISTTVDPSYIISLIRKLLPS 60
           MEVGSDSDPIEAEL+ ++E VE   GPAHHPSAP DELFDISTTVDPSYIISLIRKLLP 
Sbjct: 1   MEVGSDSDPIEAELEADVEPVEDCNGPAHHPSAPLDELFDISTTVDPSYIISLIRKLLPL 60

Query: 61  NASNLRNSYGIRDDDGNASVTNMDESDAYLSGDQVLSSSGTVNECQGIEIADGSDKLADR 120
           NASN RNS     D G+ SV  MDE D YLSGDQ+LSSSGTV++C G+EIADGS KLAD+
Sbjct: 61  NASNTRNSCENGHDGGDTSVNKMDEGDGYLSGDQLLSSSGTVSKCLGLEIADGSGKLADK 120

Query: 121 EGEDEGACPRSEQRISSSEENVWEEYGCILWDLSASKSHAELMVQNLVLEVLSANLMVSQ 180
           EGEDEGAC +SEQ ISS EE VWEEYGCILWDLSAS+S AELMVQNLVLEVLSANLMVSQ
Sbjct: 121 EGEDEGACLKSEQLISSPEEKVWEEYGCILWDLSASRSQAELMVQNLVLEVLSANLMVSQ 180

Query: 181 SVRVMEIVLGIIGNLACHEVPMKHIVDKSGLITTIVNQLFLDDAQCLCEVCRLLDAGLQS 240
           SVRVMEI LGIIGNLACHEVPMKHIV KSGLITTIV+QLFLDDAQCLCEVCRLL+ GLQS
Sbjct: 181 SVRVMEISLGIIGNLACHEVPMKHIVAKSGLITTIVSQLFLDDAQCLCEVCRLLNTGLQS 240

Query: 241 SECAIWAGALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVVHVLLPCLMKL 300
           SEC IWA ALN EHVLSRILWVSENTLNPQLIEKSVGLLSTIIES QEVVH LLPCLMKL
Sbjct: 241 SECVIWAEALNFEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESHQEVVHALLPCLMKL 300

Query: 301 GLSSALFNLFSFEMKILTNERSAERYSILDAILRAVEALSGIEEHSQEFCSNKKLFQLVC 360
           GLSS LFNLFSFEMKILTNERSAER+SILD ILRAVE LSGIEEHS E CSNK+LFQLV 
Sbjct: 301 GLSSVLFNLFSFEMKILTNERSAERHSILDVILRAVETLSGIEEHSHELCSNKELFQLVR 360

Query: 361 ELVKLPDAFEVSSSCISAVILIANILSDLPDLAFDMSQDLSFLQGLLDIFSFAGDDLEAR 420
           +LVKLPDAFEVSSSCISAV+LIANILSD+PDLAF+MSQDLSFLQGL D FSFAGDDLEAR
Sbjct: 361 DLVKLPDAFEVSSSCISAVVLIANILSDVPDLAFEMSQDLSFLQGLFDTFSFAGDDLEAR 420

Query: 421 DAVWSIIARILVHVEEKAMSRPRMFECVSLLVSKTDLIEDDLLDHRMTELNKKEDGLTSA 480
           DAVWSIIARILV V+E  MSRP++ E VSLLVSKTDLIEDDLLDH MTE NK+EDG+TSA
Sbjct: 421 DAVWSIIARILVRVQENVMSRPKLLEYVSLLVSKTDLIEDDLLDHCMTESNKEEDGMTSA 480

Query: 481 CTKSNSRCISLGRIIAILNCWAASKDEGTDVRDEYRAEDIDVNRLLSCCCKHSEWLGIRT 540
           CTKSNSRCIS             S+D  T       A  +   + ++    HSE  G   
Sbjct: 481 CTKSNSRCIS-----------GVSRDGNTHFVTHSVA--LSHFQYVTSMPLHSELKG--- 537

Query: 541 LKCITILHAVLDIILNHYQLKER 564
                      DIILNHYQLKER
Sbjct: 541 ----------QDIILNHYQLKER 537

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KDI1_CUCSA1.3e-24884.89Uncharacterized protein OS=Cucumis sativus GN=Csa_6G149940 PE=4 SV=1[more]
M5W7F7_PRUPE9.2e-15759.30Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004180mg PE=4 SV=1[more]
A0A061GW25_THECC1.5e-15160.08ARM repeat superfamily protein, putative isoform 1 OS=Theobroma cacao GN=TCM_041... [more]
A0A061GVV3_THECC1.5e-15160.08ARM repeat superfamily protein, putative isoform 2 OS=Theobroma cacao GN=TCM_041... [more]
A0A067KE10_JATCU5.8e-15156.23Uncharacterized protein OS=Jatropha curcas GN=JCGZ_11230 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G22820.24.1e-12448.53 ARM repeat superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449456176|ref|XP_004145826.1|1.9e-24884.89PREDICTED: uncharacterized protein LOC101215373 [Cucumis sativus][more]
gi|659117533|ref|XP_008458652.1|1.4e-24684.51PREDICTED: uncharacterized protein LOC103497988 isoform X1 [Cucumis melo][more]
gi|659117537|ref|XP_008458654.1|2.3e-24183.40PREDICTED: uncharacterized protein LOC103497988 isoform X3 [Cucumis melo][more]
gi|659117539|ref|XP_008458655.1|4.8e-23180.97PREDICTED: uncharacterized protein LOC103497988 isoform X4 [Cucumis melo][more]
gi|659117535|ref|XP_008458653.1|3.2e-22777.62PREDICTED: uncharacterized protein LOC103497988 isoform X2 [Cucumis melo][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR011989ARM-like
IPR016024ARM-type_fold
Vocabulary: Molecular Function
TermDefinition
GO:0005488binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005488 binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh20G005400.1CmaCh20G005400.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011989Armadillo-like helicalGENE3DG3DSA:1.25.10.10coord: 135..405
score: 4.
IPR016024Armadillo-type foldunknownSSF48371ARM repeatcoord: 136..439
score: 3.2
NoneNo IPR availablePANTHERPTHR23424SERUM AMYLOID Acoord: 122..536
score: 4.2E-165coord: 7..62
score: 4.2E
NoneNo IPR availablePANTHERPTHR23424:SF4PROTEIN SAAL1coord: 122..536
score: 4.2E-165coord: 7..62
score: 4.2E

The following gene(s) are paralogous to this gene:

None