CmaCh17G013380.1 (mRNA) Cucurbita maxima (Rimu)

NameCmaCh17G013380.1
TypemRNA
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionSmr (Small MutS-related) domain protein
LocationCma_Chr17 : 8988334 .. 8992506 (+)
Sequence length2051
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCGTGGGGGAGGGGTAAATCTCCTGGCTGGGCAGCGGTTAACCTTAAGCAACAGAATAGTGGCCTTCAAGATGAAATTGACCCGGACCCATTCCCACCAATGTCTACCGCTCATTCCTTTCTGCCACCCTGTGAAAACGTACACAGAGTTAATGGTCGTTCAGGGAGGTCTTCACAAGAAAATTTTGGTGCAGAAAAGACAATACTTGGTAATTCTAGCATTCGAAGTAGCAAGAAGTTGGTTGAAGAATCCACTGATGTTTTAGCCTTCTGGAAGCTTAAAGAGCTTCATTCTTGGGCTGATATCAGCTTGATTGTGGATATAATGGAAGCTGTAAATAATAACTTCAACGAGGCGTCTAAGTTATTAAAAACAATGGTTTCTAGTGACAATTTTGAGATCAATAATGAGATGAGCACCTTAGGACTGCATTCCTCTAATGATGTATCATTGGTGAGGGGTAAATCTTCTGGCTGGGAAGAATTTAACCTTAAGCAACAAAATAGAGGCTTTCAAGATAGAATTGATCCGAAACCATTCCCACCGATGCCAAGTTCCCTTTCCTCTTTGCCACCCCGTGAAAACTTGCACGGTGTTAATGGCCGTCCAGGGGGATCCTCCTCATCTTCACCTCTTCCTTCTGCTGATTCTCTAACTTTGCCAGAAAATTACGGTGCAAAGAAAATACTTGGTGATTCTACCATTCAAAATGGAAGGAAGGTGGTTGAAGAAACCACTGACGTTTTAGCCTTTTGGAAGCTTAAGGAGCTTCATACTTGGGCTGATTTTAGCTTGATTGTGGATATAATGGAAGCTGTAGATAATAACTTCAATGAGGCGTCTACTTTTTTAAACAAAATGGTTTCTAGTGACAATGTTGAGATCTGTAACGAGATGAGCACCCTAGGACTGCATTCTGCTGATGGATTATCGTGCAATGGGAAGAATGATGTAACCATATCATCAGAAAGAACTGTTAATAATCCCATCCCTAGTTCCACACTAAAGGATGTGCAAGACATGCATCAAAATATTAATGATAAATTGTTTGAAAATAATTATCATGAAAGAAATTTCTTTCACAATGTTGGAAATCCAAAAATAGCTCTTTATTGCTCAAAGTCTGCTCCTATTGAGCCCGAGTGGGAAGAAGATGATATTTACCTGAGCCATCGGAAAGATGCTATAGCAATGATGAGGTAAATAAGTCTTCTGTCTACTTTATTATCGGGTTGTTTTTCTACTTCAATTTAATTTTTTTTTTCCTTAATAGATATATTTGTTAAAATTTCCATAACAAAAACTTATTTTTCTTTTTCCTGCTTCTTTTATCATTGTGATGCACCACTGGGTTTCTGGGGTTCTTTGTATTTATTATATATTTTTACCGTGTGTGAGAAGAACGTTTTAAAGAATTTATATGTATGGGGGATCTGGTTGTACCTTGAGTTATGAGTAGACAACAGTCTGTTTGTATGAATAAGCTAATTTATGTTGATAGCTCAAGCTTTGGAAATTAATAGAAAGTCGACGTGATAAGAGGGAGTTATTGAATCCAAGTCCATGGTAATTTTTTGTTTTTAATTCAAATTGACACTGTAATCATACTCAAAAGAAATGTTAAAAAATAATTGTTGGACAATTATTCTTGACCTAGGCTCTCTTGTATTGTGGCTTTGGTCTTATGTTGTAGTGTAAAATATATGGTTCTTGGGAGAAGGTTCAAGTACTCAATGCTATAGGAGTTGATGTCTCGCTTAATATGATCCGATTTATTTGTAATTTTCTTTAATCAATAGGTCTGCATCTCAACATTCAAGGGTAGCCACTAATGCCTATCTTCGGAAGGATCATGCTTCTGCCAAGTATCATTCATCAAGGGCTCAAGAACAATGGCTAGCTGCAAAAATGTTAAATGCTAAGGCAGCCAATGAAATTTTACAAACAAGGAATAGTGAAAATGGGCTCTGGAAGTTGGACTTACATGGGCTTCATGCAGCAGAGGCTGTTCAAGCCTTGCAAGATCACTTGCTGAAAATCGAAACTTGGAATGCCTCCAATCGGTCGTTGTCGCCAAAGAAAGCTGAAAGGAAGGGTTTCTATCGTGTTTCATCCCTTGAGTATCTTAGTTGTCTGGGCGTAAAGTTGGACAAAGAATTACAATCACCATTACCTAGGCATAGGCCGACATCATTGGAAGTCATAACAGGTAAATTGTATCTGTTGCTGTTTAACTCAACATTTGTGTTTTCTTAAAGTTAGGTTTAACAGGCAAATTTCATTCTTTTCACATTTGTTTTTCTTGAGAATGGAATCATTGTATGACAATGTAACTGATAACTTTAGAGGCTGACTACGCTTGTGTTGCCTTTAGCCTTTATCAATTCCAGAGATAGATTGGCATTTTCTACATTATTCAATACTTTTGGTTACAGTAGCACAGTGGACATTGTGGTATGCCAACCTATTAATTGAAAGTTTTCTCTCTGCTGTATCTAGGCTTCAATCCTAAACGCTCTTCAAAACCCTCCTCACGTGCTAGTATCTAGTAGTTTTTTGGCTTATTATTCCATATTGCATATAAAAATCTATCAAGATTTGTTCTTGGGTTGATAATCCACTATTGATAGGGAAGAGTTACATCTGCCACCATGCGAGAAAAGGCTGTATGAATTGTGAGAAAAAGGTTTTAAGAGTGTGGTAACCTTTCCCTAGTAGATAAGAGAATGAAGCATTCCTTATAAGGGTGTGCAAACCTCTCCATAGTGGACGCGTTTTAAAATCTTGAGGGGAAGTCCCGAATGGAAAACCCAAAGAGGACAATGTTTTCTAGCGGCGGGCTTGGGCTGTTACGAATGGTATCAAAGCCAGACACCGGGCGGTGTGCCAGCAAGGACGCTGGGTCCCCAAGGTGGTGGATTGTGAGTTCCCACATCGGTTGGAGAGGGGAACGAATCATTTCTTATAAGGGAGTAGAAACCTCTCCTTAGTGGACGCGTTTTAAAACCTTGAGGGGAAGCCCATAAGGGAAAGTTCAAAGAGGACCATGAGCTTATTTCTTTGCATTTATTGTGTTTAGATTTGCATGCGAGAGACATTCAAGTAGTCTCAATCAAACTATCTTGTGAAGTGATTCGAATCACAAGTTAAATACTCTGGATCTTGATCATCTAGGGAAGTCATTCTAAAACTTTCCCTTGGGATGATTCTTTATCGTTTTTGGGGGTTTTTGGATAATTTCGATCTAAAAGTCTCATTCTTCAACAAGATCGTTAGATCAAATCTCAAGAGTTTCATAACCCTTGGTGCATCTTACCCATGCGTGTACTTGGAGAAAGGCACTTGGGGAAGTAATGAGCTATTAGTTAGCTGGGGATGGCCGGGTGGATGCATGTTTCGCGACTGCGGGTCCCATTTGTAATGAAGGACAAAATGGAAATCTGCTTAGCTGAGGCCAGGAGGCTATATATATATTAGATATAACAATGAGTTTCATCATAGGTTGATTTGACTGAAGGGAAAGATTTAAATGTCGGTGACGCTAGCATATTTTTTTCGTAAAATAACGAATTCTCATGTGCACTACTGATTCAAACATTGACGATTGTACTCATGATGTTTTGTTCGTTCTCAAAGGAATAGGTAAACATAGCAGGGGGGAGGCTGCTCTACCAAAGGCCGTGACAAGTTTTCTTAGTGAAAATGGGTGAGTCTACTTTTCTTTCCATTTTGACTTTAACCAAAAAGAAGGGTAAAAATGTTTATAGATTCATTCTACTTCAACCACTTGATAATTTGCAGGTACCGTTTTGAACAGTTGAGGCCTGGAACGATCAGCGTTCGACCGAAGTTTCGTAGGTAAACGACTCTCCCTTATTAGTAGTAGCTTCAAAGTTATTCAGTCAGAACGTAGATTAGAGTTGTTGGAGGTTGGGTAAGTGAAACGTGTAAAATGACTGAAAATGTTCGTTATTTAGAAGGAGCTCTGTAGGATTGGTTGTTAACCACATTACTGCATTGTTATTTTGTTGTCTTTCCTGGAAAAGATAGCACCTAGGAAGAAGAGGTTAGAAGAACTCTGGAATCCTGGGATTGTATTCTATTATGATTCATTATGGCGATTGCACTAGTTGAGTCTACTTCTCTAATCCATTGAAAATATAATAGCTG

mRNA sequence

ATGTCGTGGGGGAGGGGTAAATCTCCTGGCTGGGCAGCGGTTAACCTTAAGCAACAGAATAGTGGCCTTCAAGATGAAATTGACCCGGACCCATTCCCACCAATGTCTACCGCTCATTCCTTTCTGCCACCCTGTGAAAACGTACACAGAGTTAATGGTCGTTCAGGGAGGTCTTCACAAGAAAATTTTGGTGCAGAAAAGACAATACTTGGTAATTCTAGCATTCGAAGTAGCAAGAAGTTGGTTGAAGAATCCACTGATGTTTTAGCCTTCTGGAAGCTTAAAGAGCTTCATTCTTGGGCTGATATCAGCTTGATTGTGGATATAATGGAAGCTGTAAATAATAACTTCAACGAGGCGTCTAAGTTATTAAAAACAATGGTTTCTAGTGACAATTTTGAGATCAATAATGAGATGAGCACCTTAGGACTGCATTCCTCTAATGATGTATCATTGGTGAGGGGTAAATCTTCTGGCTGGGAAGAATTTAACCTTAAGCAACAAAATAGAGGCTTTCAAGATAGAATTGATCCGAAACCATTCCCACCGATGCCAAGTTCCCTTTCCTCTTTGCCACCCCGTGAAAACTTGCACGGTGTTAATGGCCGTCCAGGGGGATCCTCCTCATCTTCACCTCTTCCTTCTGCTGATTCTCTAACTTTGCCAGAAAATTACGGTGCAAAGAAAATACTTGGTGATTCTACCATTCAAAATGGAAGGAAGGTGGTTGAAGAAACCACTGACGTTTTAGCCTTTTGGAAGCTTAAGGAGCTTCATACTTGGGCTGATTTTAGCTTGATTGTGGATATAATGGAAGCTGTAGATAATAACTTCAATGAGGCGTCTACTTTTTTAAACAAAATGGTTTCTAGTGACAATGTTGAGATCTGTAACGAGATGAGCACCCTAGGACTGCATTCTGCTGATGGATTATCGTGCAATGGGAAGAATGATGTAACCATATCATCAGAAAGAACTGTTAATAATCCCATCCCTAGTTCCACACTAAAGGATGTGCAAGACATGCATCAAAATATTAATGATAAATTGTTTGAAAATAATTATCATGAAAGAAATTTCTTTCACAATGTTGGAAATCCAAAAATAGCTCTTTATTGCTCAAAGTCTGCTCCTATTGAGCCCGAGTGGGAAGAAGATGATATTTACCTGAGCCATCGGAAAGATGCTATAGCAATGATGAGGTCTGCATCTCAACATTCAAGGGTAGCCACTAATGCCTATCTTCGGAAGGATCATGCTTCTGCCAAGTATCATTCATCAAGGGCTCAAGAACAATGGCTAGCTGCAAAAATGTTAAATGCTAAGGCAGCCAATGAAATTTTACAAACAAGGAATAGTGAAAATGGGCTCTGGAAGTTGGACTTACATGGGCTTCATGCAGCAGAGGCTGTTCAAGCCTTGCAAGATCACTTGCTGAAAATCGAAACTTGGAATGCCTCCAATCGGTCGTTGTCGCCAAAGAAAGCTGAAAGGAAGGGTTTCTATCGTGTTTCATCCCTTGAGTATCTTAGTTGTCTGGGCGTAAAGTTGGACAAAGAATTACAATCACCATTACCTAGGCATAGGCCGACATCATTGGAAGTCATAACAGGAATAGGTAAACATAGCAGGGGGGAGGCTGCTCTACCAAAGGCCGTGACAAGTTTTCTTAGTGAAAATGGGTACCGTTTTGAACAGTTGAGGCCTGGAACGATCAGCGTTCGACCGAAGTTTCGTAGGTAAACGACTCTCCCTTATTAGTAGTAGCTTCAAAGTTATTCAGTCAGAACGTAGATTAGAGTTGTTGGAGGTTGGGTAAGTGAAACGTGTAAAATGACTGAAAATGTTCGTTATTTAGAAGGAGCTCTGTAGGATTGGTTGTTAACCACATTACTGCATTGTTATTTTGTTGTCTTTCCTGGAAAAGATAGCACCTAGGAAGAAGAGGTTAGAAGAACTCTGGAATCCTGGGATTGTATTCTATTATGATTCATTATGGCGATTGCACTAGTTGAGTCTACTTCTCTAATCCATTGAAAATATAATAGCTG

Coding sequence (CDS)

ATGTCGTGGGGGAGGGGTAAATCTCCTGGCTGGGCAGCGGTTAACCTTAAGCAACAGAATAGTGGCCTTCAAGATGAAATTGACCCGGACCCATTCCCACCAATGTCTACCGCTCATTCCTTTCTGCCACCCTGTGAAAACGTACACAGAGTTAATGGTCGTTCAGGGAGGTCTTCACAAGAAAATTTTGGTGCAGAAAAGACAATACTTGGTAATTCTAGCATTCGAAGTAGCAAGAAGTTGGTTGAAGAATCCACTGATGTTTTAGCCTTCTGGAAGCTTAAAGAGCTTCATTCTTGGGCTGATATCAGCTTGATTGTGGATATAATGGAAGCTGTAAATAATAACTTCAACGAGGCGTCTAAGTTATTAAAAACAATGGTTTCTAGTGACAATTTTGAGATCAATAATGAGATGAGCACCTTAGGACTGCATTCCTCTAATGATGTATCATTGGTGAGGGGTAAATCTTCTGGCTGGGAAGAATTTAACCTTAAGCAACAAAATAGAGGCTTTCAAGATAGAATTGATCCGAAACCATTCCCACCGATGCCAAGTTCCCTTTCCTCTTTGCCACCCCGTGAAAACTTGCACGGTGTTAATGGCCGTCCAGGGGGATCCTCCTCATCTTCACCTCTTCCTTCTGCTGATTCTCTAACTTTGCCAGAAAATTACGGTGCAAAGAAAATACTTGGTGATTCTACCATTCAAAATGGAAGGAAGGTGGTTGAAGAAACCACTGACGTTTTAGCCTTTTGGAAGCTTAAGGAGCTTCATACTTGGGCTGATTTTAGCTTGATTGTGGATATAATGGAAGCTGTAGATAATAACTTCAATGAGGCGTCTACTTTTTTAAACAAAATGGTTTCTAGTGACAATGTTGAGATCTGTAACGAGATGAGCACCCTAGGACTGCATTCTGCTGATGGATTATCGTGCAATGGGAAGAATGATGTAACCATATCATCAGAAAGAACTGTTAATAATCCCATCCCTAGTTCCACACTAAAGGATGTGCAAGACATGCATCAAAATATTAATGATAAATTGTTTGAAAATAATTATCATGAAAGAAATTTCTTTCACAATGTTGGAAATCCAAAAATAGCTCTTTATTGCTCAAAGTCTGCTCCTATTGAGCCCGAGTGGGAAGAAGATGATATTTACCTGAGCCATCGGAAAGATGCTATAGCAATGATGAGGTCTGCATCTCAACATTCAAGGGTAGCCACTAATGCCTATCTTCGGAAGGATCATGCTTCTGCCAAGTATCATTCATCAAGGGCTCAAGAACAATGGCTAGCTGCAAAAATGTTAAATGCTAAGGCAGCCAATGAAATTTTACAAACAAGGAATAGTGAAAATGGGCTCTGGAAGTTGGACTTACATGGGCTTCATGCAGCAGAGGCTGTTCAAGCCTTGCAAGATCACTTGCTGAAAATCGAAACTTGGAATGCCTCCAATCGGTCGTTGTCGCCAAAGAAAGCTGAAAGGAAGGGTTTCTATCGTGTTTCATCCCTTGAGTATCTTAGTTGTCTGGGCGTAAAGTTGGACAAAGAATTACAATCACCATTACCTAGGCATAGGCCGACATCATTGGAAGTCATAACAGGAATAGGTAAACATAGCAGGGGGGAGGCTGCTCTACCAAAGGCCGTGACAAGTTTTCTTAGTGAAAATGGGTACCGTTTTGAACAGTTGAGGCCTGGAACGATCAGCGTTCGACCGAAGTTTCGTAGGTAA

Protein sequence

MSWGRGKSPGWAAVNLKQQNSGLQDEIDPDPFPPMSTAHSFLPPCENVHRVNGRSGRSSQENFGAEKTILGNSSIRSSKKLVEESTDVLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSSGWEEFNLKQQNRGFQDRIDPKPFPPMPSSLSSLPPRENLHGVNGRPGGSSSSSPLPSADSLTLPENYGAKKILGDSTIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTFLNKMVSSDNVEICNEMSTLGLHSADGLSCNGKNDVTISSERTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRVATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETWNASNRSLSPKKAERKGFYRVSSLEYLSCLGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR
BLAST of CmaCh17G013380.1 vs. TrEMBL
Match: A0A0A0KA90_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G374690 PE=4 SV=1)

HSP 1 Score: 865.9 bits (2236), Expect = 2.7e-248
Identity = 457/610 (74.92%), Postives = 500/610 (81.97%), Query Frame = 1

Query: 1   MSWGRGKSPGWAAVNLKQQNSGLQDEIDPDPFPPMSTAHSFLPPCENVHRVNGRSGRS-- 60
           MSW RGKS GWAA NLKQQN+GLQDE+D DPFPPMST  S LPP EN+  VNG SG+S  
Sbjct: 1   MSWVRGKSSGWAAFNLKQQNNGLQDEVDRDPFPPMSTTLSSLPPRENLRGVNGHSGKSFS 60

Query: 61  ---------------------SQENFGAEKTILGNSSIRSSKKLVEESTDVLAFWKLKEL 120
                                +  NFGA+KTILG ++I+S KKLVEE+ DVL+FWKLKEL
Sbjct: 61  LAPIPSADSPTLPVKFGAKKTTLGNFGAKKTILGGTNIQSGKKLVEETNDVLSFWKLKEL 120

Query: 121 HSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKS 180
           H WADISLI+DIMEAVNN+FNEAS LL TMVSSDN EINN+MSTLGLHSSND+  + GKS
Sbjct: 121 HPWADISLIMDIMEAVNNDFNEASTLLNTMVSSDNLEINNKMSTLGLHSSNDLLWMAGKS 180

Query: 181 SGWEEFNLKQQNRGFQDRIDPKPFPPMPSSLSSLPPRENLHGVNGRPGGSSSSSPLPSAD 240
            GWEEFNLKQ N+G QD +D + FPPM ++ SSLPP ENLHGV GR G S +S PLPS D
Sbjct: 181 PGWEEFNLKQHNKGLQDEMDLEAFPPMLTNRSSLPPYENLHGVYGRSGRSFASEPLPSVD 240

Query: 241 SLTLPENYGAKK-ILGDSTIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDN 300
           SLT PENYGAK  I  DS+IQ+G+KVVEE TDVLAFWKLKE+H+WADFSLIVDIM+AV+N
Sbjct: 241 SLTSPENYGAKNTIADDSSIQSGKKVVEENTDVLAFWKLKEIHSWADFSLIVDIMDAVNN 300

Query: 301 NFNEASTFLNKMVSSDNVEICNEMSTLGLHSADGLSCNGKNDVTISSERTVNNPIPSSTL 360
           NF+EAST L  MVSSDN EI NE+STLGLHSA+ L CNG NDV+I+SER +N PI SST+
Sbjct: 301 NFDEASTLLKTMVSSDNFEINNEISTLGLHSANDLLCNGNNDVSIASERMINAPILSSTV 360

Query: 361 KDVQDMHQNIN------DKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDDIYL 420
           K VQ +HQN N       KLF N+Y ERN FHN GN KIAL CSKS PIEPEWEEDDIYL
Sbjct: 361 KAVQGIHQNNNTSREDYTKLFANDYFERNSFHNTGNSKIALGCSKSVPIEPEWEEDDIYL 420

Query: 421 SHRKDAIAMMRSASQHSRVATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQT 480
           SHRKDAIAMMRSASQHSR ATNAY RKDHASAKYHSSRA+EQWLAAKMLN KAANEILQT
Sbjct: 421 SHRKDAIAMMRSASQHSRAATNAYRRKDHASAKYHSSRAEEQWLAAKMLNDKAANEILQT 480

Query: 481 RNSENGLWKLDLHGLHAAEAVQALQDHLLKIETWNASNRSLSPKKAERKGFYRVSSLEYL 540
           RNS+NGLWKLDLHGLHAAEAVQAL DHLLKIET NASNRSLSPKKAERKGF R SSLEYL
Sbjct: 481 RNSKNGLWKLDLHGLHAAEAVQALHDHLLKIETQNASNRSLSPKKAERKGFQRASSLEYL 540

Query: 541 SCLGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENGYRFEQLRPG 581
           SC+  KLDKE  SP  RHRPTSLEVITGIGKHS+GEAALPKAV SFL+ENGYRFEQ RPG
Sbjct: 541 SCMESKLDKE--SPSSRHRPTSLEVITGIGKHSKGEAALPKAVASFLTENGYRFEQTRPG 600

BLAST of CmaCh17G013380.1 vs. TrEMBL
Match: F6HGK1_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_07s0130g00370 PE=4 SV=1)

HSP 1 Score: 374.0 bits (959), Expect = 3.3e-100
Identity = 229/443 (51.69%), Postives = 290/443 (65.46%), Query Frame = 1

Query: 150 VSLVRGKSSGWEEFNLKQ-QNRGFQDRIDPKPFPPMPSSLSSLPPRENLHGVNGRPGGSS 209
           +S   GKS GW  F+LKQ Q +G +  +D +P+PP+PSS +SL P  N    NG  G S 
Sbjct: 1   MSSASGKSPGWAAFDLKQRQKQGLEPELDKEPYPPIPSSFTSLRPCRN-SASNGCSGRSF 60

Query: 210 SSSPLPSADSLTLPENYGAKKIL--GDSTIQNGRKVVEETTDVLAFWKLKELHTWADFSL 269
           SS  +PS +  TL EN   KK +  G+S  +   KV E +  V+AF KLKEL++WAD SL
Sbjct: 61  SSLLVPSVNFPTLEENKDCKKPMQGGNSGNKQQTKVAEVSNLVIAFNKLKELYSWADNSL 120

Query: 270 IVDIMEAVDNNFNEASTFLNKMVSSDNVEICNEMSTLGLHSADGLSCNGKNDVTISSERT 329
           I DIM AVDN+ ++AST L  MVS+ + E   E S + L+S  G   N   +  + ++  
Sbjct: 121 IEDIMAAVDNDIDKASTLLGAMVSTGSFEENKETSIVELNSTSG---NPYENCKLQADNG 180

Query: 330 VNNPIPSSTLKDVQDMHQNINDKLFENN--------YHERNFFHNVGNPKIALYCSKSAP 389
           V   + + T+  + ++   I D L +NN           +N F +  +  + L   KS P
Sbjct: 181 VF--LGNGTV--LSELSSTIGDLLIDNNKGLTDECGSSGKNLFDDAADMTLILGRMKSIP 240

Query: 390 IEPEWEEDDIYLSHRKDAIAMMRSASQHSRVATNAYLRKDHASAKYHSSRAQEQWLAAKM 449
           IEPEWEEDD+YLSHRKDAI  MRSASQHSR ATNA+LR DH SAK  S +A+++W+ A+ 
Sbjct: 241 IEPEWEEDDVYLSHRKDAIRFMRSASQHSRAATNAFLRGDHVSAKQFSLKAKDEWVKAER 300

Query: 450 LNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETWNASNRSLSPKKAER 509
           LN+KAANEIL  RNS N LWKLDLHGLHAAEAVQALQ+HL KIET    NRS+SP +A+ 
Sbjct: 301 LNSKAANEILDIRNSNNDLWKLDLHGLHAAEAVQALQEHLWKIETQMPFNRSVSPNRAKT 360

Query: 510 K-GFYRVSSLEYLSCL-GVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSF 569
           K G  R  SLE  SC+   +LDK  Q  L R RPTSL+VITG G HSRG+AALP AV SF
Sbjct: 361 KVGILRSPSLESFSCVDNEELDK--QWTLSRQRPTSLQVITGRGNHSRGQAALPTAVRSF 420

Query: 570 LSENGYRFEQLRPGTISVRPKFR 580
           L+E+GYRFE+ RPG I+VRPKFR
Sbjct: 421 LNEHGYRFEEARPGVIAVRPKFR 433

BLAST of CmaCh17G013380.1 vs. TrEMBL
Match: A0A061DK57_THECC (Smr (Small MutS Related) domain-containing protein, putative isoform 1 OS=Theobroma cacao GN=TCM_001646 PE=4 SV=1)

HSP 1 Score: 347.4 bits (890), Expect = 3.3e-92
Identity = 212/432 (49.07%), Postives = 266/432 (61.57%), Query Frame = 1

Query: 154 RGKSSGWEEFNLKQ-QNRGFQDRIDPKPFPPMPSSLSSLPPRENLHGVNGRPGGSSSSSP 213
           +G+SSGW  F+LKQ Q +G     +  PFPPMP+SL ++ P  NL   N     S SS  
Sbjct: 18  KGESSGWSAFDLKQRQKQGLVPETEDDPFPPMPNSLPAICPCINLAKSNDLSARSFSSVL 77

Query: 214 LPSADSLTLPEN--YGAKKILGDSTIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDI 273
            PS +  T  +N  Y     +G     +G KVVE+  + LA  KLKELH WA+ SLI D+
Sbjct: 78  KPSDNFPTSKQNKDYTKPINMGKPIENDGDKVVEQNNNNLALKKLKELHCWAENSLIEDL 137

Query: 274 MEAVDNNFNEASTFLNKMVSSDNVEICNEMSTLGLHSA-DGLSCNGKNDVTISSERTVNN 333
           + A D + +EAS  L  M+S    E   E     + SA      N   D  IS+ +T   
Sbjct: 138 LLAADGDVHEASALLKGMMSISGTEDIKETKNNEMSSAISDFPGNAYCDREISTGKTAKL 197

Query: 334 PIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDDIY 393
              SS   + +D    + D       HE   F    N K+ L    S P EPEWEEDD+Y
Sbjct: 198 VCQSSKADEREDNLDKLTDM------HENKLFDGASNMKLILGQLTSIPFEPEWEEDDVY 257

Query: 394 LSHRKDAIAMMRSASQHSRVATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQ 453
           LSHRKDAI MMRSASQHSR A+NA+LR DH +A+ HS  A+E+WLAA+ LNAKAA+EIL+
Sbjct: 258 LSHRKDAIRMMRSASQHSRAASNAFLRGDHVAAQQHSQNAREEWLAAQRLNAKAASEILR 317

Query: 454 TRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETWNASNRSLSPKKAERKG-FYRVSSLE 513
            RNS+N LWKLDLHGLHAAEAVQAL +HL ++ET   + RS+SP + +        SS+E
Sbjct: 318 IRNSDNDLWKLDLHGLHAAEAVQALHEHLRRLETQVPAGRSVSPNRFKANNRIVHSSSVE 377

Query: 514 YLSCLGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENGYRFEQLR 573
             S +  KLDK+  S   R RPTSL+VITG+G HSRG+AALP AV SFL ENGYRF++ R
Sbjct: 378 TFSSMD-KLDKQQTS--SRQRPTSLQVITGVGNHSRGQAALPAAVRSFLIENGYRFDEAR 437

Query: 574 PGTISVRPKFRR 581
           PG I+VRPKFRR
Sbjct: 438 PGLITVRPKFRR 440

BLAST of CmaCh17G013380.1 vs. TrEMBL
Match: M5WI32_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa006112mg PE=4 SV=1)

HSP 1 Score: 327.0 bits (837), Expect = 4.6e-86
Identity = 205/440 (46.59%), Postives = 273/440 (62.05%), Query Frame = 1

Query: 156 KSSGWEEFNLKQ-QNRGFQDRIDPKPFPPMPSSLSSLPPRENL---HGVNGRPGGSSSSS 215
           KS GW  F+LKQ Q +G + + D   FPP+ ++L SL P EN+   + ++GRP     S 
Sbjct: 7   KSGGWAAFDLKQRQKQGLEPQTDTDHFPPILTTLPSLHPCENVSRNNDLSGRP----FSC 66

Query: 216 PLPSADSLTLPENYGAKKIL--GDSTIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVD 275
            L   D  T  EN   K+ L  GDS+   G  + +  +      K+ +L+ WAD SLI D
Sbjct: 67  VLHPVDFPTSTENRDGKRPLLYGDSS---GTSMEDNRSSKK---KIMDLYPWADDSLIED 126

Query: 276 IMEAVDNNFNEASTFLNKMVSSDNVEICNEMSTLGLHSADGLSCNGKNDVTISSERTVNN 335
           IM AV ++  +AST L  MVS  + E   E         D    N  +D+   S++T + 
Sbjct: 127 IMAAVGDDITKASTLLKAMVSPSSFEENKE--------TDISKINSNSDI-YQSDKTKHT 186

Query: 336 PIPSSTLKDVQDMHQNINDKLFENNY-----HE---RNFFHNVGNPKIALYCSKSAPIEP 395
             P  +  D+ D++      L ENN      H+   +N  ++    K+ L   +S P+EP
Sbjct: 187 SFPLESAADIADLNSTFEKCLEENNIELLNAHDFCGKNLPNDAATMKLTLGSLESVPVEP 246

Query: 396 EWEEDDIYLSHRKDAIAMMRSASQHSRVATNAYLRKDHASAKYHSSRAQEQWLAAKMLNA 455
           EWEEDD+YL HRKDA+ MMRSASQHS+ ATNA++R DH SA+ HS++A+E+WLAA+ LN 
Sbjct: 247 EWEEDDVYLRHRKDALRMMRSASQHSKAATNAFVRGDHFSAQRHSNKAREEWLAAESLNN 306

Query: 456 KAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETWNASNRSLSPKKAE-RKG 515
           KAA +IL  RNS+N +WKLDLHGLHA+EA+QAL++HL +IET   SN S+SP K    K 
Sbjct: 307 KAAKKILNIRNSKNDVWKLDLHGLHASEAIQALREHLQRIETKVLSNHSVSPNKVRMEKR 366

Query: 516 FYRVSSLEYLSCLGV-KLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSE 575
             R SSLE  +C+   KLD+  Q      RPTSL+VITGIG HSRG+AALP AV SFL++
Sbjct: 367 IIRSSSLESFNCMDTEKLDQ--QKAPSTQRPTSLQVITGIGNHSRGQAALPTAVGSFLND 425

Query: 576 NGYRFEQLRPGTISVRPKFR 580
           NGYRFE+LRPG I+VRPKFR
Sbjct: 427 NGYRFEELRPGVITVRPKFR 425

BLAST of CmaCh17G013380.1 vs. TrEMBL
Match: A0A0D2TIK4_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_012G048200 PE=4 SV=1)

HSP 1 Score: 323.6 bits (828), Expect = 5.1e-85
Identity = 203/432 (46.99%), Postives = 264/432 (61.11%), Query Frame = 1

Query: 154 RGKSSGWEEFNLKQ-QNRGFQDRIDPKPFPPMP-SSLSSLPPRENLHGVNGRPGGSSSSS 213
           + KSSGW  F+LKQ Q +      +  PFPP+  ++L   P + N + ++ R   S SS 
Sbjct: 18  KDKSSGWTAFDLKQRQKQALVPETENDPFPPVAMAALHPCPSQTNCNDLSAR---SFSSV 77

Query: 214 PLPSADSLTLPENYGA-KKILGDSTIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDI 273
             PS+D  TL +N  + K I     I N  K+     + LA  KLKE+H WA+ SLI DI
Sbjct: 78  LKPSSDFPTLKQNKNSIKSINVGKPIGNEDKIAGVNNNDLALKKLKEIHCWAENSLIEDI 137

Query: 274 MEAVDNNFNEASTFLNKMVSSDNVEICNEMSTLGLHSA-DGLSCNGKNDVTISSERTVNN 333
           + A +N+ +EAS  L +M+   + E  ++     + SA      N   D+ + S +T ++
Sbjct: 138 LLATNNDIHEASALLKQMMPRSSTEEIDKAKNNEMGSAIANFPSNANCDICLPSGKTADH 197

Query: 334 PIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDDIY 393
              SS   + ++  + + D       HE   F +  N K+ L    S PIEPEWEEDD+Y
Sbjct: 198 VGQSSKANEREENLKILTD------VHENKLFDDHSNMKLILGQLTSIPIEPEWEEDDVY 257

Query: 394 LSHRKDAIAMMRSASQHSRVATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQ 453
           LSHRKDAI MMRSASQHSR ATNA+LR DH SA+ HS  A+E+WLAA+ LNAKAA EIL 
Sbjct: 258 LSHRKDAIRMMRSASQHSRAATNAFLRGDHFSAQQHSQNAREEWLAAQRLNAKAAREILS 317

Query: 454 TRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETWNASNRSLSPKKAE-RKGFYRVSSLE 513
            RNS+N LWKLDLHGLHAAEAVQAL +HL ++ET  +   S+SP   +   G  R SS+ 
Sbjct: 318 IRNSDNDLWKLDLHGLHAAEAVQALHEHLRRLETRVSGGCSVSPNGVKANNGIVRSSSVG 377

Query: 514 YLSCLGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENGYRFEQLR 573
            +S +  KL K   S   R  P SLEVITG+G HSRG+AALP AV  FL ENGYRF++ R
Sbjct: 378 TISSMD-KLGKPQTS--SRQVPASLEVITGVGNHSRGQAALPTAVRGFLIENGYRFDETR 437

Query: 574 PGTISVRPKFRR 581
           PG I+VRPKFRR
Sbjct: 438 PGLITVRPKFRR 437

BLAST of CmaCh17G013380.1 vs. TAIR10
Match: AT5G23520.1 (AT5G23520.1 smr (Small MutS Related) domain-containing protein)

HSP 1 Score: 279.3 bits (713), Expect = 5.6e-75
Identity = 187/449 (41.65%), Postives = 260/449 (57.91%), Query Frame = 1

Query: 150 VSLVRGKSSGWEEFNLKQ-QNRGFQDRIDPKPFPPMPSSLSSLPPRENLHGVNGR----- 209
           +S ++GKSSGW  F+LKQ Q +G +  ++  PFPP+ +S+++        GV GR     
Sbjct: 1   MSWMKGKSSGWTAFDLKQRQKQGLESEVEGDPFPPVSTSVNAS------FGVRGRLRRNH 60

Query: 210 -PGGSSSSSPL--PSA-DSLTLPENYGAKKILGDSTIQNGRKVVEETTDVLAFWKLKELH 269
            P   S SS L  PS   +LT  ++ G ++  G    +     +   +  LAF KLKE++
Sbjct: 61  EPSEKSFSSVLLPPSRFPALTENKDCGNQERGGCCRRKPDTLSLPVNSHDLAFTKLKEMN 120

Query: 270 TWADFSLIVDIMEAVDNNFNEASTFLNKMVSSDNVEICNEMSTLGLHSADGLSCNGKNDV 329
           +WAD +LI D++ + +++F  A  FL  MVSS   +   E  T  +   +G S + +   
Sbjct: 121 SWADDNLIRDVLLSTEDDFEMALAFLKGMVSSGKED---EEPTSKI---EGYSSDNRRSE 180

Query: 330 TISSERTVNNPIPS---STLKDV--QDMHQNINDKLFENNYHERNFFHNVGNPKIALYCS 389
             + E+TV + +     ST +D    D+  +       N      F  ++      +   
Sbjct: 181 YRTFEKTVTSSVKMAARSTFEDAGKYDLENSDGSSFLVNASDNEKFPDDISELDSIIQRL 240

Query: 390 KSAPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRVATNAYLRKDHASAKYHSSRAQEQWL 449
           +S PIEPEWEEDD+YLSHRKDA+ +MRSAS HSR A NA+ R DHASAK HS +A+E WL
Sbjct: 241 QSIPIEPEWEEDDLYLSHRKDALKVMRSASNHSRAAQNAFQRYDHASAKQHSDKAREDWL 300

Query: 450 AAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETWNASNRSLSPK 509
           AA+ LNA+AA +I+   N +N +WKLDLHGLHA EAVQALQ+ L  IE     NRS+SP 
Sbjct: 301 AAEKLNAEAAKKIIGITNKDNDIWKLDLHGLHATEAVQALQERLQMIEGHFTVNRSVSPN 360

Query: 510 KAERK-GFYRVSSLEYLSCLGVKLDKE---LQSPLPRHRPTSLEVITGIGKHSRGEAALP 569
           +   K    R +S E       +LD+E    Q    R    SL+VITGIGKHSRG+A+LP
Sbjct: 361 RGRSKNAALRSASQEPFG----RLDEEGMHCQRTSSRELRNSLQVITGIGKHSRGQASLP 420

Query: 570 KAVTSFLSENGYRFEQLRPGTISVRPKFR 580
            AV +F  +N YRF++ RPG I+VRPKFR
Sbjct: 421 LAVKTFFEDNRYRFDETRPGVITVRPKFR 433

BLAST of CmaCh17G013380.1 vs. NCBI nr
Match: gi|659100734|ref|XP_008451240.1| (PREDICTED: uncharacterized protein LOC103492590 [Cucumis melo])

HSP 1 Score: 868.6 bits (2243), Expect = 6.1e-249
Identity = 461/610 (75.57%), Postives = 498/610 (81.64%), Query Frame = 1

Query: 1   MSWGRGKSPGWAAVNLKQQNSGLQDEIDPDPFPPMSTAHSFLPPCENVHRVNGRSGRS-- 60
           MSW RGKS GWAA NLKQQN+G+QDE+D DPFPPMST  S LPP EN+  VNGRSGRS  
Sbjct: 1   MSWVRGKSSGWAAFNLKQQNNGIQDEVDGDPFPPMSTTLSSLPPRENLRGVNGRSGRSFS 60

Query: 61  ---------------------SQENFGAEKTILGNSSIRSSKKLVEESTDVLAFWKLKEL 120
                                +  NF A+KTILG S+I+S KK+VEE+ DVL+FWKLKEL
Sbjct: 61  FAPIPSADSPTLPGKCGAKKTTLGNFSAKKTILGASNIQSGKKMVEETNDVLSFWKLKEL 120

Query: 121 HSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKS 180
           H WADISLI+DIMEAVNN+FNEAS LL TMVSSDN EINNEMS LGLHSSND+S + GKS
Sbjct: 121 HPWADISLIMDIMEAVNNDFNEASTLLNTMVSSDNLEINNEMSNLGLHSSNDLSWMMGKS 180

Query: 181 SGWEEFNLKQQNRGFQDRIDPKPFPPMPSSLSSLPPRENLHGVNGRPGGSSSSSPLPSAD 240
            GWEEFNL+Q NRG Q   DP+ FPPM ++  SLPP ENLHGV GR G S +S PLPSAD
Sbjct: 181 PGWEEFNLQQHNRGLQGEKDPEAFPPMLTNHPSLPPYENLHGVYGRLGRSFASEPLPSAD 240

Query: 241 SLTLPENYGAKKIL-GDSTIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDN 300
           SLT P NYGAK  +  DS IQ+G+KVVEE TDVLAFWKLKE+H+WADFSLIVDIM+AV+N
Sbjct: 241 SLTSPGNYGAKNTIPDDSGIQSGKKVVEENTDVLAFWKLKEIHSWADFSLIVDIMDAVNN 300

Query: 301 NFNEASTFLNKMVSSDNVEICNEMSTLGLHSADGLSCNGKNDVTISSERTVNNPIPSSTL 360
           NF+EAST L  MVSSDN EI NE+STLGLHSA+ L CNG NDV+ISSERT+N PI S TL
Sbjct: 301 NFDEASTLLKTMVSSDNFEINNEISTLGLHSANDLLCNGDNDVSISSERTINGPILSPTL 360

Query: 361 KDVQDMHQNIND------KLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDDIYL 420
           K  Q MHQN N       KLF N+Y ERNFF N GN KIAL CSKS PIEPEWEEDDIYL
Sbjct: 361 KAAQGMHQNDNTGGEDCTKLFVNDYFERNFFPNAGNSKIALGCSKSVPIEPEWEEDDIYL 420

Query: 421 SHRKDAIAMMRSASQHSRVATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQT 480
           SHRKDAIAMMRSASQHSR ATNAY RKDHASAKYHSSRAQEQWLAAKMLN KAANEILQT
Sbjct: 421 SHRKDAIAMMRSASQHSRAATNAYRRKDHASAKYHSSRAQEQWLAAKMLNDKAANEILQT 480

Query: 481 RNSENGLWKLDLHGLHAAEAVQALQDHLLKIETWNASNRSLSPKKAERKGFYRVSSLEYL 540
           RNS+NGLWKLDLHGLHAAEAVQALQDHLLKIET NASNRSLSPKKAERKGF R SSLEYL
Sbjct: 481 RNSKNGLWKLDLHGLHAAEAVQALQDHLLKIETQNASNRSLSPKKAERKGFQRASSLEYL 540

Query: 541 SCLGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENGYRFEQLRPG 581
           SC+  KLDKE  SP  RHRPTSLEVITGIGKHS+GEAALPKAVTSFL+ENGYRFEQ RPG
Sbjct: 541 SCMDAKLDKE--SPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLTENGYRFEQTRPG 600

BLAST of CmaCh17G013380.1 vs. NCBI nr
Match: gi|449462475|ref|XP_004148966.1| (PREDICTED: uncharacterized protein LOC101223137 [Cucumis sativus])

HSP 1 Score: 865.9 bits (2236), Expect = 3.9e-248
Identity = 457/610 (74.92%), Postives = 500/610 (81.97%), Query Frame = 1

Query: 1   MSWGRGKSPGWAAVNLKQQNSGLQDEIDPDPFPPMSTAHSFLPPCENVHRVNGRSGRS-- 60
           MSW RGKS GWAA NLKQQN+GLQDE+D DPFPPMST  S LPP EN+  VNG SG+S  
Sbjct: 1   MSWVRGKSSGWAAFNLKQQNNGLQDEVDRDPFPPMSTTLSSLPPRENLRGVNGHSGKSFS 60

Query: 61  ---------------------SQENFGAEKTILGNSSIRSSKKLVEESTDVLAFWKLKEL 120
                                +  NFGA+KTILG ++I+S KKLVEE+ DVL+FWKLKEL
Sbjct: 61  LAPIPSADSPTLPVKFGAKKTTLGNFGAKKTILGGTNIQSGKKLVEETNDVLSFWKLKEL 120

Query: 121 HSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKS 180
           H WADISLI+DIMEAVNN+FNEAS LL TMVSSDN EINN+MSTLGLHSSND+  + GKS
Sbjct: 121 HPWADISLIMDIMEAVNNDFNEASTLLNTMVSSDNLEINNKMSTLGLHSSNDLLWMAGKS 180

Query: 181 SGWEEFNLKQQNRGFQDRIDPKPFPPMPSSLSSLPPRENLHGVNGRPGGSSSSSPLPSAD 240
            GWEEFNLKQ N+G QD +D + FPPM ++ SSLPP ENLHGV GR G S +S PLPS D
Sbjct: 181 PGWEEFNLKQHNKGLQDEMDLEAFPPMLTNRSSLPPYENLHGVYGRSGRSFASEPLPSVD 240

Query: 241 SLTLPENYGAKK-ILGDSTIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDN 300
           SLT PENYGAK  I  DS+IQ+G+KVVEE TDVLAFWKLKE+H+WADFSLIVDIM+AV+N
Sbjct: 241 SLTSPENYGAKNTIADDSSIQSGKKVVEENTDVLAFWKLKEIHSWADFSLIVDIMDAVNN 300

Query: 301 NFNEASTFLNKMVSSDNVEICNEMSTLGLHSADGLSCNGKNDVTISSERTVNNPIPSSTL 360
           NF+EAST L  MVSSDN EI NE+STLGLHSA+ L CNG NDV+I+SER +N PI SST+
Sbjct: 301 NFDEASTLLKTMVSSDNFEINNEISTLGLHSANDLLCNGNNDVSIASERMINAPILSSTV 360

Query: 361 KDVQDMHQNIN------DKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDDIYL 420
           K VQ +HQN N       KLF N+Y ERN FHN GN KIAL CSKS PIEPEWEEDDIYL
Sbjct: 361 KAVQGIHQNNNTSREDYTKLFANDYFERNSFHNTGNSKIALGCSKSVPIEPEWEEDDIYL 420

Query: 421 SHRKDAIAMMRSASQHSRVATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQT 480
           SHRKDAIAMMRSASQHSR ATNAY RKDHASAKYHSSRA+EQWLAAKMLN KAANEILQT
Sbjct: 421 SHRKDAIAMMRSASQHSRAATNAYRRKDHASAKYHSSRAEEQWLAAKMLNDKAANEILQT 480

Query: 481 RNSENGLWKLDLHGLHAAEAVQALQDHLLKIETWNASNRSLSPKKAERKGFYRVSSLEYL 540
           RNS+NGLWKLDLHGLHAAEAVQAL DHLLKIET NASNRSLSPKKAERKGF R SSLEYL
Sbjct: 481 RNSKNGLWKLDLHGLHAAEAVQALHDHLLKIETQNASNRSLSPKKAERKGFQRASSLEYL 540

Query: 541 SCLGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENGYRFEQLRPG 581
           SC+  KLDKE  SP  RHRPTSLEVITGIGKHS+GEAALPKAV SFL+ENGYRFEQ RPG
Sbjct: 541 SCMESKLDKE--SPSSRHRPTSLEVITGIGKHSKGEAALPKAVASFLTENGYRFEQTRPG 600

BLAST of CmaCh17G013380.1 vs. NCBI nr
Match: gi|225463171|ref|XP_002267329.1| (PREDICTED: uncharacterized protein LOC100263151 [Vitis vinifera])

HSP 1 Score: 374.0 bits (959), Expect = 4.7e-100
Identity = 229/443 (51.69%), Postives = 290/443 (65.46%), Query Frame = 1

Query: 150 VSLVRGKSSGWEEFNLKQ-QNRGFQDRIDPKPFPPMPSSLSSLPPRENLHGVNGRPGGSS 209
           +S   GKS GW  F+LKQ Q +G +  +D +P+PP+PSS +SL P  N    NG  G S 
Sbjct: 1   MSSASGKSPGWAAFDLKQRQKQGLEPELDKEPYPPIPSSFTSLRPCRN-SASNGCSGRSF 60

Query: 210 SSSPLPSADSLTLPENYGAKKIL--GDSTIQNGRKVVEETTDVLAFWKLKELHTWADFSL 269
           SS  +PS +  TL EN   KK +  G+S  +   KV E +  V+AF KLKEL++WAD SL
Sbjct: 61  SSLLVPSVNFPTLEENKDCKKPMQGGNSGNKQQTKVAEVSNLVIAFNKLKELYSWADNSL 120

Query: 270 IVDIMEAVDNNFNEASTFLNKMVSSDNVEICNEMSTLGLHSADGLSCNGKNDVTISSERT 329
           I DIM AVDN+ ++AST L  MVS+ + E   E S + L+S  G   N   +  + ++  
Sbjct: 121 IEDIMAAVDNDIDKASTLLGAMVSTGSFEENKETSIVELNSTSG---NPYENCKLQADNG 180

Query: 330 VNNPIPSSTLKDVQDMHQNINDKLFENN--------YHERNFFHNVGNPKIALYCSKSAP 389
           V   + + T+  + ++   I D L +NN           +N F +  +  + L   KS P
Sbjct: 181 VF--LGNGTV--LSELSSTIGDLLIDNNKGLTDECGSSGKNLFDDAADMTLILGRMKSIP 240

Query: 390 IEPEWEEDDIYLSHRKDAIAMMRSASQHSRVATNAYLRKDHASAKYHSSRAQEQWLAAKM 449
           IEPEWEEDD+YLSHRKDAI  MRSASQHSR ATNA+LR DH SAK  S +A+++W+ A+ 
Sbjct: 241 IEPEWEEDDVYLSHRKDAIRFMRSASQHSRAATNAFLRGDHVSAKQFSLKAKDEWVKAER 300

Query: 450 LNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETWNASNRSLSPKKAER 509
           LN+KAANEIL  RNS N LWKLDLHGLHAAEAVQALQ+HL KIET    NRS+SP +A+ 
Sbjct: 301 LNSKAANEILDIRNSNNDLWKLDLHGLHAAEAVQALQEHLWKIETQMPFNRSVSPNRAKT 360

Query: 510 K-GFYRVSSLEYLSCL-GVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSF 569
           K G  R  SLE  SC+   +LDK  Q  L R RPTSL+VITG G HSRG+AALP AV SF
Sbjct: 361 KVGILRSPSLESFSCVDNEELDK--QWTLSRQRPTSLQVITGRGNHSRGQAALPTAVRSF 420

Query: 570 LSENGYRFEQLRPGTISVRPKFR 580
           L+E+GYRFE+ RPG I+VRPKFR
Sbjct: 421 LNEHGYRFEEARPGVIAVRPKFR 433

BLAST of CmaCh17G013380.1 vs. NCBI nr
Match: gi|590709627|ref|XP_007048605.1| (Smr (Small MutS Related) domain-containing protein, putative isoform 1 [Theobroma cacao])

HSP 1 Score: 347.4 bits (890), Expect = 4.7e-92
Identity = 212/432 (49.07%), Postives = 266/432 (61.57%), Query Frame = 1

Query: 154 RGKSSGWEEFNLKQ-QNRGFQDRIDPKPFPPMPSSLSSLPPRENLHGVNGRPGGSSSSSP 213
           +G+SSGW  F+LKQ Q +G     +  PFPPMP+SL ++ P  NL   N     S SS  
Sbjct: 18  KGESSGWSAFDLKQRQKQGLVPETEDDPFPPMPNSLPAICPCINLAKSNDLSARSFSSVL 77

Query: 214 LPSADSLTLPEN--YGAKKILGDSTIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDI 273
            PS +  T  +N  Y     +G     +G KVVE+  + LA  KLKELH WA+ SLI D+
Sbjct: 78  KPSDNFPTSKQNKDYTKPINMGKPIENDGDKVVEQNNNNLALKKLKELHCWAENSLIEDL 137

Query: 274 MEAVDNNFNEASTFLNKMVSSDNVEICNEMSTLGLHSA-DGLSCNGKNDVTISSERTVNN 333
           + A D + +EAS  L  M+S    E   E     + SA      N   D  IS+ +T   
Sbjct: 138 LLAADGDVHEASALLKGMMSISGTEDIKETKNNEMSSAISDFPGNAYCDREISTGKTAKL 197

Query: 334 PIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDDIY 393
              SS   + +D    + D       HE   F    N K+ L    S P EPEWEEDD+Y
Sbjct: 198 VCQSSKADEREDNLDKLTDM------HENKLFDGASNMKLILGQLTSIPFEPEWEEDDVY 257

Query: 394 LSHRKDAIAMMRSASQHSRVATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQ 453
           LSHRKDAI MMRSASQHSR A+NA+LR DH +A+ HS  A+E+WLAA+ LNAKAA+EIL+
Sbjct: 258 LSHRKDAIRMMRSASQHSRAASNAFLRGDHVAAQQHSQNAREEWLAAQRLNAKAASEILR 317

Query: 454 TRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETWNASNRSLSPKKAERKG-FYRVSSLE 513
            RNS+N LWKLDLHGLHAAEAVQAL +HL ++ET   + RS+SP + +        SS+E
Sbjct: 318 IRNSDNDLWKLDLHGLHAAEAVQALHEHLRRLETQVPAGRSVSPNRFKANNRIVHSSSVE 377

Query: 514 YLSCLGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENGYRFEQLR 573
             S +  KLDK+  S   R RPTSL+VITG+G HSRG+AALP AV SFL ENGYRF++ R
Sbjct: 378 TFSSMD-KLDKQQTS--SRQRPTSLQVITGVGNHSRGQAALPAAVRSFLIENGYRFDEAR 437

Query: 574 PGTISVRPKFRR 581
           PG I+VRPKFRR
Sbjct: 438 PGLITVRPKFRR 440

BLAST of CmaCh17G013380.1 vs. NCBI nr
Match: gi|470132710|ref|XP_004302219.1| (PREDICTED: uncharacterized protein LOC101307600 [Fragaria vesca subsp. vesca])

HSP 1 Score: 335.9 bits (860), Expect = 1.4e-88
Identity = 200/430 (46.51%), Postives = 270/430 (62.79%), Query Frame = 1

Query: 150 VSLVRGKSSGWEEFNLKQQNRGFQDRIDPKPFPPMPSSLSSLPPRENLHGVNGRPGGSSS 209
           +S  + KS GW  F+LKQ+ +G   +ID  PFPP+ S++ SL P    +  +  P    S
Sbjct: 1   MSQQQAKSHGWAAFDLKQRQKGRAPQIDEDPFPPIVSTVKSLHPTVLRN--DEAPRKPFS 60

Query: 210 SSPLPSADSLTLPENYGAKKILGDSTIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVD 269
           S  LPS D  +L EN   ++ L D         VE+    +   K+K+ ++WAD SL+ D
Sbjct: 61  SVFLPSVDVSSLAENRNGERSLLDGNSSRKHTPVEDQRSSIK--KIKDHYSWADDSLVED 120

Query: 270 IMEAVDNNFNEASTFLNKMVSSDNVEICNEMSTLGLHSADGLSCNGKNDVTISSERTVNN 329
           IM AVDN+   AS  L  MVS    E+  E S  G+ S+   S + K     SSE   + 
Sbjct: 121 IMAAVDNDITNASNLLKAMVSPSRSEVNKETSISGVDSSTDASLSDK---CFSSESAADI 180

Query: 330 PIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDDIY 389
              SST++    + +N      +N+ + +   ++ GN K+     +S PIEPEWEEDD+Y
Sbjct: 181 AELSSTIEKC--LEENNIKWSNDNDLYGQKLSNDAGNVKLTTSSLESVPIEPEWEEDDVY 240

Query: 390 LSHRKDAIAMMRSASQHSRVATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQ 449
           L  RKDA+ MMRSASQHS+ A NA+ R DH SA+ +S +A+E+WLAA+ LN +AA EIL 
Sbjct: 241 LRIRKDALRMMRSASQHSKAAANAFGRGDHYSAQQYSIKAREEWLAAESLNNRAAKEILS 300

Query: 450 TRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETWNASNRSLSPKKAERKGFYRVSSLEY 509
            RNS+N +WKLDLHGLHA+EA++ALQ+HL KIE    SN S  P +A+ +   + SS+E 
Sbjct: 301 IRNSKNDVWKLDLHGLHASEAIRALQEHLEKIERKILSNHSALPNRAKMESNIQHSSVES 360

Query: 510 LSCLGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENGYRFEQLRP 569
            SC   ++ ++LQ  + R RPTSL+VITGIG HSRG+AALP AV SFLSENGYRFE+LRP
Sbjct: 361 FSCTDTEIVEQLQ--VSRQRPTSLQVITGIGNHSRGQAALPTAVRSFLSENGYRFEELRP 419

Query: 570 GTISVRPKFR 580
           G I+VRPKFR
Sbjct: 421 GAITVRPKFR 419

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KA90_CUCSA2.7e-24874.92Uncharacterized protein OS=Cucumis sativus GN=Csa_7G374690 PE=4 SV=1[more]
F6HGK1_VITVI3.3e-10051.69Putative uncharacterized protein OS=Vitis vinifera GN=VIT_07s0130g00370 PE=4 SV=... [more]
A0A061DK57_THECC3.3e-9249.07Smr (Small MutS Related) domain-containing protein, putative isoform 1 OS=Theobr... [more]
M5WI32_PRUPE4.6e-8646.59Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa006112mg PE=4 SV=1[more]
A0A0D2TIK4_GOSRA5.1e-8546.99Uncharacterized protein OS=Gossypium raimondii GN=B456_012G048200 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G23520.15.6e-7541.65 smr (Small MutS Related) domain-containing protein[more]
Match NameE-valueIdentityDescription
gi|659100734|ref|XP_008451240.1|6.1e-24975.57PREDICTED: uncharacterized protein LOC103492590 [Cucumis melo][more]
gi|449462475|ref|XP_004148966.1|3.9e-24874.92PREDICTED: uncharacterized protein LOC101223137 [Cucumis sativus][more]
gi|225463171|ref|XP_002267329.1|4.7e-10051.69PREDICTED: uncharacterized protein LOC100263151 [Vitis vinifera][more]
gi|590709627|ref|XP_007048605.1|4.7e-9249.07Smr (Small MutS Related) domain-containing protein, putative isoform 1 [Theobrom... [more]
gi|470132710|ref|XP_004302219.1|1.4e-8846.51PREDICTED: uncharacterized protein LOC101307600 [Fragaria vesca subsp. vesca][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR002625Smr_dom
IPR013899DUF1771
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmaCh17G013380CmaCh17G013380gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmaCh17G013380.1CmaCh17G013380.1-proteinpolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh17G013380.1.exon.4CmaCh17G013380.1.exon.4exon
CmaCh17G013380.1.exon.3CmaCh17G013380.1.exon.3exon
CmaCh17G013380.1.exon.2CmaCh17G013380.1.exon.2exon
CmaCh17G013380.1.exon.1CmaCh17G013380.1.exon.1exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh17G013380.1.CDS.1CmaCh17G013380.1.CDS.1CDS
CmaCh17G013380.1.CDS.2CmaCh17G013380.1.CDS.2CDS
CmaCh17G013380.1.CDS.3CmaCh17G013380.1.CDS.3CDS
CmaCh17G013380.1.CDS.4CmaCh17G013380.1.CDS.4CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh17G013380.1.three_prime_UTR.1CmaCh17G013380.1.three_prime_UTR.1three_prime_UTR


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002625Smr domainPFAMPF01713Smrcoord: 460..560
score: 2.
IPR002625Smr domainSMARTSM00463SMR_2coord: 457..577
score: 1.
IPR002625Smr domainPROFILEPS50828SMRcoord: 460..577
score: 18
IPR002625Smr domainunknownSSF160443SMR domain-likecoord: 526..575
score: 1.5
IPR013899Domain of unknown function DUF1771PFAMPF08590DUF1771coord: 389..452
score: 1.8
IPR013899Domain of unknown function DUF1771SMARTSM01162DUF1771_2coord: 388..453
score: 1.6
NoneNo IPR availablePANTHERPTHR13308UNCHARACTERIZEDcoord: 533..580
score: 2.9E-66coord: 339..498
score: 2.9E-66coord: 63..132
score: 2.9