Cp4.1LG12g10490 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG12g10490
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
Descriptionsmr (Small MutS Related) domain-containing protein
LocationCp4.1LG12 : 9432070 .. 9437043 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGGGAAAAAAGCGCTCGCAAATCTAGAGAGAATCCTGTCGTAGATGGAATAAAGAAATTGAGAGAAAAGCATTCAATCTCCTTCATTCAATTTTTTACTTCATCCCTTTGAAGGAATTCTTAATCGAAGTCAAAGATCAGAGTATGTGGGGTAGGGTTGAAGTTGAACTGTGCCCTTTGAAGATTTCCCGATTTCTTTGTTTCCTTTTCCGCTCTGGAATTCGATCTATGCAGCAAAATTTTCACCCAAATTTCTCTACTTTCTATCTCCGAATTGAATTTCCGTTTCCAAAATCGATTACGAAGGGGCACAAACGGCGGAGTTGGGGATTTGAGATAGTGAGAAGACTCGGTTAGCGGATTTGGGAGTCGAATCTTTTCAACTTCTTGTGTCTCCTGCTCCTTCAGGTTTGAACCCTAATTTTCTCGTTGAAGCTGGAAGGTATTCTTATGGATTTTCTGGTATTTGTTTCTTCTGTTGCTTCCTTTTGGTTCGGTTACTTGTTTAATCTATGTTGATTTCCTGATACGATTGAAGTATTTTTCTTCTGATCGTTGACTTCGTTTTTCCTCCTTTTTTGTTGGGGATTCTTGGATTGAGCGTGTCAGAATGTTAGACCTTTTTATCTTTACTGAAATAAGCTATTTTCGTCGTAATCTTGCTTTGGTTTGTGTCTCTTTTTCGAAGTCTACTGATGATTTGGCCATCGATAGTGGCTTTATATTAATGTTTGATTCCGGTCGTGCTAGAATGGGTCTGTTGAAATTTTCTATATTCCAGGTATGAACAGCTCTGTTTCTTCCAGTCGTGCTACCTCTTCGAGTTGCAGTTTTATTGGATTGTCGGAACTACAAGACATATTCATAGTAGGAGGGGTTTGAGTTAGACCCGGAGTGTGAAATTTTAATTTACTGGTGCTCAACTCCGTATGCTTTATTTTCAGTTCCTTGCATTGGAAGATGTCGTGGGGGAGGGGTAAATCGCCTGGATGGGCAGCGGTTAACCTTAAGCAACAGAATAGTGGCCTTCAAGATGAAATTGACCCGGACCCATTCCCACCAATGTCTACCGCTCTTTCCTTTCTGCCACCCCGTGAAAACGTACACAGAGTTAATGGTCGTTCAGGGAGATCTTTCTCATCGACACCCCTTCCTTCTGCCGATTCTTTAATGTCACCAAAAAATTTTGGTGCAAAAAAGACCATACCTGGTAATTCTAGCATTCGAAGTGGCAAGAAGTTGGTTGAAGAATCCACTGATGTTTTAGCCTTCTGGAAGCTTAAAGAGCTTCATTCTTGGGCTGATATCAGTTTGATTGTGGATATAATGGAAGCTGTAAATAATAACTTCAACGAGGCGTCTAAGTTATTAAAAACAATGGTTTCTAGTGACAATTTTGAGATCAATAATGAGATGAGCACCTTAGGACTGCATTCCTCTAATGATGTATCATTGGTGAGGGGTAAATCTCCTGGCTGGGAAGAATATAACCTTAAGCAACAAAATAGAGGCCTTCAAGATAGAATTGATCCGAAACCATTCCCACCGATGCCAAGTGCCCTTTCCTCTTTGCCACCCCGTGAAAACTTGCACGGAGTTACTGGCCGTCCAGGGAGATCCTCCTCATCTTCACCTCTTCCTTCTGCTGATTCTCTAACTTCGCCAGAAAATTACAGTGCAAAGAAAATACTTGGTGATTCTAGCATTCAAAATGGAAGGAAGGTGGTTGAAGAAACCACTGACGTTTTAGCCTTTTGGAAGCTTAAGGAGCTTCATACTTGGGCTGATTTTAGCTTGATTGTGGATATAATGGAAGCTGTAGATAATAACTTCAATGAGGCATCTACTTATTTAAACAAAATGGTTTCTAGTGACAATGTTGAGATCTGTAACGAGATGAGCACCTTAGGACTGCATTCTGCTGATGGACTATCGTGCTATGGGAAGAATGATGTAACTATATCATTAGGAAGAACTGTTAATAATCCCATCCCTAGTTCCACACTAAAGGATGTGCAAGACATGCATCAAAATATTAATGATAAATTGTTTGAAAATAATTATCATGAAAGAAATTTCTTTCACAATGTTGGAAATCCAAAAATAGCTCTTTATTGCTCAAAGTCTGCTCCTATTGAGCCCGAGTGGGAAGAAGATGATATTTACCTGAGCCATCGGAAAGATGCTATAGCAATGATGAGGTAAATAAGTCTTCTGTCTACTTTATTATCGGGTTGTTTTTCTACTTCAATTTAATTTTGTTTCCCTTAATAAATATATTTTTTAAAATTTCTATAACAAAAACTTCTTTTTTTTCCTGCGTTTATCATTGTGATGCACCACTAGGTTTCTCGGGCTCTTTGTATTTATTATATATTTTTACCGTGTGTGAGAAGAACGTTTTAAAGAATTTATATGTATGGGGGATCTCTTCTATAGGTTGTACCTTGAGTTATGAGTAGACAACAGTCTGTTTGTATGAATAAGCTAATTTATGTTGATAGCTCAAGCTTTGGGGATTAGTAGAAAGTCGACGTGTTAAGAGAGAGTTATTGAATCCAAGTCCATGGTAAATTTTTTTTTTAACTCAAATTGACACTGTAATCATACTCAAAAGAAATGTTAAAAAATAATTGTTGGACAATTATTCTTGACCTAGGCTCTCTAGTATTGTGGCTTTGGTTTTATGTTGTAGTGTAAAATATATGGTTCTTGGGAGAAGGTTCAAGTACTCAATGGTATAGGAGTTGATGTCTTGCTTAACATGATCCGATGTTATTTGTAATTTTCTTTAATCAATAGGTCTGCATCTCAACATTCAAGGGCAGCCACTAATGCCTATCTTCGGAAGGATCATGCTTCAGCCAAGTATCATTCATCAAGGGCTCAAGAACAATGGCTAGCTGCAAAAATGTTAAATGCTAAGGCAGCCAATGAAATTTTACAAACAAGGAATAGTGAAAATGGGCTCTGGAAGTTGGACCTACATGGGCTTCATGCAGCAGAGGCTGTTCAAGCCTTGCAAGATCACTTGCTGAAAATCGAAACTCGGAATGCCTCCAATCGGTCGTTGTCGCCAAAGAAAGCTGAAAGGAAGGGTTTCCATCGTGTTTCATCCCTTGAGTATCTTAGTTGTATGGGCGTAAAGTTGGACAAAGAATTACAATCACCATTACCTAGGCATAGGCCGACATCATTGGAAGTCATAACAGGTAAATTTGTACCTGTTGCTGTTTAACTCAACATTTGTGTTTTCTTAAAGATTGGTTGTCTATATTTTTTGTTTTGACGTTTTTGTTCATTCTTTTCACATTTATTTTTCTTGAGAATGGAATCATTGTATGACAATGTAACTGATAACTTTAGAGGTTGACTACGCTTGTGTTGGCTTTAGCCTTTATCAATTCCAGAGATAGATTGGCATTTTCTACATTATTCAGTTCATCAAGATTTGTTCTTGGATTGATACTCCACTATTGATAGGCTGTATGAATTGTGAGGAAAAGTTTAGGAGTCCAGAAATTTCCGAGAGAGGGGAATGAAGTATTCCTTATAAGGGTGTAAAACCTCTCCCTAGTGGACACGTTTTAAAACCTTGAAGGGAAAGCCTAAAGAGGACAATACGTGCTAGCGGTGGGCTTGGGCGGTTACAAATGGTATCAGAGCCAGACACCAGACTGTATGTCAGTGAGGATGTTGGGCTCCCAAGGGGGGTGGATTGTGAGATCCCACATCGGTTGGAGAGGGGAACAAAGCATTCCTCGTAAGGGTGTGGAAACCTCTCCTTAAAACCTTGAGGGGAATCTCGGAAGGGAAAGTACAAAGAGGACTATGAGCTTGTTTCTTTGCATTTATTGCATGCAAGAGTCATTCAAGTAACCTTAATCAAAGTATCTTGTGAAGTGATTCGAATCACGAGTTTAGAAACGAGTTTTCTTTACTCTTGATCTTGATCATCAAGGTAAGTCATTCTAAAACTTTCCCCTAGGTTATTCTTTATCATTTTTGGAGTTTTTAGATAATTTAGATCTAAAGATCTTGTTCTTCAGCAAGGTTGTTAGATCAAATATCAAGAGTTTCATAGTGCATCTTACCCATGCGTGTACTTGGAGGAAGGTTCTTGGGGAAGTAATGAGCAATTAGTTAGCTGGGGATGGCCGGGTGGATACACGTTTCGTGACTGTGGGTCCCATTTTGTAATGAAGGACAAAATAGAAATCTGTTTAGCTGAGGCCGGGAGGCTATATATATTAGATATAACAATGAGTTTCATCATAGGCTGATTTGACTGAAGGGAAAGATTTAAATGTCGGTGACGCTAGCATATTTTTTTTTTGTAAAATAACGAATTCTCACGTGCACTACTGATTCAAACATTGACGATTGTACTCATGATGTTTTGTTCGTTCCCAAAGGAATAGGTAAACATAGCAGGGGGGAGGCTGCTCTACCAAAGGCCGTGACAAGTTTTCTTAGTGAAAATGGGTGAGTCTATCGTTTCTTTCCATTTTGACTTGAACCCAAAAGAAGGGTAAAAATGTTTATAGATTCATTCTACTTCAACCACACTTGATAATTTGCAGGTACCGTTTTGAACAGTTGAGGCCTGGAACGATCAGCGTTCGCCCAAAGTTTCGTAGGTAAATGAGTACCCCTTATTAGTAATAGCTTCAAAGTTATTCAGTCAGAACGTAGATTAGAGTCGTTGGAGGTTGGGTAAGTGAAACGTGTAAAATGACTGAAAACGTTCGTTATTTAGAAGGAGCTCGGTAGGATTGATTGTTAACCACATTACTGCATTGTTATTTTGTTGTCTTTCCTGGAAAAGATGGCACCTAGGAAGAAGAGGTTAGAAGAACTCTGGAATCCTGGGATTGTATTCTATTATGATTCATTATGGCGATTGCACTAGTTGAGTCTACTTCTCTAATCTATTGAAAATATAATAAGCTGATTTGCTAATGACTTCATCAAT

mRNA sequence

GGGGAAAAAAGCGCTCGCAAATCTAGAGAGAATCCTGTCGTAGATGGAATAAAGAAATTGAGAGAAAAGCATTCAATCTCCTTCATTCAATTTTTTACTTCATCCCTTTGAAGGAATTCTTAATCGAAGTCAAAGATCAGAGTATGTGGGGTAGGGTTGAAGTTGAACTGTGCCCTTTGAAGATTTCCCGATTTCTTTGTTTCCTTTTCCGCTCTGGAATTCGATCTATGCAGCAAAATTTTCACCCAAATTTCTCTACTTTCTATCTCCGAATTGAATTTCCGTTTCCAAAATCGATTACGAAGGGGCACAAACGGCGGAGTTGGGGATTTGAGATAGTGAGAAGACTCGGTTAGCGGATTTGGGAGTCGAATCTTTTCAACTTCTTGTGTCTCCTGCTCCTTCAGGTATGAACAGCTCTGTTTCTTCCAGTCGTGCTACCTCTTCGAGTTGCAGTTTTATTGGATTGTCGGAACTACAAGACATATTCATAGTAGGAGGGGTTTGAGTTAGACCCGGAGTGTGAAATTTTAATTTACTGGTGCTCAACTCCGTATGCTTTATTTTCAGTTCCTTGCATTGGAAGATGTCGTGGGGGAGGGGTAAATCGCCTGGATGGGCAGCGGTTAACCTTAAGCAACAGAATAGTGGCCTTCAAGATGAAATTGACCCGGACCCATTCCCACCAATGTCTACCGCTCTTTCCTTTCTGCCACCCCGTGAAAACGTACACAGAGTTAATGGTCGTTCAGGGAGATCTTTCTCATCGACACCCCTTCCTTCTGCCGATTCTTTAATGTCACCAAAAAATTTTGGTGCAAAAAAGACCATACCTGGTAATTCTAGCATTCGAAGTGGCAAGAAGTTGGTTGAAGAATCCACTGATGTTTTAGCCTTCTGGAAGCTTAAAGAGCTTCATTCTTGGGCTGATATCAGTTTGATTGTGGATATAATGGAAGCTGTAAATAATAACTTCAACGAGGCGTCTAAGTTATTAAAAACAATGGTTTCTAGTGACAATTTTGAGATCAATAATGAGATGAGCACCTTAGGACTGCATTCCTCTAATGATGTATCATTGGTGAGGGGTAAATCTCCTGGCTGGGAAGAATATAACCTTAAGCAACAAAATAGAGGCCTTCAAGATAGAATTGATCCGAAACCATTCCCACCGATGCCAAGTGCCCTTTCCTCTTTGCCACCCCGTGAAAACTTGCACGGAGTTACTGGCCGTCCAGGGAGATCCTCCTCATCTTCACCTCTTCCTTCTGCTGATTCTCTAACTTCGCCAGAAAATTACAGTGCAAAGAAAATACTTGGTGATTCTAGCATTCAAAATGGAAGGAAGGTGGTTGAAGAAACCACTGACGTTTTAGCCTTTTGGAAGCTTAAGGAGCTTCATACTTGGGCTGATTTTAGCTTGATTGTGGATATAATGGAAGCTGTAGATAATAACTTCAATGAGGCATCTACTTATTTAAACAAAATGGTTTCTAGTGACAATGTTGAGATCTGTAACGAGATGAGCACCTTAGGACTGCATTCTGCTGATGGACTATCGTGCTATGGGAAGAATGATGTAACTATATCATTAGGAAGAACTGTTAATAATCCCATCCCTAGTTCCACACTAAAGGATGTGCAAGACATGCATCAAAATATTAATGATAAATTGTTTGAAAATAATTATCATGAAAGAAATTTCTTTCACAATGTTGGAAATCCAAAAATAGCTCTTTATTGCTCAAAGTCTGCTCCTATTGAGCCCGAGTGGGAAGAAGATGATATTTACCTGAGCCATCGGAAAGATGCTATAGCAATGATGAGGTCTGCATCTCAACATTCAAGGGCAGCCACTAATGCCTATCTTCGGAAGGATCATGCTTCAGCCAAGTATCATTCATCAAGGGCTCAAGAACAATGGCTAGCTGCAAAAATGTTAAATGCTAAGGCAGCCAATGAAATTTTACAAACAAGGAATAGTGAAAATGGGCTCTGGAAGTTGGACCTACATGGGCTTCATGCAGCAGAGGCTGTTCAAGCCTTGCAAGATCACTTGCTGAAAATCGAAACTCGGAATGCCTCCAATCGGTCGTTGTCGCCAAAGAAAGCTGAAAGGAAGGGTTTCCATCGTGTTTCATCCCTTGAGTATCTTAGTTGTATGGGCGTAAAGTTGGACAAAGAATTACAATCACCATTACCTAGGCATAGGCCGACATCATTGGAAGTCATAACAGGAATAGGTAAACATAGCAGGGGGGAGGCTGCTCTACCAAAGGCCGTGACAAGTTTTCTTAGTGAAAATGGGTACCGTTTTGAACAGTTGAGGCCTGGAACGATCAGCGTTCGCCCAAAGTTTCGTAGGTAAATGAGTACCCCTTATTAGTAATAGCTTCAAAGTTATTCAGTCAGAACGTAGATTAGAGTCGTTGGAGGTTGGGTAAGTGAAACGTGTAAAATGACTGAAAACGTTCGTTATTTAGAAGGAGCTCGGTAGGATTGATTGTTAACCACATTACTGCATTGTTATTTTGTTGTCTTTCCTGGAAAAGATGGCACCTAGGAAGAAGAGGTTAGAAGAACTCTGGAATCCTGGGATTGTATTCTATTATGATTCATTATGGCGATTGCACTAGTTGAGTCTACTTCTCTAATCTATTGAAAATATAATAAGCTGATTTGCTAATGACTTCATCAAT

Coding sequence (CDS)

ATGTCGTGGGGGAGGGGTAAATCGCCTGGATGGGCAGCGGTTAACCTTAAGCAACAGAATAGTGGCCTTCAAGATGAAATTGACCCGGACCCATTCCCACCAATGTCTACCGCTCTTTCCTTTCTGCCACCCCGTGAAAACGTACACAGAGTTAATGGTCGTTCAGGGAGATCTTTCTCATCGACACCCCTTCCTTCTGCCGATTCTTTAATGTCACCAAAAAATTTTGGTGCAAAAAAGACCATACCTGGTAATTCTAGCATTCGAAGTGGCAAGAAGTTGGTTGAAGAATCCACTGATGTTTTAGCCTTCTGGAAGCTTAAAGAGCTTCATTCTTGGGCTGATATCAGTTTGATTGTGGATATAATGGAAGCTGTAAATAATAACTTCAACGAGGCGTCTAAGTTATTAAAAACAATGGTTTCTAGTGACAATTTTGAGATCAATAATGAGATGAGCACCTTAGGACTGCATTCCTCTAATGATGTATCATTGGTGAGGGGTAAATCTCCTGGCTGGGAAGAATATAACCTTAAGCAACAAAATAGAGGCCTTCAAGATAGAATTGATCCGAAACCATTCCCACCGATGCCAAGTGCCCTTTCCTCTTTGCCACCCCGTGAAAACTTGCACGGAGTTACTGGCCGTCCAGGGAGATCCTCCTCATCTTCACCTCTTCCTTCTGCTGATTCTCTAACTTCGCCAGAAAATTACAGTGCAAAGAAAATACTTGGTGATTCTAGCATTCAAAATGGAAGGAAGGTGGTTGAAGAAACCACTGACGTTTTAGCCTTTTGGAAGCTTAAGGAGCTTCATACTTGGGCTGATTTTAGCTTGATTGTGGATATAATGGAAGCTGTAGATAATAACTTCAATGAGGCATCTACTTATTTAAACAAAATGGTTTCTAGTGACAATGTTGAGATCTGTAACGAGATGAGCACCTTAGGACTGCATTCTGCTGATGGACTATCGTGCTATGGGAAGAATGATGTAACTATATCATTAGGAAGAACTGTTAATAATCCCATCCCTAGTTCCACACTAAAGGATGTGCAAGACATGCATCAAAATATTAATGATAAATTGTTTGAAAATAATTATCATGAAAGAAATTTCTTTCACAATGTTGGAAATCCAAAAATAGCTCTTTATTGCTCAAAGTCTGCTCCTATTGAGCCCGAGTGGGAAGAAGATGATATTTACCTGAGCCATCGGAAAGATGCTATAGCAATGATGAGGTCTGCATCTCAACATTCAAGGGCAGCCACTAATGCCTATCTTCGGAAGGATCATGCTTCAGCCAAGTATCATTCATCAAGGGCTCAAGAACAATGGCTAGCTGCAAAAATGTTAAATGCTAAGGCAGCCAATGAAATTTTACAAACAAGGAATAGTGAAAATGGGCTCTGGAAGTTGGACCTACATGGGCTTCATGCAGCAGAGGCTGTTCAAGCCTTGCAAGATCACTTGCTGAAAATCGAAACTCGGAATGCCTCCAATCGGTCGTTGTCGCCAAAGAAAGCTGAAAGGAAGGGTTTCCATCGTGTTTCATCCCTTGAGTATCTTAGTTGTATGGGCGTAAAGTTGGACAAAGAATTACAATCACCATTACCTAGGCATAGGCCGACATCATTGGAAGTCATAACAGGAATAGGTAAACATAGCAGGGGGGAGGCTGCTCTACCAAAGGCCGTGACAAGTTTTCTTAGTGAAAATGGGTACCGTTTTGAACAGTTGAGGCCTGGAACGATCAGCGTTCGCCCAAAGTTTCGTAGGTAA

Protein sequence

MSWGRGKSPGWAAVNLKQQNSGLQDEIDPDPFPPMSTALSFLPPRENVHRVNGRSGRSFSSTPLPSADSLMSPKNFGAKKTIPGNSSIRSGKKLVEESTDVLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSPGWEEYNLKQQNRGLQDRIDPKPFPPMPSALSSLPPRENLHGVTGRPGRSSSSSPLPSADSLTSPENYSAKKILGDSSIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTYLNKMVSSDNVEICNEMSTLGLHSADGLSCYGKNDVTISLGRTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETRNASNRSLSPKKAERKGFHRVSSLEYLSCMGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR
BLAST of Cp4.1LG12g10490 vs. TrEMBL
Match: A0A0A0KA90_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G374690 PE=4 SV=1)

HSP 1 Score: 895.6 bits (2313), Expect = 3.3e-257
Identity = 471/610 (77.21%), Postives = 512/610 (83.93%), Query Frame = 1

Query: 1   MSWGRGKSPGWAAVNLKQQNSGLQDEIDPDPFPPMSTALSFLPPRENVHRVNGRSGRSFS 60
           MSW RGKS GWAA NLKQQN+GLQDE+D DPFPPMST LS LPPREN+  VNG SG+SFS
Sbjct: 1   MSWVRGKSSGWAAFNLKQQNNGLQDEVDRDPFPPMSTTLSSLPPRENLRGVNGHSGKSFS 60

Query: 61  STPLPSADSLMSPK----------NFGAKKTIPGNSSIRSGKKLVEESTDVLAFWKLKEL 120
             P+PSADS   P           NFGAKKTI G ++I+SGKKLVEE+ DVL+FWKLKEL
Sbjct: 61  LAPIPSADSPTLPVKFGAKKTTLGNFGAKKTILGGTNIQSGKKLVEETNDVLSFWKLKEL 120

Query: 121 HSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKS 180
           H WADISLI+DIMEAVNN+FNEAS LL TMVSSDN EINN+MSTLGLHSSND+  + GKS
Sbjct: 121 HPWADISLIMDIMEAVNNDFNEASTLLNTMVSSDNLEINNKMSTLGLHSSNDLLWMAGKS 180

Query: 181 PGWEEYNLKQQNRGLQDRIDPKPFPPMPSALSSLPPRENLHGVTGRPGRSSSSSPLPSAD 240
           PGWEE+NLKQ N+GLQD +D + FPPM +  SSLPP ENLHGV GR GRS +S PLPS D
Sbjct: 181 PGWEEFNLKQHNKGLQDEMDLEAFPPMLTNRSSLPPYENLHGVYGRSGRSFASEPLPSVD 240

Query: 241 SLTSPENYSAKK-ILGDSSIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDN 300
           SLTSPENY AK  I  DSSIQ+G+KVVEE TDVLAFWKLKE+H+WADFSLIVDIM+AV+N
Sbjct: 241 SLTSPENYGAKNTIADDSSIQSGKKVVEENTDVLAFWKLKEIHSWADFSLIVDIMDAVNN 300

Query: 301 NFNEASTYLNKMVSSDNVEICNEMSTLGLHSADGLSCYGKNDVTISLGRTVNNPIPSSTL 360
           NF+EAST L  MVSSDN EI NE+STLGLHSA+ L C G NDV+I+  R +N PI SST+
Sbjct: 301 NFDEASTLLKTMVSSDNFEINNEISTLGLHSANDLLCNGNNDVSIASERMINAPILSSTV 360

Query: 361 KDVQDMHQNIND------KLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDDIYL 420
           K VQ +HQN N       KLF N+Y ERN FHN GN KIAL CSKS PIEPEWEEDDIYL
Sbjct: 361 KAVQGIHQNNNTSREDYTKLFANDYFERNSFHNTGNSKIALGCSKSVPIEPEWEEDDIYL 420

Query: 421 SHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQT 480
           SHRKDAIAMMRSASQHSRAATNAY RKDHASAKYHSSRA+EQWLAAKMLN KAANEILQT
Sbjct: 421 SHRKDAIAMMRSASQHSRAATNAYRRKDHASAKYHSSRAEEQWLAAKMLNDKAANEILQT 480

Query: 481 RNSENGLWKLDLHGLHAAEAVQALQDHLLKIETRNASNRSLSPKKAERKGFHRVSSLEYL 540
           RNS+NGLWKLDLHGLHAAEAVQAL DHLLKIET+NASNRSLSPKKAERKGF R SSLEYL
Sbjct: 481 RNSKNGLWKLDLHGLHAAEAVQALHDHLLKIETQNASNRSLSPKKAERKGFQRASSLEYL 540

Query: 541 SCMGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENGYRFEQLRPG 594
           SCM  KLDKE  SP  RHRPTSLEVITGIGKHS+GEAALPKAV SFL+ENGYRFEQ RPG
Sbjct: 541 SCMESKLDKE--SPSSRHRPTSLEVITGIGKHSKGEAALPKAVASFLTENGYRFEQTRPG 600

BLAST of Cp4.1LG12g10490 vs. TrEMBL
Match: F6HGK1_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_07s0130g00370 PE=4 SV=1)

HSP 1 Score: 378.3 bits (970), Expect = 1.8e-101
Identity = 231/439 (52.62%), Postives = 294/439 (66.97%), Query Frame = 1

Query: 163 VSLVRGKSPGWEEYNLKQ-QNRGLQDRIDPKPFPPMPSALSSLPPRENLHGVTGRPGRSS 222
           +S   GKSPGW  ++LKQ Q +GL+  +D +P+PP+PS+ +SL P  N     G  GRS 
Sbjct: 1   MSSASGKSPGWAAFDLKQRQKQGLEPELDKEPYPPIPSSFTSLRPCRN-SASNGCSGRSF 60

Query: 223 SSSPLPSADSLTSPENYSAKKIL--GDSSIQNGRKVVEETTDVLAFWKLKELHTWADFSL 282
           SS  +PS +  T  EN   KK +  G+S  +   KV E +  V+AF KLKEL++WAD SL
Sbjct: 61  SSLLVPSVNFPTLEENKDCKKPMQGGNSGNKQQTKVAEVSNLVIAFNKLKELYSWADNSL 120

Query: 283 IVDIMEAVDNNFNEASTYLNKMVSSDNVEICNEMSTLGLHSADGL---SCYGKNDVTISL 342
           I DIM AVDN+ ++AST L  MVS+ + E   E S + L+S  G    +C  + D  + L
Sbjct: 121 IEDIMAAVDNDIDKASTLLGAMVSTGSFEENKETSIVELNSTSGNPYENCKLQADNGVFL 180

Query: 343 GRTVNNPIPSSTLKDVQ-DMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPE 402
           G        SST+ D+  D ++ + D+   +    +N F +  +  + L   KS PIEPE
Sbjct: 181 GNGTVLSELSSTIGDLLIDNNKGLTDECGSSG---KNLFDDAADMTLILGRMKSIPIEPE 240

Query: 403 WEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAK 462
           WEEDD+YLSHRKDAI  MRSASQHSRAATNA+LR DH SAK  S +A+++W+ A+ LN+K
Sbjct: 241 WEEDDVYLSHRKDAIRFMRSASQHSRAATNAFLRGDHVSAKQFSLKAKDEWVKAERLNSK 300

Query: 463 AANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETRNASNRSLSPKKAERK-GF 522
           AANEIL  RNS N LWKLDLHGLHAAEAVQALQ+HL KIET+   NRS+SP +A+ K G 
Sbjct: 301 AANEILDIRNSNNDLWKLDLHGLHAAEAVQALQEHLWKIETQMPFNRSVSPNRAKTKVGI 360

Query: 523 HRVSSLEYLSCM-GVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSEN 582
            R  SLE  SC+   +LDK  Q  L R RPTSL+VITG G HSRG+AALP AV SFL+E+
Sbjct: 361 LRSPSLESFSCVDNEELDK--QWTLSRQRPTSLQVITGRGNHSRGQAALPTAVRSFLNEH 420

Query: 583 GYRFEQLRPGTISVRPKFR 593
           GYRFE+ RPG I+VRPKFR
Sbjct: 421 GYRFEEARPGVIAVRPKFR 433

BLAST of Cp4.1LG12g10490 vs. TrEMBL
Match: A0A061DK57_THECC (Smr (Small MutS Related) domain-containing protein, putative isoform 1 OS=Theobroma cacao GN=TCM_001646 PE=4 SV=1)

HSP 1 Score: 352.1 bits (902), Expect = 1.4e-93
Identity = 218/435 (50.11%), Postives = 275/435 (63.22%), Query Frame = 1

Query: 167 RGKSPGWEEYNLKQ-QNRGLQDRIDPKPFPPMPSALSSLPPRENLHGVTGRPGRSSSSSP 226
           +G+S GW  ++LKQ Q +GL    +  PFPPMP++L ++ P  NL        RS SS  
Sbjct: 18  KGESSGWSAFDLKQRQKQGLVPETEDDPFPPMPNSLPAICPCINLAKSNDLSARSFSSVL 77

Query: 227 LPSADSLTSPEN--YSAKKILGDSSIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDI 286
            PS +  TS +N  Y+    +G     +G KVVE+  + LA  KLKELH WA+ SLI D+
Sbjct: 78  KPSDNFPTSKQNKDYTKPINMGKPIENDGDKVVEQNNNNLALKKLKELHCWAENSLIEDL 137

Query: 287 MEAVDNNFNEASTYLNKMVSSDNVEICNEMSTLGLHSADGLSCYGKN---DVTISLGRTV 346
           + A D + +EAS  L  M+S    E   E     + SA  +S +  N   D  IS G+T 
Sbjct: 138 LLAADGDVHEASALLKGMMSISGTEDIKETKNNEMSSA--ISDFPGNAYCDREISTGKTA 197

Query: 347 NNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD 406
                SS   + +D    + D       HE   F    N K+ L    S P EPEWEEDD
Sbjct: 198 KLVCQSSKADEREDNLDKLTDM------HENKLFDGASNMKLILGQLTSIPFEPEWEEDD 257

Query: 407 IYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEI 466
           +YLSHRKDAI MMRSASQHSRAA+NA+LR DH +A+ HS  A+E+WLAA+ LNAKAA+EI
Sbjct: 258 VYLSHRKDAIRMMRSASQHSRAASNAFLRGDHVAAQQHSQNAREEWLAAQRLNAKAASEI 317

Query: 467 LQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETRNASNRSLSPK--KAERKGFHRVS 526
           L+ RNS+N LWKLDLHGLHAAEAVQAL +HL ++ET+  + RS+SP   KA  +  H  S
Sbjct: 318 LRIRNSDNDLWKLDLHGLHAAEAVQALHEHLRRLETQVPAGRSVSPNRFKANNRIVHS-S 377

Query: 527 SLEYLSCMGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENGYRFE 586
           S+E  S M  KLDK+  S   R RPTSL+VITG+G HSRG+AALP AV SFL ENGYRF+
Sbjct: 378 SVETFSSMD-KLDKQQTS--SRQRPTSLQVITGVGNHSRGQAALPAAVRSFLIENGYRFD 437

Query: 587 QLRPGTISVRPKFRR 594
           + RPG I+VRPKFRR
Sbjct: 438 EARPGLITVRPKFRR 440

BLAST of Cp4.1LG12g10490 vs. TrEMBL
Match: M5WI32_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa006112mg PE=4 SV=1)

HSP 1 Score: 327.4 bits (838), Expect = 3.6e-86
Identity = 206/440 (46.82%), Postives = 271/440 (61.59%), Query Frame = 1

Query: 169 KSPGWEEYNLKQ-QNRGLQDRIDPKPFPPMPSALSSLPPRENL---HGVTGRPGRSSSSS 228
           KS GW  ++LKQ Q +GL+ + D   FPP+ + L SL P EN+   + ++GRP     S 
Sbjct: 7   KSGGWAAFDLKQRQKQGLEPQTDTDHFPPILTTLPSLHPCENVSRNNDLSGRP----FSC 66

Query: 229 PLPSADSLTSPENYSAKKIL--GDSSIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVD 288
            L   D  TS EN   K+ L  GDSS   G  + +  +      K+ +L+ WAD SLI D
Sbjct: 67  VLHPVDFPTSTENRDGKRPLLYGDSS---GTSMEDNRSSKK---KIMDLYPWADDSLIED 126

Query: 289 IMEAVDNNFNEASTYLNKMVSSDNVEICNEMSTLGLHSADGLSCYGKNDVTISLGRTVNN 348
           IM AV ++  +AST L  MVS  + E   E           +S    N       +T + 
Sbjct: 127 IMAAVGDDITKASTLLKAMVSPSSFEENKETD---------ISKINSNSDIYQSDKTKHT 186

Query: 349 PIPSSTLKDVQDMHQNINDKLFENN--------YHERNFFHNVGNPKIALYCSKSAPIEP 408
             P  +  D+ D++      L ENN        +  +N  ++    K+ L   +S P+EP
Sbjct: 187 SFPLESAADIADLNSTFEKCLEENNIELLNAHDFCGKNLPNDAATMKLTLGSLESVPVEP 246

Query: 409 EWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNA 468
           EWEEDD+YL HRKDA+ MMRSASQHS+AATNA++R DH SA+ HS++A+E+WLAA+ LN 
Sbjct: 247 EWEEDDVYLRHRKDALRMMRSASQHSKAATNAFVRGDHFSAQRHSNKAREEWLAAESLNN 306

Query: 469 KAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETRNASNRSLSPKKAE-RKG 528
           KAA +IL  RNS+N +WKLDLHGLHA+EA+QAL++HL +IET+  SN S+SP K    K 
Sbjct: 307 KAAKKILNIRNSKNDVWKLDLHGLHASEAIQALREHLQRIETKVLSNHSVSPNKVRMEKR 366

Query: 529 FHRVSSLEYLSCMGV-KLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSE 588
             R SSLE  +CM   KLD+  Q      RPTSL+VITGIG HSRG+AALP AV SFL++
Sbjct: 367 IIRSSSLESFNCMDTEKLDQ--QKAPSTQRPTSLQVITGIGNHSRGQAALPTAVGSFLND 425

Query: 589 NGYRFEQLRPGTISVRPKFR 593
           NGYRFE+LRPG I+VRPKFR
Sbjct: 427 NGYRFEELRPGVITVRPKFR 425

BLAST of Cp4.1LG12g10490 vs. TrEMBL
Match: A0A067F5N4_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g014250mg PE=4 SV=1)

HSP 1 Score: 325.9 bits (834), Expect = 1.0e-85
Identity = 208/439 (47.38%), Postives = 268/439 (61.05%), Query Frame = 1

Query: 163 VSLVRGKSPGWEEYNLKQ-QNRGLQDRIDPKPFPPMPSALSSLPPRENLHGVTGRPGRSS 222
           +SL R KSPGW  ++LKQ Q +GL    D   +PP+ S L+SL   EN+   T    +  
Sbjct: 1   MSLTRVKSPGWAAFDLKQRQKQGLAPETDKDSYPPISSTLTSLRNCENVSRNTDVLVKPF 60

Query: 223 SSSPLPSADSLTSPENYSAKKILGDSSIQNGRKVVEETTDVLAFWKLKELHTWADFSLIV 282
           SS   PS +  T  E         D   ++G K +E+ +  LA  KLK LH+WAD SLI 
Sbjct: 61  SSVLRPSVEFPTLTEENEC-----DYKGKHGHKAIEQHSRDLALKKLKALHSWADNSLIE 120

Query: 283 DIMEAVDNNFNEASTYLNKMVSSDNVEICNEMSTLGLHSA--DGLSCYGKNDVTISLGRT 342
           D+MEAVDN+   AS  L  MVSS      N+ + +   S+  D   CY K      L + 
Sbjct: 121 DLMEAVDNDIKRASNLLEGMVSSSGSAEENKETKIAESSSTIDESPCYRKGGEICFLEKA 180

Query: 343 VNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNP----KIALYCSKSAPIEPE 402
           ++    S+T  D       +ND   E+     +   NV +     K  +    S PIEPE
Sbjct: 181 LDLSNLSTTTGD------GVNDNFIESVDVRASSVINVSDKDDGMKSIMERLSSLPIEPE 240

Query: 403 WEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAK 462
           WEEDD+YL HRKDA+ MMRSASQHS+AA NAYLR DH SA+ HS +A+++WL A+ LN+K
Sbjct: 241 WEEDDVYLVHRKDAMKMMRSASQHSKAANNAYLRGDHFSAQQHSLKARKEWLIAERLNSK 300

Query: 463 AANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETRNASNRSLSPKKAERK-GF 522
           AA EIL  RNSEN +WKLDLHGLHAAEAVQALQ+ L KIE +   N S+SPKK + K G 
Sbjct: 301 AAKEILGIRNSENDMWKLDLHGLHAAEAVQALQERLQKIEMQRPMNCSVSPKKVKSKNGM 360

Query: 523 HRVSSLEYLSCMGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENG 582
              +SLE   CM +++  + +S L R    SL+VITGIG HSRG+AALP AV +FLSE+G
Sbjct: 361 VCTASLESFGCMDMEVVDKQRSSL-RQIQKSLQVITGIGNHSRGQAALPTAVKNFLSESG 420

Query: 583 YRFEQLRPGTISVRPKFRR 594
           YRF++ RPG I+VRPKFR+
Sbjct: 421 YRFDEARPGVITVRPKFRQ 427

BLAST of Cp4.1LG12g10490 vs. TAIR10
Match: AT5G23520.1 (AT5G23520.1 smr (Small MutS Related) domain-containing protein)

HSP 1 Score: 275.4 bits (703), Expect = 8.2e-74
Identity = 180/446 (40.36%), Postives = 248/446 (55.61%), Query Frame = 1

Query: 163 VSLVRGKSPGWEEYNLKQ-QNRGLQDRIDPKPFPPMPSALSSLPPRENLHGVTGRPGRSS 222
           +S ++GKS GW  ++LKQ Q +GL+  ++  PFPP+ +++++        GV GR  R+ 
Sbjct: 1   MSWMKGKSSGWTAFDLKQRQKQGLESEVEGDPFPPVSTSVNAS------FGVRGRLRRNH 60

Query: 223 SSSPLPSADSLTSPENYSAKKILGDSSIQNGRKVVEETTDVL---------AFWKLKELH 282
             S    +  L  P  + A     D   Q          D L         AF KLKE++
Sbjct: 61  EPSEKSFSSVLLPPSRFPALTENKDCGNQERGGCCRRKPDTLSLPVNSHDLAFTKLKEMN 120

Query: 283 TWADFSLIVDIMEAVDNNFNEASTYLNKMVSSDNVEICNEMSTLGLHSADGLSCYGKNDV 342
           +WAD +LI D++ + +++F  A  +L  MVSS   +        G  S +  S Y   + 
Sbjct: 121 SWADDNLIRDVLLSTEDDFEMALAFLKGMVSSGKEDEEPTSKIEGYSSDNRRSEYRTFEK 180

Query: 343 TISLGRTVNNPIPSSTLKDV--QDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSA 402
           T++    +      ST +D    D+  +       N      F  ++      +   +S 
Sbjct: 181 TVTSSVKM---AARSTFEDAGKYDLENSDGSSFLVNASDNEKFPDDISELDSIIQRLQSI 240

Query: 403 PIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAK 462
           PIEPEWEEDD+YLSHRKDA+ +MRSAS HSRAA NA+ R DHASAK HS +A+E WLAA+
Sbjct: 241 PIEPEWEEDDLYLSHRKDALKVMRSASNHSRAAQNAFQRYDHASAKQHSDKAREDWLAAE 300

Query: 463 MLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETRNASNRSLSPKKAE 522
            LNA+AA +I+   N +N +WKLDLHGLHA EAVQALQ+ L  IE     NRS+SP +  
Sbjct: 301 KLNAEAAKKIIGITNKDNDIWKLDLHGLHATEAVQALQERLQMIEGHFTVNRSVSPNRGR 360

Query: 523 RKGFH-RVSSLEYLSCMGVKLDKE---LQSPLPRHRPTSLEVITGIGKHSRGEAALPKAV 582
            K    R +S E       +LD+E    Q    R    SL+VITGIGKHSRG+A+LP AV
Sbjct: 361 SKNAALRSASQEPFG----RLDEEGMHCQRTSSRELRNSLQVITGIGKHSRGQASLPLAV 420

Query: 583 TSFLSENGYRFEQLRPGTISVRPKFR 593
            +F  +N YRF++ RPG I+VRPKFR
Sbjct: 421 KTFFEDNRYRFDETRPGVITVRPKFR 433

BLAST of Cp4.1LG12g10490 vs. NCBI nr
Match: gi|659100734|ref|XP_008451240.1| (PREDICTED: uncharacterized protein LOC103492590 [Cucumis melo])

HSP 1 Score: 897.9 bits (2319), Expect = 9.6e-258
Identity = 474/610 (77.70%), Postives = 510/610 (83.61%), Query Frame = 1

Query: 1   MSWGRGKSPGWAAVNLKQQNSGLQDEIDPDPFPPMSTALSFLPPRENVHRVNGRSGRSFS 60
           MSW RGKS GWAA NLKQQN+G+QDE+D DPFPPMST LS LPPREN+  VNGRSGRSFS
Sbjct: 1   MSWVRGKSSGWAAFNLKQQNNGIQDEVDGDPFPPMSTTLSSLPPRENLRGVNGRSGRSFS 60

Query: 61  STPLPSADSLMSPK----------NFGAKKTIPGNSSIRSGKKLVEESTDVLAFWKLKEL 120
             P+PSADS   P           NF AKKTI G S+I+SGKK+VEE+ DVL+FWKLKEL
Sbjct: 61  FAPIPSADSPTLPGKCGAKKTTLGNFSAKKTILGASNIQSGKKMVEETNDVLSFWKLKEL 120

Query: 121 HSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKS 180
           H WADISLI+DIMEAVNN+FNEAS LL TMVSSDN EINNEMS LGLHSSND+S + GKS
Sbjct: 121 HPWADISLIMDIMEAVNNDFNEASTLLNTMVSSDNLEINNEMSNLGLHSSNDLSWMMGKS 180

Query: 181 PGWEEYNLKQQNRGLQDRIDPKPFPPMPSALSSLPPRENLHGVTGRPGRSSSSSPLPSAD 240
           PGWEE+NL+Q NRGLQ   DP+ FPPM +   SLPP ENLHGV GR GRS +S PLPSAD
Sbjct: 181 PGWEEFNLQQHNRGLQGEKDPEAFPPMLTNHPSLPPYENLHGVYGRLGRSFASEPLPSAD 240

Query: 241 SLTSPENYSAKKIL-GDSSIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDN 300
           SLTSP NY AK  +  DS IQ+G+KVVEE TDVLAFWKLKE+H+WADFSLIVDIM+AV+N
Sbjct: 241 SLTSPGNYGAKNTIPDDSGIQSGKKVVEENTDVLAFWKLKEIHSWADFSLIVDIMDAVNN 300

Query: 301 NFNEASTYLNKMVSSDNVEICNEMSTLGLHSADGLSCYGKNDVTISLGRTVNNPIPSSTL 360
           NF+EAST L  MVSSDN EI NE+STLGLHSA+ L C G NDV+IS  RT+N PI S TL
Sbjct: 301 NFDEASTLLKTMVSSDNFEINNEISTLGLHSANDLLCNGDNDVSISSERTINGPILSPTL 360

Query: 361 KDVQDMHQNIND------KLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDDIYL 420
           K  Q MHQN N       KLF N+Y ERNFF N GN KIAL CSKS PIEPEWEEDDIYL
Sbjct: 361 KAAQGMHQNDNTGGEDCTKLFVNDYFERNFFPNAGNSKIALGCSKSVPIEPEWEEDDIYL 420

Query: 421 SHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQT 480
           SHRKDAIAMMRSASQHSRAATNAY RKDHASAKYHSSRAQEQWLAAKMLN KAANEILQT
Sbjct: 421 SHRKDAIAMMRSASQHSRAATNAYRRKDHASAKYHSSRAQEQWLAAKMLNDKAANEILQT 480

Query: 481 RNSENGLWKLDLHGLHAAEAVQALQDHLLKIETRNASNRSLSPKKAERKGFHRVSSLEYL 540
           RNS+NGLWKLDLHGLHAAEAVQALQDHLLKIET+NASNRSLSPKKAERKGF R SSLEYL
Sbjct: 481 RNSKNGLWKLDLHGLHAAEAVQALQDHLLKIETQNASNRSLSPKKAERKGFQRASSLEYL 540

Query: 541 SCMGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENGYRFEQLRPG 594
           SCM  KLDKE  SP  RHRPTSLEVITGIGKHS+GEAALPKAVTSFL+ENGYRFEQ RPG
Sbjct: 541 SCMDAKLDKE--SPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLTENGYRFEQTRPG 600

BLAST of Cp4.1LG12g10490 vs. NCBI nr
Match: gi|449462475|ref|XP_004148966.1| (PREDICTED: uncharacterized protein LOC101223137 [Cucumis sativus])

HSP 1 Score: 895.6 bits (2313), Expect = 4.7e-257
Identity = 471/610 (77.21%), Postives = 512/610 (83.93%), Query Frame = 1

Query: 1   MSWGRGKSPGWAAVNLKQQNSGLQDEIDPDPFPPMSTALSFLPPRENVHRVNGRSGRSFS 60
           MSW RGKS GWAA NLKQQN+GLQDE+D DPFPPMST LS LPPREN+  VNG SG+SFS
Sbjct: 1   MSWVRGKSSGWAAFNLKQQNNGLQDEVDRDPFPPMSTTLSSLPPRENLRGVNGHSGKSFS 60

Query: 61  STPLPSADSLMSPK----------NFGAKKTIPGNSSIRSGKKLVEESTDVLAFWKLKEL 120
             P+PSADS   P           NFGAKKTI G ++I+SGKKLVEE+ DVL+FWKLKEL
Sbjct: 61  LAPIPSADSPTLPVKFGAKKTTLGNFGAKKTILGGTNIQSGKKLVEETNDVLSFWKLKEL 120

Query: 121 HSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKS 180
           H WADISLI+DIMEAVNN+FNEAS LL TMVSSDN EINN+MSTLGLHSSND+  + GKS
Sbjct: 121 HPWADISLIMDIMEAVNNDFNEASTLLNTMVSSDNLEINNKMSTLGLHSSNDLLWMAGKS 180

Query: 181 PGWEEYNLKQQNRGLQDRIDPKPFPPMPSALSSLPPRENLHGVTGRPGRSSSSSPLPSAD 240
           PGWEE+NLKQ N+GLQD +D + FPPM +  SSLPP ENLHGV GR GRS +S PLPS D
Sbjct: 181 PGWEEFNLKQHNKGLQDEMDLEAFPPMLTNRSSLPPYENLHGVYGRSGRSFASEPLPSVD 240

Query: 241 SLTSPENYSAKK-ILGDSSIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDN 300
           SLTSPENY AK  I  DSSIQ+G+KVVEE TDVLAFWKLKE+H+WADFSLIVDIM+AV+N
Sbjct: 241 SLTSPENYGAKNTIADDSSIQSGKKVVEENTDVLAFWKLKEIHSWADFSLIVDIMDAVNN 300

Query: 301 NFNEASTYLNKMVSSDNVEICNEMSTLGLHSADGLSCYGKNDVTISLGRTVNNPIPSSTL 360
           NF+EAST L  MVSSDN EI NE+STLGLHSA+ L C G NDV+I+  R +N PI SST+
Sbjct: 301 NFDEASTLLKTMVSSDNFEINNEISTLGLHSANDLLCNGNNDVSIASERMINAPILSSTV 360

Query: 361 KDVQDMHQNIND------KLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDDIYL 420
           K VQ +HQN N       KLF N+Y ERN FHN GN KIAL CSKS PIEPEWEEDDIYL
Sbjct: 361 KAVQGIHQNNNTSREDYTKLFANDYFERNSFHNTGNSKIALGCSKSVPIEPEWEEDDIYL 420

Query: 421 SHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQT 480
           SHRKDAIAMMRSASQHSRAATNAY RKDHASAKYHSSRA+EQWLAAKMLN KAANEILQT
Sbjct: 421 SHRKDAIAMMRSASQHSRAATNAYRRKDHASAKYHSSRAEEQWLAAKMLNDKAANEILQT 480

Query: 481 RNSENGLWKLDLHGLHAAEAVQALQDHLLKIETRNASNRSLSPKKAERKGFHRVSSLEYL 540
           RNS+NGLWKLDLHGLHAAEAVQAL DHLLKIET+NASNRSLSPKKAERKGF R SSLEYL
Sbjct: 481 RNSKNGLWKLDLHGLHAAEAVQALHDHLLKIETQNASNRSLSPKKAERKGFQRASSLEYL 540

Query: 541 SCMGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENGYRFEQLRPG 594
           SCM  KLDKE  SP  RHRPTSLEVITGIGKHS+GEAALPKAV SFL+ENGYRFEQ RPG
Sbjct: 541 SCMESKLDKE--SPSSRHRPTSLEVITGIGKHSKGEAALPKAVASFLTENGYRFEQTRPG 600

BLAST of Cp4.1LG12g10490 vs. NCBI nr
Match: gi|225463171|ref|XP_002267329.1| (PREDICTED: uncharacterized protein LOC100263151 [Vitis vinifera])

HSP 1 Score: 378.3 bits (970), Expect = 2.5e-101
Identity = 231/439 (52.62%), Postives = 294/439 (66.97%), Query Frame = 1

Query: 163 VSLVRGKSPGWEEYNLKQ-QNRGLQDRIDPKPFPPMPSALSSLPPRENLHGVTGRPGRSS 222
           +S   GKSPGW  ++LKQ Q +GL+  +D +P+PP+PS+ +SL P  N     G  GRS 
Sbjct: 1   MSSASGKSPGWAAFDLKQRQKQGLEPELDKEPYPPIPSSFTSLRPCRN-SASNGCSGRSF 60

Query: 223 SSSPLPSADSLTSPENYSAKKIL--GDSSIQNGRKVVEETTDVLAFWKLKELHTWADFSL 282
           SS  +PS +  T  EN   KK +  G+S  +   KV E +  V+AF KLKEL++WAD SL
Sbjct: 61  SSLLVPSVNFPTLEENKDCKKPMQGGNSGNKQQTKVAEVSNLVIAFNKLKELYSWADNSL 120

Query: 283 IVDIMEAVDNNFNEASTYLNKMVSSDNVEICNEMSTLGLHSADGL---SCYGKNDVTISL 342
           I DIM AVDN+ ++AST L  MVS+ + E   E S + L+S  G    +C  + D  + L
Sbjct: 121 IEDIMAAVDNDIDKASTLLGAMVSTGSFEENKETSIVELNSTSGNPYENCKLQADNGVFL 180

Query: 343 GRTVNNPIPSSTLKDVQ-DMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPE 402
           G        SST+ D+  D ++ + D+   +    +N F +  +  + L   KS PIEPE
Sbjct: 181 GNGTVLSELSSTIGDLLIDNNKGLTDECGSSG---KNLFDDAADMTLILGRMKSIPIEPE 240

Query: 403 WEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAK 462
           WEEDD+YLSHRKDAI  MRSASQHSRAATNA+LR DH SAK  S +A+++W+ A+ LN+K
Sbjct: 241 WEEDDVYLSHRKDAIRFMRSASQHSRAATNAFLRGDHVSAKQFSLKAKDEWVKAERLNSK 300

Query: 463 AANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETRNASNRSLSPKKAERK-GF 522
           AANEIL  RNS N LWKLDLHGLHAAEAVQALQ+HL KIET+   NRS+SP +A+ K G 
Sbjct: 301 AANEILDIRNSNNDLWKLDLHGLHAAEAVQALQEHLWKIETQMPFNRSVSPNRAKTKVGI 360

Query: 523 HRVSSLEYLSCM-GVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSEN 582
            R  SLE  SC+   +LDK  Q  L R RPTSL+VITG G HSRG+AALP AV SFL+E+
Sbjct: 361 LRSPSLESFSCVDNEELDK--QWTLSRQRPTSLQVITGRGNHSRGQAALPTAVRSFLNEH 420

Query: 583 GYRFEQLRPGTISVRPKFR 593
           GYRFE+ RPG I+VRPKFR
Sbjct: 421 GYRFEEARPGVIAVRPKFR 433

BLAST of Cp4.1LG12g10490 vs. NCBI nr
Match: gi|590709627|ref|XP_007048605.1| (Smr (Small MutS Related) domain-containing protein, putative isoform 1 [Theobroma cacao])

HSP 1 Score: 352.1 bits (902), Expect = 2.0e-93
Identity = 218/435 (50.11%), Postives = 275/435 (63.22%), Query Frame = 1

Query: 167 RGKSPGWEEYNLKQ-QNRGLQDRIDPKPFPPMPSALSSLPPRENLHGVTGRPGRSSSSSP 226
           +G+S GW  ++LKQ Q +GL    +  PFPPMP++L ++ P  NL        RS SS  
Sbjct: 18  KGESSGWSAFDLKQRQKQGLVPETEDDPFPPMPNSLPAICPCINLAKSNDLSARSFSSVL 77

Query: 227 LPSADSLTSPEN--YSAKKILGDSSIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDI 286
            PS +  TS +N  Y+    +G     +G KVVE+  + LA  KLKELH WA+ SLI D+
Sbjct: 78  KPSDNFPTSKQNKDYTKPINMGKPIENDGDKVVEQNNNNLALKKLKELHCWAENSLIEDL 137

Query: 287 MEAVDNNFNEASTYLNKMVSSDNVEICNEMSTLGLHSADGLSCYGKN---DVTISLGRTV 346
           + A D + +EAS  L  M+S    E   E     + SA  +S +  N   D  IS G+T 
Sbjct: 138 LLAADGDVHEASALLKGMMSISGTEDIKETKNNEMSSA--ISDFPGNAYCDREISTGKTA 197

Query: 347 NNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD 406
                SS   + +D    + D       HE   F    N K+ L    S P EPEWEEDD
Sbjct: 198 KLVCQSSKADEREDNLDKLTDM------HENKLFDGASNMKLILGQLTSIPFEPEWEEDD 257

Query: 407 IYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEI 466
           +YLSHRKDAI MMRSASQHSRAA+NA+LR DH +A+ HS  A+E+WLAA+ LNAKAA+EI
Sbjct: 258 VYLSHRKDAIRMMRSASQHSRAASNAFLRGDHVAAQQHSQNAREEWLAAQRLNAKAASEI 317

Query: 467 LQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETRNASNRSLSPK--KAERKGFHRVS 526
           L+ RNS+N LWKLDLHGLHAAEAVQAL +HL ++ET+  + RS+SP   KA  +  H  S
Sbjct: 318 LRIRNSDNDLWKLDLHGLHAAEAVQALHEHLRRLETQVPAGRSVSPNRFKANNRIVHS-S 377

Query: 527 SLEYLSCMGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENGYRFE 586
           S+E  S M  KLDK+  S   R RPTSL+VITG+G HSRG+AALP AV SFL ENGYRF+
Sbjct: 378 SVETFSSMD-KLDKQQTS--SRQRPTSLQVITGVGNHSRGQAALPAAVRSFLIENGYRFD 437

Query: 587 QLRPGTISVRPKFRR 594
           + RPG I+VRPKFRR
Sbjct: 438 EARPGLITVRPKFRR 440

BLAST of Cp4.1LG12g10490 vs. NCBI nr
Match: gi|645219001|ref|XP_008233514.1| (PREDICTED: uncharacterized protein LOC103332547 isoform X1 [Prunus mume])

HSP 1 Score: 331.6 bits (849), Expect = 2.7e-87
Identity = 209/460 (45.43%), Postives = 278/460 (60.43%), Query Frame = 1

Query: 152 MSTLGLHSSNDVSLVRGKSPGWEEYNLKQ-QNRGLQDRIDPKPFPPMPSALSSLPPRENL 211
           M+   ++    +S  R KS GW  ++LKQ Q +GL+ + D   FPP+ + L SL P EN+
Sbjct: 1   MTVASIYLCRKMSRGRAKSGGWAAFDLKQRQKQGLEPQTDTDHFPPILTTLPSLHPCENV 60

Query: 212 ---HGVTGRPGRSSSSSPLPSADSLTSPENYSAKKIL--GDSS---IQNGRKVVEETTDV 271
              + ++GRP     S  L   D  TS EN   K++L  GDSS   +++ R   +     
Sbjct: 61  SRNNDLSGRP----FSCVLHPVDFPTSTENRDGKRLLLYGDSSGTSMEDNRSTKK----- 120

Query: 272 LAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTYLNKMVSSDNVEICNEMSTLGLHSAD 331
               K+ +L+ WAD SLI DIM AV ++  +AST L  MVS  + E   E          
Sbjct: 121 ----KIMDLYPWADDSLIEDIMAAVGDDITKASTLLKAMVSPSSFEENKETD-------- 180

Query: 332 GLSCYGKNDVTISLGRTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFF-------- 391
            +S    N       +T +   P  +  D+ D++      L ENN    N          
Sbjct: 181 -ISKINSNSDIYQSDKTKHTSFPLESAADIADLNSTFEKCLEENNIELLNAHDFCGKKLP 240

Query: 392 HNVGNPKIALYCSKSAPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHAS 451
           ++    K+ L   +S P+EPEWEEDD+YL HRKDA+ MMRSASQHS+AATNA++R DH S
Sbjct: 241 NDAATTKLTLGSLESVPVEPEWEEDDVYLRHRKDALRMMRSASQHSKAATNAFVRGDHFS 300

Query: 452 AKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKI 511
           A+ HS++A+E+WLAA+ LN KAA +IL  RNS+N +WKLDLHGLHA+EA+QAL++HL +I
Sbjct: 301 AQRHSNKAREEWLAAESLNNKAAKKILNIRNSKNDVWKLDLHGLHASEAIQALREHLQRI 360

Query: 512 ETRNASNRSLSPKKAE-RKGFHRVSSLEYLSCMGV-KLDKELQSPLPRHRPTSLEVITGI 571
           ET+  SN S+SP K    K   R SSLE  +CM   KLD+  Q      RPTSL+VITGI
Sbjct: 361 ETKVLSNHSVSPNKVRMEKRIIRSSSLESFNCMDTEKLDQ--QKAPSTQRPTSLQVITGI 420

Query: 572 GKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFR 593
           G HSRG+AALP AV SFL++NGYRFE+LRPG I+VRPKFR
Sbjct: 421 GNHSRGQAALPTAVRSFLNDNGYRFEELRPGVITVRPKFR 436

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KA90_CUCSA3.3e-25777.21Uncharacterized protein OS=Cucumis sativus GN=Csa_7G374690 PE=4 SV=1[more]
F6HGK1_VITVI1.8e-10152.62Putative uncharacterized protein OS=Vitis vinifera GN=VIT_07s0130g00370 PE=4 SV=... [more]
A0A061DK57_THECC1.4e-9350.11Smr (Small MutS Related) domain-containing protein, putative isoform 1 OS=Theobr... [more]
M5WI32_PRUPE3.6e-8646.82Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa006112mg PE=4 SV=1[more]
A0A067F5N4_CITSI1.0e-8547.38Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g014250mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G23520.18.2e-7440.36 smr (Small MutS Related) domain-containing protein[more]
Match NameE-valueIdentityDescription
gi|659100734|ref|XP_008451240.1|9.6e-25877.70PREDICTED: uncharacterized protein LOC103492590 [Cucumis melo][more]
gi|449462475|ref|XP_004148966.1|4.7e-25777.21PREDICTED: uncharacterized protein LOC101223137 [Cucumis sativus][more]
gi|225463171|ref|XP_002267329.1|2.5e-10152.62PREDICTED: uncharacterized protein LOC100263151 [Vitis vinifera][more]
gi|590709627|ref|XP_007048605.1|2.0e-9350.11Smr (Small MutS Related) domain-containing protein, putative isoform 1 [Theobrom... [more]
gi|645219001|ref|XP_008233514.1|2.7e-8745.43PREDICTED: uncharacterized protein LOC103332547 isoform X1 [Prunus mume][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR013899DUF1771
IPR002625Smr_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG12g10490.1Cp4.1LG12g10490.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002625Smr domainPFAMPF01713Smrcoord: 473..573
score: 2.
IPR002625Smr domainSMARTSM00463SMR_2coord: 470..590
score: 2.6
IPR002625Smr domainPROFILEPS50828SMRcoord: 473..590
score: 20
IPR002625Smr domainunknownSSF160443SMR domain-likecoord: 542..588
score: 1.83E-10coord: 471..502
score: 1.83
IPR013899Domain of unknown function DUF1771PFAMPF08590DUF1771coord: 402..465
score: 1.3
IPR013899Domain of unknown function DUF1771SMARTSM01162DUF1771_2coord: 401..466
score: 3.8
NoneNo IPR availablePANTHERPTHR13308UNCHARACTERIZEDcoord: 58..145
score: 6.9E-69coord: 352..511
score: 6.9E-69coord: 546..593
score: 6.9