CmaCh04G003920 (gene) Cucurbita maxima (Rimu)

NameCmaCh04G003920
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionDentin sialophosphoprotein-related, putative isoform 5
LocationCma_Chr04 : 1985016 .. 1992589 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GGAGAATAGTAAAGTATTCGTCCACCCAAAAAGAAAAAAAAAAAGAAACTCTCGAAGTCGAAGCCTTCCTGGAGCTCTCTTCAACCAGACTGAAGAACCCGCCTAGGGCTCACCTTTGCTTTTTTGTTTTTGTCTTTGCTGGGATTTTTTTCTTCGATCCCCAATCGTTGAATATCTGGATTGCTTGGGATTTTCTGAGCGGGATTTTTGGGGTCCAGGTGAGTTTTTTTTTTTTTTTTTTTTTTTTAATAATTTTTTTTGGTGGATCGCTACTGGTTCTTCTGATTTTTGCTGTAGTTCTTCTTCTTCTTTTCGTGATTCTATTTTCAGCTGTTTGGTTGCTGAGAAAGTGGGTTGGAAATTGGAATTGAAAAGGGTTTTCTGAAATTGTTTGATGAGATTTGGTTTCACTGAAAGGGAGTAAAATTGTTGGGAGTTAATGAATTTTTGCTTTGGTTTATGGAGTTTTTTTAGTGGTGGCTCGATTTTTTTAGCTGTTTGGTTGCTGAGAAAGTGGGTTGGAATTCAAAAGGGTTTTCTGGAATTGTTTGATGGGAGTTGGTTTCACTGAAAGGATGTGAAATGGTTGAGTTTCTGATGGATTAATTGAAAGAGGGGATGTTTCTTGTTTGAAACTGTATTTTTATTTTTGTGGCTGTGATTCCTTATAATATTCAACCTTTTTCACTGTATATGGTTCATGAATTCATTTTCTTTTTGTTTTCTCTGGAAACCCATTGAATCAATATGCCTTTTAAGATCACTTTTGGCTTAACTTGCTGGAAGGGAAATAATAGATCTAAGAAGAGCAAAATTTAATTGTTTGGCGTGTTTGTTTGACCTGTACTGCAAACGGAAAAAAACTTCCTGAAGAAGAAATAATAAAATTCCTCAAATAGCATGCCCCTTTCCTGAAAAAATGATATTTATTACTAGAATTAGTGAAAGAAATCTTACTAGCTTTCTTCTTCACATTTTCTGCTGTTTTCCCCTTTACAGTCTCCATAAGTAAGAGCCTGAGAAAGTGTTGTTTCATCTTTCTCTTGTTCTTAATTCAGATTTACTGCTGGGCAAGTTCGTGATTGAGACATGGGATTGTATCAAACAGTTTTATGCTTCTTAAATGGTGGATAAAGAATCTTGAAGGAATGTCTGACTTGTGTATGTACAAGGTATAGTTTCATTTTCCCATTTTTCTCTTCTTCTGATGATAAGTTTGAACTAATGTTGATTTTTCCCCCTAGTAGAGATGCTGAACATAGATTGAGAGTGTTAATTCTCGAAAACAAGTTTAGCAAACTGATATGAATCCTCAATCCTTTTGTTAGTGAGATGGTTTTGGAAGGCAAGAGTCCTTAAATAGCAAAATTATTTATGTGGACGCTTCACTGAACAGCGTTAACACTCAACACGATAGGTTTTCTTATGTATACGAGGAGTGAAGAGACCCTTCGTAATAAGTTTTTTGCACTGTCAGCTTGCCTCTAAAGCTTGGGATTTTATGTATCAAACATTTGGGTCGAGTTGGTGTCCTCGTAAGGATGCTGAGGAATGGCTCCCGAAGTTGCTTTTGGATTGGTGGTTTAGGGATAAAGCAAGAATGCTTTTGTTACATGCTATTAGAGCTCACTTGGAAGGAAAGGAATGAGAAGCTTTTCAAAGATACGTTAAATGCCATTCACAAACCTTTTTGTAATTACTCTTATCTTTTCTATCAACTTAGATTGGAGGCATTTTGTTTGATCCTCTTAGGCATGAGAATATCTTGTAGATTGGTTTGATCGTGGGAGCACGAGTTTTAGATGTGTAATCTTTACCCTTTTGTTGAGTTGATCATTTATTGGCTCAAAAACCTTGACTAATTTGGTGAATGACAATCTTATTCCTTTGGCCAATTTTTTGTTTTAGCTATACCACTACTCACCTCATTTAGGTTGAGTACTAGCAAAAATACCTTATCTTTTGTTTATTATGGTCTTCATCATAGATTTAGACATTTATTTGATACTAACAATGCTAGGTTCCTCTAAAGAGCTATTGATCAATGTTACACCTTGAAGTTCATATTGATTTATTTTTTATCCATGCATCTGAAGTATTGCAAATACTATCTCTGTTACATGAAAGAGCTCCAAATTTACCACCAGATGATCGATCTTATGCCATTGATAGGATTCAAATGATTTCTTCTCTAATCTAGGATTCAAATGATTTCTTCTCTAATCTAGGATTCAAATGATATCGTCTCTAATCTTTTCACTTAGACCGAGCATCCAACTTGGCTTTCTATTATTATATCTTCTATTTGTGTGTGATTCTTTACTTGTCCTTATAATAATCTTGGCCGATGTAGTTTGCGTGTGTATTCTACTTATATGCATTTGGGTAGAGGACCAAATTGTTGATTCAAAGCAACTTGTAGATTTGCTTAGCTGTAGCCCTTTTCTGTTACCATTGTGTCTCAGAGGTATGTATCTGTTTTGCATAGCCAATTTGGCTTCCAACTGAAGACCCCTTTTTTCTAATTCCAAATCTAGACCCATCCATATAATATCATCATATTAACAGGACTTTTGCAATTCAAGTTCAAGCTAACGACCATGAAATGGATGCCGTAGTATAAGAAGATATCACTGGTATTTGCTGATCTTGTGGTCGGTCATAAGTTGGTGTAGCTGATGTTGCCATTGGGGTGAAGGCGACATCACTAGTAACTGGTATTCTGAAATCCTTAAATTTGCTTCATTCTGAATCAACATGTCTCTGCATATTGTGTTTTACAGTTAAATGGAGATTGATTTTAGCTTCTTGATCCACATTGGAGGCAGAAAGCTTCCACTCTAACGATCTCTTATAGATGTGCACTCTATAGATACAAGAAGCTAATGGTCAATAGTCTATGCAAAGCTCTAGCTTAAATGGGCTCTGTATTATTCTTTGTTTTCCAATAAGTATCTAACTATTGGTTAGAAAAAATCTATCTCGAGAAGTCAACAATTAAATCCAAGATTAAAACTATTGACTAAGGGCGGCTATGCGTCATTTTTAATTTTCTGAAAGAAAAGAAGCAATAATTTGAGCACTGTGTGAAATCCTCCTGCCCTATTATAACTGATAAACATGATGTTTGAGATGATCGGCGAGGATTTATGCACTTAGCAATCATTTTTCAGTTGCAATCTCTAATCCTGTATATTCTGTGGAATCAATGATTTATGGATGGATGTTATATAACAATATACTTCACTATTTCTTTCTATTCTTTTATTTATCTATTAGATTGAAAGGTCTCTTGCCATTGTGATTTTCTTTGTTTAACTCATTTAAGATTTTAACTCTATTTTGGTCCTCAAACATTTATATTAATTTTAGTCCTAAAACTTTTGAGTGCTCTACTTTATTCTCTAAACTTTCAAATATTTTGTTTTCAATCTTAAATTGGATGATCCAAAATTCATTTTGAGACAATTGACATCTAATTTGTTACACATAGCTTTAATTACTTGTCTACATAAACATCTACTCATAATGTAAATGCGTGAAATATGTCACATCTGGTTTCTACGAAATAACTTCTTTTGAAAAAAATTGATGACAAGGACTTAAATGATTTTTAAAAGTTTGAGAATTGAATTTGAGCATTTGAAATTTAAAAACTACAGCGGCACAAGCATAAATTTTTGGGGCCAAAATAGAATGTAAACCAACAATAGCTACTTCATAGATGTATTTATGCCCTGTGGTTTATGTTGTTTGCATGTAGCATTGGTTGTTTGGGTTTAGGGAGTGTTTGGAGTGACTATCAAGTGATCGACAACACTTTTTTAACACTTAAGAACTCTTTTTCAAGCACTTCGAATGTCATTCCAAACATGACTTCAGTTCACTTCTATAATGTGTGTTGTATTGGCTTATGTATTTTATGAGATATAGCAATTTATCCATATTGTTTTAGTGTAAGGGAGAAAGTATGTATTTTCCTTTTATTTTGAATTCTCCAGTCCCGTAGCTTACTACCTTGTAACGGTGTTGCAGCTCAAGGATAATATTTGGAAGGACTTCACTGGGAATGATGATTATATAGTGCCTCATCTTGGTGACGAACTTTGGGATCTATTTGACGAGCGGAGTGATAGTCTTGAGAATTCTAATCCCAAAATTAGCTTTACAGATGATTCTTACAATACAACTAAGTTCAGTTTTCTTGGTAAGGAGAAAGTAATAACACAAACATCAATCATGAAGAATACAGTTCTGGAAAAAGATTCATGGCCTCATACACCTGATGGTGTTCCTACTTCAAACAGTGACTCGTTCAAAGATGTGAAAATGGAACCAAACGGTCTTAGTATGCCAAACCGCTGCTTTAAAACAGACGTGGGTACAGACTTTGATTACTGCACAGATGATCATATTGTAACTGACAATAGTGCTGCAGATGAGAATGACATGTACCAATATTCTGTCAGTCACATGTCTCAAACAGGTAATGATATTAGTTTTCTGGATGATGATTGTGAAAACAAAGAAAACAACGATCTTCTGTTTTATGGGTGGCAAGATATGGAAAGCTTTGAGGACGTAGATAGGATGTTCAGGTAATCAACCTTGTTCATTGTTTTTTTCCCATTTGTTTTGATTTTATGATATAAAGTTGAATTCTTTGATCCACTGTGATTGTGGTTTATGCTTGATACAGAAATTGTGATTCAACATTTGGGCTCGGGAATCTAAGCAATGAAGATGATCTATGCTGGTTTTCCCCATCCCATGGCTCAGAAAAACTTGAAGAAAAACCAACAAAGTCGAACTTCAATTTTTCGTGCTGTGAAGGAAGTACAATAAATGATGCATTGGAATTTAATGAAGGTTCTAATCTCCTGAATTCCGATCCATCATCTGATGGTTTGAACAAAAATAATATTTTAACTGGGCGTAAGGTAAATGATGGGATTGACTCTGCTGCTGTTAGTCACTTGTTAGCTTCTGACATACCAGATAGAAAAAGAAATTCTAGAGGTGACTTGATACCTAAACAACAGGTTTGTCTTTGTTCTAATTTTTCTATTTGAAATTGAGTTTTCTGGATCTTTCTTTATCCTTAAACCAAAAATGAAATAAAACCTTTCTCATTTTATTTGTAATTGAGATCTTCTCTCCCCAGCCTAGTTATGTTCATTGTAAAAAATGAGTGAAGAACGAACCATCTTTTCATGTAACGAAGCATTGTAGTAAAACTCTTAAGGCTCGTTGTAGGATAAAGTAAGAACTCTCCGAGCGCATCCTTCTGGAACTCATCAAATTGAAAACTTATGACTGTAACCTAAATAATGCACAAAGTTCTTCAAGGCATAAATAAGGAGCCATACATGATAGTATAGAATATCTTCCCTATTAAGGTTTCCAAGGCATAAATAGGGATCCTGATCTGATACTCGAGTGGCATACGAACAGGAAGTTTGTGTCTGGACCTAGAATATAGCAATGCGTATTAACAAATTTTGGTGTCTTCAGCTCTTTCTCGTACAAAATCTACAATTAACGCCCACTTATTTCTGCAGGAGTCTTCTTTTGCATCTAATCAACTACAGTCTGTACCTAGCTCTCATAATCCTTCCTTTGACGCTCCAACAATTGCAGCAAATGAAAATAGAGAAAAACTGTACCACCAGGATTTACCAGCCTCATTCAATAAGAATATCACTTTGATGTCTATGCCAAATTCGGAAACATTCAGTACTTCATTTCCAGTTAGAAAGCATGCACCAAGATCTGAAAGTGAAATTGATGATGGTCACAGTGAAACTGGAGTAGTTAGCCGAGAAAGTCGAGCGGAATTAGATTCTTCAAATGCACCGGATATATCTTGCCAGAGCACTACGCGGGATGGAATCTCACTGGAAGCAACTAGTTTTTGCCAGCTTCAACAAGTAATGGAGCAGGTAGTAGTATTTTACATGCTTTTCACATCATATAATAGTATCCCCTGTCTATCTGCATGGAAAGAGTCATTTGCATGCATAGTAGTTCTTCGTCCTTCATGCAATTAGCGAGTCGTTCTTTGCTAACATACCCTAGCCAATAATCTGTCCATGTCCATAACAAACATTTGGTGGTTTTCTTCAATCCCTTCAATTTGTGGTATTCATTGCTCGCATGTCTGAAATTCTACAAGTCGTAAATTAGATTGTCCATATACCTTAAACAACATCGTCACGAGGAGACACCAGCACAACATGACACATGCACGTCAACACGTCATTTTCTAGAAAATTAGGACATGGACATGGACATTGAAAAGTTTGTTTATAAAAGGATGTACATTTGAATTATATTTGAAGACTTGCCTTACATTCCAATGATTTTACATTATGTGTTTTGAGTTTATATATCTATATTTCAAACTAATGTGAAAGAACCACAGAACCTCATAAAGCTCATTGAATGTCTTACAATTCATTACACATGAGAGCATTGCATTTCCATAATCTTGAGGAAAATGTTTTTATGGTTTATACAAAAAACTAAAAATGAGCCAAGATTCACAAAATGCTTTGATAATGAACATTAGACTCCTTCACTGGGAAATGTGAAATTGTGATATTATTCTTCAAACTAACGGCTTGAATTACTTATTTCAGTTGGATATTCGAACAAAACTATGCATAAGAGATAGTTTATATCGCCTGGCAAGAAGTGCGGAGCAGAGACATAACTGTGCAAATATTAATGAGAATATTAGAGAAGATAAACTTGTGAGAATTGCACCGTTGATCGATCAAGACGCAAACAGGCAAGTTATTACTCACCTTATTATTATCAAGTTGGGGCACTTCAGTTTTCTTTCTGTTACCTTCTAGCAATGTGTGTACTTAGGCCTCATTTAGTTCATCTAAATGAGGATGATGGAAAGTTGAATGAAAGCAAATTGAAACGAAAAATTCTAGATTATGTGTACGAAAACTAACTTGGTTAATATTTACTAAAACTATAGGAATGCCAATTGGCCTGTATGGCTGGCTGGTTATCAGTGGTTTCTTGTTTCTGTAAATGCAGAGGATGGGTCAACCAACAGCATTTTACTTATTTTTCTTCTTCTTGTGTATAAACAATGCAGTTAAGTTTGAGCTTCTTTTTCTCTTTCTTTTCCCCGTCTATGGAGTTATTTCGAACTTTCTTACCGTGGATTTCGGTTTTAACAGGAGTGGAGGTTTTTTGGATTTGGAAACTGATACCAATCCTATTGACCGATCAGTCGCCCACTTGCTGTTTCACCGGCCTTCGGATCCGTCTGTAGTGCCCACTGGTGGTAATGCCTTGCCTCTGAAATCTCCCAAACTGGTAAGCTTTTTACACCTTTCAGTTTTCTTCCTGTTCTGTATGGAATAAAATCATTTCCAGTTAGAGAAACTCGAGAGCGTCTTGTTTTTTCTTATCAATATAGTTCGTCTTAACATGTCACCAATTTGCAGGTGCCAGCCGAAAAACAAAATTTTCAGGACGAAACCGGGGGAGCTACAGCTTATGCAGCAGATCAAAAGCCGCTGTCAAATGGGAAAAACTATGAGCAGTAG

mRNA sequence

GGAGAATAGTAAAGTATTCGTCCACCCAAAAAGAAAAAAAAAAAGAAACTCTCGAAGTCGAAGCCTTCCTGGAGCTCTCTTCAACCAGACTGAAGAACCCGCCTAGGGCTCACCTTTGCTTTTTTGTTTTTGTCTTTGCTGGGATTTTTTTCTTCGATCCCCAATCGTTGAATATCTGGATTGCTTGGGATTTTCTGAGCGGGATTTTTGGGGTCCAGTCTCCATAAGTAAGAGCCTGAGAAAGTGTTGTTTCATCTTTCTCTTGTTCTTAATTCAGATTTACTGCTGGGCAAGTTCGTGATTGAGACATGGGATTGTATCAAACAGTTTTATGCTTCTTAAATGGTGGATAAAGAATCTTGAAGGAATGTCTGACTTGTGTATGTACAAGCTCAAGGATAATATTTGGAAGGACTTCACTGGGAATGATGATTATATAGTGCCTCATCTTGGTGACGAACTTTGGGATCTATTTGACGAGCGGAGTGATAGTCTTGAGAATTCTAATCCCAAAATTAGCTTTACAGATGATTCTTACAATACAACTAAGTTCAGTTTTCTTGGTAAGGAGAAAGTAATAACACAAACATCAATCATGAAGAATACAGTTCTGGAAAAAGATTCATGGCCTCATACACCTGATGGTGTTCCTACTTCAAACAGTGACTCGTTCAAAGATGTGAAAATGGAACCAAACGGTCTTAGTATGCCAAACCGCTGCTTTAAAACAGACGTGGGTACAGACTTTGATTACTGCACAGATGATCATATTGTAACTGACAATAGTGCTGCAGATGAGAATGACATGTACCAATATTCTGTCAGTCACATGTCTCAAACAGGTAATGATATTAGTTTTCTGGATGATGATTGTGAAAACAAAGAAAACAACGATCTTCTGTTTTATGGGTGGCAAGATATGGAAAGCTTTGAGGACGTAGATAGGATGTTCAGAAATTGTGATTCAACATTTGGGCTCGGGAATCTAAGCAATGAAGATGATCTATGCTGGTTTTCCCCATCCCATGGCTCAGAAAAACTTGAAGAAAAACCAACAAAGTCGAACTTCAATTTTTCGTGCTGTGAAGGAAGTACAATAAATGATGCATTGGAATTTAATGAAGGTTCTAATCTCCTGAATTCCGATCCATCATCTGATGGTTTGAACAAAAATAATATTTTAACTGGGCGTAAGGTAAATGATGGGATTGACTCTGCTGCTGTTAGTCACTTGTTAGCTTCTGACATACCAGATAGAAAAAGAAATTCTAGAGGTGACTTGATACCTAAACAACAGGAGTCTTCTTTTGCATCTAATCAACTACAGTCTGTACCTAGCTCTCATAATCCTTCCTTTGACGCTCCAACAATTGCAGCAAATGAAAATAGAGAAAAACTGTACCACCAGGATTTACCAGCCTCATTCAATAAGAATATCACTTTGATGTCTATGCCAAATTCGGAAACATTCAGTACTTCATTTCCAGTTAGAAAGCATGCACCAAGATCTGAAAGTGAAATTGATGATGGTCACAGTGAAACTGGAGTAGTTAGCCGAGAAAGTCGAGCGGAATTAGATTCTTCAAATGCACCGGATATATCTTGCCAGAGCACTACGCGGGATGGAATCTCACTGGAAGCAACTAGTTTTTGCCAGCTTCAACAAGTAATGGAGCAGTTGGATATTCGAACAAAACTATGCATAAGAGATAGTTTATATCGCCTGGCAAGAAGTGCGGAGCAGAGACATAACTGTGCAAATATTAATGAGAATATTAGAGAAGATAAACTTGTGAGAATTGCACCGTTGATCGATCAAGACGCAAACAGGAATGCCAATTGGCCTGTATGGCTGGCTGGTTATCAGTGGTTTCTTGTTTCTGTAAATGCAGAGGATGGGAGTGGAGGTTTTTTGGATTTGGAAACTGATACCAATCCTATTGACCGATCAGTCGCCCACTTGCTGTTTCACCGGCCTTCGGATCCGTCTGTAGTGCCCACTGGTGGTAATGCCTTGCCTCTGAAATCTCCCAAACTGGTGCCAGCCGAAAAACAAAATTTTCAGGACGAAACCGGGGGAGCTACAGCTTATGCAGCAGATCAAAAGCCGCTGTCAAATGGGAAAAACTATGAGCAGTAG

Coding sequence (CDS)

ATGCTTCTTAAATGGTGGATAAAGAATCTTGAAGGAATGTCTGACTTGTGTATGTACAAGCTCAAGGATAATATTTGGAAGGACTTCACTGGGAATGATGATTATATAGTGCCTCATCTTGGTGACGAACTTTGGGATCTATTTGACGAGCGGAGTGATAGTCTTGAGAATTCTAATCCCAAAATTAGCTTTACAGATGATTCTTACAATACAACTAAGTTCAGTTTTCTTGGTAAGGAGAAAGTAATAACACAAACATCAATCATGAAGAATACAGTTCTGGAAAAAGATTCATGGCCTCATACACCTGATGGTGTTCCTACTTCAAACAGTGACTCGTTCAAAGATGTGAAAATGGAACCAAACGGTCTTAGTATGCCAAACCGCTGCTTTAAAACAGACGTGGGTACAGACTTTGATTACTGCACAGATGATCATATTGTAACTGACAATAGTGCTGCAGATGAGAATGACATGTACCAATATTCTGTCAGTCACATGTCTCAAACAGGTAATGATATTAGTTTTCTGGATGATGATTGTGAAAACAAAGAAAACAACGATCTTCTGTTTTATGGGTGGCAAGATATGGAAAGCTTTGAGGACGTAGATAGGATGTTCAGAAATTGTGATTCAACATTTGGGCTCGGGAATCTAAGCAATGAAGATGATCTATGCTGGTTTTCCCCATCCCATGGCTCAGAAAAACTTGAAGAAAAACCAACAAAGTCGAACTTCAATTTTTCGTGCTGTGAAGGAAGTACAATAAATGATGCATTGGAATTTAATGAAGGTTCTAATCTCCTGAATTCCGATCCATCATCTGATGGTTTGAACAAAAATAATATTTTAACTGGGCGTAAGGTAAATGATGGGATTGACTCTGCTGCTGTTAGTCACTTGTTAGCTTCTGACATACCAGATAGAAAAAGAAATTCTAGAGGTGACTTGATACCTAAACAACAGGAGTCTTCTTTTGCATCTAATCAACTACAGTCTGTACCTAGCTCTCATAATCCTTCCTTTGACGCTCCAACAATTGCAGCAAATGAAAATAGAGAAAAACTGTACCACCAGGATTTACCAGCCTCATTCAATAAGAATATCACTTTGATGTCTATGCCAAATTCGGAAACATTCAGTACTTCATTTCCAGTTAGAAAGCATGCACCAAGATCTGAAAGTGAAATTGATGATGGTCACAGTGAAACTGGAGTAGTTAGCCGAGAAAGTCGAGCGGAATTAGATTCTTCAAATGCACCGGATATATCTTGCCAGAGCACTACGCGGGATGGAATCTCACTGGAAGCAACTAGTTTTTGCCAGCTTCAACAAGTAATGGAGCAGTTGGATATTCGAACAAAACTATGCATAAGAGATAGTTTATATCGCCTGGCAAGAAGTGCGGAGCAGAGACATAACTGTGCAAATATTAATGAGAATATTAGAGAAGATAAACTTGTGAGAATTGCACCGTTGATCGATCAAGACGCAAACAGGAATGCCAATTGGCCTGTATGGCTGGCTGGTTATCAGTGGTTTCTTGTTTCTGTAAATGCAGAGGATGGGAGTGGAGGTTTTTTGGATTTGGAAACTGATACCAATCCTATTGACCGATCAGTCGCCCACTTGCTGTTTCACCGGCCTTCGGATCCGTCTGTAGTGCCCACTGGTGGTAATGCCTTGCCTCTGAAATCTCCCAAACTGGTGCCAGCCGAAAAACAAAATTTTCAGGACGAAACCGGGGGAGCTACAGCTTATGCAGCAGATCAAAAGCCGCTGTCAAATGGGAAAAACTATGAGCAGTAG

Protein sequence

MLLKWWIKNLEGMSDLCMYKLKDNIWKDFTGNDDYIVPHLGDELWDLFDERSDSLENSNPKISFTDDSYNTTKFSFLGKEKVITQTSIMKNTVLEKDSWPHTPDGVPTSNSDSFKDVKMEPNGLSMPNRCFKTDVGTDFDYCTDDHIVTDNSAADENDMYQYSVSHMSQTGNDISFLDDDCENKENNDLLFYGWQDMESFEDVDRMFRNCDSTFGLGNLSNEDDLCWFSPSHGSEKLEEKPTKSNFNFSCCEGSTINDALEFNEGSNLLNSDPSSDGLNKNNILTGRKVNDGIDSAAVSHLLASDIPDRKRNSRGDLIPKQQESSFASNQLQSVPSSHNPSFDAPTIAANENREKLYHQDLPASFNKNITLMSMPNSETFSTSFPVRKHAPRSESEIDDGHSETGVVSRESRAELDSSNAPDISCQSTTRDGISLEATSFCQLQQVMEQLDIRTKLCIRDSLYRLARSAEQRHNCANINENIREDKLVRIAPLIDQDANRNANWPVWLAGYQWFLVSVNAEDGSGGFLDLETDTNPIDRSVAHLLFHRPSDPSVVPTGGNALPLKSPKLVPAEKQNFQDETGGATAYAADQKPLSNGKNYEQ
BLAST of CmaCh04G003920 vs. Swiss-Prot
Match: LNK1_ARATH (Protein LNK1 OS=Arabidopsis thaliana GN=LNK1 PE=1 SV=1)

HSP 1 Score: 166.4 bits (420), Expect = 9.7e-40
Identity = 186/646 (28.79%), Postives = 269/646 (41.64%), Query Frame = 1

Query: 13  MSDLCMYKLKDNIWKDFTGNDDYIVPHLGDELWDLFDERSDSLENSNPKISFTDDSYNTT 72
           MSDL +++L D +  +F GNDD IVP    E     D     +  SN K    DD  +  
Sbjct: 1   MSDLYIHELGDYLSDEFHGNDDGIVPDSAYE-----DGGQFPILVSNRKKRRNDDMGS-- 60

Query: 73  KFSFLGKEKVITQTSIMKNT-VLEKDSWPHTPDGVPTSNSDSFKDVKMEPNGLSMPNRCF 132
                G   + + T I +   +L K+ WP    G  + + D+     ++   L   N   
Sbjct: 61  -----GTNHLKSNTFIKREANMLGKNPWPEKDSGGSSVSRDTGTGKDVQDMTLEDTNTSD 120

Query: 133 KTDVGTDFD----YCTDDHIVTDNSAADENDMYQYSVSHMSQTGNDISFLDDDCENKENN 192
               G   D    + T D ++ D SAA  + +Y YS++ +    ND+SF D+   +KE N
Sbjct: 121 HGFNGGHVDVVENFSTGDPMLCDTSAATNDGVYNYSLNSIPDAENDLSFFDNG--DKEKN 180

Query: 193 DLLF---------------------YGWQDMESFEDV----------------------D 252
           DL +                     +G   + +  D+                      D
Sbjct: 181 DLFYGWGDIGNFEDVDNMLRSCDSTFGLDSLNNEGDLGWFSSAQPNEETAGAMTDDLKPD 240

Query: 253 RMFRNCDSTFGLGNLSNEDDLCWFSPSHGSE----------KLEEKPTKSNFNFSCCEGS 312
           +M  N  +      L  ED L    P+H  E            + K +++ F+ S  +  
Sbjct: 241 KMLENQRTAM----LQVEDFLNNSEPNHAVEDEYGYTIEDDSAQGKSSQNVFDTSLQKKD 300

Query: 313 T----INDALEFNEGSNLLNSDPSSDGLNKNNILT-----GRKVNDGIDSAAVSHLLASD 372
                +   LE  +  +L + D  SDG ++N+         R++ D       S     D
Sbjct: 301 ILMLDVEANLEKKQTDHLHHLDGKSDGFSENSFTLQHSGISREIMDTNQYYPPSAFQQRD 360

Query: 373 IP----------------DRKRNSRGDLIPKQQESS---FASNQLQSVPSSHNPSFDAPT 432
           +P                + K   + +  P    +S   + SN  QS+ S   P+ D   
Sbjct: 361 VPYSHFNCEQPSVQVSACESKSGIKSENKPSPSSASNESYTSNHAQSIESLQGPTVDDRF 420

Query: 433 IAANENREKLYH-QDLPASFNKNITLMSMPNSETFSTSFPVRKHAPRSESEIDDGHSETG 492
               E R  L   QD+P SF  N    S  +S  F  + P++K        +++ H    
Sbjct: 421 RKVFETRANLLPGQDMPPSFAANTKKSSKTDSMVFPDAAPIQKIG------LENDH---- 480

Query: 493 VVSRESRAELDSSNAPDISCQSTTRDGISLEATSFCQLQQVMEQLDIRTKLCIRDSLYRL 552
              R++  EL++SN    SC S+  D ISLEATSF QLQQV+EQLD+RTKLCIRDSLYRL
Sbjct: 481 ---RKAATELETSNMQGSSCVSSVVDDISLEATSFRQLQQVIEQLDVRTKLCIRDSLYRL 540

Query: 553 ARSAEQRHNCANINENIREDKLVRIAPLIDQDANRNANWPVWLAGYQWFLVSVNAEDGSG 572
           A+SAEQRH+  N  E      LV                                 D   
Sbjct: 541 AKSAEQRHHGGNRPEKGAGSHLV-----------------------------TGEADKYA 585

BLAST of CmaCh04G003920 vs. Swiss-Prot
Match: LNK2_ARATH (Protein LNK2 OS=Arabidopsis thaliana GN=LNK2 PE=1 SV=1)

HSP 1 Score: 67.0 bits (162), Expect = 8.0e-10
Identity = 52/147 (35.37%), Postives = 72/147 (48.98%), Query Frame = 1

Query: 434 SLEATSFCQLQQVMEQLDIRTKLCIRDSLYRLARSAEQRH---NCANINENIREDKLVRI 493
           S E     +LQ V+ +LD+ T+ CIRDSL+RLA SA QRH   + ++ N+  ++D+ V  
Sbjct: 489 SAEFAVLYRLQDVVAKLDMGTRTCIRDSLFRLAGSAAQRHYTSDTSHSNKTSQDDQEV-- 548

Query: 494 APLIDQDANRNANWPVWLAGYQWFLVSVNAEDGSGGFLDLETDTNPIDRSVAHLLFHRPS 553
              I ++ +R          Y++            G  D E  TNP DR+VAHLLFHRP 
Sbjct: 549 ---IPREESR----------YRY-----------AGMPDTEAVTNPTDRTVAHLLFHRPF 608

Query: 554 DPSVVPTGGNALPLKSPKLVPAEKQNF 578
           D              S K+   EK NF
Sbjct: 609 DMLAAKRMEGPESPASSKMGTEEKGNF 609

BLAST of CmaCh04G003920 vs. TrEMBL
Match: A0A0A0KVV5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G107450 PE=4 SV=1)

HSP 1 Score: 766.9 bits (1979), Expect = 1.8e-218
Identity = 406/515 (78.83%), Postives = 431/515 (83.69%), Query Frame = 1

Query: 89  MKNTVLEKDSWPHTPDGVPTS-NSDSFKDVKMEPNGLSMPNRCFKTDVGTDFDYCTDDHI 148
           MK+TVLEKDSW HTPDGVP+S NSDSFKD KME + LSM N CFKTDVGTD DYCTDDHI
Sbjct: 1   MKSTVLEKDSWSHTPDGVPSSLNSDSFKDAKMESSSLSMSNHCFKTDVGTDLDYCTDDHI 60

Query: 149 VTDNSAADENDMYQYSVSHMSQTGNDISFLDDDCENKENNDLLFYGWQDMESFEDVDRMF 208
           VTDNSAADENDMYQYSVSHMSQT NDISFLDDD ENKENNDLL+YGWQD+ SFEDVDRMF
Sbjct: 61  VTDNSAADENDMYQYSVSHMSQTDNDISFLDDDRENKENNDLLYYGWQDIGSFEDVDRMF 120

Query: 209 RNCDSTFGLGNLSNEDDLCWFSPSHGSEKLEEKPTKSNFNFSCCEGSTINDALEFNEGSN 268
           RNCDSTFGLGNLSNEDDL WFSPSHG+EKLE+ P+K NF FSCCEGSTINDA EFNE SN
Sbjct: 121 RNCDSTFGLGNLSNEDDLRWFSPSHGTEKLED-PSKPNFKFSCCEGSTINDATEFNEESN 180

Query: 269 LLNSDPSSDGLNKNNILTGRKVNDGI----DSAAVSHLLASDIPDRKRNSRGDLIPKQQE 328
            +NS+ S DGLN+NNIL G K+NDGI    DSAA+SHL A+D+ DRK NS GDLIPK+QE
Sbjct: 181 PVNSEASPDGLNRNNILNGCKMNDGITDIGDSAAISHLSAADMSDRKGNSSGDLIPKKQE 240

Query: 329 SSFASNQLQSVPSSHNPSFDAPTIAANENREKLYHQDLPASFNKNITLMSMPNSETFSTS 388
           SS+ASNQL S   SH PSFDAPTI ANENREKLYHQDLPASFNKN T MS P+SETF+TS
Sbjct: 241 SSYASNQLHS---SHYPSFDAPTIGANENREKLYHQDLPASFNKNFTFMSAPSSETFNTS 300

Query: 389 FPVRKHAPRSESEIDDGHSETGVVSRESRAELDSSNAPDISCQSTTRDGISLEATSFCQL 448
           FPVRK APRSESEIDDGHSE+GVVSR SR ELDSSNA D  C+ST  DGISLEATSF QL
Sbjct: 301 FPVRKQAPRSESEIDDGHSESGVVSRGSRVELDSSNAQDKPCRSTMLDGISLEATSFRQL 360

Query: 449 QQVMEQLDIRTKLCIRDSLYRLARSAEQRHNCANINENIREDKLVRIAPLIDQDANRNAN 508
           QQVMEQLDIRTKLCIRDSLYRLARSAEQRHNCAN+NEN  EDK VR+A  IDQDANR   
Sbjct: 361 QQVMEQLDIRTKLCIRDSLYRLARSAEQRHNCANLNENTGEDKFVRVASSIDQDANR--- 420

Query: 509 WPVWLAGYQWFLVSVNAEDGSGGFLDLETDTNPIDRSVAHLLFHRPSDPSVVPTGGNALP 568
                               SGGFLDLETDTNPIDRSVAHLLFHRPSDPS++P GGN L 
Sbjct: 421 --------------------SGGFLDLETDTNPIDRSVAHLLFHRPSDPSLMPAGGNTLS 480

Query: 569 LKSPKLVPAEKQNFQDETGGATAYAADQKPLSNGK 599
           LKS KLVPAEKQNFQDETGGA A  ADQK LSNGK
Sbjct: 481 LKSHKLVPAEKQNFQDETGGAAA-CADQKLLSNGK 487

BLAST of CmaCh04G003920 vs. TrEMBL
Match: W9QVG7_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_017214 PE=4 SV=1)

HSP 1 Score: 264.2 bits (674), Expect = 3.8e-67
Identity = 194/536 (36.19%), Postives = 279/536 (52.05%), Query Frame = 1

Query: 21  LKDNIWKDFTGNDDYIVPHLGDELWDLFDERSDSLENSNPKISFTDDSYNTTKFSFLGKE 80
           L+DN+W +F  +DD+IVPH G++  D      DS +    ++     + +T+   + G +
Sbjct: 90  LEDNVWDEFGDSDDHIVPHSGNKHADKSLAEGDSRKKPQSEVIGVASNGDTSTTHYTGGK 149

Query: 81  KVITQTSIM-KNTVLEKDSWPHTPDGV-PTSNSDSFKDV-KMEPNGLSMPNRCFKTD--V 140
              +   +   NT+LEK SW  T DGV P+ ++DS K+V  +  +G  + N  FK+D   
Sbjct: 150 GNRSIPFVTDNNTMLEKGSWSDTLDGVFPSCDTDSIKEVTSLASDGTRISNNSFKSDNVE 209

Query: 141 GTDFDYCTDDHIVTDNSAADENDMYQYSVSHMSQTGNDISFLDDDCENKENNDLLFYGWQ 200
               ++C DD I+ D   A +N +Y+Y +S + QTG ++SF D+D E++EN DLL+YGW 
Sbjct: 210 SGGSEFCVDDSIMGDGCTAVDNSLYRYPLSRIPQTGYELSFFDNDREDRENGDLLYYGWP 269

Query: 201 DMESFEDVDRMFRNCDSTFGLGNLSNEDDLCWFSPSHGSE-------------------- 260
           D+ +FEDVDRMFR+CDSTFG+ +L+N+++L WFS S+ +E                    
Sbjct: 270 DIGNFEDVDRMFRSCDSTFGMESLNNDEELSWFSSSNATEGSEDALKSGLKFAGPNAGSS 329

Query: 261 --------------KLEEK----PTKSNFNFSCCEGSTINDALEFNEGS----------- 320
                         KLE +    P +   + +  EG T +  LE N GS           
Sbjct: 330 KGVSEHYDAFKPDSKLESRDILMPKEQPKHDAQLEGETTDQYLE-NGGSFHHYDYLKQLT 389

Query: 321 -------NLLNSDPSSDGLNKNNILTGRKVND----GIDSAAVSHLLASD----IPDRKR 380
                  +L +   S+ G+ ++  + G          +    V H   SD     P    
Sbjct: 390 DMKQPHGDLSSQFYSAPGIEQHKQIAGPNSMSYNQRQLPYMHVDHTRQSDQISVCPTPSV 449

Query: 381 NSRGDLIPKQ-QESSFASNQLQSVPSSHNPSFDAPTIAANENREKLYH-QDLPASFNKNI 440
            S  D  P   +ESS+ SNQLQ + +   P  + P    +EN EK +  Q   +SF  N 
Sbjct: 450 KSEKDGKPSSLKESSYGSNQLQCMENFRGPLTETPIATISENTEKPHSCQGFQSSFIMNF 509

Query: 441 TLMSMPNSETFSTSFPVRKHAPRSESEIDDGHSETGVVSRESRAELDSSNAPDISCQSTT 486
              +  N   F      +K   +SE++I  GHS+    S E  AE+DSSN  + SC S+ 
Sbjct: 510 ENAATTNPVVFGNKVSKQKEEQKSENQIG-GHSDIEGASNEIPAEVDSSNLQESSCMSSV 569

BLAST of CmaCh04G003920 vs. TrEMBL
Match: A0A061E394_THECC (Dentin sialophosphoprotein-related, putative isoform 3 OS=Theobroma cacao GN=TCM_007494 PE=4 SV=1)

HSP 1 Score: 218.8 bits (556), Expect = 1.8e-53
Identity = 128/322 (39.75%), Postives = 200/322 (62.11%), Query Frame = 1

Query: 13  MSDLCMYKLKDNIWKDFTGNDDYIVPHLGDELWDLFDERSDSLENSNPKIS-FTDDSYNT 72
           MSDLCMY+L+DN+W +F  +DD+IVPH  DE    F  + D  +     ++  T ++ NT
Sbjct: 1   MSDLCMYELEDNVWDEFGASDDHIVPHTVDEYGAQFKVQDDVRKKRRHDVTGVTSNANNT 60

Query: 73  TKFSFLGKEKVITQTSIMKNTVLEKDSWPHTPDGV-PTS-NSDSFKDVKMEPNGLSMPNR 132
           TK+  LG+++    T ++KN +LEK SW H+PDG+ PTS ++DS  +  M  +     + 
Sbjct: 61  TKYGILGEKEKGLHT-LIKNRMLEKGSWSHSPDGIFPTSGDNDSHNEATMASDDSRTSSH 120

Query: 133 CFKT----DVGTDFDYCTDDHIVTDNSAADENDMYQYSVSHMSQTGNDISFLDDDCENKE 192
             KT     VG++F  C D+ ++ D  A ++N++YQY +++MSQT +D+SF D++ E+KE
Sbjct: 121 GLKTGNIDSVGSEF--CADEPVLVDKCATEDNNVYQYPLNNMSQTDDDLSFFDNNHEDKE 180

Query: 193 NNDLLFYGWQDMESFEDVDRMFRNCDSTFGLGNLSNEDDLCWFSPSHGSEKLEEKPTKSN 252
           N+DLL+YGW D+ +FEDVDRMFR+CDSTFGLG+LSNEDDLCWFS S  +E   + P K++
Sbjct: 181 NSDLLYYGWADIGNFEDVDRMFRSCDSTFGLGSLSNEDDLCWFSSSQATEGSHD-PLKAD 240

Query: 253 FNFSCCEGSTINDALEFNEGSNLLNSDPSSDGLNKNNILTGRKV---NDGIDSAAVSHLL 312
                   + +N   E    S   ++ PS+   NK ++    K+   N   D+A ++ + 
Sbjct: 241 --------AKLNSVPEDCATSRPDSAGPSTIDSNKKSVFLSDKISPLNSSSDNAGLAPMS 300

Query: 313 ASDIPDRKRNSRGDLIPKQQES 325
           + ++ + +  S+ D IP +Q S
Sbjct: 301 SLNVSNTESESKDDPIPNEQIS 310

BLAST of CmaCh04G003920 vs. TrEMBL
Match: A0A061E1A2_THECC (Dentin sialophosphoprotein-related, putative isoform 7 OS=Theobroma cacao GN=TCM_007494 PE=4 SV=1)

HSP 1 Score: 218.8 bits (556), Expect = 1.8e-53
Identity = 128/322 (39.75%), Postives = 200/322 (62.11%), Query Frame = 1

Query: 13  MSDLCMYKLKDNIWKDFTGNDDYIVPHLGDELWDLFDERSDSLENSNPKIS-FTDDSYNT 72
           MSDLCMY+L+DN+W +F  +DD+IVPH  DE    F  + D  +     ++  T ++ NT
Sbjct: 1   MSDLCMYELEDNVWDEFGASDDHIVPHTVDEYGAQFKVQDDVRKKRRHDVTGVTSNANNT 60

Query: 73  TKFSFLGKEKVITQTSIMKNTVLEKDSWPHTPDGV-PTS-NSDSFKDVKMEPNGLSMPNR 132
           TK+  LG+++    T ++KN +LEK SW H+PDG+ PTS ++DS  +  M  +     + 
Sbjct: 61  TKYGILGEKEKGLHT-LIKNRMLEKGSWSHSPDGIFPTSGDNDSHNEATMASDDSRTSSH 120

Query: 133 CFKT----DVGTDFDYCTDDHIVTDNSAADENDMYQYSVSHMSQTGNDISFLDDDCENKE 192
             KT     VG++F  C D+ ++ D  A ++N++YQY +++MSQT +D+SF D++ E+KE
Sbjct: 121 GLKTGNIDSVGSEF--CADEPVLVDKCATEDNNVYQYPLNNMSQTDDDLSFFDNNHEDKE 180

Query: 193 NNDLLFYGWQDMESFEDVDRMFRNCDSTFGLGNLSNEDDLCWFSPSHGSEKLEEKPTKSN 252
           N+DLL+YGW D+ +FEDVDRMFR+CDSTFGLG+LSNEDDLCWFS S  +E   + P K++
Sbjct: 181 NSDLLYYGWADIGNFEDVDRMFRSCDSTFGLGSLSNEDDLCWFSSSQATEGSHD-PLKAD 240

Query: 253 FNFSCCEGSTINDALEFNEGSNLLNSDPSSDGLNKNNILTGRKV---NDGIDSAAVSHLL 312
                   + +N   E    S   ++ PS+   NK ++    K+   N   D+A ++ + 
Sbjct: 241 --------AKLNSVPEDCATSRPDSAGPSTIDSNKKSVFLSDKISPLNSSSDNAGLAPMS 300

Query: 313 ASDIPDRKRNSRGDLIPKQQES 325
           + ++ + +  S+ D IP +Q S
Sbjct: 301 SLNVSNTESESKDDPIPNEQIS 310

BLAST of CmaCh04G003920 vs. TrEMBL
Match: A0A061E968_THECC (Dentin sialophosphoprotein-related, putative isoform 6 OS=Theobroma cacao GN=TCM_007494 PE=4 SV=1)

HSP 1 Score: 218.8 bits (556), Expect = 1.8e-53
Identity = 128/322 (39.75%), Postives = 200/322 (62.11%), Query Frame = 1

Query: 13  MSDLCMYKLKDNIWKDFTGNDDYIVPHLGDELWDLFDERSDSLENSNPKIS-FTDDSYNT 72
           MSDLCMY+L+DN+W +F  +DD+IVPH  DE    F  + D  +     ++  T ++ NT
Sbjct: 1   MSDLCMYELEDNVWDEFGASDDHIVPHTVDEYGAQFKVQDDVRKKRRHDVTGVTSNANNT 60

Query: 73  TKFSFLGKEKVITQTSIMKNTVLEKDSWPHTPDGV-PTS-NSDSFKDVKMEPNGLSMPNR 132
           TK+  LG+++    T ++KN +LEK SW H+PDG+ PTS ++DS  +  M  +     + 
Sbjct: 61  TKYGILGEKEKGLHT-LIKNRMLEKGSWSHSPDGIFPTSGDNDSHNEATMASDDSRTSSH 120

Query: 133 CFKT----DVGTDFDYCTDDHIVTDNSAADENDMYQYSVSHMSQTGNDISFLDDDCENKE 192
             KT     VG++F  C D+ ++ D  A ++N++YQY +++MSQT +D+SF D++ E+KE
Sbjct: 121 GLKTGNIDSVGSEF--CADEPVLVDKCATEDNNVYQYPLNNMSQTDDDLSFFDNNHEDKE 180

Query: 193 NNDLLFYGWQDMESFEDVDRMFRNCDSTFGLGNLSNEDDLCWFSPSHGSEKLEEKPTKSN 252
           N+DLL+YGW D+ +FEDVDRMFR+CDSTFGLG+LSNEDDLCWFS S  +E   + P K++
Sbjct: 181 NSDLLYYGWADIGNFEDVDRMFRSCDSTFGLGSLSNEDDLCWFSSSQATEGSHD-PLKAD 240

Query: 253 FNFSCCEGSTINDALEFNEGSNLLNSDPSSDGLNKNNILTGRKV---NDGIDSAAVSHLL 312
                   + +N   E    S   ++ PS+   NK ++    K+   N   D+A ++ + 
Sbjct: 241 --------AKLNSVPEDCATSRPDSAGPSTIDSNKKSVFLSDKISPLNSSSDNAGLAPMS 300

Query: 313 ASDIPDRKRNSRGDLIPKQQES 325
           + ++ + +  S+ D IP +Q S
Sbjct: 301 SLNVSNTESESKDDPIPNEQIS 310

BLAST of CmaCh04G003920 vs. TAIR10
Match: AT5G64170.2 (AT5G64170.2 dentin sialophosphoprotein-related)

HSP 1 Score: 166.4 bits (420), Expect = 5.4e-41
Identity = 186/646 (28.79%), Postives = 269/646 (41.64%), Query Frame = 1

Query: 13  MSDLCMYKLKDNIWKDFTGNDDYIVPHLGDELWDLFDERSDSLENSNPKISFTDDSYNTT 72
           MSDL +++L D +  +F GNDD IVP    E     D     +  SN K    DD  +  
Sbjct: 1   MSDLYIHELGDYLSDEFHGNDDGIVPDSAYE-----DGGQFPILVSNRKKRRNDDMGS-- 60

Query: 73  KFSFLGKEKVITQTSIMKNT-VLEKDSWPHTPDGVPTSNSDSFKDVKMEPNGLSMPNRCF 132
                G   + + T I +   +L K+ WP    G  + + D+     ++   L   N   
Sbjct: 61  -----GTNHLKSNTFIKREANMLGKNPWPEKDSGGSSVSRDTGTGKDVQDMTLEDTNTSD 120

Query: 133 KTDVGTDFD----YCTDDHIVTDNSAADENDMYQYSVSHMSQTGNDISFLDDDCENKENN 192
               G   D    + T D ++ D SAA  + +Y YS++ +    ND+SF D+   +KE N
Sbjct: 121 HGFNGGHVDVVENFSTGDPMLCDTSAATNDGVYNYSLNSIPDAENDLSFFDNG--DKEKN 180

Query: 193 DLLF---------------------YGWQDMESFEDV----------------------D 252
           DL +                     +G   + +  D+                      D
Sbjct: 181 DLFYGWGDIGNFEDVDNMLRSCDSTFGLDSLNNEGDLGWFSSAQPNEETAGAMTDDLKPD 240

Query: 253 RMFRNCDSTFGLGNLSNEDDLCWFSPSHGSE----------KLEEKPTKSNFNFSCCEGS 312
           +M  N  +      L  ED L    P+H  E            + K +++ F+ S  +  
Sbjct: 241 KMLENQRTAM----LQVEDFLNNSEPNHAVEDEYGYTIEDDSAQGKSSQNVFDTSLQKKD 300

Query: 313 T----INDALEFNEGSNLLNSDPSSDGLNKNNILT-----GRKVNDGIDSAAVSHLLASD 372
                +   LE  +  +L + D  SDG ++N+         R++ D       S     D
Sbjct: 301 ILMLDVEANLEKKQTDHLHHLDGKSDGFSENSFTLQHSGISREIMDTNQYYPPSAFQQRD 360

Query: 373 IP----------------DRKRNSRGDLIPKQQESS---FASNQLQSVPSSHNPSFDAPT 432
           +P                + K   + +  P    +S   + SN  QS+ S   P+ D   
Sbjct: 361 VPYSHFNCEQPSVQVSACESKSGIKSENKPSPSSASNESYTSNHAQSIESLQGPTVDDRF 420

Query: 433 IAANENREKLYH-QDLPASFNKNITLMSMPNSETFSTSFPVRKHAPRSESEIDDGHSETG 492
               E R  L   QD+P SF  N    S  +S  F  + P++K        +++ H    
Sbjct: 421 RKVFETRANLLPGQDMPPSFAANTKKSSKTDSMVFPDAAPIQKIG------LENDH---- 480

Query: 493 VVSRESRAELDSSNAPDISCQSTTRDGISLEATSFCQLQQVMEQLDIRTKLCIRDSLYRL 552
              R++  EL++SN    SC S+  D ISLEATSF QLQQV+EQLD+RTKLCIRDSLYRL
Sbjct: 481 ---RKAATELETSNMQGSSCVSSVVDDISLEATSFRQLQQVIEQLDVRTKLCIRDSLYRL 540

Query: 553 ARSAEQRHNCANINENIREDKLVRIAPLIDQDANRNANWPVWLAGYQWFLVSVNAEDGSG 572
           A+SAEQRH+  N  E      LV                                 D   
Sbjct: 541 AKSAEQRHHGGNRPEKGAGSHLV-----------------------------TGEADKYA 585

BLAST of CmaCh04G003920 vs. TAIR10
Match: AT3G54500.3 (AT3G54500.3 FUNCTIONS IN: molecular_function unknown)

HSP 1 Score: 67.0 bits (162), Expect = 4.5e-11
Identity = 52/147 (35.37%), Postives = 72/147 (48.98%), Query Frame = 1

Query: 434 SLEATSFCQLQQVMEQLDIRTKLCIRDSLYRLARSAEQRH---NCANINENIREDKLVRI 493
           S E     +LQ V+ +LD+ T+ CIRDSL+RLA SA QRH   + ++ N+  ++D+ V  
Sbjct: 495 SAEFAVLYRLQDVVAKLDMGTRTCIRDSLFRLAGSAAQRHYTSDTSHSNKTSQDDQEV-- 554

Query: 494 APLIDQDANRNANWPVWLAGYQWFLVSVNAEDGSGGFLDLETDTNPIDRSVAHLLFHRPS 553
              I ++ +R          Y++            G  D E  TNP DR+VAHLLFHRP 
Sbjct: 555 ---IPREESR----------YRY-----------AGMPDTEAVTNPTDRTVAHLLFHRPF 614

Query: 554 DPSVVPTGGNALPLKSPKLVPAEKQNF 578
           D              S K+   EK NF
Sbjct: 615 DMLAAKRMEGPESPASSKMGTEEKGNF 615

BLAST of CmaCh04G003920 vs. NCBI nr
Match: gi|449438785|ref|XP_004137168.1| (PREDICTED: uncharacterized protein LOC101215423 isoform X1 [Cucumis sativus])

HSP 1 Score: 895.6 bits (2313), Expect = 4.8e-257
Identity = 467/591 (79.02%), Postives = 501/591 (84.77%), Query Frame = 1

Query: 13  MSDLCMYKLKDNIWKDFTGNDDYIVPHLGDELWDLFDERSDSLENSNPKISFTDDSYNTT 72
           MSDLCMYKLKDNIWKDFT NDDY+VPHLGDELWDLF+ +SD+LENSN ++ FT+D+Y+ T
Sbjct: 1   MSDLCMYKLKDNIWKDFTENDDYLVPHLGDELWDLFEVQSDNLENSNCRVGFTNDAYSAT 60

Query: 73  KFSFLGKEKVITQTSIMKNTVLEKDSWPHTPDGVPTS-NSDSFKDVKMEPNGLSMPNRCF 132
           KFSFLGKEKV TQTSIMK+TVLEKDSW HTPDGVP+S NSDSFKD KME + LSM N CF
Sbjct: 61  KFSFLGKEKVKTQTSIMKSTVLEKDSWSHTPDGVPSSLNSDSFKDAKMESSSLSMSNHCF 120

Query: 133 KTDVGTDFDYCTDDHIVTDNSAADENDMYQYSVSHMSQTGNDISFLDDDCENKENNDLLF 192
           KTDVGTD DYCTDDHIVTDNSAADENDMYQYSVSHMSQT NDISFLDDD ENKENNDLL+
Sbjct: 121 KTDVGTDLDYCTDDHIVTDNSAADENDMYQYSVSHMSQTDNDISFLDDDRENKENNDLLY 180

Query: 193 YGWQDMESFEDVDRMFRNCDSTFGLGNLSNEDDLCWFSPSHGSEKLEEKPTKSNFNFSCC 252
           YGWQD+ SFEDVDRMFRNCDSTFGLGNLSNEDDL WFSPSHG+EKLE+ P+K NF FSCC
Sbjct: 181 YGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDLRWFSPSHGTEKLED-PSKPNFKFSCC 240

Query: 253 EGSTINDALEFNEGSNLLNSDPSSDGLNKNNILTGRKVNDGI----DSAAVSHLLASDIP 312
           EGSTINDA EFNE SN +NS+ S DGLN+NNIL G K+NDGI    DSAA+SHL A+D+ 
Sbjct: 241 EGSTINDATEFNEESNPVNSEASPDGLNRNNILNGCKMNDGITDIGDSAAISHLSAADMS 300

Query: 313 DRKRNSRGDLIPKQQESSFASNQLQSVPSSHNPSFDAPTIAANENREKLYHQDLPASFNK 372
           DRK NS GDLIPK+QESS+ASNQL S   SH PSFDAPTI ANENREKLYHQDLPASFNK
Sbjct: 301 DRKGNSSGDLIPKKQESSYASNQLHS---SHYPSFDAPTIGANENREKLYHQDLPASFNK 360

Query: 373 NITLMSMPNSETFSTSFPVRKHAPRSESEIDDGHSETGVVSRESRAELDSSNAPDISCQS 432
           N T MS P+SETF+TSFPVRK APRSESEIDDGHSE+GVVSR SR ELDSSNA D  C+S
Sbjct: 361 NFTFMSAPSSETFNTSFPVRKQAPRSESEIDDGHSESGVVSRGSRVELDSSNAQDKPCRS 420

Query: 433 TTRDGISLEATSFCQLQQVMEQLDIRTKLCIRDSLYRLARSAEQRHNCANINENIREDKL 492
           T  DGISLEATSF QLQQVMEQLDIRTKLCIRDSLYRLARSAEQRHNCAN+NEN  EDK 
Sbjct: 421 TMLDGISLEATSFRQLQQVMEQLDIRTKLCIRDSLYRLARSAEQRHNCANLNENTGEDKF 480

Query: 493 VRIAPLIDQDANRNANWPVWLAGYQWFLVSVNAEDGSGGFLDLETDTNPIDRSVAHLLFH 552
           VR+A  IDQDANR                       SGGFLDLETDTNPIDRSVAHLLFH
Sbjct: 481 VRVASSIDQDANR-----------------------SGGFLDLETDTNPIDRSVAHLLFH 540

Query: 553 RPSDPSVVPTGGNALPLKSPKLVPAEKQNFQDETGGATAYAADQKPLSNGK 599
           RPSDPS++P GGN L LKS KLVPAEKQNFQDETGGA A  ADQK LSNGK
Sbjct: 541 RPSDPSLMPAGGNTLSLKSHKLVPAEKQNFQDETGGAAA-CADQKLLSNGK 563

BLAST of CmaCh04G003920 vs. NCBI nr
Match: gi|659111228|ref|XP_008455643.1| (PREDICTED: dentin sialophosphoprotein isoform X1 [Cucumis melo])

HSP 1 Score: 888.3 bits (2294), Expect = 7.7e-255
Identity = 463/591 (78.34%), Postives = 499/591 (84.43%), Query Frame = 1

Query: 13  MSDLCMYKLKDNIWKDFTGNDDYIVPHLGDELWDLFDERSDSLENSNPKISFTDDSYNTT 72
           MSDLCMYKLKDNIWK FT NDDYIVPHLGDELWDLF+ +SD+LENSN ++ FT+D+Y+ T
Sbjct: 1   MSDLCMYKLKDNIWKGFTENDDYIVPHLGDELWDLFEVQSDNLENSNCRVGFTNDAYSAT 60

Query: 73  KFSFLGKEKVITQTSIMKNTVLEKDSWPHTPDGVPTS-NSDSFKDVKMEPNGLSMPNRCF 132
            FSFLGKEKV TQTSIMK+TVLEKDSW H PDGVP+S NSDSFKD KME + LSM N CF
Sbjct: 61  NFSFLGKEKVKTQTSIMKSTVLEKDSWSHAPDGVPSSLNSDSFKDAKMESSSLSMSNHCF 120

Query: 133 KTDVGTDFDYCTDDHIVTDNSAADENDMYQYSVSHMSQTGNDISFLDDDCENKENNDLLF 192
           KTDVGTD DYCTDDHIVTDNSAADENDMYQYSVSHMSQT NDISFLDDD ENKENNDLL+
Sbjct: 121 KTDVGTDLDYCTDDHIVTDNSAADENDMYQYSVSHMSQTDNDISFLDDDRENKENNDLLY 180

Query: 193 YGWQDMESFEDVDRMFRNCDSTFGLGNLSNEDDLCWFSPSHGSEKLEEKPTKSNFNFSCC 252
           YGWQD+ SFEDVDRMFRNCDSTFGLGNLSNEDDL WFSPSHG+EKLE+ P+KSNF FSCC
Sbjct: 181 YGWQDIGSFEDVDRMFRNCDSTFGLGNLSNEDDLRWFSPSHGTEKLED-PSKSNFKFSCC 240

Query: 253 EGSTINDALEFNEGSNLLNSDPSSDGLNKNNILTGRKVNDGI----DSAAVSHLLASDIP 312
           EGSTINDA EFNE SN +NS+PS +GLN+NNIL G K+NDGI    DSAA+SHL A+D+ 
Sbjct: 241 EGSTINDATEFNEESNPVNSEPSPEGLNRNNILNGCKMNDGITDISDSAAMSHLSAADMS 300

Query: 313 DRKRNSRGDLIPKQQESSFASNQLQSVPSSHNPSFDAPTIAANENREKLYHQDLPASFNK 372
           DRK NS GDLIPK+QESS+ASNQL S   SH PSFD PTIAANENREK+YHQ+LPASFNK
Sbjct: 301 DRKGNSSGDLIPKKQESSYASNQLHS---SHYPSFDTPTIAANENREKVYHQNLPASFNK 360

Query: 373 NITLMSMPNSETFSTSFPVRKHAPRSESEIDDGHSETGVVSRESRAELDSSNAPDISCQS 432
           N T MS P+ ETF+TSFPVRK APRSESEIDDGHSE+GVVSR SR ELDSSNA D  C+S
Sbjct: 361 NFTFMSAPSPETFNTSFPVRKQAPRSESEIDDGHSESGVVSRGSRVELDSSNAQDKPCRS 420

Query: 433 TTRDGISLEATSFCQLQQVMEQLDIRTKLCIRDSLYRLARSAEQRHNCANINENIREDKL 492
           T  DGISLEATSF QLQQVMEQLDIRTKLCIRDSLYRLARSAEQRHNCAN+NEN  EDK 
Sbjct: 421 TMLDGISLEATSFRQLQQVMEQLDIRTKLCIRDSLYRLARSAEQRHNCANVNENTGEDKF 480

Query: 493 VRIAPLIDQDANRNANWPVWLAGYQWFLVSVNAEDGSGGFLDLETDTNPIDRSVAHLLFH 552
           VR+A  IDQDANR                       SGGFLDLETDTNPIDRSVAHLLFH
Sbjct: 481 VRVASTIDQDANR-----------------------SGGFLDLETDTNPIDRSVAHLLFH 540

Query: 553 RPSDPSVVPTGGNALPLKSPKLVPAEKQNFQDETGGATAYAADQKPLSNGK 599
           RPSDPS++P GGN L LKS KLVPAEKQNFQDETGGA A  ADQK LSNGK
Sbjct: 541 RPSDPSLMPAGGNTLSLKSHKLVPAEKQNFQDETGGAAA-CADQKLLSNGK 563

BLAST of CmaCh04G003920 vs. NCBI nr
Match: gi|778691932|ref|XP_011653378.1| (PREDICTED: uncharacterized protein LOC101215423 isoform X2 [Cucumis sativus])

HSP 1 Score: 766.9 bits (1979), Expect = 2.6e-218
Identity = 406/515 (78.83%), Postives = 431/515 (83.69%), Query Frame = 1

Query: 89  MKNTVLEKDSWPHTPDGVPTS-NSDSFKDVKMEPNGLSMPNRCFKTDVGTDFDYCTDDHI 148
           MK+TVLEKDSW HTPDGVP+S NSDSFKD KME + LSM N CFKTDVGTD DYCTDDHI
Sbjct: 1   MKSTVLEKDSWSHTPDGVPSSLNSDSFKDAKMESSSLSMSNHCFKTDVGTDLDYCTDDHI 60

Query: 149 VTDNSAADENDMYQYSVSHMSQTGNDISFLDDDCENKENNDLLFYGWQDMESFEDVDRMF 208
           VTDNSAADENDMYQYSVSHMSQT NDISFLDDD ENKENNDLL+YGWQD+ SFEDVDRMF
Sbjct: 61  VTDNSAADENDMYQYSVSHMSQTDNDISFLDDDRENKENNDLLYYGWQDIGSFEDVDRMF 120

Query: 209 RNCDSTFGLGNLSNEDDLCWFSPSHGSEKLEEKPTKSNFNFSCCEGSTINDALEFNEGSN 268
           RNCDSTFGLGNLSNEDDL WFSPSHG+EKLE+ P+K NF FSCCEGSTINDA EFNE SN
Sbjct: 121 RNCDSTFGLGNLSNEDDLRWFSPSHGTEKLED-PSKPNFKFSCCEGSTINDATEFNEESN 180

Query: 269 LLNSDPSSDGLNKNNILTGRKVNDGI----DSAAVSHLLASDIPDRKRNSRGDLIPKQQE 328
            +NS+ S DGLN+NNIL G K+NDGI    DSAA+SHL A+D+ DRK NS GDLIPK+QE
Sbjct: 181 PVNSEASPDGLNRNNILNGCKMNDGITDIGDSAAISHLSAADMSDRKGNSSGDLIPKKQE 240

Query: 329 SSFASNQLQSVPSSHNPSFDAPTIAANENREKLYHQDLPASFNKNITLMSMPNSETFSTS 388
           SS+ASNQL S   SH PSFDAPTI ANENREKLYHQDLPASFNKN T MS P+SETF+TS
Sbjct: 241 SSYASNQLHS---SHYPSFDAPTIGANENREKLYHQDLPASFNKNFTFMSAPSSETFNTS 300

Query: 389 FPVRKHAPRSESEIDDGHSETGVVSRESRAELDSSNAPDISCQSTTRDGISLEATSFCQL 448
           FPVRK APRSESEIDDGHSE+GVVSR SR ELDSSNA D  C+ST  DGISLEATSF QL
Sbjct: 301 FPVRKQAPRSESEIDDGHSESGVVSRGSRVELDSSNAQDKPCRSTMLDGISLEATSFRQL 360

Query: 449 QQVMEQLDIRTKLCIRDSLYRLARSAEQRHNCANINENIREDKLVRIAPLIDQDANRNAN 508
           QQVMEQLDIRTKLCIRDSLYRLARSAEQRHNCAN+NEN  EDK VR+A  IDQDANR   
Sbjct: 361 QQVMEQLDIRTKLCIRDSLYRLARSAEQRHNCANLNENTGEDKFVRVASSIDQDANR--- 420

Query: 509 WPVWLAGYQWFLVSVNAEDGSGGFLDLETDTNPIDRSVAHLLFHRPSDPSVVPTGGNALP 568
                               SGGFLDLETDTNPIDRSVAHLLFHRPSDPS++P GGN L 
Sbjct: 421 --------------------SGGFLDLETDTNPIDRSVAHLLFHRPSDPSLMPAGGNTLS 480

Query: 569 LKSPKLVPAEKQNFQDETGGATAYAADQKPLSNGK 599
           LKS KLVPAEKQNFQDETGGA A  ADQK LSNGK
Sbjct: 481 LKSHKLVPAEKQNFQDETGGAAA-CADQKLLSNGK 487

BLAST of CmaCh04G003920 vs. NCBI nr
Match: gi|659111232|ref|XP_008455645.1| (PREDICTED: dentin sialophosphoprotein isoform X2 [Cucumis melo])

HSP 1 Score: 763.5 bits (1970), Expect = 2.9e-217
Identity = 403/515 (78.25%), Postives = 431/515 (83.69%), Query Frame = 1

Query: 89  MKNTVLEKDSWPHTPDGVPTS-NSDSFKDVKMEPNGLSMPNRCFKTDVGTDFDYCTDDHI 148
           MK+TVLEKDSW H PDGVP+S NSDSFKD KME + LSM N CFKTDVGTD DYCTDDHI
Sbjct: 1   MKSTVLEKDSWSHAPDGVPSSLNSDSFKDAKMESSSLSMSNHCFKTDVGTDLDYCTDDHI 60

Query: 149 VTDNSAADENDMYQYSVSHMSQTGNDISFLDDDCENKENNDLLFYGWQDMESFEDVDRMF 208
           VTDNSAADENDMYQYSVSHMSQT NDISFLDDD ENKENNDLL+YGWQD+ SFEDVDRMF
Sbjct: 61  VTDNSAADENDMYQYSVSHMSQTDNDISFLDDDRENKENNDLLYYGWQDIGSFEDVDRMF 120

Query: 209 RNCDSTFGLGNLSNEDDLCWFSPSHGSEKLEEKPTKSNFNFSCCEGSTINDALEFNEGSN 268
           RNCDSTFGLGNLSNEDDL WFSPSHG+EKLE+ P+KSNF FSCCEGSTINDA EFNE SN
Sbjct: 121 RNCDSTFGLGNLSNEDDLRWFSPSHGTEKLED-PSKSNFKFSCCEGSTINDATEFNEESN 180

Query: 269 LLNSDPSSDGLNKNNILTGRKVNDGI----DSAAVSHLLASDIPDRKRNSRGDLIPKQQE 328
            +NS+PS +GLN+NNIL G K+NDGI    DSAA+SHL A+D+ DRK NS GDLIPK+QE
Sbjct: 181 PVNSEPSPEGLNRNNILNGCKMNDGITDISDSAAMSHLSAADMSDRKGNSSGDLIPKKQE 240

Query: 329 SSFASNQLQSVPSSHNPSFDAPTIAANENREKLYHQDLPASFNKNITLMSMPNSETFSTS 388
           SS+ASNQL S   SH PSFD PTIAANENREK+YHQ+LPASFNKN T MS P+ ETF+TS
Sbjct: 241 SSYASNQLHS---SHYPSFDTPTIAANENREKVYHQNLPASFNKNFTFMSAPSPETFNTS 300

Query: 389 FPVRKHAPRSESEIDDGHSETGVVSRESRAELDSSNAPDISCQSTTRDGISLEATSFCQL 448
           FPVRK APRSESEIDDGHSE+GVVSR SR ELDSSNA D  C+ST  DGISLEATSF QL
Sbjct: 301 FPVRKQAPRSESEIDDGHSESGVVSRGSRVELDSSNAQDKPCRSTMLDGISLEATSFRQL 360

Query: 449 QQVMEQLDIRTKLCIRDSLYRLARSAEQRHNCANINENIREDKLVRIAPLIDQDANRNAN 508
           QQVMEQLDIRTKLCIRDSLYRLARSAEQRHNCAN+NEN  EDK VR+A  IDQDANR   
Sbjct: 361 QQVMEQLDIRTKLCIRDSLYRLARSAEQRHNCANVNENTGEDKFVRVASTIDQDANR--- 420

Query: 509 WPVWLAGYQWFLVSVNAEDGSGGFLDLETDTNPIDRSVAHLLFHRPSDPSVVPTGGNALP 568
                               SGGFLDLETDTNPIDRSVAHLLFHRPSDPS++P GGN L 
Sbjct: 421 --------------------SGGFLDLETDTNPIDRSVAHLLFHRPSDPSLMPAGGNTLS 480

Query: 569 LKSPKLVPAEKQNFQDETGGATAYAADQKPLSNGK 599
           LKS KLVPAEKQNFQDETGGA A  ADQK LSNGK
Sbjct: 481 LKSHKLVPAEKQNFQDETGGAAA-CADQKLLSNGK 487

BLAST of CmaCh04G003920 vs. NCBI nr
Match: gi|703090769|ref|XP_010094170.1| (hypothetical protein L484_017214 [Morus notabilis])

HSP 1 Score: 264.2 bits (674), Expect = 5.4e-67
Identity = 194/536 (36.19%), Postives = 279/536 (52.05%), Query Frame = 1

Query: 21  LKDNIWKDFTGNDDYIVPHLGDELWDLFDERSDSLENSNPKISFTDDSYNTTKFSFLGKE 80
           L+DN+W +F  +DD+IVPH G++  D      DS +    ++     + +T+   + G +
Sbjct: 90  LEDNVWDEFGDSDDHIVPHSGNKHADKSLAEGDSRKKPQSEVIGVASNGDTSTTHYTGGK 149

Query: 81  KVITQTSIM-KNTVLEKDSWPHTPDGV-PTSNSDSFKDV-KMEPNGLSMPNRCFKTD--V 140
              +   +   NT+LEK SW  T DGV P+ ++DS K+V  +  +G  + N  FK+D   
Sbjct: 150 GNRSIPFVTDNNTMLEKGSWSDTLDGVFPSCDTDSIKEVTSLASDGTRISNNSFKSDNVE 209

Query: 141 GTDFDYCTDDHIVTDNSAADENDMYQYSVSHMSQTGNDISFLDDDCENKENNDLLFYGWQ 200
               ++C DD I+ D   A +N +Y+Y +S + QTG ++SF D+D E++EN DLL+YGW 
Sbjct: 210 SGGSEFCVDDSIMGDGCTAVDNSLYRYPLSRIPQTGYELSFFDNDREDRENGDLLYYGWP 269

Query: 201 DMESFEDVDRMFRNCDSTFGLGNLSNEDDLCWFSPSHGSE-------------------- 260
           D+ +FEDVDRMFR+CDSTFG+ +L+N+++L WFS S+ +E                    
Sbjct: 270 DIGNFEDVDRMFRSCDSTFGMESLNNDEELSWFSSSNATEGSEDALKSGLKFAGPNAGSS 329

Query: 261 --------------KLEEK----PTKSNFNFSCCEGSTINDALEFNEGS----------- 320
                         KLE +    P +   + +  EG T +  LE N GS           
Sbjct: 330 KGVSEHYDAFKPDSKLESRDILMPKEQPKHDAQLEGETTDQYLE-NGGSFHHYDYLKQLT 389

Query: 321 -------NLLNSDPSSDGLNKNNILTGRKVND----GIDSAAVSHLLASD----IPDRKR 380
                  +L +   S+ G+ ++  + G          +    V H   SD     P    
Sbjct: 390 DMKQPHGDLSSQFYSAPGIEQHKQIAGPNSMSYNQRQLPYMHVDHTRQSDQISVCPTPSV 449

Query: 381 NSRGDLIPKQ-QESSFASNQLQSVPSSHNPSFDAPTIAANENREKLYH-QDLPASFNKNI 440
            S  D  P   +ESS+ SNQLQ + +   P  + P    +EN EK +  Q   +SF  N 
Sbjct: 450 KSEKDGKPSSLKESSYGSNQLQCMENFRGPLTETPIATISENTEKPHSCQGFQSSFIMNF 509

Query: 441 TLMSMPNSETFSTSFPVRKHAPRSESEIDDGHSETGVVSRESRAELDSSNAPDISCQSTT 486
              +  N   F      +K   +SE++I  GHS+    S E  AE+DSSN  + SC S+ 
Sbjct: 510 ENAATTNPVVFGNKVSKQKEEQKSENQIG-GHSDIEGASNEIPAEVDSSNLQESSCMSSV 569

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
LNK1_ARATH9.7e-4028.79Protein LNK1 OS=Arabidopsis thaliana GN=LNK1 PE=1 SV=1[more]
LNK2_ARATH8.0e-1035.37Protein LNK2 OS=Arabidopsis thaliana GN=LNK2 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KVV5_CUCSA1.8e-21878.83Uncharacterized protein OS=Cucumis sativus GN=Csa_4G107450 PE=4 SV=1[more]
W9QVG7_9ROSA3.8e-6736.19Uncharacterized protein OS=Morus notabilis GN=L484_017214 PE=4 SV=1[more]
A0A061E394_THECC1.8e-5339.75Dentin sialophosphoprotein-related, putative isoform 3 OS=Theobroma cacao GN=TCM... [more]
A0A061E1A2_THECC1.8e-5339.75Dentin sialophosphoprotein-related, putative isoform 7 OS=Theobroma cacao GN=TCM... [more]
A0A061E968_THECC1.8e-5339.75Dentin sialophosphoprotein-related, putative isoform 6 OS=Theobroma cacao GN=TCM... [more]
Match NameE-valueIdentityDescription
AT5G64170.25.4e-4128.79 dentin sialophosphoprotein-related[more]
AT3G54500.34.5e-1135.37 FUNCTIONS IN: molecular_function unknown[more]
Match NameE-valueIdentityDescription
gi|449438785|ref|XP_004137168.1|4.8e-25779.02PREDICTED: uncharacterized protein LOC101215423 isoform X1 [Cucumis sativus][more]
gi|659111228|ref|XP_008455643.1|7.7e-25578.34PREDICTED: dentin sialophosphoprotein isoform X1 [Cucumis melo][more]
gi|778691932|ref|XP_011653378.1|2.6e-21878.83PREDICTED: uncharacterized protein LOC101215423 isoform X2 [Cucumis sativus][more]
gi|659111232|ref|XP_008455645.1|2.9e-21778.25PREDICTED: dentin sialophosphoprotein isoform X2 [Cucumis melo][more]
gi|703090769|ref|XP_010094170.1|5.4e-6736.19hypothetical protein L484_017214 [Morus notabilis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh04G003920.1CmaCh04G003920.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR33334FAMILY NOT NAMEDcoord: 526..566
score: 6.8E-105coord: 19..502
score: 6.8E
NoneNo IPR availablePANTHERPTHR33334:SF5DENTIN SIALOPHOSPHOPROTEIN-LIKE PROTEINcoord: 526..566
score: 6.8E-105coord: 19..502
score: 6.8E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CmaCh04G003920CmaCh16G002530Cucurbita maxima (Rimu)cmacmaB351