Csor.00g140520 (gene) Silver-seed gourd (wild; sororia) v1

Overview
NameCsor.00g140520
Typegene
OrganismCucurbita argyrosperma subsp. sororia (Silver-seed gourd (wild; sororia) v1)
DescriptionGlycosyltransferase
LocationCsor_Chr01: 13862456 .. 13867736 (-)
RNA-Seq ExpressionCsor.00g140520
SyntenyCsor.00g140520
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSinitialstart_codonintroninternalterminalstop_codon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCATCCTATATGTCAAAACTACGAGCACCGGTTCACTTTCTAGGGTGGGAAGGTGGATTTAAAAGTTCACTGGAGTTTCTTTTTGGGCAACAGAAACTCTTGTGTGGCAGTTCGAGCCTGTTGCACTCAGTACCTTATTCCTCACTCACAGAATTGCATGCACTCTTAAGACCTGGCACTATTTCTGGTGCTAGTTCAGAGCTAGTTAATAGTAGGAGGAATATCTCTGTTCTTGGAGCAATTTCTCGCACATTTTCTATTCCTTCTGTGTCAGGCCCTGCGTTACAGACCTGTGGGTATCACATTGATTGTGCCATTGCTGAATCCAATCAATATTCAACTCGCAGCAAGTTTCAAGACAAACCAATGGCTGCTTGTGGTTCTAGAGCTGGACTTGGTGAATGTTCTCTCGAGAATCTAAGTTTCAGGATTGCACGCACCTCTCCTCCAGCGATCAGTCCTAGTATTTGTTTCAACAAAAGAAGCGTTGATTGCTGCCCAAAAGCCAGCATGAGTTTGAAAAATCAGGAGCAGCCTAGCAATAATGTGATATATGGATACTTTACATACAATGTTGCAAAAAGGTTTTGCAGCAGTTACCTACATGCTGGGTTGGGAGCAAGGGATCTTCATAGTTCGTCCACTTCTTCCCTAGCTGCTGGTTCTGCCCCCAATTTATCATTTGATAATTCTGCACGGGAGGAACAACTTGCTAACTCTACCGATTCATCCGCACAGTATGTTCTCCACCTGATTCCTTGGTTTAAAATGAAAATATTATAGTCTATATGCTTAGTTTAAATTACATCACAACTTCAGTTTAAACTTACCAGATTGAATGTTTTGAATATTATTGTCAATTGAACTTGTTCTTTGGTCGTCCCATTCTTATAGATCCAATTGTTGAGGGTGAAATGTGAATCATAGCCTGTTGTCCGTGCTTAATGAACATTTGTGGTTTTTTCTTTTCCTGTCAAAATGAATCTGTTTTTTTTTTTTTTTTTTTTTTCAATATTCCAATATTGTACCCTATGGTGCTTCGTTGTTTTTGGATCTCATTATGGAACTCTTCAAGATTGCCGGCACAAAGTATGCTGTGATACCGCTCTTCACACTTTTAAGTGACAGTATCAGTTTGAATGCAAGTGTTCTTCTTTGCTCTAAATTTCCTACTATATTCTTCACAGAAAGATTCCGAAAGGCAAATCAATGAAACTGGTTTCTGGGTCTTGCTATCTGCCCCACCCTGATAAAGAAGATACTGGTGGAGAGGACGCTCACTTTATTTGTGTGGATGAACAAGCTATAGGGGTGGCTGATGGTGTGGGTGGTTGGGCAGATCTTGGTGTTGATGCTGGACAGTATTCCCGAGAACTCATGTCTAATTCAGTTAATGCAGTTCAAGAGGAGCCCAAGGGCTCAATTGATCCAGCTAGAGTCTTGGAGAAGGCTCACTCGAAAACAAAAGCCAAAGGCTCCTCCACTGCTTGTATCATAGCGCTTACAGAACAAGTATGTTGTTTCACTTGAGGCCATCATTCTTTGTAAGTTCAAATATAGGCAAGTTGTTTTAGGTTTTGAAAGTTGGGAGTCTGATCTTATTTTATCAAGACCAAACTCTCTGTTTTCAATTTTCAGATCCTTGATTTGTTGTTTCATTATAATTTTTCTTTCTTTGACCAAAATAATTTCTGGAGGTTTCTATTTAAGTATAGCCTAAGTCGTCCTAGTTTGATCTTTTGGATGGAATATATAACAAGAGAAGAACTCTTTGATGTCTTCTAGTGCTTGCTACAACATTTAAAATTTTCCCATAGTTTAGTGAGGTGGAGCTTGATACTTCCCTATATAATATAATTATATCTATCAATAATTTTAGCTGGTTTCTGTGCAAGTACTTGCTCATATCTCGTGTAGTTTTTATTTTTATTTTTACTTTTTATTGCAGGGGCTCCATGCAATCAATTTAGGAGACAGTGGATTTATGGTGGTTAGAGACGGATGCACAATATTCAGATCTCCTGTGCAGCAGCATGATTTTAACTTCACCTTTCAATTGGAGAGTGGAAACAATGGCGATTTACCTAGCTCTGGACAGGTCAGTATTCTCACCAGCTTCTCTGTGTCTGTAGTTGTCATTCCCGTTCCTGGTACTTTTGTTTGTACCTACAGTAGTTGCTTCGTGATTTTTCCGAACGAGAGAGAGTAGAGTTAGCAACCTCCCAAAATTGAGAGTATTGGAATTTCTTGGCAATAACCTTAACTTGATGATGATGGGATTATCAAAACTGATTCAGGTCTTCTCCGTCCCTGTTGCTCCTGGAGATGTCATAATTGCTGGCACTGATGGACTCTTTGATAACTTGTACAACAACGAGATCACCGCAGTGGTGGTTCATGCCATGAGAGCTGGCTTAGGCTCTCAGGTGACTGCCCAGAAGATAGCTGCTCTTGCACGCCAGCGAGCTCAAGATAAAGACCGACAAACACCTTTCTCCACTGCTGCTCAAGATGCTGGGTTTCGGTACTACGGAGGCAAGCTTGATTTTTTTTTTTTTTTTTTTTTGGTTTTTCCATCCCTGTTTTCTTAGCGAGAAATGGTGTTCTTGTTTGGAATATGATTTTTGGTCAAATGGGTAATTGTGAACTAGCGGATCCAACGGGTGGCTTCGAGCGTGTGGATAATTCCCACACGCCATTGCTAAAATGAAAAAAAAAAGTATTTAATTTTCTGGTGATTTAATAAAATATTGTTCAAAATTTAATTGTGATGTATAGAAAGATATCTTCACGTCTGAATTAGGGTCGGAAGTGACACTTTTTCATACTTTGAATTTAAACACTTTTATTATTACTTGAATATGTTAAATTGAAATAAATAAATAAATAAATACATTTGGAAAGTTCTAAAAGTTGACTTTCTTTCCATTAACCATTTTTTTAATATATAAAAAAACCTCAAATTCCAACGTTGACTTTAAGAAGTAGGGATAGAGAGGGGTTAATCTTTGATAGGTAGCTTTCAAGGAGGGGCATAAATGGAAAAGAAAAAAGAGAGCACACGTTTGTCCTCAACCAGGCCCAGGCCCAGGCCCATCTGGTTCCCATTTTCTTACCCTTTCTGCACCAAACCAAGACAAAATAGAAGTCATAGTCATACGCCTCTCCTCTCTTCTTTCTCTCTCATTCCAGTTCCGACGACCAAAATTTCTGGTGCTTTAAGGTAACAGAAAACAGTCCCATTTTCTTTATTTATGGAATTGTAACGATCCCTTCCTTCTACTTCAAGGGAATGCCAAATCTCTAATTCCTAAAACATTCTCTGGTTGTTTCCCCTTGAAGTCTATACTCTCCTGCTTCACTCACTCCCCAACTGCGCCTGATTTCTCCATGGCCGGCCGTAAAGACAAAGCTCAGTCTGCCCGCGTCTCTCGAATCGTCATCGCCATCGCAATCGGAGTTCTTGTTGGCTGTCTTTTTGCTTTCTTGTATCCTCATGGACTTTTCGCCTCTGATCTGCCTGTCCAAAACCGTCGCCTCGGCAAATCCGAGTTTCTGGTTTGTTGCTTGTTGCCCTAATTCCTGTTTCTGAATTTCTATACCGATTTCAATTGTCATTTATTTTTTATTTTAGCGAAATGAACCCGAATTGACATGTATGTTTTGAGGTTTCAAGTGTTTCGGATTGTACTTTATTGTTCTATAATTTATATATCTTTGAGAGGAGTTCGATTTCTTTTTTTTCCTTTTACTGTTTGTCTTTTCTTCTTGAGCGATGAAATTCATAATCAATTCTTTGATTAATTATTTTGGCATTCTTTTAAAGATTTGTGGGTAGTGTGAGCACTGTGGTAGCTAGATCAATTCTTCTGTTTGTGTTGTTTTCATTTTAGTTTGACTTGTATGTTCATGATTTGGTTTGTGCATTTCTTTTGAATCTATACTCTTTAGAAGATGTTAGTTTCAGTTTAATGACCTTAAAAAGTTTTAGGTTTCATTTACTCGCTAAATTAAATCAAAACATAAAATATCACATTATCTAACGAACTTATCCAGTTGATATCTAGCTTAATTGAGTTTTGATGGAATCGTTGGACTTTTGGAGATTCTGGAGGAAAATTTTCGTATATCTTCTATTTCCTAACTTTTTTTTCTTACACAGGTTCAGTCTTCTTCTCCTTGCGAATCGTCGGAGCGGTTCAAGATGCTTAAAGGCCACGTCGTTTCAATATTAGAGAAGAACTCCCAGTTGGAGAAGCGTATAAAGGATCTAACAGGGGAACTGAGGATTGTGGAACAAACAAAAGATCATGCTCAGAAGCAATATTTGGCGCTCAGTGAAAATCACAAGGCTGGTCCATTTGGTACTGTCAAAGGTCTTAGAACCAACCCTACCGTAATCCCTGATGAATCTGTAAACCCTCGATTGGCGAAGCTCCTGGAGAAAGTTGCTATCCAGAGGGAGCTGATTGTGACACTCGCGAATTCTAATGTACAACCCATGCTGGAGGTTTGGTTTACGAGTATCCAGAAGGTCGGTATACCGAATTATTTAGTTGTGGCTCTGGATGACCAGACGGAAGAATTCTGCAAATCCCATAATGTTCCTGTCTACACGAGAGATCCAGACAAGAGTGTTGATTTAATCGGAAAGGAAGGAGGCAACCACCAAGTCTCGGCATTGAAGTTTCGGATTTTGAGGGAGTTCTTGCAACTTGGATACAGTGTTCTTCTCTCAGACGTCGATATAGTCTACTTACAGAATCCTTTCGATCATCTTTACCGGGATTCAGATGTGGAGTCGATGAGTGATGGTCACAGCAATATGACAGCTTATGGATACAACGATGTATTTGATGAACCTGCCATGGGCTGGGCTAGATATGCACACACTATGAGAATATGGGTTTACAACTCTGGTTTCTTCTACATTAGGCCTACACTGCCTTCGTTTGAGCTTTTGGATCGTGTCGCGACTCGGCTTTCTCAAGAAAAAGCATGGGACCAAGCTGTTTTTAACGAGGAACTCTTTTATCCTTCTCGTCCTGGACGCGATGGACTTCATGCCTCCAAGAGAACCATGGATATGTATCTTTTCATGAACAGTAAGGTACTCTTCAAGACTGTTCGTAAGGACCCGAAACTCAGACAGTTGAAACCCGTCATTGTTCATATTAATTACCATCCCGACAAGTATCCAAGAATGAAAGCAGTCGTCGAATTCTACGTGAACGGTCAGCAAAATGCTCTGGATTCGTTCCCAGATGGTTCTGAATGA

mRNA sequence

ATGCCATCCTATATGTCAAAACTACGAGCACCGGTTCACTTTCTAGGGTGGGAAGGTGGATTTAAAAGTTCACTGGAGTTTCTTTTTGGGCAACAGAAACTCTTGTGTGGCAGTTCGAGCCTGTTGCACTCAGTACCTTATTCCTCACTCACAGAATTGCATGCACTCTTAAGACCTGGCACTATTTCTGGTGCTAGTTCAGAGCTAGTTAATAGTAGGAGGAATATCTCTGTTCTTGGAGCAATTTCTCGCACATTTTCTATTCCTTCTGTGTCAGGCCCTGCGTTACAGACCTGTGGGTATCACATTGATTGTGCCATTGCTGAATCCAATCAATATTCAACTCGCAGCAAGTTTCAAGACAAACCAATGGCTGCTTGTGGTTCTAGAGCTGGACTTGGTGAATGTTCTCTCGAGAATCTAAGTTTCAGGATTGCACGCACCTCTCCTCCAGCGATCAGTCCTAGTATTTGTTTCAACAAAAGAAGCGTTGATTGCTGCCCAAAAGCCAGCATGAGTTTGAAAAATCAGGAGCAGCCTAGCAATAATGTGATATATGGATACTTTACATACAATGTTGCAAAAAGGTTTTGCAGCAGTTACCTACATGCTGGGTTGGGAGCAAGGGATCTTCATAGTTCGTCCACTTCTTCCCTAGCTGCTGGTTCTGCCCCCAATTTATCATTTGATAATTCTGCACGGGAGGAACAACTTGCTAACTCTACCGATTCATCCGCACAAAAGATTCCGAAAGGCAAATCAATGAAACTGGTTTCTGGGTCTTGCTATCTGCCCCACCCTGATAAAGAAGATACTGGTGGAGAGGACGCTCACTTTATTTGTGTGGATGAACAAGCTATAGGGGTGGCTGATGGTGTGGGTGGTTGGGCAGATCTTGGTGTTGATGCTGGACAGTATTCCCGAGAACTCATGTCTAATTCAGTTAATGCAGTTCAAGAGGAGCCCAAGGGCTCAATTGATCCAGCTAGAGTCTTGGAGAAGGCTCACTCGAAAACAAAAGCCAAAGGCTCCTCCACTGCTTGTATCATAGCGCTTACAGAACAAGGGCTCCATGCAATCAATTTAGGAGACAGTGGATTTATGGTGGTTAGAGACGGATGCACAATATTCAGATCTCCTGTGCAGCAGCATGATTTTAACTTCACCTTTCAATTGGAGAGTGGAAACAATGGCGATTTACCTAGCTCTGGACAGGTCTTCTCCGTCCCTGTTGCTCCTGGAGATGTCATAATTGCTGGCACTGATGGACTCTTTGATAACTTGTACAACAACGAGATCACCGCAGTGGTGGTTCATGCCATGAGAGCTGGCTTAGGCTCTCAGGTGACTGCCCAGAAGATAGCTGCTCTTGCACGCCAGCGAGCTCAAGATAAAGACCGACAAACACCTTTCTCCACTGCTGCTCAAGATGCTGGGTTTCGTTCCGACGACCAAAATTTCTGGTGCTTTAAGTCTATACTCTCCTGCTTCACTCACTCCCCAACTGCGCCTGATTTCTCCATGGCCGGCCGTAAAGACAAAGCTCAGTCTGCCCGCGTCTCTCGAATCGTCATCGCCATCGCAATCGGAGTTCTTGTTGGCTGTCTTTTTGCTTTCTTGTATCCTCATGGACTTTTCGCCTCTGATCTGCCTGTCCAAAACCGTCGCCTCGGCAAATCCGAGTTTCTGGTTCAGTCTTCTTCTCCTTGCGAATCGTCGGAGCGGTTCAAGATGCTTAAAGGCCACGTCGTTTCAATATTAGAGAAGAACTCCCAGTTGGAGAAGCGTATAAAGGATCTAACAGGGGAACTGAGGATTGTGGAACAAACAAAAGATCATGCTCAGAAGCAATATTTGGCGCTCAGTGAAAATCACAAGGCTGGTCCATTTGGTACTGTCAAAGGTCTTAGAACCAACCCTACCGTAATCCCTGATGAATCTGTAAACCCTCGATTGGCGAAGCTCCTGGAGAAAGTTGCTATCCAGAGGGAGCTGATTGTGACACTCGCGAATTCTAATGTACAACCCATGCTGGAGGTTTGGTTTACGAGTATCCAGAAGGTCGGTATACCGAATTATTTAGTTGTGGCTCTGGATGACCAGACGGAAGAATTCTGCAAATCCCATAATGTTCCTGTCTACACGAGAGATCCAGACAAGAGTGTTGATTTAATCGGAAAGGAAGGAGGCAACCACCAAGTCTCGGCATTGAAGTTTCGGATTTTGAGGGAGTTCTTGCAACTTGGATACAGTGTTCTTCTCTCAGACGTCGATATAGTCTACTTACAGAATCCTTTCGATCATCTTTACCGGGATTCAGATGTGGAGTCGATGAGTGATGGTCACAGCAATATGACAGCTTATGGATACAACGATGTATTTGATGAACCTGCCATGGGCTGGGCTAGATATGCACACACTATGAGAATATGGGTTTACAACTCTGGTTTCTTCTACATTAGGCCTACACTGCCTTCGTTTGAGCTTTTGGATCGTGTCGCGACTCGGCTTTCTCAAGAAAAAGCATGGGACCAAGCTGTTTTTAACGAGGAACTCTTTTATCCTTCTCGTCCTGGACGCGATGGACTTCATGCCTCCAAGAGAACCATGGATATGTATCTTTTCATGAACAGTAAGGTACTCTTCAAGACTGTTCGTAAGGACCCGAAACTCAGACAGTTGAAACCCGTCATTGTTCATATTAATTACCATCCCGACAAGTATCCAAGAATGAAAGCAGTCGTCGAATTCTACGTGAACGGTCAGCAAAATGCTCTGGATTCGTTCCCAGATGGTTCTGAATGA

Coding sequence (CDS)

ATGCCATCCTATATGTCAAAACTACGAGCACCGGTTCACTTTCTAGGGTGGGAAGGTGGATTTAAAAGTTCACTGGAGTTTCTTTTTGGGCAACAGAAACTCTTGTGTGGCAGTTCGAGCCTGTTGCACTCAGTACCTTATTCCTCACTCACAGAATTGCATGCACTCTTAAGACCTGGCACTATTTCTGGTGCTAGTTCAGAGCTAGTTAATAGTAGGAGGAATATCTCTGTTCTTGGAGCAATTTCTCGCACATTTTCTATTCCTTCTGTGTCAGGCCCTGCGTTACAGACCTGTGGGTATCACATTGATTGTGCCATTGCTGAATCCAATCAATATTCAACTCGCAGCAAGTTTCAAGACAAACCAATGGCTGCTTGTGGTTCTAGAGCTGGACTTGGTGAATGTTCTCTCGAGAATCTAAGTTTCAGGATTGCACGCACCTCTCCTCCAGCGATCAGTCCTAGTATTTGTTTCAACAAAAGAAGCGTTGATTGCTGCCCAAAAGCCAGCATGAGTTTGAAAAATCAGGAGCAGCCTAGCAATAATGTGATATATGGATACTTTACATACAATGTTGCAAAAAGGTTTTGCAGCAGTTACCTACATGCTGGGTTGGGAGCAAGGGATCTTCATAGTTCGTCCACTTCTTCCCTAGCTGCTGGTTCTGCCCCCAATTTATCATTTGATAATTCTGCACGGGAGGAACAACTTGCTAACTCTACCGATTCATCCGCACAAAAGATTCCGAAAGGCAAATCAATGAAACTGGTTTCTGGGTCTTGCTATCTGCCCCACCCTGATAAAGAAGATACTGGTGGAGAGGACGCTCACTTTATTTGTGTGGATGAACAAGCTATAGGGGTGGCTGATGGTGTGGGTGGTTGGGCAGATCTTGGTGTTGATGCTGGACAGTATTCCCGAGAACTCATGTCTAATTCAGTTAATGCAGTTCAAGAGGAGCCCAAGGGCTCAATTGATCCAGCTAGAGTCTTGGAGAAGGCTCACTCGAAAACAAAAGCCAAAGGCTCCTCCACTGCTTGTATCATAGCGCTTACAGAACAAGGGCTCCATGCAATCAATTTAGGAGACAGTGGATTTATGGTGGTTAGAGACGGATGCACAATATTCAGATCTCCTGTGCAGCAGCATGATTTTAACTTCACCTTTCAATTGGAGAGTGGAAACAATGGCGATTTACCTAGCTCTGGACAGGTCTTCTCCGTCCCTGTTGCTCCTGGAGATGTCATAATTGCTGGCACTGATGGACTCTTTGATAACTTGTACAACAACGAGATCACCGCAGTGGTGGTTCATGCCATGAGAGCTGGCTTAGGCTCTCAGGTGACTGCCCAGAAGATAGCTGCTCTTGCACGCCAGCGAGCTCAAGATAAAGACCGACAAACACCTTTCTCCACTGCTGCTCAAGATGCTGGGTTTCGTTCCGACGACCAAAATTTCTGGTGCTTTAAGTCTATACTCTCCTGCTTCACTCACTCCCCAACTGCGCCTGATTTCTCCATGGCCGGCCGTAAAGACAAAGCTCAGTCTGCCCGCGTCTCTCGAATCGTCATCGCCATCGCAATCGGAGTTCTTGTTGGCTGTCTTTTTGCTTTCTTGTATCCTCATGGACTTTTCGCCTCTGATCTGCCTGTCCAAAACCGTCGCCTCGGCAAATCCGAGTTTCTGGTTCAGTCTTCTTCTCCTTGCGAATCGTCGGAGCGGTTCAAGATGCTTAAAGGCCACGTCGTTTCAATATTAGAGAAGAACTCCCAGTTGGAGAAGCGTATAAAGGATCTAACAGGGGAACTGAGGATTGTGGAACAAACAAAAGATCATGCTCAGAAGCAATATTTGGCGCTCAGTGAAAATCACAAGGCTGGTCCATTTGGTACTGTCAAAGGTCTTAGAACCAACCCTACCGTAATCCCTGATGAATCTGTAAACCCTCGATTGGCGAAGCTCCTGGAGAAAGTTGCTATCCAGAGGGAGCTGATTGTGACACTCGCGAATTCTAATGTACAACCCATGCTGGAGGTTTGGTTTACGAGTATCCAGAAGGTCGGTATACCGAATTATTTAGTTGTGGCTCTGGATGACCAGACGGAAGAATTCTGCAAATCCCATAATGTTCCTGTCTACACGAGAGATCCAGACAAGAGTGTTGATTTAATCGGAAAGGAAGGAGGCAACCACCAAGTCTCGGCATTGAAGTTTCGGATTTTGAGGGAGTTCTTGCAACTTGGATACAGTGTTCTTCTCTCAGACGTCGATATAGTCTACTTACAGAATCCTTTCGATCATCTTTACCGGGATTCAGATGTGGAGTCGATGAGTGATGGTCACAGCAATATGACAGCTTATGGATACAACGATGTATTTGATGAACCTGCCATGGGCTGGGCTAGATATGCACACACTATGAGAATATGGGTTTACAACTCTGGTTTCTTCTACATTAGGCCTACACTGCCTTCGTTTGAGCTTTTGGATCGTGTCGCGACTCGGCTTTCTCAAGAAAAAGCATGGGACCAAGCTGTTTTTAACGAGGAACTCTTTTATCCTTCTCGTCCTGGACGCGATGGACTTCATGCCTCCAAGAGAACCATGGATATGTATCTTTTCATGAACAGTAAGGTACTCTTCAAGACTGTTCGTAAGGACCCGAAACTCAGACAGTTGAAACCCGTCATTGTTCATATTAATTACCATCCCGACAAGTATCCAAGAATGAAAGCAGTCGTCGAATTCTACGTGAACGGTCAGCAAAATGCTCTGGATTCGTTCCCAGATGGTTCTGAATGA

Protein sequence

MPSYMSKLRAPVHFLGWEGGFKSSLEFLFGQQKLLCGSSSLLHSVPYSSLTELHALLRPGTISGASSELVNSRRNISVLGAISRTFSIPSVSGPALQTCGYHIDCAIAESNQYSTRSKFQDKPMAACGSRAGLGECSLENLSFRIARTSPPAISPSICFNKRSVDCCPKASMSLKNQEQPSNNVIYGYFTYNVAKRFCSSYLHAGLGARDLHSSSTSSLAAGSAPNLSFDNSAREEQLANSTDSSAQKIPKGKSMKLVSGSCYLPHPDKEDTGGEDAHFICVDEQAIGVADGVGGWADLGVDAGQYSRELMSNSVNAVQEEPKGSIDPARVLEKAHSKTKAKGSSTACIIALTEQGLHAINLGDSGFMVVRDGCTIFRSPVQQHDFNFTFQLESGNNGDLPSSGQVFSVPVAPGDVIIAGTDGLFDNLYNNEITAVVVHAMRAGLGSQVTAQKIAALARQRAQDKDRQTPFSTAAQDAGFRSDDQNFWCFKSILSCFTHSPTAPDFSMAGRKDKAQSARVSRIVIAIAIGVLVGCLFAFLYPHGLFASDLPVQNRRLGKSEFLVQSSSPCESSERFKMLKGHVVSILEKNSQLEKRIKDLTGELRIVEQTKDHAQKQYLALSENHKAGPFGTVKGLRTNPTVIPDESVNPRLAKLLEKVAIQRELIVTLANSNVQPMLEVWFTSIQKVGIPNYLVVALDDQTEEFCKSHNVPVYTRDPDKSVDLIGKEGGNHQVSALKFRILREFLQLGYSVLLSDVDIVYLQNPFDHLYRDSDVESMSDGHSNMTAYGYNDVFDEPAMGWARYAHTMRIWVYNSGFFYIRPTLPSFELLDRVATRLSQEKAWDQAVFNEELFYPSRPGRDGLHASKRTMDMYLFMNSKVLFKTVRKDPKLRQLKPVIVHINYHPDKYPRMKAVVEFYVNGQQNALDSFPDGSE
Homology
BLAST of Csor.00g140520 vs. ExPASy Swiss-Prot
Match: Q9C9Q5 (Arabinosyltransferase RRA2 OS=Arabidopsis thaliana OX=3702 GN=RRA2 PE=2 SV=1)

HSP 1 Score: 598.6 bits (1542), Expect = 1.2e-169
Identity = 292/429 (68.07%), Postives = 349/429 (81.35%), Query Frame = 0

Query: 508 MAGRKDKAQSARVSRIVIAIAIGVLVGCLFAFLYPHGLF--ASDLPVQNRRLGKSEFLVQ 567
           MAGR+D+ Q  R SRI IAI +G+L+GC+ + L+P+G F   S L     R+ KS     
Sbjct: 1   MAGRRDRIQQLRGSRIAIAIFVGILIGCVCSVLFPNGFFNSGSSLIANEERISKST-STD 60

Query: 568 SSSPCESSERFKMLKGHVVSILEKNSQLEKRIKDLTGELRIVEQTKDHAQKQYLALSENH 627
             + CESSER KMLK     I  KN++L K++++LT ++R+ EQ  ++A+KQ L L    
Sbjct: 61  GLASCESSERVKMLKSDFSIISVKNAELRKQVRELTEKVRLAEQETENARKQVLVLGSEI 120

Query: 628 KAGPFGTVKGLRTNPTVIPDESVNPRLAKLLEKVAIQRELIVTLANSNVQPMLEVWFTSI 687
           KAGPFGTVK LRTNPTV+PDESVNPRLAKLLEKVA+ +E+IV LANSNV+PMLE+   S+
Sbjct: 121 KAGPFGTVKSLRTNPTVVPDESVNPRLAKLLEKVAVNKEIIVVLANSNVKPMLELQIASV 180

Query: 688 QKVGIPNYLVVALDDQTEEFCKSHNVPVYTRDPDKSVDLIGKEGGNHQVSALKFRILREF 747
           ++VGI NYL+VALDD  E FC+S  V  Y RDPDK+VD++GK GGNH VS LKFR+LREF
Sbjct: 181 KRVGIQNYLIVALDDSMESFCESKEVVFYKRDPDKAVDMVGKSGGNHAVSGLKFRVLREF 240

Query: 748 LQLGYSVLLSDVDIVYLQNPFDHLYRDSDVESMSDGHSNMTAYGYNDVFDEPAMGWARYA 807
           LQLGYSVLLSDVDIV+LQNPF HL+RDSDVESMSDGH N TAYG+NDVFDEP+MGWARYA
Sbjct: 241 LQLGYSVLLSDVDIVFLQNPFSHLHRDSDVESMSDGHDNNTAYGFNDVFDEPSMGWARYA 300

Query: 808 HTMRIWVYNSGFFYIRPTLPSFELLDRVATRLSQEKAWDQAVFNEELFYPSRPGRDGLHA 867
           HTMRIWV+NSGFFY+RPT+PS +LLDRVA  LS+ +AWDQAVFNE+LFYPS PG  GLHA
Sbjct: 301 HTMRIWVFNSGFFYLRPTIPSIDLLDRVADTLSKSEAWDQAVFNEQLFYPSHPGYTGLHA 360

Query: 868 SKRTMDMYLFMNSKVLFKTVRKDPKLRQLKPVIVHINYHPDKYPRMKAVVEFYVNGQQNA 927
           SKR MDMY FMNSKVLFKTVRK+ +L++LKPVIVH+NYHPDK  RM AVVEFYVNG+Q+A
Sbjct: 361 SKRVMDMYEFMNSKVLFKTVRKNQELKKLKPVIVHLNYHPDKLERMHAVVEFYVNGKQDA 420

Query: 928 LDSFPDGSE 935
           LDSFPDGS+
Sbjct: 421 LDSFPDGSD 428

BLAST of Csor.00g140520 vs. ExPASy Swiss-Prot
Match: Q9LN62 (Arabinosyltransferase RRA3 OS=Arabidopsis thaliana OX=3702 GN=RRA3 PE=2 SV=1)

HSP 1 Score: 596.7 bits (1537), Expect = 4.6e-169
Identity = 295/429 (68.76%), Postives = 351/429 (81.82%), Query Frame = 0

Query: 508 MAGRKDKAQSARVSRIVIAIAIGVLVGCLFAFLYPHGLFASDLPVQ-NRRLGKSEFLVQS 567
           MAGR+D++Q  R SRI IAI IG+ +GC+ A L+P+G F S   ++ +  L KS   V  
Sbjct: 1   MAGRRDRSQQLRGSRIAIAILIGIFIGCVCAVLFPYGFFNSSSSLKASEHLSKSSNQV-G 60

Query: 568 SSPCESSERFKMLKGHVVSILEKNSQLEKRIKDLTGELRIVEQTKDHAQKQYLALSENHK 627
           SS CES ER KMLK   V++ EKN++L+K++++LT +LR+ EQ  D+A+KQ LAL    K
Sbjct: 61  SSACESPERVKMLKSDFVTLSEKNAELKKQVRELTEKLRLAEQGSDNARKQVLALGTQIK 120

Query: 628 AGPFGTVKGLRTNPTVIPDESVNPRLAKLLEKVAIQRELIVTLANSNVQPMLEVWFTSIQ 687
           AGPFGTVK LRTNPT++PDES+NPRLAK+LE++A+ +E+IV LAN+NV+ MLEV   SI+
Sbjct: 121 AGPFGTVKSLRTNPTILPDESINPRLAKILEEIAVDKEVIVALANANVKAMLEVQIASIK 180

Query: 688 KVGIPNYLVVALDDQTEEFCKSHNVPVYTRDPDKSVDLIGKEGGNHQVSALKFRILREFL 747
           +VGI NYLVVALDD  E  CK ++V  Y RDPDK VD +GK GGNH VS LKFR+LREFL
Sbjct: 181 RVGITNYLVVALDDYIENLCKENDVAYYKRDPDKDVDTVGKTGGNHAVSGLKFRVLREFL 240

Query: 748 QLGYSVLLSDVDIVYLQNPFDHLYRDSDVESMSDGHSNMTAYGYNDVFDEPAMGWARYAH 807
           QLGY VLLSDVDIV+LQNPF HLYRDSDVESMSDGH N TAYG+NDVFDEPAMGWARYAH
Sbjct: 241 QLGYGVLLSDVDIVFLQNPFSHLYRDSDVESMSDGHDNHTAYGFNDVFDEPAMGWARYAH 300

Query: 808 TMRIWVYNSGFFYIRPTLPSFELLDRVATRLSQEKAWDQAVFNEELFYPSRPGRDGLHAS 867
           TMRIWV+NSGFFY+RPT+PS ELLDRVA RLS+ K WDQAVFNEELFYPS P    LHAS
Sbjct: 301 TMRIWVFNSGFFYLRPTIPSIELLDRVADRLSKAKVWDQAVFNEELFYPSHPEYTALHAS 360

Query: 868 KRTMDMYLFMNSKVLFKTVRKDPKL-RQLKPVIVHINYHPDKYPRMKAVVEFYVNGQQNA 927
           KR MDMY FMNSKVLFKTVRK+ +L +++KPVIVH+NYHPDK  RM+AVVEFYVNG+Q+A
Sbjct: 361 KRVMDMYEFMNSKVLFKTVRKNHELKKKVKPVIVHVNYHPDKLNRMQAVVEFYVNGKQDA 420

Query: 928 LDSFPDGSE 935
           LDSFPDGSE
Sbjct: 421 LDSFPDGSE 428

BLAST of Csor.00g140520 vs. ExPASy Swiss-Prot
Match: Q9SUK9 (Probable protein phosphatase 2C 55 OS=Arabidopsis thaliana OX=3702 GN=At4g16580 PE=2 SV=2)

HSP 1 Score: 504.6 bits (1298), Expect = 2.4e-141
Identity = 288/466 (61.80%), Postives = 341/466 (73.18%), Query Frame = 0

Query: 26  EFLFGQQKLLCGSSSL---LHSVPYSSLTELHALLRPGTISGASSE--LVNSRRNISVLG 85
           E L  Q K+L G  +L    +   Y+  T  +  L P     ASS+  L+N RRN+SV+G
Sbjct: 6   ESLQKQVKILIGLGNLGFGGYRGLYTRFTNPNGFLEP-----ASSDLLLINERRNLSVIG 65

Query: 86  AISRTFSIPSVSGPALQTCGYHIDCAIAESNQYSTRSKFQDKPMAACGSRAGLGECSLEN 145
           A+SRTFS+PSVSGPA Q CGYHID  +++                 C S A LG  SL  
Sbjct: 66  AVSRTFSVPSVSGPAFQVCGYHIDLLLSD----------------PCKSMASLGSKSL-- 125

Query: 146 LSFRIARTSPPAISPSICFNKRSVDCCPKA--SMSLKNQEQPSNNVIYGYFTYNVAKRFC 205
               + R S   +S        S D   +   SM L+ ++    + I  YF Y  AKR+ 
Sbjct: 126 ---FVDRHSASLVSKRFTGGMVSGDGPNRGRISMRLRGKDHNEKSTICAYFAYRGAKRWI 185

Query: 206 SSYLH---AGLGARDLHSSSTSSLAAGSAPNLSFDNSAREEQLANSTDSSAQKIPKGKSM 265
             YL+    G+G R LHSS ++ L+AG+AP++S DNS  +EQ+ +S+DS A K+   K +
Sbjct: 186 --YLNQQRRGMGFRGLHSSLSNRLSAGNAPDVSLDNSVTDEQVRDSSDSVAAKLCT-KPL 245

Query: 266 KLVSGSCYLPHPDKEDTGGEDAHFICVDEQAIGVADGVGGWADLGVDAGQYSRELMSNSV 325
           KLVSGSCYLPHPDKE TGGEDAHFIC +EQA+GVADGVGGWA+LG+DAG YSRELMSNSV
Sbjct: 246 KLVSGSCYLPHPDKEATGGEDAHFICAEEQALGVADGVGGWAELGIDAGYYSRELMSNSV 305

Query: 326 NAVQEEPKGSIDPARVLEKAHSKTKAKGSSTACIIALTEQGLHAINLGDSGFMVVRDGCT 385
           NA+Q+EPKGSIDPARVLEKAH+ TK++GSSTACIIALT QGLHAINLGDSGFMVVR+G T
Sbjct: 306 NAIQDEPKGSIDPARVLEKAHTCTKSQGSSTACIIALTNQGLHAINLGDSGFMVVREGHT 365

Query: 386 IFRSPVQQHDFNFTFQLESGNNGDLPSSGQVFSVPVAPGDVIIAGTDGLFDNLYNNEITA 445
           +FRSPVQQHDFNFT+QLESG NGDLPSSGQVF+V VAPGDVIIAGTDGLFDNLYNNEITA
Sbjct: 366 VFRSPVQQHDFNFTYQLESGRNGDLPSSGQVFTVAVAPGDVIIAGTDGLFDNLYNNEITA 425

Query: 446 VVVHAMRAGLGSQVTAQKIAALARQRAQDKDRQTPFSTAAQDAGFR 482
           +VVHA+RA +  QVTAQKIAALARQRAQDK+RQTPFSTAAQDAGFR
Sbjct: 426 IVVHAVRANIDPQVTAQKIAALARQRAQDKNRQTPFSTAAQDAGFR 442

BLAST of Csor.00g140520 vs. ExPASy Swiss-Prot
Match: Q9C9Q6 (Arabinosyltransferase RRA1 OS=Arabidopsis thaliana OX=3702 GN=RRA1 PE=2 SV=1)

HSP 1 Score: 504.2 bits (1297), Expect = 3.1e-141
Identity = 257/426 (60.33%), Postives = 309/426 (72.54%), Query Frame = 0

Query: 508 MAGRKDKAQSARVSRIVIAIAIGVLVGCLFAFLYPHGLFASDLPVQNRRLGKSEFLVQSS 567
           MA RK+K Q  R   I IA+ +G+ +GC+   L P+          N R  K      +S
Sbjct: 1   MAVRKEKVQPFRECGIAIAVLVGIFIGCVCTILIPNDFV-------NFRSSK-----VAS 60

Query: 568 SPCESSERFKMLKGHVVSILEKNSQLEKRIKDLTGELRIVEQTKDHAQKQYLALSENHKA 627
           + CES ER KM K     I EKN +L K++ DLT ++R+ EQ             E  KA
Sbjct: 61  ASCESPERVKMFKAEFAIISEKNGELRKQVSDLTEKVRLAEQ------------KEVIKA 120

Query: 628 GPFGTVKGLRTNPTVIPDESVNPRLAKLLEKVAIQRELIVTLANSNVQPMLEVWFTSIQK 687
           GPFGTV GL+TNPTV PDES NPRLAKLLEKVA+ +E+IV LAN+NV+PMLEV   S+++
Sbjct: 121 GPFGTVTGLQTNPTVAPDESANPRLAKLLEKVAVNKEIIVVLANNNVKPMLEVQIASVKR 180

Query: 688 VGIPNYLVVALDDQTEEFCKSHNVPVYTRDPDKSVDLIGKEGGNHQVSALKFRILREFLQ 747
           VGI NYLVV LDD  E FCKS+ V  Y RDPD ++D++GK   +  VS LKFR+LREFLQ
Sbjct: 181 VGIQNYLVVPLDDSLESFCKSNEVAYYKRDPDNAIDVVGKSRRSSDVSGLKFRVLREFLQ 240

Query: 748 LGYSVLLSDVDIVYLQNPFDHLYRDSDVESMSDGHSNMTAYGYNDVFDEPAMGWARYAHT 807
           LGY VLLSDVDIV+LQNPF HLYRDSDVESMSDGH N TAYG+NDVFD+P M  +R  +T
Sbjct: 241 LGYGVLLSDVDIVFLQNPFGHLYRDSDVESMSDGHDNNTAYGFNDVFDDPTMTRSRTVYT 300

Query: 808 MRIWVYNSGFFYIRPTLPSFELLDRVATRLSQEKAWDQAVFNEELFYPSRPGRDGLHASK 867
            RIWV+NSGFFY+RPTLPS ELLDRV   LS+   WDQAVFN+ LFYPS PG  GL+ASK
Sbjct: 301 NRIWVFNSGFFYLRPTLPSIELLDRVTDTLSKSGGWDQAVFNQHLFYPSHPGYTGLYASK 360

Query: 868 RTMDMYLFMNSKVLFKTVRKDPKLRQLKPVIVHINYHPDKYPRMKAVVEFYVNGQQNALD 927
           R MD+Y FMNS+VLFKTVRKD ++++LKPVI+H+NYH DK  RM+A VEFYVNG+Q+ALD
Sbjct: 361 RVMDVYEFMNSRVLFKTVRKDEEMKKLKPVIIHMNYHSDKLERMQAAVEFYVNGKQDALD 402

Query: 928 SFPDGS 934
            F DGS
Sbjct: 421 RFRDGS 402

BLAST of Csor.00g140520 vs. ExPASy Swiss-Prot
Match: Q9LVQ8 (Probable protein phosphatase 2C 80 OS=Arabidopsis thaliana OX=3702 GN=At5g66720 PE=2 SV=1)

HSP 1 Score: 357.5 bits (916), Expect = 4.7e-97
Identity = 208/374 (55.61%), Postives = 261/374 (69.79%), Query Frame = 0

Query: 113 YSTRSKFQDKPMAACGSRAGLGECSLENL----SFRIARTSPPAISPSICFNKRSVDCCP 172
           +S  S+F+ + MAA GS    G+  L++L    S  +  T   +   S   N      CP
Sbjct: 38  FSDSSRFR-QAMAASGSLPVFGDACLDDLVTTCSNGLDFTKKRSSGGSFTIN------CP 97

Query: 173 KASMSLKNQEQPSNNVIYGYFTYNVAKRFCSSYLHAGLGARDLHSSSTSSLAAGSAPNLS 232
            ASM L  +     N +  +  Y+V      S    G  ++ +H+S  +  + G A  LS
Sbjct: 98  VASMRLGKRGGMMKNRLVCH--YSVVDPLEKSRALFGTLSKSVHTSPMACFSVGPAHELS 157

Query: 233 FDNSAREEQLANSTDSSAQKIPKGKSMKLVSGSCYLPHPDKEDTGGEDAHFICVDEQAIG 292
             N   +E    +T S        KS++LVSGSCYLPHP+KE TGGEDAHFIC +EQAIG
Sbjct: 158 SLNGGSQESPPTTTTSL-------KSLRLVSGSCYLPHPEKEATGGEDAHFICDEEQAIG 217

Query: 293 VADGVGGWADLGVDAGQYSRELMSNSVNAVQEEPKG-SIDPARVLEKAHSKTKAKGSSTA 352
           VADGVGGWA++GV+AG +SRELMS SV+A+QE+ KG SIDP  VLEKAHS+TKAKGSSTA
Sbjct: 218 VADGVGGWAEVGVNAGLFSRELMSYSVSAIQEQHKGSSIDPLVVLEKAHSQTKAKGSSTA 277

Query: 353 CIIALTEQGLHAINLGDSGFMVVRDGCTIFRSPVQQHDFNFTFQLESGNNGDLPSSGQVF 412
           CII L ++GLHAINLGDSGF VVR+G T+F+SPVQQH FNFT+QLESGN+ D+PSSGQVF
Sbjct: 278 CIIVLKDKGLHAINLGDSGFTVVREGTTVFQSPVQQHGFNFTYQLESGNSADVPSSGQVF 337

Query: 413 SVPVAPGDVIIAGTDGLFDNLYNNEITAVVVHAMRAGLGSQVTAQKIAALARQRAQDKDR 472
           ++ V  GDVI+AGTDG++DNLYN EIT VVV ++RAGL  + TAQKIA LARQRA DK R
Sbjct: 338 TIDVQSGDVIVAGTDGVYDNLYNEEITGVVVSSVRAGLDPKGTAQKIAELARQRAVDKKR 395

Query: 473 QTPFSTAAQDAGFR 482
           Q+PF+TAAQ+AG+R
Sbjct: 398 QSPFATAAQEAGYR 395

BLAST of Csor.00g140520 vs. NCBI nr
Match: KAG6608522.1 (Arabinosyltransferase RRA2, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1857 bits (4809), Expect = 0.0
Identity = 934/934 (100.00%), Postives = 934/934 (100.00%), Query Frame = 0

Query: 1   MPSYMSKLRAPVHFLGWEGGFKSSLEFLFGQQKLLCGSSSLLHSVPYSSLTELHALLRPG 60
           MPSYMSKLRAPVHFLGWEGGFKSSLEFLFGQQKLLCGSSSLLHSVPYSSLTELHALLRPG
Sbjct: 1   MPSYMSKLRAPVHFLGWEGGFKSSLEFLFGQQKLLCGSSSLLHSVPYSSLTELHALLRPG 60

Query: 61  TISGASSELVNSRRNISVLGAISRTFSIPSVSGPALQTCGYHIDCAIAESNQYSTRSKFQ 120
           TISGASSELVNSRRNISVLGAISRTFSIPSVSGPALQTCGYHIDCAIAESNQYSTRSKFQ
Sbjct: 61  TISGASSELVNSRRNISVLGAISRTFSIPSVSGPALQTCGYHIDCAIAESNQYSTRSKFQ 120

Query: 121 DKPMAACGSRAGLGECSLENLSFRIARTSPPAISPSICFNKRSVDCCPKASMSLKNQEQP 180
           DKPMAACGSRAGLGECSLENLSFRIARTSPPAISPSICFNKRSVDCCPKASMSLKNQEQP
Sbjct: 121 DKPMAACGSRAGLGECSLENLSFRIARTSPPAISPSICFNKRSVDCCPKASMSLKNQEQP 180

Query: 181 SNNVIYGYFTYNVAKRFCSSYLHAGLGARDLHSSSTSSLAAGSAPNLSFDNSAREEQLAN 240
           SNNVIYGYFTYNVAKRFCSSYLHAGLGARDLHSSSTSSLAAGSAPNLSFDNSAREEQLAN
Sbjct: 181 SNNVIYGYFTYNVAKRFCSSYLHAGLGARDLHSSSTSSLAAGSAPNLSFDNSAREEQLAN 240

Query: 241 STDSSAQKIPKGKSMKLVSGSCYLPHPDKEDTGGEDAHFICVDEQAIGVADGVGGWADLG 300
           STDSSAQKIPKGKSMKLVSGSCYLPHPDKEDTGGEDAHFICVDEQAIGVADGVGGWADLG
Sbjct: 241 STDSSAQKIPKGKSMKLVSGSCYLPHPDKEDTGGEDAHFICVDEQAIGVADGVGGWADLG 300

Query: 301 VDAGQYSRELMSNSVNAVQEEPKGSIDPARVLEKAHSKTKAKGSSTACIIALTEQGLHAI 360
           VDAGQYSRELMSNSVNAVQEEPKGSIDPARVLEKAHSKTKAKGSSTACIIALTEQGLHAI
Sbjct: 301 VDAGQYSRELMSNSVNAVQEEPKGSIDPARVLEKAHSKTKAKGSSTACIIALTEQGLHAI 360

Query: 361 NLGDSGFMVVRDGCTIFRSPVQQHDFNFTFQLESGNNGDLPSSGQVFSVPVAPGDVIIAG 420
           NLGDSGFMVVRDGCTIFRSPVQQHDFNFTFQLESGNNGDLPSSGQVFSVPVAPGDVIIAG
Sbjct: 361 NLGDSGFMVVRDGCTIFRSPVQQHDFNFTFQLESGNNGDLPSSGQVFSVPVAPGDVIIAG 420

Query: 421 TDGLFDNLYNNEITAVVVHAMRAGLGSQVTAQKIAALARQRAQDKDRQTPFSTAAQDAGF 480
           TDGLFDNLYNNEITAVVVHAMRAGLGSQVTAQKIAALARQRAQDKDRQTPFSTAAQDAGF
Sbjct: 421 TDGLFDNLYNNEITAVVVHAMRAGLGSQVTAQKIAALARQRAQDKDRQTPFSTAAQDAGF 480

Query: 481 RSDDQNFWCFKSILSCFTHSPTAPDFSMAGRKDKAQSARVSRIVIAIAIGVLVGCLFAFL 540
           RSDDQNFWCFKSILSCFTHSPTAPDFSMAGRKDKAQSARVSRIVIAIAIGVLVGCLFAFL
Sbjct: 481 RSDDQNFWCFKSILSCFTHSPTAPDFSMAGRKDKAQSARVSRIVIAIAIGVLVGCLFAFL 540

Query: 541 YPHGLFASDLPVQNRRLGKSEFLVQSSSPCESSERFKMLKGHVVSILEKNSQLEKRIKDL 600
           YPHGLFASDLPVQNRRLGKSEFLVQSSSPCESSERFKMLKGHVVSILEKNSQLEKRIKDL
Sbjct: 541 YPHGLFASDLPVQNRRLGKSEFLVQSSSPCESSERFKMLKGHVVSILEKNSQLEKRIKDL 600

Query: 601 TGELRIVEQTKDHAQKQYLALSENHKAGPFGTVKGLRTNPTVIPDESVNPRLAKLLEKVA 660
           TGELRIVEQTKDHAQKQYLALSENHKAGPFGTVKGLRTNPTVIPDESVNPRLAKLLEKVA
Sbjct: 601 TGELRIVEQTKDHAQKQYLALSENHKAGPFGTVKGLRTNPTVIPDESVNPRLAKLLEKVA 660

Query: 661 IQRELIVTLANSNVQPMLEVWFTSIQKVGIPNYLVVALDDQTEEFCKSHNVPVYTRDPDK 720
           IQRELIVTLANSNVQPMLEVWFTSIQKVGIPNYLVVALDDQTEEFCKSHNVPVYTRDPDK
Sbjct: 661 IQRELIVTLANSNVQPMLEVWFTSIQKVGIPNYLVVALDDQTEEFCKSHNVPVYTRDPDK 720

Query: 721 SVDLIGKEGGNHQVSALKFRILREFLQLGYSVLLSDVDIVYLQNPFDHLYRDSDVESMSD 780
           SVDLIGKEGGNHQVSALKFRILREFLQLGYSVLLSDVDIVYLQNPFDHLYRDSDVESMSD
Sbjct: 721 SVDLIGKEGGNHQVSALKFRILREFLQLGYSVLLSDVDIVYLQNPFDHLYRDSDVESMSD 780

Query: 781 GHSNMTAYGYNDVFDEPAMGWARYAHTMRIWVYNSGFFYIRPTLPSFELLDRVATRLSQE 840
           GHSNMTAYGYNDVFDEPAMGWARYAHTMRIWVYNSGFFYIRPTLPSFELLDRVATRLSQE
Sbjct: 781 GHSNMTAYGYNDVFDEPAMGWARYAHTMRIWVYNSGFFYIRPTLPSFELLDRVATRLSQE 840

Query: 841 KAWDQAVFNEELFYPSRPGRDGLHASKRTMDMYLFMNSKVLFKTVRKDPKLRQLKPVIVH 900
           KAWDQAVFNEELFYPSRPGRDGLHASKRTMDMYLFMNSKVLFKTVRKDPKLRQLKPVIVH
Sbjct: 841 KAWDQAVFNEELFYPSRPGRDGLHASKRTMDMYLFMNSKVLFKTVRKDPKLRQLKPVIVH 900

Query: 901 INYHPDKYPRMKAVVEFYVNGQQNALDSFPDGSE 934
           INYHPDKYPRMKAVVEFYVNGQQNALDSFPDGSE
Sbjct: 901 INYHPDKYPRMKAVVEFYVNGQQNALDSFPDGSE 934

BLAST of Csor.00g140520 vs. NCBI nr
Match: KAG7037845.1 (Arabinosyltransferase RRA3 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1838 bits (4761), Expect = 0.0
Identity = 931/956 (97.38%), Postives = 931/956 (97.38%), Query Frame = 0

Query: 1   MPSYMSKLRAPVHFLGWEGGFKSSLEFLFGQQKLLCGSSSLLHSVPYSSLTELHALLRPG 60
           MPSYMSKLRAPVHFLGWEGGFKSSLEFLFGQQKLLCGSSSL HSVPYSSLTELHALLRPG
Sbjct: 1   MPSYMSKLRAPVHFLGWEGGFKSSLEFLFGQQKLLCGSSSLFHSVPYSSLTELHALLRPG 60

Query: 61  TISGASSELVNSRRNISVLGAISRTFSIPSVSGPALQTCGYHIDCAIAESNQYSTRSKFQ 120
           TISGASSELVNSRRNISVLGAISRTFSIPSVSGPALQTCGYHIDCAIAESNQYSTRSKFQ
Sbjct: 61  TISGASSELVNSRRNISVLGAISRTFSIPSVSGPALQTCGYHIDCAIAESNQYSTRSKFQ 120

Query: 121 DKPMAACGSRAGLGECSLENLSFRIARTSPPAISPSICFNKRSVDCCPKASMSLKNQEQP 180
           DKPMAACGSRAGLGECSLENLSFRIARTSPPAISPSICFNKRSVDCCPKASMSLKNQEQP
Sbjct: 121 DKPMAACGSRAGLGECSLENLSFRIARTSPPAISPSICFNKRSVDCCPKASMSLKNQEQP 180

Query: 181 SNNVIYGYFTYNVAKRFCSSYLHAGLGARDLHSSSTSSLAAGSAPNLSFDNSAREEQLAN 240
           SNNVIYGYFTYNVAKRFCSSYLHAGLGARDLHSSSTSSLAAGSAPNLSFDNSAREEQLAN
Sbjct: 181 SNNVIYGYFTYNVAKRFCSSYLHAGLGARDLHSSSTSSLAAGSAPNLSFDNSAREEQLAN 240

Query: 241 STDSSAQKIPKGKSMKLVSGSCYLPHPDKEDTGGEDAHFICVDEQAIGVADGVGGWADLG 300
           STDSSAQKIPKGKSMKLVSGSCYLPHPDKEDTGGEDAHFICVDEQAIGVADGVGGWADLG
Sbjct: 241 STDSSAQKIPKGKSMKLVSGSCYLPHPDKEDTGGEDAHFICVDEQAIGVADGVGGWADLG 300

Query: 301 VDAGQYSRELMSNSVNAVQEEPKGSIDPARVLEKAHSKTKAKGSSTACIIALTEQGLHAI 360
           VDAGQYSRELMSNSVNAVQEEPKGSIDPARVLEKAHSKTKAKGSSTACIIALTEQGLHAI
Sbjct: 301 VDAGQYSRELMSNSVNAVQEEPKGSIDPARVLEKAHSKTKAKGSSTACIIALTEQGLHAI 360

Query: 361 NLGDSGFMVVRDGCTIFRSPVQQHDFNFTFQLESGNNGDLPSSGQVFSVPVAPGDVIIAG 420
           NLGDSGFMVVRDGCTIFRSPVQQHDFNFTFQLESGNNGDLPSSGQVFSVPVAPGDVIIAG
Sbjct: 361 NLGDSGFMVVRDGCTIFRSPVQQHDFNFTFQLESGNNGDLPSSGQVFSVPVAPGDVIIAG 420

Query: 421 TDGLFDNLYNNEITAVVVHAMRAGLGSQVTAQKIAALARQRAQDKDRQTPFSTAAQDAGF 480
           TDGLFDNLYNNEITAVVVHAMRAGLGSQVTAQKIAALARQRAQDKDRQTPFSTAAQDAGF
Sbjct: 421 TDGLFDNLYNNEITAVVVHAMRAGLGSQVTAQKIAALARQRAQDKDRQTPFSTAAQDAGF 480

Query: 481 R----------------------SDDQNFWCFKSILSCFTHSPTAPDFSMAGRKDKAQSA 540
           R                      SDDQNFWCFKSILSCFTHSP APDFSMAGRKDKAQSA
Sbjct: 481 RYYGGKLDDITVVVSYVASSNDNSDDQNFWCFKSILSCFTHSPAAPDFSMAGRKDKAQSA 540

Query: 541 RVSRIVIAIAIGVLVGCLFAFLYPHGLFASDLPVQNRRLGKSEFLVQSSSPCESSERFKM 600
           RVSRIVIAIAIGVLVGCLFAFLYPHGLFASDLPVQNRRL KSEFLVQSSSPCESSERFKM
Sbjct: 541 RVSRIVIAIAIGVLVGCLFAFLYPHGLFASDLPVQNRRLAKSEFLVQSSSPCESSERFKM 600

Query: 601 LKGHVVSILEKNSQLEKRIKDLTGELRIVEQTKDHAQKQYLALSENHKAGPFGTVKGLRT 660
           LKGHVVSILEKNSQLEKRIKDLTGELRIVEQTKDHAQKQYLALSENHKAGPFGTVKGLRT
Sbjct: 601 LKGHVVSILEKNSQLEKRIKDLTGELRIVEQTKDHAQKQYLALSENHKAGPFGTVKGLRT 660

Query: 661 NPTVIPDESVNPRLAKLLEKVAIQRELIVTLANSNVQPMLEVWFTSIQKVGIPNYLVVAL 720
           NPTVIPDESVNPRLAKLLEKVAIQRELIVTLANSNVQPMLEVWFTSIQKVGIPNYLVVAL
Sbjct: 661 NPTVIPDESVNPRLAKLLEKVAIQRELIVTLANSNVQPMLEVWFTSIQKVGIPNYLVVAL 720

Query: 721 DDQTEEFCKSHNVPVYTRDPDKSVDLIGKEGGNHQVSALKFRILREFLQLGYSVLLSDVD 780
           DDQTEEFCKSHNVPVYTRDPDKSVDLIGKEGGNHQVSALKFRILREFLQLGYSVLLSDVD
Sbjct: 721 DDQTEEFCKSHNVPVYTRDPDKSVDLIGKEGGNHQVSALKFRILREFLQLGYSVLLSDVD 780

Query: 781 IVYLQNPFDHLYRDSDVESMSDGHSNMTAYGYNDVFDEPAMGWARYAHTMRIWVYNSGFF 840
           IVYLQNPFDHLYRDSDVESMSDGHSNMTAYGYNDVFDEPAMGWARYAHTMRIWVYNSGFF
Sbjct: 781 IVYLQNPFDHLYRDSDVESMSDGHSNMTAYGYNDVFDEPAMGWARYAHTMRIWVYNSGFF 840

Query: 841 YIRPTLPSFELLDRVATRLSQEKAWDQAVFNEELFYPSRPGRDGLHASKRTMDMYLFMNS 900
           YIRPTLPSFELLDRVATRLSQEKAWDQAVFNEELFYPSRPGRDGLHASKRTMDMYLFMNS
Sbjct: 841 YIRPTLPSFELLDRVATRLSQEKAWDQAVFNEELFYPSRPGRDGLHASKRTMDMYLFMNS 900

Query: 901 KVLFKTVRKDPKLRQLKPVIVHINYHPDKYPRMKAVVEFYVNGQQNALDSFPDGSE 934
           KVLFKTVRKDPKLRQLKPVIVHINYHPDKYPRMKAVVEFYVNGQQNALDSFPDGSE
Sbjct: 901 KVLFKTVRKDPKLRQLKPVIVHINYHPDKYPRMKAVVEFYVNGQQNALDSFPDGSE 956

BLAST of Csor.00g140520 vs. NCBI nr
Match: KAF9833896.1 (hypothetical protein H0E87_030678 [Populus deltoides])

HSP 1 Score: 1227 bits (3174), Expect = 0.0
Identity = 640/944 (67.80%), Postives = 746/944 (79.03%), Query Frame = 0

Query: 1   MPS-YMSKLRAPVHF------LGWEGGFKSSLEFLFGQQKLLCGSSSLLHSVPYSSLTEL 60
           MPS Y S+LR+ V        +G EG  ++  E L GQ K    +  L HSV  +SLT+L
Sbjct: 1   MPSTYFSRLRSAVQNGIQRSGIGQEGVLQN-FESLIGQGKFRFCNYRLFHSVCVASLTDL 60

Query: 61  HALLRPGTISGASSE--LVNSRRNISVLGAISRTFSIPSVSGPALQTCGYHIDCAIAESN 120
             LLRPGT+  ASS+  +VN +RNISV+GA+SRT S+PSVSGP+ Q CGYHID A+ ++N
Sbjct: 61  QLLLRPGTVVAASSDSLVVNRKRNISVVGAVSRTLSVPSVSGPSFQVCGYHIDRALCDNN 120

Query: 121 QYSTRSKFQDKPMAACGSRAGLGECSLENLSFRIARTSPPAISPSICFNKRSVDCCPKAS 180
           Q     K  +KPMAA  SRA  GE  LENL+ R+        +P I +   S     KAS
Sbjct: 121 QILASGKPYNKPMAARASRAVFGESLLENLTSRVGHLPSSTNNPCISYGSSSSQSFRKAS 180

Query: 181 MSLKNQEQPSNNVIYGYFTYNVAKRFCSSYLHAGLGARDLHSSSTSSLAAGSAPNLSFDN 240
           MSLKN EQP+N+ IYGYF YNVAKR+     +   G RD  SS+ S  AAG+AP+++++N
Sbjct: 181 MSLKNHEQPTNSPIYGYFVYNVAKRWSDFSPYMETGFRDFQSSAHSCFAAGTAPDVTYEN 240

Query: 241 SAREEQLANSTDSSAQKIPKGKSMKLVSGSCYLPHPDKEDTGGEDAHFICVDEQAIGVAD 300
           S REEQ   S  SS QKI  GK +KL+SGSCYLPHPDKE+TGGEDAHFIC DE A+GVAD
Sbjct: 241 STREEQPEGSA-SSEQKISTGKMLKLLSGSCYLPHPDKEETGGEDAHFICADEHAVGVAD 300

Query: 301 GVGGWADLGVDAGQYSRELMSNSVNAVQEEPKGSIDPARVLEKAHSKTKAKGSSTACIIA 360
           GVGGWAD G+D+G YSRELMSNSV AVQEEPKGSIDPARVLEKAHS TKAKGSSTACIIA
Sbjct: 301 GVGGWADHGIDSGLYSRELMSNSVTAVQEEPKGSIDPARVLEKAHSSTKAKGSSTACIIA 360

Query: 361 LTEQGLHAINLGDSGFMVVRDGCTIFRSPVQQHDFNFTFQLESGNNGDLPSSGQVFSVPV 420
           LT+QGLHAINLGDSGF+VVRDGCT+FRSPVQQH FNFT+QLE+GNNGDLPSSGQVF++PV
Sbjct: 361 LTDQGLHAINLGDSGFIVVRDGCTVFRSPVQQHGFNFTYQLENGNNGDLPSSGQVFTIPV 420

Query: 421 APGDVIIAGTDGLFDNLYNNEITAVVVHAMRAGLGSQVTAQKIAALARQRAQDKDRQTPF 480
           APGDVI+AGTDGLFDNLYNNEI AVVVHAMRAGL  Q TAQKIAALARQRAQDKDRQTPF
Sbjct: 421 APGDVIVAGTDGLFDNLYNNEINAVVVHAMRAGLEPQATAQKIAALARQRAQDKDRQTPF 480

Query: 481 STAAQDAGFRSDDQNFWCFKSILSCFTHSPTAPDFSMAGRKDKAQSARVSRIVIAIAIGV 540
           STAAQDAGFR           ++S  T S    +     R++K QS + SRI +AI IG+
Sbjct: 481 STAAQDAGFRYYGGKLDDITVVVSYITSSDN--EGMAVLRREKGQSLQGSRIAVAILIGI 540

Query: 541 LVGCLFAFLYPHGLFASDLPVQNRRLGKSEFLVQSSSPCESSERFKMLKGHVVSILEKNS 600
           L+GC+FA  YPHG F+S+    +RR+  S      SS CES ER KM+K  +V I EKN+
Sbjct: 541 LLGCVFAVFYPHGFFSSNPTGSHRRIANSNLQTGLSS-CESPERIKMVKADIVLISEKNA 600

Query: 601 QLEKRIKDLTGELRIVEQTKDHAQKQYLALSENHKAGPFGTVKGLRTNPTVIPDESVNPR 660
           +++K++++L  +L++ EQ +DHAQKQ L L +  KAGPFGTVKGLRTNPTV+PDESVNPR
Sbjct: 601 EMKKQVRELNEKLQLAEQGQDHAQKQVLLLGKQQKAGPFGTVKGLRTNPTVVPDESVNPR 660

Query: 661 LAKLLEKVAIQRELIVTLANSNVQPMLEVWFTSIQKVGIPNYLVVALDDQTEEFCKSHNV 720
           LAKLLE+VA+++ELIV LANSNV+ MLEVWF +I+K GI NYLVVALDD   +FCKS++V
Sbjct: 661 LAKLLEEVAVRKELIVALANSNVKTMLEVWFANIKKAGIRNYLVVALDDHIVDFCKSNDV 720

Query: 721 PVYTRDPDKSVDLIGKEGGNHQVSALKFRILREFLQLGYSVLLSDVDIVYLQNPFDHLYR 780
           PVY RDPD  +D + + GGNH VS LKFRILREFLQLGYSVLLSDVDI+YLQNPFDHLYR
Sbjct: 721 PVYKRDPDGGIDSVARTGGNHAVSGLKFRILREFLQLGYSVLLSDVDIIYLQNPFDHLYR 780

Query: 781 DSDVESMSDGHSNMTAYGYNDVFDEPAMGWARYAHTMRIWVYNSGFFYIRPTLPSFELLD 840
           DSDVESMSDGH NMTAYG++DVF+EPAMGWARYAHTMRIWVYNSGFFYIRPTLPS ELLD
Sbjct: 781 DSDVESMSDGHDNMTAYGFDDVFNEPAMGWARYAHTMRIWVYNSGFFYIRPTLPSIELLD 840

Query: 841 RVATRLSQE-KAWDQAVFNEELFYPSRPGRDGLHASKRTMDMYLFMNSKVLFKTVRKDPK 900
           RVA RLS+E  +WDQAVFNEELF PS PG DGLHA+KRTMDM+LFMNSKVLFKTVRKDP 
Sbjct: 841 RVAGRLSREPNSWDQAVFNEELFTPSHPGYDGLHAAKRTMDMFLFMNSKVLFKTVRKDPA 900

Query: 901 LRQLKPVIVHINYHPDKYPRMKAVVEFYVNGQQNALDSFPDGSE 934
           L+ LKPVIVH+NYHPDK  RM+AVVEFYVNG+Q+ALD FPDGS+
Sbjct: 901 LKTLKPVIVHVNYHPDKLRRMQAVVEFYVNGKQDALDPFPDGSD 939

BLAST of Csor.00g140520 vs. NCBI nr
Match: BBG94933.1 (Protein phosphatase 2C family protein, partial [Prunus dulcis])

HSP 1 Score: 1224 bits (3168), Expect = 0.0
Identity = 646/968 (66.74%), Postives = 745/968 (76.96%), Query Frame = 0

Query: 15  LGWEGGFKSSLEFLFGQQKLLCGSSSLLHSVPYSSLTELHALLRPGTISGA--SSELVNS 74
           +G EGG + SL+ L GQ KLL G+S L  S P+S++++LHA L PGT+  A   S+LVN 
Sbjct: 52  VGQEGGLQDSLDGLIGQGKLLFGNSKLFQSRPFSTISDLHAFLSPGTVFAARSDSQLVNQ 111

Query: 75  RRNISVLGAISRTFSIPSVSGPALQTCGYHIDCAIAESNQYSTRSKFQDKPMAACGSRAG 134
           R+NISV+G ISR  S PSVSGP+LQ CGYHIDCA++E  Q+ TRSKFQ+KPMAACGSR  
Sbjct: 112 RKNISVVGEISRIISTPSVSGPSLQVCGYHIDCALSEPCQFITRSKFQNKPMAACGSRTV 171

Query: 135 LGECSLENLSFRIARTSP-PAISPSICFNKRSVDCCPKASMSLKNQEQPSNNVIYGYFTY 194
           +G C  +N + R    S  P  S +   N++  DC   ASMSLK +   + N I+GYF Y
Sbjct: 172 VGGCYPDNFTSRRGLLSMVPESSCTFYNNRKGSDCFQAASMSLKKRGLSNTNAIFGYFIY 231

Query: 195 NVAKRFCSSYLHAGLGARDLHSSSTSSLAAGSAPNLSFDNSAREEQLANSTDSSAQKIPK 254
            V KR+ +S    G G+R+ HSSST  L+AG+A ++SFDNSA EEQL++S DSS QK+  
Sbjct: 232 EVGKRWSNSSPTKGSGSREFHSSSTC-LSAGTAQDVSFDNSAPEEQLSSSADSSDQKVTD 291

Query: 255 GKSMKLVSGSCYLPHPDKEDTGGEDAHFICVDEQAIGVADGVGGWADLGVDAGQYSRELM 314
           GKS+KL SGS YLPHPDKE+TGGEDAHFICV+EQAIGVADGVGGWADLGV++G YSRELM
Sbjct: 292 GKSLKLTSGSYYLPHPDKEETGGEDAHFICVNEQAIGVADGVGGWADLGVNSGLYSRELM 351

Query: 315 SNSVNAVQEEPKGSIDPARVLEKAHSKTKAKGSSTACIIALTEQGLHAINLGDSGFMVVR 374
           SNSV AVQEEPKGS+DPARVLEKAHS TKAKGSSTACIIALTEQG+HAINLGDSGF+VVR
Sbjct: 352 SNSVAAVQEEPKGSVDPARVLEKAHSSTKAKGSSTACIIALTEQGIHAINLGDSGFIVVR 411

Query: 375 DGCTIFRSPVQQHDFNFTFQLESGNNGDLPSSGQVFSVPVAPGDVIIAGTDGLFDNLYNN 434
           DGCT+FRSPVQQHDFNFT+QLESG+NGDLPSSGQVF+VPVAPGDVIIAGTDGLFDNLYNN
Sbjct: 412 DGCTVFRSPVQQHDFNFTYQLESGSNGDLPSSGQVFTVPVAPGDVIIAGTDGLFDNLYNN 471

Query: 435 EITAVVVHAMRAGLGSQVTAQKIAALARQRAQDKDRQTPFSTAAQDAGFR---------- 494
           EITAVVVHA+RAGLG QVTAQKIAALARQRAQD+DRQTPFSTAAQDAGFR          
Sbjct: 472 EITAVVVHAIRAGLGPQVTAQKIAALARQRAQDRDRQTPFSTAAQDAGFRYYGGKLDDIT 531

Query: 495 ---SDDQNFWCFKSILSCFTHSPTA-------PDFS-------------------MAGRK 554
              S ++    F S+    + S          P  S                   MAGR+
Sbjct: 532 VVVSYERGSSTFASLTFSSSSSTRTLISDCDIPSLSLSLTLQERLCTARGKGKEGMAGRR 591

Query: 555 D------KAQSARVSRIVIAIAIGVLVGCLFAFLYPHGLFASDLPVQNRRLGKSEFLVQS 614
           D      K QS R SRIV AI +GVL+G + AF +P G F+SD P+Q+RR GK +  VQ 
Sbjct: 592 DGSLMRDKTQSFRGSRIVTAIVVGVLLGSVCAFFFPRGFFSSDPPIQSRRFGKLDLQVQ- 651

Query: 615 SSPCESSERFKMLKGHVVSILEKNSQLEKRIKDLTGELRIVEQTKDHAQKQYLALSENHK 674
                                           DLT +LR+ EQ KDHA +Q+  L + HK
Sbjct: 652 --------------------------------DLTEKLRLAEQGKDHAHEQFSVLGKPHK 711

Query: 675 AGPFGTVKGLRTNPTVIPDESVNPRLAKLLEKVAIQRELIVTLANSNVQPMLEVWFTSIQ 734
           AGP GTVKGLRTNPTVIPDESVNPRLAK+LE VA+Q+ELIV LANSNV+ MLE+WFTSI+
Sbjct: 712 AGPLGTVKGLRTNPTVIPDESVNPRLAKILEDVAVQKELIVALANSNVKAMLEIWFTSIK 771

Query: 735 KVGIPNYLVVALDDQTEEFCKSHNVPVYTRDPDKSVDLIGKEGGNHQVSALKFRILREFL 794
           +VGI NYLVV LDD+ EEFC +++VPVY RDPD  +D I K GGNH VS LKFRILREFL
Sbjct: 772 RVGITNYLVVGLDDEIEEFCIANDVPVYKRDPDDGIDSIAKTGGNHAVSGLKFRILREFL 831

Query: 795 QLGYSVLLSDVDIVYLQNPFDHLYRDSDVESMSDGHSNMTAYGYNDVFDEPAMGWARYAH 854
           QLGYSVLLSDVDIVYLQNPF+HLYRDSDVESMSDGH+NMTAYG+NDVFDEP+MGWARYAH
Sbjct: 832 QLGYSVLLSDVDIVYLQNPFNHLYRDSDVESMSDGHNNMTAYGFNDVFDEPSMGWARYAH 891

Query: 855 TMRIWVYNSGFFYIRPTLPSFELLDRVATRLSQEKAWDQAVFNEELFYPSRPGRDGLHAS 914
           TMRIWVYNSGFFYIRPTLPS ELLDRVA RLS+EKAWDQAVFNEELF+PS PG DGLHAS
Sbjct: 892 TMRIWVYNSGFFYIRPTLPSIELLDRVAGRLSKEKAWDQAVFNEELFFPSHPGYDGLHAS 951

Query: 915 KRTMDMYLFMNSKVLFKTVRKDPKLRQLKPVIVHINYHPDKYPRMKAVVEFYVNGQQNAL 934
           KRTMD YLFMNSKVLFKTVRKD  L++LKPVI+H+NYHPDK PRMKA++EFYVNG+Q+AL
Sbjct: 952 KRTMDFYLFMNSKVLFKTVRKDANLKKLKPVILHVNYHPDKLPRMKAIMEFYVNGKQDAL 985

BLAST of Csor.00g140520 vs. NCBI nr
Match: XP_022768910.1 (probable protein phosphatase 2C 55 isoform X1 [Durio zibethinus])

HSP 1 Score: 1210 bits (3131), Expect = 0.0
Identity = 635/941 (67.48%), Postives = 731/941 (77.68%), Query Frame = 0

Query: 1   MPS-YMSKLRAPVH--FLGWEGGFKSSLEFLFGQQKLLCGSSSLLHSVPYSSLTELHALL 60
           MPS +  +LR+ V       EGG + S+E L G  K+  GS    HS+ +S L +L  +L
Sbjct: 1   MPSTFFWRLRSAVQNGIQRTEGGLQDSIEVLIGAGKVGFGSCRFFHSLRFSGLADLQGIL 60

Query: 61  RPGTISGASSE--LVNSRRNISVLGAISRTFSIPSVSGPALQTCGYHIDCAIAESNQYST 120
           + GT   A S+  L N RRNISV+GA SRT S+PSVSGPA Q CGYHIDCA+A+S+Q S+
Sbjct: 61  QTGTFLAARSDSLLANRRRNISVVGAFSRTISVPSVSGPAFQVCGYHIDCALADSSQISS 120

Query: 121 R-SKFQDKPMAACGSRAGLGECSLENLSFRIARTSPPAISPSICFNKRSVDCCPKASMSL 180
             SKFQ KPMAA  S   +G   ++ L  +    S    S  I +  RS++ C KA MSL
Sbjct: 121 LLSKFQSKPMAASSSGVIIGGYLVDTLKLKHEHLSSSTSSADIFYGNRSLNSCTKARMSL 180

Query: 181 KNQEQPSNNVIYGYFTYNVAKRFCSSYLHAGLGARDLHSSSTSSLAAGSAPNLSFDNSAR 240
           KN+E+P+N+ IYGYF YNV KR+C+     G G+R  HSS  S L+AG+AP++SFDNS R
Sbjct: 181 KNREKPNNSPIYGYFIYNVGKRWCNFNPSLGSGSRAFHSSLPSFLSAGTAPDVSFDNSGR 240

Query: 241 EEQLANSTDSSAQKIPKGKSMKLVSGSCYLPHPDKEDTGGEDAHFICVDEQAIGVADGVG 300
           EEQ+ANS+ SS +KI  GK++KL+SGSC LPHP KEDTGGEDAHFICVDEQAIGVADGVG
Sbjct: 241 EEQVANSSVSSEEKISAGKTLKLLSGSCCLPHPAKEDTGGEDAHFICVDEQAIGVADGVG 300

Query: 301 GWADLGVDAGQYSRELMSNSVNAVQEEPKGSIDPARVLEKAHSKTKAKGSSTACIIALTE 360
           GWADLGVDAGQYSRELMSNSV+A+QEEPKGSIDPARVLEKAHS TKAKGSSTACIIALT+
Sbjct: 301 GWADLGVDAGQYSRELMSNSVSAIQEEPKGSIDPARVLEKAHSSTKAKGSSTACIIALTD 360

Query: 361 QGLHAINLGDSGFMVVRDGCTIFRSPVQQHDFNFTFQLESGNNGDLPSSGQVFSVPVAPG 420
           QGLHAINLGDSGFMVVRDGCTIFRSPVQQHDFNFT+QLESG+NGDLPSSGQVF+VPVAPG
Sbjct: 361 QGLHAINLGDSGFMVVRDGCTIFRSPVQQHDFNFTYQLESGSNGDLPSSGQVFAVPVAPG 420

Query: 421 DVIIAGTDGLFDNLYNNEITAVVVHAMRAGLGSQVTAQKIAALARQRAQDKDRQTPFSTA 480
           DVIIAGTDGLFDNLYNNEITAVVVHA+RAGLG QVTAQKIAALARQRAQD+DRQTPFSTA
Sbjct: 421 DVIIAGTDGLFDNLYNNEITAVVVHAVRAGLGPQVTAQKIAALARQRAQDRDRQTPFSTA 480

Query: 481 AQDAGFRSDDQNFWCFKSILSCFTHSPTAPDFSMAGRKDKAQSARVSRIVIAIAIGVLVG 540
           AQDAGFR                            G+ D                     
Sbjct: 481 AQDAGFRY-------------------------YGGKLD--------------------- 540

Query: 541 CLFAFLYPHGLFASDLPVQNRRLGKSEFLVQSSSPCESSERFKMLKGHVVSILEKNSQLE 600
                         D+ V    +  SE +   SS CESSER KMLK  +VS+ EKNS+L+
Sbjct: 541 --------------DITVVVSYITSSEEI--GSSSCESSERIKMLKSEIVSLSEKNSELK 600

Query: 601 KRIKDLTGELRIVEQTKDHAQKQYLALSENHKAGPFGTVKGLRTNPTVIPDESVNPRLAK 660
           K +KDLT +L++ EQ KDHAQKQ+L L E HKAGP GTVK LRTNPTV+PD+SVNPRLAK
Sbjct: 601 KEVKDLTEKLQLAEQGKDHAQKQFLMLGEQHKAGPVGTVKALRTNPTVVPDDSVNPRLAK 660

Query: 661 LLEKVAIQRELIVTLANSNVQPMLEVWFTSIQKVGIPNYLVVALDDQTEEFCKSHNVPVY 720
           +LE+VA+++ELIV LANSNV+ MLEVWF+SI++VGI NYLV+ALDDQ  E CKS+NVPVY
Sbjct: 661 ILEEVAVRKELIVALANSNVKEMLEVWFSSIKRVGITNYLVIALDDQIVELCKSNNVPVY 720

Query: 721 TRDPDKSVDLIGKEGGNHQVSALKFRILREFLQLGYSVLLSDVDIVYLQNPFDHLYRDSD 780
            RDPD+ +D +G+ GGNH VS LKFRILREFLQLGY VLLSDVDIVYLQNPF+HLYRDSD
Sbjct: 721 KRDPDEGIDAVGRTGGNHAVSGLKFRILREFLQLGYGVLLSDVDIVYLQNPFNHLYRDSD 780

Query: 781 VESMSDGHSNMTAYGYNDVFDEPAMGWARYAHTMRIWVYNSGFFYIRPTLPSFELLDRVA 840
           VESM+DGH+NMTAYGYNDVFDEPAMGWARYAHTMRIWVYNSGFFYIRPT+PS ELLDRVA
Sbjct: 781 VESMTDGHNNMTAYGYNDVFDEPAMGWARYAHTMRIWVYNSGFFYIRPTIPSIELLDRVA 840

Query: 841 TRLS-QEKAWDQAVFNEELFYPSRPGRDGLHASKRTMDMYLFMNSKVLFKTVRKDPKLRQ 900
            R++ Q+ +WDQAVFNEELF+PS PG DGLHA KRTMD Y+FMNSKVLFKTVR+D KL++
Sbjct: 841 DRMARQQNSWDQAVFNEELFFPSHPGYDGLHAVKRTMDFYMFMNSKVLFKTVRRDAKLKK 879

Query: 901 LKPVIVHINYHPDKYPRMKAVVEFYVNGQQNALDSFPDGSE 934
           LKPVIVH+NYHPDK  RMKAVVEFYV G+Q+ALD FPDGSE
Sbjct: 901 LKPVIVHVNYHPDKLRRMKAVVEFYVKGKQDALDPFPDGSE 879

BLAST of Csor.00g140520 vs. ExPASy TrEMBL
Match: A0A4Y1QST1 (Glycosyltransferase (Fragment) OS=Prunus dulcis OX=3755 GN=Prudu_003335 PE=3 SV=1)

HSP 1 Score: 1224 bits (3168), Expect = 0.0
Identity = 646/968 (66.74%), Postives = 745/968 (76.96%), Query Frame = 0

Query: 15  LGWEGGFKSSLEFLFGQQKLLCGSSSLLHSVPYSSLTELHALLRPGTISGA--SSELVNS 74
           +G EGG + SL+ L GQ KLL G+S L  S P+S++++LHA L PGT+  A   S+LVN 
Sbjct: 52  VGQEGGLQDSLDGLIGQGKLLFGNSKLFQSRPFSTISDLHAFLSPGTVFAARSDSQLVNQ 111

Query: 75  RRNISVLGAISRTFSIPSVSGPALQTCGYHIDCAIAESNQYSTRSKFQDKPMAACGSRAG 134
           R+NISV+G ISR  S PSVSGP+LQ CGYHIDCA++E  Q+ TRSKFQ+KPMAACGSR  
Sbjct: 112 RKNISVVGEISRIISTPSVSGPSLQVCGYHIDCALSEPCQFITRSKFQNKPMAACGSRTV 171

Query: 135 LGECSLENLSFRIARTSP-PAISPSICFNKRSVDCCPKASMSLKNQEQPSNNVIYGYFTY 194
           +G C  +N + R    S  P  S +   N++  DC   ASMSLK +   + N I+GYF Y
Sbjct: 172 VGGCYPDNFTSRRGLLSMVPESSCTFYNNRKGSDCFQAASMSLKKRGLSNTNAIFGYFIY 231

Query: 195 NVAKRFCSSYLHAGLGARDLHSSSTSSLAAGSAPNLSFDNSAREEQLANSTDSSAQKIPK 254
            V KR+ +S    G G+R+ HSSST  L+AG+A ++SFDNSA EEQL++S DSS QK+  
Sbjct: 232 EVGKRWSNSSPTKGSGSREFHSSSTC-LSAGTAQDVSFDNSAPEEQLSSSADSSDQKVTD 291

Query: 255 GKSMKLVSGSCYLPHPDKEDTGGEDAHFICVDEQAIGVADGVGGWADLGVDAGQYSRELM 314
           GKS+KL SGS YLPHPDKE+TGGEDAHFICV+EQAIGVADGVGGWADLGV++G YSRELM
Sbjct: 292 GKSLKLTSGSYYLPHPDKEETGGEDAHFICVNEQAIGVADGVGGWADLGVNSGLYSRELM 351

Query: 315 SNSVNAVQEEPKGSIDPARVLEKAHSKTKAKGSSTACIIALTEQGLHAINLGDSGFMVVR 374
           SNSV AVQEEPKGS+DPARVLEKAHS TKAKGSSTACIIALTEQG+HAINLGDSGF+VVR
Sbjct: 352 SNSVAAVQEEPKGSVDPARVLEKAHSSTKAKGSSTACIIALTEQGIHAINLGDSGFIVVR 411

Query: 375 DGCTIFRSPVQQHDFNFTFQLESGNNGDLPSSGQVFSVPVAPGDVIIAGTDGLFDNLYNN 434
           DGCT+FRSPVQQHDFNFT+QLESG+NGDLPSSGQVF+VPVAPGDVIIAGTDGLFDNLYNN
Sbjct: 412 DGCTVFRSPVQQHDFNFTYQLESGSNGDLPSSGQVFTVPVAPGDVIIAGTDGLFDNLYNN 471

Query: 435 EITAVVVHAMRAGLGSQVTAQKIAALARQRAQDKDRQTPFSTAAQDAGFR---------- 494
           EITAVVVHA+RAGLG QVTAQKIAALARQRAQD+DRQTPFSTAAQDAGFR          
Sbjct: 472 EITAVVVHAIRAGLGPQVTAQKIAALARQRAQDRDRQTPFSTAAQDAGFRYYGGKLDDIT 531

Query: 495 ---SDDQNFWCFKSILSCFTHSPTA-------PDFS-------------------MAGRK 554
              S ++    F S+    + S          P  S                   MAGR+
Sbjct: 532 VVVSYERGSSTFASLTFSSSSSTRTLISDCDIPSLSLSLTLQERLCTARGKGKEGMAGRR 591

Query: 555 D------KAQSARVSRIVIAIAIGVLVGCLFAFLYPHGLFASDLPVQNRRLGKSEFLVQS 614
           D      K QS R SRIV AI +GVL+G + AF +P G F+SD P+Q+RR GK +  VQ 
Sbjct: 592 DGSLMRDKTQSFRGSRIVTAIVVGVLLGSVCAFFFPRGFFSSDPPIQSRRFGKLDLQVQ- 651

Query: 615 SSPCESSERFKMLKGHVVSILEKNSQLEKRIKDLTGELRIVEQTKDHAQKQYLALSENHK 674
                                           DLT +LR+ EQ KDHA +Q+  L + HK
Sbjct: 652 --------------------------------DLTEKLRLAEQGKDHAHEQFSVLGKPHK 711

Query: 675 AGPFGTVKGLRTNPTVIPDESVNPRLAKLLEKVAIQRELIVTLANSNVQPMLEVWFTSIQ 734
           AGP GTVKGLRTNPTVIPDESVNPRLAK+LE VA+Q+ELIV LANSNV+ MLE+WFTSI+
Sbjct: 712 AGPLGTVKGLRTNPTVIPDESVNPRLAKILEDVAVQKELIVALANSNVKAMLEIWFTSIK 771

Query: 735 KVGIPNYLVVALDDQTEEFCKSHNVPVYTRDPDKSVDLIGKEGGNHQVSALKFRILREFL 794
           +VGI NYLVV LDD+ EEFC +++VPVY RDPD  +D I K GGNH VS LKFRILREFL
Sbjct: 772 RVGITNYLVVGLDDEIEEFCIANDVPVYKRDPDDGIDSIAKTGGNHAVSGLKFRILREFL 831

Query: 795 QLGYSVLLSDVDIVYLQNPFDHLYRDSDVESMSDGHSNMTAYGYNDVFDEPAMGWARYAH 854
           QLGYSVLLSDVDIVYLQNPF+HLYRDSDVESMSDGH+NMTAYG+NDVFDEP+MGWARYAH
Sbjct: 832 QLGYSVLLSDVDIVYLQNPFNHLYRDSDVESMSDGHNNMTAYGFNDVFDEPSMGWARYAH 891

Query: 855 TMRIWVYNSGFFYIRPTLPSFELLDRVATRLSQEKAWDQAVFNEELFYPSRPGRDGLHAS 914
           TMRIWVYNSGFFYIRPTLPS ELLDRVA RLS+EKAWDQAVFNEELF+PS PG DGLHAS
Sbjct: 892 TMRIWVYNSGFFYIRPTLPSIELLDRVAGRLSKEKAWDQAVFNEELFFPSHPGYDGLHAS 951

Query: 915 KRTMDMYLFMNSKVLFKTVRKDPKLRQLKPVIVHINYHPDKYPRMKAVVEFYVNGQQNAL 934
           KRTMD YLFMNSKVLFKTVRKD  L++LKPVI+H+NYHPDK PRMKA++EFYVNG+Q+AL
Sbjct: 952 KRTMDFYLFMNSKVLFKTVRKDANLKKLKPVILHVNYHPDKLPRMKAIMEFYVNGKQDAL 985

BLAST of Csor.00g140520 vs. ExPASy TrEMBL
Match: A0A6P6AVU3 (Glycosyltransferase OS=Durio zibethinus OX=66656 GN=LOC111312683 PE=3 SV=1)

HSP 1 Score: 1210 bits (3131), Expect = 0.0
Identity = 635/941 (67.48%), Postives = 731/941 (77.68%), Query Frame = 0

Query: 1   MPS-YMSKLRAPVH--FLGWEGGFKSSLEFLFGQQKLLCGSSSLLHSVPYSSLTELHALL 60
           MPS +  +LR+ V       EGG + S+E L G  K+  GS    HS+ +S L +L  +L
Sbjct: 1   MPSTFFWRLRSAVQNGIQRTEGGLQDSIEVLIGAGKVGFGSCRFFHSLRFSGLADLQGIL 60

Query: 61  RPGTISGASSE--LVNSRRNISVLGAISRTFSIPSVSGPALQTCGYHIDCAIAESNQYST 120
           + GT   A S+  L N RRNISV+GA SRT S+PSVSGPA Q CGYHIDCA+A+S+Q S+
Sbjct: 61  QTGTFLAARSDSLLANRRRNISVVGAFSRTISVPSVSGPAFQVCGYHIDCALADSSQISS 120

Query: 121 R-SKFQDKPMAACGSRAGLGECSLENLSFRIARTSPPAISPSICFNKRSVDCCPKASMSL 180
             SKFQ KPMAA  S   +G   ++ L  +    S    S  I +  RS++ C KA MSL
Sbjct: 121 LLSKFQSKPMAASSSGVIIGGYLVDTLKLKHEHLSSSTSSADIFYGNRSLNSCTKARMSL 180

Query: 181 KNQEQPSNNVIYGYFTYNVAKRFCSSYLHAGLGARDLHSSSTSSLAAGSAPNLSFDNSAR 240
           KN+E+P+N+ IYGYF YNV KR+C+     G G+R  HSS  S L+AG+AP++SFDNS R
Sbjct: 181 KNREKPNNSPIYGYFIYNVGKRWCNFNPSLGSGSRAFHSSLPSFLSAGTAPDVSFDNSGR 240

Query: 241 EEQLANSTDSSAQKIPKGKSMKLVSGSCYLPHPDKEDTGGEDAHFICVDEQAIGVADGVG 300
           EEQ+ANS+ SS +KI  GK++KL+SGSC LPHP KEDTGGEDAHFICVDEQAIGVADGVG
Sbjct: 241 EEQVANSSVSSEEKISAGKTLKLLSGSCCLPHPAKEDTGGEDAHFICVDEQAIGVADGVG 300

Query: 301 GWADLGVDAGQYSRELMSNSVNAVQEEPKGSIDPARVLEKAHSKTKAKGSSTACIIALTE 360
           GWADLGVDAGQYSRELMSNSV+A+QEEPKGSIDPARVLEKAHS TKAKGSSTACIIALT+
Sbjct: 301 GWADLGVDAGQYSRELMSNSVSAIQEEPKGSIDPARVLEKAHSSTKAKGSSTACIIALTD 360

Query: 361 QGLHAINLGDSGFMVVRDGCTIFRSPVQQHDFNFTFQLESGNNGDLPSSGQVFSVPVAPG 420
           QGLHAINLGDSGFMVVRDGCTIFRSPVQQHDFNFT+QLESG+NGDLPSSGQVF+VPVAPG
Sbjct: 361 QGLHAINLGDSGFMVVRDGCTIFRSPVQQHDFNFTYQLESGSNGDLPSSGQVFAVPVAPG 420

Query: 421 DVIIAGTDGLFDNLYNNEITAVVVHAMRAGLGSQVTAQKIAALARQRAQDKDRQTPFSTA 480
           DVIIAGTDGLFDNLYNNEITAVVVHA+RAGLG QVTAQKIAALARQRAQD+DRQTPFSTA
Sbjct: 421 DVIIAGTDGLFDNLYNNEITAVVVHAVRAGLGPQVTAQKIAALARQRAQDRDRQTPFSTA 480

Query: 481 AQDAGFRSDDQNFWCFKSILSCFTHSPTAPDFSMAGRKDKAQSARVSRIVIAIAIGVLVG 540
           AQDAGFR                            G+ D                     
Sbjct: 481 AQDAGFRY-------------------------YGGKLD--------------------- 540

Query: 541 CLFAFLYPHGLFASDLPVQNRRLGKSEFLVQSSSPCESSERFKMLKGHVVSILEKNSQLE 600
                         D+ V    +  SE +   SS CESSER KMLK  +VS+ EKNS+L+
Sbjct: 541 --------------DITVVVSYITSSEEI--GSSSCESSERIKMLKSEIVSLSEKNSELK 600

Query: 601 KRIKDLTGELRIVEQTKDHAQKQYLALSENHKAGPFGTVKGLRTNPTVIPDESVNPRLAK 660
           K +KDLT +L++ EQ KDHAQKQ+L L E HKAGP GTVK LRTNPTV+PD+SVNPRLAK
Sbjct: 601 KEVKDLTEKLQLAEQGKDHAQKQFLMLGEQHKAGPVGTVKALRTNPTVVPDDSVNPRLAK 660

Query: 661 LLEKVAIQRELIVTLANSNVQPMLEVWFTSIQKVGIPNYLVVALDDQTEEFCKSHNVPVY 720
           +LE+VA+++ELIV LANSNV+ MLEVWF+SI++VGI NYLV+ALDDQ  E CKS+NVPVY
Sbjct: 661 ILEEVAVRKELIVALANSNVKEMLEVWFSSIKRVGITNYLVIALDDQIVELCKSNNVPVY 720

Query: 721 TRDPDKSVDLIGKEGGNHQVSALKFRILREFLQLGYSVLLSDVDIVYLQNPFDHLYRDSD 780
            RDPD+ +D +G+ GGNH VS LKFRILREFLQLGY VLLSDVDIVYLQNPF+HLYRDSD
Sbjct: 721 KRDPDEGIDAVGRTGGNHAVSGLKFRILREFLQLGYGVLLSDVDIVYLQNPFNHLYRDSD 780

Query: 781 VESMSDGHSNMTAYGYNDVFDEPAMGWARYAHTMRIWVYNSGFFYIRPTLPSFELLDRVA 840
           VESM+DGH+NMTAYGYNDVFDEPAMGWARYAHTMRIWVYNSGFFYIRPT+PS ELLDRVA
Sbjct: 781 VESMTDGHNNMTAYGYNDVFDEPAMGWARYAHTMRIWVYNSGFFYIRPTIPSIELLDRVA 840

Query: 841 TRLS-QEKAWDQAVFNEELFYPSRPGRDGLHASKRTMDMYLFMNSKVLFKTVRKDPKLRQ 900
            R++ Q+ +WDQAVFNEELF+PS PG DGLHA KRTMD Y+FMNSKVLFKTVR+D KL++
Sbjct: 841 DRMARQQNSWDQAVFNEELFFPSHPGYDGLHAVKRTMDFYMFMNSKVLFKTVRRDAKLKK 879

Query: 901 LKPVIVHINYHPDKYPRMKAVVEFYVNGQQNALDSFPDGSE 934
           LKPVIVH+NYHPDK  RMKAVVEFYV G+Q+ALD FPDGSE
Sbjct: 901 LKPVIVHVNYHPDKLRRMKAVVEFYVKGKQDALDPFPDGSE 879

BLAST of Csor.00g140520 vs. ExPASy TrEMBL
Match: A0A6P6AVS4 (Glycosyltransferase OS=Durio zibethinus OX=66656 GN=LOC111312683 PE=3 SV=1)

HSP 1 Score: 1161 bits (3003), Expect = 0.0
Identity = 619/941 (65.78%), Postives = 710/941 (75.45%), Query Frame = 0

Query: 1   MPS-YMSKLRAPVH--FLGWEGGFKSSLEFLFGQQKLLCGSSSLLHSVPYSSLTELHALL 60
           MPS +  +LR+ V       EGG + S+E L G  K+  GS    HS+ +S L +L  +L
Sbjct: 1   MPSTFFWRLRSAVQNGIQRTEGGLQDSIEVLIGAGKVGFGSCRFFHSLRFSGLADLQGIL 60

Query: 61  RPGTISGASSE--LVNSRRNISVLGAISRTFSIPSVSGPALQTCGYHIDCAIAESNQYST 120
           + GT   A S+  L N RRNISV+GA SRT S+PSVSGPA Q CGYHIDCA+A+S+Q S+
Sbjct: 61  QTGTFLAARSDSLLANRRRNISVVGAFSRTISVPSVSGPAFQVCGYHIDCALADSSQISS 120

Query: 121 R-SKFQDKPMAACGSRAGLGECSLENLSFRIARTSPPAISPSICFNKRSVDCCPKASMSL 180
             SKFQ KPMAA  S   +G   ++ L  +    S    S  I +  RS++ C KA MSL
Sbjct: 121 LLSKFQSKPMAASSSGVIIGGYLVDTLKLKHEHLSSSTSSADIFYGNRSLNSCTKARMSL 180

Query: 181 KNQEQPSNNVIYGYFTYNVAKRFCSSYLHAGLGARDLHSSSTSSLAAGSAPNLSFDNSAR 240
           KN+                              +R  HSS  S L+AG+AP++SFDNS R
Sbjct: 181 KNR------------------------------SRAFHSSLPSFLSAGTAPDVSFDNSGR 240

Query: 241 EEQLANSTDSSAQKIPKGKSMKLVSGSCYLPHPDKEDTGGEDAHFICVDEQAIGVADGVG 300
           EEQ+ANS+ SS +KI  GK++KL+SGSC LPHP KEDTGGEDAHFICVDEQAIGVADGVG
Sbjct: 241 EEQVANSSVSSEEKISAGKTLKLLSGSCCLPHPAKEDTGGEDAHFICVDEQAIGVADGVG 300

Query: 301 GWADLGVDAGQYSRELMSNSVNAVQEEPKGSIDPARVLEKAHSKTKAKGSSTACIIALTE 360
           GWADLGVDAGQYSRELMSNSV+A+QEEPKGSIDPARVLEKAHS TKAKGSSTACIIALT+
Sbjct: 301 GWADLGVDAGQYSRELMSNSVSAIQEEPKGSIDPARVLEKAHSSTKAKGSSTACIIALTD 360

Query: 361 QGLHAINLGDSGFMVVRDGCTIFRSPVQQHDFNFTFQLESGNNGDLPSSGQVFSVPVAPG 420
           QGLHAINLGDSGFMVVRDGCTIFRSPVQQHDFNFT+QLESG+NGDLPSSGQVF+VPVAPG
Sbjct: 361 QGLHAINLGDSGFMVVRDGCTIFRSPVQQHDFNFTYQLESGSNGDLPSSGQVFAVPVAPG 420

Query: 421 DVIIAGTDGLFDNLYNNEITAVVVHAMRAGLGSQVTAQKIAALARQRAQDKDRQTPFSTA 480
           DVIIAGTDGLFDNLYNNEITAVVVHA+RAGLG QVTAQKIAALARQRAQD+DRQTPFSTA
Sbjct: 421 DVIIAGTDGLFDNLYNNEITAVVVHAVRAGLGPQVTAQKIAALARQRAQDRDRQTPFSTA 480

Query: 481 AQDAGFRSDDQNFWCFKSILSCFTHSPTAPDFSMAGRKDKAQSARVSRIVIAIAIGVLVG 540
           AQDAGFR                            G+ D                     
Sbjct: 481 AQDAGFRY-------------------------YGGKLD--------------------- 540

Query: 541 CLFAFLYPHGLFASDLPVQNRRLGKSEFLVQSSSPCESSERFKMLKGHVVSILEKNSQLE 600
                         D+ V    +  SE +   SS CESSER KMLK  +VS+ EKNS+L+
Sbjct: 541 --------------DITVVVSYITSSEEI--GSSSCESSERIKMLKSEIVSLSEKNSELK 600

Query: 601 KRIKDLTGELRIVEQTKDHAQKQYLALSENHKAGPFGTVKGLRTNPTVIPDESVNPRLAK 660
           K +KDLT +L++ EQ KDHAQKQ+L L E HKAGP GTVK LRTNPTV+PD+SVNPRLAK
Sbjct: 601 KEVKDLTEKLQLAEQGKDHAQKQFLMLGEQHKAGPVGTVKALRTNPTVVPDDSVNPRLAK 660

Query: 661 LLEKVAIQRELIVTLANSNVQPMLEVWFTSIQKVGIPNYLVVALDDQTEEFCKSHNVPVY 720
           +LE+VA+++ELIV LANSNV+ MLEVWF+SI++VGI NYLV+ALDDQ  E CKS+NVPVY
Sbjct: 661 ILEEVAVRKELIVALANSNVKEMLEVWFSSIKRVGITNYLVIALDDQIVELCKSNNVPVY 720

Query: 721 TRDPDKSVDLIGKEGGNHQVSALKFRILREFLQLGYSVLLSDVDIVYLQNPFDHLYRDSD 780
            RDPD+ +D +G+ GGNH VS LKFRILREFLQLGY VLLSDVDIVYLQNPF+HLYRDSD
Sbjct: 721 KRDPDEGIDAVGRTGGNHAVSGLKFRILREFLQLGYGVLLSDVDIVYLQNPFNHLYRDSD 780

Query: 781 VESMSDGHSNMTAYGYNDVFDEPAMGWARYAHTMRIWVYNSGFFYIRPTLPSFELLDRVA 840
           VESM+DGH+NMTAYGYNDVFDEPAMGWARYAHTMRIWVYNSGFFYIRPT+PS ELLDRVA
Sbjct: 781 VESMTDGHNNMTAYGYNDVFDEPAMGWARYAHTMRIWVYNSGFFYIRPTIPSIELLDRVA 840

Query: 841 TRLS-QEKAWDQAVFNEELFYPSRPGRDGLHASKRTMDMYLFMNSKVLFKTVRKDPKLRQ 900
            R++ Q+ +WDQAVFNEELF+PS PG DGLHA KRTMD Y+FMNSKVLFKTVR+D KL++
Sbjct: 841 DRMARQQNSWDQAVFNEELFFPSHPGYDGLHAVKRTMDFYMFMNSKVLFKTVRRDAKLKK 849

Query: 901 LKPVIVHINYHPDKYPRMKAVVEFYVNGQQNALDSFPDGSE 934
           LKPVIVH+NYHPDK  RMKAVVEFYV G+Q+ALD FPDGSE
Sbjct: 901 LKPVIVHVNYHPDKLRRMKAVVEFYVKGKQDALDPFPDGSE 849

BLAST of Csor.00g140520 vs. ExPASy TrEMBL
Match: A0A5N6NI82 (Glycosyltransferase OS=Mikania micrantha OX=192012 GN=E3N88_20854 PE=3 SV=1)

HSP 1 Score: 1099 bits (2842), Expect = 0.0
Identity = 589/1008 (58.43%), Postives = 714/1008 (70.83%), Query Frame = 0

Query: 18   EGGFKSSLEFLFGQQKLLCGSSSLLHSVPYSSLTELHALLRPGTISGASSEL--VNSRRN 77
            E  F+ SLE L    KLL G       V +    +L++LL+  +   A   L   + ++N
Sbjct: 25   EATFQGSLEALLAHGKLLFGKPRFYSGV-FVKPGDLNSLLQQYSPCAAQLNLQPASKKKN 84

Query: 78   ISVLGAISRTFSIPSVSGPALQTCGYHIDCAIAESNQYSTRSKFQDKPMAACGSRAGLGE 137
            ISV+GA+SRTFS PSVSGP+ Q CG+HID   + S+++S+       PMA C SR+ LG 
Sbjct: 85   ISVMGAVSRTFSTPSVSGPSFQVCGFHIDNLQSGSSRFSSGISNLKMPMALCSSRSILGR 144

Query: 138  CSLENLSFRIARTSPPAISPSICFNKRSVDCCPKASMSLKNQEQPSNNVIYGYFTYNVAK 197
              +  +       +    S SI +  RS  CC K SM+ +N+EQ  ++ +YGYF Y+ AK
Sbjct: 145  SYMSTIISTRENLTGSIDSLSISYTSRSFHCCRKVSMNSRNKEQSDSSSVYGYFIYHAAK 204

Query: 198  RFCSSYLHAGLGARDLHSSSTSSLAAGSAPNLSFDNSAREEQLANSTDSSAQKIPKGKSM 257
                     G   +  H S  + L AG+A ++  DN   ++QL NS DSS +K+   + +
Sbjct: 205  TNSIFDPFLGFQWKSFHISVPACLTAGTASDVFSDNRVHDDQLTNSADSSNRKLLSDRPL 264

Query: 258  KLVSGSCYLPHPDKEDTGGEDAHFICVDEQAIGVADGVGGWADLGVDAGQYSRELMSNSV 317
            KL+SGSCYLPHPDKE+TGGEDAHFIC DEQAIGVADGVGGWADLG+DAG+Y+RELMSNSV
Sbjct: 265  KLLSGSCYLPHPDKEETGGEDAHFICSDEQAIGVADGVGGWADLGIDAGKYARELMSNSV 324

Query: 318  NAVQEEPKGSIDPARVLEKAHSKTKAKGSSTACIIALTEQGLHAINLGDSGFMVVRDGCT 377
            +AVQ+EPKGS+DPARVLEKA++KTKAKGSSTACIIALT QGL+AINLGDSGFMVVRDGCT
Sbjct: 325  SAVQDEPKGSVDPARVLEKAYTKTKAKGSSTACIIALTNQGLNAINLGDSGFMVVRDGCT 384

Query: 378  IFRSPVQQHDFNFTFQLESGNNGDLPSSGQVFSVPVAPGDVIIAGTDGLFDNLYNNEITA 437
            +FRSP QQHDFNFT+QLE+G+N DLPSSGQVFSVPVAPGDVIIAGTDGLFDNLYNN+ITA
Sbjct: 385  VFRSPAQQHDFNFTYQLENGSNSDLPSSGQVFSVPVAPGDVIIAGTDGLFDNLYNNDITA 444

Query: 438  VVVHAMRAGLGSQVTAQKIAALARQRAQDKDRQTPFSTAAQDAGFRSDDQNFWCFKSI-- 497
            +VVHA+RAGL  QVTAQKIAALARQRAQ+KDRQTPFS AAQ+AGFR   +    F+S+  
Sbjct: 445  IVVHAVRAGLEPQVTAQKIAALARQRAQEKDRQTPFSAAAQEAGFRWKHEPIE-FRSVVE 504

Query: 498  -LSCFTHSP--------------------------------------------------- 557
             L C  +                                                     
Sbjct: 505  SLECVIYRQFLGGLVVVIIIIIKLFVGDSIEAVNQISHRVPGAVSVPLVADWQAEAHMGG 564

Query: 558  ---------------------------TAPDFSMAG---RKDK--AQSARVSRIVIAIAI 617
                                        +P  +MAG   R+DK  AQS R SRI +AI I
Sbjct: 565  TRGQKGLLAQRTLEDETKSIEIIYRVHLSPKIAMAGPVARRDKNAAQSIRGSRIAVAIVI 624

Query: 618  GVLVGCLFAFLYPHGLFASD--LPVQNRRLGKSEFLVQSSSPCESSERFKMLKGHVVSIL 677
            G+L G +FA LYPHG F+++    +Q RRL KS   + S+S CESSER  MLK  +  + 
Sbjct: 625  GILFGGIFALLYPHGFFSANHASQLQGRRLAKSILQIGSTS-CESSERVNMLKSDLADLS 684

Query: 678  EKNSQLEKRIKDLTGELRIVEQTKDHAQKQYLALSENHKAGPFGTVKGLRTNPTVIPDES 737
             KN +L+K+++DLT ++   EQ    A++Q + + E  KAGPFGTVKG+RTNP V+PD++
Sbjct: 685  TKNDELKKQVRDLTKKVMAAEQKNGKAEQQVIVVGEPQKAGPFGTVKGIRTNPIVLPDDT 744

Query: 738  VNPRLAKLLEKVAIQRELIVTLANSNVQPMLEVWFTSIQKVGIPNYLVVALDDQTEEFCK 797
            VNPRL K+L+KVA+Q ELIV LANSNV+ MLEVWFTSI+KVGIPNYLVVALD++  +FCK
Sbjct: 745  VNPRLLKILKKVAVQNELIVALANSNVKEMLEVWFTSIKKVGIPNYLVVALDNRIADFCK 804

Query: 798  SHNVPVYTRDPDKSVDLIGKEGGNHQVSALKFRILREFLQLGYSVLLSDVDIVYLQNPFD 857
             ++VP YTRDPD+ +D + K GGNH VS LKFRILREFLQLGYSVLLSDVDIVYLQNPFD
Sbjct: 805  ENDVPYYTRDPDEDIDSVAKTGGNHAVSGLKFRILREFLQLGYSVLLSDVDIVYLQNPFD 864

Query: 858  HLYRDSDVESMSDGHSNMTAYGYNDVFDEPAMGWARYAHTMRIWVYNSGFFYIRPTLPSF 917
            H+YRDSDVESMSDGH NMTAYGYNDV D+P+MGWARYAHTMRIWVYNSGFFY+RPTLP+ 
Sbjct: 865  HIYRDSDVESMSDGHDNMTAYGYNDVSDDPSMGWARYAHTMRIWVYNSGFFYLRPTLPAI 924

Query: 918  ELLDRVATRLSQE-KAWDQAVFNEELFYPSRPGRDGLHASKRTMDMYLFMNSKVLFKTVR 934
            ELLDRVA RLS    AWDQAVFNE+LF+PS PG  GLHASKRTMD Y+FMNSK LFK VR
Sbjct: 925  ELLDRVAERLSHPPSAWDQAVFNEQLFFPSYPGYTGLHASKRTMDRYMFMNSKTLFKQVR 984

BLAST of Csor.00g140520 vs. ExPASy TrEMBL
Match: A0A1S4D7C0 (Glycosyltransferase OS=Nicotiana tabacum OX=4097 GN=LOC107826729 PE=3 SV=1)

HSP 1 Score: 1073 bits (2774), Expect = 0.0
Identity = 561/921 (60.91%), Postives = 671/921 (72.86%), Query Frame = 0

Query: 16  GWEGGFKSSLEFLFGQQKLLCGSSSLLHSVPYSSLTELHALLRPGTISGASSEL--VNSR 75
           G E   +  +E L  +++LL G      SVP + L++LH ++RPGT++ A + L  VN R
Sbjct: 23  GQESRLQDLVEILAAEERLLFGK--FFCSVPSAGLSDLHVIVRPGTLAAAQANLNIVNQR 82

Query: 76  RNISVLGAISRTFSIPSVSGPALQTCGYHIDCAIAESNQYSTRSKFQDKPMAACGSRAGL 135
           +N SV+ AI R  SIPSVSGPA Q CGYHID  ++E  Q S  +     PMA CGSR  +
Sbjct: 83  KNFSVVSAIPRALSIPSVSGPAFQVCGYHIDRLLSEPTQVSLETDSHKAPMAICGSRTSV 142

Query: 136 GECSLENLSFRIARTSPPAISPSICFNKRSVDCCPKASMSLKNQEQPSNNVIYGYFTYNV 195
           G CS   ++ R  +      SP+  ++ R+ D   KASMSL+N  QP++ V+YGYFTYN 
Sbjct: 143 G-CSSSKMTSRHLKPCFSVNSPTTLYSSRNFDNSQKASMSLRNNNQPNDFVVYGYFTYNA 202

Query: 196 AKRFCSSYLHAGLGARDLHSSSTSSLAAGSAPNLSFDNSAREEQLANSTDSSAQKIPKGK 255
            K    S ++ G G +  HSSS + ++AG+AP++SFDNS RE   A+S +S  Q I   +
Sbjct: 203 VKSKGISNVYEGFGFKGFHSSSAACISAGAAPDVSFDNSLREVHPASSANSPEQNIHIDR 262

Query: 256 SMKLVSGSCYLPHPDKEDTGGEDAHFICVDEQAIGVADGVGGWADLGVDAGQYSRELMSN 315
           S+KL SGSCYLPHPDKE+ GGEDAHFIC+DEQAIGVADGVGGWAD+GVDAGQY+RELMSN
Sbjct: 263 SLKLNSGSCYLPHPDKEEKGGEDAHFICIDEQAIGVADGVGGWADVGVDAGQYARELMSN 322

Query: 316 SVNAVQEEPKGSIDPARVLEKAHSKTKAKGSSTACIIALTEQGLHAINLGDSGFMVVRDG 375
           SV+A++EEPKGS+DPARVLEKA+S TKAKGSSTACIIALT++GLHAINLGDSGF+VVRDG
Sbjct: 323 SVSAIREEPKGSVDPARVLEKAYSHTKAKGSSTACIIALTDEGLHAINLGDSGFLVVRDG 382

Query: 376 CTIFRSPVQQHDFNFTFQLESGNNGDLPSSGQVFSVPVAPGDVIIAGTDGLFDNLYNNEI 435
           CT+FRSPVQQHDFNFTFQLESG+ GDLPSSG+V+ +PVAPGDVIIAGTDGLFDNLYN++I
Sbjct: 383 CTVFRSPVQQHDFNFTFQLESGSAGDLPSSGEVYKIPVAPGDVIIAGTDGLFDNLYNSDI 442

Query: 436 TAVVVHAMRAGLGSQVTAQKIAALARQRAQDKDRQTPFSTAAQDAGFRSDDQNFWCFKSI 495
           TA+VVHA RAGL  QVTAQKIAALARQRA D                             
Sbjct: 443 TAIVVHATRAGLAPQVTAQKIAALARQRAXDP---------------------------- 502

Query: 496 LSCFTHSPTAPDFSMAGRKDKAQSARVSRIVIAIAIGVLVGCLFAFLYPHGLFASDLPVQ 555
                                                           PH L  S+L V 
Sbjct: 503 ------------------------------------------------PHPLSKSNLQV- 562

Query: 556 NRRLGKSEFLVQSSSPCESSERFKMLKGHVVSILEKNSQLEKRIKDLTGELRIVEQTKDH 615
                        SS CES+ER  ML      + EKN++L++++++L  +L++  Q    
Sbjct: 563 ------------GSSNCESTERVNMLNSENRKLSEKNAELQRQVRELNQKLQVAAQGNGR 622

Query: 616 AQKQYLALSENHKAGPFGTVKGLRTNPTVIPDESVNPRLAKLLEKVAIQRELIVTLANSN 675
           AQ+Q +  S+  KAGPFGTVK LRTNP V+PDESVNPRLAK+L ++A+ +E+IV LANSN
Sbjct: 623 AQEQLVVSSQPQKAGPFGTVKSLRTNPPVVPDESVNPRLAKILAEIAVSKEVIVALANSN 682

Query: 676 VQPMLEVWFTSIQKVGIPNYLVVALDDQTEEFCKSHNVPVYTRDPDKSVDLIGKEGGNHQ 735
           V+ MLEVWF SI+KVGIPNYLVVALDD   +FCK ++VPVY RDPD +VD IGK GGNH 
Sbjct: 683 VRSMLEVWFNSIKKVGIPNYLVVALDDAIVDFCKENDVPVYKRDPDDNVDFIGKNGGNHA 742

Query: 736 VSALKFRILREFLQLGYSVLLSDVDIVYLQNPFDHLYRDSDVESMSDGHSNMTAYGYNDV 795
           VS LKFRILREFLQLGYSVLLSDVDIVYLQNPFDHLYRDSDVESMSDGH+NMTAYGYNDV
Sbjct: 743 VSGLKFRILREFLQLGYSVLLSDVDIVYLQNPFDHLYRDSDVESMSDGHNNMTAYGYNDV 802

Query: 796 FDEPAMGWARYAHTMRIWVYNSGFFYIRPTLPSFELLDRVATRLS-QEKAWDQAVFNEEL 855
           FDEP+MGWARYAHTMRIWVYNSGFFYIRPT+PS ELLDRVA RL+ Q  +WDQAVFNEEL
Sbjct: 803 FDEPSMGWARYAHTMRIWVYNSGFFYIRPTIPSIELLDRVADRLTKQPNSWDQAVFNEEL 851

Query: 856 FYPSRPGRDGLHASKRTMDMYLFMNSKVLFKTVRKDPKLRQLKPVIVHINYHPDKYPRMK 915
            +PS PG  GL+AS+RTMD+YLFMNSKVLFKTVRKD  L++LKPVIVH+NYHPDK+PRMK
Sbjct: 863 AFPSHPGYVGLYASRRTMDIYLFMNSKVLFKTVRKDANLKKLKPVIVHVNYHPDKFPRMK 851

Query: 916 AVVEFYVNGQQNALDSFPDGS 933
           AVVE+YVNG+Q+ALD+FPDGS
Sbjct: 923 AVVEYYVNGKQDALDAFPDGS 851

BLAST of Csor.00g140520 vs. TAIR 10
Match: AT1G75110.1 (Nucleotide-diphospho-sugar transferase family protein )

HSP 1 Score: 598.6 bits (1542), Expect = 8.7e-171
Identity = 292/429 (68.07%), Postives = 349/429 (81.35%), Query Frame = 0

Query: 508 MAGRKDKAQSARVSRIVIAIAIGVLVGCLFAFLYPHGLF--ASDLPVQNRRLGKSEFLVQ 567
           MAGR+D+ Q  R SRI IAI +G+L+GC+ + L+P+G F   S L     R+ KS     
Sbjct: 1   MAGRRDRIQQLRGSRIAIAIFVGILIGCVCSVLFPNGFFNSGSSLIANEERISKST-STD 60

Query: 568 SSSPCESSERFKMLKGHVVSILEKNSQLEKRIKDLTGELRIVEQTKDHAQKQYLALSENH 627
             + CESSER KMLK     I  KN++L K++++LT ++R+ EQ  ++A+KQ L L    
Sbjct: 61  GLASCESSERVKMLKSDFSIISVKNAELRKQVRELTEKVRLAEQETENARKQVLVLGSEI 120

Query: 628 KAGPFGTVKGLRTNPTVIPDESVNPRLAKLLEKVAIQRELIVTLANSNVQPMLEVWFTSI 687
           KAGPFGTVK LRTNPTV+PDESVNPRLAKLLEKVA+ +E+IV LANSNV+PMLE+   S+
Sbjct: 121 KAGPFGTVKSLRTNPTVVPDESVNPRLAKLLEKVAVNKEIIVVLANSNVKPMLELQIASV 180

Query: 688 QKVGIPNYLVVALDDQTEEFCKSHNVPVYTRDPDKSVDLIGKEGGNHQVSALKFRILREF 747
           ++VGI NYL+VALDD  E FC+S  V  Y RDPDK+VD++GK GGNH VS LKFR+LREF
Sbjct: 181 KRVGIQNYLIVALDDSMESFCESKEVVFYKRDPDKAVDMVGKSGGNHAVSGLKFRVLREF 240

Query: 748 LQLGYSVLLSDVDIVYLQNPFDHLYRDSDVESMSDGHSNMTAYGYNDVFDEPAMGWARYA 807
           LQLGYSVLLSDVDIV+LQNPF HL+RDSDVESMSDGH N TAYG+NDVFDEP+MGWARYA
Sbjct: 241 LQLGYSVLLSDVDIVFLQNPFSHLHRDSDVESMSDGHDNNTAYGFNDVFDEPSMGWARYA 300

Query: 808 HTMRIWVYNSGFFYIRPTLPSFELLDRVATRLSQEKAWDQAVFNEELFYPSRPGRDGLHA 867
           HTMRIWV+NSGFFY+RPT+PS +LLDRVA  LS+ +AWDQAVFNE+LFYPS PG  GLHA
Sbjct: 301 HTMRIWVFNSGFFYLRPTIPSIDLLDRVADTLSKSEAWDQAVFNEQLFYPSHPGYTGLHA 360

Query: 868 SKRTMDMYLFMNSKVLFKTVRKDPKLRQLKPVIVHINYHPDKYPRMKAVVEFYVNGQQNA 927
           SKR MDMY FMNSKVLFKTVRK+ +L++LKPVIVH+NYHPDK  RM AVVEFYVNG+Q+A
Sbjct: 361 SKRVMDMYEFMNSKVLFKTVRKNQELKKLKPVIVHLNYHPDKLERMHAVVEFYVNGKQDA 420

Query: 928 LDSFPDGSE 935
           LDSFPDGS+
Sbjct: 421 LDSFPDGSD 428

BLAST of Csor.00g140520 vs. TAIR 10
Match: AT1G19360.1 (Nucleotide-diphospho-sugar transferase family protein )

HSP 1 Score: 596.7 bits (1537), Expect = 3.3e-170
Identity = 295/429 (68.76%), Postives = 351/429 (81.82%), Query Frame = 0

Query: 508 MAGRKDKAQSARVSRIVIAIAIGVLVGCLFAFLYPHGLFASDLPVQ-NRRLGKSEFLVQS 567
           MAGR+D++Q  R SRI IAI IG+ +GC+ A L+P+G F S   ++ +  L KS   V  
Sbjct: 1   MAGRRDRSQQLRGSRIAIAILIGIFIGCVCAVLFPYGFFNSSSSLKASEHLSKSSNQV-G 60

Query: 568 SSPCESSERFKMLKGHVVSILEKNSQLEKRIKDLTGELRIVEQTKDHAQKQYLALSENHK 627
           SS CES ER KMLK   V++ EKN++L+K++++LT +LR+ EQ  D+A+KQ LAL    K
Sbjct: 61  SSACESPERVKMLKSDFVTLSEKNAELKKQVRELTEKLRLAEQGSDNARKQVLALGTQIK 120

Query: 628 AGPFGTVKGLRTNPTVIPDESVNPRLAKLLEKVAIQRELIVTLANSNVQPMLEVWFTSIQ 687
           AGPFGTVK LRTNPT++PDES+NPRLAK+LE++A+ +E+IV LAN+NV+ MLEV   SI+
Sbjct: 121 AGPFGTVKSLRTNPTILPDESINPRLAKILEEIAVDKEVIVALANANVKAMLEVQIASIK 180

Query: 688 KVGIPNYLVVALDDQTEEFCKSHNVPVYTRDPDKSVDLIGKEGGNHQVSALKFRILREFL 747
           +VGI NYLVVALDD  E  CK ++V  Y RDPDK VD +GK GGNH VS LKFR+LREFL
Sbjct: 181 RVGITNYLVVALDDYIENLCKENDVAYYKRDPDKDVDTVGKTGGNHAVSGLKFRVLREFL 240

Query: 748 QLGYSVLLSDVDIVYLQNPFDHLYRDSDVESMSDGHSNMTAYGYNDVFDEPAMGWARYAH 807
           QLGY VLLSDVDIV+LQNPF HLYRDSDVESMSDGH N TAYG+NDVFDEPAMGWARYAH
Sbjct: 241 QLGYGVLLSDVDIVFLQNPFSHLYRDSDVESMSDGHDNHTAYGFNDVFDEPAMGWARYAH 300

Query: 808 TMRIWVYNSGFFYIRPTLPSFELLDRVATRLSQEKAWDQAVFNEELFYPSRPGRDGLHAS 867
           TMRIWV+NSGFFY+RPT+PS ELLDRVA RLS+ K WDQAVFNEELFYPS P    LHAS
Sbjct: 301 TMRIWVFNSGFFYLRPTIPSIELLDRVADRLSKAKVWDQAVFNEELFYPSHPEYTALHAS 360

Query: 868 KRTMDMYLFMNSKVLFKTVRKDPKL-RQLKPVIVHINYHPDKYPRMKAVVEFYVNGQQNA 927
           KR MDMY FMNSKVLFKTVRK+ +L +++KPVIVH+NYHPDK  RM+AVVEFYVNG+Q+A
Sbjct: 361 KRVMDMYEFMNSKVLFKTVRKNHELKKKVKPVIVHVNYHPDKLNRMQAVVEFYVNGKQDA 420

Query: 928 LDSFPDGSE 935
           LDSFPDGSE
Sbjct: 421 LDSFPDGSE 428

BLAST of Csor.00g140520 vs. TAIR 10
Match: AT4G16580.1 (Protein phosphatase 2C family protein )

HSP 1 Score: 504.6 bits (1298), Expect = 1.7e-142
Identity = 288/466 (61.80%), Postives = 341/466 (73.18%), Query Frame = 0

Query: 26  EFLFGQQKLLCGSSSL---LHSVPYSSLTELHALLRPGTISGASSE--LVNSRRNISVLG 85
           E L  Q K+L G  +L    +   Y+  T  +  L P     ASS+  L+N RRN+SV+G
Sbjct: 6   ESLQKQVKILIGLGNLGFGGYRGLYTRFTNPNGFLEP-----ASSDLLLINERRNLSVIG 65

Query: 86  AISRTFSIPSVSGPALQTCGYHIDCAIAESNQYSTRSKFQDKPMAACGSRAGLGECSLEN 145
           A+SRTFS+PSVSGPA Q CGYHID  +++                 C S A LG  SL  
Sbjct: 66  AVSRTFSVPSVSGPAFQVCGYHIDLLLSD----------------PCKSMASLGSKSL-- 125

Query: 146 LSFRIARTSPPAISPSICFNKRSVDCCPKA--SMSLKNQEQPSNNVIYGYFTYNVAKRFC 205
               + R S   +S        S D   +   SM L+ ++    + I  YF Y  AKR+ 
Sbjct: 126 ---FVDRHSASLVSKRFTGGMVSGDGPNRGRISMRLRGKDHNEKSTICAYFAYRGAKRWI 185

Query: 206 SSYLH---AGLGARDLHSSSTSSLAAGSAPNLSFDNSAREEQLANSTDSSAQKIPKGKSM 265
             YL+    G+G R LHSS ++ L+AG+AP++S DNS  +EQ+ +S+DS A K+   K +
Sbjct: 186 --YLNQQRRGMGFRGLHSSLSNRLSAGNAPDVSLDNSVTDEQVRDSSDSVAAKLCT-KPL 245

Query: 266 KLVSGSCYLPHPDKEDTGGEDAHFICVDEQAIGVADGVGGWADLGVDAGQYSRELMSNSV 325
           KLVSGSCYLPHPDKE TGGEDAHFIC +EQA+GVADGVGGWA+LG+DAG YSRELMSNSV
Sbjct: 246 KLVSGSCYLPHPDKEATGGEDAHFICAEEQALGVADGVGGWAELGIDAGYYSRELMSNSV 305

Query: 326 NAVQEEPKGSIDPARVLEKAHSKTKAKGSSTACIIALTEQGLHAINLGDSGFMVVRDGCT 385
           NA+Q+EPKGSIDPARVLEKAH+ TK++GSSTACIIALT QGLHAINLGDSGFMVVR+G T
Sbjct: 306 NAIQDEPKGSIDPARVLEKAHTCTKSQGSSTACIIALTNQGLHAINLGDSGFMVVREGHT 365

Query: 386 IFRSPVQQHDFNFTFQLESGNNGDLPSSGQVFSVPVAPGDVIIAGTDGLFDNLYNNEITA 445
           +FRSPVQQHDFNFT+QLESG NGDLPSSGQVF+V VAPGDVIIAGTDGLFDNLYNNEITA
Sbjct: 366 VFRSPVQQHDFNFTYQLESGRNGDLPSSGQVFTVAVAPGDVIIAGTDGLFDNLYNNEITA 425

Query: 446 VVVHAMRAGLGSQVTAQKIAALARQRAQDKDRQTPFSTAAQDAGFR 482
           +VVHA+RA +  QVTAQKIAALARQRAQDK+RQTPFSTAAQDAGFR
Sbjct: 426 IVVHAVRANIDPQVTAQKIAALARQRAQDKNRQTPFSTAAQDAGFR 442

BLAST of Csor.00g140520 vs. TAIR 10
Match: AT1G75120.1 (Nucleotide-diphospho-sugar transferase family protein )

HSP 1 Score: 504.2 bits (1297), Expect = 2.2e-142
Identity = 257/426 (60.33%), Postives = 309/426 (72.54%), Query Frame = 0

Query: 508 MAGRKDKAQSARVSRIVIAIAIGVLVGCLFAFLYPHGLFASDLPVQNRRLGKSEFLVQSS 567
           MA RK+K Q  R   I IA+ +G+ +GC+   L P+          N R  K      +S
Sbjct: 1   MAVRKEKVQPFRECGIAIAVLVGIFIGCVCTILIPNDFV-------NFRSSK-----VAS 60

Query: 568 SPCESSERFKMLKGHVVSILEKNSQLEKRIKDLTGELRIVEQTKDHAQKQYLALSENHKA 627
           + CES ER KM K     I EKN +L K++ DLT ++R+ EQ             E  KA
Sbjct: 61  ASCESPERVKMFKAEFAIISEKNGELRKQVSDLTEKVRLAEQ------------KEVIKA 120

Query: 628 GPFGTVKGLRTNPTVIPDESVNPRLAKLLEKVAIQRELIVTLANSNVQPMLEVWFTSIQK 687
           GPFGTV GL+TNPTV PDES NPRLAKLLEKVA+ +E+IV LAN+NV+PMLEV   S+++
Sbjct: 121 GPFGTVTGLQTNPTVAPDESANPRLAKLLEKVAVNKEIIVVLANNNVKPMLEVQIASVKR 180

Query: 688 VGIPNYLVVALDDQTEEFCKSHNVPVYTRDPDKSVDLIGKEGGNHQVSALKFRILREFLQ 747
           VGI NYLVV LDD  E FCKS+ V  Y RDPD ++D++GK   +  VS LKFR+LREFLQ
Sbjct: 181 VGIQNYLVVPLDDSLESFCKSNEVAYYKRDPDNAIDVVGKSRRSSDVSGLKFRVLREFLQ 240

Query: 748 LGYSVLLSDVDIVYLQNPFDHLYRDSDVESMSDGHSNMTAYGYNDVFDEPAMGWARYAHT 807
           LGY VLLSDVDIV+LQNPF HLYRDSDVESMSDGH N TAYG+NDVFD+P M  +R  +T
Sbjct: 241 LGYGVLLSDVDIVFLQNPFGHLYRDSDVESMSDGHDNNTAYGFNDVFDDPTMTRSRTVYT 300

Query: 808 MRIWVYNSGFFYIRPTLPSFELLDRVATRLSQEKAWDQAVFNEELFYPSRPGRDGLHASK 867
            RIWV+NSGFFY+RPTLPS ELLDRV   LS+   WDQAVFN+ LFYPS PG  GL+ASK
Sbjct: 301 NRIWVFNSGFFYLRPTLPSIELLDRVTDTLSKSGGWDQAVFNQHLFYPSHPGYTGLYASK 360

Query: 868 RTMDMYLFMNSKVLFKTVRKDPKLRQLKPVIVHINYHPDKYPRMKAVVEFYVNGQQNALD 927
           R MD+Y FMNS+VLFKTVRKD ++++LKPVI+H+NYH DK  RM+A VEFYVNG+Q+ALD
Sbjct: 361 RVMDVYEFMNSRVLFKTVRKDEEMKKLKPVIIHMNYHSDKLERMQAAVEFYVNGKQDALD 402

Query: 928 SFPDGS 934
            F DGS
Sbjct: 421 RFRDGS 402

BLAST of Csor.00g140520 vs. TAIR 10
Match: AT5G66720.1 (Protein phosphatase 2C family protein )

HSP 1 Score: 357.5 bits (916), Expect = 3.4e-98
Identity = 208/374 (55.61%), Postives = 261/374 (69.79%), Query Frame = 0

Query: 113 YSTRSKFQDKPMAACGSRAGLGECSLENL----SFRIARTSPPAISPSICFNKRSVDCCP 172
           +S  S+F+ + MAA GS    G+  L++L    S  +  T   +   S   N      CP
Sbjct: 38  FSDSSRFR-QAMAASGSLPVFGDACLDDLVTTCSNGLDFTKKRSSGGSFTIN------CP 97

Query: 173 KASMSLKNQEQPSNNVIYGYFTYNVAKRFCSSYLHAGLGARDLHSSSTSSLAAGSAPNLS 232
            ASM L  +     N +  +  Y+V      S    G  ++ +H+S  +  + G A  LS
Sbjct: 98  VASMRLGKRGGMMKNRLVCH--YSVVDPLEKSRALFGTLSKSVHTSPMACFSVGPAHELS 157

Query: 233 FDNSAREEQLANSTDSSAQKIPKGKSMKLVSGSCYLPHPDKEDTGGEDAHFICVDEQAIG 292
             N   +E    +T S        KS++LVSGSCYLPHP+KE TGGEDAHFIC +EQAIG
Sbjct: 158 SLNGGSQESPPTTTTSL-------KSLRLVSGSCYLPHPEKEATGGEDAHFICDEEQAIG 217

Query: 293 VADGVGGWADLGVDAGQYSRELMSNSVNAVQEEPKG-SIDPARVLEKAHSKTKAKGSSTA 352
           VADGVGGWA++GV+AG +SRELMS SV+A+QE+ KG SIDP  VLEKAHS+TKAKGSSTA
Sbjct: 218 VADGVGGWAEVGVNAGLFSRELMSYSVSAIQEQHKGSSIDPLVVLEKAHSQTKAKGSSTA 277

Query: 353 CIIALTEQGLHAINLGDSGFMVVRDGCTIFRSPVQQHDFNFTFQLESGNNGDLPSSGQVF 412
           CII L ++GLHAINLGDSGF VVR+G T+F+SPVQQH FNFT+QLESGN+ D+PSSGQVF
Sbjct: 278 CIIVLKDKGLHAINLGDSGFTVVREGTTVFQSPVQQHGFNFTYQLESGNSADVPSSGQVF 337

Query: 413 SVPVAPGDVIIAGTDGLFDNLYNNEITAVVVHAMRAGLGSQVTAQKIAALARQRAQDKDR 472
           ++ V  GDVI+AGTDG++DNLYN EIT VVV ++RAGL  + TAQKIA LARQRA DK R
Sbjct: 338 TIDVQSGDVIVAGTDGVYDNLYNEEITGVVVSSVRAGLDPKGTAQKIAELARQRAVDKKR 395

Query: 473 QTPFSTAAQDAGFR 482
           Q+PF+TAAQ+AG+R
Sbjct: 398 QSPFATAAQEAGYR 395

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9C9Q51.2e-16968.07Arabinosyltransferase RRA2 OS=Arabidopsis thaliana OX=3702 GN=RRA2 PE=2 SV=1[more]
Q9LN624.6e-16968.76Arabinosyltransferase RRA3 OS=Arabidopsis thaliana OX=3702 GN=RRA3 PE=2 SV=1[more]
Q9SUK92.4e-14161.80Probable protein phosphatase 2C 55 OS=Arabidopsis thaliana OX=3702 GN=At4g16580 ... [more]
Q9C9Q63.1e-14160.33Arabinosyltransferase RRA1 OS=Arabidopsis thaliana OX=3702 GN=RRA1 PE=2 SV=1[more]
Q9LVQ84.7e-9755.61Probable protein phosphatase 2C 80 OS=Arabidopsis thaliana OX=3702 GN=At5g66720 ... [more]
Match NameE-valueIdentityDescription
KAG6608522.10.0100.00Arabinosyltransferase RRA2, partial [Cucurbita argyrosperma subsp. sororia][more]
KAG7037845.10.097.38Arabinosyltransferase RRA3 [Cucurbita argyrosperma subsp. argyrosperma][more]
KAF9833896.10.067.80hypothetical protein H0E87_030678 [Populus deltoides][more]
BBG94933.10.066.74Protein phosphatase 2C family protein, partial [Prunus dulcis][more]
XP_022768910.10.067.48probable protein phosphatase 2C 55 isoform X1 [Durio zibethinus][more]
Match NameE-valueIdentityDescription
A0A4Y1QST10.066.74Glycosyltransferase (Fragment) OS=Prunus dulcis OX=3755 GN=Prudu_003335 PE=3 SV=... [more]
A0A6P6AVU30.067.48Glycosyltransferase OS=Durio zibethinus OX=66656 GN=LOC111312683 PE=3 SV=1[more]
A0A6P6AVS40.065.78Glycosyltransferase OS=Durio zibethinus OX=66656 GN=LOC111312683 PE=3 SV=1[more]
A0A5N6NI820.058.43Glycosyltransferase OS=Mikania micrantha OX=192012 GN=E3N88_20854 PE=3 SV=1[more]
A0A1S4D7C00.060.91Glycosyltransferase OS=Nicotiana tabacum OX=4097 GN=LOC107826729 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G75110.18.7e-17168.07Nucleotide-diphospho-sugar transferase family protein [more]
AT1G19360.13.3e-17068.76Nucleotide-diphospho-sugar transferase family protein [more]
AT4G16580.11.7e-14261.80Protein phosphatase 2C family protein [more]
AT1G75120.12.2e-14260.33Nucleotide-diphospho-sugar transferase family protein [more]
AT5G66720.13.4e-9855.61Protein phosphatase 2C family protein [more]
InterPro
Analysis Name: InterPro Annotations of Silver-seed gourd (sororia) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 583..617
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 233..249
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 233..252
NoneNo IPR availablePANTHERPTHR46581:SF6GLYCOSYLTRANSFERASEcoord: 512..934
IPR001932PPM-type phosphatase domainSMARTSM00331PP2C_SIG_2coord: 271..492
e-value: 0.0013
score: 14.0
IPR001932PPM-type phosphatase domainSMARTSM00332PP2C_4coord: 252..438
e-value: 1.3E-4
score: 11.7
IPR001932PPM-type phosphatase domainPFAMPF07228SpoIIEcoord: 285..464
e-value: 5.0E-7
score: 29.8
IPR001932PPM-type phosphatase domainPROSITEPS51746PPM_2coord: 261..480
score: 20.019505
IPR029044Nucleotide-diphospho-sugar transferasesGENE3D3.90.550.10Spore Coat Polysaccharide Biosynthesis Protein SpsA; Chain Acoord: 664..918
e-value: 6.0E-6
score: 27.8
IPR029044Nucleotide-diphospho-sugar transferasesSUPERFAMILY53448Nucleotide-diphospho-sugar transferasescoord: 666..866
IPR005069Nucleotide-diphospho-sugar transferasePFAMPF03407Nucleotid_transcoord: 690..907
e-value: 6.0E-54
score: 183.2
IPR036457PPM-type phosphatase domain superfamilyGENE3D3.60.40.10coord: 263..469
e-value: 1.7E-18
score: 69.1
IPR036457PPM-type phosphatase domain superfamilySUPERFAMILY81606PP2C-likecoord: 262..462
IPR044290Arabinosyltransferase RRA1/2/3PANTHERPTHR46581ARABINOSYLTRANSFERASE RRA3coord: 512..934

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csor.00g140520.m01Csor.00g140520.m01mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0071555 cell wall organization
biological_process GO:0006470 protein dephosphorylation
biological_process GO:0080147 root hair cell development
cellular_component GO:0000139 Golgi membrane
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0016757 glycosyltransferase activity
molecular_function GO:0046872 metal ion binding
molecular_function GO:0004722 protein serine/threonine phosphatase activity
molecular_function GO:0016791 phosphatase activity