Cla022052 (gene) Watermelon (97103) v1

NameCla022052
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionSubtilisin-like protease (AHRD V1 **-- Q9FGU4_ARATH); contains Interpro domain(s) IPR015500 Peptidase S8, subtilisin-related
LocationChr8 : 19897946 .. 19905217 (+)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGAAATGGAAGAATCAACAATGAAGAAGATGAACAAACAACTGCTGAATTGGATTACTTGCAACTTTTATCCTCTGTTATTCCAAGGTTCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTATTTCATATTATAATAATCAAGGATTGATTTCCTCAAAAATTAATGTATTTTAGTTGGAATTAAAATGGGTTTTTGTTGCTTTTTGAGTAGGAAAGAGAAAGAGAGAGGATCAAGGGATGTGGTCCACCAATACAACCATGCTTTCAAAGGGTTTTCAGCAATGCTAACTGAGGAAGAAGCTTCTTCTCTGTCTGGTAATTTTTTGCTTTCTTATTTTTTTTTAAACCCTTCATATTAGTTTGTATATTTTGGTAATTTCAATGAAGACAAAATTAACATATAAATAAATAAATATAAATGTTGGGGCTAAAAAAAACTTATCCATCAATATTAAAGAGAAAGACCTTGTCTTTCTTATTCAATTTCTTGACTATAAAAAATTAGTTGTAGAGTAGTAGTTGGGCCATTTCTTAATATTACTTTTCAGTTATTTAAAAATATTATTCAACCCTTACACCTACTATTACAACTTATAAATAAAATCACTGGATGAACGATTGAGGTAAAGAGAGAGTTGTGAGATCTGAAAATTGCAAACTTCTAACAATCACAGTTTAAAGAGTTAATAAGTTATAAATTGATGTTTTTTTTAGTCGTGGGACTTATGAGTTTCTTGGACTAAATAAAGGTAGCTTCACAATGCTTCAGCCTTTTCGCAGCATTGTGTGTAAAGTGCAACGATCTTTTATTTGGACAATTTCACTCTTTTTCTTCTCTTGGCTATTCTAAATCCAAGACAAGAATGCACTTTTTATGTGAATTTCATTGGAGTTCACTTCATAACGCCGAATGAAAACTCGAGATTTTAAATTTTGAGTGAATGAGTTACTATGATTTAAATCAAACATTCTCATGTTAACGAATAATGATACATAGATTAAATTATAGGTTAAATTTTGAAGTCTCTTAGTAATTTAACATTACATATCTTAGACAATATTATAATAATTTTTTTTTTTAAAAAAAGGTATGTCATTACTATTCAATCAAATTAGTATATTATAAACTATATGGTATAAAAGAGAATGATATGATATGTAGATTTGTATAGATGGTAGAAGATACTAAACTTAGCATTTGTTTGTGTATTTTTCAAGGCTTTTTCCACATGGTCCACTGGCTTATTTAACATTGTCTCTATTTCATTTATTATTGGTATGTGTGTGATATATAATTAATAATAATTATAAGGTGAATTGAGAGACAAATAACCTTATATATAATAAAGAAAATTTTGATGTGGAATAGTACAATAATCACAGCACATGGTATCATGATGCCTACATCCATGCATTTGGTTTATTAAAAAAGATGAGTACATTAAATTGGATATACTTCAATCTCATACTAAAATTTCTCCAAAATTATCTAAATTACTTTATGTTTAAAATATAGTGAACCATCAAATTTATCTAAGCTTAATACTTCAAAGTTTTAAATCAAAGTGAATTTTATATTAATAGATAAAAAGTCCTAGACATGGTGGAAAATTTTAGTTGTTTTTTCTCAAATTTTGTTGAGAAAATTTTTATTTTACTTTCCCCATTTCGACAAATCCAGAAAATTTTGATTTTTTTTTTTTTGGAATTAGTTTAGTTAGATTAGTTCAAGTTTTTCTATATTATGATCATGATTATACATAAGTGTATAGTTTCACACACCAATGTCATTTTTGACGAAGGATGTGCTACTTTTTTTTTTTTTTTTTAATTTCAAACCTTTTTCAATTTGCAGCTTTATTTTCAATTCAAAACTTTTGTTTAAGAATAAATTTAATTTGACAAAACCACAAAAAAGAACCTATATTTATTTTTAATCTCATTCGAATTTTTTCCATTAAATTTTTCACATCATTTCACCTCAAAATTGAATCGTGAGTTTTTTTTTTTTTTTTCCTTTTACCATCCTAAAACTTTTATTTTCACATCATTTCATCTCAAAATATTTTTAAACTTTATATTTCTTTTGTAATTATTTTTCCATTTCTTTTTTAAATTTCTCCTCTTGCATCATCTTTTACACAGACATTTTCAATATACATCTAAATGTTGTCTTACTTTATAGAAATGATTATTGAAAATGGTCAATTATAGTCTAAATGCACAATCAAAGTTTCATTAGCTGATTATAACAATTTTCATAAAGTTTCTCGAAATTTCTATCGATATTTCCATCAACATCAATGTCTTGAACCTTGGTTTAGATTCATGCTATAAATGCTAACTATTGAAACTTTTGATTTCGATAGTGATTTAATATTTTTGATAGAGACAAAATATTCATAGATTCTTGCCTTTCTTTATCTAAATCTTTTACTAGGAAAGATACTAATGTGGTTGATTTTTTTCCTTACCTAATAAGGACTTTTATCCTAAGGAAGGTAAATTAAACTTTCCTTAGAGGTATCAAAAAAAACTAAAAAGAAAAAAGTTGCAAGGATAAGAGAATTTGATTCCTTAAGAAAAGGACATTGGTTCCTCAAGAATATATGAAGGAAATCCCTTTCTATAACATGACATAGAATTGATAAAATAAAGAAGAGGAATGTTCTAAATCTACAACCTATACAATTAAAATTATAGAAAGAAAAAAAATACAACTTATGCAACTTTAGAAAAACATAGATATTTAACCACTTACGCTCTTACGCTCTACCAAGGTTCACTTATTTGTTGGATTGTGAATATATAATATATTCCCTAACTCAGCTCTAAAGCTATCATATTAACTAACATAAGTAGAGTTCAATAAATCTCAATTCCAACGAAAAATCATTAGATATCTTACGGATGAATCTTTCTTTTTTCTTTTCCGACTCCAATTAGATTCAGGTTAGATGTACTTTTTAGTTATTGGGAGAATTCAATTTCTCCTATTTATGCAAAAAAAATCAACTCTTAAAGATCTTTTCCTTTTTCTTTTGGAAGTTGCAAGATGAAACAATAGACCTATTGACGTTGTAAGAACCAAATGATTCTTTGCATTACATATTTGTCATAAATGTTAAATATTCAAATTTTCATCTCCACAATGTTGACTAAAAAAATAATAAATAAGACACCATACTTTTATCTAAAATATCAAATGTTCAAATTCTCACTTAACTTTTTTTAACCAGAAAAATCAACTAATCAAAACGAGTAAATTCAAATTCTCAGCTTTACCATATATTGAAACAATTGAATAGTCTTAGAATTATAGTCCATCACAATTGTTTTTGAAAAAAAAAAAAAAAAACACAATTGTATAATTATAACCACACTCACAATTCACGATTTTGCTATTCATAGTACAATAGGAGTGAGAGGATTTGAAGTTTGGAACCCAAAATCCTTGATTGATATTGCATGTTAGTTAAGTTATACTTTATATAATATAAGTAATAATAGAAATAATTACATAATTAAAAAGAAGAAATATTTAGCCAAATAATTCATTCTACATCAAACAGATAAGATTTGAGCTATCCATGTATAAAATCCATAAAAATATCATCACAAAAATATAATATAATAATTATGACGTCAATTAGTTTATTTATTTATTTTAATATGCTTCTCTTCTTGAAGTGAACAATAAAAAGCATAAACAATTGAATTTTTTTTTCTTATATTAAAAAAACACAAATAATATAAACATTTAGTACATTATTTGGTAAAATGTTTGTAATTAGAAAAAGGAAATTTTGATAAAAATGTGGAGGAAACATCAGGTATTGATGGAATCGTGTCGGTGTTCCCTGATCCGACGCTTCACCTCCACACTACACGTTCTTGGGATTTCTTGGACTCCATCTCCGGCCTCCGCCCTCCCACGCCACTTCCGCCGCCGCATTCTTACCCCAGTACATCGGATGTCATCGTCGGCGTCATTGACACCGGTAATTTCCACTTTCCTTTCTCATTACAATACGAACCATGGGTTCTTGAATCTTCATCATACTTCATTGCATAAAGTTAGAGAGAACCTAATTTCTGACAATGTGGTCTCGACGGACGCCTTTCGGAAGAGACAATGAAATTAATAATCTGTATGTTGTGGATTAGGGATTTGGCCGGAGTCTCAATCTTTCAACGATGAGGGGATTGGAGAAATTCCATCTAAATGGAAAGGAGTCTGTATGGAGGCACCTGATTTCAAGAAGTCTAATTGCAACAGGTCATAATTATTGTATAGACTCTCATTCATTTTGGTGGTTGTAATGTAGAATTTAAAATTCTTCATTCAACATTCATCTAAGAGAAAATGTAGTGGAAAATGGTGAGGCTTCTGATGGAATTATAAGTTTGTGTCTGTAGGAAGTTGATAGGTGCAAGATACTATAATGTTATAGAACTCAATGGGAATGATAGCCATGTGGGGGCTCCGAAGGGCACACCGAGGGATTCGCTCGGCCATGGGACCCACACTGCGTCGATAGCAGCCGGAGCCAGAGTCCCCAATGCAAGTTACTTTGGCTTAGCGAGAGGGACGGCGAGAGGCGGTGGCATTCCTTCCACAAGGATTGCAAGCTATAAGGTTTGTGCTGGTGTTGGATGCTCTGGTGCTGCAATTCTCAAAGCCATTGATGATGCAGTTAGGGATGGAGTTGATATCATTTCGATCTCGATTGGGATTGGTTCCCCTTTGTTTCAATCTGATTATTTGAATGACCCAATTGCCATTGGAGCATTCCATGCCCAACTAATGGGAGTTTTGGTTGTCTGCTCTGGTGGGAATGATGGCCCTGACCCTAACACTGTGGGGAACGTTGCTCCTTGGATTTTCACTGTTGCTGCTTCTAATATTGACAGGGATTTCCAGTCCTCTGTGGTTCTTGGCAATGGGAAGACTTTTCATGTAAGATACTTTCCTTTTAATGGTTTCTAAAATCTGTCTATTGATTATGAATTGTCATGCTAATTCTTTTAAGGAAATTAGGACTCCTGATGCAAGTTTGATTTTTATGTATAAATTGAAGTTTAGGAAGCTCTTTTTGAATTCCATAAATCAATTTTATACTATTTTCCAGGGGACTGCTATAAATCTCTCAAATCTTACTAGCTCAAAGACTTATCCTCTTGTATTTGGAAAGGACGCTGCTGCTAAATTCACACCCGTATCAGAAGCAAGGTTCATAATATTCTAATCACTTTTGACATGAAACTTTCAAACTTGCATACTGTTTGAATTTGAAAACAGTTTTATGCTTTCTGGAACAAAAGTTTAGTGGAACGTAATCTTTGAATAGCCGTTTTCCTTGTTTTCTTTTTGTTTGTTTTCATTCATGAATGTTCTAGTTTTTAAAAACACTGAATTTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCGTAAATTGGGACTTTAAAATCCTGATGAAAACCTGTACAAGTCCTCCAAAACCACTCTTGAAAATAGTTTTGAAAACAAAAGTAGATTCGGTTTTTCCATGTAGCTCCCAAACATGTTCTCTTTTTTCTTCGGAAAATAGATAATACTAAGGAGGGTAATGGTTATGAATTATGAGTTCGAAAGAAGATTGCATGCCCCTTCCCATTGCATTAGCTAACCTTCTGTACAATACATTGATGACAGGAATTGTTATCCAGGATCGCTGGATCGATCAAAAGTTGCAGGCAAGATTGTCGTTTGTGCCTCTGATGACTTCAGTACTTCAAGGACAATAAAGGAACTGGTCGTTCAAGATGCTAAAGCCATGGGGTTAATACTGATCAATGAGGCATCAAAAACTGTGCCAATGGATTCAAACATTTTCCCATTTACACAAATTGGGAATTCAGATGGTCTTCAGATTCTTGAGTACATTAACTCCACCAAGTAAGCAACAGATTGATTCTATTGTTTCTTTTTCTGACATGTTTAAACCTTCATTCTGATATAATATTTTCTTCTTCTTCCATAATCTCAAACAATACAGGAACCCAACAGCCACAATTCTCAGAACAGTGGAAGTTCAAAGACTCAAACCGGCTCCAATTGTGGCTTATTTCTCATCGAGAGGCCCATCTCCACTTACCGAAAACATTCTCAAGGTCGATCTCCTTCTAAACACTATTCATTCGAAGTCAGTACTATGAGTTATTAATGTATATTTGTGTACATTAAGTAATGATGGACATTTAATTTACAATGCAGCCGGATATTACAGCTCCGGGAGTATCCATTTTAGCAGCTATGATTCCAAAGAGTGATGGAGATAGTGGTCCAATTGGTAAGAAGCCTTCCAATTATGCAATGAGATCCGGGACGTCAATGGCATGCCCTCATGTAGCAGGCGCTGCTGCATTCATCAAATCGGTTTATCACGACTGGAGTTCTTCCATGATCAAATCTGCACTCATGACAACAGGTTACCTTCTGAAATCCCAAGTTCTATGTATTTCATTATCAAGCCTTGATTAGACTAGACGCCTGTACTTTTTATGGTTTATCGTCACAAAAGATGCATGGAAATCAAGTGGTTTGGTTGATTCCCTTTTCTTGATGTATATTTTCATTTACAGCAACTCAATACGATAATCAAAGGAAATTCATGAGAAACAGCACAAACAACCCTTCAAACCCACATGAGATGGGAGCTGGAGAAATAAGCCCCATAAAAGCTCTTAATCCTGGATTGGTTTTCGAAACTACGAACGAAGATTATCTTCGTTTCCTCTGTTATTATGGGTATTCAAACAAGGTCATAAGATCTGTGTCGAAACAAAACTTCAGCTGTCCGAAAACTTCAAAAGAAGATCTTATCTCCAATGTCAATTATCCATCCATCTCCATTGGAAAACTAGACAGTAAACAAGCTGCCAAAGTAATAGAAAGAACTGTGACAAACGTAGGAGCTCCAGATGCCACTTACATTGCCAAGGTTCATTCTTCAGAGGGTTTAATAGTGAAAGTGAATCCAAGGAAGATTGTTTTCTCTGAGAAGGTAAAGAAAGTGACCTTCAAAGTTTCATTTTACGGCAAGGAGGCTCGCAGCGGCTACAACTTCGGGACGATCACGTGGCGGGACACTGCACATTCCGTTCGAACTTTTTTTGCTGTAAATGTAATATAA

mRNA sequence

ATGGGAAATGGAAGAATCAACAATGAAGAAGATGAACAAACAACTGCTGAATTGGATTACTTGCAACTTTTATCCTCTGTTATTCCAAGGAAAGAGAAAGAGAGAGGATCAAGGGATGTGGTCCACCAATACAACCATGCTTTCAAAGGGTTTTCAGCAATGCTAACTGAGGAAGAAGCTTCTTCTCTGTCTGGTATTGATGGAATCGTGTCGGTGTTCCCTGATCCGACGCTTCACCTCCACACTACACGTTCTTGGGATTTCTTGGACTCCATCTCCGGCCTCCGCCCTCCCACGCCACTTCCGCCGCCGCATTCTTACCCCAGTACATCGGATGTCATCGTCGGCGTCATTGACACCGGGATTTGGCCGGAGTCTCAATCTTTCAACGATGAGGGGATTGGAGAAATTCCATCTAAATGGAAAGGAGTCTGTATGGAGGCACCTGATTTCAAGAAGTCTAATTGCAACAGGAAGTTGATAGGTGCAAGATACTATAATGTTATAGAACTCAATGGGAATGATAGCCATGTGGGGGCTCCGAAGGGCACACCGAGGGATTCGCTCGGCCATGGGACCCACACTGCGTCGATAGCAGCCGGAGCCAGAGTCCCCAATGCAAGTTACTTTGGCTTAGCGAGAGGGACGGCGAGAGGCGGTGGCATTCCTTCCACAAGGATTGCAAGCTATAAGGTTTGTGCTGGTGTTGGATGCTCTGGTGCTGCAATTCTCAAAGCCATTGATGATGCAGTTAGGGATGGAGTTGATATCATTTCGATCTCGATTGGGATTGGTTCCCCTTTGTTTCAATCTGATTATTTGAATGACCCAATTGCCATTGGAGCATTCCATGCCCAACTAATGGGAGTTTTGGTTGTCTGCTCTGGTGGGAATGATGGCCCTGACCCTAACACTGTGGGGAACGTTGCTCCTTGGATTTTCACTGTTGCTGCTTCTAATATTGACAGGGATTTCCAGTCCTCTGTGGTTCTTGGCAATGGGAAGACTTTTCATGGGACTGCTATAAATCTCTCAAATCTTACTAGCTCAAAGACTTATCCTCTTGTATTTGGAAAGGACGCTGCTGCTAAATTCACACCCGTATCAGAAGCAAGGAATTGTTATCCAGGATCGCTGGATCGATCAAAAGTTGCAGGCAAGATTGTCGTTTGTGCCTCTGATGACTTCAGTACTTCAAGGACAATAAAGGAACTGGTCGTTCAAGATGCTAAAGCCATGGGGTTAATACTGATCAATGAGGCATCAAAAACTGTGCCAATGGATTCAAACATTTTCCCATTTACACAAATTGGGAATTCAGATGGTCTTCAGATTCTTGAGTACATTAACTCCACCAAGAACCCAACAGCCACAATTCTCAGAACAGTGGAAGTTCAAAGACTCAAACCGGCTCCAATTGTGGCTTATTTCTCATCGAGAGGCCCATCTCCACTTACCGAAAACATTCTCAAGCCGGATATTACAGCTCCGGGAGTATCCATTTTAGCAGCTATGATTCCAAAGAGTGATGGAGATAGTGGTCCAATTGGTAAGAAGCCTTCCAATTATGCAATGAGATCCGGGACGTCAATGGCATGCCCTCATGTAGCAGGCGCTGCTGCATTCATCAAATCGGTTTATCACGACTGGAGTTCTTCCATGATCAAATCTGCACTCATGACAACAGCAACTCAATACGATAATCAAAGGAAATTCATGAGAAACAGCACAAACAACCCTTCAAACCCACATGAGATGGGAGCTGGAGAAATAAGCCCCATAAAAGCTCTTAATCCTGGATTGGTTTTCGAAACTACGAACGAAGATTATCTTCGTTTCCTCTGTTATTATGGGTATTCAAACAAGGTCATAAGATCTGTGTCGAAACAAAACTTCAGCTGTCCGAAAACTTCAAAAGAAGATCTTATCTCCAATGTCAATTATCCATCCATCTCCATTGGAAAACTAGACAGTAAACAAGCTGCCAAAGTAATAGAAAGAACTGTGACAAACGTAGGAGCTCCAGATGCCACTTACATTGCCAAGGTTCATTCTTCAGAGGGTTTAATAGTGAAAGTGAATCCAAGGAAGATTGTTTTCTCTGAGAAGGTAAAGAAAGTGACCTTCAAAGTTTCATTTTACGGCAAGGAGGCTCGCAGCGGCTACAACTTCGGGACGATCACGTGGCGGGACACTGCACATTCCGTTCGAACTTTTTTTGCTGTAAATGTAATATAA

Coding sequence (CDS)

ATGGGAAATGGAAGAATCAACAATGAAGAAGATGAACAAACAACTGCTGAATTGGATTACTTGCAACTTTTATCCTCTGTTATTCCAAGGAAAGAGAAAGAGAGAGGATCAAGGGATGTGGTCCACCAATACAACCATGCTTTCAAAGGGTTTTCAGCAATGCTAACTGAGGAAGAAGCTTCTTCTCTGTCTGGTATTGATGGAATCGTGTCGGTGTTCCCTGATCCGACGCTTCACCTCCACACTACACGTTCTTGGGATTTCTTGGACTCCATCTCCGGCCTCCGCCCTCCCACGCCACTTCCGCCGCCGCATTCTTACCCCAGTACATCGGATGTCATCGTCGGCGTCATTGACACCGGGATTTGGCCGGAGTCTCAATCTTTCAACGATGAGGGGATTGGAGAAATTCCATCTAAATGGAAAGGAGTCTGTATGGAGGCACCTGATTTCAAGAAGTCTAATTGCAACAGGAAGTTGATAGGTGCAAGATACTATAATGTTATAGAACTCAATGGGAATGATAGCCATGTGGGGGCTCCGAAGGGCACACCGAGGGATTCGCTCGGCCATGGGACCCACACTGCGTCGATAGCAGCCGGAGCCAGAGTCCCCAATGCAAGTTACTTTGGCTTAGCGAGAGGGACGGCGAGAGGCGGTGGCATTCCTTCCACAAGGATTGCAAGCTATAAGGTTTGTGCTGGTGTTGGATGCTCTGGTGCTGCAATTCTCAAAGCCATTGATGATGCAGTTAGGGATGGAGTTGATATCATTTCGATCTCGATTGGGATTGGTTCCCCTTTGTTTCAATCTGATTATTTGAATGACCCAATTGCCATTGGAGCATTCCATGCCCAACTAATGGGAGTTTTGGTTGTCTGCTCTGGTGGGAATGATGGCCCTGACCCTAACACTGTGGGGAACGTTGCTCCTTGGATTTTCACTGTTGCTGCTTCTAATATTGACAGGGATTTCCAGTCCTCTGTGGTTCTTGGCAATGGGAAGACTTTTCATGGGACTGCTATAAATCTCTCAAATCTTACTAGCTCAAAGACTTATCCTCTTGTATTTGGAAAGGACGCTGCTGCTAAATTCACACCCGTATCAGAAGCAAGGAATTGTTATCCAGGATCGCTGGATCGATCAAAAGTTGCAGGCAAGATTGTCGTTTGTGCCTCTGATGACTTCAGTACTTCAAGGACAATAAAGGAACTGGTCGTTCAAGATGCTAAAGCCATGGGGTTAATACTGATCAATGAGGCATCAAAAACTGTGCCAATGGATTCAAACATTTTCCCATTTACACAAATTGGGAATTCAGATGGTCTTCAGATTCTTGAGTACATTAACTCCACCAAGAACCCAACAGCCACAATTCTCAGAACAGTGGAAGTTCAAAGACTCAAACCGGCTCCAATTGTGGCTTATTTCTCATCGAGAGGCCCATCTCCACTTACCGAAAACATTCTCAAGCCGGATATTACAGCTCCGGGAGTATCCATTTTAGCAGCTATGATTCCAAAGAGTGATGGAGATAGTGGTCCAATTGGTAAGAAGCCTTCCAATTATGCAATGAGATCCGGGACGTCAATGGCATGCCCTCATGTAGCAGGCGCTGCTGCATTCATCAAATCGGTTTATCACGACTGGAGTTCTTCCATGATCAAATCTGCACTCATGACAACAGCAACTCAATACGATAATCAAAGGAAATTCATGAGAAACAGCACAAACAACCCTTCAAACCCACATGAGATGGGAGCTGGAGAAATAAGCCCCATAAAAGCTCTTAATCCTGGATTGGTTTTCGAAACTACGAACGAAGATTATCTTCGTTTCCTCTGTTATTATGGGTATTCAAACAAGGTCATAAGATCTGTGTCGAAACAAAACTTCAGCTGTCCGAAAACTTCAAAAGAAGATCTTATCTCCAATGTCAATTATCCATCCATCTCCATTGGAAAACTAGACAGTAAACAAGCTGCCAAAGTAATAGAAAGAACTGTGACAAACGTAGGAGCTCCAGATGCCACTTACATTGCCAAGGTTCATTCTTCAGAGGGTTTAATAGTGAAAGTGAATCCAAGGAAGATTGTTTTCTCTGAGAAGGTAAAGAAAGTGACCTTCAAAGTTTCATTTTACGGCAAGGAGGCTCGCAGCGGCTACAACTTCGGGACGATCACGTGGCGGGACACTGCACATTCCGTTCGAACTTTTTTTGCTGTAAATGTAATATAA

Protein sequence

MGNGRINNEEDEQTTAELDYLQLLSSVIPRKEKERGSRDVVHQYNHAFKGFSAMLTEEEASSLSGIDGIVSVFPDPTLHLHTTRSWDFLDSISGLRPPTPLPPPHSYPSTSDVIVGVIDTGIWPESQSFNDEGIGEIPSKWKGVCMEAPDFKKSNCNRKLIGARYYNVIELNGNDSHVGAPKGTPRDSLGHGTHTASIAAGARVPNASYFGLARGTARGGGIPSTRIASYKVCAGVGCSGAAILKAIDDAVRDGVDIISISIGIGSPLFQSDYLNDPIAIGAFHAQLMGVLVVCSGGNDGPDPNTVGNVAPWIFTVAASNIDRDFQSSVVLGNGKTFHGTAINLSNLTSSKTYPLVFGKDAAAKFTPVSEARNCYPGSLDRSKVAGKIVVCASDDFSTSRTIKELVVQDAKAMGLILINEASKTVPMDSNIFPFTQIGNSDGLQILEYINSTKNPTATILRTVEVQRLKPAPIVAYFSSRGPSPLTENILKPDITAPGVSILAAMIPKSDGDSGPIGKKPSNYAMRSGTSMACPHVAGAAAFIKSVYHDWSSSMIKSALMTTATQYDNQRKFMRNSTNNPSNPHEMGAGEISPIKALNPGLVFETTNEDYLRFLCYYGYSNKVIRSVSKQNFSCPKTSKEDLISNVNYPSISIGKLDSKQAAKVIERTVTNVGAPDATYIAKVHSSEGLIVKVNPRKIVFSEKVKKVTFKVSFYGKEARSGYNFGTITWRDTAHSVRTFFAVNVI
BLAST of Cla022052 vs. Swiss-Prot
Match: CRSP_ARATH (CO(2)-response secreted protease OS=Arabidopsis thaliana GN=CRSP PE=2 SV=1)

HSP 1 Score: 639.0 bits (1647), Expect = 6.3e-182
Identity = 341/710 (48.03%), Postives = 450/710 (63.38%), Query Frame = 1

Query: 34  ERGSRDVVHQYNHAFKGFSAMLTEEEASSLSGIDGIVSVFPDPTLHLHTTRSWDFLDSIS 93
           +R + D++H Y H F GF+A LT EEA  ++   G+VSVFPDP   LHTT SWDFL   +
Sbjct: 61  KRRANDLLHTYKHGFSGFAARLTAEEAKVIAKKPGVVSVFPDPHFQLHTTHSWDFLKYQT 120

Query: 94  GLRPPTPLPPPHSYPSTSDVIVGVIDTGIWPESQSFNDEGIGEIPSKWKGVCMEAPDFKK 153
            ++  +  PP  +   + D IVG++DTGIWPES+SFND+ +G IPS+WKG CMEA DFK 
Sbjct: 121 SVKVDSG-PPSSASDGSYDSIVGILDTGIWPESESFNDKDMGPIPSRWKGTCMEAKDFKS 180

Query: 154 SNCNRKLIGARYYNVIELNGNDSHVGAPKGTPRDSLGHGTHTASIAAGARVPNASYFGLA 213
           SNCNRK+IGARYY     N +D    +   T RD +GHG+H +S  AG+ V NASY+G+A
Sbjct: 181 SNCNRKIIGARYYK----NPDDD---SEYYTTRDVIGHGSHVSSTIAGSAVENASYYGVA 240

Query: 214 RGTARGGGIPSTRIASYKVCAGVGCSGAAILKAIDDAVRDGVDIISISIGIGSPLFQSDY 273
            GTA+GG   + RIA YKVC   GC+G++IL A DDA+ DGVD++S+S+G  +P +    
Sbjct: 241 SGTAKGGS-QNARIAMYKVCNPGGCTGSSILAAFDDAIADGVDVLSLSLG--APAYARID 300

Query: 274 LN-DPIAIGAFHAQLMGVLVVCSGGNDGPDPNTVGNVAPWIFTVAASNIDRDFQSSVVLG 333
           LN DPIAIGAFHA   G+LV+CS GNDGPD  TV N APWI TVAA+ IDRDF+S VVLG
Sbjct: 301 LNTDPIAIGAFHAVEQGILVICSAGNDGPDGGTVTNTAPWIMTVAANTIDRDFESDVVLG 360

Query: 334 NGKTFHGTAINLSNLTSSKTYPLVFGKDAAAKFTPVSEARNCYPGSLDRSKVAGKIVVCA 393
             K   G  I+ SN++ S  YPL+ GK A +       AR C   SLD+ KV GKIV+C 
Sbjct: 361 GNKVIKGEGIHFSNVSKSPVYPLIHGKSAKSADASEGSARACDSDSLDQEKVKGKIVLCE 420

Query: 394 SDDFSTSRTIKELVVQDAKAMGLILINEASKTVPMDSNIFPFTQIGNSDGLQILEYINST 453
           +   S   +     V+     G + +++ ++ V      FP T I + +  +I  Y+NST
Sbjct: 421 NVGGSYYASSARDEVKSKGGTGCVFVDDRTRAVASAYGSFPTTVIDSKEAAEIFSYLNST 480

Query: 454 KNPTATILRTVEVQRLKPAPIVAYFSSRGPSPLTENILKPDITAPGVSILAAMIPKSDGD 513
           K+P ATIL T  V++  PAP VAYFSSRGPS LT +ILKPDITAPGVSILAA    +D  
Sbjct: 481 KDPVATILPTATVEKFTPAPAVAYFSSRGPSSLTRSILKPDITAPGVSILAAW-TGNDSS 540

Query: 514 SGPIGKKPSNYAMRSGTSMACPHVAGAAAFIKSVYHDWSSSMIKSALMTTATQYDNQRKF 573
               GK  S Y + SGTSMA PHV+  A+ IKS +  W  S I+SA+MTTATQ +N +  
Sbjct: 541 ISLEGKPASQYNVISGTSMAAPHVSAVASLIKSQHPTWGPSAIRSAIMTTATQTNNDKGL 600

Query: 574 MRNSTNNPSNPHEMGAGEISPIKALNPGLVFETTNEDYLRFLCYYGYSNKVIRSVSK--- 633
           +   T   + P++ GAGE+S   ++ PGLV+ETT  DYL FLCYYGY+   I+++SK   
Sbjct: 601 ITTETGATATPYDSGAGELSSTASMQPGLVYETTETDYLNFLCYYGYNVTTIKAMSKAFP 660

Query: 634 QNFSCPKTSKEDLISNVNYPSISIGKLDSKQAAKVIERTVTNVGAP-DATYIAKVHSSEG 693
           +NF+CP  S  DLIS +NYPSI I        +K + RTVTNVG   +A Y   V +  G
Sbjct: 661 ENFTCPADSNLDLISTINYPSIGISGFKG-NGSKTVTRTVTNVGEDGEAVYTVSVETPPG 720

Query: 694 LIVKVNPRKIVFSEKVKKVTFKVSFYGKEARSGYNFGTITWRDTAHSVRT 739
             ++V P K+ F++  +K+T++V      +     FG +TW +  + VR+
Sbjct: 721 FNIQVTPEKLQFTKDGEKLTYQVIVSATASLKQDVFGALTWSNAKYKVRS 757

BLAST of Cla022052 vs. Swiss-Prot
Match: SBT51_ARATH (Subtilisin-like protease SBT5.1 OS=Arabidopsis thaliana GN=SBT5.1 PE=3 SV=1)

HSP 1 Score: 578.2 bits (1489), Expect = 1.3e-163
Identity = 324/740 (43.78%), Postives = 450/740 (60.81%), Query Frame = 1

Query: 19  DYLQLLSSVIPRKEKERGSRDVVHQYNHAFKGFSAMLTEEEASSLSGIDGIVSVFPDPTL 78
           D+++LLSS++     +R  +  +H+Y H F GF+A L+E+EA  ++   G++SVFPD  L
Sbjct: 49  DHVELLSSLL-----QRSGKTPMHRYKHGFSGFAAHLSEDEAHLIAKQPGVLSVFPDQML 108

Query: 79  HLHTTRSWDFLDSISGLRPPTPLPPPHSYPST---SDVIVGVIDTGIWPESQSFNDEGIG 138
            LHTTRSWDFL   S  R        +   S     D I+G +D+GIWPE+QSFND  +G
Sbjct: 109 QLHTTRSWDFLVQESYQRDTYFTEMNYEQESEMHEGDTIIGFLDSGIWPEAQSFNDRHMG 168

Query: 139 EIPSKWKGVCMEAPDFKKSN--CNRKLIGARYYNVIELNGNDSHVGAPKGTPRDSLGHGT 198
            +P KWKG CM     +  +  CNRKLIGARYYN      +   +     TPRD LGHGT
Sbjct: 169 PVPEKWKGTCMRGKKTQPDSFRCNRKLIGARYYN------SSFFLDPDYETPRDFLGHGT 228

Query: 199 HTASIAAGARVPNASYFGLARGTARGGGIPSTRIASYKVCAGVGCSGAAILKAIDDAVRD 258
           H ASIAAG  + NASY+GLA G  RGG  PS+RIA Y+ C+ +GC G++IL A DDA+ D
Sbjct: 229 HVASIAAGQIIANASYYGLASGIMRGGS-PSSRIAMYRACSLLGCRGSSILAAFDDAIAD 288

Query: 259 GVDIISISIGIGSPLFQSDYLNDPIAIGAFHAQLMGVLVVCSGGNDGPDPNTVGNVAPWI 318
           GVD+ISIS+G    L+  + L DP++IG+FHA   G+ VVCS GN GP   +V N APW+
Sbjct: 289 GVDVISISMG----LWPDNLLEDPLSIGSFHAVERGITVVCSVGNSGPSSQSVFNAAPWM 348

Query: 319 FTVAASNIDRDFQSSVVLGN--GKTFHGTAINLSNLTSSKTYPLVFGKDAAAKFTPVSEA 378
            TVAAS IDR F+S+++LG    +   G  IN++N+  ++ YPL+  + A         A
Sbjct: 349 ITVAASTIDRGFESNILLGGDENRLIEGFGINIANIDKTQAYPLIHARSAKKIDANEEAA 408

Query: 379 RNCYPGSLDRSKVAGKIVVCASDDFSTSRTIKELVVQDAKAMGLILINEASKTVPMDSNI 438
           RNC P +LD++ V GKIVVC SD  +     K   V+    +G++L+++ S  +      
Sbjct: 409 RNCAPDTLDQTIVKGKIVVCDSDLDNQVIQWKSDEVKRLGGIGMVLVDDESMDLSFIDPS 468

Query: 439 FPFTQIGNSDGLQILEYINSTKNPTATILRTVEVQRLKPAPIVAYFSSRGPSPLTENILK 498
           F  T I   DG+QI+ YINST+ P ATI+ T        AP +  FSSRGP  LT +ILK
Sbjct: 469 FLVTIIKPEDGIQIMSYINSTREPIATIMPTRSRTGHMLAPSIPSFSSRGPYLLTRSILK 528

Query: 499 PDITAPGVSILAAMIPKSDGDSGPIGKKPSNYAMRSGTSMACPHVAGAAAFIKSVYHDWS 558
           PDI APGV+ILA+ +   D ++ P GK P  + + SGTSM+CPHV+G AA +KS Y  WS
Sbjct: 529 PDIAAPGVNILASWL-VGDRNAAPEGKPPPLFNIESGTSMSCPHVSGIAARLKSRYPSWS 588

Query: 559 SSMIKSALMTTATQYDNQRKFMRNSTNNPSNPHEMGAGEISPIKALNPGLVFETTNEDYL 618
            + I+SA+MTTA Q  N    +   T   + P++ GAG+++     +PGL++ET + DYL
Sbjct: 589 PAAIRSAIMTTAVQMTNTGSHITTETGEKATPYDFGAGQVTIFGPSSPGLIYETNHMDYL 648

Query: 619 RFLCYYGYSNKVIRSVSK---QNFSCPKTSKEDLISNVNYPSISIGKLDSKQAAKVIERT 678
            FL YYG+++  I+ +S    Q F+CP+ S    ISN+NYPSISI   + K++ +V  RT
Sbjct: 649 NFLGYYGFTSDQIKKISNRIPQGFACPEQSNRGDISNINYPSISISNFNGKESRRV-SRT 708

Query: 679 VTNV-----GAPDATYIAKVHSSEGLIVKVNPRKIVFSEKVKKVTFKVSFYGKEA-RSGY 738
           VTNV     G  D  Y   + + EGL+V+V PR++ F +   K++++V F          
Sbjct: 709 VTNVASRLIGDEDTVYTVSIDAPEGLLVRVIPRRLHFRKIGDKLSYQVIFSSTTTILKDD 768

Query: 739 NFGTITWRDTAHSVRTFFAV 743
            FG+ITW +  ++VR+ F V
Sbjct: 769 AFGSITWSNGMYNVRSPFVV 770

BLAST of Cla022052 vs. Swiss-Prot
Match: SBT4C_ARATH (Subtilisin-like protease SBT4.12 OS=Arabidopsis thaliana GN=SBT4.12 PE=2 SV=1)

HSP 1 Score: 520.8 bits (1340), Expect = 2.5e-146
Identity = 313/748 (41.84%), Postives = 433/748 (57.89%), Query Frame = 1

Query: 4   GRINNEEDEQTTAELDYLQLLSSVIPRKEKERGSRDVVHQYNHAFKGFSAMLTEEEASSL 63
           G +++  D   T+  D++ +L  V      E     +V  Y  +F GF+A LTE E + +
Sbjct: 38  GSLSSRADYIPTS--DHMSILQQVTGESSIEGR---LVRSYKRSFNGFAARLTESERTLI 97

Query: 64  SGIDGIVSVFPDPTLHLHTTRSWDFLDSISGLRPPTPLPPPHSYPSTSDVIVGVIDTGIW 123
           + I+G+VSVFP+  L LHTT SWDF+    G      L         SD I+GVIDTGIW
Sbjct: 98  AEIEGVVSVFPNKILQLHTTTSWDFMGVKEGKNTKRNLA------IESDTIIGVIDTGIW 157

Query: 124 PESQSFNDEGIGEIPSKWKGVCMEAPDFKKSNCNRKLIGARYYNVIELNGNDSHVGAPKG 183
           PES+SF+D+G G  P KWKGVC    +F    CN KLIGAR Y               +G
Sbjct: 158 PESKSFSDKGFGPPPKKWKGVCSGGKNF---TCNNKLIGARDYT-------------SEG 217

Query: 184 TPRDSLGHGTHTASIAAGARVPNASYFGLARGTARGGGIPSTRIASYKVCAGVGCSGAAI 243
           T RD+ GHGTHTAS AAG  V + S+FG+  GT RGG +P++RIA+YKVC   GCS  A+
Sbjct: 218 T-RDTSGHGTHTASTAAGNAVKDTSFFGIGNGTVRGG-VPASRIAAYKVCTDSGCSSEAL 277

Query: 244 LKAIDDAVRDGVDIISISIGIGSPLFQSDYLNDPIAIGAFHAQLMGVLVVCSGGNDGPDP 303
           L + DDA+ DGVD+I+ISIG   P   S + +DPIAIGAFHA   G+L V S GN GP P
Sbjct: 278 LSSFDDAIADGVDLITISIGFQFP---SIFEDDPIAIGAFHAMAKGILTVSSAGNSGPKP 337

Query: 304 NTVGNVAPWIFTVAASNIDRDFQSSVVLGNGKTFHGTAINLSNLTSSKTYPLVFGKDAAA 363
            TV +VAPWIFTVAAS  +R F + VVLGNGKT  G ++N  ++   K YPLV+GK AA+
Sbjct: 338 TTVSHVAPWIFTVAASTTNRGFITKVVLGNGKTLAGRSVNAFDM-KGKKYPLVYGKSAAS 397

Query: 364 KFTPVSEARNCYPGSLDRSKVAGKIVVCASDDFSTSRTIKELVVQDAKAMGLI-LINEAS 423
                  A  C P  L++S+V GKI+VC                + AK++G I +I+++ 
Sbjct: 398 SACDAKTAALCAPACLNKSRVKGKILVCGGPS----------GYKIAKSVGAIAIIDKSP 457

Query: 424 KTVPMDSNIFPFTQIGNSDGLQILEYINSTKNPTATILRTVEVQRLKPAPIVAYFSSRGP 483
           +     ++  P + +   D   ++ YI S  +P A +L+T  +   + +P++A FSSRGP
Sbjct: 458 RPDVAFTHHLPASGLKAKDFKSLVSYIESQDSPQAAVLKTETIFN-RTSPVIASFSSRGP 517

Query: 484 SPLTENILKPDITAPGVSILAAMIPKSDGDSGPIGKKPSNYAMRSGTSMACPHVAGAAAF 543
           + +  +ILKPDITAPGV ILAA  P  +G+      +   Y++ SGTSMACPHVAG AA+
Sbjct: 518 NTIAVDILKPDITAPGVEILAAFSP--NGEPSEDDTRRVKYSVFSGTSMACPHVAGVAAY 577

Query: 544 IKSVYHDWSSSMIKSALMTTATQYDNQRKFMRNSTNNPSNPHEMGAGEISPIKALNPGLV 603
           +K+ Y  WS SMI+SA+MTTA     + + +       S     GAG + P+ ALNPGLV
Sbjct: 578 VKTFYPRWSPSMIQSAIMTTAWPVKAKGRGI------ASTEFAYGAGHVDPMAALNPGLV 637

Query: 604 FETTNEDYLRFLCYYGYSNKVIRSVSKQNFSCPKTSKEDLISNVNYPSISIGKLDSKQA- 663
           +E    D++ FLC   Y++K ++ +S     C K +K  L  N+NYPS+S  KL    + 
Sbjct: 638 YELDKADHIAFLCGMNYTSKTLKIISGDTVKCSKKNK-ILPRNLNYPSMS-AKLSGTDST 697

Query: 664 -AKVIERTVTNVGAPDATYIAKVHSSEG--LIVKVNPRKIVFSEKVKKVTFKVSFYGKEA 723
            +    RT+TNVG P++TY +KV +  G  L +KV P  + F    +K +F V+  G + 
Sbjct: 698 FSVTFNRTLTNVGTPNSTYKSKVVAGHGSKLSIKVTPSVLYFKTVNEKQSFSVTVTGSDV 731

Query: 724 RSGY-NFGTITWRDTAHSVRTFFAVNVI 746
            S   +   + W D  H+VR+   V ++
Sbjct: 758 DSEVPSSANLIWSDGTHNVRSPIVVYIM 731

BLAST of Cla022052 vs. Swiss-Prot
Match: SBT44_ARATH (Subtilisin-like protease SBT4.4 OS=Arabidopsis thaliana GN=SBT4.4 PE=2 SV=1)

HSP 1 Score: 515.4 bits (1326), Expect = 1.0e-144
Identity = 311/737 (42.20%), Postives = 422/737 (57.26%), Query Frame = 1

Query: 12  EQTTAELDYLQLLSSVIPRKEKERGSRDVVHQYNHAFKGFSAMLTEEEASSLSGIDGIVS 71
           E+ T   D++ +L  +      E     +V  Y  +F GF+A LTE E   L+G++ +VS
Sbjct: 46  EEYTPMSDHMSILQEITGESLIENR---LVRSYKKSFNGFAARLTESERKRLAGMERVVS 105

Query: 72  VFPDPTLHLHTTRSWDFLDSISGLRPPTPLPPPHSYPSTSDVIVGVIDTGIWPESQSFND 131
           VFP   L L TT SW+F+    G++         +    SD I+GVID+GI+PES SF+D
Sbjct: 106 VFPSRKLKLQTTSSWNFMGLKEGIKTK------RTRSIESDTIIGVIDSGIYPESDSFSD 165

Query: 132 EGIGEIPSKWKGVCMEAPDFKKSNCNRKLIGARYYNVIELNGNDSHVGAPKGTPRDSLGH 191
           +G G  P KWKG C    +F    CN K+IGAR Y   +   N         T RD  GH
Sbjct: 166 QGFGPPPKKWKGTCAGGKNF---TCNNKVIGARDYTA-KSKANQ--------TARDYSGH 225

Query: 192 GTHTASIAAGARVPNASYFGLARGTARGGGIPSTRIASYKVCAGVGCSGAAILKAIDDAV 251
           GTHTASIAAG  V N++++GL  GTARGG +P+ RIA YKVC   GC G A++ A DDA+
Sbjct: 226 GTHTASIAAGNAVANSNFYGLGNGTARGG-VPAARIAVYKVCDNEGCDGEAMMSAFDDAI 285

Query: 252 RDGVDIISISIGIGS-PLFQSDYLNDPIAIGAFHAQLMGVLVVCSGGNDGPDPNTVGNVA 311
            DGVD+ISISI + + P F+ D    PIAIGAFHA  +GVL V + GN+GP  +TV + A
Sbjct: 286 ADGVDVISISIVLDNIPPFEED----PIAIGAFHAMAVGVLTVNAAGNNGPKISTVTSTA 345

Query: 312 PWIFTVAASNIDRDFQSSVVLGNGKTFHGTAINLSNLTSSKTYPLVFGKDAAAKFTPVSE 371
           PW+F+VAAS  +R F + VVLG+GK   G ++N  ++  +  YPLV+GK AA     V +
Sbjct: 346 PWVFSVAASVTNRAFMAKVVLGDGKILIGRSVNTYDMNGTN-YPLVYGKSAALSTCSVDK 405

Query: 372 ARNCYPGSLDRSKVAGKIVVCASDDFSTSRTIKELVVQDAKAMGLILINEASKTVPMDSN 431
           AR C P  LD   V GKIV+C S       T   +  Q   A+G I+ N       + S 
Sbjct: 406 ARLCEPKCLDGKLVKGKIVLCDS-------TKGLIEAQKLGAVGSIVKNPEPDRAFIRS- 465

Query: 432 IFPFTQIGNSDGLQILEYINSTKNPTATILRTVEVQRLKPAPIVAYFSSRGPSPLTENIL 491
            FP + + N D   ++ Y+NSTKNP AT+L++ E+   + AP+VA FSSRGPS +  +IL
Sbjct: 466 -FPVSFLSNDDYKSLVSYMNSTKNPKATVLKSEEISNQR-APLVASFSSRGPSSIVSDIL 525

Query: 492 KPDITAPGVSILAAMIPKSDGDSGPIGKKPSNYAMRSGTSMACPHVAGAAAFIKSVYHDW 551
           KPDITAPGV ILAA  P S         +   Y++ SGTSMACPHVAG AA++K+ +  W
Sbjct: 526 KPDITAPGVEILAAYSPDSSPTESEFDTRRVKYSVLSGTSMACPHVAGVAAYVKTFHPQW 585

Query: 552 SSSMIKSALMTTATQYDNQRKFMRNSTNNPSNPHEMGAGEISPIKALNPGLVFETTNEDY 611
           S SMI+SA+MTTA   +       + +   S     G+G + PI A+NPGLV+E T  D+
Sbjct: 586 SPSMIQSAIMTTAWPMN------ASGSGFVSTEFAYGSGHVDPIDAINPGLVYELTKADH 645

Query: 612 LRFLCYYGYSNKVIRSVSKQNFSCPKTSKEDLISNVNYPSISIGKLDSKQAAKVIERTVT 671
           + FLC   Y++  +R +S  N +C K   + L  N+NYP++S     +K      +RTVT
Sbjct: 646 INFLCGLNYTSDHLRIISGDNSTCTKEISKTLPRNLNYPTMSAKVSGTKPFNITFQRTVT 705

Query: 672 NVGAPDATYIAKVHSSEG--LIVKVNPRKIVFSEKVKKVTFKVSFYGKEARSGYNFGT-- 731
           NVG   +TY AKV    G  L +KV+PR +      +K +F V+       S  + GT  
Sbjct: 706 NVGMQKSTYNAKVVKFPGSKLSIKVSPRVLSMKSMNEKQSFMVTV------SSDSIGTKQ 733

Query: 732 -----ITWRDTAHSVRT 739
                + W D  H+VR+
Sbjct: 766 PVSANLIWSDGTHNVRS 733

BLAST of Cla022052 vs. Swiss-Prot
Match: AIR3_ARATH (Subtilisin-like protease SBT5.3 OS=Arabidopsis thaliana GN=AIR3 PE=2 SV=1)

HSP 1 Score: 511.5 bits (1316), Expect = 1.5e-143
Identity = 297/723 (41.08%), Postives = 413/723 (57.12%), Query Frame = 1

Query: 33  KERGSRDVVHQYNHAFKGFSAMLTEEEASSLSGIDGIVSVFPDPTLHLHTTRSWDFLDSI 92
           +ER +  + + Y     GF+A L  + A  +S    +VSVFP+  L LHTTRSWDFL   
Sbjct: 68  RERATDAIFYSYTKHINGFAAHLDHDLAYEISKHPEVVSVFPNKALKLHTTRSWDFL--- 127

Query: 93  SGLRPPTPLPPPHSYPST---SDVIVGVIDTGIWPESQSFNDEGIGEIPSKWKGVCMEAP 152
            GL   + +P    +       D I+  +DTG+WPES+SF DEG+G IPS+WKG+C    
Sbjct: 128 -GLEHNSYVPSSSIWRKARFGEDTIIANLDTGVWPESKSFRDEGLGPIPSRWKGICQNQK 187

Query: 153 DFKKSNCNRKLIGARYYNVIELNGNDSHVGAPKGTPRDSLGHGTHTASIAAGARVPNASY 212
           D    +CNRKLIGARY+N         H+ +   +PRD  GHG+HT S AAG  VP  S 
Sbjct: 188 D-ATFHCNRKLIGARYFNK-GYAAAVGHLNSSFDSPRDLDGHGSHTLSTAAGDFVPGVSI 247

Query: 213 FGLARGTARGGGIPSTRIASYKVC----AGVGCSGAAILKAIDDAVRDGVDIISISIGIG 272
           FG   GTA+GG  P  R+A+YKVC     G  C  A +L A D A+ DG D+IS+S+G G
Sbjct: 248 FGQGNGTAKGGS-PRARVAAYKVCWPPVKGNECYDADVLAAFDAAIHDGADVISVSLG-G 307

Query: 273 SPLFQSDYLNDPIAIGAFHAQLMGVLVVCSGGNDGPDPNTVGNVAPWIFTVAASNIDRDF 332
            P   + + ND +AIG+FHA    ++VVCS GN GP  +TV NVAPW  TV AS +DR+F
Sbjct: 308 EP---TSFFNDSVAIGSFHAAKKRIVVVCSAGNSGPADSTVSNVAPWQITVGASTMDREF 367

Query: 333 QSSVVLGNGKTFHGTAINLSNLTSSKTYPLVFGKDAAAKFTPVSEARNCYPGSLDRSKVA 392
            S++VLGNGK + G +++ + L  +K YP++   +A AK     +A+ C  GSLD  K  
Sbjct: 368 ASNLVLGNGKHYKGQSLSSTALPHAKFYPIMASVNAKAKNASALDAQLCKLGSLDPIKTK 427

Query: 393 GKIVVCASDDFSTSRTIKELVVQDAKAMGLILINE--ASKTVPMDSNIFPFTQIGNSDGL 452
           GKI+VC        R  K   V     +G++L N       +  D ++ P TQ+ + D  
Sbjct: 428 GKILVCLRG--QNGRVEKGRAVALGGGIGMVLENTYVTGNDLLADPHVLPATQLTSKDSF 487

Query: 453 QILEYINSTKNPTATILRTVEVQRLKPAPIVAYFSSRGPSPLTENILKPDITAPGVSILA 512
            +  YI+ TK P A I  +     LKPAP++A FSS+GPS +   ILKPDITAPGVS++A
Sbjct: 488 AVSRYISQTKKPIAHITPSRTDLGLKPAPVMASFSSKGPSIVAPQILKPDITAPGVSVIA 547

Query: 513 AMIPKSDGDSGPIGKKPSNYAMRSGTSMACPHVAGAAAFIKSVYHDWSSSMIKSALMTTA 572
           A        +     +   +   SGTSM+CPH++G A  +K+ Y  WS + I+SA+MTTA
Sbjct: 548 AYTGAVSPTNEQFDPRRLLFNAISGTSMSCPHISGIAGLLKTRYPSWSPAAIRSAIMTTA 607

Query: 573 TQYDNQRKFMRNSTNNPSNPHEMGAGEISPIKALNPGLVFETTNEDYLRFLCYYGYSNKV 632
           T  D+    ++N+TN  + P   GAG + P  A+NPGLV++   +DYL FLC  GY+   
Sbjct: 608 TIMDDIPGPIQNATNMKATPFSFGAGHVQPNLAVNPGLVYDLGIKDYLNFLCSLGYNASQ 667

Query: 633 IRSVSKQNFSCPKTSKEDLISNVNYPSISIGKLDSKQAAKVIERTVTNVGAPDATYIAKV 692
           I   S  NF+C  +S +  + N+NYPSI++  L S +    + RTV NVG P + Y  KV
Sbjct: 668 ISVFSGNNFTC--SSPKISLVNLNYPSITVPNLTSSKV--TVSRTVKNVGRP-SMYTVKV 727

Query: 693 HSSEGLIVKVNPRKIVFSEKVKKVTFKVSFYGKEAR--SGYNFGTITWRDTAHSVRTFFA 745
           ++ +G+ V V P  + F++  ++ TFKV     +     GY FG + W D  H VR+   
Sbjct: 728 NNPQGVYVAVKPTSLNFTKVGEQKTFKVILVKSKGNVAKGYVFGELVWSDKKHRVRSPIV 772

BLAST of Cla022052 vs. TrEMBL
Match: A0A0A0KJ05_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G401370 PE=4 SV=1)

HSP 1 Score: 1381.7 bits (3575), Expect = 0.0e+00
Identity = 694/748 (92.78%), Postives = 720/748 (96.26%), Query Frame = 1

Query: 1   MGNGRINNEEDEQTTA-ELDYLQLLSSVIP-RKEKERGSRDVV-HQYNHAFKGFSAMLTE 60
           MGNG     EDEQT   ELDY QLLSSVIP RKEKE GSR VV HQY+HAFKGFSAMLTE
Sbjct: 47  MGNG-----EDEQTAGDELDYFQLLSSVIPSRKEKESGSRAVVIHQYHHAFKGFSAMLTE 106

Query: 61  EEASSLSGIDGIVSVFPDPTLHLHTTRSWDFLDSISGLRPPTPLPPPHSYPSTSDVIVGV 120
           EEASSLSGIDGIVSVFPDPTL LHTTRSWDFLDSISGLRPPTPLPPPHSYPS+SDVIVGV
Sbjct: 107 EEASSLSGIDGIVSVFPDPTLQLHTTRSWDFLDSISGLRPPTPLPPPHSYPSSSDVIVGV 166

Query: 121 IDTGIWPESQSFNDEGIGEIPSKWKGVCMEAPDFKKSNCNRKLIGARYYNVIELNGNDSH 180
           IDTGI+PESQSFNDEGIGEIPSKWKGVCMEAPDFKKSNCNRKLIGARYYNV+ELNGNDSH
Sbjct: 167 IDTGIFPESQSFNDEGIGEIPSKWKGVCMEAPDFKKSNCNRKLIGARYYNVVELNGNDSH 226

Query: 181 VGAPKGTPRDSLGHGTHTASIAAGARVPNASYFGLARGTARGGGIPSTRIASYKVCAGVG 240
           VG PKGTPRDS GHGTHT+SIAAGARVPNASYFGLARGTARGGG PSTRIASYKVCAGVG
Sbjct: 227 VGPPKGTPRDSHGHGTHTSSIAAGARVPNASYFGLARGTARGGGSPSTRIASYKVCAGVG 286

Query: 241 CSGAAILKAIDDAVRDGVDIISISIGIGSPLFQSDYLNDPIAIGAFHAQLMGVLVVCSGG 300
           CSGAAILKAIDDA++DGVDIISISIGIGSPLFQSDYLNDPIAIGA HAQLMGVLVVCS G
Sbjct: 287 CSGAAILKAIDDAIKDGVDIISISIGIGSPLFQSDYLNDPIAIGALHAQLMGVLVVCSAG 346

Query: 301 NDGPDPNTVGNVAPWIFTVAASNIDRDFQSSVVLGNGKTFHGTAINLSNLTSSKTYPLVF 360
           NDGPDPNTVGNVAPWIFTVAASNIDRDFQS+VVLGNGKTF GTAINLSNLTSSKTYPLVF
Sbjct: 347 NDGPDPNTVGNVAPWIFTVAASNIDRDFQSTVVLGNGKTFPGTAINLSNLTSSKTYPLVF 406

Query: 361 GKDAAAKFTPVSEARNCYPGSLDRSKVAGKIVVCASDDFSTSRTIKELVVQDAKAMGLIL 420
           G+DAAAKFTP SEARNC+PGSLDRSKVAGKIVVCASDDFSTSR IKELVVQDAKAMGLIL
Sbjct: 407 GQDAAAKFTPTSEARNCFPGSLDRSKVAGKIVVCASDDFSTSRIIKELVVQDAKAMGLIL 466

Query: 421 INEASKTVPMDSNIFPFTQIGNSDGLQILEYINSTKNPTATILRTVEVQRLKPAPIVAYF 480
           INEASK+VPMDSNIFPFTQIGNS+GLQILEYINSTKNPTATIL+TVEV+RLKPAP VAYF
Sbjct: 467 INEASKSVPMDSNIFPFTQIGNSEGLQILEYINSTKNPTATILKTVEVRRLKPAPTVAYF 526

Query: 481 SSRGPSPLTENILKPDITAPGVSILAAMIPKSDGDSGPIGKKPSNYAMRSGTSMACPHVA 540
           SSRGPSPLTENILKPDITAPGVSILAAMIPKSD D+GPIGKKPSNYAM+SGTSMACPHVA
Sbjct: 527 SSRGPSPLTENILKPDITAPGVSILAAMIPKSDEDTGPIGKKPSNYAMKSGTSMACPHVA 586

Query: 541 GAAAFIKSVYHDWSSSMIKSALMTTATQYDNQRKFMRNSTNNPSNPHEMGAGEISPIKAL 600
           GAAAFIKSVYHDWSSSMIKSALMTTATQYDNQRK+MRN+T+NPSNPHEMGAGEISPIKAL
Sbjct: 587 GAAAFIKSVYHDWSSSMIKSALMTTATQYDNQRKYMRNTTDNPSNPHEMGAGEISPIKAL 646

Query: 601 NPGLVFETTNEDYLRFLCYYGYSNKVIRSVSKQNFSCPKTSKEDLISNVNYPSISIGKLD 660
           NPGLVFETTNED+L FLCYYGYSNKVIRS+ KQNF+CPKTSKEDLISNVNYPSISI KLD
Sbjct: 647 NPGLVFETTNEDHLLFLCYYGYSNKVIRSMLKQNFTCPKTSKEDLISNVNYPSISIAKLD 706

Query: 661 SKQAAKVIERTVTNVGAPDATYIAKVHSSEGLIVKVNPRKIVFSEKVKKVTFKVSFYGKE 720
            KQAAKV+ERTVTNVGAPDATYIAKVHSSEGLIVKVNPRKIVFSEKVKKVTFKVSFYGKE
Sbjct: 707 RKQAAKVVERTVTNVGAPDATYIAKVHSSEGLIVKVNPRKIVFSEKVKKVTFKVSFYGKE 766

Query: 721 ARSGYNFGTITWRDTAHSVRTFFAVNVI 746
           AR+GYNFG+ITWRDTAHSVRTFFAVNV+
Sbjct: 767 ARNGYNFGSITWRDTAHSVRTFFAVNVV 789

BLAST of Cla022052 vs. TrEMBL
Match: M5X7H3_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001918mg PE=4 SV=1)

HSP 1 Score: 1025.8 bits (2651), Expect = 2.7e-296
Identity = 518/744 (69.62%), Postives = 607/744 (81.59%), Query Frame = 1

Query: 2   GNGRINNEEDEQTTAELDYLQLLSSVIPRKEKERGSRDVVHQYNHAFKGFSAMLTEEEAS 61
           GNGR+   ED    AE  YLQ+LSS+IP  E ER S  ++H+YNHAF+GFSAMLTE EAS
Sbjct: 8   GNGRVLGAED---AAESAYLQMLSSIIPSHEIERLS--IIHKYNHAFRGFSAMLTETEAS 67

Query: 62  SLSGIDGIVSVFPDPTLHLHTTRSWDFLDSISGLRPPTPLPPPHSYPSTSDVIVGVIDTG 121
            LSG D +VS+FPD  L LHTTRSWDFL++ SG  P       +    +SDVI+G+IDTG
Sbjct: 68  VLSGHDDVVSIFPDSILELHTTRSWDFLEAESGRLPSNK----YQRGLSSDVIIGMIDTG 127

Query: 122 IWPESQSFNDEGIGEIPSKWKGVCMEAPDFKKSNCNRKLIGARYYNVI-ELNGNDSHVGA 181
           IWPES SFNDEGIG +PS+WKGVCME  DF+KSNCNRKLIGARYYNV    +GN S +  
Sbjct: 128 IWPESSSFNDEGIGAVPSRWKGVCMEGSDFRKSNCNRKLIGARYYNVPWTRDGNQSSLAR 187

Query: 182 PKGTPRDSLGHGTHTASIAAGARVPNASYFGLARGTARGGGIPSTRIASYKVCAGVGCSG 241
            KG+PRDS+GHGTHTAS AAG +V NASY+GLA+GTARGG +PS RIA YK C+ VGCSG
Sbjct: 188 TKGSPRDSVGHGTHTASTAAGVQVLNASYYGLAQGTARGG-LPSARIACYKACSDVGCSG 247

Query: 242 AAILKAIDDAVRDGVDIISISIGIGSPLFQSDYLNDPIAIGAFHAQLMGVLVVCSGGNDG 301
           A ILKAIDDA+RDGVDIISISIG+ S LFQSDYLNDPIAIGAFHA+ MGV+V+CSGGNDG
Sbjct: 248 ATILKAIDDAIRDGVDIISISIGMSS-LFQSDYLNDPIAIGAFHAEQMGVMVICSGGNDG 307

Query: 302 PDPNTVGNVAPWIFTVAASNIDRDFQSSVVLGNGKTFHGTAINLSNLTSSKTYPLVFGKD 361
           PDP T+ N APWIFTVAASNIDRDFQS++VLGNGK F G+AIN SNLT S+TYPLVFGKD
Sbjct: 308 PDPYTIVNTAPWIFTVAASNIDRDFQSNIVLGNGKNFTGSAINFSNLTRSRTYPLVFGKD 367

Query: 362 AAAKFTPVSEARNCYPGSLDRSKVAGKIVVCASDDFSTSRTIKELVVQDAKAMGLILINE 421
            A  +TPVSEARNCYPGSLD  KV GKIVVC  DD + SR IK+LVV+DAKA GLILI+E
Sbjct: 368 VAGYYTPVSEARNCYPGSLDPKKVVGKIVVCVDDDPAVSRKIKKLVVEDAKAKGLILIDE 427

Query: 422 ASKTVPMDSNIFPFTQIGNSDGLQILEYINSTKNPTATILRTVEVQRLKPAPIVAYFSSR 481
           A K+VP DS IFP+T++GN  G QIL+YINSTKNPTATIL TV+V R +PAP VAYFSSR
Sbjct: 428 AEKSVPFDSGIFPYTEVGNIAGFQILQYINSTKNPTATILPTVDVPRYRPAPAVAYFSSR 487

Query: 482 GPSPLTENILKPDITAPGVSILAAMIPKSDGDSGPIGKKPSNYAMRSGTSMACPHVAGAA 541
           GP+ LTENILKPDI APGV+ILAA+ PK++  + P GKKPS ++++SGTSMACPHV GAA
Sbjct: 488 GPAELTENILKPDIMAPGVAILAAIAPKNETGTVPNGKKPSTFSIKSGTSMACPHVTGAA 547

Query: 542 AFIKSVYHDWSSSMIKSALMTTATQYDNQRKFMRNSTNNPSNPHEMGAGEISPIKALNPG 601
           AFIKSV+  W+SSMIKSALMTTAT ++N +K + NS+N  +NPHE+G GEI+P+KAL+PG
Sbjct: 548 AFIKSVHRRWTSSMIKSALMTTATVFNNMKKPLTNSSNTFANPHEVGVGEINPLKALSPG 607

Query: 602 LVFETTNEDYLRFLCYYGYSNKVIRSVSKQNFSCPKTSKEDLISNVNYPSISIGKLDSKQ 661
           LVFETT E+YL FLCYYGY  K IRS+S   F CPK+S ++LISNVNYPSISI KL+  Q
Sbjct: 608 LVFETTTENYLEFLCYYGYPEKNIRSMSNTKFICPKSSIDELISNVNYPSISISKLNRHQ 667

Query: 662 AAKVIERTVTNVGAPDATYIAKVHSSEGLIVKVNPRKIVFSEKVKKVTFKVSFYGKEARS 721
            AK I+RT TNV A ++TYIAKVH+  GLIVKV P K+VF+E V++V+F+VSFYGKEA  
Sbjct: 668 PAKTIQRTATNVAALNSTYIAKVHAPAGLIVKVLPEKLVFAEGVRRVSFQVSFYGKEAPR 727

Query: 722 GYNFGTITWRDTAHSVRTFFAVNV 745
           GYNFG+ITW D  HSVRT F+VNV
Sbjct: 728 GYNFGSITWFDGRHSVRTVFSVNV 740

BLAST of Cla022052 vs. TrEMBL
Match: K7MD65_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_15G224500 PE=4 SV=1)

HSP 1 Score: 1024.6 bits (2648), Expect = 5.9e-296
Identity = 519/746 (69.57%), Postives = 607/746 (81.37%), Query Frame = 1

Query: 1   MGNGRINNEEDEQTTAELDYLQLLSSVIPRKEKERGSRDVVHQYNHAFKGFSAMLTEEEA 60
           MGN   NN   E    E  +L LLSS+IP ++ ER +  + H ++HAF GFSA+LTE EA
Sbjct: 35  MGNSSPNNIGVEGQILESSHLHLLSSIIPSEQSERIA--LTHHFSHAFSGFSALLTEGEA 94

Query: 61  SSLSGIDGIVSVFPDPTLHLHTTRSWDFLDSISGLRPPTPLPPPHSYPSTSDVIVGVIDT 120
           S+LSG D +VSVFPDP L LHTTRSWDFL+S  G++P +   P     S+SD+I+GVIDT
Sbjct: 95  SALSGHDSVVSVFPDPVLQLHTTRSWDFLESDLGMKPYSYGTPKLHQHSSSDIIIGVIDT 154

Query: 121 GIWPESQSFNDEGIGEIPSKWKGVCMEAPDFKKSNCNRKLIGARYYNVIELNG-NDSHVG 180
           GIWPES SF DEGIGEIPS+WKGVCME  DFKKSNCNRKLIGARYYN++  +G N +H+ 
Sbjct: 155 GIWPESPSFRDEGIGEIPSRWKGVCMEGSDFKKSNCNRKLIGARYYNILATSGDNQTHIE 214

Query: 181 APKGTPRDSLGHGTHTASIAAGARVPNASYFGLARGTARGGGIPSTRIASYKVCAGVGCS 240
           A KG+PRDS+GHGTHTASIAAG  V NASYFGLA+GTARGG  PSTRIA+YK C+  GCS
Sbjct: 215 ATKGSPRDSVGHGTHTASIAAGVHVNNASYFGLAQGTARGGS-PSTRIAAYKTCSDEGCS 274

Query: 241 GAAILKAIDDAVRDGVDIISISIGIGSPLFQSDYLNDPIAIGAFHAQLMGVLVVCSGGND 300
           GA ILKAIDDAV+DGVDIISISIG+ S LFQSD+L+DPIAIGAFHA+  GVLVVCS GND
Sbjct: 275 GATILKAIDDAVKDGVDIISISIGLSS-LFQSDFLSDPIAIGAFHAEQKGVLVVCSAGND 334

Query: 301 GPDPNTVGNVAPWIFTVAASNIDRDFQSSVVLGNGKTFHGTAINLSNLTSSKTYPLVFGK 360
           GPDP TV N APWIFT+AASNIDR+FQS++VLGNGK F GT IN SNLT SK + LVFG+
Sbjct: 335 GPDPFTVVNTAPWIFTIAASNIDRNFQSTIVLGNGKYFQGTGINFSNLTHSKMHRLVFGE 394

Query: 361 DAAAKFTPVSEARNCYPGSLDRSKVAGKIVVCASDDFSTSRTIKELVVQDAKAMGLILIN 420
             AAKF P SEARNC+PGSLD +K AG IVVC +DD + SR IK+LVVQDA+A+G+ILIN
Sbjct: 395 QVAAKFVPASEARNCFPGSLDFNKTAGSIVVCVNDDPTVSRQIKKLVVQDARAIGIILIN 454

Query: 421 EASKTVPMDSNIFPFTQIGNSDGLQILEYINSTKNPTATILRTVEVQRLKPAPIVAYFSS 480
           E +K  P D+  FPFTQ+GN +G QIL+YINSTKNPTATIL T EV RLKP+PIVA FSS
Sbjct: 455 EDNKDAPFDAGAFPFTQVGNLEGHQILQYINSTKNPTATILPTTEVSRLKPSPIVASFSS 514

Query: 481 RGPSPLTENILKPDITAPGVSILAAMIPKS-DGDSGPIGKKPSNYAMRSGTSMACPHVAG 540
           RGPS LTEN+LKPD+ APGV ILAA+IPK+ +  S PIGKKPS YA++SGTSMACPHV G
Sbjct: 515 RGPSSLTENVLKPDVMAPGVGILAAVIPKTKEPGSVPIGKKPSLYAIKSGTSMACPHVTG 574

Query: 541 AAAFIKSVYHDWSSSMIKSALMTTATQYDNQRKFMRNSTNNPSNPHEMGAGEISPIKALN 600
           AAAFIKSV+  WSSSMIKSALMTTAT Y+N RK + NS+N+ ++PHEMG GEI+P++ALN
Sbjct: 575 AAAFIKSVHTKWSSSMIKSALMTTATNYNNLRKPLTNSSNSIADPHEMGVGEINPLRALN 634

Query: 601 PGLVFETTNEDYLRFLCYYGYSNKVIRSVSKQNFSCPKTSKEDLISNVNYPSISIGKLDS 660
           PGLVFET  EDYLRFLCY+GYS K+IRS+SK NF+CPK S E LISNVNYPSIS+  L  
Sbjct: 635 PGLVFETDVEDYLRFLCYFGYSQKIIRSMSKTNFNCPKNSSEGLISNVNYPSISVSTLKK 694

Query: 661 KQAAKVIERTVTNVGAPDATYIAKVHSSEGLIVKVNPRKIVFSEKVKKVTFKVSFYGKEA 720
           +Q AKVI R VTNVG+ +ATY AKV + EGL+VKV P K+VFSE V+++T+KVSFYGKEA
Sbjct: 695 QQKAKVITRKVTNVGSLNATYTAKVLAPEGLVVKVIPNKLVFSEGVQRMTYKVSFYGKEA 754

Query: 721 RSGYNFGTITWRDTAHSVRTFFAVNV 745
           RSGYNFG++TW D  H V T FAV V
Sbjct: 755 RSGYNFGSLTWLDGHHYVHTVFAVKV 776

BLAST of Cla022052 vs. TrEMBL
Match: I1M0I4_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_13G187000 PE=4 SV=1)

HSP 1 Score: 1020.8 bits (2638), Expect = 8.6e-295
Identity = 521/748 (69.65%), Postives = 604/748 (80.75%), Query Frame = 1

Query: 1   MGNGRINNEEDEQTTAELDYLQLLSSVIPRKEKERGSRDVVHQYNHAFKGFSAMLTEEEA 60
           MGN   N    E   AE  +LQLLS +IP +E ER +  + H ++HAF GFSAMLTE EA
Sbjct: 35  MGNSSPNKIGVESQIAESSHLQLLSLIIPSEESERIA--LTHHFSHAFSGFSAMLTESEA 94

Query: 61  SSLSGIDGIVSVFPDPTLHLHTTRSWDFLDSISGLRP--PTPLPPPHSYPSTSDVIVGVI 120
           S+LSG DG+VSVFPDP L LHTTRSWDFL+S  G++P      P  H +PST D+I+GVI
Sbjct: 95  SALSGHDGVVSVFPDPVLELHTTRSWDFLESELGMKPYYSHGTPTLHKHPST-DIIIGVI 154

Query: 121 DTGIWPESQSFNDEGIGEIPSKWKGVCMEAPDFKKSNCNRKLIGARYYNVIELNG-NDSH 180
           DTGIWPES SF DEGIGEIPSKWKGVCME  DFKKSNCNRKLIGARYY +   +G N +H
Sbjct: 155 DTGIWPESPSFRDEGIGEIPSKWKGVCMEGRDFKKSNCNRKLIGARYYKIQATSGDNQTH 214

Query: 181 VGAPKGTPRDSLGHGTHTASIAAGARVPNASYFGLARGTARGGGIPSTRIASYKVCAGVG 240
           + A KG+PRD++GHGTHTASIAAG  V NASYFGLA+GTARGG  PSTRIA+YK C+  G
Sbjct: 215 IEAAKGSPRDTVGHGTHTASIAAGVHVNNASYFGLAKGTARGGS-PSTRIAAYKTCSDEG 274

Query: 241 CSGAAILKAIDDAVRDGVDIISISIGIGSPLFQSDYLNDPIAIGAFHAQLMGVLVVCSGG 300
           CSGA ILKAIDDAV+DGVDIISISIG+ S LFQSD+L+DPIAIGAFHA+  GVLVVCS G
Sbjct: 275 CSGATILKAIDDAVKDGVDIISISIGLSS-LFQSDFLSDPIAIGAFHAEQKGVLVVCSAG 334

Query: 301 NDGPDPNTVGNVAPWIFTVAASNIDRDFQSSVVLGNGKTFHGTAINLSNLTSSKTYPLVF 360
           NDGPDP TV N APWIFT+AASNIDR+FQS++VLGNGK   GT IN SNLT SK + LVF
Sbjct: 335 NDGPDPFTVVNSAPWIFTIAASNIDRNFQSTIVLGNGKYLQGTGINFSNLTHSKMHRLVF 394

Query: 361 GKDAAAKFTPVSEARNCYPGSLDRSKVAGKIVVCASDDFSTSRTIKELVVQDAKAMGLIL 420
           G+  AAKF P SEARNC+PGSLD +K AG IVVC +DD S SR IK+LVVQDA+A+G+IL
Sbjct: 395 GEQVAAKFVPASEARNCFPGSLDFNKTAGNIVVCVNDDPSVSRRIKKLVVQDARAVGIIL 454

Query: 421 INEASKTVPMDSNIFPFTQIGNSDGLQILEYINSTKNPTATILRTVEVQRLKPAPIVAYF 480
           INE +K  P D+ +FPFTQ+GN +G QIL+YINSTKNPTATIL T EV R KP+PIVA F
Sbjct: 455 INENNKDAPFDAGVFPFTQVGNLEGHQILKYINSTKNPTATILPTTEVARSKPSPIVASF 514

Query: 481 SSRGPSPLTENILKPDITAPGVSILAAMIPKS-DGDSGPIGKKPSNYAMRSGTSMACPHV 540
           SSRGPS LTENILKPD+ APGV ILAA+IPKS +  S PIGKKPS YA++SGTSMACPHV
Sbjct: 515 SSRGPSSLTENILKPDVMAPGVGILAAVIPKSKEPGSVPIGKKPSLYAIKSGTSMACPHV 574

Query: 541 AGAAAFIKSVYHDWSSSMIKSALMTTATQYDNQRKFMRNSTNNPSNPHEMGAGEISPIKA 600
            GAAAFIKSV+  WSSSMIKSALMTTAT Y+N RK + NS+N+ + PHEMG GEI+P++A
Sbjct: 575 TGAAAFIKSVHKKWSSSMIKSALMTTATNYNNMRKPLTNSSNSIAGPHEMGVGEINPLRA 634

Query: 601 LNPGLVFETTNEDYLRFLCYYGYSNKVIRSVSKQNFSCPKTSKEDLISNVNYPSISIGKL 660
           LNPGLVFET  EDYLRFLCY+GYS K+IRS+S+ NF+CPK S EDLIS+VNYPSISI  L
Sbjct: 635 LNPGLVFETDVEDYLRFLCYFGYSQKIIRSISETNFNCPKNSSEDLISSVNYPSISISTL 694

Query: 661 DSKQAAKVIERTVTNVGAPDATYIAKVHSSEGLIVKVNPRKIVFSEKVKKVTFKVSFYGK 720
             +Q AKVI RTVTNVG  +ATY AKV + +GL+V+V P K+VFSE V+++T+KVSFYGK
Sbjct: 695 KRQQKAKVITRTVTNVGYLNATYTAKVRAPQGLVVEVIPNKLVFSEGVQRMTYKVSFYGK 754

Query: 721 EARSGYNFGTITWRDTAHSVRTFFAVNV 745
           EA  GYNFG++TW D  H V T FAV V
Sbjct: 755 EAHGGYNFGSLTWLDGHHYVHTVFAVKV 777

BLAST of Cla022052 vs. TrEMBL
Match: A0A0B2PRM9_GLYSO (Subtilisin-like protease OS=Glycine soja GN=glysoja_040552 PE=4 SV=1)

HSP 1 Score: 1019.6 bits (2635), Expect = 1.9e-294
Identity = 520/748 (69.52%), Postives = 604/748 (80.75%), Query Frame = 1

Query: 1   MGNGRINNEEDEQTTAELDYLQLLSSVIPRKEKERGSRDVVHQYNHAFKGFSAMLTEEEA 60
           MGN   N    E   AE  +LQLLS +IP +E ER +  + H ++HAF GFSAMLTE EA
Sbjct: 35  MGNSSPNKIGVESQIAESSHLQLLSLIIPSEESERIA--LTHHFSHAFSGFSAMLTESEA 94

Query: 61  SSLSGIDGIVSVFPDPTLHLHTTRSWDFLDSISGLRP--PTPLPPPHSYPSTSDVIVGVI 120
           S+LSG DG+VSVFPDP L LHTTRSWDFL+S  G++P      P  H +PST D+I+G+I
Sbjct: 95  SALSGHDGVVSVFPDPVLELHTTRSWDFLESDLGMKPYYSHGTPTLHKHPST-DIIIGLI 154

Query: 121 DTGIWPESQSFNDEGIGEIPSKWKGVCMEAPDFKKSNCNRKLIGARYYNVIELNG-NDSH 180
           DTGIWPES SF DEGIGEIPSKWKGVCME  DFKKSNCNRKLIGARYY +   +G N +H
Sbjct: 155 DTGIWPESPSFRDEGIGEIPSKWKGVCMEGRDFKKSNCNRKLIGARYYKIQATSGDNQTH 214

Query: 181 VGAPKGTPRDSLGHGTHTASIAAGARVPNASYFGLARGTARGGGIPSTRIASYKVCAGVG 240
           + A KG+PRD++GHGTHTASIAAG  V NASYFGLA+GTARGG  PSTRIA+YK C+  G
Sbjct: 215 IEAAKGSPRDTVGHGTHTASIAAGVHVNNASYFGLAKGTARGGS-PSTRIAAYKTCSDEG 274

Query: 241 CSGAAILKAIDDAVRDGVDIISISIGIGSPLFQSDYLNDPIAIGAFHAQLMGVLVVCSGG 300
           CSGA ILKAIDDAV+DGVDIISISIG+ S LFQSD+L+DPIAIGAFHA+  GVLVVCS G
Sbjct: 275 CSGATILKAIDDAVKDGVDIISISIGLSS-LFQSDFLSDPIAIGAFHAEQKGVLVVCSAG 334

Query: 301 NDGPDPNTVGNVAPWIFTVAASNIDRDFQSSVVLGNGKTFHGTAINLSNLTSSKTYPLVF 360
           NDGPDP TV N APWIFT+AASNIDR+FQS++VLGNGK   GT IN SNLT SK + LVF
Sbjct: 335 NDGPDPFTVVNSAPWIFTIAASNIDRNFQSTIVLGNGKYLQGTGINFSNLTHSKMHRLVF 394

Query: 361 GKDAAAKFTPVSEARNCYPGSLDRSKVAGKIVVCASDDFSTSRTIKELVVQDAKAMGLIL 420
           G+  AAKF P SEARNC+PGSLD +K AG IVVC +DD S SR IK+LVVQDA+A+G+IL
Sbjct: 395 GEQVAAKFVPASEARNCFPGSLDFNKTAGNIVVCVNDDPSVSRRIKKLVVQDARAVGIIL 454

Query: 421 INEASKTVPMDSNIFPFTQIGNSDGLQILEYINSTKNPTATILRTVEVQRLKPAPIVAYF 480
           INE +K  P D+ +FPFTQ+GN +G QIL+YINSTKNPTATIL T EV R KP+PIVA F
Sbjct: 455 INENNKDAPFDAGVFPFTQVGNLEGHQILKYINSTKNPTATILPTTEVARSKPSPIVASF 514

Query: 481 SSRGPSPLTENILKPDITAPGVSILAAMIPKS-DGDSGPIGKKPSNYAMRSGTSMACPHV 540
           SSRGPS LTENILKPD+ APGV ILAA+IPKS +  S PIGKKPS YA++SGTSMACPHV
Sbjct: 515 SSRGPSSLTENILKPDVMAPGVGILAAVIPKSKEPGSVPIGKKPSLYAIKSGTSMACPHV 574

Query: 541 AGAAAFIKSVYHDWSSSMIKSALMTTATQYDNQRKFMRNSTNNPSNPHEMGAGEISPIKA 600
            GAAAFIKSV+  WSSSMIKSALMTTAT Y+N RK + NS+N+ + PHEMG GEI+P++A
Sbjct: 575 TGAAAFIKSVHKKWSSSMIKSALMTTATNYNNMRKPLTNSSNSIAGPHEMGVGEINPLRA 634

Query: 601 LNPGLVFETTNEDYLRFLCYYGYSNKVIRSVSKQNFSCPKTSKEDLISNVNYPSISIGKL 660
           LNPGLVFET  EDYLRFLCY+GYS K+IRS+S+ NF+CPK S EDLIS+VNYPSISI  L
Sbjct: 635 LNPGLVFETDVEDYLRFLCYFGYSQKIIRSISETNFNCPKNSSEDLISSVNYPSISISTL 694

Query: 661 DSKQAAKVIERTVTNVGAPDATYIAKVHSSEGLIVKVNPRKIVFSEKVKKVTFKVSFYGK 720
             +Q AKVI RTVTNVG  +ATY AKV + +GL+V+V P K+VFSE V+++T+KVSFYGK
Sbjct: 695 KRQQKAKVITRTVTNVGYLNATYTAKVRAPQGLVVEVIPNKLVFSEGVQRMTYKVSFYGK 754

Query: 721 EARSGYNFGTITWRDTAHSVRTFFAVNV 745
           EA  GYNFG++TW D  H V T FAV V
Sbjct: 755 EAHGGYNFGSLTWLDGHHYVHTVFAVKV 777

BLAST of Cla022052 vs. NCBI nr
Match: gi|659089241|ref|XP_008445401.1| (PREDICTED: subtilisin-like protease [Cucumis melo])

HSP 1 Score: 1398.3 bits (3618), Expect = 0.0e+00
Identity = 695/749 (92.79%), Postives = 722/749 (96.40%), Query Frame = 1

Query: 1   MGNGRINNEEDEQTTAELDYLQLLSSVIP---RKEKERGSRDVV-HQYNHAFKGFSAMLT 60
           MGNG    E++E    ELDYLQLLSSVIP    KEKE GSRDVV HQY+HAFKGFSAMLT
Sbjct: 1   MGNG----EDEETAGGELDYLQLLSSVIPSRKEKEKENGSRDVVIHQYHHAFKGFSAMLT 60

Query: 61  EEEASSLSGIDGIVSVFPDPTLHLHTTRSWDFLDSISGLRPPTPLPPPHSYPSTSDVIVG 120
           EEEASSLSGIDGIVSVFPDPTL LHTTRSWDFLDSISGLRPPTPLPPPH YPS+SDVIVG
Sbjct: 61  EEEASSLSGIDGIVSVFPDPTLQLHTTRSWDFLDSISGLRPPTPLPPPHFYPSSSDVIVG 120

Query: 121 VIDTGIWPESQSFNDEGIGEIPSKWKGVCMEAPDFKKSNCNRKLIGARYYNVIELNGNDS 180
           VIDTGIWPESQSFNDEG+GEIPSKWKGVCMEAPDFKKSNCNRKLIGARYYNV+ELNGNDS
Sbjct: 121 VIDTGIWPESQSFNDEGVGEIPSKWKGVCMEAPDFKKSNCNRKLIGARYYNVVELNGNDS 180

Query: 181 HVGAPKGTPRDSLGHGTHTASIAAGARVPNASYFGLARGTARGGGIPSTRIASYKVCAGV 240
           HVG PKGTPRDSLGHG+HT+SIAAGARVPNASYFGLARGTARGGG PSTRIASYKVCAGV
Sbjct: 181 HVGPPKGTPRDSLGHGSHTSSIAAGARVPNASYFGLARGTARGGGTPSTRIASYKVCAGV 240

Query: 241 GCSGAAILKAIDDAVRDGVDIISISIGIGSPLFQSDYLNDPIAIGAFHAQLMGVLVVCSG 300
           GCSGAAILKAIDDA++DGVDIISISIGIGSPLFQSDYLNDPIAIGA HAQL GVLVVCS 
Sbjct: 241 GCSGAAILKAIDDAIKDGVDIISISIGIGSPLFQSDYLNDPIAIGALHAQLRGVLVVCSA 300

Query: 301 GNDGPDPNTVGNVAPWIFTVAASNIDRDFQSSVVLGNGKTFHGTAINLSNLTSSKTYPLV 360
           GNDGPDPNTVGNVAPWIFTVAASNIDRDFQS+VVLGNGKTFHGT INLSNLTSSKTYPLV
Sbjct: 301 GNDGPDPNTVGNVAPWIFTVAASNIDRDFQSTVVLGNGKTFHGTGINLSNLTSSKTYPLV 360

Query: 361 FGKDAAAKFTPVSEARNCYPGSLDRSKVAGKIVVCASDDFSTSRTIKELVVQDAKAMGLI 420
           FGKDAAAKFTP SEARNCYPGSLDRSKVAGKIVVCASDDFSTSRTIKELVVQDAKA+GLI
Sbjct: 361 FGKDAAAKFTPTSEARNCYPGSLDRSKVAGKIVVCASDDFSTSRTIKELVVQDAKAVGLI 420

Query: 421 LINEASKTVPMDSNIFPFTQIGNSDGLQILEYINSTKNPTATILRTVEVQRLKPAPIVAY 480
           LINEASK+VPMDSNIFPFTQIGNS+GLQILEYINSTKNPTATIL+TVEV+RLKPAP VAY
Sbjct: 421 LINEASKSVPMDSNIFPFTQIGNSEGLQILEYINSTKNPTATILKTVEVRRLKPAPTVAY 480

Query: 481 FSSRGPSPLTENILKPDITAPGVSILAAMIPKSDGDSGPIGKKPSNYAMRSGTSMACPHV 540
           FSSRGPSPLTENILKPDITAPGVSILAAMIPKSDGD+GPIGKKPSNYAM+SGTSMACPHV
Sbjct: 481 FSSRGPSPLTENILKPDITAPGVSILAAMIPKSDGDTGPIGKKPSNYAMKSGTSMACPHV 540

Query: 541 AGAAAFIKSVYHDWSSSMIKSALMTTATQYDNQRKFMRNSTNNPSNPHEMGAGEISPIKA 600
           AGAAAFIKSVYHDWSSSMIKSALMTTATQYDNQRK+MRN+TNNPSNPHEMGAGEISPIKA
Sbjct: 541 AGAAAFIKSVYHDWSSSMIKSALMTTATQYDNQRKYMRNTTNNPSNPHEMGAGEISPIKA 600

Query: 601 LNPGLVFETTNEDYLRFLCYYGYSNKVIRSVSKQNFSCPKTSKEDLISNVNYPSISIGKL 660
           LNPGLVFETTNEDYL FLCYYGYSNKV+RS+ KQNF+CPKTSKEDLISNVNYPSISIGKL
Sbjct: 601 LNPGLVFETTNEDYLLFLCYYGYSNKVLRSMLKQNFTCPKTSKEDLISNVNYPSISIGKL 660

Query: 661 DSKQAAKVIERTVTNVGAPDATYIAKVHSSEGLIVKVNPRKIVFSEKVKKVTFKVSFYGK 720
           D KQAAKV+ERTVTNVGAPDATYIAKVHSSEGLIVKVNPRKIVFSEKVKKVTFKVSFYGK
Sbjct: 661 DRKQAAKVVERTVTNVGAPDATYIAKVHSSEGLIVKVNPRKIVFSEKVKKVTFKVSFYGK 720

Query: 721 EARSGYNFGTITWRDTAHSVRTFFAVNVI 746
           EAR+GYNFG+ITWRDTAHSVRTFFAVNV+
Sbjct: 721 EARNGYNFGSITWRDTAHSVRTFFAVNVV 745

BLAST of Cla022052 vs. NCBI nr
Match: gi|778715825|ref|XP_011657463.1| (PREDICTED: CO(2)-response secreted protease-like [Cucumis sativus])

HSP 1 Score: 1381.7 bits (3575), Expect = 0.0e+00
Identity = 694/748 (92.78%), Postives = 720/748 (96.26%), Query Frame = 1

Query: 1   MGNGRINNEEDEQTTA-ELDYLQLLSSVIP-RKEKERGSRDVV-HQYNHAFKGFSAMLTE 60
           MGNG     EDEQT   ELDY QLLSSVIP RKEKE GSR VV HQY+HAFKGFSAMLTE
Sbjct: 47  MGNG-----EDEQTAGDELDYFQLLSSVIPSRKEKESGSRAVVIHQYHHAFKGFSAMLTE 106

Query: 61  EEASSLSGIDGIVSVFPDPTLHLHTTRSWDFLDSISGLRPPTPLPPPHSYPSTSDVIVGV 120
           EEASSLSGIDGIVSVFPDPTL LHTTRSWDFLDSISGLRPPTPLPPPHSYPS+SDVIVGV
Sbjct: 107 EEASSLSGIDGIVSVFPDPTLQLHTTRSWDFLDSISGLRPPTPLPPPHSYPSSSDVIVGV 166

Query: 121 IDTGIWPESQSFNDEGIGEIPSKWKGVCMEAPDFKKSNCNRKLIGARYYNVIELNGNDSH 180
           IDTGI+PESQSFNDEGIGEIPSKWKGVCMEAPDFKKSNCNRKLIGARYYNV+ELNGNDSH
Sbjct: 167 IDTGIFPESQSFNDEGIGEIPSKWKGVCMEAPDFKKSNCNRKLIGARYYNVVELNGNDSH 226

Query: 181 VGAPKGTPRDSLGHGTHTASIAAGARVPNASYFGLARGTARGGGIPSTRIASYKVCAGVG 240
           VG PKGTPRDS GHGTHT+SIAAGARVPNASYFGLARGTARGGG PSTRIASYKVCAGVG
Sbjct: 227 VGPPKGTPRDSHGHGTHTSSIAAGARVPNASYFGLARGTARGGGSPSTRIASYKVCAGVG 286

Query: 241 CSGAAILKAIDDAVRDGVDIISISIGIGSPLFQSDYLNDPIAIGAFHAQLMGVLVVCSGG 300
           CSGAAILKAIDDA++DGVDIISISIGIGSPLFQSDYLNDPIAIGA HAQLMGVLVVCS G
Sbjct: 287 CSGAAILKAIDDAIKDGVDIISISIGIGSPLFQSDYLNDPIAIGALHAQLMGVLVVCSAG 346

Query: 301 NDGPDPNTVGNVAPWIFTVAASNIDRDFQSSVVLGNGKTFHGTAINLSNLTSSKTYPLVF 360
           NDGPDPNTVGNVAPWIFTVAASNIDRDFQS+VVLGNGKTF GTAINLSNLTSSKTYPLVF
Sbjct: 347 NDGPDPNTVGNVAPWIFTVAASNIDRDFQSTVVLGNGKTFPGTAINLSNLTSSKTYPLVF 406

Query: 361 GKDAAAKFTPVSEARNCYPGSLDRSKVAGKIVVCASDDFSTSRTIKELVVQDAKAMGLIL 420
           G+DAAAKFTP SEARNC+PGSLDRSKVAGKIVVCASDDFSTSR IKELVVQDAKAMGLIL
Sbjct: 407 GQDAAAKFTPTSEARNCFPGSLDRSKVAGKIVVCASDDFSTSRIIKELVVQDAKAMGLIL 466

Query: 421 INEASKTVPMDSNIFPFTQIGNSDGLQILEYINSTKNPTATILRTVEVQRLKPAPIVAYF 480
           INEASK+VPMDSNIFPFTQIGNS+GLQILEYINSTKNPTATIL+TVEV+RLKPAP VAYF
Sbjct: 467 INEASKSVPMDSNIFPFTQIGNSEGLQILEYINSTKNPTATILKTVEVRRLKPAPTVAYF 526

Query: 481 SSRGPSPLTENILKPDITAPGVSILAAMIPKSDGDSGPIGKKPSNYAMRSGTSMACPHVA 540
           SSRGPSPLTENILKPDITAPGVSILAAMIPKSD D+GPIGKKPSNYAM+SGTSMACPHVA
Sbjct: 527 SSRGPSPLTENILKPDITAPGVSILAAMIPKSDEDTGPIGKKPSNYAMKSGTSMACPHVA 586

Query: 541 GAAAFIKSVYHDWSSSMIKSALMTTATQYDNQRKFMRNSTNNPSNPHEMGAGEISPIKAL 600
           GAAAFIKSVYHDWSSSMIKSALMTTATQYDNQRK+MRN+T+NPSNPHEMGAGEISPIKAL
Sbjct: 587 GAAAFIKSVYHDWSSSMIKSALMTTATQYDNQRKYMRNTTDNPSNPHEMGAGEISPIKAL 646

Query: 601 NPGLVFETTNEDYLRFLCYYGYSNKVIRSVSKQNFSCPKTSKEDLISNVNYPSISIGKLD 660
           NPGLVFETTNED+L FLCYYGYSNKVIRS+ KQNF+CPKTSKEDLISNVNYPSISI KLD
Sbjct: 647 NPGLVFETTNEDHLLFLCYYGYSNKVIRSMLKQNFTCPKTSKEDLISNVNYPSISIAKLD 706

Query: 661 SKQAAKVIERTVTNVGAPDATYIAKVHSSEGLIVKVNPRKIVFSEKVKKVTFKVSFYGKE 720
            KQAAKV+ERTVTNVGAPDATYIAKVHSSEGLIVKVNPRKIVFSEKVKKVTFKVSFYGKE
Sbjct: 707 RKQAAKVVERTVTNVGAPDATYIAKVHSSEGLIVKVNPRKIVFSEKVKKVTFKVSFYGKE 766

Query: 721 ARSGYNFGTITWRDTAHSVRTFFAVNVI 746
           AR+GYNFG+ITWRDTAHSVRTFFAVNV+
Sbjct: 767 ARNGYNFGSITWRDTAHSVRTFFAVNVV 789

BLAST of Cla022052 vs. NCBI nr
Match: gi|470145117|ref|XP_004308189.1| (PREDICTED: CO(2)-response secreted protease-like [Fragaria vesca subsp. vesca])

HSP 1 Score: 1030.0 bits (2662), Expect = 2.0e-297
Identity = 514/745 (68.99%), Postives = 604/745 (81.07%), Query Frame = 1

Query: 1   MGNGRINNEEDEQTTAELDYLQLLSSVIPRKEKERGSRDVVHQYNHAFKGFSAMLTEEEA 60
           MG+   +    E  +AE  YL++LSS+IP  ++ER S  ++H+YNHAF+GFSAMLTE EA
Sbjct: 30  MGSSLSDGNRREAESAESAYLEMLSSIIPSHQRERTS--IIHKYNHAFRGFSAMLTESEA 89

Query: 61  SSLSGIDGIVSVFPDPTLHLHTTRSWDFLDSISGLRPPTPLPPPHSYPSTSDVIVGVIDT 120
           S+LSG   +VS+FPD  L LHTTRSWDF+   +G  P      P    ++ DVI+GVIDT
Sbjct: 90  SALSGHADVVSIFPDSILELHTTRSWDFIQE-AGAEPGGVSYHPRPTTTSDDVIIGVIDT 149

Query: 121 GIWPESQSFNDEGIGEIPSKWKGVCMEAPDFKKSNCNRKLIGARYYNV-IELNGNDSHVG 180
           GIWPES SFNDEGIG +PS+WKGVCME PDFKKSNCNRKLIGARYYNV +   GN SH+ 
Sbjct: 150 GIWPESPSFNDEGIGAVPSRWKGVCMEGPDFKKSNCNRKLIGARYYNVEMTRIGNQSHLA 209

Query: 181 APKGTPRDSLGHGTHTASIAAGARVPNASYFGLARGTARGGGIPSTRIASYKVCAGVGCS 240
           AP G+PRDS+GHGTHT S AAGARVP+ASY+GLA+GT++GG +PS RIA YK C+ VGCS
Sbjct: 210 APNGSPRDSVGHGTHTTSTAAGARVPDASYYGLAQGTSKGG-LPSARIACYKACSDVGCS 269

Query: 241 GAAILKAIDDAVRDGVDIISISIGIGSPLFQSDYLNDPIAIGAFHAQLMGVLVVCSGGND 300
           GA ILKAIDDA+RDGVD+ISISIG+ S LFQ DYLNDPIAIGAFHA+ MGV+V+CSGGND
Sbjct: 270 GATILKAIDDAIRDGVDMISISIGLSS-LFQPDYLNDPIAIGAFHAEQMGVMVICSGGND 329

Query: 301 GPDPNTVGNVAPWIFTVAASNIDRDFQSSVVLGNGKTFHGTAINLSNLTSSKTYPLVFGK 360
           GPDP TV N APWIFTVAASNIDRDFQSSVVLGNG+TF G+AIN SNLT S+TYPLVFGK
Sbjct: 330 GPDPYTVVNTAPWIFTVAASNIDRDFQSSVVLGNGRTFTGSAINFSNLTRSRTYPLVFGK 389

Query: 361 DAAAKFTPVSEARNCYPGSLDRSKVAGKIVVCASDDFSTSRTIKELVVQDAKAMGLILIN 420
           DAAA FTPVSEA NCYPGS D  KVAGKIVVC +DD + SR IK+LVV DAKA GLILI+
Sbjct: 390 DAAANFTPVSEASNCYPGSFDPKKVAGKIVVCVADDQTVSRKIKKLVVDDAKAKGLILID 449

Query: 421 EASKTVPMDSNIFPFTQIGNSDGLQILEYINSTKNPTATILRTVEVQRLKPAPIVAYFSS 480
           E  KTVP DS +FPF  +G++ G QIL YINSTKNP ATIL TV+V R +PAP VAYFSS
Sbjct: 450 EEEKTVPFDSGVFPFVNVGDAVGSQILNYINSTKNPRATILPTVDVHRYRPAPTVAYFSS 509

Query: 481 RGPSPLTENILKPDITAPGVSILAAMIPKSDGDSGPIGKKPSNYAMRSGTSMACPHVAGA 540
           RGP+ LTENILKPDI APGV+ILAA+ PK++  S P G+KPS ++++SGTSMACPHV GA
Sbjct: 510 RGPAQLTENILKPDIMAPGVAILAAICPKNEPGSVPDGEKPSKFSIKSGTSMACPHVTGA 569

Query: 541 AAFIKSVYHDWSSSMIKSALMTTATQYDNQRKFMRNSTNNPSNPHEMGAGEISPIKALNP 600
           AAFIKSV+  W+SSMIKSALMTTAT Y+N +K + NSTNN +NPHE+G GEI+PIKALNP
Sbjct: 570 AAFIKSVHRGWTSSMIKSALMTTATMYNNMKKPLINSTNNYANPHEVGVGEINPIKALNP 629

Query: 601 GLVFETTNEDYLRFLCYYGYSNKVIRSVSKQNFSCPKTSKEDLISNVNYPSISIGKLDSK 660
           GLVFET  E+YL FLCYYGY  K IR +S   F+CPK S E LISN+NYPSIS+ KL+  
Sbjct: 630 GLVFETITENYLEFLCYYGYKEKDIRLMSNTKFNCPKVSTEKLISNINYPSISVSKLNRH 689

Query: 661 QAAKVIERTVTNVGAPDATYIAKVHSSEGLIVKVNPRKIVFSEKVKKVTFKVSFYGKEAR 720
           Q    I+RT TNVGAP++TYIAKV++  GL+VKV P KIVF+E V+KV+F+VSFYGKEA 
Sbjct: 690 QPVMTIKRTATNVGAPNSTYIAKVNAPVGLVVKVLPEKIVFAEGVRKVSFQVSFYGKEAP 749

Query: 721 SGYNFGTITWRDTAHSVRTFFAVNV 745
           +GY+FG+ITW D  HSV T F+VNV
Sbjct: 750 TGYSFGSITWFDGRHSVNTVFSVNV 769

BLAST of Cla022052 vs. NCBI nr
Match: gi|645251529|ref|XP_008231725.1| (PREDICTED: subtilisin-like protease [Prunus mume])

HSP 1 Score: 1026.2 bits (2652), Expect = 2.9e-296
Identity = 520/744 (69.89%), Postives = 605/744 (81.32%), Query Frame = 1

Query: 2   GNGRINNEEDEQTTAELDYLQLLSSVIPRKEKERGSRDVVHQYNHAFKGFSAMLTEEEAS 61
           GNGR+   ED    AE  YLQ+LSS+IP  E ER S  ++H+YNHAF+GFSAMLTE EAS
Sbjct: 42  GNGRVLGAED---AAESTYLQMLSSIIPSHEIERIS--IIHKYNHAFRGFSAMLTETEAS 101

Query: 62  SLSGIDGIVSVFPDPTLHLHTTRSWDFLDSISGLRPPTPLPPPHSYPSTSDVIVGVIDTG 121
            LSG D +VS+FPD  L LHTTRSWDFL+S SG  P       +    +SDVI+G+IDTG
Sbjct: 102 ILSGHDDVVSIFPDSILELHTTRSWDFLESESGRLPSNK----YQRGLSSDVIIGMIDTG 161

Query: 122 IWPESQSFNDEGIGEIPSKWKGVCMEAPDFKKSNCNRKLIGARYYNVI-ELNGNDSHVGA 181
           IWPES SFNDEGIG +PS+WKGVCME  DF+KSNCNRKLIGARYYNV    +GN S +  
Sbjct: 162 IWPESSSFNDEGIGAVPSRWKGVCMEGSDFRKSNCNRKLIGARYYNVPWTRDGNQSSLAR 221

Query: 182 PKGTPRDSLGHGTHTASIAAGARVPNASYFGLARGTARGGGIPSTRIASYKVCAGVGCSG 241
            KG+PRDS+GHGTHTASIAAG +V NASY+GLA GTA+GG +PS RIA YK C+ VGCSG
Sbjct: 222 TKGSPRDSVGHGTHTASIAAGVQVLNASYYGLALGTAKGG-LPSARIACYKACSDVGCSG 281

Query: 242 AAILKAIDDAVRDGVDIISISIGIGSPLFQSDYLNDPIAIGAFHAQLMGVLVVCSGGNDG 301
           A ILKAIDDA+RDGVDIISISIGI S LFQSDYLNDPIAIGAFHA+ MGV+V+CSGGNDG
Sbjct: 282 ATILKAIDDAIRDGVDIISISIGISS-LFQSDYLNDPIAIGAFHAEQMGVMVICSGGNDG 341

Query: 302 PDPNTVGNVAPWIFTVAASNIDRDFQSSVVLGNGKTFHGTAINLSNLTSSKTYPLVFGKD 361
           PDP T+ N APWIFTVAASNIDRDFQS++VLGNGK F G+AIN SNLT S+TYPLVFGKD
Sbjct: 342 PDPYTIVNTAPWIFTVAASNIDRDFQSNIVLGNGKNFTGSAINFSNLTRSRTYPLVFGKD 401

Query: 362 AAAKFTPVSEARNCYPGSLDRSKVAGKIVVCASDDFSTSRTIKELVVQDAKAMGLILINE 421
            A  +TPVSEARNCYPGSLD  KV GKIVVC  DD + SR IK+LVV+DAKA GLILI+E
Sbjct: 402 VAGYYTPVSEARNCYPGSLDPKKVVGKIVVCVDDDPAVSRKIKKLVVEDAKAKGLILIDE 461

Query: 422 ASKTVPMDSNIFPFTQIGNSDGLQILEYINSTKNPTATILRTVEVQRLKPAPIVAYFSSR 481
           A K+VP DS IFP+T++GN  G QIL+YINSTKNPTATIL TV+V R +PAP VAYFSSR
Sbjct: 462 AEKSVPFDSGIFPYTEVGNIAGFQILQYINSTKNPTATILPTVDVPRYRPAPAVAYFSSR 521

Query: 482 GPSPLTENILKPDITAPGVSILAAMIPKSDGDSGPIGKKPSNYAMRSGTSMACPHVAGAA 541
           GP+ LTENILKPDI APGV+ILAA+ PK++  + P GKKPS ++++SGTSMACPHV GAA
Sbjct: 522 GPAELTENILKPDIMAPGVAILAAVAPKNETGTVPNGKKPSTFSIKSGTSMACPHVTGAA 581

Query: 542 AFIKSVYHDWSSSMIKSALMTTATQYDNQRKFMRNSTNNPSNPHEMGAGEISPIKALNPG 601
           AFIKSV+  W+SSMIKSALMTTAT ++N +K + NS+N  +NPHE+G GEI+P+KAL+PG
Sbjct: 582 AFIKSVHRRWTSSMIKSALMTTATVFNNMKKPLTNSSNTFANPHEVGVGEINPLKALSPG 641

Query: 602 LVFETTNEDYLRFLCYYGYSNKVIRSVSKQNFSCPKTSKEDLISNVNYPSISIGKLDSKQ 661
           LVFETT E+YL FLCYYGY  K IRS+S   F CPK S ++LISNVNYPSISI KL   Q
Sbjct: 642 LVFETTTENYLEFLCYYGYPEKNIRSMSNTKFVCPKISIDELISNVNYPSISISKLHRHQ 701

Query: 662 AAKVIERTVTNVGAPDATYIAKVHSSEGLIVKVNPRKIVFSEKVKKVTFKVSFYGKEARS 721
            AK I+RT TNV A ++TYIAKVH+  GLIVKV P K+VF+E V++V+F+VSFYGK+A  
Sbjct: 702 PAKTIQRTATNVAALNSTYIAKVHAPVGLIVKVLPEKLVFAEGVRRVSFQVSFYGKKAPR 761

Query: 722 GYNFGTITWRDTAHSVRTFFAVNV 745
           GYNFGTITW D  HSVRT F+VNV
Sbjct: 762 GYNFGTITWFDGRHSVRTVFSVNV 774

BLAST of Cla022052 vs. NCBI nr
Match: gi|596045909|ref|XP_007220237.1| (hypothetical protein PRUPE_ppa001918mg [Prunus persica])

HSP 1 Score: 1025.8 bits (2651), Expect = 3.8e-296
Identity = 518/744 (69.62%), Postives = 607/744 (81.59%), Query Frame = 1

Query: 2   GNGRINNEEDEQTTAELDYLQLLSSVIPRKEKERGSRDVVHQYNHAFKGFSAMLTEEEAS 61
           GNGR+   ED    AE  YLQ+LSS+IP  E ER S  ++H+YNHAF+GFSAMLTE EAS
Sbjct: 8   GNGRVLGAED---AAESAYLQMLSSIIPSHEIERLS--IIHKYNHAFRGFSAMLTETEAS 67

Query: 62  SLSGIDGIVSVFPDPTLHLHTTRSWDFLDSISGLRPPTPLPPPHSYPSTSDVIVGVIDTG 121
            LSG D +VS+FPD  L LHTTRSWDFL++ SG  P       +    +SDVI+G+IDTG
Sbjct: 68  VLSGHDDVVSIFPDSILELHTTRSWDFLEAESGRLPSNK----YQRGLSSDVIIGMIDTG 127

Query: 122 IWPESQSFNDEGIGEIPSKWKGVCMEAPDFKKSNCNRKLIGARYYNVI-ELNGNDSHVGA 181
           IWPES SFNDEGIG +PS+WKGVCME  DF+KSNCNRKLIGARYYNV    +GN S +  
Sbjct: 128 IWPESSSFNDEGIGAVPSRWKGVCMEGSDFRKSNCNRKLIGARYYNVPWTRDGNQSSLAR 187

Query: 182 PKGTPRDSLGHGTHTASIAAGARVPNASYFGLARGTARGGGIPSTRIASYKVCAGVGCSG 241
            KG+PRDS+GHGTHTAS AAG +V NASY+GLA+GTARGG +PS RIA YK C+ VGCSG
Sbjct: 188 TKGSPRDSVGHGTHTASTAAGVQVLNASYYGLAQGTARGG-LPSARIACYKACSDVGCSG 247

Query: 242 AAILKAIDDAVRDGVDIISISIGIGSPLFQSDYLNDPIAIGAFHAQLMGVLVVCSGGNDG 301
           A ILKAIDDA+RDGVDIISISIG+ S LFQSDYLNDPIAIGAFHA+ MGV+V+CSGGNDG
Sbjct: 248 ATILKAIDDAIRDGVDIISISIGMSS-LFQSDYLNDPIAIGAFHAEQMGVMVICSGGNDG 307

Query: 302 PDPNTVGNVAPWIFTVAASNIDRDFQSSVVLGNGKTFHGTAINLSNLTSSKTYPLVFGKD 361
           PDP T+ N APWIFTVAASNIDRDFQS++VLGNGK F G+AIN SNLT S+TYPLVFGKD
Sbjct: 308 PDPYTIVNTAPWIFTVAASNIDRDFQSNIVLGNGKNFTGSAINFSNLTRSRTYPLVFGKD 367

Query: 362 AAAKFTPVSEARNCYPGSLDRSKVAGKIVVCASDDFSTSRTIKELVVQDAKAMGLILINE 421
            A  +TPVSEARNCYPGSLD  KV GKIVVC  DD + SR IK+LVV+DAKA GLILI+E
Sbjct: 368 VAGYYTPVSEARNCYPGSLDPKKVVGKIVVCVDDDPAVSRKIKKLVVEDAKAKGLILIDE 427

Query: 422 ASKTVPMDSNIFPFTQIGNSDGLQILEYINSTKNPTATILRTVEVQRLKPAPIVAYFSSR 481
           A K+VP DS IFP+T++GN  G QIL+YINSTKNPTATIL TV+V R +PAP VAYFSSR
Sbjct: 428 AEKSVPFDSGIFPYTEVGNIAGFQILQYINSTKNPTATILPTVDVPRYRPAPAVAYFSSR 487

Query: 482 GPSPLTENILKPDITAPGVSILAAMIPKSDGDSGPIGKKPSNYAMRSGTSMACPHVAGAA 541
           GP+ LTENILKPDI APGV+ILAA+ PK++  + P GKKPS ++++SGTSMACPHV GAA
Sbjct: 488 GPAELTENILKPDIMAPGVAILAAIAPKNETGTVPNGKKPSTFSIKSGTSMACPHVTGAA 547

Query: 542 AFIKSVYHDWSSSMIKSALMTTATQYDNQRKFMRNSTNNPSNPHEMGAGEISPIKALNPG 601
           AFIKSV+  W+SSMIKSALMTTAT ++N +K + NS+N  +NPHE+G GEI+P+KAL+PG
Sbjct: 548 AFIKSVHRRWTSSMIKSALMTTATVFNNMKKPLTNSSNTFANPHEVGVGEINPLKALSPG 607

Query: 602 LVFETTNEDYLRFLCYYGYSNKVIRSVSKQNFSCPKTSKEDLISNVNYPSISIGKLDSKQ 661
           LVFETT E+YL FLCYYGY  K IRS+S   F CPK+S ++LISNVNYPSISI KL+  Q
Sbjct: 608 LVFETTTENYLEFLCYYGYPEKNIRSMSNTKFICPKSSIDELISNVNYPSISISKLNRHQ 667

Query: 662 AAKVIERTVTNVGAPDATYIAKVHSSEGLIVKVNPRKIVFSEKVKKVTFKVSFYGKEARS 721
            AK I+RT TNV A ++TYIAKVH+  GLIVKV P K+VF+E V++V+F+VSFYGKEA  
Sbjct: 668 PAKTIQRTATNVAALNSTYIAKVHAPAGLIVKVLPEKLVFAEGVRRVSFQVSFYGKEAPR 727

Query: 722 GYNFGTITWRDTAHSVRTFFAVNV 745
           GYNFG+ITW D  HSVRT F+VNV
Sbjct: 728 GYNFGSITWFDGRHSVRTVFSVNV 740

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
CRSP_ARATH6.3e-18248.03CO(2)-response secreted protease OS=Arabidopsis thaliana GN=CRSP PE=2 SV=1[more]
SBT51_ARATH1.3e-16343.78Subtilisin-like protease SBT5.1 OS=Arabidopsis thaliana GN=SBT5.1 PE=3 SV=1[more]
SBT4C_ARATH2.5e-14641.84Subtilisin-like protease SBT4.12 OS=Arabidopsis thaliana GN=SBT4.12 PE=2 SV=1[more]
SBT44_ARATH1.0e-14442.20Subtilisin-like protease SBT4.4 OS=Arabidopsis thaliana GN=SBT4.4 PE=2 SV=1[more]
AIR3_ARATH1.5e-14341.08Subtilisin-like protease SBT5.3 OS=Arabidopsis thaliana GN=AIR3 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KJ05_CUCSA0.0e+0092.78Uncharacterized protein OS=Cucumis sativus GN=Csa_6G401370 PE=4 SV=1[more]
M5X7H3_PRUPE2.7e-29669.62Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001918mg PE=4 SV=1[more]
K7MD65_SOYBN5.9e-29669.57Uncharacterized protein OS=Glycine max GN=GLYMA_15G224500 PE=4 SV=1[more]
I1M0I4_SOYBN8.6e-29569.65Uncharacterized protein OS=Glycine max GN=GLYMA_13G187000 PE=4 SV=1[more]
A0A0B2PRM9_GLYSO1.9e-29469.52Subtilisin-like protease OS=Glycine soja GN=glysoja_040552 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|659089241|ref|XP_008445401.1|0.0e+0092.79PREDICTED: subtilisin-like protease [Cucumis melo][more]
gi|778715825|ref|XP_011657463.1|0.0e+0092.78PREDICTED: CO(2)-response secreted protease-like [Cucumis sativus][more]
gi|470145117|ref|XP_004308189.1|2.0e-29768.99PREDICTED: CO(2)-response secreted protease-like [Fragaria vesca subsp. vesca][more]
gi|645251529|ref|XP_008231725.1|2.9e-29669.89PREDICTED: subtilisin-like protease [Prunus mume][more]
gi|596045909|ref|XP_007220237.1|3.8e-29669.62hypothetical protein PRUPE_ppa001918mg [Prunus persica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000209Peptidase_S8/S53_dom
IPR003137PA_domain
IPR010259S8pro/Inhibitor_I9
IPR015500Peptidase_S8_subtilisin-rel
IPR023828Peptidase_S8_Ser-AS
Vocabulary: Molecular Function
TermDefinition
GO:0004252serine-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0055114 oxidation-reduction process
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0032440 2-alkenal reductase [NAD(P)] activity
molecular_function GO:0004252 serine-type endopeptidase activity
molecular_function GO:0016491 oxidoreductase activity
molecular_function GO:0003723 RNA binding
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0008233 peptidase activity
molecular_function GO:0008236 serine-type peptidase activity
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
WMU75997watermelon EST collection version 2.0transcribed_cluster
WMU78867watermelon EST collection version 2.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla022052Cla022052.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
WMU75997WMU75997transcribed_cluster
WMU78867WMU78867transcribed_cluster


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000209Peptidase S8/S53 domainGENE3DG3DSA:3.40.50.200coord: 111..331
score: 3.8E-71coord: 470..597
score: 3.8
IPR000209Peptidase S8/S53 domainPFAMPF00082Peptidase_S8coord: 111..576
score: 5.0
IPR000209Peptidase S8/S53 domainunknownSSF52743Subtilisin-likecoord: 474..601
score: 2.09E-71coord: 106..360
score: 2.09
IPR003137PA domainPFAMPF02225PAcoord: 352..445
score: 8.
IPR010259Peptidase S8 propeptide/proteinase inhibitor I9GENE3DG3DSA:3.30.70.80coord: 39..75
score: 5.
IPR010259Peptidase S8 propeptide/proteinase inhibitor I9PFAMPF05922Inhibitor_I9coord: 10..81
score: 8.9
IPR015500Peptidase S8, subtilisin-relatedPRINTSPR00723SUBTILISINcoord: 527..543
score: 6.6E-13coord: 187..200
score: 6.6E-13coord: 110..129
score: 6.6
IPR015500Peptidase S8, subtilisin-relatedPANTHERPTHR10795PROPROTEIN CONVERTASE SUBTILISIN/KEXINcoord: 1..343
score: 0.0coord: 366..744
score:
IPR023828Peptidase S8, subtilisin, Ser-active sitePROSITEPS00138SUBTILASE_SERcoord: 528..538
scor
NoneNo IPR availablePANTHERPTHR10795:SF444SUBTILISIN-LIKE SERINE ENDOPEPTIDASE FAMILY PROTEINcoord: 1..343
score: 0.0coord: 366..744
score: