CaUC06G108310 (gene) Watermelon (USVL246-FR2) v1

Overview
NameCaUC06G108310
Typegene
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionAspartic proteinase CDR1-like
LocationCiama_Chr06: 1972770 .. 1980593 (-)
RNA-Seq ExpressionCaUC06G108310
SyntenyCaUC06G108310
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCACCCATTTTCTTTCTTATTTTCTTAATCTCCTCTGCCGTCGTCTCAGCCGCCACCACAGGCCGTGACTATGGCTTCACCGTCGAACTCATCCGCCGTGACTACCCCAAGTCCCCTATGTACAACCGATCGCAGACTCACTACCATCGCATCGCCGACGCCCTCCGCCGCTCCATCAGCCGTAACACGGCCGCGCTGACAGACACGGCGGAGGCCCCTATTTACAGCAACAGAGGCGAATACCTCATGGAATTATCCGTCGGAACGCCGCCGTTTTCGATTCTAGCTGTTGCTGACACAGGGAGCGACATCATTTGGACCCAATGCAAACCATGCAAAAATTGCTACCAACAAAACGCGCCAATGTTTACCCCGAGTAAATCGGCGACATACAAAAAACTGTCGTGCTCCTCTCCGATTTGCTTGTTTGCTGGTGAGAGTGGTTCATGTTCCTCTCAGTCTGAGTGCTTGTACTCGATTTCTTACGGCGATAGGTCCCACAGCCAAGGAGATTTTGCCGTTGATACGGTTACTATGGGGTCTACCTCTGGCCGCCCCGTGGCGTTTCCTCGTATGGCCATTGGTTGTGGTCATGACAACGCTGGCACTTTCGATGCTAATGTTTCTGGCATTGTCGGCCTTGGGCTAGGTCCGGCTTCCCTCGTCACGCAAATGGGACCCTCCGCTGGCGGAAAGTTCTCTTACTGTTTAACTCCGATTGGAAGCAATACTATCGAGTCCAGCAAACTTAACTTTGGCTCTAATGCCGTCGTCTCCGGCTCTAGCGCCGTCTCAACTCCTATATATATTAGTGGTAAACAAATTATTACTACCGAACTCAAAAGCTTAAATTGATTTATTATTATTATTATTATTATTATTATTATTATTATTTTTATTGATGCTCAATCATTCATACTGGGTGGGTGGATTTGTATATATTTAAGTTAATCTTTTGTTCATATATTTTTTAACAGATAGATTCAAAAGTTTCTACTGGCTCAAGTTAGAAGGCGTGAGCGTAGGGGAGAAAAAATTTGAATTTCCAGTCTCTTCAATATTAGGCGGAGAAGCAAACATGATCATTGACTCAGGCACGACGCTTACTTTCCTCCCCATGCATTTATACAACAACTTCTCCACCACAATTTCCAACTCGATAAACCTCCAGCGGACGAATGACCCAAATCAATTCTTAGATTACTGCTTCGCAACTACCACCGATGACTACAAAGCGCCGCCCGTCACGATGCACTTTGAAGGTGCCGATGTGCCCCTCCCCCAAGAAAACGTGTTCGTTAGGGTGTCGGACGACGTTGTTTGCTTGGCCTTCTGTCCCGGCCAGGACAACCACATTATGATCTATGGCAACATTGCCCAGAACAACTTCTTGGTTGGTTATGATATTAACACCATGTCTGTTTCTTTCAAGCCGGCAGATTGCATTGCCATGTGATTCTCACGTGCTCTTCCTGATTTATTTTAGTTTATATTTTTTAAATGTTGTATCTTCTTTGTTTTAGAGATGAAAGTATTTCTATACATGCAAAATTATTATAGTACTATTCATATATATAAATGTTGAGAAAATATTATATTTCTTACAATAGTTTTTTATTTTTTATTTTTATTTTTATTTTTATTATTGTTATTTTTTGCGTTGGTCTCTACTCTCCAATTTTTTTATATATATAGAATAATTCTAGTTTGGTCTTAAAATCTTAAGTAAGCAGTAAATTTTATAATACATCTCAACTTTTTTGTAACAAAATTATAATCAATGTGTAAAAGTTATTCGAGAATTTTCTTCATTATTACATTAGGTGTGATATATCGATAATTTATATAAAAAGATAGATATATAAAACATTATGAAATATAAATTATAGAGATATTAAAGACGGAGCGAAACTAAATATAAAATTAAAATTAGAAATTATTTGTTGGCCTGATCTTAATAAAAAATGAACCATTTGATATATATTTAGTTTCCAAAACGCATTGAAAAAATCTATTATCTAATATCGATATTATATTAAAAAGTGGGGTAGTTCGAAAAACAGTTTTGAACATTTGTTGTCCTCACTTTAATTTACTTATTTACAAAATATATTAAAAGTTTAATTATAAATAGTAAATAAATAAATTATTACTAGTTATTACATAATTAATTAATTATTAATTATCGATAATAAGTGTAATAGAAAACTCAAAGGTTTGAAAAAGTTAAAAATTTTTCTGTACATCACAAGAATTTTCCGTTAATTTAATTAAAACAAATGAAAATTTGATAGATTTATAATTCAAAAAACTTTGTAACGTCTCTCAACTTTTACTACTATACAAACGAGCTTTTTTTATCACCATTTTTCACCCAATACCTAAAGCTTCATTTTTTCTTTTTTGTTTTCACACAACATTTTTCTCCAAGTCTCTTCAATCTTCAAAACACACCTCAACTATATCTCGAATATACATATGTCAATATCTATATCTTGAATCTTAAAATTGAAATGTTACACACCTCAACTTTGATGAATTTAAATAACAATATTGTTTACCCATTATTGTATGCATTGTTCTTTTTATAATCAATATATATATATATATATATATATATATTTTTTTTTTTTTGTTTAAATATTTGTTGAAAAAAAATGAATGAACTATTTATTTTGAGGATTGTGTTAGCTAGAGTCTTGAATTTTCTTTGTTACCCATTTTAATGGTTCTTTTATAATTAGTATATTGTTATTTTATTTAAAATAATTTTAAGAAAAACAATAGATTGATCTGCTACACATAAATAATATATATATATAAAAGCCAATCATGTCTATAACTTTTTTTTAATTACATCACATGTGGATTAGGAGGGGTTGAACCCTAGACTTCGATCATTTGATGGTACAAGTTTTTATTTGAGTTGAGTTATGCATATGTTGTTAACCATTCTATCTTTTACATTTTGAGCTTTTTTTTTTTTATNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTTTTTTTTTTTTTTAGGTAGTACTTATTGTGTTATTCACTAGTTTTTTAAAAAAAGTAAATTACATGTTTAGTCCTCAAATTTTTAGTTTGTGTCTAATATCTATTTGATTTGTGGATGGTATAAAGTATCCAATTGGTCTAAGAACATATTGTGGCTATTTGAGTCCCTAGAGTTTTGAAAGTGTATAACAAGTCTTTAAATTTCATTTTGTATCCAATATGTTCCTAAAATTTTAATTTTATGTCTACAACTTCATTGACTTATTTGTTATTTTCCAAAACTCACATAATCACAAAATTAAAATCTTTTTACTCCTTTTAGATATAAAACATAATTGTATGAAATACAAAATTGTTTAAAATTTTTAATGTTTTAAAAATTTATTAAAAACAAAATTAAAAATACAAAATCCTATCATATACAAAAATAAAAATTTAGAGGCTCATTAACCAAGTTTAAAAAGTTTAAATTTATGGAGTAAATTTTTTGTGTTGGTAATATGAAGACCATGTTAGTTTGATTTTTTTTTTTTCTTTTTTGTTGAGAAATTGGTTGTTGATACACTTTTGATTCATACCCTATCTTAAACAAATAATTATGCATCTCAATTCTTTTACTAAGATCCTTCTCAGCCTCTAAACTGTTAAATATTTATTTTAATTTTTAATCAAAAAGTTATTATTATAAGGTCGTACCTAATTTTGAAATAAATTACCTTTAGTTTTCTATCGAACAAAACTACTTTTTTAAAGTTATGGTTTATACTCCCATAAACTCTAAATAAACTTTTATTAATTTCTTAAATTTTAACTATCGAGTCATTGGAATTTTGATCTTGTGTTTTATAAGTTCATATATTAAATATTTTTGAAAATTTAGAAATTAATTAAAATAATTTAAAATTTTTAAAGATTCATTAAATACCTTTTTAAAGTTTATAGATGTAGTTATTAGCCTCTTTTTAAAGTTTAGAAATTAATAATTCAAGTGGAGGTAAACCAAGTATTGGAATCAATTAAACAAAAGAGGAATGTTTATGAACCAAGCTTATAATTTATCATGTTGAGGTAATTATTTTTATGTTTATCACCATTCAACATGAAACTATACGGACAATTTATGACAATAGAATAATGATAATGGCTAAAATGGTCATTTTAACCAATTTTGAAAGATTGAAATAAACATTTTGATACTTATTTTTAAGAAAAGAAAAGGTGTGAAGGGCAAAGATGTCATTTGAGGATGTCTTGACATCTTAAAAACATATGTGGCAAAATGAGGCATCCAAAATTAGATAGATATCTCTTGTATGAACATGGACATACAAATTAGGTAATCTAAAATTCGGAAAATCTCATATTTTTACGTAGCAAATGACCACCCATTTTAATGATATATGGAGTGGAGTAATTGAATATCTAATACAAGCATTATGTCTATTGAACTATACTTATTAATGCAAACAAACAATTCACATTTTAAAACACAAATATACATTTTATTGTTCGAAGAGAAAATCAAACAAATATAAATTTTAACGTGTATTGCTTAGATTATAGCAAATTTTAAAAATATTTATGTATTTGACTTTTGATAAATCTTTGCATAGATTTTTGGGTAAATCCTTGAAAATTTATATTCTTACCATGCGTTTGTCGAAGTTAAAAAAATTGATAGAGTTAGGTTCTAACACATGGTTCTCTAAAATAATAACAAAAATTTTGTTTTTTCTAACCAAATCAAATTCTAGATCATATTCTCTCAATCTAAATTTGATCCAAACGGTCAATACTTTTTGGTTTTCTCCAAATACAAACGAGTATGTCTAAATTTATATTGAAAAACTAAAGCAAACTAGTAGATCGATTTGAATTTGTTTGATTTTTTGATTTTTTTATTGCACTCCCAAACGTTATTAAAAAGTAGACGATAGAACAAAAAAAAATCGTATGATTATATTCAACAAGCCATTTATTATTATTATTATTTTAAACTAAAACTAACTAGTTCATAGAAGTTTGAATTATCGTTTAAAACTTTTTCCCTCATCATTTCTACTTATTTCAACAAACAATTAACTTTACTACACATCCTAAGAGCAATATTATTAAAAAGAAAATCGTAGGTTGGAATCTTTATTTCATGACCAAATGAATATTAGGAAAAAAGGAAAAAAGAAAAAGAAAAAGAAAAAGAGAAAGAAAACAAACAATTTTACCCTTGGAATTCCAAAATTTGAAAGAAAGAAAAAAAAAAAAACCAAATATGGCAATGTTCGATGAATTATGTCGGTCATAATTGCCCAAAAAAAAAACCTCACTGTCACTACCCCATTTATCACATTATTACCATATCATTTTGACAAAAACGAGTCGAATTAAAGGAAACCAAGCGTTTCTTAATTATCAATATCAAATTTACAAATTTATTTCCCATCGATTCATTTCCATTTTTGATAACCTCTCAAAATTCTCTCTTCTATATAATTCAATCACGTTCCTCATGCATTGCTTCATCCCCACCAATTCATAAATTATGGCACCCATTTTCTCTCTTATTTTCTTAATCTCCTTCGCCGTCTCGGCCGCCGTCAGCCGTGACTATGGCTTCACTGTCGAACTCATCCACCGTGACTCCCCCAAGTCCCCTATGTACAACCCATCTGAGACTCACTACCACCGCCTCGCCAATGCCCTCCGCCGTTCCATCAGCCGTAACACAGCGGCATTGACAGACACAGCGGAGGCTCCTATTTACAACTATAGAGGCCAATACCTCATGGAAATATCCCTCGGAACGCCGCCGTTTTCGATTCTAGCTGTTGCTGACACAGGAAGCGACATCGTTTGGACCCAATGTGAACCATGCCCAAATTGCTACGAACAAAGCGCGCCAATGTTTAACCCGAGTAAATCGGCGACTTACAAAAATGTGGCGTGTTCCTCGCCGATTTGCTCGTTTGCTGGTGAGGAACGTTCTTGTTCCGCTCAGTCCGAGTGTTTGTACTCGATTACTTACGGCGATAGTTCCCACAGCCAAGGAGATCTTGCCGTTGATACCGTTACTATGGGGTCCACCTCTGGTCGCCAGGTGGCGTTTCCTCGTATTGCCATTGGTTGTGGTCATGACAATGCTGGAACATTTGATGCTAATGTCTCCGGCATTGTTGGGCTCGGGCAAGGTCCAGCTTCGCTTGTCTCACAATTGGGACCGGCTACTGGCGGAAAATTCTCTTACTGTTTAGCTCCAATCGGAAATGGCACTATTGAGTCCAGCAAACTTAACTTTGGCTCTAACGCGATCGTTTCCGGCTCTAAAGCTGTATCGACTCCTATTTATACTAGTGGTAAACAATTACTACTAAACTCAAAAGCTTAATTGATGTATATATTGGGTTTTTAATCTTTTGTCCATATGTTGAACAGATACCTACAAAACCTTCTACTCACTCAAGCTAGAAGCTGTGAGCGTAGGGGAGAGCAAATTTGATTTTCCAGTAGTCTCTTCAAGATTAGGCGGAGAAGCAAACATCATCATCGACTCTGGCACGACGCTTACTTTACTCCCAACGGATTTATACAACAACTTCGCCACTGAAATTTCCGGCTCGATAAACCTCCAGCGCACGAATGATCCGAATCAATACTTAGATGATTGCTACGCGACTACCACTGATGACTATGAAGCGCCACCCGTAACCATGCACTTTGAAGGCGCTGATGTACCCCTCCAACGAGAAAACGTGTTCATTAGAGTGTCGGATGACGCTGTTTGCTTGGCTTTTAAAGCAGCTGGGCAGGATGAGGACAATATTTTTATCTATGGCAACATTTCCCAGAACAACTTCTTGGTTGGTTATGATACTAAGAACATGTCTGTTTCTTTCAAGCCCGCGGATTGCATTTCCATGTGA

mRNA sequence

ATGGCACCCATTTTCTTTCTTATTTTCTTAATCTCCTCTGCCGTCGTCTCAGCCGCCACCACAGGCCGTGACTATGGCTTCACCGTCGAACTCATCCGCCGTGACTACCCCAAGTCCCCTATGTACAACCGATCGCAGACTCACTACCATCGCATCGCCGACGCCCTCCGCCGCTCCATCAGCCGTAACACGGCCGCGCTGACAGACACGGCGGAGGCCCCTATTTACAGCAACAGAGGCGAATACCTCATGGAATTATCCGTCGGAACGCCGCCGTTTTCGATTCTAGCTGTTGCTGACACAGGGAGCGACATCATTTGGACCCAATGCAAACCATGCAAAAATTGCTACCAACAAAACGCGCCAATGTTTACCCCGAGTAAATCGGCGACATACAAAAAACTGTCGTGCTCCTCTCCGATTTGCTTGTTTGCTGGTGAGAGTGGTTCATGTTCCTCTCAGTCTGAGTGCTTGTACTCGATTTCTTACGGCGATAGGTCCCACAGCCAAGGAGATTTTGCCGTTGATACGGTTACTATGGGGTCTACCTCTGGCCGCCCCGTGGCGTTTCCTCGTATGGCCATTGGTTGTGGTCATGACAACGCTGGCACTTTCGATGCTAATGTTTCTGGCATTGTCGGCCTTGGGCTAGGTCCGGCTTCCCTCGTCACGCAAATGGGACCCTCCGCTGGCGGAAAGTTCTCTTACTGTTTAACTCCGATTGGAAGCAATACTATCGAGTCCAGCAAACTTAACTTTGGCTCTAATGCCGTCGTCTCCGGCTCTAGCGCCGTCTCAACTCCTATATATATTAGTGATAGATTCAAAAGTTTCTACTGGCTCAAGTTAGAAGGCGTGAGCGTAGGGGAGAAAAAATTTGAATTTCCAGTCTCTTCAATATTAGGCGGAGAAGCAAACATGATCATTGACTCAGGCACGACGCTTACTTTCCTCCCCATGCATTTATACAACAACTTCTCCACCACAATTTCCAACTCGATAAACCTCCAGCGGACGAATGACCCAAATCAATTCTTAGATTACTGCTTCGCAACTACCACCGATGACTACAAAGCGCCGCCCGTCACGATGCACTTTGAAGGTGCCGATGTGCCCCTCCCCCAAGAAAACGTGTTCGTTAGGGTGTCGGACGACGTTGTTTGCTTGGCCTTCTGTCCCGGCCAGGACAACCACATTATGATCTATGGCAACATTGCCCAGAACAACTTCTTGGTTGGTTATGATATTAACACCATGTCTGTTTCTTTCAAGCCGGCAGATTGCATTGCCATCCGTGACTATGGCTTCACTGTCGAACTCATCCACCGTGACTCCCCCAAGTCCCCTATGTACAACCCATCTGAGACTCACTACCACCGCCTCGCCAATGCCCTCCGCCGTTCCATCAGCCGTAACACAGCGGCATTGACAGACACAGCGGAGGCTCCTATTTACAACTATAGAGGCCAATACCTCATGGAAATATCCCTCGGAACGCCGCCGTTTTCGATTCTAGCTGTTGCTGACACAGGAAGCGACATCGTTTGGACCCAATGTGAACCATGCCCAAATTGCTACGAACAAAGCGCGCCAATGTTTAACCCGAGTAAATCGGCGACTTACAAAAATGTGGCGTGTTCCTCGCCGATTTGCTCGTTTGCTGGTGAGGAACGTTCTTGTTCCGCTCAGTCCGAGTGTTTGTACTCGATTACTTACGGCGATAGTTCCCACAGCCAAGGAGATCTTGCCGTTGATACCGTTACTATGGGGTCCACCTCTGGTCGCCAGGTGGCGTTTCCTCGTATTGCCATTGGTTGTGGTCATGACAATGCTGGAACATTTGATGCTAATGTCTCCGGCATTGTTGGGCTCGGGCAAGGTCCAGCTTCGCTTGTCTCACAATTGGGACCGGCTACTGGCGGAAAATTCTCTTACTGTTTAGCTCCAATCGGAAATGGCACTATTGAGTCCAGCAAACTTAACTTTGGCTCTAACGCGATCGTTTCCGGCTCTAAAGCTGTATCGACTCCTATTTATACTAGTGATACCTACAAAACCTTCTACTCACTCAAGCTAGAAGCTGTGAGCGTAGGGGAGAGCAAATTTGATTTTCCAGTAGTCTCTTCAAGATTAGGCGGAGAAGCAAACATCATCATCGACTCTGGCACGACGCTTACTTTACTCCCAACGGATTTATACAACAACTTCGCCACTGAAATTTCCGGCTCGATAAACCTCCAGCGCACGAATGATCCGAATCAATACTTAGATGATTGCTACGCGACTACCACTGATGACTATGAAGCGCCACCCGTAACCATGCACTTTGAAGGCGCTGATGTACCCCTCCAACGAGAAAACGTGTTCATTAGAGTGTCGGATGACGCTGTTTGCTTGGCTTTTAAAGCAGCTGGGCAGGATGAGGACAATATTTTTATCTATGGCAACATTTCCCAGAACAACTTCTTGGTTGGTTATGATACTAAGAACATGTCTGTTTCTTTCAAGCCCGCGGATTGCATTTCCATGTGA

Coding sequence (CDS)

ATGGCACCCATTTTCTTTCTTATTTTCTTAATCTCCTCTGCCGTCGTCTCAGCCGCCACCACAGGCCGTGACTATGGCTTCACCGTCGAACTCATCCGCCGTGACTACCCCAAGTCCCCTATGTACAACCGATCGCAGACTCACTACCATCGCATCGCCGACGCCCTCCGCCGCTCCATCAGCCGTAACACGGCCGCGCTGACAGACACGGCGGAGGCCCCTATTTACAGCAACAGAGGCGAATACCTCATGGAATTATCCGTCGGAACGCCGCCGTTTTCGATTCTAGCTGTTGCTGACACAGGGAGCGACATCATTTGGACCCAATGCAAACCATGCAAAAATTGCTACCAACAAAACGCGCCAATGTTTACCCCGAGTAAATCGGCGACATACAAAAAACTGTCGTGCTCCTCTCCGATTTGCTTGTTTGCTGGTGAGAGTGGTTCATGTTCCTCTCAGTCTGAGTGCTTGTACTCGATTTCTTACGGCGATAGGTCCCACAGCCAAGGAGATTTTGCCGTTGATACGGTTACTATGGGGTCTACCTCTGGCCGCCCCGTGGCGTTTCCTCGTATGGCCATTGGTTGTGGTCATGACAACGCTGGCACTTTCGATGCTAATGTTTCTGGCATTGTCGGCCTTGGGCTAGGTCCGGCTTCCCTCGTCACGCAAATGGGACCCTCCGCTGGCGGAAAGTTCTCTTACTGTTTAACTCCGATTGGAAGCAATACTATCGAGTCCAGCAAACTTAACTTTGGCTCTAATGCCGTCGTCTCCGGCTCTAGCGCCGTCTCAACTCCTATATATATTAGTGATAGATTCAAAAGTTTCTACTGGCTCAAGTTAGAAGGCGTGAGCGTAGGGGAGAAAAAATTTGAATTTCCAGTCTCTTCAATATTAGGCGGAGAAGCAAACATGATCATTGACTCAGGCACGACGCTTACTTTCCTCCCCATGCATTTATACAACAACTTCTCCACCACAATTTCCAACTCGATAAACCTCCAGCGGACGAATGACCCAAATCAATTCTTAGATTACTGCTTCGCAACTACCACCGATGACTACAAAGCGCCGCCCGTCACGATGCACTTTGAAGGTGCCGATGTGCCCCTCCCCCAAGAAAACGTGTTCGTTAGGGTGTCGGACGACGTTGTTTGCTTGGCCTTCTGTCCCGGCCAGGACAACCACATTATGATCTATGGCAACATTGCCCAGAACAACTTCTTGGTTGGTTATGATATTAACACCATGTCTGTTTCTTTCAAGCCGGCAGATTGCATTGCCATCCGTGACTATGGCTTCACTGTCGAACTCATCCACCGTGACTCCCCCAAGTCCCCTATGTACAACCCATCTGAGACTCACTACCACCGCCTCGCCAATGCCCTCCGCCGTTCCATCAGCCGTAACACAGCGGCATTGACAGACACAGCGGAGGCTCCTATTTACAACTATAGAGGCCAATACCTCATGGAAATATCCCTCGGAACGCCGCCGTTTTCGATTCTAGCTGTTGCTGACACAGGAAGCGACATCGTTTGGACCCAATGTGAACCATGCCCAAATTGCTACGAACAAAGCGCGCCAATGTTTAACCCGAGTAAATCGGCGACTTACAAAAATGTGGCGTGTTCCTCGCCGATTTGCTCGTTTGCTGGTGAGGAACGTTCTTGTTCCGCTCAGTCCGAGTGTTTGTACTCGATTACTTACGGCGATAGTTCCCACAGCCAAGGAGATCTTGCCGTTGATACCGTTACTATGGGGTCCACCTCTGGTCGCCAGGTGGCGTTTCCTCGTATTGCCATTGGTTGTGGTCATGACAATGCTGGAACATTTGATGCTAATGTCTCCGGCATTGTTGGGCTCGGGCAAGGTCCAGCTTCGCTTGTCTCACAATTGGGACCGGCTACTGGCGGAAAATTCTCTTACTGTTTAGCTCCAATCGGAAATGGCACTATTGAGTCCAGCAAACTTAACTTTGGCTCTAACGCGATCGTTTCCGGCTCTAAAGCTGTATCGACTCCTATTTATACTAGTGATACCTACAAAACCTTCTACTCACTCAAGCTAGAAGCTGTGAGCGTAGGGGAGAGCAAATTTGATTTTCCAGTAGTCTCTTCAAGATTAGGCGGAGAAGCAAACATCATCATCGACTCTGGCACGACGCTTACTTTACTCCCAACGGATTTATACAACAACTTCGCCACTGAAATTTCCGGCTCGATAAACCTCCAGCGCACGAATGATCCGAATCAATACTTAGATGATTGCTACGCGACTACCACTGATGACTATGAAGCGCCACCCGTAACCATGCACTTTGAAGGCGCTGATGTACCCCTCCAACGAGAAAACGTGTTCATTAGAGTGTCGGATGACGCTGTTTGCTTGGCTTTTAAAGCAGCTGGGCAGGATGAGGACAATATTTTTATCTATGGCAACATTTCCCAGAACAACTTCTTGGTTGGTTATGATACTAAGAACATGTCTGTTTCTTTCAAGCCCGCGGATTGCATTTCCATGTGA

Protein sequence

MAPIFFLIFLISSAVVSAATTGRDYGFTVELIRRDYPKSPMYNRSQTHYHRIADALRRSISRNTAALTDTAEAPIYSNRGEYLMELSVGTPPFSILAVADTGSDIIWTQCKPCKNCYQQNAPMFTPSKSATYKKLSCSSPICLFAGESGSCSSQSECLYSISYGDRSHSQGDFAVDTVTMGSTSGRPVAFPRMAIGCGHDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGKFSYCLTPIGSNTIESSKLNFGSNAVVSGSSAVSTPIYISDRFKSFYWLKLEGVSVGEKKFEFPVSSILGGEANMIIDSGTTLTFLPMHLYNNFSTTISNSINLQRTNDPNQFLDYCFATTTDDYKAPPVTMHFEGADVPLPQENVFVRVSDDVVCLAFCPGQDNHIMIYGNIAQNNFLVGYDINTMSVSFKPADCIAIRDYGFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISRNTAALTDTAEAPIYNYRGQYLMEISLGTPPFSILAVADTGSDIVWTQCEPCPNCYEQSAPMFNPSKSATYKNVACSSPICSFAGEERSCSAQSECLYSITYGDSSHSQGDLAVDTVTMGSTSGRQVAFPRIAIGCGHDNAGTFDANVSGIVGLGQGPASLVSQLGPATGGKFSYCLAPIGNGTIESSKLNFGSNAIVSGSKAVSTPIYTSDTYKTFYSLKLEAVSVGESKFDFPVVSSRLGGEANIIIDSGTTLTLLPTDLYNNFATEISGSINLQRTNDPNQYLDDCYATTTDDYEAPPVTMHFEGADVPLQRENVFIRVSDDAVCLAFKAAGQDEDNIFIYGNISQNNFLVGYDTKNMSVSFKPADCISM
Homology
BLAST of CaUC06G108310 vs. NCBI nr
Match: KAA3468560.1 (aspartic proteinase CDR1-like [Gossypium australe])

HSP 1 Score: 750.4 bits (1936), Expect = 1.7e-212
Identity = 413/852 (48.47%), Postives = 545/852 (63.97%), Query Frame = 0

Query: 1   MAPIFFLIFLISSAVVSAATTGRDYGFTVELIRRDYPKSPMYNRSQTHYHRIADALRRSI 60
           MA I   I  +S+     A  G   GF+VEL  RD   SP YN  +T   R+ +ALRRS 
Sbjct: 12  MAAIVLAILALSTLCSIEAQKG---GFSVELFHRDSINSPFYNPLETTSDRVTNALRRSF 71

Query: 61  SR-----NTAALTDTAEAPIYSNRGEYLMELSVGTPPFSILAVADTGSDIIWTQCKPCKN 120
           +R       +  T  AE+ + ++ GEYLM++S+GTP F I+A+ADTGSD+IWTQCKPC  
Sbjct: 72  NRVHRFKTNSVPTTAAESDLTADSGEYLMKISLGTPRFDIVAIADTGSDLIWTQCKPCSQ 131

Query: 121 CYQQNAPMFTPSKSATYKKLSCSSPICLFAGESGSCSSQSECLYSISYGDRSHSQGDFAV 180
           C++Q+AP F PSKS+TY+K+SCS+  C+   E  SCS+   C Y++SYGD S S GD A 
Sbjct: 132 CFKQDAPFFDPSKSSTYRKISCSASQCIDL-ERTSCSTDHSCQYAVSYGDSSFSDGDLAA 191

Query: 181 DTVTMGSTSGRPVAFPRMAIGCGHDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGKFS 240
           DT+T+ S +GRPV FP+  IGCG  N GTFD   SGI+GLG G  SL++Q+  S  GKFS
Sbjct: 192 DTLTLASITGRPVTFPKTVIGCGTSNGGTFDEKTSGIIGLGGGQVSLISQLRTSVAGKFS 251

Query: 241 YCLTPIGSNTIESSKLNFGSNAVVSGSSAVSTPIYISDRFKSFYWLKLEGVSVGEKKFEF 300
           YCL PI S    SSK+NFGSNA+VSG   VSTP+ +     +FY+L LE ++VG K+ +F
Sbjct: 252 YCLLPI-SQAGNSSKINFGSNAIVSGPGVVSTPL-VKKSPDTFYFLTLEAITVGTKRIKF 311

Query: 301 PVSSILGGEANMIIDSGTTLTFLPMHLYNNFSTTISNSINLQRTNDPNQFLDYCFATTTD 360
             SS+   E N+IIDSGTTLT LP   Y+   + +++ I+ +R   P + L  C+    D
Sbjct: 312 TGSSLGSEEGNIIIDSGTTLTLLPSDFYSEVESAMTSQISAKRIEGP-EGLSLCY-NAKD 371

Query: 361 DYKAPPVTMHFEGADVPLPQENVFVRVSDDVVCLAFCPGQDNHIMIYGNIAQNNFLVGYD 420
           ++K P VT+HF  AD+ L   N F+RVSD  +C +F    D  + IYGN++Q +FL+GYD
Sbjct: 372 EFKIPDVTVHFTNADMKLKPLNTFIRVSDTAICFSFSSLDD--VAIYGNLSQMDFLIGYD 431

Query: 421 INTMSVSFKPADCIAIRDYGFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISR---- 480
               +VS               VELIHRDS KSP YNP ET + R+ NA RRS SR    
Sbjct: 432 TQKQTVS---------------VELIHRDSIKSPFYNPFETTFDRVTNAFRRSFSRVHRF 491

Query: 481 -NTAALTDTAEAPIYNYRGQYLMEISLGTPPFSILAVADTGSDIVWTQCEPCPNCYEQSA 540
              +  T  A   I    G+YLM ISLGTP FS++A+ADTGSD++WTQC PC  C++Q A
Sbjct: 492 YPNSITTTEANPDIIVNTGEYLMNISLGTPSFSVVALADTGSDLIWTQCSPCSQCFKQDA 551

Query: 541 PMFNPSKSATYKNVACSSPIC-SFAGEERSCSAQSECLYSITYGDSSHSQGDLAVDTVTM 600
           P+F+P+KS+TY+ ++CSS  C +  G   +    + C+YS+TYGD+S S+GD+A DT+T+
Sbjct: 552 PLFDPTKSSTYRKMSCSSNSCENIQGGTCASPTDTSCIYSVTYGDNSFSKGDIAYDTLTL 611

Query: 601 GSTSGRQVAFPRIAIGCGHDNAGTFDANVSGIVGLGQGPASLVSQLGPATGGKFSYCLAP 660
           GST+G+ VA P   IGCG++NAGTF    SGI+GLG G  SL++QLG    GKFSYCL P
Sbjct: 612 GSTTGQAVALPDTIIGCGNNNAGTFSGKASGIIGLGGGEISLINQLGSPINGKFSYCLLP 671

Query: 661 IGNGTIESSKLNFGSNAIVSGSKAVSTPIYTSDTYKTFYSLKLEAVSVGESKFDFPVVSS 720
           +     +SSK+NFGSNAIVSG   VSTP+       TFY L L+A+SVG  + +F    S
Sbjct: 672 M-TQIGKSSKMNFGSNAIVSGPGTVSTPLIEKSP-NTFYFLTLKAISVGTQRIEFK--GS 731

Query: 721 RLG-GEANIIIDSGTTLTLLPTDLYNNFATEISGSINLQRTNDPNQYLDDCYATTTDDYE 780
            LG  E NI+IDSGTTLTL+P+D Y+   + +    N  R   P Q  + CY     ++E
Sbjct: 732 SLGTDEGNIVIDSGTTLTLIPSDFYSQLESAMDSQFNGIRAQGP-QGFNLCY-VAIHEFE 791

Query: 781 APPVTMHFEGADVPLQRENVFIRVSDDAVCLAFKAAGQDEDNIFIYGNISQNNFLVGYDT 840
           AP VT+HF  ADV L+  N F++V D   C AF  A     NI IYGN++Q NFL+GYDT
Sbjct: 792 APEVTVHFANADVKLKTLNTFVKVDDTTACFAFSPA----QNIAIYGNLAQMNFLIGYDT 828

BLAST of CaUC06G108310 vs. NCBI nr
Match: RDY01103.1 (Aspartic proteinase CDR1, partial [Mucuna pruriens])

HSP 1 Score: 750.4 bits (1936), Expect = 1.7e-212
Identity = 403/846 (47.64%), Postives = 540/846 (63.83%), Query Frame = 0

Query: 26  GFTVELIRRDYPKSPMYNRSQTHYHRIADALRRSISR------NTAALTDTAEAPIYSNR 85
           GF+VELI RD PKSP YN  +T + ++ +A  RS SR       + A   T ++ I SN+
Sbjct: 23  GFSVELIHRDSPKSPFYNPIETPFQQLNNAFHRSFSRVNHFYPKSKASQKTPQSVITSNQ 82

Query: 86  GEYLMELSVGTPPFSILAVADTGSDIIWTQCKPCKNCYQQNAPMFTPSKSATYKKLSCSS 145
           GEYL++ S+GTPPF ++ +ADTGSD+IW+QCKPC  CY Q  P+F PSKS+TY+ +SC S
Sbjct: 83  GEYLVKYSIGTPPFEVMGIADTGSDLIWSQCKPCDQCYNQTTPLFDPSKSSTYEPVSCYS 142

Query: 146 PICLFAGES--GSCSSQSECLYSISYGDRSHSQGDFAVDTVTMGSTSGRPVAFPRMAIGC 205
            +C   G++   S +    C Y++SYGD SHSQG  A DT T+ ST+G  VAF +++IGC
Sbjct: 143 RVCQLLGKTYCYSANGDPNCEYTVSYGDGSHSQGTLAFDTFTLDSTTGSSVAFTKISIGC 202

Query: 206 GHDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGKFSYCLTPIGSNTIESSKLNFGSNA 265
           G +NAGTFD+  SGIVGLG G  SL++Q+GPS   KFSYCL P+  +    SKLNFG NA
Sbjct: 203 GVNNAGTFDSKGSGIVGLGGGVVSLISQIGPSIDFKFSYCLVPLFESK-SISKLNFGENA 262

Query: 266 VVSGSSAVSTPIYISDRFKSFYWLKLEGVSVGEKKFEFPVSSILG-GEANMIIDSGTTLT 325
           VV+G   VSTPI I     +FY+LKLEG+SVG K+ EF   S       N+IIDSGTTLT
Sbjct: 263 VVAGPGTVSTPI-IPGPVDTFYYLKLEGMSVGSKRIEFICDSTSNVANGNIIIDSGTTLT 322

Query: 326 FLPMHLYNNFSTTISNSINLQRTNDPNQFLDYCFATTTDD-YKAPPVTMHFEGADVPLPQ 385
            LP   Y      ++  INL+R N  +Q L  C+ +  ++  + P +T HF GADV L  
Sbjct: 323 ILPEKFYTKLELEVAAHINLERVNSTDQILSLCYQSPPNNAIETPIITAHFSGADVVLNS 382

Query: 386 ENVFVRVSDDVVCLAFCPGQDNHIMIYGNIAQNNFLVGYDINTMSVSFKPADCIAIRDY- 445
            N F+ VS+ V C AF P   N   I+GN+AQ N+LVGYD+   +VSFKP DC  I    
Sbjct: 383 LNTFISVSNYVTCFAFAPMATN--SIFGNLAQMNYLVGYDLQRKTVSFKPTDCTKIGKLE 442

Query: 446 ------GFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISR--------NTAALTDTA 505
                 GF+V+LIHRDS KSP+YNPSE+ + +L +A +RS +R          +  T T 
Sbjct: 443 SEALKGGFSVQLIHRDSSKSPLYNPSESAFQQLKSAFQRSFNRVNHFYPKSKVSRKTKTP 502

Query: 506 EAPIYNYRGQYLMEISLGTPPFSILAVADTGSDIVWTQCEPCPNCYEQSAPMFNPSKSAT 565
           ++ I    G+YL++ S+GTPPF ++ V DTGSD++W+QC+PC  CY Q+ P+F+ SKS+T
Sbjct: 503 QSVITWNHGEYLVKYSIGTPPFEVMGVFDTGSDLIWSQCKPCKECYNQTNPLFDYSKSST 562

Query: 566 YKNVACSSPICSFAGEERSCSAQSE--CLYSITYGDSSHSQGDLAVDTVTMGSTSGRQVA 625
           Y+ + C S +C   G E +C + S+  C Y++ YGD SHS+G LA DT+T+ ST+   +A
Sbjct: 563 YEPIHCKSRVCKSLG-EANCYSHSDPTCEYTVIYGDGSHSRGFLAFDTLTLPSTTDSSIA 622

Query: 626 FPRIAIGCGHDNAGTFDANVSGIVGLGQGPASLVSQLGPATGGKFSYCLAPIGNGTIESS 685
           FP+I  GCG +N G FD   SGIVG+G G  SL+SQ+GP+   KFSYCL P+ + +  +S
Sbjct: 623 FPKIFFGCGVNNGGIFDPKASGIVGVGGGAVSLISQIGPSIDFKFSYCLVPLFSESESTS 682

Query: 686 KLNFGSNAIVSGSKAVSTPIYTSDTYKTFYSLKLEAVSVGESKFDFPVVSSRLGGEANII 745
           KLNFG NA+V+G   VSTPI       TFY LKL+ +SVG  + +    S    G+ NII
Sbjct: 683 KLNFGENAVVAGPGTVSTPIIPGPV-NTFYYLKLKGMSVGSKRIELISDSKSNNGKGNII 742

Query: 746 IDSGTTLTLLPTDLYNNFATEISGSINLQRTNDPNQYLDDCYATTTDD-YEAPPVTMHFE 805
           IDSGTTLT LP  LY    +E++  I L+R + P   L  CY + +++  EAP +T HF 
Sbjct: 743 IDSGTTLTFLPQKLYTKLESEVAAQIKLERVHSPEHVLSLCYKSPSNNAIEAPIITAHFS 802

Query: 806 GADVPLQRENVFIRVSDDAVCLAFKAAGQDEDNIFIYGNISQNNFLVGYDTKNMSVSFKP 844
           GADV L   N F+ VSD+  C AF    + +    I+GNI+Q N LVGYD +  +VSFKP
Sbjct: 803 GADVVLNSLNTFVSVSDNVTCFAFAPVMRSDS---IFGNIAQMNHLVGYDLQKKTVSFKP 859

BLAST of CaUC06G108310 vs. NCBI nr
Match: TKY49535.1 (Aspartic proteinase CDR1 [Spatholobus suberectus])

HSP 1 Score: 742.7 bits (1916), Expect = 3.6e-210
Identity = 409/863 (47.39%), Postives = 541/863 (62.69%), Query Frame = 0

Query: 26  GFTVELIRRDYPKSPMYNRSQTHYHRIADALRRSIS------RNTAALTDTAEAPIYSNR 85
           GF+V+LI RD PKSP YN ++T + ++ +A  RS +      R +     T ++ I SN+
Sbjct: 33  GFSVQLIHRDSPKSPFYNPTETPFQQLNNAFHRSFNRVNYFYRKSKVSQKTPQSVITSNQ 92

Query: 86  GEYLMELSVGTPPFSILAVADTGSDIIWTQCKPCKNCYQQNAPMFTPSKSATYKKLSCSS 145
           GEYL++ S+GTPPF ++ +ADTGSD+IW QCKPC  CY Q  P+F PSKS TY+ +SC S
Sbjct: 93  GEYLVQYSIGTPPFEVMGIADTGSDLIWLQCKPCDQCYNQTNPLFDPSKSVTYEPVSCYS 152

Query: 146 PICLFAGESGSCSSQS--ECLYSISYGDRSHSQGDFAVDTVTMGSTSGRPVAFPRMAIGC 205
            +C   G++  C S S   C Y+ SYGD SHSQG+ A DT+T+GST+G  VAFP++ IGC
Sbjct: 153 RVCQSVGQT-YCYSDSVPNCEYTASYGDGSHSQGNLAFDTLTLGSTTGSSVAFPKIPIGC 212

Query: 206 GHDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGKFSYCLTPIGSNTIESSKLNFGSNA 265
           G +NAGTFD+  SGIVGLG G  SL +Q+GPS   KFSYCL P+  +   +SKLNFG NA
Sbjct: 213 GVNNAGTFDSKGSGIVGLGGGVVSLTSQIGPSIDFKFSYCLVPLFESE-GTSKLNFGENA 272

Query: 266 VVSGSSAVSTPIYISDRFKSFYWLKLEGVSVGEKKFEFPVSSILG-GEANMIIDSGTTLT 325
           VV+GS  VSTPI I     +FY+LKLEG+SVG K+ EF   S     E N+IIDSGTTLT
Sbjct: 273 VVAGSGTVSTPI-IPSSIDTFYYLKLEGMSVGSKRIEFVGDSTSNDAEGNIIIDSGTTLT 332

Query: 326 FLPMHLYNNFSTTISNSINLQRTNDPNQFLDYCFATTTDD-YKAPPVTMHFEGADVPLPQ 385
           FLP  +Y    + ++  I+L+R N   + L  C+ +  ++  +AP +T+HF GADV L  
Sbjct: 333 FLPEKIYAKLESEVAAQISLERVNSTAEILSLCYKSPANNAIQAPLITVHFTGADVGLNS 392

Query: 386 ENVFVRVSDDVVCLAFCPGQDNHIMIYGNIAQNNFLVGYDINTMSVSFKPADCIAIRDY- 445
            N FV VSDDV C AF P       ++GN+AQ N LVGYD+   +V+F     +A     
Sbjct: 393 LNTFVSVSDDVTCFAFAPVASG--SLFGNLAQMNHLVGYDLLKKTVTFDSTHNMATYSNN 452

Query: 446 -------------------------GFTVELIHRDSPKSPMYNPSETHYHRLANALRRSI 505
                                    GF+V+LIHRDSPKSP YNP+ET + +L NA  RS 
Sbjct: 453 LFVFSTLTLSTICFCGIPLTEALKGGFSVQLIHRDSPKSPFYNPAETPFQQLNNAFHRSF 512

Query: 506 SR------NTAALTDTAEAPIYNYRGQYLMEISLGTPPFSILAVADTGSDIVWTQCEPCP 565
           +R       +    +T ++ I +  G+YL++ S+GTPPF ++ +ADTGSD+VWTQC+PC 
Sbjct: 513 NRANHFYPKSKVSQETPQSVITSNHGEYLVKYSIGTPPFEVMGIADTGSDLVWTQCKPCE 572

Query: 566 NCYEQSAPMFNPSKSATYKNVACSSPICSFAGEE--RSCSAQSECLYSITYGDSSHSQGD 625
            CY Q+ P+F+PSKS TY+ V+C S +C    +    S +    C Y+++YGD SHS+G+
Sbjct: 573 QCYNQTNPLFDPSKSVTYEPVSCYSSLCLSVRQSNCHSDTGDPNCEYTVSYGDGSHSRGN 632

Query: 626 LAVDTVTMGSTSGRQVAFPRIAIGCGHDNAGTFDANVSGIVGLGQGPASLVSQLGPATGG 685
           LA +T+T+GST+G  VA P+I IGCG +NAG FD+  SGIVGLG G  SL++QLG A   
Sbjct: 633 LAFETLTLGSTTGSSVAIPKIPIGCGVNNAGEFDSKGSGIVGLGGGALSLITQLGSAIDY 692

Query: 686 KFSYCLAPIGNGTIESSKLNFGSNAIVSGSKAVSTPIYTSDTYKTFYSLKLEAVSVGESK 745
           KFSYCL P+      +SKLNFG NA+V+G   VSTPI       TFY LKLE +SVG  +
Sbjct: 693 KFSYCLVPLFESK-STSKLNFGENAVVAGPGTVSTPIIPG-IVNTFYLLKLEGMSVGPKR 752

Query: 746 FDFPVVSSRLGGEANIIIDSGTTLTLLPTDLYNNFATEISGSINLQRTNDPNQYLDDCYA 805
            +F   S+      N IIDSGTTLT LP   Y    +E++  INL+R   P+Q L  CY 
Sbjct: 753 IEFVGDSTSNDAVGNTIIDSGTTLTFLPKYFYRKLESEVAAQINLERVKSPDQSLSLCYK 812

Query: 806 TTTDD-YEAPPVTMHFEGADVPLQRENVFIRVSDDAVCLAFKAAGQDEDNIFIYGNISQN 844
           +  ++  +AP +T HF GADV L   N F+ VSD+  C AF +    E    I+GNI+Q 
Sbjct: 813 SPPNNAIQAPLITAHFTGADVVLNSLNTFVGVSDNVTCFAFASL---ETEYSIFGNIAQT 872

BLAST of CaUC06G108310 vs. NCBI nr
Match: XP_038876324.1 (aspartic proteinase CDR1-like [Benincasa hispida])

HSP 1 Score: 726.5 bits (1874), Expect = 2.7e-205
Identity = 354/412 (85.92%), Postives = 380/412 (92.23%), Query Frame = 0

Query: 432 RDYGFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISRNTAALTDTAEAPIYNYRGQY 491
           R++GFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISRNTAA+TDTA APIYNYRGQY
Sbjct: 24  REFGFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISRNTAAVTDTAVAPIYNYRGQY 83

Query: 492 LMEISLGTPPFSILAVADTGSDIVWTQCEPCPNCYEQSAPMFNPSKSATYKNVACSSPIC 551
           LM+ISLGTPPFSI+AVADTGSD++WTQCEPCPNCYEQSAPMFNPSKS TYKNV CSSPIC
Sbjct: 84  LMKISLGTPPFSIIAVADTGSDVIWTQCEPCPNCYEQSAPMFNPSKSTTYKNVPCSSPIC 143

Query: 552 SFAGEERSCSAQSECLYSITYGDSSHSQGDLAVDTVTMGSTSGRQVAFPRIAIGCGHDNA 611
           S+AGE+ SCSA SECLYSI+YGD SHSQGD AVDTVTMGSTSG  V FP +AIGCGHDNA
Sbjct: 144 SYAGEDSSCSAHSECLYSISYGDRSHSQGDFAVDTVTMGSTSGSPVTFPHMAIGCGHDNA 203

Query: 612 GTFDANVSGIVGLGQGPASLVSQLGPATGGKFSYCLAPIGNGTIESSKLNFGSNAIVSGS 671
           GTFDA+VSGIVGLGQG ASLVSQ+GPATGGKFSYCLAPIGN + ESSKLNFGSNA VSGS
Sbjct: 204 GTFDASVSGIVGLGQGSASLVSQMGPATGGKFSYCLAPIGNSSAESSKLNFGSNADVSGS 263

Query: 672 KAVSTPIYTSDTYKTFYSLKLEAVSVGESKFDFPVVSSRLGGEANIIIDSGTTLTLLPTD 731
           +AVSTPIYTS  YKTFYSLKLEAVSVGE+KFDFP+VSSRLGGE NIIIDSGTTLT LP D
Sbjct: 264 EAVSTPIYTSVKYKTFYSLKLEAVSVGENKFDFPIVSSRLGGEGNIIIDSGTTLTFLPVD 323

Query: 732 LYNNFATEISGSINLQRTNDPNQYLDDCYATTTDDYEAPPVTMHFEGADVPLQRENVFIR 791
           LYNNFAT IS SINLQRT+DPNQ+LD C+ATTTDDYEAP VTMHFEGADVPL RENVFIR
Sbjct: 324 LYNNFATTISDSINLQRTDDPNQFLDYCFATTTDDYEAPSVTMHFEGADVPLNRENVFIR 383

Query: 792 VSDDAVCLAFKAAGQDEDNIFIYGNISQNNFLVGYDTKNMSVSFKPADCISM 844
           +SDD VCLAFKA+  D++ IFIYGNISQNNFLVGYD KNM VSFK ADC++M
Sbjct: 384 ISDDIVCLAFKASQDDQEMIFIYGNISQNNFLVGYDIKNMVVSFKQADCVAM 435

BLAST of CaUC06G108310 vs. NCBI nr
Match: KAF4377251.1 (hypothetical protein F8388_012352 [Cannabis sativa])

HSP 1 Score: 696.4 bits (1796), Expect = 3.0e-196
Identity = 407/915 (44.48%), Postives = 551/915 (60.22%), Query Frame = 0

Query: 4   IFFLIFLISSAVVSAATTGRDYGFTVELIRRDYPKSPMYNRSQTHYHRIADALRRSISRN 63
           I  L+ L+S +  S + +    GF+VE+I RD   SP+YN SQTH  R+A+A RRSI+R 
Sbjct: 11  IVLLLLLLSLSSYSHSYSSD--GFSVEIIHRDSAVSPLYNPSQTHSQRLANAFRRSITRA 70

Query: 64  TAAL------TDTAEAPIYSNRGEYLMELSVGTPPFSILAVADTGSDIIWTQCKPCKNCY 123
           +         T + E+ +Y++ GEYLM +S+GTPPF ILA+ADTGSD+ WTQC PCK CY
Sbjct: 71  STLYSSHQLSTSSVESTLYTDGGEYLMSISIGTPPFDILAIADTGSDLTWTQCSPCKKCY 130

Query: 124 QQNAPMFTPSKSATYKKLSCSSPICLFA-GESGSCSS-QSECLYSISYGDRSHSQGDFAV 183
           +Q AP+F P+ S TY+  +C S +C  A G   SCSS    C YS+SYGD+S S G+ A 
Sbjct: 131 KQVAPLFKPNSSKTYRDATCDSSVCKSATGAKTSCSSLDDSCQYSVSYGDQSFSNGNIAT 190

Query: 184 DTVTMGSTSGRPVAFPRMAIGCGHDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGKFS 243
           D +T+ STSGRPV FP   IGC H++ GTFD   SGIVGLG G  SL +Q+  S GGKFS
Sbjct: 191 DVLTLSSTSGRPVTFPNFIIGCSHNSDGTFDERGSGIVGLGGGVDSLTSQLTSSIGGKFS 250

Query: 244 YCLTPI---GSNTIESSKLNFGSNAVVSGSSAVSTPIYISDRFKSFYWLKLEGVSVG--- 303
           YCL P    G+ T  SS L+FGSNAVVSG+  VSTPI +     +FY+L LEG++VG   
Sbjct: 251 YCLVPFISGGNQTKNSSTLSFGSNAVVSGAGVVSTPI-VKGETDTFYYLTLEGITVGSLN 310

Query: 304 ---EKKF-EFPVSS---ILGGEANMIIDSGTTLTFLPMHLYNNFSTTISNSI-NLQRTND 363
               KKF  F  SS       + N+IIDSGTTLT +P   Y++F + +++ + N +R  D
Sbjct: 311 GKKNKKFINFRSSSSTPAAVSQGNIIIDSGTTLTLVPEEFYSDFESALASELKNEKRVED 370

Query: 364 PNQFLDYCFATTT-DDYKAPPVTMHFEGADVPLPQENVFVRVSDDVVCLAFCPGQDNHIM 423
           P+  L  C+  ++  D+ +P +TM+F+GADV L Q N FV+VSD VVCL+F   Q   I 
Sbjct: 371 PSGTLSLCYQISSGKDFVSPSITMNFKGADVELSQLNTFVQVSDTVVCLSFVSAQG--IA 430

Query: 424 IYGNIAQNNFLVGYDINTMSVSFKPADCIAI------RDYGFTVELIHRDSPKSPMYNPS 483
           IYGN+AQ NFL    +  + ++F     +++       D    +ELIHRDSPKSP Y+ S
Sbjct: 431 IYGNLAQMNFLHNIIVWLLFIAFTTITTVSLISCKNNGDDIINLELIHRDSPKSPFYSSS 490

Query: 484 ETHYHRLANALRRSISR------------------NTAALTDTAEAPIYNYRGQYLMEIS 543
           +TH+ RL+ AL RS  R                   T   T   ++ ++  RG+YL+ IS
Sbjct: 491 QTHWQRLSMALERSTHRTNHLILTKKKNKNNISTTTTTTTTSAGQSELFPSRGEYLINIS 550

Query: 544 LGTPPFSILAVADTGSDIVWTQCEPCPNCYEQSAPMFNPSKSATYKNVACSSPICSFAGE 603
           +GTPPF ILA+ADTGSD++WTQC PCP+C+ Q  P+F P  S+T+  + C S  C +  +
Sbjct: 551 IGTPPFPILAIADTGSDLIWTQCHPCPHCFTQKGPLFRPESSSTFHLLPCKSEQCMYLDK 610

Query: 604 ERSCSAQSE-------CLYSITYGDSSHSQGDLAVDTVTM------------------GS 663
           + +    S+       C Y+ +YGDSS++ G LA++T+T                    S
Sbjct: 611 QSTLCNISDSSPPSPPCRYTYSYGDSSYTNGTLALETLTFSSSSSSSSSSSSSSSSSSSS 670

Query: 664 TSGRQVAFPRIAIGCGHDNAGTFDANVSGIVGLGQGPASLVSQLGPATGGKFSYCLAP-- 723
           +S    +FP    GCG  N G F    SGI+GLG G  SL+SQ+G +  GKFSYCL P  
Sbjct: 671 SSSSSSSFPNRIFGCGFRNGGDFSGLESGIIGLGAGKLSLISQMGSSINGKFSYCLVPES 730

Query: 724 IGNGTIESSKLNFGSNAIVSGSKAVSTPIYTSDTYKTFYSLKLEAVSVGESKFDFPVVSS 783
           +   +  SSKL FG +++V G +  STP+  +   + +Y L L+AVSVG  KFD    SS
Sbjct: 731 LSTPSSSSSKLYFGGSSLVPGPEVSSTPLLINTNVQNYYYLALKAVSVGSMKFDL-ASSS 790

Query: 784 RLGGEANIIIDSGTTLTLLPTDLYNNFATEISGSI-NLQRTNDPNQYLDDCYATTTDDY- 841
           + GG  N+IIDSGT LT LPT LY    T +  SI NL+   DPN Y+  CY T +DD  
Sbjct: 791 KKGG--NMIIDSGTMLTYLPTKLYKALETIMIKSIKNLEFGKDPNGYMSLCYKTKSDDIM 850

BLAST of CaUC06G108310 vs. ExPASy Swiss-Prot
Match: Q6XBF8 (Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1)

HSP 1 Score: 414.8 bits (1065), Expect = 2.3e-114
Identity = 208/414 (50.24%), Postives = 282/414 (68.12%), Query Frame = 0

Query: 435 GFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISR----NTAALTDTAEAPIYNYRGQ 494
           GFT +LIHRDSPKSP YNP ET   RL NA+ RS++R         T   +  + +  G+
Sbjct: 30  GFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNRVFHFTEKDNTPQPQIDLTSNSGE 89

Query: 495 YLMEISLGTPPFSILAVADTGSDIVWTQCEPCPNCYEQSAPMFNPSKSATYKNVACSSPI 554
           YLM +S+GTPPF I+A+ADTGSD++WTQC PC +CY Q  P+F+P  S+TYK+V+CSS  
Sbjct: 90  YLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCSSSQ 149

Query: 555 CSFAGEERSCSA-QSECLYSITYGDSSHSQGDLAVDTVTMGSTSGRQVAFPRIAIGCGHD 614
           C+    + SCS   + C YS++YGD+S+++G++AVDT+T+GS+  R +    I IGCGH+
Sbjct: 150 CTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGCGHN 209

Query: 615 NAGTFDANVSGIVGLGQGPASLVSQLGPATGGKFSYCLAPIGNGTIESSKLNFGSNAIVS 674
           NAGTF+   SGIVGLG GP SL+ QLG +  GKFSYCL P+ +   ++SK+NFG+NAIVS
Sbjct: 210 NAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGTNAIVS 269

Query: 675 GSKAVSTPIYTSDTYKTFYSLKLEAVSVGESKFDFPVVSSRLGGEANIIIDSGTTLTLLP 734
           GS  VSTP+    + +TFY L L+++SVG  +  +    S    E NIIIDSGTTLTLLP
Sbjct: 270 GSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSE-SSEGNIIIDSGTTLTLLP 329

Query: 735 TDLYNNFATEISGSINLQRTNDPNQYLDDCYATTTDDYEAPPVTMHFEGADVPLQRENVF 794
           T+ Y+     ++ SI+ ++  DP   L  CY + T D + P +TMHF+GADV L   N F
Sbjct: 330 TEFYSELEDAVASSIDAEKKQDPQSGLSLCY-SATGDLKVPVITMHFDGADVKLDSSNAF 389

Query: 795 IRVSDDAVCLAFKAAGQDEDNIFIYGNISQNNFLVGYDTKNMSVSFKPADCISM 844
           ++VS+D VC AF+ +     +  IYGN++Q NFLVGYDT + +VSFKP DC  M
Sbjct: 390 VQVSEDLVCFAFRGS----PSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCAKM 437

BLAST of CaUC06G108310 vs. ExPASy Swiss-Prot
Match: Q3EBM5 (Probable aspartic protease At2g35615 OS=Arabidopsis thaliana OX=3702 GN=At2g35615 PE=3 SV=1)

HSP 1 Score: 334.3 bits (856), Expect = 3.9e-90
Identity = 188/445 (42.25%), Postives = 266/445 (59.78%), Query Frame = 0

Query: 5   FFLIFLISSAVVSAATTGRDYGFTVELIRRDYPKSPMYNRSQTHYHRIADALRRSISRNT 64
           FFL F      V+ +++G    F+VELI RD P SP+YN   T   R+  A  RS+SR+ 
Sbjct: 9   FFLFF-----SVTLSSSGHPKNFSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVSRSR 68

Query: 65  AALTDTAEAPIYSN----RGEYLMELSVGTPPFSILAVADTGSDIIWTQCKPCKNCYQQN 124
                 ++  + S      GE+ M +++GTPP  + A+ADTGSD+ W QCKPC+ CY++N
Sbjct: 69  RFNHQLSQTDLQSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKEN 128

Query: 125 APMFTPSKSATYKKLSCSSPIC--LFAGESGSCSSQSECLYSISYGDRSHSQGDFAVDTV 184
            P+F   KS+TYK   C S  C  L + E G   S + C Y  SYGD+S S+GD A +TV
Sbjct: 129 GPIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATETV 188

Query: 185 TMGSTSGRPVAFPRMAIGCGHDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGKFSYCL 244
           ++ S SG PV+FP    GCG++N GTFD   SGI+GLG G  SL++Q+G S   KFSYCL
Sbjct: 189 SIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCL 248

Query: 245 TPIGSNTIESSKLNFGSNAVVSG----SSAVSTPIYISDRFKSFYWLKLEGVSVGEKKFE 304
           +   + T  +S +N G+N++ S     S  VSTP+ +     ++Y+L LE +SVG+KK  
Sbjct: 249 SHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPL-VDKEPLTYYYLTLEAISVGKKKIP 308

Query: 305 FPVSS--------ILGGEANMIIDSGTTLTFLPMHLYNNFSTTISNSI-NLQRTNDPNQF 364
           +  SS        +     N+IIDSGTTLT L    ++ FS+ +  S+   +R +DP   
Sbjct: 309 YTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDPQGL 368

Query: 365 LDYCFATTTDDYKAPPVTMHFEGADVPLPQENVFVRVSDDVVCLAFCPGQDNHIMIYGNI 424
           L +CF + + +   P +T+HF GADV L   N FV++S+D+VCL+  P     + IYGN 
Sbjct: 369 LSHCFKSGSAEIGLPEITVHFTGADVRLSPINAFVKLSEDMVCLSMVP--TTEVAIYGNF 428

Query: 425 AQNNFLVGYDINTMSVSFKPADCIA 431
           AQ +FLVGYD+ T +VSF+  DC A
Sbjct: 429 AQMDFLVGYDLETRTVSFQHMDCSA 445

BLAST of CaUC06G108310 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 260.0 bits (663), Expect = 9.3e-68
Identity = 160/416 (38.46%), Postives = 221/416 (53.12%), Query Frame = 0

Query: 435 GFTVELIHRDSPKSPMYNPSETHYHRLANALRRS---ISRNTAALTDTA--EAPIYNYRG 494
           GF + L H DS K      + T +  L  A+ R    + R  A L   +  E  +Y   G
Sbjct: 40  GFQIMLEHVDSGK------NLTKFQLLERAIERGSRRLQRLEAMLNGPSGVETSVYAGDG 99

Query: 495 QYLMEISLGTPPFSILAVADTGSDIVWTQCEPCPNCYEQSAPMFNPSKSATYKNVACSSP 554
           +YLM +S+GTP     A+ DTGSD++WTQC+PC  C+ QS P+FNP  S+++  + CSS 
Sbjct: 100 EYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQ 159

Query: 555 ICSFAGEERSCSAQSECLYSITYGDSSHSQGDLAVDTVTMGSTSGRQVAFPRIAIGCGHD 614
           +C  A    +CS  + C Y+  YGD S +QG +  +T+T GS     V+ P I  GCG +
Sbjct: 160 LCQ-ALSSPTCS-NNFCQYTYGYGDGSETQGSMGTETLTFGS-----VSIPNITFGCGEN 219

Query: 615 NAGTFDANVSGIVGLGQGPASLVSQLGPATGGKFSYCLAPIGNGTIESSKLNFGSNAIVS 674
           N G    N +G+VG+G+GP SL SQL      KFSYC+ PIG+ T  +  L   +N++ +
Sbjct: 220 NQGFGQGNGAGLVGMGRGPLSLPSQLDVT---KFSYCMTPIGSSTPSNLLLGSLANSVTA 279

Query: 675 GSKAVSTPIYTSDTYKTFYSLKLEAVSVGESKFDFPVVSSRL---GGEANIIIDSGTTLT 734
           GS   +T +  S    TFY + L  +SVG ++      +  L    G   IIIDSGTTLT
Sbjct: 280 GSP--NTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLT 339

Query: 735 LLPTDLYNNFATEISGSINLQRTNDPNQYLDDCYATTTD--DYEAPPVTMHFEGADVPLQ 794
               + Y +   E    INL   N  +   D C+ T +D  + + P   MHF+G D+ L 
Sbjct: 340 YFVNNAYQSVRQEFISQINLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGGDLELP 399

Query: 795 RENVFIRVSDDAVCLAFKAAGQDEDNIFIYGNISQNNFLVGYDTKNMSVSFKPADC 841
            EN FI  S+  +CLA    G     + I+GNI Q N LV YDT N  VSF  A C
Sbjct: 400 SENYFISPSNGLICLAM---GSSSQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of CaUC06G108310 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 247.3 bits (630), Expect = 6.2e-64
Identity = 144/395 (36.46%), Postives = 219/395 (55.44%), Query Frame = 0

Query: 44  RSQTHYHRIADALRRSISRN---TAALTDTA--EAPIYSNRGEYLMELSVGTPPFSILAV 103
           ++ T Y  I  A++R   R     A L  ++  E P+Y+  GEYLM +++GTP  S  A+
Sbjct: 53  KNLTKYELIKRAIKRGERRMRSINAMLQSSSGIETPVYAGDGEYLMNVAIGTPDSSFSAI 112

Query: 104 ADTGSDIIWTQCKPCKNCYQQNAPMFTPSKSATYKKLSCSSPICLFAGESGSCSSQSECL 163
            DTGSD+IWTQC+PC  C+ Q  P+F P  S+++  L C S  C     S +C++ +EC 
Sbjct: 113 MDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQDL-PSETCNN-NECQ 172

Query: 164 YSISYGDRSHSQGDFAVDTVTMGSTSGRPVAFPRMAIGCGHDNAGTFDANVSGIVGLGLG 223
           Y+  YGD S +QG  A +T T  ++S      P +A GCG DN G    N +G++G+G G
Sbjct: 173 YTYGYGDGSTTQGYMATETFTFETSS-----VPNIAFGCGEDNQGFGQGNGAGLIGMGWG 232

Query: 224 PASLVTQMGPSAGGKFSYCLTPIGSNTIESSKLNFGSNAVVSGSSAVSTPIYISDRFKSF 283
           P SL +Q+G    G+FSYC+T  GS++  +  L   ++ V  GS   ST +  S    ++
Sbjct: 233 PLSLPSQLGV---GQFSYCMTSYGSSSPSTLALGSAASGVPEGSP--STTLIHSSLNPTY 292

Query: 284 YWLKLEGVSVGEKKFEFPVSSIL---GGEANMIIDSGTTLTFLPMHLYNNFSTTISNSIN 343
           Y++ L+G++VG      P S+      G   MIIDSGTTLT+LP   YN  +   ++ IN
Sbjct: 293 YYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQIN 352

Query: 344 LQRTNDPNQFLDYCFATTTD--DYKAPPVTMHFEGADVPLPQENVFVRVSDDVVCLAFCP 403
           L   ++ +  L  CF   +D    + P ++M F+G  + L ++N+ +  ++ V+CLA   
Sbjct: 353 LPTVDESSSGLSTCFQQPSDGSTVQVPEISMQFDGGVLNLGEQNILISPAEGVICLAMGS 412

Query: 404 GQDNHIMIYGNIAQNNFLVGYDINTMSVSFKPADC 429
                I I+GNI Q    V YD+  ++VSF P  C
Sbjct: 413 SSQLGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of CaUC06G108310 vs. ExPASy Swiss-Prot
Match: Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 211.8 bits (538), Expect = 2.9e-53
Identity = 138/361 (38.23%), Postives = 188/361 (52.08%), Query Frame = 0

Query: 80  GEYLMELSVGTPPFSILAVADTGSDIIWTQCKPCKNCYQQNAPMFTPSKSATYKKLSCSS 139
           GEY   L VGTP   +  V DTGSDI+W QC PC+ CY Q+ P+F P KS TY  + CSS
Sbjct: 140 GEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSS 199

Query: 140 PICLFAGESGSCSSQSECLYSISYGDRSHSQGDFAVDTVTMGSTSGRPVAFPRMAIGCGH 199
           P C     +G  + +  CLY +SYGD S + GDF+ +T+T      + V     A+GCGH
Sbjct: 200 PHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGV-----ALGCGH 259

Query: 200 DNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGKFSYCLTPIGSNTIESSKLNFGSNAVV 259
           DN G F    +G++GLG G  S   Q G     KFSYCL    +++  SS        VV
Sbjct: 260 DNEGLF-VGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSS--------VV 319

Query: 260 SGSSAVS-----TPIYISDRFKSFYWLKLEGVSVGEKKFEFPVSSILG----GEANMIID 319
            G++AVS     TP+  + +  +FY++ L G+SVG  +     +S+      G   +IID
Sbjct: 320 FGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIID 379

Query: 320 SGTTLTFLPMHLYNNFSTTIS-NSINLQRTNDPNQFLDYCF-ATTTDDYKAPPVTMHFEG 379
           SGT++T L    Y          +  L+R  D + F D CF  +  ++ K P V +HF G
Sbjct: 380 SGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLF-DTCFDLSNMNEVKVPTVVLHFRG 439

Query: 380 ADVPLPQENVFVRV-SDDVVCLAFCPGQDNHIMIYGNIAQNNFLVGYDINTMSVSFKPAD 429
           ADV LP  N  + V ++   C AF  G    + I GNI Q  F V YD+ +  V F P  
Sbjct: 440 ADVSLPATNYLIPVDTNGKFCFAFA-GTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGG 484

BLAST of CaUC06G108310 vs. ExPASy TrEMBL
Match: A0A371HE86 (Aspartic proteinase CDR1 (Fragment) OS=Mucuna pruriens OX=157652 GN=CDR1 PE=3 SV=1)

HSP 1 Score: 750.4 bits (1936), Expect = 8.4e-213
Identity = 403/846 (47.64%), Postives = 540/846 (63.83%), Query Frame = 0

Query: 26  GFTVELIRRDYPKSPMYNRSQTHYHRIADALRRSISR------NTAALTDTAEAPIYSNR 85
           GF+VELI RD PKSP YN  +T + ++ +A  RS SR       + A   T ++ I SN+
Sbjct: 23  GFSVELIHRDSPKSPFYNPIETPFQQLNNAFHRSFSRVNHFYPKSKASQKTPQSVITSNQ 82

Query: 86  GEYLMELSVGTPPFSILAVADTGSDIIWTQCKPCKNCYQQNAPMFTPSKSATYKKLSCSS 145
           GEYL++ S+GTPPF ++ +ADTGSD+IW+QCKPC  CY Q  P+F PSKS+TY+ +SC S
Sbjct: 83  GEYLVKYSIGTPPFEVMGIADTGSDLIWSQCKPCDQCYNQTTPLFDPSKSSTYEPVSCYS 142

Query: 146 PICLFAGES--GSCSSQSECLYSISYGDRSHSQGDFAVDTVTMGSTSGRPVAFPRMAIGC 205
            +C   G++   S +    C Y++SYGD SHSQG  A DT T+ ST+G  VAF +++IGC
Sbjct: 143 RVCQLLGKTYCYSANGDPNCEYTVSYGDGSHSQGTLAFDTFTLDSTTGSSVAFTKISIGC 202

Query: 206 GHDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGKFSYCLTPIGSNTIESSKLNFGSNA 265
           G +NAGTFD+  SGIVGLG G  SL++Q+GPS   KFSYCL P+  +    SKLNFG NA
Sbjct: 203 GVNNAGTFDSKGSGIVGLGGGVVSLISQIGPSIDFKFSYCLVPLFESK-SISKLNFGENA 262

Query: 266 VVSGSSAVSTPIYISDRFKSFYWLKLEGVSVGEKKFEFPVSSILG-GEANMIIDSGTTLT 325
           VV+G   VSTPI I     +FY+LKLEG+SVG K+ EF   S       N+IIDSGTTLT
Sbjct: 263 VVAGPGTVSTPI-IPGPVDTFYYLKLEGMSVGSKRIEFICDSTSNVANGNIIIDSGTTLT 322

Query: 326 FLPMHLYNNFSTTISNSINLQRTNDPNQFLDYCFATTTDD-YKAPPVTMHFEGADVPLPQ 385
            LP   Y      ++  INL+R N  +Q L  C+ +  ++  + P +T HF GADV L  
Sbjct: 323 ILPEKFYTKLELEVAAHINLERVNSTDQILSLCYQSPPNNAIETPIITAHFSGADVVLNS 382

Query: 386 ENVFVRVSDDVVCLAFCPGQDNHIMIYGNIAQNNFLVGYDINTMSVSFKPADCIAIRDY- 445
            N F+ VS+ V C AF P   N   I+GN+AQ N+LVGYD+   +VSFKP DC  I    
Sbjct: 383 LNTFISVSNYVTCFAFAPMATN--SIFGNLAQMNYLVGYDLQRKTVSFKPTDCTKIGKLE 442

Query: 446 ------GFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISR--------NTAALTDTA 505
                 GF+V+LIHRDS KSP+YNPSE+ + +L +A +RS +R          +  T T 
Sbjct: 443 SEALKGGFSVQLIHRDSSKSPLYNPSESAFQQLKSAFQRSFNRVNHFYPKSKVSRKTKTP 502

Query: 506 EAPIYNYRGQYLMEISLGTPPFSILAVADTGSDIVWTQCEPCPNCYEQSAPMFNPSKSAT 565
           ++ I    G+YL++ S+GTPPF ++ V DTGSD++W+QC+PC  CY Q+ P+F+ SKS+T
Sbjct: 503 QSVITWNHGEYLVKYSIGTPPFEVMGVFDTGSDLIWSQCKPCKECYNQTNPLFDYSKSST 562

Query: 566 YKNVACSSPICSFAGEERSCSAQSE--CLYSITYGDSSHSQGDLAVDTVTMGSTSGRQVA 625
           Y+ + C S +C   G E +C + S+  C Y++ YGD SHS+G LA DT+T+ ST+   +A
Sbjct: 563 YEPIHCKSRVCKSLG-EANCYSHSDPTCEYTVIYGDGSHSRGFLAFDTLTLPSTTDSSIA 622

Query: 626 FPRIAIGCGHDNAGTFDANVSGIVGLGQGPASLVSQLGPATGGKFSYCLAPIGNGTIESS 685
           FP+I  GCG +N G FD   SGIVG+G G  SL+SQ+GP+   KFSYCL P+ + +  +S
Sbjct: 623 FPKIFFGCGVNNGGIFDPKASGIVGVGGGAVSLISQIGPSIDFKFSYCLVPLFSESESTS 682

Query: 686 KLNFGSNAIVSGSKAVSTPIYTSDTYKTFYSLKLEAVSVGESKFDFPVVSSRLGGEANII 745
           KLNFG NA+V+G   VSTPI       TFY LKL+ +SVG  + +    S    G+ NII
Sbjct: 683 KLNFGENAVVAGPGTVSTPIIPGPV-NTFYYLKLKGMSVGSKRIELISDSKSNNGKGNII 742

Query: 746 IDSGTTLTLLPTDLYNNFATEISGSINLQRTNDPNQYLDDCYATTTDD-YEAPPVTMHFE 805
           IDSGTTLT LP  LY    +E++  I L+R + P   L  CY + +++  EAP +T HF 
Sbjct: 743 IDSGTTLTFLPQKLYTKLESEVAAQIKLERVHSPEHVLSLCYKSPSNNAIEAPIITAHFS 802

Query: 806 GADVPLQRENVFIRVSDDAVCLAFKAAGQDEDNIFIYGNISQNNFLVGYDTKNMSVSFKP 844
           GADV L   N F+ VSD+  C AF    + +    I+GNI+Q N LVGYD +  +VSFKP
Sbjct: 803 GADVVLNSLNTFVSVSDNVTCFAFAPVMRSDS---IFGNIAQMNHLVGYDLQKKTVSFKP 859

BLAST of CaUC06G108310 vs. ExPASy TrEMBL
Match: A0A5B6VH54 (Aspartic proteinase CDR1-like OS=Gossypium australe OX=47621 GN=EPI10_014435 PE=3 SV=1)

HSP 1 Score: 750.4 bits (1936), Expect = 8.4e-213
Identity = 413/852 (48.47%), Postives = 545/852 (63.97%), Query Frame = 0

Query: 1   MAPIFFLIFLISSAVVSAATTGRDYGFTVELIRRDYPKSPMYNRSQTHYHRIADALRRSI 60
           MA I   I  +S+     A  G   GF+VEL  RD   SP YN  +T   R+ +ALRRS 
Sbjct: 12  MAAIVLAILALSTLCSIEAQKG---GFSVELFHRDSINSPFYNPLETTSDRVTNALRRSF 71

Query: 61  SR-----NTAALTDTAEAPIYSNRGEYLMELSVGTPPFSILAVADTGSDIIWTQCKPCKN 120
           +R       +  T  AE+ + ++ GEYLM++S+GTP F I+A+ADTGSD+IWTQCKPC  
Sbjct: 72  NRVHRFKTNSVPTTAAESDLTADSGEYLMKISLGTPRFDIVAIADTGSDLIWTQCKPCSQ 131

Query: 121 CYQQNAPMFTPSKSATYKKLSCSSPICLFAGESGSCSSQSECLYSISYGDRSHSQGDFAV 180
           C++Q+AP F PSKS+TY+K+SCS+  C+   E  SCS+   C Y++SYGD S S GD A 
Sbjct: 132 CFKQDAPFFDPSKSSTYRKISCSASQCIDL-ERTSCSTDHSCQYAVSYGDSSFSDGDLAA 191

Query: 181 DTVTMGSTSGRPVAFPRMAIGCGHDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGKFS 240
           DT+T+ S +GRPV FP+  IGCG  N GTFD   SGI+GLG G  SL++Q+  S  GKFS
Sbjct: 192 DTLTLASITGRPVTFPKTVIGCGTSNGGTFDEKTSGIIGLGGGQVSLISQLRTSVAGKFS 251

Query: 241 YCLTPIGSNTIESSKLNFGSNAVVSGSSAVSTPIYISDRFKSFYWLKLEGVSVGEKKFEF 300
           YCL PI S    SSK+NFGSNA+VSG   VSTP+ +     +FY+L LE ++VG K+ +F
Sbjct: 252 YCLLPI-SQAGNSSKINFGSNAIVSGPGVVSTPL-VKKSPDTFYFLTLEAITVGTKRIKF 311

Query: 301 PVSSILGGEANMIIDSGTTLTFLPMHLYNNFSTTISNSINLQRTNDPNQFLDYCFATTTD 360
             SS+   E N+IIDSGTTLT LP   Y+   + +++ I+ +R   P + L  C+    D
Sbjct: 312 TGSSLGSEEGNIIIDSGTTLTLLPSDFYSEVESAMTSQISAKRIEGP-EGLSLCY-NAKD 371

Query: 361 DYKAPPVTMHFEGADVPLPQENVFVRVSDDVVCLAFCPGQDNHIMIYGNIAQNNFLVGYD 420
           ++K P VT+HF  AD+ L   N F+RVSD  +C +F    D  + IYGN++Q +FL+GYD
Sbjct: 372 EFKIPDVTVHFTNADMKLKPLNTFIRVSDTAICFSFSSLDD--VAIYGNLSQMDFLIGYD 431

Query: 421 INTMSVSFKPADCIAIRDYGFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISR---- 480
               +VS               VELIHRDS KSP YNP ET + R+ NA RRS SR    
Sbjct: 432 TQKQTVS---------------VELIHRDSIKSPFYNPFETTFDRVTNAFRRSFSRVHRF 491

Query: 481 -NTAALTDTAEAPIYNYRGQYLMEISLGTPPFSILAVADTGSDIVWTQCEPCPNCYEQSA 540
              +  T  A   I    G+YLM ISLGTP FS++A+ADTGSD++WTQC PC  C++Q A
Sbjct: 492 YPNSITTTEANPDIIVNTGEYLMNISLGTPSFSVVALADTGSDLIWTQCSPCSQCFKQDA 551

Query: 541 PMFNPSKSATYKNVACSSPIC-SFAGEERSCSAQSECLYSITYGDSSHSQGDLAVDTVTM 600
           P+F+P+KS+TY+ ++CSS  C +  G   +    + C+YS+TYGD+S S+GD+A DT+T+
Sbjct: 552 PLFDPTKSSTYRKMSCSSNSCENIQGGTCASPTDTSCIYSVTYGDNSFSKGDIAYDTLTL 611

Query: 601 GSTSGRQVAFPRIAIGCGHDNAGTFDANVSGIVGLGQGPASLVSQLGPATGGKFSYCLAP 660
           GST+G+ VA P   IGCG++NAGTF    SGI+GLG G  SL++QLG    GKFSYCL P
Sbjct: 612 GSTTGQAVALPDTIIGCGNNNAGTFSGKASGIIGLGGGEISLINQLGSPINGKFSYCLLP 671

Query: 661 IGNGTIESSKLNFGSNAIVSGSKAVSTPIYTSDTYKTFYSLKLEAVSVGESKFDFPVVSS 720
           +     +SSK+NFGSNAIVSG   VSTP+       TFY L L+A+SVG  + +F    S
Sbjct: 672 M-TQIGKSSKMNFGSNAIVSGPGTVSTPLIEKSP-NTFYFLTLKAISVGTQRIEFK--GS 731

Query: 721 RLG-GEANIIIDSGTTLTLLPTDLYNNFATEISGSINLQRTNDPNQYLDDCYATTTDDYE 780
            LG  E NI+IDSGTTLTL+P+D Y+   + +    N  R   P Q  + CY     ++E
Sbjct: 732 SLGTDEGNIVIDSGTTLTLIPSDFYSQLESAMDSQFNGIRAQGP-QGFNLCY-VAIHEFE 791

Query: 781 APPVTMHFEGADVPLQRENVFIRVSDDAVCLAFKAAGQDEDNIFIYGNISQNNFLVGYDT 840
           AP VT+HF  ADV L+  N F++V D   C AF  A     NI IYGN++Q NFL+GYDT
Sbjct: 792 APEVTVHFANADVKLKTLNTFVKVDDTTACFAFSPA----QNIAIYGNLAQMNFLIGYDT 828

BLAST of CaUC06G108310 vs. ExPASy TrEMBL
Match: A0A7J6G2M2 (Uncharacterized protein OS=Cannabis sativa OX=3483 GN=F8388_012352 PE=3 SV=1)

HSP 1 Score: 696.4 bits (1796), Expect = 1.4e-196
Identity = 407/915 (44.48%), Postives = 551/915 (60.22%), Query Frame = 0

Query: 4   IFFLIFLISSAVVSAATTGRDYGFTVELIRRDYPKSPMYNRSQTHYHRIADALRRSISRN 63
           I  L+ L+S +  S + +    GF+VE+I RD   SP+YN SQTH  R+A+A RRSI+R 
Sbjct: 11  IVLLLLLLSLSSYSHSYSSD--GFSVEIIHRDSAVSPLYNPSQTHSQRLANAFRRSITRA 70

Query: 64  TAAL------TDTAEAPIYSNRGEYLMELSVGTPPFSILAVADTGSDIIWTQCKPCKNCY 123
           +         T + E+ +Y++ GEYLM +S+GTPPF ILA+ADTGSD+ WTQC PCK CY
Sbjct: 71  STLYSSHQLSTSSVESTLYTDGGEYLMSISIGTPPFDILAIADTGSDLTWTQCSPCKKCY 130

Query: 124 QQNAPMFTPSKSATYKKLSCSSPICLFA-GESGSCSS-QSECLYSISYGDRSHSQGDFAV 183
           +Q AP+F P+ S TY+  +C S +C  A G   SCSS    C YS+SYGD+S S G+ A 
Sbjct: 131 KQVAPLFKPNSSKTYRDATCDSSVCKSATGAKTSCSSLDDSCQYSVSYGDQSFSNGNIAT 190

Query: 184 DTVTMGSTSGRPVAFPRMAIGCGHDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGKFS 243
           D +T+ STSGRPV FP   IGC H++ GTFD   SGIVGLG G  SL +Q+  S GGKFS
Sbjct: 191 DVLTLSSTSGRPVTFPNFIIGCSHNSDGTFDERGSGIVGLGGGVDSLTSQLTSSIGGKFS 250

Query: 244 YCLTPI---GSNTIESSKLNFGSNAVVSGSSAVSTPIYISDRFKSFYWLKLEGVSVG--- 303
           YCL P    G+ T  SS L+FGSNAVVSG+  VSTPI +     +FY+L LEG++VG   
Sbjct: 251 YCLVPFISGGNQTKNSSTLSFGSNAVVSGAGVVSTPI-VKGETDTFYYLTLEGITVGSLN 310

Query: 304 ---EKKF-EFPVSS---ILGGEANMIIDSGTTLTFLPMHLYNNFSTTISNSI-NLQRTND 363
               KKF  F  SS       + N+IIDSGTTLT +P   Y++F + +++ + N +R  D
Sbjct: 311 GKKNKKFINFRSSSSTPAAVSQGNIIIDSGTTLTLVPEEFYSDFESALASELKNEKRVED 370

Query: 364 PNQFLDYCFATTT-DDYKAPPVTMHFEGADVPLPQENVFVRVSDDVVCLAFCPGQDNHIM 423
           P+  L  C+  ++  D+ +P +TM+F+GADV L Q N FV+VSD VVCL+F   Q   I 
Sbjct: 371 PSGTLSLCYQISSGKDFVSPSITMNFKGADVELSQLNTFVQVSDTVVCLSFVSAQG--IA 430

Query: 424 IYGNIAQNNFLVGYDINTMSVSFKPADCIAI------RDYGFTVELIHRDSPKSPMYNPS 483
           IYGN+AQ NFL    +  + ++F     +++       D    +ELIHRDSPKSP Y+ S
Sbjct: 431 IYGNLAQMNFLHNIIVWLLFIAFTTITTVSLISCKNNGDDIINLELIHRDSPKSPFYSSS 490

Query: 484 ETHYHRLANALRRSISR------------------NTAALTDTAEAPIYNYRGQYLMEIS 543
           +TH+ RL+ AL RS  R                   T   T   ++ ++  RG+YL+ IS
Sbjct: 491 QTHWQRLSMALERSTHRTNHLILTKKKNKNNISTTTTTTTTSAGQSELFPSRGEYLINIS 550

Query: 544 LGTPPFSILAVADTGSDIVWTQCEPCPNCYEQSAPMFNPSKSATYKNVACSSPICSFAGE 603
           +GTPPF ILA+ADTGSD++WTQC PCP+C+ Q  P+F P  S+T+  + C S  C +  +
Sbjct: 551 IGTPPFPILAIADTGSDLIWTQCHPCPHCFTQKGPLFRPESSSTFHLLPCKSEQCMYLDK 610

Query: 604 ERSCSAQSE-------CLYSITYGDSSHSQGDLAVDTVTM------------------GS 663
           + +    S+       C Y+ +YGDSS++ G LA++T+T                    S
Sbjct: 611 QSTLCNISDSSPPSPPCRYTYSYGDSSYTNGTLALETLTFSSSSSSSSSSSSSSSSSSSS 670

Query: 664 TSGRQVAFPRIAIGCGHDNAGTFDANVSGIVGLGQGPASLVSQLGPATGGKFSYCLAP-- 723
           +S    +FP    GCG  N G F    SGI+GLG G  SL+SQ+G +  GKFSYCL P  
Sbjct: 671 SSSSSSSFPNRIFGCGFRNGGDFSGLESGIIGLGAGKLSLISQMGSSINGKFSYCLVPES 730

Query: 724 IGNGTIESSKLNFGSNAIVSGSKAVSTPIYTSDTYKTFYSLKLEAVSVGESKFDFPVVSS 783
           +   +  SSKL FG +++V G +  STP+  +   + +Y L L+AVSVG  KFD    SS
Sbjct: 731 LSTPSSSSSKLYFGGSSLVPGPEVSSTPLLINTNVQNYYYLALKAVSVGSMKFDL-ASSS 790

Query: 784 RLGGEANIIIDSGTTLTLLPTDLYNNFATEISGSI-NLQRTNDPNQYLDDCYATTTDDY- 841
           + GG  N+IIDSGT LT LPT LY    T +  SI NL+   DPN Y+  CY T +DD  
Sbjct: 791 KKGG--NMIIDSGTMLTYLPTKLYKALETIMIKSIKNLEFGKDPNGYMSLCYKTKSDDIM 850

BLAST of CaUC06G108310 vs. ExPASy TrEMBL
Match: A0A3Q7HJU2 (Uncharacterized protein OS=Solanum lycopersicum OX=4081 PE=3 SV=1)

HSP 1 Score: 684.1 bits (1764), Expect = 7.4e-193
Identity = 377/865 (43.58%), Postives = 517/865 (59.77%), Query Frame = 0

Query: 5   FFLIFLISSAVVSAATTGRDY----GFTVELIRRDYPKSPMYNRSQTHYHRIADALRRSI 64
           F   F+  + +VS   T  D+    GFT+ LI RD P SP+YN S T  +R+ +A  RS 
Sbjct: 10  FLSTFVFLTLLVSCRNTISDHRVENGFTLHLIHRDSPLSPLYNSSITQSNRLINAFHRSF 69

Query: 65  SR------NTAALTDTAEAPIYSNRGEYLMELSVGTPPFSILAVADTGSDIIWTQCKPCK 124
           SR      ++    +T  + I    GEY+M+LS+GTPP  I+A+ADTGSD+ WTQC+PC 
Sbjct: 70  SRASFFKKSSFVTPNTIRSDISPIPGEYIMKLSIGTPPVEIVAIADTGSDLTWTQCEPCL 129

Query: 125 NCYQQNAPMFTPSKSATYKKLSCSSPICLFAGESGSCSSQSECLYSISYGDRSHSQGDFA 184
           NC++Q++P+F   KS++YK   C +  C   G S SC   + C Y +SYGD+S++ GD A
Sbjct: 130 NCFEQSSPLFDSKKSSSYKTAGCDTKECTSIG-SSSCVKGNVCEYQMSYGDQSYTIGDLA 189

Query: 185 VDTVTMGST-SGRPVAFPRMAIGCGHDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGK 244
            D  T  ST S   VA P +A GCGH N GTF+ + SGI+GLG G  S++ Q+     GK
Sbjct: 190 FDIFTFPSTNSSENVAIPNVAFGCGHHNGGTFNNHTSGIIGLGGGNVSIINQLDKEINGK 249

Query: 245 FSYCLTPIGSNTIES---SKLNFGSNAVVSGSSAVSTPIYISDRFKSFYWLKLEGVSVGE 304
           FSYCL  I   +  S   S +NFGS+A VSG   VSTP+ I     +FY+L LEGVSVG 
Sbjct: 250 FSYCLISIALGSPISNVTSHINFGSSASVSGPDVVSTPL-IKKEPSTFYYLNLEGVSVGN 309

Query: 305 KKFEFPVSSILGG--EANMIIDSGTTLTFLPMHLYNNFSTTISNSINLQRTNDPNQFLDY 364
           +  +F  S +  G  E N+IIDSGTTLT LP   Y++  +T+ +SI+  R  DP+     
Sbjct: 310 RTLKFKSSKVSSGGEEGNIIIDSGTTLTLLPNEFYSSLESTLVDSISATRKEDPSGTFRL 369

Query: 365 CFATTTDDYKAPPVTMHFEGADVPLPQENVFVRVSDDVVCLAFCPGQDNHIMIYGNIAQN 424
           C+ +      AP +T HF  AD+ L   + F ++ + +VCL   P  +  I I+GN+AQ 
Sbjct: 370 CYESKNGTIDAPTITTHFTNADLELSPSSTFAQIEEGLVCLTIVPADE--IAIFGNLAQG 429

Query: 425 NFLVGYDINTMSVSFKPADCIAIRDYGFTVELIHRDSPKSPMYNPSETHYHRLANALRRS 484
           NFL+GYD+    +SFKPADC       FT++LIHRDSP SP +NPS T Y RL +AL RS
Sbjct: 430 NFLIGYDLVANKISFKPADCT-----NFTLDLIHRDSPLSPFHNPSNTPYERLQHALYRS 489

Query: 485 ISRNT---AALTDTAEAPIYNYRGQYLMEISLGTPPFSILAVADTGSDIVWTQCEPCPNC 544
            SR +       +  E+ +    G+YLM+IS+GTPP   L +ADTGSD+ WTQC+PC NC
Sbjct: 490 FSRASFLKKKYVNPIESTLIPSGGEYLMKISIGTPPIDTLVIADTGSDLTWTQCKPCVNC 549

Query: 545 YEQSAPMFNPSKSATYKNVACSSPICSFAGEERSCSAQSECLYSITYGDSSHSQGDLAVD 604
           ++Q  P+FNP KS++YK + C++ +C     + S    S C Y ++YGD SH+ GDL+++
Sbjct: 550 FKQLTPIFNPKKSSSYKTIGCNNKLC-----QGSLCNNSRCNYEVSYGDQSHTMGDLSIE 609

Query: 605 TVTMGSTSGRQVAFPRIAIGCGHDNAGTFDANVSGIVGLGQGPASLVSQLGPATGGKFSY 664
           T T  STS + V+ P I  GCGHDN GTF    SGI+GLG G  S+V+Q+     GKFSY
Sbjct: 610 TFTFSSTSSQNVSIPNIVFGCGHDNGGTFPNVTSGIIGLGGGNVSIVNQMHQQIKGKFSY 669

Query: 665 CLAPIG---NGTIESSKLNFGSNAIVSGSKAVSTPIYTSDTYKTFYSLKLEAVSVGESKF 724
           CL P+    + +  +S +NFG+ A VSG   VSTP+   +   TFY L LE +S+G    
Sbjct: 670 CLIPLESLLDNSNATSHINFGNCATVSGPNVVSTPLIKKEP-STFYYLNLERISIGNRTV 729

Query: 725 D---FPVVSSRLGGEANIIIDSGTTLTLLPTDLYNNFATEISGSINLQRTNDPNQYLDDC 784
           +   FPVV        NIIIDSGTTLT +P   Y N  + +  SIN  + +DP+     C
Sbjct: 730 EFNSFPVVVGGDDDPGNIIIDSGTTLTYVPDAFYLNLESMLILSINATKKDDPSSSFRLC 789

Query: 785 YATTTD-DYEAPPVTMHFEGADVPLQRENVFIRVSDDAVCLAFKAAGQDEDNIFIYGNIS 844
           Y +  +   + P +  HF  AD+ L   N+F +V +  VCL     G   + I I+GN++
Sbjct: 790 YESNKNGTIDVPKIVAHFTNADLELSTSNIFTKVVEGIVCLTIVPGG---NQISIFGNLA 849

BLAST of CaUC06G108310 vs. ExPASy TrEMBL
Match: F6HJ51 (Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VIT_02s0087g00230 PE=3 SV=1)

HSP 1 Score: 677.9 bits (1748), Expect = 5.3e-191
Identity = 365/873 (41.81%), Postives = 520/873 (59.56%), Query Frame = 0

Query: 4   IFFLI----FLISSAVVSAATTGRDYGFTVELIRRDYPKSPMYNRSQTHYHRIADALRRS 63
           IFF +    FL     V+ A  G   GF+V+LI RD P SP ++ S+T   R+ DA RRS
Sbjct: 8   IFFNVVVVGFLFQLLEVALARGG---GFSVDLIHRDSPHSPFFDPSKTQAERLTDAFRRS 67

Query: 64  IS-----RNTAALTDTAEAPIYSNRGEYLMELSVGTPPFSILAVADTGSDIIWTQCKPCK 123
           +S     R TA  +D  ++ I  + GEYLM L +GTPP  ++A+ DTGSD+ WTQC+PC 
Sbjct: 68  VSRVGRFRPTAMTSDGIQSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCT 127

Query: 124 NCYQQNAPMFTPSKSATYKKLSCSSPICLFAGESGSCSSQSECLYSISYGDRSHSQGDFA 183
           +CY+Q  P+F P  S+TY+  SC +  CL  G+  SCS + +C +  SY D S + G+ A
Sbjct: 128 HCYKQVVPLFDPKNSSTYRDSSCGTSFCLALGKDRSCSKEKKCTFRYSYADGSFTGGNLA 187

Query: 184 VDTVTMGSTSGRPVAFPRMAIGCGHDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGKF 243
            +T+T+ ST+G+PV+FP  A GCGH + G FD + SGIVGLG G  SL++Q+  +  G F
Sbjct: 188 SETLTVDSTAGKPVSFPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTINGLF 247

Query: 244 SYCLTPIGSNTIESSKLNFGSNAVVSGSSAVSTPIYISDRFKSFYWLKLEGVSVGEKKFE 303
           SYCL P+ +++  SS++NFG++  VSG   VSTP+ +     +FY+L LEG+SVG+K+  
Sbjct: 248 SYCLLPVSTDSSISSRINFGASGRVSGYGTVSTPL-VQKSPDTFYYLTLEGISVGKKRLP 307

Query: 304 FPVSS--ILGGEANMIIDSGTTLTFLPMHLYNNFSTTISNSINLQRTNDPNQFLDYCFAT 363
           +   S      E N+I+DSGTT TFLP   Y+    +++NSI  +R  DPN     C+  
Sbjct: 308 YKGYSKKTEVEEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCY-N 367

Query: 364 TTDDYKAPPVTMHFEGADVPLPQENVFVRVSDDVVCLAFCPGQDNHIMIYGNIAQNNFLV 423
           TT +  AP +T HF+ A+V L   N F+R+ +D+VC    P  D  I + GN+AQ NFLV
Sbjct: 368 TTAEINAPIITAHFKDANVELQPLNTFMRMQEDLVCFTVAPTSD--IGVLGNLAQVNFLV 427

Query: 424 GYDINTMSVS-------------------FKPADCIAIRDYGFTVELIHRDSPKSPMYNP 483
           G+D+    +S                   F   +       GF+V+LIHRDSP SP ++P
Sbjct: 428 GFDLRKKRISSMEVFGVKIFFNVVVVGFLFHLLEVGLASGGGFSVDLIHRDSPHSPFFDP 487

Query: 484 SETHYHRLANALRRSIS-----RNTAALTDTAEAPIYNYRGQYLMEISLGTPPFSILAVA 543
           S+T   RL +A  RS S     R +A  +D  ++ +    G+Y+M +S+GTPP  ++A+ 
Sbjct: 488 SKTRTERLTDAFHRSASRVGRFRQSAMTSDGIQSRLVPSAGEYIMNLSIGTPPVPVIAIV 547

Query: 544 DTGSDIVWTQCEPCPNCYEQSAPMFNPSKSATYKNVACSSPICSFAGEERSCSAQSECLY 603
           DTGSD+ WTQC PC +CY+Q  P F+P  S+TY++ +C +  C   G +RSC    +C +
Sbjct: 548 DTGSDLTWTQCRPCTHCYKQVVPFFDPKNSSTYRDSSCGTSFCLALGNDRSCRNGKKCTF 607

Query: 604 SITYGDSSHSQGDLAVDTVTMGSTSGRQVAFPRIAIGCGHDNAGTFDANVSGIVGLGQGP 663
             +Y D S + G+LAV+T+T+ ST+G+ V+FP  A GC H + G FD + SGIVGLG   
Sbjct: 608 MYSYADGSFTGGNLAVETLTVASTAGKPVSFPGFAFGCVHRSGGIFDEHSSGIVGLGVAE 667

Query: 664 ASLVSQLGPATGGKFSYCLAPIGNGTIESSKLNFGSNAIVSGSKAVSTPIYTSDTYKTFY 723
            S++SQL     G+FSYCL P+   +  SS++NFG + IVSG+  VSTP+        +Y
Sbjct: 668 LSMISQLKSTINGRFSYCLLPVFTDSSMSSRINFGRSGIVSGAGTVSTPLVMKGPDTYYY 727

Query: 724 SLKLEAVSVGESKFDFPVVSSRLG-GEANIIIDSGTTLTLLPTDLYNNFATEISGSINLQ 783
            + LE  SVG+ +  +   S +    E NII+DSGTT T LP + Y      ++ SI  +
Sbjct: 728 LITLEGFSVGKKRLSYKGFSKKAEVEEGNIIVDSGTTYTYLPLEFYVKLEESVAHSIKGK 787

Query: 784 RTNDPNQYLDDCYATTTDDYEAPPVTMHFEGADVPLQRENVFIRVSDDAVCLAFKAAGQD 841
           R  DPN     CY TT D  +AP +T HF+ A+V LQ  N F+R+ +D VC         
Sbjct: 788 RVRDPNGISSLCYNTTVDQIDAPIITAHFKDANVELQPWNTFLRMQEDLVCFTVLPT--- 847

BLAST of CaUC06G108310 vs. TAIR 10
Match: AT2G28220.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 435.3 bits (1118), Expect = 1.1e-121
Identity = 308/859 (35.86%), Postives = 431/859 (50.17%), Query Frame = 0

Query: 1   MAPIFFLIFL-ISSAVVSAATTGRDYGFTVELIRRDYPKSPMYNRSQTHYHRIADALRRS 60
           +A    ++FL I +  +   T    +GFT++LI+R         RS +   R++    + 
Sbjct: 18  LATTMIVLFLQIITCFLFTTTVSSPHGFTIDLIQR---------RSNSSSFRLSKNQLQG 77

Query: 61  ISRNTAALTDTAEAPIYSNRGEYLMELSVGTPPFSILAVADTGSDIIWTQCKPCKNCYQQ 120
            S     L D            YLM+L VGTPPF I A  DTGSD+IWTQC PC +CY Q
Sbjct: 78  ASPYADTLFD---------YNIYLMKLQVGTPPFEIAAEIDTGSDLIWTQCMPCPDCYSQ 137

Query: 121 NAPMFTPSKSATYKKLSCSSPICLFAGESGSCSSQSECLYSISYGDRSHSQGDFAVDTVT 180
             P+F PSKS+T+ +  C                   C Y I Y D ++S+G  A +TVT
Sbjct: 138 FDPIFDPSKSSTFNEQRCHG---------------KSCHYEIIYEDNTYSKGILATETVT 197

Query: 181 MGSTSGRPVAFPRMAIGCG-----HDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGKF 240
           + STSG P       IGCG      DN+G F ++ SGIVGL +GP SL++QM     G  
Sbjct: 198 IHSTSGEPFVMAETTIGCGLHNTDLDNSG-FASSSSGIVGLNMGPRSLISQMDLPYPGLI 257

Query: 241 SYCLTPIGSNTIESSKLNFGSNAVVSGSSAVSTPIYISDRFKSFYWLKLEGVSVGEKKFE 300
           SYC +  G     +SK+NFG+NA+V+G   V+  ++I  +   FY+L L+ VSV + + E
Sbjct: 258 SYCFSGQG-----TSKINFGTNAIVAGDGTVAADMFIK-KDNPFYYLNLDAVSVEDNRIE 317

Query: 301 FPVSSILGGEANMIIDSGTTLTFLPMHLYNNFSTTISNSINLQRTNDP--NQFLDYCFAT 360
              +     + N++IDSG+T+T+ P+   N     +   +   R  DP  N  L Y F+ 
Sbjct: 318 TLGTPFHAEDGNIVIDSGSTVTYFPVSYCNLVRKAVEQVVTAVRVPDPSGNDMLCY-FSE 377

Query: 361 TTDDYKAPPVTMHFE-GADVPLPQENVFVRV-SDDVVCLAFCPGQDNHIMIYGNIAQNNF 420
           T D +  P +TMHF  GAD+ L + N+++   S  + CLA          I+GN AQNNF
Sbjct: 378 TIDIF--PVITMHFSGGADLVLDKYNMYMESNSGGLFCLAIICNSPTQEAIFGNRAQNNF 437

Query: 421 LVGYDINTMSVSFKPADCIAIRDYGFTVELIHRDSPKSPMYNPSETHYHRLANALRRSIS 480
           LVGYD +++                    L+   SP                        
Sbjct: 438 LVGYDSSSL--------------------LLQGASP------------------------ 497

Query: 481 RNTAALTDTAEAPIYNYRGQYLMEISLGTPPFSILAVADTGSDIVWTQCEPCPNCYEQSA 540
                  DT    +Y+Y   YLM++ +GTPPF I+A  DTGSDI+WTQC PCPNCY Q A
Sbjct: 498 -----YADT----LYDY-SIYLMKLQVGTPPFEIVAEIDTGSDIIWTQCMPCPNCYSQFA 557

Query: 541 PMFNPSKSATYKNVACSSPICSFAGEERSCSAQSECLYSITYGDSSHSQGDLAVDTVTMG 600
           P+F+PSKS+T++   C+                + C Y I Y D ++S+G LA +TVT+ 
Sbjct: 558 PIFDPSKSSTFREQRCNG---------------NSCHYEIIYADKTYSKGILATETVTIP 617

Query: 601 STSGRQVAFPRIAIGCGHDNAGT----FDANVSGIVGLGQGPASLVSQLGPATGGKFSYC 660
           STSG         IGCG DN       F ++ SGIVGL  GP SL+SQ+     G  SYC
Sbjct: 618 STSGEPFVMAETKIGCGLDNTNLQYSGFASSSSGIVGLNMGPLSLISQMDLPYPGLISYC 677

Query: 661 LAPIGNGTIESSKLNFGSNAIVSGSKAVSTPIYTSDTYKTFYSLKLEAVSVGESKFDFPV 720
            +  G GT   SK+NFG+NAIV+G   V+  ++       FY L L+AVSV E      +
Sbjct: 678 FS--GQGT---SKINFGTNAIVAGDGTVAADMFIKKD-NPFYYLNLDAVSV-EDNLIATL 737

Query: 721 VSSRLGGEANIIIDSGTTLTLLPTDLYNNFATEISGSINLQRTNDPNQYLDDCYATTTDD 780
            +     + NI IDSGTTLT  P    N     +   +   +  D       CY + T D
Sbjct: 738 GTPFHAEDGNIFIDSGTTLTYFPMSYCNLVREAVEQVVTAVKVPDMGSDNLLCYYSDTID 754

Query: 781 YEAPPVTMHFE-GADVPLQRENVFIR-VSDDAVCLAFKAAGQDEDNIFIYGNISQNNFLV 840
              P +TMHF  GAD+ L + N+++  ++    CLA      D     ++GN +QNNFLV
Sbjct: 798 I-FPVITMHFSGGADLVLDKYNMYLETITGGIFCLAIGC--NDPSMPAVFGNRAQNNFLV 754

Query: 841 GYDTKNMSVSFKPADCISM 844
           GYD  +  +SF P +C ++
Sbjct: 858 GYDPSSNVISFSPTNCSAL 754

BLAST of CaUC06G108310 vs. TAIR 10
Match: AT5G33340.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 414.8 bits (1065), Expect = 1.6e-115
Identity = 208/414 (50.24%), Postives = 282/414 (68.12%), Query Frame = 0

Query: 435 GFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISR----NTAALTDTAEAPIYNYRGQ 494
           GFT +LIHRDSPKSP YNP ET   RL NA+ RS++R         T   +  + +  G+
Sbjct: 30  GFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNRVFHFTEKDNTPQPQIDLTSNSGE 89

Query: 495 YLMEISLGTPPFSILAVADTGSDIVWTQCEPCPNCYEQSAPMFNPSKSATYKNVACSSPI 554
           YLM +S+GTPPF I+A+ADTGSD++WTQC PC +CY Q  P+F+P  S+TYK+V+CSS  
Sbjct: 90  YLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCSSSQ 149

Query: 555 CSFAGEERSCSA-QSECLYSITYGDSSHSQGDLAVDTVTMGSTSGRQVAFPRIAIGCGHD 614
           C+    + SCS   + C YS++YGD+S+++G++AVDT+T+GS+  R +    I IGCGH+
Sbjct: 150 CTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGCGHN 209

Query: 615 NAGTFDANVSGIVGLGQGPASLVSQLGPATGGKFSYCLAPIGNGTIESSKLNFGSNAIVS 674
           NAGTF+   SGIVGLG GP SL+ QLG +  GKFSYCL P+ +   ++SK+NFG+NAIVS
Sbjct: 210 NAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGTNAIVS 269

Query: 675 GSKAVSTPIYTSDTYKTFYSLKLEAVSVGESKFDFPVVSSRLGGEANIIIDSGTTLTLLP 734
           GS  VSTP+    + +TFY L L+++SVG  +  +    S    E NIIIDSGTTLTLLP
Sbjct: 270 GSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSE-SSEGNIIIDSGTTLTLLP 329

Query: 735 TDLYNNFATEISGSINLQRTNDPNQYLDDCYATTTDDYEAPPVTMHFEGADVPLQRENVF 794
           T+ Y+     ++ SI+ ++  DP   L  CY + T D + P +TMHF+GADV L   N F
Sbjct: 330 TEFYSELEDAVASSIDAEKKQDPQSGLSLCY-SATGDLKVPVITMHFDGADVKLDSSNAF 389

Query: 795 IRVSDDAVCLAFKAAGQDEDNIFIYGNISQNNFLVGYDTKNMSVSFKPADCISM 844
           ++VS+D VC AF+ +     +  IYGN++Q NFLVGYDT + +VSFKP DC  M
Sbjct: 390 VQVSEDLVCFAFRGS----PSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCAKM 437

BLAST of CaUC06G108310 vs. TAIR 10
Match: AT1G64830.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 399.8 bits (1026), Expect = 5.3e-111
Identity = 216/436 (49.54%), Postives = 293/436 (67.20%), Query Frame = 0

Query: 1   MAPIFFLIFLISSAVVSAATTGRDYGFTVELIRRDYPKSPMYNRSQTHYHRIADALRRSI 60
           MA + F   L+S  ++S        GFT++LI RD PKSP YN ++T   R+ +A+RRS 
Sbjct: 1   MASLIFAT-LLSLLLLSNVNAYPKDGFTIDLIHRDSPKSPFYNSAETSSQRMRNAIRRS- 60

Query: 61  SRNTAALTDTAEAP------IYSNRGEYLMELSVGTPPFSILAVADTGSDIIWTQCKPCK 120
           +R+T   ++   +P      I SNRGEYLM +S+GTPP  ILA+ADTGSD+IWTQC PC+
Sbjct: 61  ARSTLQFSNDDASPNSPQSFITSNRGEYLMNISIGTPPVPILAIADTGSDLIWTQCNPCE 120

Query: 121 NCYQQNAPMFTPSKSATYKKLSCSSPICLFAGESGSCSS-QSECLYSISYGDRSHSQGDF 180
           +CYQQ +P+F P +S+TY+K+SCSS  C  A E  SCS+ ++ C Y+I+YGD S+++GD 
Sbjct: 121 DCYQQTSPLFDPKESSTYRKVSCSSSQCR-ALEDASCSTDENTCSYTITYGDNSYTKGDV 180

Query: 181 AVDTVTMGSTSGRPVAFPRMAIGCGHDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGK 240
           AVDTVTMGS+  RPV+   M IGCGH+N GTFD   SGI+GLG G  SLV+Q+  S  GK
Sbjct: 181 AVDTVTMGSSGRRPVSLRNMIIGCGHENTGTFDPAGSGIIGLGGGSTSLVSQLRKSINGK 240

Query: 241 FSYCLTPIGSNTIESSKLNFGSNAVVSGSSAVSTPIYISDRFKSFYWLKLEGVSVGEKKF 300
           FSYCL P  S T  +SK+NFG+N +VSG   VST +   D   ++Y+L LE +SVG KK 
Sbjct: 241 FSYCLVPFTSETGLTSKINFGTNGIVSGDGVVSTSMVKKDP-ATYYFLNLEAISVGSKKI 300

Query: 301 EFPVSSILG-GEANMIIDSGTTLTFLPMHLYNNFSTTISNSINLQRTNDPNQFLDYCFAT 360
           +F  S+I G GE N++IDSGTTLT LP + Y    + ++++I  +R  DP+  L  C+  
Sbjct: 301 QF-TSTIFGTGEGNIVIDSGTTLTLLPSNFYYELESVVASTIKAERVQDPDGILSLCYRD 360

Query: 361 TTDDYKAPPVTMHFEGADVPLPQENVFVRVSDDVVCLAFCPGQDNHIMIYGNIAQNNFLV 420
           ++  +K P +T+HF+G DV L   N FV VS+DV C AF   +   + I+GN+AQ NFLV
Sbjct: 361 SS-SFKVPDITVHFKGGDVKLGNLNTFVAVSEDVSCFAFAANE--QLTIFGNLAQMNFLV 420

Query: 421 GYDINTMSVSFKPADC 429
           GYD  + +VSFK  DC
Sbjct: 421 GYDTVSGTVSFKKTDC 428

BLAST of CaUC06G108310 vs. TAIR 10
Match: AT1G31450.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 340.9 bits (873), Expect = 2.9e-93
Identity = 187/435 (42.99%), Postives = 267/435 (61.38%), Query Frame = 0

Query: 419 MSVSFKPADCIAIRDYGFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISRNTAALTD 478
           +++SF  A   +      TVELIHRDSP SP+YNP  T   RL  A  RSISR+    T 
Sbjct: 12  LAISFFFASNSSANRENLTVELIHRDSPHSPLYNPHHTVSDRLNAAFLRSISRSRRFTTK 71

Query: 479 T-AEAPIYNYRGQYLMEISLGTPPFSILAVADTGSDIVWTQCEPCPNCYEQSAPMFNPSK 538
           T  ++ + +  G+Y M IS+GTPP  + A+ADTGSD+ W QC+PC  CY+Q++P+F+  K
Sbjct: 72  TDLQSGLISNGGEYFMSISIGTPPSKVFAIADTGSDLTWVQCKPCQQCYKQNSPLFDKKK 131

Query: 539 SATYKNVACSSPICSFAGE-ERSCSAQSE-CLYSITYGDSSHSQGDLAVDTVTMGSTSGR 598
           S+TYK  +C S  C    E E  C    + C Y  +YGD+S ++GD+A +T+++ S+SG 
Sbjct: 132 SSTYKTESCDSKTCQALSEHEEGCDESKDICKYRYSYGDNSFTKGDVATETISIDSSSGS 191

Query: 599 QVAFPRIAIGCGHDNAGTFDANVSGIVGLGQGPASLVSQLGPATGGKFSYCLAPIGNGTI 658
            V+FP    GCG++N GTF+   SGI+GLG GP SLVSQLG + G KFSYCL+     T 
Sbjct: 192 SVSFPGTVFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCLSHTAATTN 251

Query: 659 ESSKLNFGSNAIVSG----SKAVSTPIYTSDTYKTFYSLKLEAVSVGESKFDFPVVSSRL 718
            +S +N G+N+I S     S  ++TP+   D  +T+Y L LEAV+VG++K  +      L
Sbjct: 252 GTSVINLGTNSIPSNPSKDSATLTTPLIQKDP-ETYYFLTLEAVTVGKTKLPYTGGGYGL 311

Query: 719 GGEA-----NIIIDSGTTLTLLPTDLYNNFATEISGSI-NLQRTNDPNQYLDDCYATTTD 778
            G++     NIIIDSGTTLTLL +  Y++F T +  S+   +R +DP   L  C+ +   
Sbjct: 312 NGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKRVSDPQGLLTHCFKSGDK 371

Query: 779 DYEAPPVTMHFEGADVPLQRENVFIRVSDDAVCLAFKAAGQDEDNIFIYGNISQNNFLVG 838
           +   P +TMHF  ADV L   N F+++++D VCL+     +    + IYGN+ Q +FLVG
Sbjct: 372 EIGLPAITMHFTNADVKLSPINAFVKLNEDTVCLSMIPTTE----VAIYGNMVQMDFLVG 431

Query: 839 YDTKNMSVSFKPADC 841
           YD +  +VSF+  DC
Sbjct: 432 YDLETKTVSFQRMDC 441

BLAST of CaUC06G108310 vs. TAIR 10
Match: AT2G35615.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 334.3 bits (856), Expect = 2.8e-91
Identity = 188/445 (42.25%), Postives = 266/445 (59.78%), Query Frame = 0

Query: 5   FFLIFLISSAVVSAATTGRDYGFTVELIRRDYPKSPMYNRSQTHYHRIADALRRSISRNT 64
           FFL F      V+ +++G    F+VELI RD P SP+YN   T   R+  A  RS+SR+ 
Sbjct: 9   FFLFF-----SVTLSSSGHPKNFSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVSRSR 68

Query: 65  AALTDTAEAPIYSN----RGEYLMELSVGTPPFSILAVADTGSDIIWTQCKPCKNCYQQN 124
                 ++  + S      GE+ M +++GTPP  + A+ADTGSD+ W QCKPC+ CY++N
Sbjct: 69  RFNHQLSQTDLQSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKEN 128

Query: 125 APMFTPSKSATYKKLSCSSPIC--LFAGESGSCSSQSECLYSISYGDRSHSQGDFAVDTV 184
            P+F   KS+TYK   C S  C  L + E G   S + C Y  SYGD+S S+GD A +TV
Sbjct: 129 GPIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATETV 188

Query: 185 TMGSTSGRPVAFPRMAIGCGHDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGKFSYCL 244
           ++ S SG PV+FP    GCG++N GTFD   SGI+GLG G  SL++Q+G S   KFSYCL
Sbjct: 189 SIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCL 248

Query: 245 TPIGSNTIESSKLNFGSNAVVSG----SSAVSTPIYISDRFKSFYWLKLEGVSVGEKKFE 304
           +   + T  +S +N G+N++ S     S  VSTP+ +     ++Y+L LE +SVG+KK  
Sbjct: 249 SHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPL-VDKEPLTYYYLTLEAISVGKKKIP 308

Query: 305 FPVSS--------ILGGEANMIIDSGTTLTFLPMHLYNNFSTTISNSI-NLQRTNDPNQF 364
           +  SS        +     N+IIDSGTTLT L    ++ FS+ +  S+   +R +DP   
Sbjct: 309 YTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDPQGL 368

Query: 365 LDYCFATTTDDYKAPPVTMHFEGADVPLPQENVFVRVSDDVVCLAFCPGQDNHIMIYGNI 424
           L +CF + + +   P +T+HF GADV L   N FV++S+D+VCL+  P     + IYGN 
Sbjct: 369 LSHCFKSGSAEIGLPEITVHFTGADVRLSPINAFVKLSEDMVCLSMVP--TTEVAIYGNF 428

Query: 425 AQNNFLVGYDINTMSVSFKPADCIA 431
           AQ +FLVGYD+ T +VSF+  DC A
Sbjct: 429 AQMDFLVGYDLETRTVSFQHMDCSA 445

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA3468560.11.7e-21248.47aspartic proteinase CDR1-like [Gossypium australe][more]
RDY01103.11.7e-21247.64Aspartic proteinase CDR1, partial [Mucuna pruriens][more]
TKY49535.13.6e-21047.39Aspartic proteinase CDR1 [Spatholobus suberectus][more]
XP_038876324.12.7e-20585.92aspartic proteinase CDR1-like [Benincasa hispida][more]
KAF4377251.13.0e-19644.48hypothetical protein F8388_012352 [Cannabis sativa][more]
Match NameE-valueIdentityDescription
Q6XBF82.3e-11450.24Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1[more]
Q3EBM53.9e-9042.25Probable aspartic protease At2g35615 OS=Arabidopsis thaliana OX=3702 GN=At2g3561... [more]
Q766C39.3e-6838.46Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q766C26.2e-6436.46Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Q9LNJ32.9e-5338.23Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
Match NameE-valueIdentityDescription
A0A371HE868.4e-21347.64Aspartic proteinase CDR1 (Fragment) OS=Mucuna pruriens OX=157652 GN=CDR1 PE=3 SV... [more]
A0A5B6VH548.4e-21348.47Aspartic proteinase CDR1-like OS=Gossypium australe OX=47621 GN=EPI10_014435 PE=... [more]
A0A7J6G2M21.4e-19644.48Uncharacterized protein OS=Cannabis sativa OX=3483 GN=F8388_012352 PE=3 SV=1[more]
A0A3Q7HJU27.4e-19343.58Uncharacterized protein OS=Solanum lycopersicum OX=4081 PE=3 SV=1[more]
F6HJ515.3e-19141.81Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VIT_02s0087g00230 PE=3 SV=... [more]
Match NameE-valueIdentityDescription
AT2G28220.11.1e-12135.86Eukaryotic aspartyl protease family protein [more]
AT5G33340.11.6e-11550.24Eukaryotic aspartyl protease family protein [more]
AT1G64830.15.3e-11149.54Eukaryotic aspartyl protease family protein [more]
AT1G31450.12.9e-9342.99Eukaryotic aspartyl protease family protein [more]
AT2G35615.12.8e-9142.25Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL246-FR2) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 82..255
e-value: 2.1E-53
score: 181.2
coord: 491..664
e-value: 1.9E-55
score: 187.9
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 687..836
e-value: 1.9E-25
score: 89.5
coord: 279..424
e-value: 3.9E-26
score: 91.8
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 253..433
e-value: 1.9E-43
score: 150.2
coord: 662..843
e-value: 3.0E-42
score: 146.3
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 468..661
e-value: 7.2E-55
score: 188.0
coord: 59..252
e-value: 4.0E-52
score: 179.0
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 75..430
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 485..841
NoneNo IPR availablePANTHERPTHR47967:SF66ASPARTIC PROTEINASE CDR1-RELATEDcoord: 7..428
NoneNo IPR availablePANTHERPTHR47967:SF66ASPARTIC PROTEINASE CDR1-RELATEDcoord: 434..840
NoneNo IPR availablePANTHERPTHR47967OS07G0603500 PROTEIN-RELATEDcoord: 7..428
coord: 434..840
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 717..728
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 491..836
score: 45.012131
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 82..424
score: 41.89922
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 81..428
e-value: 3.27701E-89
score: 281.074
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 490..840
e-value: 5.50885E-83
score: 264.895

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CaUC06G108310.1CaUC06G108310.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0004190 aspartic-type endopeptidase activity