Cucsa.084600 (gene) Cucumber (Gy14) v1

NameCucsa.084600
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionXyloglucan-specific endoglucanase inhibitor protein 2
Locationscaffold00862 : 310128 .. 311413 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCTCATTGCTCCTCTCTACAAACACCATACCTCTCTCCTCTACTCCATCTCCCTCCACCTCAAAACTCCCCTCCGCCCAGCCTCCCTCTACCTCGACCTCGGCGGCGCCTTCTCCTGGATCGACTGCTACCAAAATTACAACTCTTCCTCTTACAAATTCGTCCTTTGCAATACCCCTCTCTCCAATTCCTTCAACCAGGCTATTTGCGGCTCCTGCGTCCAAGCTCCCTCTCCTATCTGTGCTAACGACACCATCTTCTCTTACGCCTATCCAGAAAACCCATCCCTCAGAGATCATTTTGTTGATTACGATCACCCTAAGCTCACCGATTCCGAGAATGTCATCACTGATGTTCTTGCTCTCTCCACCACAGACGGCTCCACATCCGGTCCACTCCGTCGCATTCCTGAATTTCCTTTCGCATGCGTCAAGACCAATTTCCTCCGAGAAGTTGCTAAGAATGTCATTGGCCTAGCCGCGCTCGGCCGTTCCAACTTATCGATTCCATCGGTGATTAGTGCAAAATTCAATAGCCCTAAGTATTTTGCCATTTGTTTATCAGGAGCGAGGTCAGGGCCTGGTGTTGCTTTCTTCGGATCTAAAGGCCCGTACAGATTTTCCCCCAATGTTGATCTTTCTAAATCCCTAACTTACACACCATTGCTCTTCAATCCGGTTAGCGCCTCGATTTACACCTATTGGTTACCGTCTTACGAGTATTACGTTGGACTCTCCGCCATTAGAATCAACGGCAAGGTGGTGCCGTTCAACACATCTTTATTGTCGTTTGAGCCGATTCACGGTCGCGGTGGGGCTAAGATCAGCACCTCCACCAATTACGCGTTGCTACGGAGTTCGATTTATAGAGCATTCGCGACGGTGTTTATGAAGGAAGCGGTTGTACTCAACTTCAAGTTGATAAATGCGGTAGAGCCGTTCGGAGTGTGCTATGAGGCAAAGAGCGTGGGAGTGACGGCGGAAGGACAGGCGAAGGCTCCGGTGGTGGATTTGGTTATGGAGAAAGAGAAAGTGGTGTGGAAATTAGGGGGGAGGAATACGATGGTGAGGATTAAGAAGAAGGGAGTGGATGCTTGGTGCTTGGGATTCATCAATGGCGGAGAATTTCCAAGAACGCCGATCGTGATCGGAGGTTTGCAAATGGAAGATCATTTGTTGCAGTTCGATCTTGAAAATTTCAGATTTGGATTTAGCTCTTCGGCATTAACGGAGGGAACTTCATGTTCAAAATTCGACTTCACTTCTGCAAACAACACTTTCTTTTAA

mRNA sequence

CTCTCATTGCTCCTCTCTACAAACACCATACCTCTCTCCTCTACTCCATCTCCCTCCACCTCAAAACTCCCCTCCGCCCAGCCTCCCTCTACCTCGACCTCGGCGGCGCCTTCTCCTGGATCGACTGCTACCAAAATTACAACTCTTCCTCTTACAAATTCGTCCTTTGCAATACCCCTCTCTCCAATTCCTTCAACCAGGCTATTTGCGGCTCCTGCGTCCAAGCTCCCTCTCCTATCTGTGCTAACGACACCATCTTCTCTTACGCCTATCCAGAAAACCCATCCCTCAGAGATCATTTTGTTGATTACGATCACCCTAAGCTCACCGATTCCGAGAATGTCATCACTGATGTTCTTGCTCTCTCCACCACAGACGGCTCCACATCCGGTCCACTCCGTCGCATTCCTGAATTTCCTTTCGCATGCGTCAAGACCAATTTCCTCCGAGAAGTTGCTAAGAATGTCATTGGCCTAGCCGCGCTCGGCCGTTCCAACTTATCGATTCCATCGGTGATTAGTGCAAAATTCAATAGCCCTAAGTATTTTGCCATTTGTTTATCAGGAGCGAGGTCAGGGCCTGGTGTTGCTTTCTTCGGATCTAAAGGCCCGTACAGATTTTCCCCCAATGTTGATCTTTCTAAATCCCTAACTTACACACCATTGCTCTTCAATCCGGTTAGCGCCTCGATTTACACCTATTGGTTACCGTCTTACGAGTATTACGTTGGACTCTCCGCCATTAGAATCAACGGCAAGGTGGTGCCGTTCAACACATCTTTATTGTCGTTTGAGCCGATTCACGGTCGCGGTGGGGCTAAGATCAGCACCTCCACCAATTACGCGTTGCTACGGAGTTCGATTTATAGAGCATTCGCGACGGTGTTTATGAAGGAAGCGGTTGTACTCAACTTCAAGTTGATAAATGCGGTAGAGCCGTTCGGAGTGTGCTATGAGGCAAAGAGCGTGGGAGTGACGGCGGAAGGACAGGCGAAGGCTCCGGTGGTGGATTTGGTTATGGAGAAAGAGAAAGTGGTGTGGAAATTAGGGGGGAGGAATACGATGGTGAGGATTAAGAAGAAGGGAGTGGATGCTTGGTGCTTGGGATTCATCAATGGCGGAGAATTTCCAAGAACGCCGATCGTGATCGGAGGTTTGCAAATGGAAGATCATTTGTTGCAGTTCGATCTTGAAAATTTCAGATTTGGATTTAGCTCTTCGGCATTAACGGAGGGAACTTCATGTTCAAAATTCGACTTCACTTCTGCAAACAACACTTTCTTTTAA

Coding sequence (CDS)

CTCTCATTGCTCCTCTCTACAAACACCATACCTCTCTCCTCTACTCCATCTCCCTCCACCTCAAAACTCCCCTCCGCCCAGCCTCCCTCTACCTCGACCTCGGCGGCGCCTTCTCCTGGATCGACTGCTACCAAAATTACAACTCTTCCTCTTACAAATTCGTCCTTTGCAATACCCCTCTCTCCAATTCCTTCAACCAGGCTATTTGCGGCTCCTGCGTCCAAGCTCCCTCTCCTATCTGTGCTAACGACACCATCTTCTCTTACGCCTATCCAGAAAACCCATCCCTCAGAGATCATTTTGTTGATTACGATCACCCTAAGCTCACCGATTCCGAGAATGTCATCACTGATGTTCTTGCTCTCTCCACCACAGACGGCTCCACATCCGGTCCACTCCGTCGCATTCCTGAATTTCCTTTCGCATGCGTCAAGACCAATTTCCTCCGAGAAGTTGCTAAGAATGTCATTGGCCTAGCCGCGCTCGGCCGTTCCAACTTATCGATTCCATCGGTGATTAGTGCAAAATTCAATAGCCCTAAGTATTTTGCCATTTGTTTATCAGGAGCGAGGTCAGGGCCTGGTGTTGCTTTCTTCGGATCTAAAGGCCCGTACAGATTTTCCCCCAATGTTGATCTTTCTAAATCCCTAACTTACACACCATTGCTCTTCAATCCGGTTAGCGCCTCGATTTACACCTATTGGTTACCGTCTTACGAGTATTACGTTGGACTCTCCGCCATTAGAATCAACGGCAAGGTGGTGCCGTTCAACACATCTTTATTGTCGTTTGAGCCGATTCACGGTCGCGGTGGGGCTAAGATCAGCACCTCCACCAATTACGCGTTGCTACGGAGTTCGATTTATAGAGCATTCGCGACGGTGTTTATGAAGGAAGCGGTTGTACTCAACTTCAAGTTGATAAATGCGGTAGAGCCGTTCGGAGTGTGCTATGAGGCAAAGAGCGTGGGAGTGACGGCGGAAGGACAGGCGAAGGCTCCGGTGGTGGATTTGGTTATGGAGAAAGAGAAAGTGGTGTGGAAATTAGGGGGGAGGAATACGATGGTGAGGATTAAGAAGAAGGGAGTGGATGCTTGGTGCTTGGGATTCATCAATGGCGGAGAATTTCCAAGAACGCCGATCGTGATCGGAGGTTTGCAAATGGAAGATCATTTGTTGCAGTTCGATCTTGAAAATTTCAGATTTGGATTTAGCTCTTCGGCATTAACGGAGGGAACTTCATGTTCAAAATTCGACTTCACTTCTGCAAACAACACTTTCTTTTAA

Protein sequence

LIAPLYKHHTSLLYSISLHLKTPLRPASLYLDLGGAFSWIDCYQNYNSSSYKFVLCNTPLSNSFNQAICGSCVQAPSPICANDTIFSYAYPENPSLRDHFVDYDHPKLTDSENVITDVLALSTTDGSTSGPLRRIPEFPFACVKTNFLREVAKNVIGLAALGRSNLSIPSVISAKFNSPKYFAICLSGARSGPGVAFFGSKGPYRFSPNVDLSKSLTYTPLLFNPVSASIYTYWLPSYEYYVGLSAIRINGKVVPFNTSLLSFEPIHGRGGAKISTSTNYALLRSSIYRAFATVFMKEAVVLNFKLINAVEPFGVCYEAKSVGVTAEGQAKAPVVDLVMEKEKVVWKLGGRNTMVRIKKKGVDAWCLGFINGGEFPRTPIVIGGLQMEDHLLQFDLENFRFGFSSSALTEGTSCSKFDFTSANNTFF*
BLAST of Cucsa.084600 vs. Swiss-Prot
Match: 7SBG2_SOYBN (Basic 7S globulin 2 OS=Glycine max PE=1 SV=1)

HSP 1 Score: 189.1 bits (479), Expect = 9.9e-47
Identity = 137/430 (31.86%), Postives = 216/430 (50.23%), Query Frame = 1

Query: 1   LIAPLYKHHTSLLYSISLHLKTPLRPASLYLDLGGAFSWIDCYQNYNSSSYKFVLCNTPL 60
           L+ P+    ++ L+  +L  +TPL    + +DL G   W++C Q+Y+S +Y+   C++  
Sbjct: 41  LVLPVQNDASTGLHWANLQKRTPLMQVPVLVDLNGNHLWVNCEQHYSSKTYQAPFCHSTQ 100

Query: 61  SNSFNQAICGSCVQAPSPICANDTIFSYAYPENPSLRDHFVDYDHPKLTDSENVITDVLA 120
            +  N   C SC  A  P C  +T    +   NP  +           T    +  DVLA
Sbjct: 101 CSRANTHQCLSCPAASRPGCHKNTCGLMS--TNPITQQ----------TGLGELGQDVLA 160

Query: 121 LSTTDGSTS--GPLRRIPEFPFACVKTNFLRE-VAKNVIGLAALGRSNLSIPSVISAKFN 180
           +  T GST   GPL  +P+F F+C  +  L++ + +N+ G+A LG + +S+P+ +++ F 
Sbjct: 161 IHATQGSTQQLGPLVTVPQFLFSCAPSFLLQKGLPRNIQGVAGLGHAPISLPNQLASHFG 220

Query: 181 SPKYFAICLSGARSGPGVAFFGSKGPYRFSP--NVDLSKSLTYTPLLFNPVSASIYTYWL 240
               F  CLS   +  G   FG   P       N D+   L +TPL   P          
Sbjct: 221 LQHQFTTCLSRYPTSKGALIFGD-APNNMQQFHNQDIFHDLAFTPLTVTPQG-------- 280

Query: 241 PSYEYYVGLSAIRINGKVVPFNTSLLSFEPIHGRGGAKISTSTNYALLRSSIYRAFATVF 300
              EY V +S+IRIN   V F  + +S   +   GG  ISTST + +L+ S+Y+AF  VF
Sbjct: 281 ---EYNVRVSSIRINQHSV-FPPNKISSTIVGSSGGTMISTSTPHMVLQQSLYQAFTQVF 340

Query: 301 MKEAVVLNFKLINAVEPFGVCYEAKSVGVTAEGQAKAPVVDLVMEKEK-VVWKLGGRNTM 360
            ++  +     + +V PFG+C+ +  +          P VDLVM+K    VW++ G + M
Sbjct: 341 AQQ--LEKQAQVKSVAPFGLCFNSNKINAY-------PSVDLVMDKPNGPVWRISGEDLM 400

Query: 361 VRIKKKGVDAWCLGFINGGEFPRTPIVIGGLQMEDHLLQFDLENFRFGFSSSAL-TEGTS 420
           V+ +  GV   CLG +NGG  PR  + +G  Q+E+ L+ FDL   R GFS+S+L + G  
Sbjct: 401 VQAQP-GVT--CLGVMNGGMQPRAEVTLGTRQLEEKLMVFDLARSRVGFSTSSLHSHGVK 433

Query: 421 CSK-FDFTSA 423
           C   F+F +A
Sbjct: 461 CGDLFNFANA 433

BLAST of Cucsa.084600 vs. Swiss-Prot
Match: 7SB1_SOYBN (Basic 7S globulin OS=Glycine max GN=BG PE=1 SV=2)

HSP 1 Score: 182.2 bits (461), Expect = 1.2e-44
Identity = 135/430 (31.40%), Postives = 216/430 (50.23%), Query Frame = 1

Query: 1   LIAPLYKHHTSLLYSISLHLKTPLRPASLYLDLGGAFSWIDCYQNYNSSSYKFVLCNTPL 60
           ++ P+    ++ L+  +L  +TPL    + +DL G   W++C Q Y+S +Y+   C++  
Sbjct: 34  VVLPVQNDGSTGLHWANLQKRTPLMQVPVLVDLNGNHLWVNCEQQYSSKTYQAPFCHSTQ 93

Query: 61  SNSFNQAICGSCVQAPSPICANDTIFSYAYPENPSLRDHFVDYDHPKLTDSENVITDVLA 120
            +  N   C SC  A  P C  +T    +   NP  +           T    +  DVLA
Sbjct: 94  CSRANTHQCLSCPAASRPGCHKNTCGLMS--TNPITQQ----------TGLGELGEDVLA 153

Query: 121 LSTTDGSTS--GPLRRIPEFPFACVKTNFLRE-VAKNVIGLAALGRSNLSIPSVISAKFN 180
           +  T GST   GPL  +P+F F+C  +  +++ + +N  G+A LG + +S+P+ +++ F 
Sbjct: 154 IHATQGSTQQLGPLVTVPQFLFSCAPSFLVQKGLPRNTQGVAGLGHAPISLPNQLASHFG 213

Query: 181 SPKYFAICLSGARSGPGVAFFG-SKGPYRFSPNVDLSKSLTYTPLLFNPVSASIYTYWLP 240
             + F  CLS   +  G   FG +    R   N D+   L +TPL               
Sbjct: 214 LQRQFTTCLSRYPTSKGAIIFGDAPNNMRQFQNQDIFHDLAFTPLTITLQG--------- 273

Query: 241 SYEYYVGLSAIRING-KVVPFNTSLLSFEPIHGRGGAKISTSTNYALLRSSIYRAFATVF 300
             EY V +++IRIN   V P N  + S       GG  ISTST + +L+ S+Y+AF  VF
Sbjct: 274 --EYNVRVNSIRINQHSVFPLN-KISSTIVGSTSGGTMISTSTPHMVLQQSVYQAFTQVF 333

Query: 301 MKEAVVLNFKLINAVEPFGVCYEAKSVGVTAEGQAKAPVVDLVMEKEK-VVWKLGGRNTM 360
            ++  +     + +V PFG+C+ +  +          P VDLVM+K    VW++ G + M
Sbjct: 334 AQQ--LPKQAQVKSVAPFGLCFNSNKINAY-------PSVDLVMDKPNGPVWRISGEDLM 393

Query: 361 VRIKKKGVDAWCLGFINGGEFPRTPIVIGGLQMEDHLLQFDLENFRFGFSSSAL-TEGTS 420
           V+ +  GV   CLG +NGG  PR  I +G  Q+E++L+ FDL   R GFS+S+L + G  
Sbjct: 394 VQAQP-GVT--CLGVMNGGMQPRAEITLGARQLEENLVVFDLARSRVGFSTSSLHSHGVK 427

Query: 421 CSK-FDFTSA 423
           C+  F+F +A
Sbjct: 454 CADLFNFANA 427

BLAST of Cucsa.084600 vs. TrEMBL
Match: A0A0A0K506_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G390050 PE=3 SV=1)

HSP 1 Score: 855.1 bits (2208), Expect = 3.6e-245
Identity = 422/427 (98.83%), Postives = 423/427 (99.06%), Query Frame = 1

Query: 1   LIAPLYKHHTSLLYSISLHLKTPLRPASLYLDLGGAFSWIDCYQNYNSSSYKFVLCNTPL 60
           LIAPLYKHHTSLLYSISLHLKTPLRPASLYLDLGGAFSWI CYQNYNSSSYKFVLCNTPL
Sbjct: 25  LIAPLYKHHTSLLYSISLHLKTPLRPASLYLDLGGAFSWIHCYQNYNSSSYKFVLCNTPL 84

Query: 61  SNSFNQAICGSCVQAPSPICANDTIFSYAYPENPSLRDHFVDYDHPKLTDSENVITDVLA 120
           SNSFNQAICGSCVQAPSPICANDTIFSYAYPENPSLRDHFVDYDHPKLTDSENVITDVLA
Sbjct: 85  SNSFNQAICGSCVQAPSPICANDTIFSYAYPENPSLRDHFVDYDHPKLTDSENVITDVLA 144

Query: 121 LSTTDGSTSGPLRRIPEFPFACVKTNFLREVAKNVIGLAALGRSNLSIPSVISAKFNSPK 180
           LSTT GSTS PLRRIPEFPFACVKTNFLREVAKNVIGLAALGRSNLSIPSVISAKF+SPK
Sbjct: 145 LSTTGGSTSAPLRRIPEFPFACVKTNFLREVAKNVIGLAALGRSNLSIPSVISAKFSSPK 204

Query: 181 YFAICLSGARSGPGVAFFGSKGPYRFSPNVDLSKSLTYTPLLFNPVSASIYTYWLPSYEY 240
           YFAICLSGARSGPGVAFFGSKGPYRFSPNVDLSKSLTYTPLLFNPVSASIYTYWLPSYEY
Sbjct: 205 YFAICLSGARSGPGVAFFGSKGPYRFSPNVDLSKSLTYTPLLFNPVSASIYTYWLPSYEY 264

Query: 241 YVGLSAIRINGKVVPFNTSLLSFEPIHGRGGAKISTSTNYALLRSSIYRAFATVFMKEAV 300
           YVGLSAIRINGKVVPFNTSLLSFEPIHGRGGAKISTSTNYALLRSSIYRAFATVFMKEAV
Sbjct: 265 YVGLSAIRINGKVVPFNTSLLSFEPIHGRGGAKISTSTNYALLRSSIYRAFATVFMKEAV 324

Query: 301 VLNFKLINAVEPFGVCYEAKSVGVTAEGQAKAPVVDLVMEKEKVVWKLGGRNTMVRIKKK 360
           VLNFKLINAVEPFGVCYEAKSVGVTAEGQAKAPVVDLVMEKEKVVWKLGGRNTMVRIKKK
Sbjct: 325 VLNFKLINAVEPFGVCYEAKSVGVTAEGQAKAPVVDLVMEKEKVVWKLGGRNTMVRIKKK 384

Query: 361 GVDAWCLGFINGGEFPRTPIVIGGLQMEDHLLQFDLENFRFGFSSSALTEGTSCSKFDFT 420
           GVDAWCLGFINGGEFPRTPIVIGGLQMEDHLLQFDLENFRFGFSSSAL EGTSCSKFDFT
Sbjct: 385 GVDAWCLGFINGGEFPRTPIVIGGLQMEDHLLQFDLENFRFGFSSSALKEGTSCSKFDFT 444

Query: 421 SANNTFF 428
           SANNTFF
Sbjct: 445 SANNTFF 451

BLAST of Cucsa.084600 vs. TrEMBL
Match: A0A067JCA9_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_21952 PE=3 SV=1)

HSP 1 Score: 396.7 bits (1018), Expect = 3.5e-107
Identity = 205/414 (49.52%), Postives = 276/414 (66.67%), Query Frame = 1

Query: 1   LIAPLYKHHTSLLYSISLHLKTPLRPASLYLDLGGAFSWIDCYQNYNSSSYKFVLCNTPL 60
           L+AP+ K   SLLY+I+++LKTPL+P  L+LDLG +F+W+DC+++YNS+SY+ + C++ L
Sbjct: 28  LVAPIQKDPDSLLYTITVYLKTPLQPTKLHLDLGASFTWLDCFRDYNSTSYQHIPCSSSL 87

Query: 61  SNSFNQAICGSCVQAPSPICANDTIFSYAYPENPSLRDHFVDYDHPKLTDSENVITDVLA 120
             SF+   CG+C     P CAN++ F   YPENP  R   +             + D LA
Sbjct: 88  CTSFHSVACGNCNDTLGPACANNSCF--LYPENPITRQATL----------ATALVDSLA 147

Query: 121 LSTTDGSTSGPLRRIPEFPFACVKTNFLREVAKNVIGLAALGRSNLSIPSVISAKFNSPK 180
           L TTDG+T G +  +PEF F+C +   L  +AK V GLAALGRS  S+P  IS   +SP 
Sbjct: 148 LPTTDGTTIGEMVIVPEFVFSCARPFLLNGLAKEVTGLAALGRSQHSLPVQISDAVSSPH 207

Query: 181 YFAICLSGARSGPGVAFFGSKGPYRFSPNVDLSKSLTYTPLLFNPVSASIYTY-WLPSYE 240
           YFA+CLSG+ +  GVAFF + GPY F P VDLSKSL YT LL NPV +++ +Y   PS E
Sbjct: 208 YFALCLSGSSTEHGVAFFATSGPYYFLPRVDLSKSLVYTGLLLNPVGSTVISYNQQPSDE 267

Query: 241 YYVGLSAIRINGKVVPFNTSLLSFEPIHGRGGAKISTSTNYALLRSSIYRAFATVFMKEA 300
           YY+ L+++++NGK +  N+S LSF+  +G GG K+ST T+Y  L SSIYRAF   F+ E+
Sbjct: 268 YYINLTSVKVNGKPIQLNSSQLSFDE-NGFGGTKLSTDTSYTTLESSIYRAFVEAFVNES 327

Query: 301 VVLNFKLINAVEPFGVCYEAKSVGVTAEGQAKAPVVDLVMEKEKVVWKLGGRNTMVRIKK 360
             LN  + + V+PF VCY+A  V  T  G    P VD VME E V W++ G N+MV I++
Sbjct: 328 AGLNLTVTSVVKPFDVCYQATDVISTRVGPG-VPTVDFVMESEDVFWRIFGWNSMVMIER 387

Query: 361 KGVDAWCLGFINGGEFPRTPIVIGGLQMEDHLLQFDLENFRFGFSSSALTEGTS 414
            GVD WCLGF++GG   R  IVIGG QMED+LL+FD+E+ R GFSSS L +GTS
Sbjct: 388 DGVDLWCLGFVDGGVNARASIVIGGHQMEDNLLEFDMESKRLGFSSSLLLKGTS 427

BLAST of Cucsa.084600 vs. TrEMBL
Match: A0A061DRU5_THECC (Eukaryotic aspartyl protease family protein, putative OS=Theobroma cacao GN=TCM_004428 PE=3 SV=1)

HSP 1 Score: 388.3 bits (996), Expect = 1.2e-104
Identity = 203/421 (48.22%), Postives = 276/421 (65.56%), Query Frame = 1

Query: 2   IAPLYKHHTSLLYSISLHLKTPLRPASLYLDLGGAFSWIDCYQNYNSSSYKFVLCNTPLS 61
           +AP+ K +++LLYS++L+LKTPL+P  L+LDLG +F+W+DC  +YNSS+Y+ + C +PL 
Sbjct: 29  VAPIGKDNSTLLYSLTLYLKTPLQPTRLHLDLGSSFTWVDCDTDYNSSTYQHIPCGSPLC 88

Query: 62  NSFNQAI-CGSCVQAPSPICANDTIFSYAYPENPSLRDHFVDYDHPKLTDSENVITDVLA 121
           +S    + C +C   PSP CAN+T     +PEN   R           T     +TD LA
Sbjct: 89  SSLGHNLSCSNCFNPPSPSCANNTCS--LFPENSITRK----------TAISTALTDSLA 148

Query: 122 LSTTDGSTSGPLRRIPEFPFACVKTNFLREVAKNVIGLAALGRSNLSIPSVISAKFNSPK 181
           L T+DGST GP   I  + F+C   + L  +AKNV GLAA GRSN S+P  +S  F+ P+
Sbjct: 149 LPTSDGSTHGPPLLISAYIFSCSPPSLLEGLAKNVTGLAAFGRSNYSLPVQVSDTFSIPR 208

Query: 182 YFAICLSGARSGPGVAFFGSKGPYRFSPN-VDLSKSLTYTPLLFNPVSASIYTY-WLPSY 241
            FA+CL G+ + PGV   GS GPY FS   +DLSKSL YTPL+ NPV +++ TY   PS 
Sbjct: 209 CFALCLPGSTADPGVVLVGSLGPYYFSSQKIDLSKSLIYTPLILNPVGSTVITYVGQPSD 268

Query: 242 EYYVGLSAIRINGKVVPFNTSLLSFEPIHGRGGAKISTSTNYALLRSSIYRAFATVFMKE 301
           EYY+ L+AI +NGK +  N S L+ +  +G GG K+STST Y +L +SIY A    F+ E
Sbjct: 269 EYYINLTAINVNGKPIQINGSSLNVDK-NGFGGTKLSTSTTYTVLETSIYNALTDAFVNE 328

Query: 302 AVVLNFKLINAVEPFGVCYEAKSVGVTAEGQAKAPVVDLVMEKEKVVWKLGGRNTMVRIK 361
           +  LN  + NAV+PF VCY A  + VT  G    P VDLVM+ + V W++ G N+MV+I 
Sbjct: 329 SSALNLTVTNAVKPFSVCYSAADIIVTRVGPG-VPTVDLVMQSDDVFWRVFGSNSMVQIA 388

Query: 362 KKGVDAWCLGFINGGEFPRTPIVIGGLQMEDHLLQFDLENFRFGFSSSALTEGTSCSKFD 420
             G D WCLGF++GG  PRT +VIGG QMED+LLQFDL++ R GF+SS L +GT+C+ F+
Sbjct: 389 GDGGDVWCLGFVDGGVNPRTSVVIGGHQMEDNLLQFDLDSNRLGFTSSVLLKGTTCADFN 435

BLAST of Cucsa.084600 vs. TrEMBL
Match: A0A0D2N1T5_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_004G215700 PE=3 SV=1)

HSP 1 Score: 380.9 bits (977), Expect = 2.0e-102
Identity = 203/424 (47.88%), Postives = 277/424 (65.33%), Query Frame = 1

Query: 2   IAPLYKHHTSLLYSISLHLKTPLRPASLYLDLGGAFSWIDCYQNYNSSSYKFVLCNTPLS 61
           ++P+ K + +LLYS++++LKTPL+P  L+LD+G +FSW+ C   YNS++Y+ +   + L 
Sbjct: 31  VSPIQKDNATLLYSLTVYLKTPLQPTRLHLDVGASFSWVACDAGYNSTTYQHIPWASLLC 90

Query: 62  NSFNQAI-CGSCVQAPSPICANDTIFSYAYPENPSLRDHFVDYDHPKLTDSENVITDVLA 121
           +S    + C  C  APSP CAND+     +PEN   R   +    P LTDS       LA
Sbjct: 91  DSLGHNLPCSVCFNAPSPSCANDSCS--LFPENSVTRKIALS---PALTDS-------LA 150

Query: 122 LSTTDGSTSGPLRRIPEFPFACVKTNFLREVAKNVIGLAALGRSNLSIPSVISAKFNSPK 181
           L T+DGST GP   +P + F+C  ++ L  +A NV GLAA GRSN S+P+ +S  F+ P+
Sbjct: 151 LPTSDGSTQGPPILLPGYIFSCSPSSLLEGLANNVTGLAAFGRSNYSLPAQVSNTFSVPR 210

Query: 182 YFAICLSGARSGPGVAFFGSKGPYRFSPN-VDLSKSLTYTPLLFNPVSASIYTY-WLPSY 241
            FA+CL G+ S PGVA  GS GPY FSP   DLSK L YTPL+ NPV +++ TY   PS 
Sbjct: 211 CFALCLPGSPSDPGVALIGSVGPYYFSPQKTDLSKLLVYTPLVLNPVGSTVVTYAGEPSD 270

Query: 242 EYYVGLSAIRINGKVVPFNTSLLSFEPIHGRGGAKISTSTNYALLRSSIYRAFATVFMKE 301
           EYY+ +++I +NGK +  N+SLL+ E  +G GG KIST+  Y +L SSIY A  T F+ E
Sbjct: 271 EYYINMTSINVNGKPIQINSSLLAVEE-NGSGGTKISTAVPYTVLESSIYNALTTAFVNE 330

Query: 302 AVVLNFKLINAVEPFGVCYEAKSVGVTAEGQAKAPVVDLVMEKEKVVWKLGGRNTMVRIK 361
           +  LN  + + V+PFGVCY A  + VT  G    P VD VM+ + V W++ G N+MVRI 
Sbjct: 331 SSALNLTVTDTVKPFGVCYSAADITVTRVGPG-VPTVDFVMQSDNVFWRVFGSNSMVRIT 390

Query: 362 KK-GVDAWCLGFINGGEFPRTPIVIGGLQMEDHLLQFDLENFRFGFSSSALTEGTSCSKF 421
           +  G D WCLGF++GG  PRT +VIGG QM D+LLQFDL+N R GF+SS L +GT+CS F
Sbjct: 391 RDGGGDVWCLGFVDGGVNPRTSVVIGGQQMVDNLLQFDLDNSRLGFTSSVLLKGTTCSNF 440

BLAST of Cucsa.084600 vs. TrEMBL
Match: A0A059AY42_EUCGR (Uncharacterized protein (Fragment) OS=Eucalyptus grandis GN=EUGRSUZ_H01513 PE=3 SV=1)

HSP 1 Score: 367.1 bits (941), Expect = 3.0e-98
Identity = 195/426 (45.77%), Postives = 266/426 (62.44%), Query Frame = 1

Query: 3   APLYKHHTSLLYSISLHLKTPLRPASLYLDLGGAFSWIDCYQNYNSSS-YKFVLCNTPLS 62
           AP+ K +T+L Y++S++LKTP +   L L LGG F W+DCY NY SSS Y+ + CN+ + 
Sbjct: 1   APIRKDNTTLRYTLSVYLKTPPQRLDLLLHLGGRFFWVDCYSNYYSSSTYRHIHCNSSVC 60

Query: 63  NSFNQAICGSCV-QAPSPICANDTIFSYAYPENPSLRDHFVDYDHPKLTDSENVITDVLA 122
             F+   CG C    PSP C+ND+   Y+ PENP +               E+ + D L 
Sbjct: 61  YPFDAVGCGYCSGPPPSPTCSNDSCL-YS-PENPLILK----------VGLEDALVDALG 120

Query: 123 LSTTDGSTSGPLRRIPEFPFACVKTNFLREVAKNVIGLAALGRSNLSIPSVISAKFNSPK 182
           L +TDGS++G +  +P F F+C + + L E      GLAALG SN S+P+ ++   + P 
Sbjct: 121 LPSTDGSSAGRVEVVPNFLFSCARPDLLAEFPNGTSGLAALGFSNASLPAQVAKARSLPW 180

Query: 183 YFAICLSGARSGPGVAFFGSKGPYRFSPNVDLSKSLTYTPLLFNPVSASIYTYW-LPSYE 242
            FA+CLSG+RS  GV F G+ GPY F P +DLSK+L+YTP+L NP   +   Y   P+ E
Sbjct: 181 CFALCLSGSRSAAGVTFVGTVGPYNFLPGIDLSKNLSYTPILLNPHGDTFVVYAPKPAAE 240

Query: 243 YYVGLSAIRINGKVVPFNTSLLSFEPIHGRGGAKISTSTNYALLRSSIYRAFATVFMKEA 302
           YYV ++AI+INGK VP N +LL+ +   G GG  IST   Y  +++SIY A A  F +EA
Sbjct: 241 YYVNVTAIKINGKAVPLNATLLAIDQKSGHGGTLISTLVPYTTVQTSIYGAVARAFAREA 300

Query: 303 -VVLNFKLINAVEPFGVCYEAKSVGVTAEGQAKAPVVDLVMEKEKVVWKLGGRNTMVRIK 362
               N      V+PFG+CY A+ V +T  G A  P +DLVM  E  VW+L G N+MVRI+
Sbjct: 301 SAAFNLTATQPVKPFGLCYSARHVNITRAGPA-VPAIDLVMHGEDAVWRLFGANSMVRIQ 360

Query: 363 KKGVDAWCLGFINGGEFPRTPIVIGGLQMEDHLLQFDLENFRFGFSSSALTEGTSCSKFD 422
           + G DAWCLGF++GG   R  I++GG QMED+LLQFDL   R GFSSS L  GT+C+ F+
Sbjct: 361 RNGTDAWCLGFVDGGAGFRAAILMGGHQMEDNLLQFDLGKMRLGFSSSVLVRGTTCANFN 413

Query: 423 FTSANN 425
           FT+ N+
Sbjct: 421 FTTGNH 413

BLAST of Cucsa.084600 vs. TAIR10
Match: AT1G03220.1 (AT1G03220.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 311.6 bits (797), Expect = 7.5e-85
Identity = 173/422 (41.00%), Postives = 255/422 (60.43%), Query Frame = 1

Query: 1   LIAPLYKHHTSLLYSISLHLKTPLRPASLYLDLGGAFSWIDCYQNYNSSSYKFVLCNTPL 60
           L+ P+ K  ++L Y+  ++ +TPL PAS+  DLGG   W+DC + Y SS+Y+   CN+ +
Sbjct: 31  LLLPVTKDQSTLQYTTVINQRTPLVPASVVFDLGGRELWVDCDKGYVSSTYQSPRCNSAV 90

Query: 61  SNSFNQAICGSCVQAPSPICANDTIFSYAYPENPSLRDHFVDYDHPKLTDSENVITDVLA 120
            +      CG+C   P P C+N+T      P+N                 S     DV++
Sbjct: 91  CSRAGSTSCGTCFSPPRPGCSNNTCGGI--PDNTVTGT----------ATSGEFALDVVS 150

Query: 121 LSTTDGSTSGPLRRIPEFPFACVKTNFLREVAKNVIGLAALGRSNLSIPSVISAKFNSPK 180
           + +T+GS  G + +IP   F C  T  L+ +AK  +G+A +GR N+ +PS  +A F+  +
Sbjct: 151 IQSTNGSNPGRVVKIPNLIFDCGATFLLKGLAKGTVGMAGMGRHNIGLPSQFAAAFSFHR 210

Query: 181 YFAICLSGARSGPGVAFFGSKGPYRFSPNVDLSKSLTYTPLLFNPVS-ASIYTYWLPSYE 240
            FA+CL+   SG GVAFFG+ GPY F P + +S SL  TPLL NPVS AS ++    S E
Sbjct: 211 KFAVCLT---SGKGVAFFGN-GPYVFLPGIQIS-SLQTTPLLINPVSTASAFSQGEKSSE 270

Query: 241 YYVGLSAIRINGKVVPFNTSLLSFEPIHGRGGAKISTSTNYALLRSSIYRAFATVFMKEA 300
           Y++G++AI+I  K VP N +LL      G GG KIS+   Y +L SSIY AF + F+K+A
Sbjct: 271 YFIGVTAIQIVEKTVPINPTLLKINASTGIGGTKISSVNPYTVLESSIYNAFTSEFVKQA 330

Query: 301 VVLNFKLINAVEPFGVCYEAKSVGVTAEGQAKAPVVDLVMEKEKVVWKLGGRNTMVRIKK 360
              + K + +V+PFG C+  K+VGVT  G A  P ++LV+  + VVW++ G N+MV +  
Sbjct: 331 AARSIKRVASVKPFGACFSTKNVGVTRLGYA-VPEIELVLHSKDVVWRIFGANSMVSVSD 390

Query: 361 KGVDAWCLGFINGGEFPRTPIVIGGLQMEDHLLQFDLENFRFGFSSSALTEGTSCSKFDF 420
              D  CLGF++GG   RT +VIGG Q+ED+L++FDL + +FGFSS+ L   T+C+ F+F
Sbjct: 391 ---DVICLGFVDGGVNARTSVVIGGFQLEDNLIEFDLASNKFGFSSTLLGRQTNCANFNF 431

Query: 421 TS 422
           TS
Sbjct: 451 TS 431

BLAST of Cucsa.084600 vs. TAIR10
Match: AT1G03230.1 (AT1G03230.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 300.1 bits (767), Expect = 2.2e-81
Identity = 164/422 (38.86%), Postives = 253/422 (59.95%), Query Frame = 1

Query: 1   LIAPLYKHHTSLLYSISLHLKTPLRPASLYLDLGGAFSWIDCYQNYNSSSYKFVLCNTPL 60
           L+ P+ K  ++L Y+  ++ +TPL PAS+  DLGG   W+DC Q Y S++Y+   CN+ +
Sbjct: 32  LLLPVTKDPSTLQYTTVINQRTPLVPASVVFDLGGREFWVDCDQGYVSTTYRSPRCNSAV 91

Query: 61  SNSFNQAICGSCVQAPSPICANDTIFSYAYPENPSLRDHFVDYDHPKLTDSENVITDVLA 120
            +      CG+C   P P C+N+T    A+P+N                 S     DV++
Sbjct: 92  CSRAGSIACGTCFSPPRPGCSNNTCG--AFPDNSITG----------WATSGEFALDVVS 151

Query: 121 LSTTDGSTSGPLRRIPEFPFACVKTNFLREVAKNVIGLAALGRSNLSIPSVISAKFNSPK 180
           + +T+GS  G   +IP   F+C  T+ L+ +AK  +G+A +GR N+ +P   +A F+  +
Sbjct: 152 IQSTNGSNPGRFVKIPNLIFSCGSTSLLKGLAKGAVGMAGMGRHNIGLPLQFAAAFSFNR 211

Query: 181 YFAICLSGARSGPGVAFFGSKGPYRFSPNVDLSKSLTYTPLLFNPVSASI-YTYWLPSYE 240
            FA+CL+   SG GVAFFG+ GPY F P + +S+ L  TPLL NP +    ++    S E
Sbjct: 212 KFAVCLT---SGRGVAFFGN-GPYVFLPGIQISR-LQKTPLLINPGTTVFEFSKGEKSPE 271

Query: 241 YYVGLSAIRINGKVVPFNTSLLSFEPIHGRGGAKISTSTNYALLRSSIYRAFATVFMKEA 300
           Y++G++AI+I  K +P + +LL      G GG KIS+   Y +L SSIY+AF + F+++A
Sbjct: 272 YFIGVTAIKIVEKTLPIDPTLLKINASTGIGGTKISSVNPYTVLESSIYKAFTSEFIRQA 331

Query: 301 VVLNFKLINAVEPFGVCYEAKSVGVTAEGQAKAPVVDLVMEKEKVVWKLGGRNTMVRIKK 360
              + K + +V+PFG C+  K+VGVT  G A  P + LV+  + VVW++ G N+MV +  
Sbjct: 332 AARSIKRVASVKPFGACFSTKNVGVTRLGYA-VPEIQLVLHSKDVVWRIFGANSMVSVSD 391

Query: 361 KGVDAWCLGFINGGEFPRTPIVIGGLQMEDHLLQFDLENFRFGFSSSALTEGTSCSKFDF 420
              D  CLGF++GG  P   +VIGG Q+ED+L++FDL + +FGFSS+ L   T+C+ F+F
Sbjct: 392 ---DVICLGFVDGGVNPGASVVIGGFQLEDNLIEFDLASNKFGFSSTLLGRQTNCANFNF 432

Query: 421 TS 422
           TS
Sbjct: 452 TS 432

BLAST of Cucsa.084600 vs. TAIR10
Match: AT5G19110.1 (AT5G19110.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 153.7 bits (387), Expect = 2.6e-37
Identity = 114/417 (27.34%), Postives = 192/417 (46.04%), Query Frame = 1

Query: 2   IAPLYKHHTSLLYSISLHLKTPLR-PASLYLDLGGAFSWIDCYQNYNSSSYKFVLCNTPL 61
           + P+ KH  + L+  + ++ +  + P +L LDLG   +W+DC +  + SS + V C +  
Sbjct: 27  LLPITKHEPTNLFYTTFNVGSAAKSPVNLLLDLGTNLTWLDCRKLKSLSSLRLVTCQS-- 86

Query: 62  SNSFNQAICGSCVQAPSPICANDTIFSYAYPENPSLRDHFVDYDHPKLTDSENVITDVLA 121
                     +C   P   CA  +   Y  P NP  ++  V         +  V+ D  +
Sbjct: 87  ---------STCKSIPGNGCAGKSCL-YKQP-NPLGQNPVV---------TGRVVQDRAS 146

Query: 122 LSTTDGSTSGPLRRIPEFPFACVKTNFLREVAKNVIGLAALGRSNLSIPSVISAKFNSPK 181
           L TTDG        +  F F+C     L+ +   V G+ AL   + S    +++ FN   
Sbjct: 147 LYTTDGGKFLSQVSVRHFTFSCAGEKALQGLPPPVDGVLALSPGSSSFTKQVTSAFNVIP 206

Query: 182 YFAICLSGARSGPGVAFFGSKGPYRFSPNVDLSKSLTYTPLLFNPVSASIYTYWLPSYEY 241
            F++CL  +    G   F   G + F P  + S +    P    P+  +       S +Y
Sbjct: 207 KFSLCLPSS----GTGHFYIAGIHYFIPPFNSSDNPI--PRTLTPIKGT------DSGDY 266

Query: 242 YVGLSAIRINGKVVPFNTSLLSFEPIHGRGGAKISTSTNYALLRSSIYRAFATVFMKEAV 301
            + + +I + G  +  N  LL+       GGAK+ST  +Y +L++ IY A A  F  +A 
Sbjct: 267 LITVKSIYVGGTALKLNPDLLT-------GGAKLSTVVHYTVLQTDIYNALAQSFTLKAK 326

Query: 302 VLNFKLINAVEPFGVCYEAKSVGVTAEGQAKAPVVDLVMEKE--KVVWKLGGRNTMVRIK 361
            +    + +V PF  C+++++ G         PV+++ +     +V W   G NT+V++K
Sbjct: 327 AMGIAKVPSVAPFKHCFDSRTAGKNLTAGPNVPVIEIGLPGRIGEVKWGFYGANTVVKVK 386

Query: 362 KKGVDAWCLGFINGGEFPRTPIVIGGLQMEDHLLQFDLENFRFGFSSSALTEGTSCS 416
           +      CL FI+GG+ P+  +VIG  Q++DH+L+FD       FS S L   TSCS
Sbjct: 387 ET---VMCLAFIDGGKTPKDLMVIGTHQLQDHMLEFDFSGTVLAFSESLLLHNTSCS 399

BLAST of Cucsa.084600 vs. TAIR10
Match: AT5G19100.1 (AT5G19100.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 141.4 bits (355), Expect = 1.3e-33
Identity = 124/420 (29.52%), Postives = 192/420 (45.71%), Query Frame = 1

Query: 4   PLYKHHTSLLYSISLHLKTPLRPASLYLDLGGAFSWI-DCYQNYNSSSYKFVLCNTPLSN 63
           P+YK     +Y+I L + +        LDL GA   + +C     S++Y  + C +    
Sbjct: 33  PIYKDTAKNIYTIPLSIGST-SSEKFVLDLNGAAPLLQNCPTAAKSTTYHPIRCGSTRCK 92

Query: 64  SFNQAICGSCVQAPSPICANDTIFSYAYPENPSLRDHF-VDYDHPKLTDSENVITDVLAL 123
             N          P+ + A       +   +   RD   + Y    +   ++ ++  L L
Sbjct: 93  YANPNF-----PCPNNVIAKKRTVCLSSDNSRLFRDTVPLLYTFNGVYTRDSEMSSSLTL 152

Query: 124 STTDGSTSGPLRRIPEFPFACVKTNFLREVAKNVIGLAALGRSNLSIPSVISAKFNSPKY 183
           + TDG+ +                     + +  IGLA    ++LSIPS + + +  P  
Sbjct: 153 TCTDGAPA---------------------LKQRTIGLA---NTHLSIPSQLISMYQLPHK 212

Query: 184 FAICLSG---ARSGPGVAFFGSKGPYRFSP-NVDLSKSLTYTPLLFNPVSASIYTYWLPS 243
            A+CL     ++S  G  + G KG Y + P + D+SK    TPL+ N  S          
Sbjct: 213 IALCLPSTERSQSHNGDLWIG-KGEYYYLPYDKDVSKIFASTPLIGNGKSG--------- 272

Query: 244 YEYYVGLSAIRINGKVVPFNTSLLSFEPIHGRGGAKISTSTNYALLRSSIYRAFATVFMK 303
            EY + + +I+I  K VP              G  KIST   Y + ++S+Y+A  T F +
Sbjct: 273 -EYLIDVKSIQIGAKTVPIPY-----------GATKISTLAPYTVFQTSLYKALLTAFTE 332

Query: 304 EAVVLNFKLINA--VEPFGVCYEAKSVGVTAEGQAKAPVVDLVMEKEKVVWKLGGRNTMV 363
                N K+  A  V+PFG C+ +        G    PV+DLV+      W++ G N++V
Sbjct: 333 -----NIKIAKAPAVKPFGACFYSN-------GGRGVPVIDLVLSGG-AKWRIYGSNSLV 384

Query: 364 RIKKKGVDAWCLGFINGGEFPRTPIVIGGLQMEDHLLQFDLENFRFGFSSSALTEGTSCS 416
           ++ K  V   CLGF++GG  P+ PIVIGG QMED+L++FDLE  +F FSSS L   TSCS
Sbjct: 393 KVNKNVV---CLGFVDGGVKPKYPIVIGGFQMEDNLVEFDLEASKFSFSSSLLLHNTSCS 384

BLAST of Cucsa.084600 vs. TAIR10
Match: AT5G19120.1 (AT5G19120.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 132.1 bits (331), Expect = 8.1e-31
Identity = 114/415 (27.47%), Postives = 178/415 (42.89%), Query Frame = 1

Query: 1   LIAPLYKHHTSLLYSISLHLKTPLRPASLYLDLGGAFSWIDCYQNYNSSSYKFVLCNTPL 60
           ++ P+ K   +  Y   + L     P  L +DL G+  W DC   + SSS   +  ++  
Sbjct: 32  VVFPVVKDLPTGQYLAQIRLGDSPDPVKLVVDLAGSILWFDCSSRHVSSSRNLISGSS-- 91

Query: 61  SNSFNQAICGSCVQAPSPICANDTIFSYAYPENPSLRDH--FVDYDHPKLTDSENVITDV 120
                      C++A      N+ + S +        D    V  D   +T    + +DV
Sbjct: 92  ---------SGCLKAK---VGNERVSSSSSSRKDQNADCELLVKNDAFGITARGELFSDV 151

Query: 121 LALSTTDGSTSGPLRRIPEFPFACVKTNFLREVAKNVIGLAALGRSNLSIPSVISAKFNS 180
           +++    GS + P     +  FAC     LR +A    G+  LGR+ +S+PS ++A+ N 
Sbjct: 152 MSV----GSVTSP--GTVDLLFACTPPWLLRGLASGAQGVMGLGRAQISLPSQLAAETNE 211

Query: 181 PKYFAICLSGARSGPGVAFFGSKGPYRFSPNVDLSKSLTYTPLLFNPVSASIYTYWLPSY 240
            +   + LS      GV    S         V  S+SL YTPLL              S 
Sbjct: 212 RRRLTVYLSPLN---GVV---STSSVEEVFGVAASRSLVYTPLLTG-----------SSG 271

Query: 241 EYYVGLSAIRINGKVVPFNTSLLSFEPIHGRGGAKISTSTNYALLRSSIYRAFATVFMKE 300
            Y + + +IR+NG+ +           + G    ++ST   Y +L SSIY+ FA  + K 
Sbjct: 272 NYVINVKSIRVNGEKL----------SVEGPLAVELSTVVPYTILESSIYKVFAEAYAKA 331

Query: 301 AVVLNFKLINAVEPFGVCYEAKSVGVTAEGQAKAPVVDLVMEKEKVVWKLGGRNTMVRIK 360
           A       +  V PFG+C+ +             P VDL ++ E V W++ G+N MV + 
Sbjct: 332 AG--EATSVPPVAPFGLCFTS---------DVDFPAVDLALQSEMVRWRIHGKNLMVDV- 385

Query: 361 KKGVDAWCLGFINGGEFPRTPIVIGGLQMEDHLLQFDLENFRFGFSSSALTEGTS 414
             G    C G ++GG     PIV+GGLQ+E  +L FDL N   GF     ++ TS
Sbjct: 392 --GGGVRCSGIVDGGSSRVNPIVMGGLQLEGFILDFDLGNSMMGFGQRTRSDSTS 385

BLAST of Cucsa.084600 vs. NCBI nr
Match: gi|449462344|ref|XP_004148901.1| (PREDICTED: basic 7S globulin 2 [Cucumis sativus])

HSP 1 Score: 855.1 bits (2208), Expect = 5.1e-245
Identity = 422/427 (98.83%), Postives = 423/427 (99.06%), Query Frame = 1

Query: 1   LIAPLYKHHTSLLYSISLHLKTPLRPASLYLDLGGAFSWIDCYQNYNSSSYKFVLCNTPL 60
           LIAPLYKHHTSLLYSISLHLKTPLRPASLYLDLGGAFSWI CYQNYNSSSYKFVLCNTPL
Sbjct: 25  LIAPLYKHHTSLLYSISLHLKTPLRPASLYLDLGGAFSWIHCYQNYNSSSYKFVLCNTPL 84

Query: 61  SNSFNQAICGSCVQAPSPICANDTIFSYAYPENPSLRDHFVDYDHPKLTDSENVITDVLA 120
           SNSFNQAICGSCVQAPSPICANDTIFSYAYPENPSLRDHFVDYDHPKLTDSENVITDVLA
Sbjct: 85  SNSFNQAICGSCVQAPSPICANDTIFSYAYPENPSLRDHFVDYDHPKLTDSENVITDVLA 144

Query: 121 LSTTDGSTSGPLRRIPEFPFACVKTNFLREVAKNVIGLAALGRSNLSIPSVISAKFNSPK 180
           LSTT GSTS PLRRIPEFPFACVKTNFLREVAKNVIGLAALGRSNLSIPSVISAKF+SPK
Sbjct: 145 LSTTGGSTSAPLRRIPEFPFACVKTNFLREVAKNVIGLAALGRSNLSIPSVISAKFSSPK 204

Query: 181 YFAICLSGARSGPGVAFFGSKGPYRFSPNVDLSKSLTYTPLLFNPVSASIYTYWLPSYEY 240
           YFAICLSGARSGPGVAFFGSKGPYRFSPNVDLSKSLTYTPLLFNPVSASIYTYWLPSYEY
Sbjct: 205 YFAICLSGARSGPGVAFFGSKGPYRFSPNVDLSKSLTYTPLLFNPVSASIYTYWLPSYEY 264

Query: 241 YVGLSAIRINGKVVPFNTSLLSFEPIHGRGGAKISTSTNYALLRSSIYRAFATVFMKEAV 300
           YVGLSAIRINGKVVPFNTSLLSFEPIHGRGGAKISTSTNYALLRSSIYRAFATVFMKEAV
Sbjct: 265 YVGLSAIRINGKVVPFNTSLLSFEPIHGRGGAKISTSTNYALLRSSIYRAFATVFMKEAV 324

Query: 301 VLNFKLINAVEPFGVCYEAKSVGVTAEGQAKAPVVDLVMEKEKVVWKLGGRNTMVRIKKK 360
           VLNFKLINAVEPFGVCYEAKSVGVTAEGQAKAPVVDLVMEKEKVVWKLGGRNTMVRIKKK
Sbjct: 325 VLNFKLINAVEPFGVCYEAKSVGVTAEGQAKAPVVDLVMEKEKVVWKLGGRNTMVRIKKK 384

Query: 361 GVDAWCLGFINGGEFPRTPIVIGGLQMEDHLLQFDLENFRFGFSSSALTEGTSCSKFDFT 420
           GVDAWCLGFINGGEFPRTPIVIGGLQMEDHLLQFDLENFRFGFSSSAL EGTSCSKFDFT
Sbjct: 385 GVDAWCLGFINGGEFPRTPIVIGGLQMEDHLLQFDLENFRFGFSSSALKEGTSCSKFDFT 444

Query: 421 SANNTFF 428
           SANNTFF
Sbjct: 445 SANNTFF 451

BLAST of Cucsa.084600 vs. NCBI nr
Match: gi|659100997|ref|XP_008451375.1| (PREDICTED: basic 7S globulin 2 [Cucumis melo])

HSP 1 Score: 729.6 bits (1882), Expect = 3.3e-207
Identity = 362/419 (86.40%), Postives = 384/419 (91.65%), Query Frame = 1

Query: 2   IAPLYKHHTSLLYSISLHLKTPLRPASLYLDLGGAFSWIDCYQNYNSSSYKFVLCNTPLS 61
           IA +Y  H S LYSIS++LKTPLRPASL+LDLGGAFSWIDCY+NYNSSSY+FV+ NTPL+
Sbjct: 31  IASIYNDHNSRLYSISVNLKTPLRPASLHLDLGGAFSWIDCYKNYNSSSYQFVINNTPLA 90

Query: 62  NSFNQAICGSCVQAPSPICANDT---IFSYAYPENPSLRDHFVDYDHPKLTDSENVITDV 121
           +SF Q I  +CV+ PS +C+NDT   IFS+ YPE PS+RD FVDY HP+LTDSE++ITDV
Sbjct: 91  DSFGQGIGSTCVEVPSSMCSNDTVNTIFSFGYPEKPSIRDQFVDYFHPELTDSESLITDV 150

Query: 122 LALSTTDGSTSGPLRRIPEFPFACVKTNFLREVAKNVIGLAALGRSNLSIPSVISAKFNS 181
           LALST DGS SG LRR PEFPFACVKTNFLR +AKNVIGLAALGRSNLSIPSVISAKFNS
Sbjct: 151 LALSTPDGSKSGLLRRTPEFPFACVKTNFLRGLAKNVIGLAALGRSNLSIPSVISAKFNS 210

Query: 182 PKYFAICLSGARSGPGVAFFGSKGPYRFSPNVDLSKSLTYTPLLFNPVSASIYTYWLPSY 241
           PK+FAICL GARSGPGVA FGSKGPYRFSPNVDLSKSLTYTPL+FNPVS SIYTYWLPSY
Sbjct: 211 PKFFAICLPGARSGPGVAIFGSKGPYRFSPNVDLSKSLTYTPLVFNPVSGSIYTYWLPSY 270

Query: 242 EYYVGLSAIRINGKVVPFNTSLLSFEPIHGRGGAKISTSTNYALLRSSIYRAFATVFMKE 301
           EYYVGLSAIRINGKVVPFNTSLL FE IHGRGGAKISTSTNY LL+SSIYRAFATVFMKE
Sbjct: 271 EYYVGLSAIRINGKVVPFNTSLLPFESIHGRGGAKISTSTNYGLLQSSIYRAFATVFMKE 330

Query: 302 AVVLNFKLINAVEPFGVCYEAKSVGVTAEGQAKAPVVDLVMEKEKVVWKLGGRNTMVRIK 361
           A VLNFKLINAVEPFGVCY AKSVGVTAEG AKAPVVDLVMEK KVVWKLGGRNTMVRIK
Sbjct: 331 AAVLNFKLINAVEPFGVCYAAKSVGVTAEGHAKAPVVDLVMEKGKVVWKLGGRNTMVRIK 390

Query: 362 KKGVDAWCLGFINGGEFPRTPIVIGGLQMEDHLLQFDLENFRFGFSSSALTEGTSCSKF 418
           KKGVDAWCLGFINGGEFPRTPIVIGGLQMEDHLLQFDLE FRFGFSSSALTEGTSCSKF
Sbjct: 391 KKGVDAWCLGFINGGEFPRTPIVIGGLQMEDHLLQFDLEKFRFGFSSSALTEGTSCSKF 449

BLAST of Cucsa.084600 vs. NCBI nr
Match: gi|802792891|ref|XP_012092273.1| (PREDICTED: basic 7S globulin-like [Jatropha curcas])

HSP 1 Score: 414.1 bits (1063), Expect = 3.0e-112
Identity = 205/421 (48.69%), Postives = 278/421 (66.03%), Query Frame = 1

Query: 2   IAPLYKHHTSLLYSISLHLKTPLRPASLYLDLGGAFSWIDCYQNYNSSSYKFVLCNTPLS 61
           + P+ K HT+  Y  +L+LKTPL+     LDLG +F W+DC+ NY S++Y+ + C TPL 
Sbjct: 25  LLPIRKDHTTHQYITTLYLKTPLQATDFVLDLGASFFWVDCHNNYTSTTYQHIPCGTPLC 84

Query: 62  NSFNQAICGSCVQAPSPICANDTIFSYAYPENPSLRDHFVDYDHPKLTDSENVITDVLAL 121
           +SF    CG+C + P P CANDT     +PEN   R            + E  + D LAL
Sbjct: 85  DSFESRACGNCFEPPGPACANDTC--EFFPENSVTRQ----------VNLEAALIDSLAL 144

Query: 122 STTDGSTSGPLRRIPEFPFACVKTNFLREVAKNVIGLAALGRSNLSIPSVISAKFNSPKY 181
            TTDGST GPL  +  + F+C +T+ L+ +AK V G+AA+GRSN+S+ + + A F++P Y
Sbjct: 145 PTTDGSTQGPLALVESYIFSCARTSLLQGLAKGVSGMAAMGRSNISLQAQLRAAFSAPSY 204

Query: 182 FAICLSGARSGPGVAFFGSKGPYRFSPNVDLSKSLTYTPLLFNPVSASIYTYW-LPSYEY 241
           FA+CLSG+R   GVAF G+ GPY F P +D+SKSL+YTPL+ NP+  ++ TY   PS EY
Sbjct: 205 FALCLSGSRQPSGVAFIGTNGPYNFLPGIDVSKSLSYTPLILNPIGGTVITYVNQPSAEY 264

Query: 242 YVGLSAIRINGKVVPFNTSLLSFEPIHGRGGAKISTSTNYALLRSSIYRAFATVFMKEAV 301
           +VGL++I++NGKVV  N +LL+ +   G GGAKIST   Y +L +SIY+A    F+KE+ 
Sbjct: 265 FVGLTSIKVNGKVVALNQTLLAIDNETGFGGAKISTVVPYTILHTSIYKAVTESFVKESS 324

Query: 302 VLNFKLINAVEPFGVCYEAKSVGVTAEGQAKAPVVDLVMEKEKVVWKLGGRNTMVRIKKK 361
            +N  L  A +PF  CY A  +  T  G    P VDLVM+ E V W++ G N+MVR+  K
Sbjct: 325 AMNLTLTKAAKPFSFCYPAMDIVNTRVGPG-VPAVDLVMQGEDVYWRIFGSNSMVRVASK 384

Query: 362 GVDAWCLGFINGGEFPRTPIVIGGLQMEDHLLQFDLENFRFGFSSSALTEGTSCSKFDFT 421
           GVD WCLGF++GG  PRT IVIGG QMED+LLQFDLE+ R GF+SS L +GT C+ F FT
Sbjct: 385 GVDVWCLGFMDGGANPRTSIVIGGYQMEDNLLQFDLESNRLGFTSSLLIKGTKCADFHFT 432

BLAST of Cucsa.084600 vs. NCBI nr
Match: gi|802792887|ref|XP_012092272.1| (PREDICTED: basic 7S globulin 2-like [Jatropha curcas])

HSP 1 Score: 406.8 bits (1044), Expect = 4.8e-110
Identity = 209/422 (49.53%), Postives = 282/422 (66.82%), Query Frame = 1

Query: 1   LIAPLYKHHTSLLYSISLHLKTPLRPASLYLDLGGAFSWIDCYQNYNSSSYKFVLCNTPL 60
           L+AP+ K   SLLY+I+++LKTPL+P  L+LDLG +F+W+DC+++YNS+SY+ + C++ L
Sbjct: 28  LVAPIQKDPDSLLYTITVYLKTPLQPTKLHLDLGASFTWLDCFRDYNSTSYQHIPCSSSL 87

Query: 61  SNSFNQAICGSCVQAPSPICANDTIFSYAYPENPSLRDHFVDYDHPKLTDSENVITDVLA 120
             SF+   CG+C     P CAN++ F   YPENP  R   +             + D LA
Sbjct: 88  CTSFHSVACGNCNDTLGPACANNSCF--LYPENPITRQATL----------ATALVDSLA 147

Query: 121 LSTTDGSTSGPLRRIPEFPFACVKTNFLREVAKNVIGLAALGRSNLSIPSVISAKFNSPK 180
           L TTDG+T G +  +PEF F+C +   L  +AK V GLAALGRS  S+P  IS   +SP 
Sbjct: 148 LPTTDGTTIGEMVIVPEFVFSCARPFLLNGLAKEVTGLAALGRSQHSLPVQISDAVSSPH 207

Query: 181 YFAICLSGARSGPGVAFFGSKGPYRFSPNVDLSKSLTYTPLLFNPVSASIYTY-WLPSYE 240
           YFA+CLSG+ +  GVAFF + GPY F P VDLSKSL YT LL NPV +++ +Y   PS E
Sbjct: 208 YFALCLSGSSTEHGVAFFATSGPYYFLPRVDLSKSLVYTGLLLNPVGSTVISYNQQPSDE 267

Query: 241 YYVGLSAIRINGKVVPFNTSLLSFEPIHGRGGAKISTSTNYALLRSSIYRAFATVFMKEA 300
           YY+ L+++++NGK +  N+S LSF+  +G GG K+ST T+Y  L SSIYRAF   F+ E+
Sbjct: 268 YYINLTSVKVNGKPIQLNSSQLSFDE-NGFGGTKLSTDTSYTTLESSIYRAFVEAFVNES 327

Query: 301 VVLNFKLINAVEPFGVCYEAKSVGVTAEGQAKAPVVDLVMEKEKVVWKLGGRNTMVRIKK 360
             LN  + + V+PF VCY+A  V  T  G    P VD VME E V W++ G N+MV I++
Sbjct: 328 AGLNLTVTSVVKPFDVCYQATDVISTRVGPG-VPTVDFVMESEDVFWRIFGWNSMVMIER 387

Query: 361 KGVDAWCLGFINGGEFPRTPIVIGGLQMEDHLLQFDLENFRFGFSSSALTEGTSCSKFDF 420
            GVD WCLGF++GG   R  IVIGG QMED+LL+FD+E+ R GFSSS L +GTSC+ F+ 
Sbjct: 388 DGVDLWCLGFVDGGVNARASIVIGGHQMEDNLLEFDMESKRLGFSSSLLLKGTSCANFNL 435

Query: 421 TS 422
           TS
Sbjct: 448 TS 435

BLAST of Cucsa.084600 vs. NCBI nr
Match: gi|643704417|gb|KDP21481.1| (hypothetical protein JCGZ_21952 [Jatropha curcas])

HSP 1 Score: 396.7 bits (1018), Expect = 5.0e-107
Identity = 205/414 (49.52%), Postives = 276/414 (66.67%), Query Frame = 1

Query: 1   LIAPLYKHHTSLLYSISLHLKTPLRPASLYLDLGGAFSWIDCYQNYNSSSYKFVLCNTPL 60
           L+AP+ K   SLLY+I+++LKTPL+P  L+LDLG +F+W+DC+++YNS+SY+ + C++ L
Sbjct: 28  LVAPIQKDPDSLLYTITVYLKTPLQPTKLHLDLGASFTWLDCFRDYNSTSYQHIPCSSSL 87

Query: 61  SNSFNQAICGSCVQAPSPICANDTIFSYAYPENPSLRDHFVDYDHPKLTDSENVITDVLA 120
             SF+   CG+C     P CAN++ F   YPENP  R   +             + D LA
Sbjct: 88  CTSFHSVACGNCNDTLGPACANNSCF--LYPENPITRQATL----------ATALVDSLA 147

Query: 121 LSTTDGSTSGPLRRIPEFPFACVKTNFLREVAKNVIGLAALGRSNLSIPSVISAKFNSPK 180
           L TTDG+T G +  +PEF F+C +   L  +AK V GLAALGRS  S+P  IS   +SP 
Sbjct: 148 LPTTDGTTIGEMVIVPEFVFSCARPFLLNGLAKEVTGLAALGRSQHSLPVQISDAVSSPH 207

Query: 181 YFAICLSGARSGPGVAFFGSKGPYRFSPNVDLSKSLTYTPLLFNPVSASIYTY-WLPSYE 240
           YFA+CLSG+ +  GVAFF + GPY F P VDLSKSL YT LL NPV +++ +Y   PS E
Sbjct: 208 YFALCLSGSSTEHGVAFFATSGPYYFLPRVDLSKSLVYTGLLLNPVGSTVISYNQQPSDE 267

Query: 241 YYVGLSAIRINGKVVPFNTSLLSFEPIHGRGGAKISTSTNYALLRSSIYRAFATVFMKEA 300
           YY+ L+++++NGK +  N+S LSF+  +G GG K+ST T+Y  L SSIYRAF   F+ E+
Sbjct: 268 YYINLTSVKVNGKPIQLNSSQLSFDE-NGFGGTKLSTDTSYTTLESSIYRAFVEAFVNES 327

Query: 301 VVLNFKLINAVEPFGVCYEAKSVGVTAEGQAKAPVVDLVMEKEKVVWKLGGRNTMVRIKK 360
             LN  + + V+PF VCY+A  V  T  G    P VD VME E V W++ G N+MV I++
Sbjct: 328 AGLNLTVTSVVKPFDVCYQATDVISTRVGPG-VPTVDFVMESEDVFWRIFGWNSMVMIER 387

Query: 361 KGVDAWCLGFINGGEFPRTPIVIGGLQMEDHLLQFDLENFRFGFSSSALTEGTS 414
            GVD WCLGF++GG   R  IVIGG QMED+LL+FD+E+ R GFSSS L +GTS
Sbjct: 388 DGVDLWCLGFVDGGVNARASIVIGGHQMEDNLLEFDMESKRLGFSSSLLLKGTS 427

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
7SBG2_SOYBN9.9e-4731.86Basic 7S globulin 2 OS=Glycine max PE=1 SV=1[more]
7SB1_SOYBN1.2e-4431.40Basic 7S globulin OS=Glycine max GN=BG PE=1 SV=2[more]
Match NameE-valueIdentityDescription
A0A0A0K506_CUCSA3.6e-24598.83Uncharacterized protein OS=Cucumis sativus GN=Csa_7G390050 PE=3 SV=1[more]
A0A067JCA9_JATCU3.5e-10749.52Uncharacterized protein OS=Jatropha curcas GN=JCGZ_21952 PE=3 SV=1[more]
A0A061DRU5_THECC1.2e-10448.22Eukaryotic aspartyl protease family protein, putative OS=Theobroma cacao GN=TCM_... [more]
A0A0D2N1T5_GOSRA2.0e-10247.88Uncharacterized protein OS=Gossypium raimondii GN=B456_004G215700 PE=3 SV=1[more]
A0A059AY42_EUCGR3.0e-9845.77Uncharacterized protein (Fragment) OS=Eucalyptus grandis GN=EUGRSUZ_H01513 PE=3 ... [more]
Match NameE-valueIdentityDescription
AT1G03220.17.5e-8541.00 Eukaryotic aspartyl protease family protein[more]
AT1G03230.12.2e-8138.86 Eukaryotic aspartyl protease family protein[more]
AT5G19110.12.6e-3727.34 Eukaryotic aspartyl protease family protein[more]
AT5G19100.11.3e-3329.52 Eukaryotic aspartyl protease family protein[more]
AT5G19120.18.1e-3127.47 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|449462344|ref|XP_004148901.1|5.1e-24598.83PREDICTED: basic 7S globulin 2 [Cucumis sativus][more]
gi|659100997|ref|XP_008451375.1|3.3e-20786.40PREDICTED: basic 7S globulin 2 [Cucumis melo][more]
gi|802792891|ref|XP_012092273.1|3.0e-11248.69PREDICTED: basic 7S globulin-like [Jatropha curcas][more]
gi|802792887|ref|XP_012092272.1|4.8e-11049.53PREDICTED: basic 7S globulin 2-like [Jatropha curcas][more]
gi|643704417|gb|KDP21481.1|5.0e-10749.52hypothetical protein JCGZ_21952 [Jatropha curcas][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsa.084600.1Cucsa.084600.1mRNA


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 1..59
score: 3.2E-125coord: 76..416
score: 3.2E
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 2..200
score: 1.3E-32coord: 209..415
score: 5.7
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 7..413
score: 2.82
NoneNo IPR availablePANTHERPTHR13683:SF268EUKARYOTIC ASPARTYL PROTEASE FAMILY PROTEINcoord: 76..416
score: 3.2E-125coord: 1..59
score: 3.2E