MS021005 (gene) Bitter gourd (TR) v1

Overview
NameMS021005
Typegene
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionAspartyl protease family protein 1-like
Locationscaffold290: 483564 .. 487729 (+)
RNA-Seq ExpressionMS021005
SyntenyMS021005
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTTCGTCTTCTTCCCTTCCTCCGCTCTCCCTAATGCTCTGCGTTTCCTTTTTCATTTTCAGTTTCGTTTCCCATTTCTCTAATGCCCACGGATCGTTCATGTTCGATGTCCACCACCGCTACTCCGACGCCGTCCGTCGACTCCTCCCCGTCGACGGCTTGCCGGAGGAGGGCACTCTCGAGTACTACGCAGCTATGCTCCGTAGAGACCATTTCTTCCGCGCCCGTCGACTTGCTACCGCTGAAGATCGCCCTCCGCTCACTTTCATCTCCGGCAACGAAACTATTCGACTCAATCCTCTGGGATTGTACGCCTTTCTCTTCATCAATTCTTTCTCTCGCCTTAGATGTGTAATTACAGAATTGTGCTAGGTCATCGAGTTTAGATTTGAATTATGAATCTAAAATGAACGTTCATTCGCTTTCCTATTTGAGTCTCGGTATCTCTTTTGTGGTTCTCATGTTTACTGTTTTTTTTTTTTCTCCCTTATGTGTTGTTGGAACTCAGCCTGCATTACGCTGAAGTTAAGGTGGGGACGCCCGCGGTATCGTATTTAGTGGCGTTGGATACTGGCAGCGATTTGTTCTGGTTACCATGTGACTGTGTTAATTGTGTTACTGGGTTGAATTCATCCAGTGGGGTAATATTTCTTAAACCCTGTTATTTCTGTTCCTCGATTTTAATTTTTCGAGGGTTAGTTGATTCATTCTTGAAGTCCTTGACATGAGAAACACGAACACAAACACATGTGACATGGACATGATGGATTATGTATGTTATTTCCTGAGGAATTAGGAGTTAAGACACTTTGTGCATATATTGTTGAAATTGTGTATTTTGAAAACTAGGTGAGCTTTGTACATATTTTGAGGACTTTCACTCTAAAATCATTGTTTTTTATGCTATGGATAAGAAATTAAAGAAGCCAATATATATTTGGATGGATATGGTAGAATATAACAAACGTTTATCAAGAAATTCTTGTGTCAAGTGGGCATTTCTTGAGTGACTCAAGATCTTATAAGAAAAGGAGATTCAAAATCAAACGCGTCCATTTTTAAATCAATGAAAAATTGTTTCTGGTAAAAAAAAAAATCAATTATATTGTTATATGTTAAAAGAAAAGTTTGATGTGTTCTGTTTATTGTCTTTATCTCATTACTGATTTCCATGGGAATTTCTTCTGTTACTGTTTCTGTCAACAGGTAATAAGATTTAATATCTACAGCCCTAATAATTCATCAACCAGTAAGGAGGTCCCATGTAGTAGCGCCTTGTGTTCACATCCAAGCCAATGCTCTTCACCGAGTGGCACATGCCCATATGTGGTCTCGTACCTTTCTGAAAACACCTCTTCTACTGGCTACTTGGTAGAGGATATATTGCACTTAACCACAAACGATGACCAATCAAAGCCCGTTAATGCAAAGATTACTCTTGGGTAGGTTATCGTTTTATCTCTTCAATAATTGATTTTGTTCTTCTTGTCTCGCCTTTCAATAATAATTGATTTCGTTCTTCATTTCTACTCAGTTGGGTTATTGTTATCTGGATATGTTGTTCATCAGGTGTGGTAAGGACCAGAGTGGTGCATTTTTAAACTCTGCAGCACCAAATGGATTATTTGGGCTAGGTATCGAGAATGTTTCAGTTCCTAGCATCTTGGCAAATGAAGGACTCACTTCAAATTCTTTTTCCTTGTGTTTTGGACGTCATGGAATGGGAAGAATTGAATTTGGAGACAAAGGTAGTCCGGATCAAAGTGAAACACCATTCAACTTAGGACGAAAACAGTAAGTGTGCCACTTCCTAATAGAGTGTATATGCTAACTTCATGTTTTGTAAGCTGGGCAAGTAGAAATTTCCAATTAAATCGTTCTAATGGTCTTTATTTTATTTTGTTTTCTCGTTTTGTGCATGTCAGTCCAACTTACAATATCAGCATCACTCAGATAAATGTGGGAGGAAACGTTTCCAATCTTGAATTTGATGCAATTTTTGATTCTGGGACCTCATTTACCTACCTGAACGACCCAGCCTATTCACTTATTTCTGACAAAGTAAGCACACTGTAAACTTATCTGCAGTTTCTTGCGCCATGTCCTTCAGTTTGCTTATCCATGTCATTTTGTAGTTTGATTCTACGGTTGAAGAAAAGCGGTATACAATGAATTCAGACTTCCCTTTTGAAAACTGCTATGAAATGAGGTAACGTGATATGGTTCCATATATCAGCAAGAATAAAAGATAATAAGAATCTAATTTGAGGAAACCTTGATCTCTGAATTGAGTTTCAGAAGAGATCAGGCTTATTAACTCAAAGATTTCAATAAATAATTCTGCACTTCTTTCAGGTGATTTTATTGTATAAGTTACAACATAATGTGAAAAGTATCTCGGAATTGAGACATCCAGACCCATAAAACTCCACCAAAAATAATTAAAGCTTCACAATGGTTAAACCCACTTAACAATTAACTTGATTAAATCTAACCACAAATGACGATAGAAGAAAACTATGCCAACTTTGAAAAAATATCCAAGTATTCAAACATTCAAAATCTGAAACATAGTTAACTAATTTAACAATTATCACGTAATATATTATTTCCATCGTAACGTAATTATACCGTGGGAAGATACCAGTTTAGAGATAAATCTCTCTTTTCATTACCATGTGCTGACTTCCACTTTCTTTCAGCCCAAACCAAAACAACTTCACATACCCTGTAATGAATCTGACGATGAAAGGTGGTAAACAGTTTTTCATCAATCATCCAGTGATTCCGTTCAACGGCAAGGTAAAGTTATGATTAGTATTGTGTAATATTCTCTTTGGCTACTATTGTTAACTTTCTTTCTGATCATGTTCTTCAGGTGACAAGTATTTTTTGTATTGCCATTTTCAGAAGTCCAGACATTAATATCATTGGACGTAAGTATCTCAGTTTTTGGCTGATAGTTTATCGACACCTTGAGGTTTAGTTACCACATCTTGTAGAACTTATCTTTTGTAGGATGGTAATCGATTGCGCATTTAAGTTGCTTATTGCTTGCTTTTCGTTTCATATCAAGAGTCCACCTTTTATTCCTCTAACCCCACATATGAAGGATTGCGTGTAAAAAAAAAAAGTTATTTAGTCCTTCAATTTTTAGGAGCGTGTCTCATAGATCCCTAAACTTTCAATTTGTATCTAAATAGGTTTCTAAACTTTCAATTTTGTGTCTAATAAGCTCTTGAACTTTAAAAGTATCTAATAATTGAGTCATTGATCTATTAGCATTTGTAAAATTTGTTAGGTTAGATACAAAATTGAAAAGTCAAGATCCTGTTAAATACAAGATTCAATTAGGTTAGAGATCCATTAGGTATCAATATTGATGGTTTAGGAATTTATTACACACAAAATTGAAAGCTAAGAACCTATTAGATACTTTTTAAAGTTCACGGACCTATTAGACATAACTTTGAAAGTTAAGGTATAATTTAACCTGAAAAGAAATAGAGAACTTCATTGATGCACAAAACCTAACATGGAGGATACGAGTTGTGAACTTAAATCAACAATTGGCCATAGCTTTGTGCCTTTGTTATACTTTACATGGATTTACTCTAGAAATGTTGTTTGTCACTTCACTGCGAATTTGTTATACTTTACATGGATTTACTCTAGAAATGTTGTTTGTCACTTCACTGCGAATATATCAAGCCGATCTTTTTTCTCATGATGTGTATACATATTACACTTTGCATTCACGCAGAAAACTTCATGGCTGGTTATCACATAGTATTTGACCGTGAAAAGATGGTTTTGGGTTGGAAGAAGTCAAACTGTAAGTATCTCTGCTAAAGTTCTCTTAGTTTCTTCTCTCTTCCGGGATAAAATTTTGCAGCTACCTTATAACATGAGATTATAATTTTCCAGGCAATGGTGACGAGAATGAGATCACCAACAATTTTCCCGTCGATCCATCGCCAGCCCCCGCCCCCGCCCCCGCCCCCGGAAGAGCAGTCAACCCACAAGCCAACAGCAACAGCAACATTAACAACTCTTCTCGAACTATAGAACCACCAAGACCTGCAGGAAATAGTGGTTCAAACCTTCTAAGTTCAGTCATTCTCACGTTGGTAATGATTCTTTTTCCATTTTTGCTTTTTGTT

mRNA sequence

ATGGCTTCGTCTTCTTCCCTTCCTCCGCTCTCCCTAATGCTCTGCGTTTCCTTTTTCATTTTCAGTTTCGTTTCCCATTTCTCTAATGCCCACGGATCGTTCATGTTCGATGTCCACCACCGCTACTCCGACGCCGTCCGTCGACTCCTCCCCGTCGACGGCTTGCCGGAGGAGGGCACTCTCGAGTACTACGCAGCTATGCTCCGTAGAGACCATTTCTTCCGCGCCCGTCGACTTGCTACCGCTGAAGATCGCCCTCCGCTCACTTTCATCTCCGGCAACGAAACTATTCGACTCAATCCTCTGGGATTCCTGCATTACGCTGAAGTTAAGGTGGGGACGCCCGCGGTATCGTATTTAGTGGCGTTGGATACTGGCAGCGATTTGTTCTGGTTACCATGTGACTGTGTTAATTGTGTTACTGGGTTGAATTCATCCAGTGGGGTAATAAGATTTAATATCTACAGCCCTAATAATTCATCAACCAGTAAGGAGGTCCCATGTAGTAGCGCCTTGTGTTCACATCCAAGCCAATGCTCTTCACCGAGTGGCACATGCCCATATGTGGTCTCGTACCTTTCTGAAAACACCTCTTCTACTGGCTACTTGGTAGAGGATATATTGCACTTAACCACAAACGATGACCAATCAAAGCCCGTTAATGCAAAGATTACTCTTGGGTGTGGTAAGGACCAGAGTGGTGCATTTTTAAACTCTGCAGCACCAAATGGATTATTTGGGCTAGGTATCGAGAATGTTTCAGTTCCTAGCATCTTGGCAAATGAAGGACTCACTTCAAATTCTTTTTCCTTGTGTTTTGGACGTCATGGAATGGGAAGAATTGAATTTGGAGACAAAGGTAGTCCGGATCAAAGTGAAACACCATTCAACTTAGGACGAAAACATCCAACTTACAATATCAGCATCACTCAGATAAATGTGGGAGGAAACGTTTCCAATCTTGAATTTGATGCAATTTTTGATTCTGGGACCTCATTTACCTACCTGAACGACCCAGCCTATTCACTTATTTCTGACAAATTTGATTCTACGGTTGAAGAAAAGCGGTATACAATGAATTCAGACTTCCCTTTTGAAAACTGCTATGAAATGAGCCCAAACCAAAACAACTTCACATACCCTGTAATGAATCTGACGATGAAAGGTGGTAAACAGTTTTTCATCAATCATCCAGTGATTCCGTTCAACGGCAAGCCGATCTTTTTTCTCATGATGTGTATACATATTACACTTTGCATTCACGCAGAAAACTTCATGGCTGGTTATCACATAGTATTTGACCGTGAAAAGATGGTTTTGGGTTGGAAGAAGTCAAACTGCAATGGTGACGAGAATGAGATCACCAACAATTTTCCCGTCGATCCATCGCCAGCCCCCGCCCCCGCCCCCGCCCCCGGAAGAGCAGTCAACCCACAAGCCAACAGCAACAGCAACATTAACAACTCTTCTCGAACTATAGAACCACCAAGACCTGCAGGAAATAGTGGTTCAAACCTTCTAAGTTCAGTCATTCTCACGTTGGTAATGATTCTTTTTCCATTTTTGCTTTTTGTT

Coding sequence (CDS)

ATGGCTTCGTCTTCTTCCCTTCCTCCGCTCTCCCTAATGCTCTGCGTTTCCTTTTTCATTTTCAGTTTCGTTTCCCATTTCTCTAATGCCCACGGATCGTTCATGTTCGATGTCCACCACCGCTACTCCGACGCCGTCCGTCGACTCCTCCCCGTCGACGGCTTGCCGGAGGAGGGCACTCTCGAGTACTACGCAGCTATGCTCCGTAGAGACCATTTCTTCCGCGCCCGTCGACTTGCTACCGCTGAAGATCGCCCTCCGCTCACTTTCATCTCCGGCAACGAAACTATTCGACTCAATCCTCTGGGATTCCTGCATTACGCTGAAGTTAAGGTGGGGACGCCCGCGGTATCGTATTTAGTGGCGTTGGATACTGGCAGCGATTTGTTCTGGTTACCATGTGACTGTGTTAATTGTGTTACTGGGTTGAATTCATCCAGTGGGGTAATAAGATTTAATATCTACAGCCCTAATAATTCATCAACCAGTAAGGAGGTCCCATGTAGTAGCGCCTTGTGTTCACATCCAAGCCAATGCTCTTCACCGAGTGGCACATGCCCATATGTGGTCTCGTACCTTTCTGAAAACACCTCTTCTACTGGCTACTTGGTAGAGGATATATTGCACTTAACCACAAACGATGACCAATCAAAGCCCGTTAATGCAAAGATTACTCTTGGGTGTGGTAAGGACCAGAGTGGTGCATTTTTAAACTCTGCAGCACCAAATGGATTATTTGGGCTAGGTATCGAGAATGTTTCAGTTCCTAGCATCTTGGCAAATGAAGGACTCACTTCAAATTCTTTTTCCTTGTGTTTTGGACGTCATGGAATGGGAAGAATTGAATTTGGAGACAAAGGTAGTCCGGATCAAAGTGAAACACCATTCAACTTAGGACGAAAACATCCAACTTACAATATCAGCATCACTCAGATAAATGTGGGAGGAAACGTTTCCAATCTTGAATTTGATGCAATTTTTGATTCTGGGACCTCATTTACCTACCTGAACGACCCAGCCTATTCACTTATTTCTGACAAATTTGATTCTACGGTTGAAGAAAAGCGGTATACAATGAATTCAGACTTCCCTTTTGAAAACTGCTATGAAATGAGCCCAAACCAAAACAACTTCACATACCCTGTAATGAATCTGACGATGAAAGGTGGTAAACAGTTTTTCATCAATCATCCAGTGATTCCGTTCAACGGCAAGCCGATCTTTTTTCTCATGATGTGTATACATATTACACTTTGCATTCACGCAGAAAACTTCATGGCTGGTTATCACATAGTATTTGACCGTGAAAAGATGGTTTTGGGTTGGAAGAAGTCAAACTGCAATGGTGACGAGAATGAGATCACCAACAATTTTCCCGTCGATCCATCGCCAGCCCCCGCCCCCGCCCCCGCCCCCGGAAGAGCAGTCAACCCACAAGCCAACAGCAACAGCAACATTAACAACTCTTCTCGAACTATAGAACCACCAAGACCTGCAGGAAATAGTGGTTCAAACCTTCTAAGTTCAGTCATTCTCACGTTGGTAATGATTCTTTTTCCATTTTTGCTTTTTGTT

Protein sequence

MASSSSLPPLSLMLCVSFFIFSFVSHFSNAHGSFMFDVHHRYSDAVRRLLPVDGLPEEGTLEYYAAMLRRDHFFRARRLATAEDRPPLTFISGNETIRLNPLGFLHYAEVKVGTPAVSYLVALDTGSDLFWLPCDCVNCVTGLNSSSGVIRFNIYSPNNSSTSKEVPCSSALCSHPSQCSSPSGTCPYVVSYLSENTSSTGYLVEDILHLTTNDDQSKPVNAKITLGCGKDQSGAFLNSAAPNGLFGLGIENVSVPSILANEGLTSNSFSLCFGRHGMGRIEFGDKGSPDQSETPFNLGRKHPTYNISITQINVGGNVSNLEFDAIFDSGTSFTYLNDPAYSLISDKFDSTVEEKRYTMNSDFPFENCYEMSPNQNNFTYPVMNLTMKGGKQFFINHPVIPFNGKPIFFLMMCIHITLCIHAENFMAGYHIVFDREKMVLGWKKSNCNGDENEITNNFPVDPSPAPAPAPAPGRAVNPQANSNSNINNSSRTIEPPRPAGNSGSNLLSSVILTLVMILFPFLLFV
Homology
BLAST of MS021005 vs. NCBI nr
Match: XP_022145453.1 (aspartyl protease family protein 1 isoform X1 [Momordica charantia])

HSP 1 Score: 995.3 bits (2572), Expect = 1.9e-286
Identity = 506/529 (95.65%), Postives = 510/529 (96.41%), Query Frame = 0

Query: 1   MASSSSLPPLSLMLCVSFFIFSFVSHFSNAHGSFMFDVHHRYSDAVRRLLPVDGLPEEGT 60
           MASSSSLPPLSLMLCVSFFIFSFVSHFSNAHGSFMFDVHHRYSDAVRRLLPVDGLPEEGT
Sbjct: 1   MASSSSLPPLSLMLCVSFFIFSFVSHFSNAHGSFMFDVHHRYSDAVRRLLPVDGLPEEGT 60

Query: 61  LEYYAAMLRRDHFFRARRLATAEDRPPLTFISGNETIRLNPLGFLHYAEVKVGTPAVSYL 120
           LEYYAAMLRRDHFFRARRLATAEDRPPLTFISGNETIRLNPLGFLHYAEVKVGTPAVSYL
Sbjct: 61  LEYYAAMLRRDHFFRARRLATAEDRPPLTFISGNETIRLNPLGFLHYAEVKVGTPAVSYL 120

Query: 121 VALDTGSDLFWLPCDCVNCVTGLNSSSGVIRFNIYSPNNSSTSKEVPCSSALCSHPSQCS 180
           VALDTGSDLFWLPCDCVNCVTGLNSSSGVIRFNIYSPNNSSTSKEVPCSSALCSH SQCS
Sbjct: 121 VALDTGSDLFWLPCDCVNCVTGLNSSSGVIRFNIYSPNNSSTSKEVPCSSALCSHASQCS 180

Query: 181 SPSGTCPYVVSYLSENTSSTGYLVEDILHLTTNDDQSKPVNAKITLGCGKDQSGAFLNSA 240
           SPSGTCPYVVSYLSENTSSTGYLVEDILHLTTNDDQSKPVNAKITLGCGKDQSGAFLNSA
Sbjct: 181 SPSGTCPYVVSYLSENTSSTGYLVEDILHLTTNDDQSKPVNAKITLGCGKDQSGAFLNSA 240

Query: 241 APNGLFGLGIENVSVPSILANEGLTSNSFSLCFGRHGMGRIEFGDKGSPDQSETPFNLGR 300
           APNGLFGLGIENVSVPSILANEGLTSNSFSLCFGRHGMGRIEFGDKGSPDQSETPFNLGR
Sbjct: 241 APNGLFGLGIENVSVPSILANEGLTSNSFSLCFGRHGMGRIEFGDKGSPDQSETPFNLGR 300

Query: 301 KHPTYNISITQINVGGNVSNLEFDAIFDSGTSFTYLNDPAYSLISDKFDSTVEEKRYTMN 360
           KHPTYNISITQINVGGNVSNLEFDAIFDSGTSFTYLNDPAYSLISDKFDS VEEKRYTMN
Sbjct: 301 KHPTYNISITQINVGGNVSNLEFDAIFDSGTSFTYLNDPAYSLISDKFDSMVEEKRYTMN 360

Query: 361 SDFPFENCYEMSPNQNNFTYPVMNLTMKGGKQFFINHPVIPFNGK--PIFFLMMCIHITL 420
           SDFPFENCYEMSPNQNNFTYPVMNLTMKGGKQFFINHPVIPFNGK   IF + +     +
Sbjct: 361 SDFPFENCYEMSPNQNNFTYPVMNLTMKGGKQFFINHPVIPFNGKVTSIFCIAIFRSPDI 420

Query: 421 CIHAENFMAGYHIVFDREKMVLGWKKSNCNGDENEITNNFPVDPSPAPAPAPAPGRAVNP 480
            I  +NFMAGYHIVFDREKMVLGWKKSNCNGDENEITNNFPVDPS  PAPAPAPGRAVNP
Sbjct: 421 NIIGQNFMAGYHIVFDREKMVLGWKKSNCNGDENEITNNFPVDPS--PAPAPAPGRAVNP 480

Query: 481 QA--NSNSNINNSSRTIEPPRPAGNSGSNLLSSVILTLVMILFPFLLFV 526
           QA  NSNSNINNSSRTIEPPRPAGNSGSNLLSSVILTLVMILFPFLLFV
Sbjct: 481 QANSNSNSNINNSSRTIEPPRPAGNSGSNLLSSVILTLVMILFPFLLFV 527

BLAST of MS021005 vs. NCBI nr
Match: XP_022145454.1 (aspartyl protease family protein 1 isoform X2 [Momordica charantia])

HSP 1 Score: 932.9 bits (2410), Expect = 1.2e-267
Identity = 472/495 (95.35%), Postives = 476/495 (96.16%), Query Frame = 0

Query: 35  MFDVHHRYSDAVRRLLPVDGLPEEGTLEYYAAMLRRDHFFRARRLATAEDRPPLTFISGN 94
           MFDVHHRYSDAVRRLLPVDGLPEEGTLEYYAAMLRRDHFFRARRLATAEDRPPLTFISGN
Sbjct: 1   MFDVHHRYSDAVRRLLPVDGLPEEGTLEYYAAMLRRDHFFRARRLATAEDRPPLTFISGN 60

Query: 95  ETIRLNPLGFLHYAEVKVGTPAVSYLVALDTGSDLFWLPCDCVNCVTGLNSSSGVIRFNI 154
           ETIRLNPLGFLHYAEVKVGTPAVSYLVALDTGSDLFWLPCDCVNCVTGLNSSSGVIRFNI
Sbjct: 61  ETIRLNPLGFLHYAEVKVGTPAVSYLVALDTGSDLFWLPCDCVNCVTGLNSSSGVIRFNI 120

Query: 155 YSPNNSSTSKEVPCSSALCSHPSQCSSPSGTCPYVVSYLSENTSSTGYLVEDILHLTTND 214
           YSPNNSSTSKEVPCSSALCSH SQCSSPSGTCPYVVSYLSENTSSTGYLVEDILHLTTND
Sbjct: 121 YSPNNSSTSKEVPCSSALCSHASQCSSPSGTCPYVVSYLSENTSSTGYLVEDILHLTTND 180

Query: 215 DQSKPVNAKITLGCGKDQSGAFLNSAAPNGLFGLGIENVSVPSILANEGLTSNSFSLCFG 274
           DQSKPVNAKITLGCGKDQSGAFLNSAAPNGLFGLGIENVSVPSILANEGLTSNSFSLCFG
Sbjct: 181 DQSKPVNAKITLGCGKDQSGAFLNSAAPNGLFGLGIENVSVPSILANEGLTSNSFSLCFG 240

Query: 275 RHGMGRIEFGDKGSPDQSETPFNLGRKHPTYNISITQINVGGNVSNLEFDAIFDSGTSFT 334
           RHGMGRIEFGDKGSPDQSETPFNLGRKHPTYNISITQINVGGNVSNLEFDAIFDSGTSFT
Sbjct: 241 RHGMGRIEFGDKGSPDQSETPFNLGRKHPTYNISITQINVGGNVSNLEFDAIFDSGTSFT 300

Query: 335 YLNDPAYSLISDKFDSTVEEKRYTMNSDFPFENCYEMSPNQNNFTYPVMNLTMKGGKQFF 394
           YLNDPAYSLISDKFDS VEEKRYTMNSDFPFENCYEMSPNQNNFTYPVMNLTMKGGKQFF
Sbjct: 301 YLNDPAYSLISDKFDSMVEEKRYTMNSDFPFENCYEMSPNQNNFTYPVMNLTMKGGKQFF 360

Query: 395 INHPVIPFNGK--PIFFLMMCIHITLCIHAENFMAGYHIVFDREKMVLGWKKSNCNGDEN 454
           INHPVIPFNGK   IF + +     + I  +NFMAGYHIVFDREKMVLGWKKSNCNGDEN
Sbjct: 361 INHPVIPFNGKVTSIFCIAIFRSPDINIIGQNFMAGYHIVFDREKMVLGWKKSNCNGDEN 420

Query: 455 EITNNFPVDPSPAPAPAPAPGRAVNPQA--NSNSNINNSSRTIEPPRPAGNSGSNLLSSV 514
           EITNNFPVDPS  PAPAPAPGRAVNPQA  NSNSNINNSSRTIEPPRPAGNSGSNLLSSV
Sbjct: 421 EITNNFPVDPS--PAPAPAPGRAVNPQANSNSNSNINNSSRTIEPPRPAGNSGSNLLSSV 480

Query: 515 ILTLVMILFPFLLFV 526
           ILTLVMILFPFLLFV
Sbjct: 481 ILTLVMILFPFLLFV 493

BLAST of MS021005 vs. NCBI nr
Match: XP_038906112.1 (aspartyl protease family protein 1 [Benincasa hispida])

HSP 1 Score: 748.4 bits (1931), Expect = 4.1e-212
Identity = 384/527 (72.87%), Postives = 438/527 (83.11%), Query Frame = 0

Query: 1   MASSSSLPPLSLMLCVSFFIFSFVSHFSNAHGSFMFDVHHRYSDAVRRLLPVDGLPEEGT 60
           MAS S+     L LC  F IF+F+SH S+A GSF F++HH YSDAVR++LP+D LP+EGT
Sbjct: 1   MASPST---FFLTLCFFFSIFTFISHSSHALGSFTFNIHHLYSDAVRQILPLDALPQEGT 60

Query: 61  LEYYAAMLRRDHFFRARRLATAEDRPPLTFISGNETIRLNPLGFLHYAEVKVGTPAVSYL 120
           L+YYAAM+R DHF  +RRL   +D+PPLTF SGN+T+R+NPLGFL+YAEV VGTP  SYL
Sbjct: 61  LDYYAAMVRTDHFVHSRRL--VQDQPPLTFFSGNQTLRINPLGFLYYAEVTVGTPEASYL 120

Query: 121 VALDTGSDLFWLPCDCVNCVTGLNSSSGVIRFNIYSPNNSSTSKEVPCSSALCSHPSQCS 180
           VALDTGSDLFWLPCDCVNC+TG N+S G + FNIYSP+NSSTSKEV CSS+LC+H  QCS
Sbjct: 121 VALDTGSDLFWLPCDCVNCITGFNTSQGPVNFNIYSPSNSSTSKEVQCSSSLCAHADQCS 180

Query: 181 SPSGTCPYVVSYLSENTSSTGYLVEDILHLTTNDDQSKPVNAKITLGCGKDQSGAFLNSA 240
           S S TCPY VSYLS+NTSSTGYLVEDILHLTTNDDQSKPVNAKITLGCGKDQSGAFL+SA
Sbjct: 181 SRSDTCPYEVSYLSDNTSSTGYLVEDILHLTTNDDQSKPVNAKITLGCGKDQSGAFLSSA 240

Query: 241 APNGLFGLGIENVSVPSILANEGLTSNSFSLCFGRHGMGRIEFGDKGSPDQSETPFNLGR 300
           APNGLFGLGIENVSVPSILAN GL SNSFSLCFG   MGRIEFGDKGSP QSETPFNLGR
Sbjct: 241 APNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSPGQSETPFNLGR 300

Query: 301 KHPTYNISITQINVGGNVSNLEFDAIFDSGTSFTYLNDPAYSLISDKFDSTVEEKRYTMN 360
           KHPTYNISITQI VGG+VSN++  AIFDSGTSFTYLNDPAYS+ +DKFDS +EEKRYTM+
Sbjct: 301 KHPTYNISITQIGVGGHVSNVDVAAIFDSGTSFTYLNDPAYSVFADKFDSMIEEKRYTMS 360

Query: 361 SDFPFENCYEMSPNQNNFTYPVMNLTMKGGKQFFINHPVIPFNGK--PIFFLMMCIHITL 420
           S  PFENCYE+SPNQ  FTYPVMNLTMKGG  F INHPV+  + +   +F L +    ++
Sbjct: 361 SGLPFENCYELSPNQTTFTYPVMNLTMKGGGHFVINHPVVLISTRLTTLFCLAIARSDSI 420

Query: 421 CIHAENFMAGYHIVFDREKMVLGWKKSNCNGDENEITNNFPVDPSPAPAPAPAPGRAVNP 480
            I  +NFM GYHIVFDREKMVLGWK+SNC G E+E TNN PV P+PAPA AP     +NP
Sbjct: 421 NIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDENTNNLPVGPTPAPAAAPR-STIINP 480

Query: 481 QANSNSNINNSSRTIEPPRPAGNSGSNLLSSVILTLVMILFPFLLFV 526
           QA  NSNINN+S++IE P+P  NS SNLL+SVILT +M + PFLLFV
Sbjct: 481 QA--NSNINNTSQSIEKPKPT-NSSSNLLTSVILTFLMSVGPFLLFV 518

BLAST of MS021005 vs. NCBI nr
Match: TYK11398.1 (aspartyl protease family protein 1-like [Cucumis melo var. makuwa])

HSP 1 Score: 745.7 bits (1924), Expect = 2.7e-211
Identity = 379/527 (71.92%), Postives = 430/527 (81.59%), Query Frame = 0

Query: 1   MASSSSLPPLSLMLCVSFFIFSFVSHFSNAHGSFMFDVHHRYSDAVRRLLPVDGLPEEGT 60
           M SSSS    SL LC    IF+F+SHFS+  GSF F++HH YS AVR++LP    P+EGT
Sbjct: 1   MPSSSS--TFSLTLCFFLSIFTFISHFSHVFGSFTFNIHHLYSPAVRQILPFHSFPDEGT 60

Query: 61  LEYYAAMLRRDHFFRARRLATAEDRPPLTFISGNETIRLNPLGFLHYAEVKVGTPAVSYL 120
           L+YYAAM+R DHF  +RRL   +D PPLTF+SGNET+R++PLGFL+YAEV VGTP V YL
Sbjct: 61  LDYYAAMVRTDHFVHSRRLGQVQDHPPLTFLSGNETLRISPLGFLYYAEVTVGTPGVPYL 120

Query: 121 VALDTGSDLFWLPCDCVNCVTGLNSSSGVIRFNIYSPNNSSTSKEVPCSSALCSHPSQCS 180
           VALDTGSDLFWLPCDCVNC+TGLN++ G + FNIYSPNNSSTSKEV CSS+LCSHP QCS
Sbjct: 121 VALDTGSDLFWLPCDCVNCITGLNTTQGPVNFNIYSPNNSSTSKEVQCSSSLCSHPDQCS 180

Query: 181 SPSGTCPYVVSYLSENTSSTGYLVEDILHLTTNDDQSKPVNAKITLGCGKDQSGAFLNSA 240
            PS TCPY VSYLS+NTSSTGYLVEDILHLTTND QSKPVNA ITLGCGKDQSGAFL+SA
Sbjct: 181 LPSDTCPYQVSYLSDNTSSTGYLVEDILHLTTNDVQSKPVNATITLGCGKDQSGAFLSSA 240

Query: 241 APNGLFGLGIENVSVPSILANEGLTSNSFSLCFGRHGMGRIEFGDKGSPDQSETPFNLGR 300
           APNGLFGLGIENVSVPSILAN GL SNSFSLCFG   MGRIEFGDKGSPDQ+ETPFNLGR
Sbjct: 241 APNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSPDQNETPFNLGR 300

Query: 301 KHPTYNISITQINVGGNVSNLEFDAIFDSGTSFTYLNDPAYSLISDKFDSTVEEKRYTMN 360
           +HPTYN+SITQI VGG++SNL+   IFDSGTSFTYLNDPAYSL +DKFDS VEEKRYTMN
Sbjct: 301 RHPTYNVSITQIAVGGHISNLDVAVIFDSGTSFTYLNDPAYSLFADKFDSMVEEKRYTMN 360

Query: 361 SDFPFENCYEMSPNQNNFTYPVMNLTMKGGKQFFINHPVIPFN--GKPIFFLMMCIHITL 420
           SD PFENCYE+SP+Q  FTYPVMNLTMKGG  F INHP++  +   K +F L +    ++
Sbjct: 361 SDIPFENCYELSPDQTTFTYPVMNLTMKGGGHFVINHPIVLLSAQSKRLFCLAIARSDSI 420

Query: 421 CIHAENFMAGYHIVFDREKMVLGWKKSNCNGDENEITNNFPVDPSPAPAPAPAPGRAVNP 480
            I  +NFM GYHIVFDREKMVLGWK+SNC G E+E TNN PV PS  P PA APG  + P
Sbjct: 421 NIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDENTNNLPVGPS--PTPAAAPGTTIKP 480

Query: 481 QANSNSNINNSSRTIEPPRPAGNSGSNLLSSVILTLVMILFPFLLFV 526
           QA  NSN+NN+++TIE PRP  N  S L +SVILT +M +  FLLFV
Sbjct: 481 QA--NSNVNNTTQTIEKPRPT-NISSKLPTSVILTFLMPVVTFLLFV 520

BLAST of MS021005 vs. NCBI nr
Match: KAA0052941.1 (aspartyl protease family protein 1-like [Cucumis melo var. makuwa])

HSP 1 Score: 743.0 bits (1917), Expect = 1.7e-210
Identity = 378/527 (71.73%), Postives = 429/527 (81.40%), Query Frame = 0

Query: 1   MASSSSLPPLSLMLCVSFFIFSFVSHFSNAHGSFMFDVHHRYSDAVRRLLPVDGLPEEGT 60
           M SSSS    SL LC    IF+F+SHFS+  GSF F++HH YS AVR++LP    P+EGT
Sbjct: 1   MPSSSS--TFSLTLCFFLSIFTFISHFSHVFGSFTFNIHHLYSPAVRQILPFHSFPDEGT 60

Query: 61  LEYYAAMLRRDHFFRARRLATAEDRPPLTFISGNETIRLNPLGFLHYAEVKVGTPAVSYL 120
           L+YYAAM+R DHF  +RRL   +D PPLTF+SGNET+R++PLGFL+YAEV VGTP V YL
Sbjct: 61  LDYYAAMVRTDHFVHSRRLGQVQDHPPLTFLSGNETLRISPLGFLYYAEVTVGTPGVPYL 120

Query: 121 VALDTGSDLFWLPCDCVNCVTGLNSSSGVIRFNIYSPNNSSTSKEVPCSSALCSHPSQCS 180
           VALDTGSDLFWLPCDCVNC+TGLN++ G + FNIYSPNNSSTSKEV CSS+LCSHP QCS
Sbjct: 121 VALDTGSDLFWLPCDCVNCITGLNTTQGPVNFNIYSPNNSSTSKEVQCSSSLCSHPDQCS 180

Query: 181 SPSGTCPYVVSYLSENTSSTGYLVEDILHLTTNDDQSKPVNAKITLGCGKDQSGAFLNSA 240
            PS TCPY VSYLS+NTSSTGYLVEDILHLTTND QSKPVNA ITLGCGKDQSGAFL+SA
Sbjct: 181 LPSDTCPYQVSYLSDNTSSTGYLVEDILHLTTNDVQSKPVNATITLGCGKDQSGAFLSSA 240

Query: 241 APNGLFGLGIENVSVPSILANEGLTSNSFSLCFGRHGMGRIEFGDKGSPDQSETPFNLGR 300
           APNGLFGLGIENVSVPSILAN GL SNSFSLCFG   MGRIEFGDKGSPDQ+ETPFNLGR
Sbjct: 241 APNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSPDQNETPFNLGR 300

Query: 301 KHPTYNISITQINVGGNVSNLEFDAIFDSGTSFTYLNDPAYSLISDKFDSTVEEKRYTMN 360
           +HPTYN+SITQI VGG++SNL+   IFDSGTSFTYLNDPAYSL +DKFDS VEEKRYTMN
Sbjct: 301 RHPTYNVSITQIAVGGHISNLDVAVIFDSGTSFTYLNDPAYSLFADKFDSMVEEKRYTMN 360

Query: 361 SDFPFENCYEMSPNQNNFTYPVMNLTMKGGKQFFINHPVIPFN--GKPIFFLMMCIHITL 420
           SD PFENCYE+SP+Q  FTYPVMNLTMKGG  F INHP++  +   K +F L +    ++
Sbjct: 361 SDIPFENCYELSPDQTTFTYPVMNLTMKGGGHFVINHPIVLLSAQSKRLFCLAIARSDSI 420

Query: 421 CIHAENFMAGYHIVFDREKMVLGWKKSNCNGDENEITNNFPVDPSPAPAPAPAPGRAVNP 480
            I  +NFM GYHIVFDREKMVLGWK+SNC G E+E TNN PV PSP PA AP     + P
Sbjct: 421 NIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDENTNNLPVGPSPTPAAAPGT-TTIKP 480

Query: 481 QANSNSNINNSSRTIEPPRPAGNSGSNLLSSVILTLVMILFPFLLFV 526
           QA  NSN+NN+++TIE PRP  N  S L +SVILT +M +  FLLFV
Sbjct: 481 QA--NSNVNNTTQTIEKPRPT-NISSKLPTSVILTFLMPVVTFLLFV 521

BLAST of MS021005 vs. ExPASy Swiss-Prot
Match: Q8VYV9 (Aspartyl protease family protein 1 OS=Arabidopsis thaliana OX=3702 GN=APF1 PE=2 SV=1)

HSP 1 Score: 495.0 bits (1273), Expect = 1.1e-138
Identity = 255/476 (53.57%), Postives = 329/476 (69.12%), Query Frame = 0

Query: 22  SFVSHFSNAHGSFMFDVHHRYSDAVRRLLPVDGLPEEGTLEYYAAMLRRDHFFRARRLAT 81
           S+V       G F F+ HHR+SD V  +LP DGLP   + +YY  M  RD   R RRLA 
Sbjct: 21  SWVLDRCEGFGEFGFEFHHRFSDQVVGVLPGDGLPNRDSSKYYRVMAHRDRLIRGRRLAN 80

Query: 82  AEDRPPLTFISGNETIRLNPLGFLHYAEVKVGTPAVSYLVALDTGSDLFWLPCDCVNCVT 141
            ED+  +TF  GNET+R++ LGFLHYA V VGTP+  ++VALDTGSDLFWLPCDC NCV 
Sbjct: 81  -EDQSLVTFSDGNETVRVDALGFLHYANVTVGTPSDWFMVALDTGSDLFWLPCDCTNCVR 140

Query: 142 GLNSSSG-VIRFNIYSPNNSSTSKEVPCSSALCSHPSQCSSPSGTCPYVVSYLSENTSST 201
            L +  G  +  NIYSPN SSTS +VPC+S LC+   +C+SP   CPY + YLS  TSST
Sbjct: 141 ELKAPGGSSLDLNIYSPNASSTSTKVPCNSTLCTRGDRCASPESDCPYQIRYLSNGTSST 200

Query: 202 GYLVEDILHLTTNDDQSKPVNAKITLGCGKDQSGAFLNSAAPNGLFGLGIENVSVPSILA 261
           G LVED+LHL +ND  SK + A++T GCG+ Q+G F + AAPNGLFGLG+E++SVPS+LA
Sbjct: 201 GVLVEDVLHLVSNDKSSKAIPARVTFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLA 260

Query: 262 NEGLTSNSFSLCFGRHGMGRIEFGDKGSPDQSETPFNLGRKHPTYNISITQINVGGNVSN 321
            EG+ +NSFS+CFG  G GRI FGDKGS DQ ETP N+ + HPTYNI++T+I+VGGN  +
Sbjct: 261 KEGIAANSFSMCFGNDGAGRISFGDKGSVDQRETPLNIRQPHPTYNITVTKISVGGNTGD 320

Query: 322 LEFDAIFDSGTSFTYLNDPAYSLISDKFDSTVEEKRY-TMNSDFPFENCYEMSPNQNNFT 381
           LEFDA+FDSGTSFTYL D AY+LIS+ F+S   +KRY T +S+ PFE CY +SPN+++F 
Sbjct: 321 LEFDAVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKDSFQ 380

Query: 382 YPVMNLTMKGGKQFFINHP--VIPFNGKPIFFLMMCIHITLCIHAENFMAGYHIVFDREK 441
           YP +NLTMKGG  + + HP  VIP     ++ L +     + I  +NFM GY +VFDREK
Sbjct: 381 YPAVNLTMKGGSSYPVYHPLVVIPMKDTDVYCLAIMKIEDISIIGQNFMTGYRVVFDREK 440

Query: 442 MVLGWKKSNCNGDENEITNNFPVDPSPAPAPAPAPGRAVNPQA-NSNSNINNSSRT 493
           ++LGWK+S+C   E       P + S + A  PA   + +P+A N  S   N+S T
Sbjct: 441 LILGWKESDCYTGETS-ARTLPSNRSSSSARPPA--SSFDPEATNIPSQRPNTSTT 492

BLAST of MS021005 vs. ExPASy Swiss-Prot
Match: Q9LX20 (Aspartic proteinase-like protein 1 OS=Arabidopsis thaliana OX=3702 GN=At5g10080 PE=2 SV=1)

HSP 1 Score: 278.1 bits (710), Expect = 2.1e-73
Identity = 183/494 (37.04%), Postives = 248/494 (50.20%), Query Frame = 0

Query: 40  HRYSDAVRRLLPV----DGLPEEGTLEYYAAMLRRDHFFRARRLATAEDRPPLTFISGNE 99
           HR+SD  R  +      D LP + +LEYY  +   D  FR +R+        L    G++
Sbjct: 31  HRFSDEGRASIKTPSSSDSLPNKQSLEYYRLLAESD--FRRQRMNLGAKVQSLVPSEGSK 90

Query: 100 TIRL-NPLGFLHYAEVKVGTPAVSYLVALDTGSDLFWLPCDCVNC---VTGLNSSSGVIR 159
           TI   N  G+LHY  + +GTP+VS+LVALDTGS+L W+PC+CV C    +   SS     
Sbjct: 91  TISSGNDFGWLHYTWIDIGTPSVSFLVALDTGSNLLWIPCNCVQCAPLTSTYYSSLATKD 150

Query: 160 FNIYSPNNSSTSKEVPCSSALCSHPSQCSSPSGTCPYVVSYLSENTSSTGYLVEDILHLT 219
            N Y+P++SSTSK   CS  LC   S C SP   CPY V+YLS NTSS+G LVEDILHLT
Sbjct: 151 LNEYNPSSSSTSKVFLCSHKLCDSASDCESPKEQCPYTVNYLSGNTSSSGLLVEDILHLT 210

Query: 220 TNDDQ-----SKPVNAKITLGCGKDQSGAFLNSAAPNGLFGLGIENVSVPSILANEGLTS 279
            N +      S  V A++ +GCGK QSG +L+  AP+GL GLG   +SVPS L+  GL  
Sbjct: 211 YNTNNRLMNGSSSVKARVVIGCGKKQSGDYLDGVAPDGLMGLGPAEISVPSFLSKAGLMR 270

Query: 280 NSFSLCFGRHGMGRIEFGDKGSPDQSETPFNL--GRKHPTYNISITQINVGGN-VSNLEF 339
           NSFSLCF     GRI FGD G   Q  TPF      K+  Y + +    +G + +    F
Sbjct: 271 NSFSLCFDEEDSGRIYFGDMGPSIQQSTPFLQLDNNKYSGYIVGVEACCIGNSCLKQTSF 330

Query: 340 DAIFDSGTSFTYLNDPAYSLISDKFDSTVEEKRYTMNSDFPFENCYEMSPNQNNFTYPVM 399
               DSG SFTYL +  Y  ++ + D  +            +E CYE S        P +
Sbjct: 331 TTFIDSGQSFTYLPEEIYRKVALEIDRHINATSKNFEG-VSWEYCYESSAEPK---VPAI 390

Query: 400 NLTMKGGKQFFINHPVIPFNGKPIFFLMMCIHIT------LCIHAENFMAGYHIVFDREK 459
            L       F I+ P+  F  +    +  C+ I+      +    +N+M GY +VFDRE 
Sbjct: 391 KLKFSHNNTFVIHKPLFVFQ-QSQGLVQFCLPISPSGQEGIGSIGQNYMRGYRMVFDREN 450

Query: 460 MVLGWKKSNCNGDENEITNNFPVDPSPAPAPAPAPGRAVNPQANSNSNINNSSRTIEPPR 512
           M LGW  S C  D+ E     P   SP    +P P      Q+     ++ +     P +
Sbjct: 451 MKLGWSPSKCQEDKIE-----PPQASPGSTSSPNPLPTDEQQSRGGHAVSPAIAGKTPSK 510

BLAST of MS021005 vs. ExPASy Swiss-Prot
Match: Q9S9K4 (Aspartic proteinase 39 OS=Arabidopsis thaliana OX=3702 GN=A39 PE=1 SV=2)

HSP 1 Score: 128.3 bits (321), Expect = 2.6e-28
Identity = 125/455 (27.47%), Postives = 200/455 (43.96%), Query Frame = 0

Query: 14  LCVSFFIFSFVSHFSNAHGSFMFDVHHRYSDAVRRLLPVDGLPEEGTLEYYAAMLRRDHF 73
           LC+   +F  V  F++A  +F+F   H+++             ++  LE++ +   R H 
Sbjct: 7   LCIVVAVFVIVIEFASA--NFVFKAQHKFAG------------KKKNLEHFKSHDTRRH- 66

Query: 74  FRARRLATAEDRPPLTFISGNETIRLNPLGFLHYAEVKVGTPAVSYLVALDTGSDLFWLP 133
               R+  + D P    + G+   R++ +G L++ ++K+G+P   Y V +DTGSD+ W+ 
Sbjct: 67  ---SRMLASIDLP----LGGDS--RVDSVG-LYFTKIKLGSPPKEYHVQVDTGSDILWIN 126

Query: 134 C-DCVNCVTGLNSSSGVIRFNIYSPNNSSTSKEVPCSSALCSHPSQCSS--PSGTCPYVV 193
           C  C  C T  N +    R +++  N SSTSK+V C    CS  SQ  S  P+  C Y +
Sbjct: 127 CKPCPKCPTKTNLN---FRLSLFDMNASSTSKKVGCDDDFCSFISQSDSCQPALGCSYHI 186

Query: 194 SYLSENTSSTGYLVEDILHL--TTNDDQSKPVNAKITLGCGKDQSGAFLN-SAAPNGLFG 253
            Y  E+TS  G  + D+L L   T D ++ P+  ++  GCG DQSG   N  +A +G+ G
Sbjct: 187 VYADESTSD-GKFIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMG 246

Query: 254 LGIENVSVPSILANEGLTSNSFSLCFGR-HGMGRIEFGDKGSPDQSETPFNLGRKHPTYN 313
            G  N SV S LA  G     FS C     G G    G   SP    TP    + H  YN
Sbjct: 247 FGQSNTSVLSQLAATGDAKRVFSHCLDNVKGGGIFAVGVVDSPKVKTTPMVPNQMH--YN 306

Query: 314 ISITQINVGGNVSNL------EFDAIFDSGTSFTYLNDPAYSLISDKFDSTVEEKRYTMN 373
           + +  ++V G   +L          I DSGT+  Y     Y  + +   +    K + + 
Sbjct: 307 VMLMGMDVDGTSLDLPRSIVRNGGTIVDSGTTLAYFPKVLYDSLIETILARQPVKLHIVE 366

Query: 374 SDFPFENCYEMSPNQNNFTYPV-------MNLTMKGGKQFFINHPVIPFNGKPIFFLMMC 433
             F    C+  S N +    PV       + LT+      F     +   G     L   
Sbjct: 367 ETF---QCFSFSTNVDEAFPPVSFEFEDSVKLTVYPHDYLFTLEEELYCFGWQAGGLTTD 426

Query: 434 IHITLCIHAENFMAGYHIVFDREKMVLGWKKSNCN 449
               + +  +  ++   +V+D +  V+GW   NC+
Sbjct: 427 ERSEVILLGDLVLSNKLVVYDLDNEVIGWADHNCS 427

BLAST of MS021005 vs. ExPASy Swiss-Prot
Match: Q4V3D2 (Aspartic proteinase 36 OS=Arabidopsis thaliana OX=3702 GN=A36 PE=1 SV=1)

HSP 1 Score: 124.0 bits (310), Expect = 5.0e-27
Identity = 116/449 (25.84%), Postives = 198/449 (44.10%), Query Frame = 0

Query: 23  FVSHFSNAHGSFMFDVHHRYSDAVRRLLPVDGLPEEGTLEYYAAMLRRDHFFRARRLATA 82
           FV       G+F+F+V H+++   ++L               + +   D F  AR LA  
Sbjct: 18  FVLVIQVVSGNFVFNVTHKFAGKEKQL---------------SELKSHDSFRHARMLANI 77

Query: 83  EDRPPLTFISGNETIRLNPLGFLHYAEVKVGTPAVSYLVALDTGSDLFWLPC-DCVNCVT 142
            D P    + G+   R + +G L++ ++K+G+P   Y V +DTGSD+ W+ C  C  C  
Sbjct: 78  -DLP----LGGDS--RADSIG-LYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPV 137

Query: 143 GLNSSSGVIRFNIYSPNNSSTSKEVPCSSALCSH--PSQCSSPSGTCPYVVSYLSENTSS 202
             +     I  ++Y    SSTSK V C    CS    S+       C Y V Y   +TS 
Sbjct: 138 KTDLG---IPLSLYDSKTSSTSKNVGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSD 197

Query: 203 TGYLVEDI-LHLTTNDDQSKPVNAKITLGCGKDQSGAF-LNSAAPNGLFGLGIENVSVPS 262
             ++ ++I L   T + ++ P+  ++  GCGK+QSG      +A +G+ G G  N S+ S
Sbjct: 198 GDFIKDNITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIIS 257

Query: 263 ILANEGLTSNSFSLCF-GRHGMGRIEFGDKGSPDQSETPFNLGRKHPTYNISITQINVGG 322
            LA  G T   FS C    +G G    G+  SP    TP    + H  YN+ +  ++V G
Sbjct: 258 QLAAGGSTKRIFSHCLDNMNGGGIFAVGEVESPVVKTTPIVPNQVH--YNVILKGMDVDG 317

Query: 323 N---------VSNLEFDAIFDSGTSFTYLNDPAYSLISDKFDSTVEEKRYTMNSDFPFEN 382
           +          +N +   I DSGT+  YL    Y+ + +K  +  + K + +   F    
Sbjct: 318 DPIDLPPSLASTNGDGGTIIDSGTTLAYLPQNLYNSLIEKITAKQQVKLHMVQETFA--- 377

Query: 383 CYEMSPNQNNFTYPVMNLTMKGGKQFFINHPVIPFNGKPIFF--------LMMCIHITLC 442
           C+  + N +   +PV+NL  +   +  +      F+ +   +        +       + 
Sbjct: 378 CFSFTSNTDK-AFPVVNLHFEDSLKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVI 434

Query: 443 IHAENFMAGYHIVFDREKMVLGWKKSNCN 449
           +  +  ++   +V+D E  V+GW   NC+
Sbjct: 438 LLGDLVLSNKLVVYDLENEVIGWADHNCS 434

BLAST of MS021005 vs. ExPASy Swiss-Prot
Match: Q9LS40 (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 95.9 bits (237), Expect = 1.4e-18
Identity = 100/363 (27.55%), Postives = 152/363 (41.87%), Query Frame = 0

Query: 106 HYAEVKVGTPAVSYLVALDTGSDLFWLPCD-CVNCVTGLNSSSGVIRFNIYSPNNSSTSK 165
           +++ + VGTPA    + LDTGSD+ W+ C+ C +C    +         +++P +SST K
Sbjct: 162 YFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDP--------VFNPTSSSTYK 221

Query: 166 EVPCSSALCSHPSQCSSPSGTCPYVVSYLSENTSSTGYLVEDILHLTTNDDQSKPVNAKI 225
            + CS+  CS     +  S  C Y VSY  + + + G L  D    T     S  +N  +
Sbjct: 222 SLTCSAPQCSLLETSACRSNKCLYQVSY-GDGSFTVGELATD----TVTFGNSGKIN-NV 281

Query: 226 TLGCGKDQSGAFLNSAAPNGLFGLGIENVSVPSILANEGLTSNSFSLCFGRHGMGRIEFG 285
            LGCG D  G F  +A   GL GLG   +S+        + + SFS C      G+    
Sbjct: 282 ALGCGHDNEGLFTGAA---GLLGLGGGVLSI-----TNQMKATSFSYCLVDRDSGKSSSL 341

Query: 286 DKGSPD----QSETPFNLGRKHPT-YNISITQINVGGNVSNLEFDAIF------------ 345
           D  S       +  P    +K  T Y + ++  +VGG    L  DAIF            
Sbjct: 342 DFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLP-DAIFDVDASGSGGVIL 401

Query: 346 DSGTSFTYLNDPAYSLISDKFDSTVEEKRYTMNSDFPFENCYEMSPNQNNFTYPVMNLTM 405
           D GT+ T L   AY+ + D F       +   +S   F+ CY+ S + +    P +    
Sbjct: 402 DCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFS-SLSTVKVPTVAFHF 461

Query: 406 KGGKQFFI--NHPVIPFNGKPIF-FLMMCIHITLCIHAENFMAGYHIVFDREKMVLGWKK 448
            GGK   +   + +IP +    F F       +L I       G  I +D  K V+G   
Sbjct: 462 TGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLSG 500

BLAST of MS021005 vs. ExPASy TrEMBL
Match: A0A6J1CVY9 (aspartyl protease family protein 1 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111014898 PE=3 SV=1)

HSP 1 Score: 995.3 bits (2572), Expect = 9.3e-287
Identity = 506/529 (95.65%), Postives = 510/529 (96.41%), Query Frame = 0

Query: 1   MASSSSLPPLSLMLCVSFFIFSFVSHFSNAHGSFMFDVHHRYSDAVRRLLPVDGLPEEGT 60
           MASSSSLPPLSLMLCVSFFIFSFVSHFSNAHGSFMFDVHHRYSDAVRRLLPVDGLPEEGT
Sbjct: 1   MASSSSLPPLSLMLCVSFFIFSFVSHFSNAHGSFMFDVHHRYSDAVRRLLPVDGLPEEGT 60

Query: 61  LEYYAAMLRRDHFFRARRLATAEDRPPLTFISGNETIRLNPLGFLHYAEVKVGTPAVSYL 120
           LEYYAAMLRRDHFFRARRLATAEDRPPLTFISGNETIRLNPLGFLHYAEVKVGTPAVSYL
Sbjct: 61  LEYYAAMLRRDHFFRARRLATAEDRPPLTFISGNETIRLNPLGFLHYAEVKVGTPAVSYL 120

Query: 121 VALDTGSDLFWLPCDCVNCVTGLNSSSGVIRFNIYSPNNSSTSKEVPCSSALCSHPSQCS 180
           VALDTGSDLFWLPCDCVNCVTGLNSSSGVIRFNIYSPNNSSTSKEVPCSSALCSH SQCS
Sbjct: 121 VALDTGSDLFWLPCDCVNCVTGLNSSSGVIRFNIYSPNNSSTSKEVPCSSALCSHASQCS 180

Query: 181 SPSGTCPYVVSYLSENTSSTGYLVEDILHLTTNDDQSKPVNAKITLGCGKDQSGAFLNSA 240
           SPSGTCPYVVSYLSENTSSTGYLVEDILHLTTNDDQSKPVNAKITLGCGKDQSGAFLNSA
Sbjct: 181 SPSGTCPYVVSYLSENTSSTGYLVEDILHLTTNDDQSKPVNAKITLGCGKDQSGAFLNSA 240

Query: 241 APNGLFGLGIENVSVPSILANEGLTSNSFSLCFGRHGMGRIEFGDKGSPDQSETPFNLGR 300
           APNGLFGLGIENVSVPSILANEGLTSNSFSLCFGRHGMGRIEFGDKGSPDQSETPFNLGR
Sbjct: 241 APNGLFGLGIENVSVPSILANEGLTSNSFSLCFGRHGMGRIEFGDKGSPDQSETPFNLGR 300

Query: 301 KHPTYNISITQINVGGNVSNLEFDAIFDSGTSFTYLNDPAYSLISDKFDSTVEEKRYTMN 360
           KHPTYNISITQINVGGNVSNLEFDAIFDSGTSFTYLNDPAYSLISDKFDS VEEKRYTMN
Sbjct: 301 KHPTYNISITQINVGGNVSNLEFDAIFDSGTSFTYLNDPAYSLISDKFDSMVEEKRYTMN 360

Query: 361 SDFPFENCYEMSPNQNNFTYPVMNLTMKGGKQFFINHPVIPFNGK--PIFFLMMCIHITL 420
           SDFPFENCYEMSPNQNNFTYPVMNLTMKGGKQFFINHPVIPFNGK   IF + +     +
Sbjct: 361 SDFPFENCYEMSPNQNNFTYPVMNLTMKGGKQFFINHPVIPFNGKVTSIFCIAIFRSPDI 420

Query: 421 CIHAENFMAGYHIVFDREKMVLGWKKSNCNGDENEITNNFPVDPSPAPAPAPAPGRAVNP 480
            I  +NFMAGYHIVFDREKMVLGWKKSNCNGDENEITNNFPVDPS  PAPAPAPGRAVNP
Sbjct: 421 NIIGQNFMAGYHIVFDREKMVLGWKKSNCNGDENEITNNFPVDPS--PAPAPAPGRAVNP 480

Query: 481 QA--NSNSNINNSSRTIEPPRPAGNSGSNLLSSVILTLVMILFPFLLFV 526
           QA  NSNSNINNSSRTIEPPRPAGNSGSNLLSSVILTLVMILFPFLLFV
Sbjct: 481 QANSNSNSNINNSSRTIEPPRPAGNSGSNLLSSVILTLVMILFPFLLFV 527

BLAST of MS021005 vs. ExPASy TrEMBL
Match: A0A6J1CVB0 (aspartyl protease family protein 1 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111014898 PE=3 SV=1)

HSP 1 Score: 932.9 bits (2410), Expect = 5.7e-268
Identity = 472/495 (95.35%), Postives = 476/495 (96.16%), Query Frame = 0

Query: 35  MFDVHHRYSDAVRRLLPVDGLPEEGTLEYYAAMLRRDHFFRARRLATAEDRPPLTFISGN 94
           MFDVHHRYSDAVRRLLPVDGLPEEGTLEYYAAMLRRDHFFRARRLATAEDRPPLTFISGN
Sbjct: 1   MFDVHHRYSDAVRRLLPVDGLPEEGTLEYYAAMLRRDHFFRARRLATAEDRPPLTFISGN 60

Query: 95  ETIRLNPLGFLHYAEVKVGTPAVSYLVALDTGSDLFWLPCDCVNCVTGLNSSSGVIRFNI 154
           ETIRLNPLGFLHYAEVKVGTPAVSYLVALDTGSDLFWLPCDCVNCVTGLNSSSGVIRFNI
Sbjct: 61  ETIRLNPLGFLHYAEVKVGTPAVSYLVALDTGSDLFWLPCDCVNCVTGLNSSSGVIRFNI 120

Query: 155 YSPNNSSTSKEVPCSSALCSHPSQCSSPSGTCPYVVSYLSENTSSTGYLVEDILHLTTND 214
           YSPNNSSTSKEVPCSSALCSH SQCSSPSGTCPYVVSYLSENTSSTGYLVEDILHLTTND
Sbjct: 121 YSPNNSSTSKEVPCSSALCSHASQCSSPSGTCPYVVSYLSENTSSTGYLVEDILHLTTND 180

Query: 215 DQSKPVNAKITLGCGKDQSGAFLNSAAPNGLFGLGIENVSVPSILANEGLTSNSFSLCFG 274
           DQSKPVNAKITLGCGKDQSGAFLNSAAPNGLFGLGIENVSVPSILANEGLTSNSFSLCFG
Sbjct: 181 DQSKPVNAKITLGCGKDQSGAFLNSAAPNGLFGLGIENVSVPSILANEGLTSNSFSLCFG 240

Query: 275 RHGMGRIEFGDKGSPDQSETPFNLGRKHPTYNISITQINVGGNVSNLEFDAIFDSGTSFT 334
           RHGMGRIEFGDKGSPDQSETPFNLGRKHPTYNISITQINVGGNVSNLEFDAIFDSGTSFT
Sbjct: 241 RHGMGRIEFGDKGSPDQSETPFNLGRKHPTYNISITQINVGGNVSNLEFDAIFDSGTSFT 300

Query: 335 YLNDPAYSLISDKFDSTVEEKRYTMNSDFPFENCYEMSPNQNNFTYPVMNLTMKGGKQFF 394
           YLNDPAYSLISDKFDS VEEKRYTMNSDFPFENCYEMSPNQNNFTYPVMNLTMKGGKQFF
Sbjct: 301 YLNDPAYSLISDKFDSMVEEKRYTMNSDFPFENCYEMSPNQNNFTYPVMNLTMKGGKQFF 360

Query: 395 INHPVIPFNGK--PIFFLMMCIHITLCIHAENFMAGYHIVFDREKMVLGWKKSNCNGDEN 454
           INHPVIPFNGK   IF + +     + I  +NFMAGYHIVFDREKMVLGWKKSNCNGDEN
Sbjct: 361 INHPVIPFNGKVTSIFCIAIFRSPDINIIGQNFMAGYHIVFDREKMVLGWKKSNCNGDEN 420

Query: 455 EITNNFPVDPSPAPAPAPAPGRAVNPQA--NSNSNINNSSRTIEPPRPAGNSGSNLLSSV 514
           EITNNFPVDPS  PAPAPAPGRAVNPQA  NSNSNINNSSRTIEPPRPAGNSGSNLLSSV
Sbjct: 421 EITNNFPVDPS--PAPAPAPGRAVNPQANSNSNSNINNSSRTIEPPRPAGNSGSNLLSSV 480

Query: 515 ILTLVMILFPFLLFV 526
           ILTLVMILFPFLLFV
Sbjct: 481 ILTLVMILFPFLLFV 493

BLAST of MS021005 vs. ExPASy TrEMBL
Match: A0A5D3CJM4 (Aspartyl protease family protein 1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold139G00510 PE=3 SV=1)

HSP 1 Score: 745.7 bits (1924), Expect = 1.3e-211
Identity = 379/527 (71.92%), Postives = 430/527 (81.59%), Query Frame = 0

Query: 1   MASSSSLPPLSLMLCVSFFIFSFVSHFSNAHGSFMFDVHHRYSDAVRRLLPVDGLPEEGT 60
           M SSSS    SL LC    IF+F+SHFS+  GSF F++HH YS AVR++LP    P+EGT
Sbjct: 1   MPSSSS--TFSLTLCFFLSIFTFISHFSHVFGSFTFNIHHLYSPAVRQILPFHSFPDEGT 60

Query: 61  LEYYAAMLRRDHFFRARRLATAEDRPPLTFISGNETIRLNPLGFLHYAEVKVGTPAVSYL 120
           L+YYAAM+R DHF  +RRL   +D PPLTF+SGNET+R++PLGFL+YAEV VGTP V YL
Sbjct: 61  LDYYAAMVRTDHFVHSRRLGQVQDHPPLTFLSGNETLRISPLGFLYYAEVTVGTPGVPYL 120

Query: 121 VALDTGSDLFWLPCDCVNCVTGLNSSSGVIRFNIYSPNNSSTSKEVPCSSALCSHPSQCS 180
           VALDTGSDLFWLPCDCVNC+TGLN++ G + FNIYSPNNSSTSKEV CSS+LCSHP QCS
Sbjct: 121 VALDTGSDLFWLPCDCVNCITGLNTTQGPVNFNIYSPNNSSTSKEVQCSSSLCSHPDQCS 180

Query: 181 SPSGTCPYVVSYLSENTSSTGYLVEDILHLTTNDDQSKPVNAKITLGCGKDQSGAFLNSA 240
            PS TCPY VSYLS+NTSSTGYLVEDILHLTTND QSKPVNA ITLGCGKDQSGAFL+SA
Sbjct: 181 LPSDTCPYQVSYLSDNTSSTGYLVEDILHLTTNDVQSKPVNATITLGCGKDQSGAFLSSA 240

Query: 241 APNGLFGLGIENVSVPSILANEGLTSNSFSLCFGRHGMGRIEFGDKGSPDQSETPFNLGR 300
           APNGLFGLGIENVSVPSILAN GL SNSFSLCFG   MGRIEFGDKGSPDQ+ETPFNLGR
Sbjct: 241 APNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSPDQNETPFNLGR 300

Query: 301 KHPTYNISITQINVGGNVSNLEFDAIFDSGTSFTYLNDPAYSLISDKFDSTVEEKRYTMN 360
           +HPTYN+SITQI VGG++SNL+   IFDSGTSFTYLNDPAYSL +DKFDS VEEKRYTMN
Sbjct: 301 RHPTYNVSITQIAVGGHISNLDVAVIFDSGTSFTYLNDPAYSLFADKFDSMVEEKRYTMN 360

Query: 361 SDFPFENCYEMSPNQNNFTYPVMNLTMKGGKQFFINHPVIPFN--GKPIFFLMMCIHITL 420
           SD PFENCYE+SP+Q  FTYPVMNLTMKGG  F INHP++  +   K +F L +    ++
Sbjct: 361 SDIPFENCYELSPDQTTFTYPVMNLTMKGGGHFVINHPIVLLSAQSKRLFCLAIARSDSI 420

Query: 421 CIHAENFMAGYHIVFDREKMVLGWKKSNCNGDENEITNNFPVDPSPAPAPAPAPGRAVNP 480
            I  +NFM GYHIVFDREKMVLGWK+SNC G E+E TNN PV PS  P PA APG  + P
Sbjct: 421 NIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDENTNNLPVGPS--PTPAAAPGTTIKP 480

Query: 481 QANSNSNINNSSRTIEPPRPAGNSGSNLLSSVILTLVMILFPFLLFV 526
           QA  NSN+NN+++TIE PRP  N  S L +SVILT +M +  FLLFV
Sbjct: 481 QA--NSNVNNTTQTIEKPRPT-NISSKLPTSVILTFLMPVVTFLLFV 520

BLAST of MS021005 vs. ExPASy TrEMBL
Match: A0A5A7UHC9 (Aspartyl protease family protein 1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold344G00490 PE=3 SV=1)

HSP 1 Score: 743.0 bits (1917), Expect = 8.3e-211
Identity = 378/527 (71.73%), Postives = 429/527 (81.40%), Query Frame = 0

Query: 1   MASSSSLPPLSLMLCVSFFIFSFVSHFSNAHGSFMFDVHHRYSDAVRRLLPVDGLPEEGT 60
           M SSSS    SL LC    IF+F+SHFS+  GSF F++HH YS AVR++LP    P+EGT
Sbjct: 1   MPSSSS--TFSLTLCFFLSIFTFISHFSHVFGSFTFNIHHLYSPAVRQILPFHSFPDEGT 60

Query: 61  LEYYAAMLRRDHFFRARRLATAEDRPPLTFISGNETIRLNPLGFLHYAEVKVGTPAVSYL 120
           L+YYAAM+R DHF  +RRL   +D PPLTF+SGNET+R++PLGFL+YAEV VGTP V YL
Sbjct: 61  LDYYAAMVRTDHFVHSRRLGQVQDHPPLTFLSGNETLRISPLGFLYYAEVTVGTPGVPYL 120

Query: 121 VALDTGSDLFWLPCDCVNCVTGLNSSSGVIRFNIYSPNNSSTSKEVPCSSALCSHPSQCS 180
           VALDTGSDLFWLPCDCVNC+TGLN++ G + FNIYSPNNSSTSKEV CSS+LCSHP QCS
Sbjct: 121 VALDTGSDLFWLPCDCVNCITGLNTTQGPVNFNIYSPNNSSTSKEVQCSSSLCSHPDQCS 180

Query: 181 SPSGTCPYVVSYLSENTSSTGYLVEDILHLTTNDDQSKPVNAKITLGCGKDQSGAFLNSA 240
            PS TCPY VSYLS+NTSSTGYLVEDILHLTTND QSKPVNA ITLGCGKDQSGAFL+SA
Sbjct: 181 LPSDTCPYQVSYLSDNTSSTGYLVEDILHLTTNDVQSKPVNATITLGCGKDQSGAFLSSA 240

Query: 241 APNGLFGLGIENVSVPSILANEGLTSNSFSLCFGRHGMGRIEFGDKGSPDQSETPFNLGR 300
           APNGLFGLGIENVSVPSILAN GL SNSFSLCFG   MGRIEFGDKGSPDQ+ETPFNLGR
Sbjct: 241 APNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSPDQNETPFNLGR 300

Query: 301 KHPTYNISITQINVGGNVSNLEFDAIFDSGTSFTYLNDPAYSLISDKFDSTVEEKRYTMN 360
           +HPTYN+SITQI VGG++SNL+   IFDSGTSFTYLNDPAYSL +DKFDS VEEKRYTMN
Sbjct: 301 RHPTYNVSITQIAVGGHISNLDVAVIFDSGTSFTYLNDPAYSLFADKFDSMVEEKRYTMN 360

Query: 361 SDFPFENCYEMSPNQNNFTYPVMNLTMKGGKQFFINHPVIPFN--GKPIFFLMMCIHITL 420
           SD PFENCYE+SP+Q  FTYPVMNLTMKGG  F INHP++  +   K +F L +    ++
Sbjct: 361 SDIPFENCYELSPDQTTFTYPVMNLTMKGGGHFVINHPIVLLSAQSKRLFCLAIARSDSI 420

Query: 421 CIHAENFMAGYHIVFDREKMVLGWKKSNCNGDENEITNNFPVDPSPAPAPAPAPGRAVNP 480
            I  +NFM GYHIVFDREKMVLGWK+SNC G E+E TNN PV PSP PA AP     + P
Sbjct: 421 NIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDENTNNLPVGPSPTPAAAPGT-TTIKP 480

Query: 481 QANSNSNINNSSRTIEPPRPAGNSGSNLLSSVILTLVMILFPFLLFV 526
           QA  NSN+NN+++TIE PRP  N  S L +SVILT +M +  FLLFV
Sbjct: 481 QA--NSNVNNTTQTIEKPRPT-NISSKLPTSVILTFLMPVVTFLLFV 521

BLAST of MS021005 vs. ExPASy TrEMBL
Match: A0A1S3BKR5 (aspartyl protease family protein 1-like OS=Cucumis melo OX=3656 GN=LOC103490670 PE=3 SV=1)

HSP 1 Score: 737.3 bits (1902), Expect = 4.6e-209
Identity = 378/531 (71.19%), Postives = 429/531 (80.79%), Query Frame = 0

Query: 1   MASSSSLPPLSLMLCVSFFIFSFVSHFSNAHGSFMFDVHHRYSDAVRRLLPVDGLPEEGT 60
           M SSSS    SL LC    IF+F+SHFS+  GSF F++HH YS AVR++LP    P+EGT
Sbjct: 1   MPSSSS--TFSLTLCFFLSIFTFISHFSHVFGSFTFNIHHLYSPAVRQILPFHSFPDEGT 60

Query: 61  LEYYAAMLRRDHFFRARRLATAEDRPPLTFISGNETIRLNPLGFLHYAEVKVGTPAVSYL 120
           L+YYAAM+R DHF  +RRL   +D PPLTF+SGNET+R++PLGFL+YAEV VGTP V YL
Sbjct: 61  LDYYAAMVRTDHFVHSRRLGQVQDHPPLTFLSGNETLRISPLGFLYYAEVTVGTPGVPYL 120

Query: 121 VALDTGSDLFWLPCDCVNCVTGLNSSSGVIRFNIYSPNNSSTSKEVPCSSALCSHPSQCS 180
           VALDTGSDLFWLPCDCVNC+TGLN++ G + FNIYSPNNSSTSKEV CSS+LCSHP QCS
Sbjct: 121 VALDTGSDLFWLPCDCVNCITGLNTTQGPVNFNIYSPNNSSTSKEVQCSSSLCSHPDQCS 180

Query: 181 SPSGTCPYVVSYLSENTSSTGYLVEDILHLTTNDDQSKPVNAKITLGCGKDQSGAFLNSA 240
            PS TCPY VSYLS+NTSSTGYLVEDILHLTTND QSKPVNA ITLGCGKDQSGAFL+SA
Sbjct: 181 LPSDTCPYQVSYLSDNTSSTGYLVEDILHLTTNDVQSKPVNATITLGCGKDQSGAFLSSA 240

Query: 241 APNGLFGLGIENVSVPSILANEGLTSNSFSLCFGRHGMGRIEFGDKGSPDQSETPFNLGR 300
           APNGLFGLGIENVSVPSILAN GL SNSFSLCFG   MGRIEFGDKGSPDQ+ETPFNLGR
Sbjct: 241 APNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSPDQNETPFNLGR 300

Query: 301 KHPTYNISITQINVGGNVSNLEFDAIFDSGTSFTYLNDPAYSLISDKFDSTVEEKRYTMN 360
           +HPTYN+SITQI VGG++SNL+   IFDSGTSFTYLNDPAYSL +DKFDS VEEKRYTMN
Sbjct: 301 RHPTYNVSITQIAVGGHISNLDVAVIFDSGTSFTYLNDPAYSLFADKFDSMVEEKRYTMN 360

Query: 361 SDFPFENCYEMSPNQNNFTYPVMNLTMKGGKQFFINHPVIPFN--GKPIFFLMMCIHITL 420
           SD PFENCYE+SP+Q  FTYPVMNLTMKGG  F INHP++  +   K +F L +    ++
Sbjct: 361 SDIPFENCYELSPDQTTFTYPVMNLTMKGGGHFVINHPIVLLSAQSKRLFCLAIARSDSI 420

Query: 421 CIHAENFMAGYHIVFDREKMVLGWKKSNC----NGDENEITNNFPVDPSPAPAPAPAPGR 480
            I  +NFM GYHIVFDREKMVLGWK+SNC     G E+E TNN PV PSP PA AP    
Sbjct: 421 NIIGQNFMTGYHIVFDREKMVLGWKESNCEFSGTGYEDENTNNLPVGPSPTPAAAPGT-T 480

Query: 481 AVNPQANSNSNINNSSRTIEPPRPAGNSGSNLLSSVILTLVMILFPFLLFV 526
            + PQA  NSN+NN+++TIE PRP  N  S L +SVILT +M +  FLLFV
Sbjct: 481 TIKPQA--NSNVNNTTQTIEKPRPT-NISSKLPTSVILTFLMPVVTFLLFV 525

BLAST of MS021005 vs. TAIR 10
Match: AT2G17760.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 495.0 bits (1273), Expect = 7.6e-140
Identity = 255/476 (53.57%), Postives = 329/476 (69.12%), Query Frame = 0

Query: 22  SFVSHFSNAHGSFMFDVHHRYSDAVRRLLPVDGLPEEGTLEYYAAMLRRDHFFRARRLAT 81
           S+V       G F F+ HHR+SD V  +LP DGLP   + +YY  M  RD   R RRLA 
Sbjct: 21  SWVLDRCEGFGEFGFEFHHRFSDQVVGVLPGDGLPNRDSSKYYRVMAHRDRLIRGRRLAN 80

Query: 82  AEDRPPLTFISGNETIRLNPLGFLHYAEVKVGTPAVSYLVALDTGSDLFWLPCDCVNCVT 141
            ED+  +TF  GNET+R++ LGFLHYA V VGTP+  ++VALDTGSDLFWLPCDC NCV 
Sbjct: 81  -EDQSLVTFSDGNETVRVDALGFLHYANVTVGTPSDWFMVALDTGSDLFWLPCDCTNCVR 140

Query: 142 GLNSSSG-VIRFNIYSPNNSSTSKEVPCSSALCSHPSQCSSPSGTCPYVVSYLSENTSST 201
            L +  G  +  NIYSPN SSTS +VPC+S LC+   +C+SP   CPY + YLS  TSST
Sbjct: 141 ELKAPGGSSLDLNIYSPNASSTSTKVPCNSTLCTRGDRCASPESDCPYQIRYLSNGTSST 200

Query: 202 GYLVEDILHLTTNDDQSKPVNAKITLGCGKDQSGAFLNSAAPNGLFGLGIENVSVPSILA 261
           G LVED+LHL +ND  SK + A++T GCG+ Q+G F + AAPNGLFGLG+E++SVPS+LA
Sbjct: 201 GVLVEDVLHLVSNDKSSKAIPARVTFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLA 260

Query: 262 NEGLTSNSFSLCFGRHGMGRIEFGDKGSPDQSETPFNLGRKHPTYNISITQINVGGNVSN 321
            EG+ +NSFS+CFG  G GRI FGDKGS DQ ETP N+ + HPTYNI++T+I+VGGN  +
Sbjct: 261 KEGIAANSFSMCFGNDGAGRISFGDKGSVDQRETPLNIRQPHPTYNITVTKISVGGNTGD 320

Query: 322 LEFDAIFDSGTSFTYLNDPAYSLISDKFDSTVEEKRY-TMNSDFPFENCYEMSPNQNNFT 381
           LEFDA+FDSGTSFTYL D AY+LIS+ F+S   +KRY T +S+ PFE CY +SPN+++F 
Sbjct: 321 LEFDAVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKDSFQ 380

Query: 382 YPVMNLTMKGGKQFFINHP--VIPFNGKPIFFLMMCIHITLCIHAENFMAGYHIVFDREK 441
           YP +NLTMKGG  + + HP  VIP     ++ L +     + I  +NFM GY +VFDREK
Sbjct: 381 YPAVNLTMKGGSSYPVYHPLVVIPMKDTDVYCLAIMKIEDISIIGQNFMTGYRVVFDREK 440

Query: 442 MVLGWKKSNCNGDENEITNNFPVDPSPAPAPAPAPGRAVNPQA-NSNSNINNSSRT 493
           ++LGWK+S+C   E       P + S + A  PA   + +P+A N  S   N+S T
Sbjct: 441 LILGWKESDCYTGETS-ARTLPSNRSSSSARPPA--SSFDPEATNIPSQRPNTSTT 492

BLAST of MS021005 vs. TAIR 10
Match: AT4G35880.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 432.6 bits (1111), Expect = 4.6e-121
Identity = 231/507 (45.56%), Postives = 327/507 (64.50%), Query Frame = 0

Query: 27  FSNAHGS-FMFDVHHRYSDAVRRLLPVDG----LPEEGTLEYYAAMLRRDHFFRARRL-- 86
           F + +G  F F++HHR+SD V++     G     P +G+ EY+ A++ RD   R RRL  
Sbjct: 21  FGSCNGRIFTFEMHHRFSDEVKQWSDSTGRFAKFPPKGSFEYFNALVLRDWLIRGRRLSE 80

Query: 87  ATAEDRPPLTFISGNETIRLNPLGFLHYAEVKVGTPAVSYLVALDTGSDLFWLPCDCVNC 146
           + +E    LTF  GN T R++ LGFLHY  VK+GTP + ++VALDTGSDLFW+PCDC  C
Sbjct: 81  SESESESSLTFSDGNSTSRISSLGFLHYTTVKLGTPGMRFMVALDTGSDLFWVPCDCGKC 140

Query: 147 V-TGLNSSSGVIRFNIYSPNNSSTSKEVPCSSALCSHPSQCSSPSGTCPYVVSYLSENTS 206
             T   + +     +IY+P  S+T+K+V C+++LC+  +QC     TCPY+VSY+S  TS
Sbjct: 141 APTEGATYASEFELSIYNPKVSTTNKKVTCNNSLCAQRNQCLGTFSTCPYMVSYVSAQTS 200

Query: 207 STGYLVEDILHLTTNDDQSKPVNAKITLGCGKDQSGAFLNSAAPNGLFGLGIENVSVPSI 266
           ++G L+ED++HLTT D   + V A +T GCG+ QSG+FL+ AAPNGLFGLG+E +SVPS+
Sbjct: 201 TSGILMEDVMHLTTEDKNPERVEAYVTFGCGQVQSGSFLDIAAPNGLFGLGMEKISVPSV 260

Query: 267 LANEGLTSNSFSLCFGRHGMGRIEFGDKGSPDQSETPFNLGRKHPTYNISITQINVGGNV 326
           LA EGL ++SFS+CFG  G+GRI FGDKGS DQ ETPFNL   HP YNI++T++ VG  +
Sbjct: 261 LAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEETPFNLNPSHPNYNITVTRVRVGTTL 320

Query: 327 SNLEFDAIFDSGTSFTYLNDPAYSLISDKFDSTVEEKRYTMNSDFPFENCYEMSPNQNNF 386
            + EF A+FD+GTSFTYL DP Y+ +S+ F S  ++KR++ +S  PFE CY+MS + N  
Sbjct: 321 IDDEFTALFDTGTSFTYLVDPMYTTVSESFHSQAQDKRHSPDSRIPFEYCYDMSNDANAS 380

Query: 387 TYPVMNLTMKGGKQFFINHPVIPFN--GKPIFFLMMCIHITLCIHAENFMAGYHIVFDRE 446
             P ++LTMKG   F IN P+I  +  G+ ++ L +     L I  +N+M GY +VFDRE
Sbjct: 381 LIPSLSLTMKGNSHFTINDPIIVISTEGELVYCLAIVKSSELNIIGQNYMTGYRVVFDRE 440

Query: 447 KMVLGWKKSNCNGDENEITNNFPVDPSPAPAPAPAPGRAVNPQANSNSNINNSSRTIEPP 506
           K+VL WKK +C   E   T     + + A APA A G   +   N++S ++ +++TI   
Sbjct: 441 KLVLAWKKFDCYDIEETNTTVAGTNKTAAVAPAMAAGIKTH---NNSSELHKTNQTIS-- 500

Query: 507 RPAGNSGSNLLSSVILTLVMILFPFLL 524
               NS  N +S  +       F F+L
Sbjct: 501 --KSNSSPNQISKTVDVWSFFRFVFIL 520

BLAST of MS021005 vs. TAIR 10
Match: AT3G51330.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 417.5 bits (1072), Expect = 1.5e-116
Identity = 230/521 (44.15%), Postives = 327/521 (62.76%), Query Frame = 0

Query: 30  AHGSFMFDVHHRYSDAVRRLLPVDGL-PEEGTLEYYAAMLRRDHFFRARRLATAEDRPPL 89
           A G F F+VHH +SD V++ L +D L PE+G+LEY+  + +RD   R R LA+  +  P+
Sbjct: 25  ASGKFSFEVHHMFSDRVKQSLGLDDLVPEKGSLEYFKVLAQRDRLIRGRGLASNNEETPI 84

Query: 90  TFISGNETIRLNPLGFLHYAEVKVGTPAVSYLVALDTGSDLFWLPCDC-VNCV-----TG 149
           TF+ GN TI ++ LGFLHYA V VGTPA  +LVALDTGSDLFWLPC+C   C+      G
Sbjct: 85  TFMRGNRTISIDLLGFLHYANVSVGTPATWFLVALDTGSDLFWLPCNCGSTCIRDLKEVG 144

Query: 150 LNSSSGVIRFNIYSPNNSSTSKEVPCSSALCSHPSQCSSPSGTCPYVVSYLSENTSSTGY 209
           L+ S  +   N+YSPN SSTS  + CS   C   S+CSSP+ +CPY + YLS++T +TG 
Sbjct: 145 LSQSRPL---NLYSPNTSSTSSSIRCSDDRCFGSSRCSSPASSCPYQIQYLSKDTFTTGT 204

Query: 210 LVEDILHLTTNDDQSKPVNAKITLGCGKDQSGAFLNSAAPNGLFGLGIENVSVPSILANE 269
           L ED+LHL T D+  +PV A ITLGCGK+Q+G   +SAA NGL GLG+++ SVPSILA  
Sbjct: 205 LFEDVLHLVTEDEGLEPVKANITLGCGKNQTGFLQSSAAVNGLLGLGLKDYSVPSILAKA 264

Query: 270 GLTSNSFSLCFGR--HGMGRIEFGDKGSPDQSETPFNLGRKHPTYNISITQINVGGNVSN 329
            +T+NSFS+CFG     +GRI FGDKG  DQ ETP       PTY +S+T+++VGG+   
Sbjct: 265 KITANSFSMCFGNIIDVVGRISFGDKGYTDQMETPLLPTEPSPTYAVSVTEVSVGGDAVG 324

Query: 330 LEFDAIFDSGTSFTYLNDPAYSLISDKFDSTVEEKRYTMNSDFPFENCYEMSPNQNNFTY 389
           ++  A+FD+GTSFT+L +P Y LI+  FD  V +KR  ++ + PFE CY++SPN+    +
Sbjct: 325 VQLLALFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPELPFEFCYDLSPNKTTILF 384

Query: 390 PVMNLTMKGGKQFFINHPV-IPFN----GKPIFFLMMCIHITLCIHAENFMAGYHIVFDR 449
           P + +T +GG Q F+ +P+ I +N          ++  +   + I  +NFM+GY IVFDR
Sbjct: 385 PRVAMTFEGGSQMFLRNPLFIVWNEDNSAMYCLGILKSVDFKINIIGQNFMSGYRIVFDR 444

Query: 450 EKMVLGWKKSNCNGDENEITNNFPVDPSPAPAPA---------PAPGRAVNPQANSNSNI 509
           E+M+LGWK+S+C  DE+  +   P   + AP+P+         P P  A  PQ       
Sbjct: 445 ERMILGWKRSDCFEDESLESTTPPPPETEAPSPSASTPLPSLLPPPAAATPPQ------- 504

Query: 510 NNSSRTIEPPRPAGNSGSNLLSSVI--LTLVMILFPFLLFV 526
                 I+P     NSG+   ++++   + +++L P L F+
Sbjct: 505 ------IDPRNSTRNSGTGTAANLVPLASQLLLLLPLLAFL 529

BLAST of MS021005 vs. TAIR 10
Match: AT3G51350.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 383.3 bits (983), Expect = 3.2e-106
Identity = 210/510 (41.18%), Postives = 312/510 (61.18%), Query Frame = 0

Query: 30  AHGSFMFDVHHRYSDAVRRLLPV-DGLPEEGTLEYYAAMLRRDHFFRARRLATAEDRPPL 89
           A G F F+VHH +SD+V++ L + D +PE+G+LEY+  +  RD   R R LA+  D  P+
Sbjct: 25  ATGKFGFEVHHIFSDSVKQSLGLGDLVPEQGSLEYFKVLAHRDRLIRGRGLASNNDETPI 84

Query: 90  TFISGNETIRLNPLGFLHYAEVKVGTPAVSYLVALDTGSDLFWLPCDC-VNCVTGLNSSS 149
           TF  GN T+ +  LG L+YA V VGTP  S+LVALDTGSDLFWLPC+C   C+  L    
Sbjct: 85  TFDGGNLTVSVKLLGSLYYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTTCIRDL-EDI 144

Query: 150 GV---IRFNIYSPNNSSTSKEVPCSSALCSHPSQCSSPSGTCPYVVSYLSENTSSTGYLV 209
           GV   +  N+Y+PN S+TS  + CS   C    +CSSPS  CPY +SY S +T + G L+
Sbjct: 145 GVPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSSPSSICPYQISY-SNSTGTKGTLL 204

Query: 210 EDILHLTTNDDQSKPVNAKITLGCGKDQSGAFLNSAAPNGLFGLGIENVSVPSILANEGL 269
           +D+LHL T D+   PV A +TLGCG+ Q+G F  + + NG+ GLGI+  SVPS+LA   +
Sbjct: 205 QDVLHLATEDENLTPVKANVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLLAKANI 264

Query: 270 TSNSFSLCFGR--HGMGRIEFGDKGSPDQSETPFNLGRKHPTYNISITQINVGGNVSNLE 329
           T+NSFS+CFGR    +GRI FGD+G  DQ ETPF        Y ++I+ ++V G+  ++ 
Sbjct: 265 TANSFSMCFGRVIGNVGRISFGDRGYTDQEETPFISVAPSTAYGVNISGVSVAGDPVDIR 324

Query: 330 FDAIFDSGTSFTYLNDPAYSLISDKFDSTVEEKRYTMNSDFPFENCYEMSPNQNNFTYPV 389
             A FD+G+SFT+L +PAY +++  FD  VE++R  ++ + PFE CY++SPN     +P+
Sbjct: 325 LFAKFDTGSSFTHLREPAYGVLTKSFDELVEDRRRPVDPELPFEFCYDLSPNATTIQFPL 384

Query: 390 MNLTMKGGKQFFINHPVIPF---NGKPIFFL--MMCIHITLCIHAENFMAGYHIVFDREK 449
           + +T  GG +  +N+P        G  ++ L  +  + + + +  +NF+AGY IVFDRE+
Sbjct: 385 VEMTFIGGSKIILNNPFFTARTQEGNVMYCLGVLKSVGLKINVIGQNFVAGYRIVFDRER 444

Query: 450 MVLGWKKSNCNGDENEITNNFPVDPSPAPAPAPAPGRAVNPQANSNSNINNSSRTIEPPR 509
           M+LGWK+S C  DE+  +      P P    APAP  +  P  +    ++ +   I P  
Sbjct: 445 MILGWKQSLCFEDESLESTT----PPPPEVEAPAPSVSAPPPRSLPPTVSATPPPINPRN 504

Query: 510 PAGNSGSNLLSSVI--LTLVMILFPFLLFV 526
             GN G+   +++I   + +++L P L F+
Sbjct: 505 STGNPGTGGAANLIPLASQLLLLLPLLAFL 528

BLAST of MS021005 vs. TAIR 10
Match: AT3G51340.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 367.5 bits (942), Expect = 1.8e-101
Identity = 216/523 (41.30%), Postives = 305/523 (58.32%), Query Frame = 0

Query: 30  AHGSFMFDVHHRYSDAVRRLLPVDGL-PEEGTLEYYAAMLRRDHFFRARRLATAEDRPPL 89
           A G F F+VHH +SD V++ L  D L PE G+LEY+  +  RD F R R LA+  +  PL
Sbjct: 26  ASGKFSFEVHHMFSDVVKQTLGFDDLVPENGSLEYFKVLAHRDRFIRGRGLASNNEETPL 85

Query: 90  TFISGNETIRLNPLGFLHYAEVKVGTPAVSYLVALDTGSDLFWLPCDC-VNCVTGLNSS- 149
           T I  N T+ LN LGFLHYA V +GTPA  +LVALDTGSDLFWLPC+C   C+  L  + 
Sbjct: 86  TSIGSNLTLALNFLGFLHYANVSLGTPATWFLVALDTGSDLFWLPCNCGTTCIHDLKDAR 145

Query: 150 -SGVIRFNIYSPNNSSTSKEVPCSSALCSHPSQCSSPSGTCPYVVSYLSENTSSTGYLVE 209
            S  +  N+Y+PN S+TS  + CS   C    +CSSP   CPY ++ LS NT +TG L++
Sbjct: 146 FSESVPLNLYTPNASTTSSSIRCSDKRCFGSGKCSSPESICPYQIA-LSSNTVTTGTLLQ 205

Query: 210 DILHLTTNDDQSKPVNAKITLGCGKDQSGAFLNSAAPNGLFGLGIENVSVPSILANEGLT 269
           D+LHL T D+  KPVNA +TLGCG++Q+GAF    A NG+ GL ++  SVPS+LA   +T
Sbjct: 206 DVLHLVTEDEDLKPVNANVTLGCGQNQTGAFQTDIAVNGVLGLSMKEYSVPSLLAKANIT 265

Query: 270 SNSFSLCFGR--HGMGRIEFGDKGSPDQSETPFNLGRKHPTYNISITQINVGGNVSNLEF 329
           +NSFS+CFGR    +GRI FGDKG  DQ ETP         Y +++T ++VGG   ++  
Sbjct: 266 ANSFSMCFGRIISVVGRISFGDKGYTDQEETPLVSLETSTAYGVNVTGVSVGGVPVDVPL 325

Query: 330 DAIFDSGTSFTYLNDPAYSLISDKFDSTVEEKRYTMNSDFPFENCYEM------------ 389
            A+FD+G+SFT L + AY + +  FD  +E+KR  ++ DFPFE CY++            
Sbjct: 326 FALFDTGSSFTLLLESAYGVFTKAFDDLMEDKRRPVDPDFPFEFCYDLREEHLNSDARPR 385

Query: 390 -------SPNQNNFTYPVMNLTMKGGKQFFINHPVIPFNGKPIFFLMMCIHITLCIHAEN 449
                  +P +++F + + N + +     + N       G  ++ L +   I L I  +N
Sbjct: 386 HMQSKCYNPCRDDFRWRIQNDSQESVS--YSN------EGTKMYCLGILKSINLNIIGQN 445

Query: 450 FMAGYHIVFDREKMVLGWKKSNCNGDENEITNNFPVDPSPAPAPAPAPGRAVNPQANSNS 509
            M+G+ IVFDRE+M+LGWK+SNC  DE+  + +    P P    AP P  +  P A S  
Sbjct: 446 LMSGHRIVFDRERMILGWKQSNCFEDESLASES----PPPPEIEAPPPSVSTPPPAAS-- 505

Query: 510 NINNSSRTIEPPRPAGNSGSNLLS--SVILTLVMILFPFLLFV 526
               +  TI+P     NSG+   +  S +   ++ L P L F+
Sbjct: 506 ---ATPPTIDPRNSTRNSGTGGAANLSPLAAQLLFLLPLLAFL 530

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022145453.11.9e-28695.65aspartyl protease family protein 1 isoform X1 [Momordica charantia][more]
XP_022145454.11.2e-26795.35aspartyl protease family protein 1 isoform X2 [Momordica charantia][more]
XP_038906112.14.1e-21272.87aspartyl protease family protein 1 [Benincasa hispida][more]
TYK11398.12.7e-21171.92aspartyl protease family protein 1-like [Cucumis melo var. makuwa][more]
KAA0052941.11.7e-21071.73aspartyl protease family protein 1-like [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
Q8VYV91.1e-13853.57Aspartyl protease family protein 1 OS=Arabidopsis thaliana OX=3702 GN=APF1 PE=2 ... [more]
Q9LX202.1e-7337.04Aspartic proteinase-like protein 1 OS=Arabidopsis thaliana OX=3702 GN=At5g10080 ... [more]
Q9S9K42.6e-2827.47Aspartic proteinase 39 OS=Arabidopsis thaliana OX=3702 GN=A39 PE=1 SV=2[more]
Q4V3D25.0e-2725.84Aspartic proteinase 36 OS=Arabidopsis thaliana OX=3702 GN=A36 PE=1 SV=1[more]
Q9LS401.4e-1827.55Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Match NameE-valueIdentityDescription
A0A6J1CVY99.3e-28795.65aspartyl protease family protein 1 isoform X1 OS=Momordica charantia OX=3673 GN=... [more]
A0A6J1CVB05.7e-26895.35aspartyl protease family protein 1 isoform X2 OS=Momordica charantia OX=3673 GN=... [more]
A0A5D3CJM41.3e-21171.92Aspartyl protease family protein 1-like OS=Cucumis melo var. makuwa OX=1194695 G... [more]
A0A5A7UHC98.3e-21171.73Aspartyl protease family protein 1-like OS=Cucumis melo var. makuwa OX=1194695 G... [more]
A0A1S3BKR54.6e-20971.19aspartyl protease family protein 1-like OS=Cucumis melo OX=3656 GN=LOC103490670 ... [more]
Match NameE-valueIdentityDescription
AT2G17760.17.6e-14053.57Eukaryotic aspartyl protease family protein [more]
AT4G35880.14.6e-12145.56Eukaryotic aspartyl protease family protein [more]
AT3G51330.11.5e-11644.15Eukaryotic aspartyl protease family protein [more]
AT3G51350.13.2e-10641.18Eukaryotic aspartyl protease family protein [more]
AT3G51340.11.8e-10141.30Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (TR) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 325..336
score: 47.29
coord: 112..132
score: 41.52
coord: 419..434
score: 31.77
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 13..460
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 107..285
e-value: 1.4E-37
score: 129.6
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 96..285
e-value: 6.0E-43
score: 149.0
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 287..449
e-value: 3.8E-24
score: 87.4
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 99..447
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 305..397
e-value: 2.0E-9
score: 37.4
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 476..500
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 453..500
NoneNo IPR availablePANTHERPTHR13683:SF826ASPARTYL PROTEASE FAMILY PROTEIN 1coord: 13..460
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 121..132
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 325..336
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 106..443
score: 33.168755

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MS021005.1MS021005.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0004190 aspartic-type endopeptidase activity