MC09g1194 (gene) Bitter gourd (Dali-11) v1

Overview
NameMC09g1194
Typegene
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionprotein ALP1-like
LocationMC09: 18025474 .. 18030792 (-)
RNA-Seq ExpressionMC09g1194
SyntenyMC09g1194
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: utr5polypeptideCDSutr3
Hold the cursor over a type above to highlight its positions in the sequence below.
CACCACCACACACAAGAAGTAAAGAAAGGGAAGATTGGAAGTTGCCAATATCCCCCTCTCTCGTTTTTAATTCTTTCTTTTCAATTTCTCTTTTTTTTTACCTTTCCTTCGGCTACATTGTTGATTGTTGTCCTTCCCCCAACATGTAATTGGCCCCGCGTCAATTTTCTTTTTTACAACTTCTTTGCCTTACTCCAAACTTCTTCTCTCCACCATCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCACAGTCATATTTCATATTCTTCCATTTATTATTGCATGCTTTGAAATCCAAAAACCAGAGGAACTTCACTCACCCATCAATACCAAACATATCCTTCTAACCCAAAAACCTCAGAGGCACTACAATCTTCCCCTTTGTGGGTCTGCCAAAAATGCTTGTTGTGTCCTTCGTTTGCTACCCTTTTGTGTTATTATTCCATTTGCTTCGAATCTGAGAGTTTTCTTCATGTGAATTCTGTACTATCCTTCGATTCTGGTGGTGGGTTGTAGTGGGTGTTTCAGCTTTTGGCTCATCTGTGCTTCTTTTATTATTTTTGGGAGTTTCGAATGGGACCCATTAGAGGGTTCAAGAGGAAGAAGAAGGCAGAGAAGAAGGTTGACCAAAATGTCTTGGCTGCTGCTTCACTGTCGTCTCAGCCCCAGCCCTTGGATTGGTGGGACGACTTCTCCCAGAGGATTACTGGTAAACTATTTTGCTTCTCTTGGAACTATGCTTCTCTTTTGCCACTTGCTAATTAATTTGCTCTCTCTGTTGCTTTTTTATTGAATTTGTGCATTGCCATTGCTATTAGTGGCTTGCTGCTTATTGGGTCTCTCGATTAAAATTTTCTGCGTTTGGTGAGGTTTTAGGACGGCGGTAGAATGTAGTGTTTAGTAATCATATTGAGTTCTGCAATTAAAGAACCGTGGTATTCTTCCCCATAATCTGTTAAATCATGAACTCTTTATTTGACATCTTAATAACATTGAGAGGGTCACAAGGGACACACTGTTGCAGTGATCCGGTTAGCATTACTGCTTTTAGAATTCTATGGAGATAGATCAAATCTGATTCAGCAGCAGTTACTAACTTAGATATGTTAGTTCACTTGATAATTGAGGTGTACAAGTAGTGGCCACTCACTTAACTGGTTTCTATAGTTTAGACTGTGATGTCTTGTGAATGATCTTCTTTTTCTCAATGAATTGTTGAAAATTGAGGTCGACTTGCTGTAGTAAGTTATTGTTTCTTAGGAGTTGCATCTTTCAAATGTCGGTTAGGACGTGATGAGATTTTACTTGGTGATTTCTCTAATTGAATGGATCATTCTATTATATCTGCATTGAGATGGATCTTTATATAGAATGATGATAATGCTGCTTGAAGAATGGGTTGTCACTGACATTGTAACTGATTCAGATTGATTATTGTATTAGAATGTTGATTAATTATTATCTCCTCCTCTCCTTAGTTTTAGCAACTGGCCAGATTATCTGGTGTAGGGTATTAGTTTAATCTATGGTTATATTCAACAAAGTTTTCAAAATGGGTTAAATAGGAATTATAACCACAATGTTGATTTCTTAAATATTGGTATATAAAAATATCCATATGGCACCCTTCATTTTTATGGATCGAATTCAAATAGAATTCTTTCACAACTCCAACATCCGTCTTAGAAGGCCTTTCTTCTTCTGCGAGGTGGGCATGAGTAGGCCTTCCTTCTTATCACATGAATTTGATCCCCAATTGTCCTCTTGATTTCCTTTGCTAATGACTGAAATGATATTTTGCTTAATGAAGTAATTAACACTAATAGGAAATAAATATATTTAAGTATTTTTGTGTTGAACTTCACTCCAATTGCCTGATCTTCACCATCATCCTTTTGAAGTGGAAATAAATATATTTAAGTATTTTTGTGTTGAACTTCACTCCAGTTGCCTGATCTTCACCATCATCCTTTTGAAGTTCCTCTATAATATTTTGTCTCAGCATGGCTGGCAATTATTTTCTTTAGAGAACTGATCAGTATGATCCGTAACCCGGGTCTTCATTATTGTTTTGAATTAAATATATCACACTGAATATAGTACTTTAAAGTTAGAATTTGTCATCCAGATATTTGGAGGAGCGAAGTAGTTAATAAATAGAAGGATGACTCTGAAAGTAATAATGAACACAATGCAGAGAAGAAGGGCAGGGGAAGAACGATATTTGTCTTCAAGCTTCTGTTTTCTATGGTCTGCCATTTTATATGTATAGAAAACAAGAGAATTAGCTTAATATATTGTTCATGTCAATATAGTATGTTAGAACTTAGAAGGCATGCTAAACCATGAGATTGTGAAAACCAAATTTCTTGGTTTTTATTCTCTGTTCTATTATTAAAATAATCAAAACTATTTCCATGAAATCCAATTCATTATATGTTGGTCTATACTATTTTGTAAGCTGATTTTTCCTAGAGATCTTGTTATGACTTAAGAATCTCAACCCCTTTATGATCTTAATTAATTAATTTAAGCAACGAAACATCCTTATGTGGCTGCTTTTGAGGCATCTTCAAAATGTACATAGAAAAACTGACTTGGTTTCTAGTATTCCAAAATGGATACTTATTTTTCACCGTCAATTTTCAGGAGTTAGAATGATAAGAAATTTGGAATTCAAGGACAGTCCTATTTTTATTTTGTATACCTGATTATCACAGTGCAAATAAATGGTTAATGCTCATAAAAAATTATGGTTTCATCATGCTTGGAAAAACAATTATAGTTGGCATTTATGTTGGTTTCTAAGTTTACCCCCAGAAATTTATTAGGACAGCAGTTGAAGTTTTTCTATTGTGAAAATTTCAAGAAAAAAAAAGTACGAAACTAGATTTCTTGATTTTGTTCTCATAATGATCCTTTCAAAATCTCTGATTAAAACTGATCACTTCAAGGGATATTTTCCCAATTAGCAGGAAGTTCATTAGCTTTATCATGATTTACCTCCATATTTCCATTTCTAATGTCCTTCATTCTTTCCTTTTATCTGGTTATTTATCCCTTTTTGCCCAATGCCTTTAGTATATAAGCATTCTTTCAAATTGGACGTGTGCTTCATCGGTGTTGCGATTGCTCTCTATTGTCAAAAACATGTTTCAGGTACCCATTTTAGCAAAAATTCTTATGCCACCTATTCGAACAGGACTATGTTCTTGTATGCATCAACTTTTGTATTTCAATAAAGAACCTTAGATGAGCTGATAACCAAACAACAATGGCTCAAATTTTAGTCACAAATTCCCAGAAATACTAAGAAGTTGGCAATTGAAACATCAAACGCAATATCCAGATTTTGATTTATATTTGATTTGAAGCTATAAAATAACATGATCAAAGTGGGAAGATTTCGTAGACGTTCGTTATATATCCAGCTGTATCTGGAGTTCAGTTTTATTTTCATATTTCATGTTTCTGAAATGTTGGTTACAGTTTAAGATGATTTGGACTTCATAAACTGAACGTCCACCATAAATTAGTTTATATGAAGACTTTTAACTTTTAATTGTCAGAGTCTTCTGAAGTATCAGTCTGTGTAGTAAAATAAAATTTACTCATGTTAAGGAAGTGCTCGGCCTGTTGGCTCATTTAGCAGTTACATTGTTCTAGCCACGAGTTGTGATATAACAAGTAAATCTTTACTTTTTAGTATAAAATCGAAGGGTTTGACGTAAATGTGTGGTCAAACAACTTCGTCGCATTTTATTTTGTTGCGTCGAGTATTTCTACAATGTTTCTTCTGTCATTATGCTCACAAAATTTCACTTTATCTTTGTTTGATGATCAATGAGCTGGACATTTTGGTCTGATATTAATTAGCCTCACAGATAATGTAGGTTGTTATATGTTTACTGTGAATATCTGACTCGGAGCATCTGATTATGATTTTTACCTCCAATCAGGACCATTATCCCAGTCAAAGAATCCAACAAAATTTGAGTCAGTTTTCAAAATTTCAAGAAAGACATTCAGCTATATCTGTTCTCTTGTTAAGGAAGCTATGATGGCCAAAACTTCCAATTTTACTGACTTAAACGGGAAACCTTTGTCAGTAAACGACCAAGTCGCCGTTGCTCTTAGGCGGCTTAGTTCTGGTGAATCATTATCAATTATTGGTGATTCATTTGGAATGAATCAGTCATCAGTTTCCCAAATAACTTGGCGTTTTGTGGAGGCGATGGAGGAGAAAGGGCTCCACCATCTGTCATGGCCTTCAACAGAAGAAGATATGGATCAGATAAAGTCCAAGTTTAAGAAAATCAAAGGCCTTCCTAATTGTTGTGGCGTAATCGAAACAACGCACATTATGATGACGTTGCCAACAGCAGAATCTGCAAACGGCGTCTGGCTTGACCGTGAGAAAAACTGCAGCATGATCTTGCAAGTGATTGTAGACCCAGAAATGAGATTCTGTGACATCATGGCGGGTTGGCCAGGAAGTCTGAGCGACGCTCTTGTGCTCCAAAGCTCGGGATTTTTCAAACTTTCCCAAGATGGGGAGCGGTTAAATGGAAAGAATATGAAGCTCTCAGAAAGTTCAGAGCTAGGAGAGTATATTATAGGAGACTCTGGTTTCCCCCTCTTGCCATGGCTACTAACTCCTTATCAAGGGAAAGGCCTTTCGGATTATCAGACCGAGTTTAACAAGCGACATTACGCCACCCGGTTGGTGGCTCAAAGGGCATTGACAAGGTTGAAAGAGATGTGGAAGATCATTAAAGGGGTAATGTGGAAGCCTGATAAACACAGGCTACCAAGGATCATTCTTGTTTGCTGCTTACTTCACAATATAGTGATCGATATGGAGGACGAGGTGCAAGACGAGATGCCCTTGTCTCATCATCACGACTCCGGTTACCGACAACAAAGTTGCAAATTCGTCGACAATACTGCCTCTGTCGTGAGGGAGAAGCTCTCCATGTACTTGTCTGGAAAATTGCCTCCCTAGGAGGATCTCGACCCGCATCTGAAGATTCTTTTCTCTCTTTCTTTCCTTTTCATAGATTGATCTGTGTTCCTGTTGATTCAAATTCTGCAAGTTCTACCTGTCCAAATAGTTGTTCTGGATGACATAAATTTAATTCATAGTTCAATATCTATATTTCTGGTGATTATGATTTGCCTTAACAGATACCCACTTCTTTTTTCTACAAACATTATGCTGTGTTTGGCTTTGGCAGCCATTTTTGTAGAACAAAGCAAACTTTTTGCTGAAGAAATTTCACCTTCACCACTGCTCTTTGTCCCTATGATAATGTCTAG

mRNA sequence

CACCACCACACACAAGAAGTAAAGAAAGGGAAGATTGGAAGTTGCCAATATCCCCCTCTCTCGTTTTTAATTCTTTCTTTTCAATTTCTCTTTTTTTTTACCTTTCCTTCGGCTACATTGTTGATTGTTGTCCTTCCCCCAACATGTAATTGGCCCCGCGTCAATTTTCTTTTTTACAACTTCTTTGCCTTACTCCAAACTTCTTCTCTCCACCATCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCACAGTCATATTTCATATTCTTCCATTTATTATTGCATGCTTTGAAATCCAAAAACCAGAGGAACTTCACTCACCCATCAATACCAAACATATCCTTCTAACCCAAAAACCTCAGAGGCACTACAATCTTCCCCTTTGTGGGTCTGCCAAAAATGCTTGTTGTGTCCTTCGTTTGCTACCCTTTTGTGTTATTATTCCATTTGCTTCGAATCTGAGAGTTTTCTTCATGTGAATTCTGTACTATCCTTCGATTCTGGTGGTGGGTTGTAGTGGGTGTTTCAGCTTTTGGCTCATCTGTGCTTCTTTTATTATTTTTGGGAGTTTCGAATGGGACCCATTAGAGGGTTCAAGAGGAAGAAGAAGGCAGAGAAGAAGGTTGACCAAAATGTCTTGGCTGCTGCTTCACTGTCGTCTCAGCCCCAGCCCTTGGATTGGTGGGACGACTTCTCCCAGAGGATTACTGGACCATTATCCCAGTCAAAGAATCCAACAAAATTTGAGTCAGTTTTCAAAATTTCAAGAAAGACATTCAGCTATATCTGTTCTCTTGTTAAGGAAGCTATGATGGCCAAAACTTCCAATTTTACTGACTTAAACGGGAAACCTTTGTCAGTAAACGACCAAGTCGCCGTTGCTCTTAGGCGGCTTAGTTCTGGTGAATCATTATCAATTATTGGTGATTCATTTGGAATGAATCAGTCATCAGTTTCCCAAATAACTTGGCGTTTTGTGGAGGCGATGGAGGAGAAAGGGCTCCACCATCTGTCATGGCCTTCAACAGAAGAAGATATGGATCAGATAAAGTCCAAGTTTAAGAAAATCAAAGGCCTTCCTAATTGTTGTGGCGTAATCGAAACAACGCACATTATGATGACGTTGCCAACAGCAGAATCTGCAAACGGCGTCTGGCTTGACCGTGAGAAAAACTGCAGCATGATCTTGCAAGTGATTGTAGACCCAGAAATGAGATTCTGTGACATCATGGCGGGTTGGCCAGGAAGTCTGAGCGACGCTCTTGTGCTCCAAAGCTCGGGATTTTTCAAACTTTCCCAAGATGGGGAGCGGTTAAATGGAAAGAATATGAAGCTCTCAGAAAGTTCAGAGCTAGGAGAGTATATTATAGGAGACTCTGGTTTCCCCCTCTTGCCATGGCTACTAACTCCTTATCAAGGGAAAGGCCTTTCGGATTATCAGACCGAGTTTAACAAGCGACATTACGCCACCCGGTTGGTGGCTCAAAGGGCATTGACAAGGTTGAAAGAGATGTGGAAGATCATTAAAGGGGTAATGTGGAAGCCTGATAAACACAGGCTACCAAGGATCATTCTTGTTTGCTGCTTACTTCACAATATAGTGATCGATATGGAGGACGAGGTGCAAGACGAGATGCCCTTGTCTCATCATCACGACTCCGGTTACCGACAACAAAGTTGCAAATTCGTCGACAATACTGCCTCTGTCGTGAGGGAGAAGCTCTCCATGTACTTGTCTGGAAAATTGCCTCCCTAGGAGGATCTCGACCCGCATCTGAAGATTCTTTTCTCTCTTTCTTTCCTTTTCATAGATTGATCTGTGTTCCTGTTGATTCAAATTCTGCAAGTTCTACCTGTCCAAATAGTTGTTCTGGATGACATAAATTTAATTCATAGTTCAATATCTATATTTCTGGTGATTATGATTTGCCTTAACAGATACCCACTTCTTTTTTCTACAAACATTATGCTGTGTTTGGCTTTGGCAGCCATTTTTGTAGAACAAAGCAAACTTTTTGCTGAAGAAATTTCACCTTCACCACTGCTCTTTGTCCCTATGATAATGTCTAG

Coding sequence (CDS)

ATGGGACCCATTAGAGGGTTCAAGAGGAAGAAGAAGGCAGAGAAGAAGGTTGACCAAAATGTCTTGGCTGCTGCTTCACTGTCGTCTCAGCCCCAGCCCTTGGATTGGTGGGACGACTTCTCCCAGAGGATTACTGGACCATTATCCCAGTCAAAGAATCCAACAAAATTTGAGTCAGTTTTCAAAATTTCAAGAAAGACATTCAGCTATATCTGTTCTCTTGTTAAGGAAGCTATGATGGCCAAAACTTCCAATTTTACTGACTTAAACGGGAAACCTTTGTCAGTAAACGACCAAGTCGCCGTTGCTCTTAGGCGGCTTAGTTCTGGTGAATCATTATCAATTATTGGTGATTCATTTGGAATGAATCAGTCATCAGTTTCCCAAATAACTTGGCGTTTTGTGGAGGCGATGGAGGAGAAAGGGCTCCACCATCTGTCATGGCCTTCAACAGAAGAAGATATGGATCAGATAAAGTCCAAGTTTAAGAAAATCAAAGGCCTTCCTAATTGTTGTGGCGTAATCGAAACAACGCACATTATGATGACGTTGCCAACAGCAGAATCTGCAAACGGCGTCTGGCTTGACCGTGAGAAAAACTGCAGCATGATCTTGCAAGTGATTGTAGACCCAGAAATGAGATTCTGTGACATCATGGCGGGTTGGCCAGGAAGTCTGAGCGACGCTCTTGTGCTCCAAAGCTCGGGATTTTTCAAACTTTCCCAAGATGGGGAGCGGTTAAATGGAAAGAATATGAAGCTCTCAGAAAGTTCAGAGCTAGGAGAGTATATTATAGGAGACTCTGGTTTCCCCCTCTTGCCATGGCTACTAACTCCTTATCAAGGGAAAGGCCTTTCGGATTATCAGACCGAGTTTAACAAGCGACATTACGCCACCCGGTTGGTGGCTCAAAGGGCATTGACAAGGTTGAAAGAGATGTGGAAGATCATTAAAGGGGTAATGTGGAAGCCTGATAAACACAGGCTACCAAGGATCATTCTTGTTTGCTGCTTACTTCACAATATAGTGATCGATATGGAGGACGAGGTGCAAGACGAGATGCCCTTGTCTCATCATCACGACTCCGGTTACCGACAACAAAGTTGCAAATTCGTCGACAATACTGCCTCTGTCGTGAGGGAGAAGCTCTCCATGTACTTGTCTGGAAAATTGCCTCCCTAG

Protein sequence

MGPIRGFKRKKKAEKKVDQNVLAAASLSSQPQPLDWWDDFSQRITGPLSQSKNPTKFESVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSVNDQVAVALRRLSSGESLSIIGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIKGLPNCCGVIETTHIMMTLPTAESANGVWLDREKNCSMILQVIVDPEMRFCDIMAGWPGSLSDALVLQSSGFFKLSQDGERLNGKNMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHYATRLVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHHDSGYRQQSCKFVDNTASVVREKLSMYLSGKLPP
Homology
BLAST of MC09g1194 vs. ExPASy Swiss-Prot
Match: Q9M2U3 (Protein ALP1-like OS=Arabidopsis thaliana OX=3702 GN=At3g55350 PE=2 SV=1)

HSP 1 Score: 523.1 bits (1346), Expect = 2.7e-147
Identity = 257/408 (62.99%), Postives = 311/408 (76.23%), Query Frame = 0

Query: 1   MGPIRGFKRKKKAEKKVDQNVLAAASLS------------------SQPQPLDWWDDFSQ 60
           MGPI+  K+KK+AEKKVD+NVL AA+ +                  S  Q LDWWD FS+
Sbjct: 1   MGPIKTIKKKKRAEKKVDRNVLLAATAAATSASAAAALNNNDDDDDSSSQSLDWWDGFSR 60

Query: 61  RITGPLSQSKNPTKFESVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSVNDQVAV 120
           RI G    S +P  FESVFKISRKTF YICSLVK    AK +NF+D NG PLS+ND+VAV
Sbjct: 61  RIYG---GSTDPKTFESVFKISRKTFDYICSLVKADFTAKPANFSDSNGNPLSLNDRVAV 120

Query: 121 ALRRLSSGESLSIIGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKF 180
           ALRRL SGESLS+IG++FGMNQS+VSQITWRFVE+MEE+ +HHLSWPS    +D+IKSKF
Sbjct: 121 ALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAIHHLSWPS---KLDEIKSKF 180

Query: 181 KKIKGLPNCCGVIETTHIMMTLPTAESANGVWLDREKNCSMILQVIVDPEMRFCDIMAGW 240
           +KI GLPNCCG I+ THI+M LP  E +N VWLD EKN SM LQ +VDP+MRF D++AGW
Sbjct: 181 EKISGLPNCCGAIDITHIVMNLPAVEPSNKVWLDGEKNFSMTLQAVVDPDMRFLDVIAGW 240

Query: 241 PGSLSDALVLQSSGFFKLSQDGERLNGKNMKLSESSELGEYIIGDSGFPLLPWLLTPYQG 300
           PGSL+D +VL++SGF+KL + G+RLNG+ + LSE +EL EYI+GDSGFPLLPWLLTPYQG
Sbjct: 241 PGSLNDDVVLKNSGFYKLVEKGKRLNGEKLPLSERTELREYIVGDSGFPLLPWLLTPYQG 300

Query: 301 KGLSDYQTEFNKRHYATRLVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNI 360
           K  S  QTEFNKRH      AQ AL++LK+ W+II GVMW PD++RLPRII VCCLLHNI
Sbjct: 301 KPTSLPQTEFNKRHSEATKAAQMALSKLKDRWRIINGVMWMPDRNRLPRIIFVCCLLHNI 360

Query: 361 VIDMEDEVQDEMPLSHHHDSGYRQQSCKFVDNTASVVREKLSMYLSGK 391
           +IDMED+  D+ PLS  HD  YRQ+SCK  D  +SV+R++LS  L GK
Sbjct: 361 IIDMEDQTLDDQPLSQQHDMNYRQRSCKLADEASSVLRDELSDQLCGK 402

BLAST of MC09g1194 vs. ExPASy Swiss-Prot
Match: Q94K49 (Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 OS=Arabidopsis thaliana OX=3702 GN=ALP1 PE=1 SV=1)

HSP 1 Score: 365.9 bits (938), Expect = 5.6e-100
Identity = 175/380 (46.05%), Postives = 257/380 (67.63%), Query Frame = 0

Query: 9   RKKKAEKKVDQNVLAAASLSSQPQPLDWWDDFSQRITGPLSQSKNPTKFESVFKISRKTF 68
           + KK  K  ++  + A  L  +    DWWD F  R + P   S     F+  F+ S+ TF
Sbjct: 17  KAKKLAKNKEKKRVNAVPLDPEAIDCDWWDTFWLRNSSPSVPSDEDYAFKHFFRASKTTF 76

Query: 69  SYICSLVKEAMMAK-TSNFTDLNGKPLSVNDQVAVALRRLSSGESLSIIGDSFGMNQSSV 128
           SYICSLV+E ++++  S   ++ G+ LSV  QVA+ALRRL+SG+S   +G +FG+ QS+V
Sbjct: 77  SYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIALRRLASGDSQVSVGAAFGVGQSTV 136

Query: 129 SQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIKGLPNCCGVIETTHIMMTLPTA 188
           SQ+TWRF+EA+EE+  HHL WP ++  +++IKSKF+++ GLPNCCG I+TTHI+MTLP  
Sbjct: 137 SQVTWRFIEALEERAKHHLRWPDSDR-IEEIKSKFEEMYGLPNCCGAIDTTHIIMTLPAV 196

Query: 189 ESANGVWLDREKNCSMILQVIVDPEMRFCDIMAGWPGSLSDALVLQSSGFFKLSQDGERL 248
           ++++  W D+EKN SM LQ + D EMRF +++ GWPG ++ + +L+ SGFFKL ++ + L
Sbjct: 197 QASDD-WCDQEKNYSMFLQGVFDHEMRFLNMVTGWPGGMTVSKLLKFSGFFKLCENAQIL 256

Query: 249 NGKNMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHYATRLVAQRAL 308
           +G    LS+ +++ EY++G   +PLLPWL+TP+     SD    FN+RH   R VA  A 
Sbjct: 257 DGNPKTLSQGAQIREYVVGGISYPLLPWLITPHDSDHPSDSMVAFNERHEKVRSVAATAF 316

Query: 309 TRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHHDSGYRQQ 368
            +LK  W+I+  VMW+PD+ +LP IILVCCLLHNI+ID  D +Q+++PLS HHDSGY  +
Sbjct: 317 QQLKGSWRILSKVMWRPDRRKLPSIILVCCLLHNIIIDCGDYLQEDVPLSGHHDSGYADR 376

Query: 369 SCKFVDNTASVVREKLSMYL 388
            CK  +   S +R  L+ +L
Sbjct: 377 YCKQTEPLGSELRGCLTEHL 394

BLAST of MC09g1194 vs. ExPASy Swiss-Prot
Match: Q6AZB8 (Putative nuclease HARBI1 OS=Danio rerio OX=7955 GN=harbi1 PE=2 SV=1)

HSP 1 Score: 121.7 bits (304), Expect = 1.8e-26
Identity = 77/289 (26.64%), Postives = 145/289 (50.17%), Query Frame = 0

Query: 59  SVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSVNDQVAVALRRLSSGESLSIIGD 118
           + F   R+   Y+  L+K++++ +T        + +S + Q+  AL   +SG   S +GD
Sbjct: 37  NTFGFPREFIYYLVELLKDSLLRRTQ-----RSRAISPDVQILAALGFYTSGSFQSKMGD 96

Query: 119 SFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIKGLPNCCGVIETT 178
           + G++Q+S+S+      +A+ EK    + +   E    Q K +F +I G+PN  GV++  
Sbjct: 97  AIGISQASMSRCVSNVTKALIEKAPEFIGFTRDEATKQQFKDEFYRIAGIPNVTGVVDCA 156

Query: 179 HIMMTLPTAESANGVWLDREKNCSMILQVIVDPEMRFCDIMAGWPGSLSDALVLQSSGFF 238
           HI +  P A+ ++  +++++   S+  Q++ D           WPGSL+D  V + S   
Sbjct: 157 HIAIKAPNADDSS--YVNKKGFHSINCQLVCDARGLLLSAETHWPGSLTDRAVFKQSNVA 216

Query: 239 KLSQDGERLNGKNMKLSESSELGEYIIGDSGFPLLPWLLTPYQG-KGLSDYQTEFNKRHY 298
           KL ++            E+ + G +++GD+ +PL  WL+TP Q  +  +DY+  +N  H 
Sbjct: 217 KLFEE-----------QENDDEG-WLLGDNRYPLKKWLMTPVQSPESPADYR--YNLAHT 276

Query: 299 ATRLVAQRALTRLKEMWKIIKG----VMWKPDKHRLPRIILVCCLLHNI 343
            T  +  R    ++  ++ + G    + + P+K     II  CC+LHNI
Sbjct: 277 TTHEIVDRTFRAIQTRFRCLDGAKGYLQYSPEK--CSHIIQACCVLHNI 302

BLAST of MC09g1194 vs. ExPASy Swiss-Prot
Match: Q17QR8 (Putative nuclease HARBI1 OS=Bos taurus OX=9913 GN=HARBI1 PE=2 SV=1)

HSP 1 Score: 113.2 bits (282), Expect = 6.6e-24
Identity = 77/296 (26.01%), Postives = 141/296 (47.64%), Query Frame = 0

Query: 59  SVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSVNDQVAVALRRLSSGESLSIIGD 118
           S++   R+   Y+  L+  ++   T        + +S   Q+  AL   +SG   + +GD
Sbjct: 37  SMYGFPRQFIYYLVELLGASLSRPTQ-----RSRAISPETQILAALGFYTSGSFQTRMGD 96

Query: 119 SFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIKGLPNCCGVIETT 178
           + G++Q+S+S+      EA+ E+    + +P+ E  +  +K +F  + G+P   GV++  
Sbjct: 97  AIGISQASMSRCVANVTEALVERASQFIHFPADEASVQALKDEFYGLAGIPGVIGVVDCM 156

Query: 179 HIMMTLPTAESANGVWLDREKNCSMILQVIVDPEMRFCDIMAGWPGSLSDALVLQSSGFF 238
           H+ +  P AE  +  +++R+   S+   ++ D       +   WPGSL D +VLQ S   
Sbjct: 157 HVAIKAPNAEDLS--YVNRKGLHSLNCLMVCDIRGALMTVETSWPGSLQDCVVLQQS--- 216

Query: 239 KLSQDGERLNGKNMKLSESSELG----EYIIGDSGFPLLPWLLTP-YQGKGLSDYQTEFN 298
                          LS   E G     +++GDS F L  WL+TP +  +  ++Y+  +N
Sbjct: 217 --------------SLSSQFEAGMHKESWLLGDSSFFLRTWLMTPLHIPETPAEYR--YN 276

Query: 299 KRHYATRLVAQRALTRLKEMWKIIKG----VMWKPDKHRLPRIILVCCLLHNIVID 346
             H AT  V ++    L   ++ + G    + + P+K     IIL CC+LHNI ++
Sbjct: 277 MAHSATHSVIEKTFRTLCSRFRCLDGSKGALQYSPEKS--SHIILACCVLHNISLE 304

BLAST of MC09g1194 vs. ExPASy Swiss-Prot
Match: B0BN95 (Putative nuclease HARBI1 OS=Rattus norvegicus OX=10116 GN=Harbi1 PE=2 SV=1)

HSP 1 Score: 113.2 bits (282), Expect = 6.6e-24
Identity = 79/296 (26.69%), Postives = 141/296 (47.64%), Query Frame = 0

Query: 59  SVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSVNDQVAVALRRLSSGESLSIIGD 118
           S++   R+   Y+  L+  ++   T        + +S   Q+  AL   +SG   + +GD
Sbjct: 37  SMYGFPRQFIYYLVELLGASLSRPTQ-----RSRAISPETQILAALGFYTSGSFQTRMGD 96

Query: 119 SFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIKGLPNCCGVIETT 178
           + G++Q+S+S+      EA+ E+    + +P+ E  +  +K +F  + G+P   G ++  
Sbjct: 97  AIGISQASMSRCVANVTEALVERASQFIHFPADEAAIQSLKDEFYGLAGMPGVIGAVDCI 156

Query: 179 HIMMTLPTAESANGVWLDREKNCSMILQVIVDPEMRFCDIMAGWPGSLSDALVLQSSGFF 238
           H+ +  P AE  +  +++R+   S+   V+ D       +   WPGSL D  VLQ S   
Sbjct: 157 HVAIKAPNAEDLS--YVNRKGLHSLNCLVVCDIRGALMTVETSWPGSLQDCAVLQQS--- 216

Query: 239 KLSQDGERLNGKNMKLSESSELG----EYIIGDSGFPLLPWLLTP-YQGKGLSDYQTEFN 298
                          LS   E G     +++GDS F L  WLLTP +  +  ++Y+  +N
Sbjct: 217 --------------SLSSQFETGMPKDSWLLGDSSFFLHTWLLTPLHIPETPAEYR--YN 276

Query: 299 KRHYATRLVAQRALTRLKEMWKIIKG----VMWKPDKHRLPRIILVCCLLHNIVID 346
           + H AT  V ++ L  L   ++ + G    + + P+K     IIL CC+LHNI ++
Sbjct: 277 RAHSATHSVIEKTLRTLCCRFRCLDGSKGALQYSPEKS--SHIILACCVLHNISLE 304

BLAST of MC09g1194 vs. NCBI nr
Match: XP_022138922.1 (protein ALP1-like [Momordica charantia])

HSP 1 Score: 798 bits (2061), Expect = 1.36e-291
Identity = 392/393 (99.75%), Postives = 392/393 (99.75%), Query Frame = 0

Query: 1   MGPIRGFKRKKKAEKKVDQNVLAAASLSSQPQPLDWWDDFSQRITGPLSQSKNPTKFESV 60
           MGPIRGFKRKKKAEKKVDQNVLAAASLSSQPQPLDWWDDFSQRITGPLSQSKNPTKFESV
Sbjct: 1   MGPIRGFKRKKKAEKKVDQNVLAAASLSSQPQPLDWWDDFSQRITGPLSQSKNPTKFESV 60

Query: 61  FKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSVNDQVAVALRRLSSGESLSIIGDSF 120
           FKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSVNDQVAVALRRLSSGESLSIIGDSF
Sbjct: 61  FKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSVNDQVAVALRRLSSGESLSIIGDSF 120

Query: 121 GMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIKGLPNCCGVIETTHI 180
           GMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIKGLPNCCGVIETTHI
Sbjct: 121 GMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIKGLPNCCGVIETTHI 180

Query: 181 MMTLPTAESANGVWLDREKNCSMILQVIVDPEMRFCDIMAGWPGSLSDALVLQSSGFFKL 240
           MMTLPTAES NGVWLDREKNCSMILQVIVDPEMRFCDIMAGWPGSLSDALVLQSSGFFKL
Sbjct: 181 MMTLPTAESXNGVWLDREKNCSMILQVIVDPEMRFCDIMAGWPGSLSDALVLQSSGFFKL 240

Query: 241 SQDGERLNGKNMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHYATR 300
           SQDGERLNGKNMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHYATR
Sbjct: 241 SQDGERLNGKNMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHYATR 300

Query: 301 LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHH 360
           LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHH
Sbjct: 301 LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHH 360

Query: 361 DSGYRQQSCKFVDNTASVVREKLSMYLSGKLPP 393
           DSGYRQQSCKFVDNTASVVREKLSMYLSGKLPP
Sbjct: 361 DSGYRQQSCKFVDNTASVVREKLSMYLSGKLPP 393

BLAST of MC09g1194 vs. NCBI nr
Match: XP_038891834.1 (protein ALP1-like [Benincasa hispida])

HSP 1 Score: 755 bits (1950), Expect = 1.55e-272
Identity = 370/393 (94.15%), Postives = 379/393 (96.44%), Query Frame = 0

Query: 1   MGPIRGFKRKKKAEKKVDQNVLAAASLSSQPQPLDWWDDFSQRITGPLSQSKNPTKFESV 60
           MGPIRGFKRKKK EKKVDQNV AAASLSSQ QPLDWWD+FSQRITGPLSQSKN TKFESV
Sbjct: 134 MGPIRGFKRKKKVEKKVDQNVFAAASLSSQLQPLDWWDEFSQRITGPLSQSKN-TKFESV 193

Query: 61  FKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSVNDQVAVALRRLSSGESLSIIGDSF 120
           FKISRKTFSYICSLVKE MMAKTSNFTDLNGKPLS+NDQVAVALRRL SGESLS IG+SF
Sbjct: 194 FKISRKTFSYICSLVKEVMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGESF 253

Query: 121 GMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIKGLPNCCGVIETTHI 180
           GMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKI+GLPNCCGVIETTHI
Sbjct: 254 GMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTHI 313

Query: 181 MMTLPTAESANGVWLDREKNCSMILQVIVDPEMRFCDIMAGWPGSLSDALVLQSSGFFKL 240
           MMTLPT ESANG+WLDREKNCSMILQVIVDPEMRFCDI+ GWPGSLSDALVLQSSGFFKL
Sbjct: 314 MMTLPTTESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKL 373

Query: 241 SQDGERLNGKNMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHYATR 300
           SQD ERLNGK MKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRH+ATR
Sbjct: 374 SQDSERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHFATR 433

Query: 301 LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHH 360
           LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHH
Sbjct: 434 LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHH 493

Query: 361 DSGYRQQSCKFVDNTASVVREKLSMYLSGKLPP 393
           D  YRQQSC+FVDNTAS+ REKLSMYLSGKLPP
Sbjct: 494 DPSYRQQSCEFVDNTASIAREKLSMYLSGKLPP 525

BLAST of MC09g1194 vs. NCBI nr
Match: XP_004147700.1 (protein ALP1-like [Cucumis sativus] >KGN50531.1 hypothetical protein Csa_000507 [Cucumis sativus])

HSP 1 Score: 749 bits (1933), Expect = 4.24e-272
Identity = 365/393 (92.88%), Postives = 379/393 (96.44%), Query Frame = 0

Query: 1   MGPIRGFKRKKKAEKKVDQNVLAAASLSSQPQPLDWWDDFSQRITGPLSQSKNPTKFESV 60
           MGPIRGFKRKKK EKKVDQNV A+ASLSSQ QPLDWWD+FSQRITGPLSQSKN TKFESV
Sbjct: 1   MGPIRGFKRKKKVEKKVDQNVFASASLSSQLQPLDWWDEFSQRITGPLSQSKN-TKFESV 60

Query: 61  FKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSVNDQVAVALRRLSSGESLSIIGDSF 120
           FKISRKTFSYICSLVKE MMAKTS+FTDLNGKPLS+NDQVAVALRRL SGESLS IGDSF
Sbjct: 61  FKISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSF 120

Query: 121 GMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIKGLPNCCGVIETTHI 180
           G+NQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMD+IKSKFKKI+GLPNCCGV+ETTHI
Sbjct: 121 GLNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIKSKFKKIRGLPNCCGVVETTHI 180

Query: 181 MMTLPTAESANGVWLDREKNCSMILQVIVDPEMRFCDIMAGWPGSLSDALVLQSSGFFKL 240
           MMTLPT+ESANG+WLDREKNCSMILQVIVDPEMRFCDI+ GWPGSLSDALVLQSSGFFKL
Sbjct: 181 MMTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKL 240

Query: 241 SQDGERLNGKNMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHYATR 300
           SQDGERLNGK MKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGL DYQ EFNKRH+ATR
Sbjct: 241 SQDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQAEFNKRHFATR 300

Query: 301 LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHH 360
           LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHH
Sbjct: 301 LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHH 360

Query: 361 DSGYRQQSCKFVDNTASVVREKLSMYLSGKLPP 393
           D  YRQQSC+FVDNTAS+ REKLSMYLSGKLPP
Sbjct: 361 DPSYRQQSCEFVDNTASISREKLSMYLSGKLPP 392

BLAST of MC09g1194 vs. NCBI nr
Match: XP_008461643.1 (PREDICTED: putative nuclease HARBI1 [Cucumis melo])

HSP 1 Score: 748 bits (1930), Expect = 1.21e-271
Identity = 364/393 (92.62%), Postives = 379/393 (96.44%), Query Frame = 0

Query: 1   MGPIRGFKRKKKAEKKVDQNVLAAASLSSQPQPLDWWDDFSQRITGPLSQSKNPTKFESV 60
           MGPIRGFKRKKK EKKVDQNV A+ASLSSQ QPLDWWD+FSQRITGPLSQSKN TKFESV
Sbjct: 1   MGPIRGFKRKKKVEKKVDQNVFASASLSSQLQPLDWWDEFSQRITGPLSQSKN-TKFESV 60

Query: 61  FKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSVNDQVAVALRRLSSGESLSIIGDSF 120
           FKISRKTFSYICSLVKE MMAKTS+FTDLNGKPLS+NDQVAVALRRL SGESLS IGDSF
Sbjct: 61  FKISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSF 120

Query: 121 GMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIKGLPNCCGVIETTHI 180
           G+NQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMD+IKSKFKKI+GLPNCCGVIETTHI
Sbjct: 121 GLNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIKSKFKKIRGLPNCCGVIETTHI 180

Query: 181 MMTLPTAESANGVWLDREKNCSMILQVIVDPEMRFCDIMAGWPGSLSDALVLQSSGFFKL 240
           MMTLPT+ESANG+WLDREKNCSMILQVIVDPEMRFCDI+ GWPGSLSD+LVLQSSGFFKL
Sbjct: 181 MMTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDSLVLQSSGFFKL 240

Query: 241 SQDGERLNGKNMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHYATR 300
           SQDGERLNGK M+LSESSELGEYIIGDSGFPLLPWLLTPYQGKGL DYQ EFNKRH+ATR
Sbjct: 241 SQDGERLNGKKMRLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQAEFNKRHFATR 300

Query: 301 LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHH 360
           LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHH
Sbjct: 301 LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHH 360

Query: 361 DSGYRQQSCKFVDNTASVVREKLSMYLSGKLPP 393
           D  YRQQSC+FVDNTAS+ REKLSMYLSGKLPP
Sbjct: 361 DPSYRQQSCEFVDNTASIAREKLSMYLSGKLPP 392

BLAST of MC09g1194 vs. NCBI nr
Match: XP_022941714.1 (protein ALP1-like [Cucurbita moschata])

HSP 1 Score: 740 bits (1910), Expect = 1.21e-268
Identity = 363/393 (92.37%), Postives = 377/393 (95.93%), Query Frame = 0

Query: 1   MGPIRGFKRKKKAEKKVDQNVLAAASLSSQPQPLDWWDDFSQRITGPLSQSKNPTKFESV 60
           MGPIRGFKRKKK   KVDQNVL  +SL+SQPQPLDWWD+FSQRITGPLS+SKN T FESV
Sbjct: 1   MGPIRGFKRKKK---KVDQNVLVPSSLTSQPQPLDWWDEFSQRITGPLSESKN-TNFESV 60

Query: 61  FKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSVNDQVAVALRRLSSGESLSIIGDSF 120
           FKISRKTFSYI SLVKEAMMAKTSNFTDLNGKPLS+NDQVAVALRRLSSGESLS IGDSF
Sbjct: 61  FKISRKTFSYISSLVKEAMMAKTSNFTDLNGKPLSINDQVAVALRRLSSGESLSNIGDSF 120

Query: 121 GMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIKGLPNCCGVIETTHI 180
           GMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEE MD+IKSKFKKIKGLPNCCGVIETTHI
Sbjct: 121 GMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEGMDEIKSKFKKIKGLPNCCGVIETTHI 180

Query: 181 MMTLPTAESANGVWLDREKNCSMILQVIVDPEMRFCDIMAGWPGSLSDALVLQSSGFFKL 240
           MMTLPT ESA+GVWLDREKNCSM+LQVIVDPEMRFCDI+ GWPGSLSDALVLQSSGFFKL
Sbjct: 181 MMTLPTTESAHGVWLDREKNCSMLLQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKL 240

Query: 241 SQDGERLNGKNMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHYATR 300
           SQDGERLNGK MKLSESSE+GEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRH+ATR
Sbjct: 241 SQDGERLNGKKMKLSESSEVGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHFATR 300

Query: 301 LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHH 360
           LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRI+LVCCLLHNIVIDMEDEVQDEMPLSHHH
Sbjct: 301 LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIVLVCCLLHNIVIDMEDEVQDEMPLSHHH 360

Query: 361 DSGYRQQSCKFVDNTASVVREKLSMYLSGKLPP 393
           D  YRQQSC+FVDNTAS+ REKLSMYLSGKLPP
Sbjct: 361 DPSYRQQSCEFVDNTASMAREKLSMYLSGKLPP 389

BLAST of MC09g1194 vs. ExPASy TrEMBL
Match: A0A6J1CCK2 (protein ALP1-like OS=Momordica charantia OX=3673 GN=LOC111009982 PE=3 SV=1)

HSP 1 Score: 798 bits (2061), Expect = 6.57e-292
Identity = 392/393 (99.75%), Postives = 392/393 (99.75%), Query Frame = 0

Query: 1   MGPIRGFKRKKKAEKKVDQNVLAAASLSSQPQPLDWWDDFSQRITGPLSQSKNPTKFESV 60
           MGPIRGFKRKKKAEKKVDQNVLAAASLSSQPQPLDWWDDFSQRITGPLSQSKNPTKFESV
Sbjct: 1   MGPIRGFKRKKKAEKKVDQNVLAAASLSSQPQPLDWWDDFSQRITGPLSQSKNPTKFESV 60

Query: 61  FKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSVNDQVAVALRRLSSGESLSIIGDSF 120
           FKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSVNDQVAVALRRLSSGESLSIIGDSF
Sbjct: 61  FKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSVNDQVAVALRRLSSGESLSIIGDSF 120

Query: 121 GMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIKGLPNCCGVIETTHI 180
           GMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIKGLPNCCGVIETTHI
Sbjct: 121 GMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIKGLPNCCGVIETTHI 180

Query: 181 MMTLPTAESANGVWLDREKNCSMILQVIVDPEMRFCDIMAGWPGSLSDALVLQSSGFFKL 240
           MMTLPTAES NGVWLDREKNCSMILQVIVDPEMRFCDIMAGWPGSLSDALVLQSSGFFKL
Sbjct: 181 MMTLPTAESXNGVWLDREKNCSMILQVIVDPEMRFCDIMAGWPGSLSDALVLQSSGFFKL 240

Query: 241 SQDGERLNGKNMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHYATR 300
           SQDGERLNGKNMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHYATR
Sbjct: 241 SQDGERLNGKNMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHYATR 300

Query: 301 LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHH 360
           LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHH
Sbjct: 301 LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHH 360

Query: 361 DSGYRQQSCKFVDNTASVVREKLSMYLSGKLPP 393
           DSGYRQQSCKFVDNTASVVREKLSMYLSGKLPP
Sbjct: 361 DSGYRQQSCKFVDNTASVVREKLSMYLSGKLPP 393

BLAST of MC09g1194 vs. ExPASy TrEMBL
Match: A0A0A0KS64 (DDE Tnp4 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G180900 PE=3 SV=1)

HSP 1 Score: 749 bits (1933), Expect = 2.05e-272
Identity = 365/393 (92.88%), Postives = 379/393 (96.44%), Query Frame = 0

Query: 1   MGPIRGFKRKKKAEKKVDQNVLAAASLSSQPQPLDWWDDFSQRITGPLSQSKNPTKFESV 60
           MGPIRGFKRKKK EKKVDQNV A+ASLSSQ QPLDWWD+FSQRITGPLSQSKN TKFESV
Sbjct: 1   MGPIRGFKRKKKVEKKVDQNVFASASLSSQLQPLDWWDEFSQRITGPLSQSKN-TKFESV 60

Query: 61  FKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSVNDQVAVALRRLSSGESLSIIGDSF 120
           FKISRKTFSYICSLVKE MMAKTS+FTDLNGKPLS+NDQVAVALRRL SGESLS IGDSF
Sbjct: 61  FKISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSF 120

Query: 121 GMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIKGLPNCCGVIETTHI 180
           G+NQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMD+IKSKFKKI+GLPNCCGV+ETTHI
Sbjct: 121 GLNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIKSKFKKIRGLPNCCGVVETTHI 180

Query: 181 MMTLPTAESANGVWLDREKNCSMILQVIVDPEMRFCDIMAGWPGSLSDALVLQSSGFFKL 240
           MMTLPT+ESANG+WLDREKNCSMILQVIVDPEMRFCDI+ GWPGSLSDALVLQSSGFFKL
Sbjct: 181 MMTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKL 240

Query: 241 SQDGERLNGKNMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHYATR 300
           SQDGERLNGK MKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGL DYQ EFNKRH+ATR
Sbjct: 241 SQDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQAEFNKRHFATR 300

Query: 301 LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHH 360
           LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHH
Sbjct: 301 LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHH 360

Query: 361 DSGYRQQSCKFVDNTASVVREKLSMYLSGKLPP 393
           D  YRQQSC+FVDNTAS+ REKLSMYLSGKLPP
Sbjct: 361 DPSYRQQSCEFVDNTASISREKLSMYLSGKLPP 392

BLAST of MC09g1194 vs. ExPASy TrEMBL
Match: A0A1S3CEZ1 (putative nuclease HARBI1 OS=Cucumis melo OX=3656 GN=LOC103500196 PE=3 SV=1)

HSP 1 Score: 748 bits (1930), Expect = 5.88e-272
Identity = 364/393 (92.62%), Postives = 379/393 (96.44%), Query Frame = 0

Query: 1   MGPIRGFKRKKKAEKKVDQNVLAAASLSSQPQPLDWWDDFSQRITGPLSQSKNPTKFESV 60
           MGPIRGFKRKKK EKKVDQNV A+ASLSSQ QPLDWWD+FSQRITGPLSQSKN TKFESV
Sbjct: 1   MGPIRGFKRKKKVEKKVDQNVFASASLSSQLQPLDWWDEFSQRITGPLSQSKN-TKFESV 60

Query: 61  FKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSVNDQVAVALRRLSSGESLSIIGDSF 120
           FKISRKTFSYICSLVKE MMAKTS+FTDLNGKPLS+NDQVAVALRRL SGESLS IGDSF
Sbjct: 61  FKISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSF 120

Query: 121 GMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIKGLPNCCGVIETTHI 180
           G+NQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMD+IKSKFKKI+GLPNCCGVIETTHI
Sbjct: 121 GLNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIKSKFKKIRGLPNCCGVIETTHI 180

Query: 181 MMTLPTAESANGVWLDREKNCSMILQVIVDPEMRFCDIMAGWPGSLSDALVLQSSGFFKL 240
           MMTLPT+ESANG+WLDREKNCSMILQVIVDPEMRFCDI+ GWPGSLSD+LVLQSSGFFKL
Sbjct: 181 MMTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDSLVLQSSGFFKL 240

Query: 241 SQDGERLNGKNMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHYATR 300
           SQDGERLNGK M+LSESSELGEYIIGDSGFPLLPWLLTPYQGKGL DYQ EFNKRH+ATR
Sbjct: 241 SQDGERLNGKKMRLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQAEFNKRHFATR 300

Query: 301 LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHH 360
           LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHH
Sbjct: 301 LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHH 360

Query: 361 DSGYRQQSCKFVDNTASVVREKLSMYLSGKLPP 393
           D  YRQQSC+FVDNTAS+ REKLSMYLSGKLPP
Sbjct: 361 DPSYRQQSCEFVDNTASIAREKLSMYLSGKLPP 392

BLAST of MC09g1194 vs. ExPASy TrEMBL
Match: A0A6J1FP85 (protein ALP1-like OS=Cucurbita moschata OX=3662 GN=LOC111446995 PE=3 SV=1)

HSP 1 Score: 740 bits (1910), Expect = 5.86e-269
Identity = 363/393 (92.37%), Postives = 377/393 (95.93%), Query Frame = 0

Query: 1   MGPIRGFKRKKKAEKKVDQNVLAAASLSSQPQPLDWWDDFSQRITGPLSQSKNPTKFESV 60
           MGPIRGFKRKKK   KVDQNVL  +SL+SQPQPLDWWD+FSQRITGPLS+SKN T FESV
Sbjct: 1   MGPIRGFKRKKK---KVDQNVLVPSSLTSQPQPLDWWDEFSQRITGPLSESKN-TNFESV 60

Query: 61  FKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSVNDQVAVALRRLSSGESLSIIGDSF 120
           FKISRKTFSYI SLVKEAMMAKTSNFTDLNGKPLS+NDQVAVALRRLSSGESLS IGDSF
Sbjct: 61  FKISRKTFSYISSLVKEAMMAKTSNFTDLNGKPLSINDQVAVALRRLSSGESLSNIGDSF 120

Query: 121 GMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIKGLPNCCGVIETTHI 180
           GMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEE MD+IKSKFKKIKGLPNCCGVIETTHI
Sbjct: 121 GMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEGMDEIKSKFKKIKGLPNCCGVIETTHI 180

Query: 181 MMTLPTAESANGVWLDREKNCSMILQVIVDPEMRFCDIMAGWPGSLSDALVLQSSGFFKL 240
           MMTLPT ESA+GVWLDREKNCSM+LQVIVDPEMRFCDI+ GWPGSLSDALVLQSSGFFKL
Sbjct: 181 MMTLPTTESAHGVWLDREKNCSMLLQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKL 240

Query: 241 SQDGERLNGKNMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHYATR 300
           SQDGERLNGK MKLSESSE+GEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRH+ATR
Sbjct: 241 SQDGERLNGKKMKLSESSEVGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHFATR 300

Query: 301 LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHH 360
           LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRI+LVCCLLHNIVIDMEDEVQDEMPLSHHH
Sbjct: 301 LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIVLVCCLLHNIVIDMEDEVQDEMPLSHHH 360

Query: 361 DSGYRQQSCKFVDNTASVVREKLSMYLSGKLPP 393
           D  YRQQSC+FVDNTAS+ REKLSMYLSGKLPP
Sbjct: 361 DPSYRQQSCEFVDNTASMAREKLSMYLSGKLPP 389

BLAST of MC09g1194 vs. ExPASy TrEMBL
Match: A0A6J1J6K0 (protein ALP1-like OS=Cucurbita maxima OX=3661 GN=LOC111483923 PE=3 SV=1)

HSP 1 Score: 738 bits (1904), Expect = 4.81e-268
Identity = 362/393 (92.11%), Postives = 376/393 (95.67%), Query Frame = 0

Query: 1   MGPIRGFKRKKKAEKKVDQNVLAAASLSSQPQPLDWWDDFSQRITGPLSQSKNPTKFESV 60
           MGPIRGFKRKKK   KVDQNVL  +SL+SQPQPLDWWD+FSQRITGPLS+SKN T FESV
Sbjct: 1   MGPIRGFKRKKK---KVDQNVLVPSSLTSQPQPLDWWDEFSQRITGPLSESKN-TNFESV 60

Query: 61  FKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSVNDQVAVALRRLSSGESLSIIGDSF 120
           FKISRKTFSYI SLVKEAMMAKTSNFTDLNGKPLS+NDQVAVALRRLSSGESLS IGDSF
Sbjct: 61  FKISRKTFSYISSLVKEAMMAKTSNFTDLNGKPLSINDQVAVALRRLSSGESLSNIGDSF 120

Query: 121 GMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIKGLPNCCGVIETTHI 180
           GMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEE MDQIKSKFKKIKGLPNCCGVIETTHI
Sbjct: 121 GMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEGMDQIKSKFKKIKGLPNCCGVIETTHI 180

Query: 181 MMTLPTAESANGVWLDREKNCSMILQVIVDPEMRFCDIMAGWPGSLSDALVLQSSGFFKL 240
           MMTLPT ESA+GVWLDREKNCSM+LQVIVDPEMRFCDI+ GWPGSLSDALVLQSSGFF+L
Sbjct: 181 MMTLPTTESAHGVWLDREKNCSMLLQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFRL 240

Query: 241 SQDGERLNGKNMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHYATR 300
           SQDGERLNGK MKLSESSE+GEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRH+ATR
Sbjct: 241 SQDGERLNGKKMKLSESSEVGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHFATR 300

Query: 301 LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHH 360
           LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRI+LVCCLLHNIVIDMEDEVQDEMPLSHHH
Sbjct: 301 LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIVLVCCLLHNIVIDMEDEVQDEMPLSHHH 360

Query: 361 DSGYRQQSCKFVDNTASVVREKLSMYLSGKLPP 393
           D  YRQQSC+FVDNTAS+ REKLSMYL GKLPP
Sbjct: 361 DPSYRQQSCEFVDNTASMAREKLSMYLLGKLPP 389

BLAST of MC09g1194 vs. TAIR 10
Match: AT3G55350.1 (PIF / Ping-Pong family of plant transposases )

HSP 1 Score: 523.1 bits (1346), Expect = 2.0e-148
Identity = 257/408 (62.99%), Postives = 311/408 (76.23%), Query Frame = 0

Query: 1   MGPIRGFKRKKKAEKKVDQNVLAAASLS------------------SQPQPLDWWDDFSQ 60
           MGPI+  K+KK+AEKKVD+NVL AA+ +                  S  Q LDWWD FS+
Sbjct: 1   MGPIKTIKKKKRAEKKVDRNVLLAATAAATSASAAAALNNNDDDDDSSSQSLDWWDGFSR 60

Query: 61  RITGPLSQSKNPTKFESVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSVNDQVAV 120
           RI G    S +P  FESVFKISRKTF YICSLVK    AK +NF+D NG PLS+ND+VAV
Sbjct: 61  RIYG---GSTDPKTFESVFKISRKTFDYICSLVKADFTAKPANFSDSNGNPLSLNDRVAV 120

Query: 121 ALRRLSSGESLSIIGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKF 180
           ALRRL SGESLS+IG++FGMNQS+VSQITWRFVE+MEE+ +HHLSWPS    +D+IKSKF
Sbjct: 121 ALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAIHHLSWPS---KLDEIKSKF 180

Query: 181 KKIKGLPNCCGVIETTHIMMTLPTAESANGVWLDREKNCSMILQVIVDPEMRFCDIMAGW 240
           +KI GLPNCCG I+ THI+M LP  E +N VWLD EKN SM LQ +VDP+MRF D++AGW
Sbjct: 181 EKISGLPNCCGAIDITHIVMNLPAVEPSNKVWLDGEKNFSMTLQAVVDPDMRFLDVIAGW 240

Query: 241 PGSLSDALVLQSSGFFKLSQDGERLNGKNMKLSESSELGEYIIGDSGFPLLPWLLTPYQG 300
           PGSL+D +VL++SGF+KL + G+RLNG+ + LSE +EL EYI+GDSGFPLLPWLLTPYQG
Sbjct: 241 PGSLNDDVVLKNSGFYKLVEKGKRLNGEKLPLSERTELREYIVGDSGFPLLPWLLTPYQG 300

Query: 301 KGLSDYQTEFNKRHYATRLVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNI 360
           K  S  QTEFNKRH      AQ AL++LK+ W+II GVMW PD++RLPRII VCCLLHNI
Sbjct: 301 KPTSLPQTEFNKRHSEATKAAQMALSKLKDRWRIINGVMWMPDRNRLPRIIFVCCLLHNI 360

Query: 361 VIDMEDEVQDEMPLSHHHDSGYRQQSCKFVDNTASVVREKLSMYLSGK 391
           +IDMED+  D+ PLS  HD  YRQ+SCK  D  +SV+R++LS  L GK
Sbjct: 361 IIDMEDQTLDDQPLSQQHDMNYRQRSCKLADEASSVLRDELSDQLCGK 402

BLAST of MC09g1194 vs. TAIR 10
Match: AT3G63270.1 (CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912); BEST Arabidopsis thaliana protein match is: PIF / Ping-Pong family of plant transposases (TAIR:AT3G55350.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 365.9 bits (938), Expect = 4.0e-101
Identity = 175/380 (46.05%), Postives = 257/380 (67.63%), Query Frame = 0

Query: 9   RKKKAEKKVDQNVLAAASLSSQPQPLDWWDDFSQRITGPLSQSKNPTKFESVFKISRKTF 68
           + KK  K  ++  + A  L  +    DWWD F  R + P   S     F+  F+ S+ TF
Sbjct: 17  KAKKLAKNKEKKRVNAVPLDPEAIDCDWWDTFWLRNSSPSVPSDEDYAFKHFFRASKTTF 76

Query: 69  SYICSLVKEAMMAK-TSNFTDLNGKPLSVNDQVAVALRRLSSGESLSIIGDSFGMNQSSV 128
           SYICSLV+E ++++  S   ++ G+ LSV  QVA+ALRRL+SG+S   +G +FG+ QS+V
Sbjct: 77  SYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIALRRLASGDSQVSVGAAFGVGQSTV 136

Query: 129 SQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIKGLPNCCGVIETTHIMMTLPTA 188
           SQ+TWRF+EA+EE+  HHL WP ++  +++IKSKF+++ GLPNCCG I+TTHI+MTLP  
Sbjct: 137 SQVTWRFIEALEERAKHHLRWPDSDR-IEEIKSKFEEMYGLPNCCGAIDTTHIIMTLPAV 196

Query: 189 ESANGVWLDREKNCSMILQVIVDPEMRFCDIMAGWPGSLSDALVLQSSGFFKLSQDGERL 248
           ++++  W D+EKN SM LQ + D EMRF +++ GWPG ++ + +L+ SGFFKL ++ + L
Sbjct: 197 QASDD-WCDQEKNYSMFLQGVFDHEMRFLNMVTGWPGGMTVSKLLKFSGFFKLCENAQIL 256

Query: 249 NGKNMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHYATRLVAQRAL 308
           +G    LS+ +++ EY++G   +PLLPWL+TP+     SD    FN+RH   R VA  A 
Sbjct: 257 DGNPKTLSQGAQIREYVVGGISYPLLPWLITPHDSDHPSDSMVAFNERHEKVRSVAATAF 316

Query: 309 TRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHHDSGYRQQ 368
            +LK  W+I+  VMW+PD+ +LP IILVCCLLHNI+ID  D +Q+++PLS HHDSGY  +
Sbjct: 317 QQLKGSWRILSKVMWRPDRRKLPSIILVCCLLHNIIIDCGDYLQEDVPLSGHHDSGYADR 376

Query: 369 SCKFVDNTASVVREKLSMYL 388
            CK  +   S +R  L+ +L
Sbjct: 377 YCKQTEPLGSELRGCLTEHL 394

BLAST of MC09g1194 vs. TAIR 10
Match: AT5G12010.1 (unknown protein; INVOLVED IN: response to salt stress; LOCATED IN: chloroplast, plasma membrane, membrane; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G29780.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 143.7 bits (361), Expect = 3.2e-34
Identity = 90/325 (27.69%), Postives = 165/325 (50.77%), Query Frame = 0

Query: 36  WWDDFSQRITGPLSQSKNPTKFESVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLS 95
           WW++ S R+  P         F+  F++S+ TF  IC  +  A+  + +   +     + 
Sbjct: 161 WWEECS-RLDYP------EEDFKKAFRMSKSTFELICDELNSAVAKEDTALRN----AIP 220

Query: 96  VNDQVAVALRRLSSGESLSIIGDSFGMNQSSVSQITWRFVEAMEEKGL-HHLSWPSTEED 155
           V  +VAV + RL++GE L ++   FG+  S+  ++     +A+++  +  +L WP  +E 
Sbjct: 221 VRQRVAVCIWRLATGEPLRLVSKKFGLGISTCHKLVLEVCKAIKDVLMPKYLQWPD-DES 280

Query: 156 MDQIKSKFKKIKGLPNCCGVIETTHIMMTLPTAESAN-----GVWLDREKNCSMILQVIV 215
           +  I+ +F+ + G+PN  G + TTHI +  P    A+         +++ + S+ +Q +V
Sbjct: 281 LRNIRERFESVSGIPNVVGSMYTTHIPIIAPKISVASYFNKRHTERNQKTSYSITIQAVV 340

Query: 216 DPEMRFCDIMAGWPGSLSDALVLQSSGFFKLSQDGERLNGKNMKLSESSELGEYIIGDSG 275
           +P+  F D+  GWPGS+ D  VL+ S  ++ + +G  L G             ++ G  G
Sbjct: 341 NPKGVFTDLCIGWPGSMPDDKVLEKSLLYQRANNGGLLKGM------------WVAGGPG 400

Query: 276 FPLLPWLLTPYQGKGLSDYQTEFNKRHYATRLVAQRALTRLKEMWKIIKGVMWKPDKHRL 335
            PLL W+L PY  + L+  Q  FN++    + VA+ A  RLK  W  ++    +     L
Sbjct: 401 HPLLDWVLVPYTQQNLTWTQHAFNEKMSEVQGVAKEAFGRLKGRWACLQ-KRTEVKLQDL 460

Query: 336 PRIILVCCLLHNIVIDMEDEVQDEM 355
           P ++  CC+LHNI    E++++ E+
Sbjct: 461 PTVLGACCVLHNICEMREEKMEPEL 460

BLAST of MC09g1194 vs. TAIR 10
Match: AT4G29780.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G12010.1); Has 945 Blast hits to 944 proteins in 87 species: Archae - 0; Bacteria - 0; Metazoa - 519; Fungi - 43; Plants - 365; Viruses - 0; Other Eukaryotes - 18 (source: NCBI BLink). )

HSP 1 Score: 120.6 bits (301), Expect = 2.9e-27
Identity = 90/326 (27.61%), Postives = 156/326 (47.85%), Query Frame = 0

Query: 35  DWWDDFSQRITGPLSQSKNPTKFESVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPL 94
           DWWD    R++ P        +F   F++S+ TF+ IC  +   +  K +   D    P 
Sbjct: 198 DWWD----RVSRP---DFPEDEFRREFRMSKSTFNLICEELDTTVTKKNTMLRDAIPAP- 257

Query: 95  SVNDQVAVALRRLSSGESLSIIGDSFGMNQSSVSQITWRFVEAMEEKGL-HHLSWPSTEE 154
               +V V + RL++G  L  + + FG+  S+  ++      A+ +  +  +L WPS + 
Sbjct: 258 ---KRVGVCVWRLATGAPLRHVSERFGLGISTCHKLVIEVCRAIYDVLMPKYLLWPS-DS 317

Query: 155 DMDQIKSKFKKIKGLPNCCGVIETTHIMMTLPTAESA-----NGVWLDREKNCSMILQVI 214
           +++  K+KF+ +  +PN  G I TTHI +  P    A          +++ + S+ +Q +
Sbjct: 318 EINSTKAKFESVHKIPNVVGSIYTTHIPIIAPKVHVAAYFNKRHTERNQKTSYSITVQGV 377

Query: 215 VDPEMRFCDIMAGWPGSLSDALVLQSSGFFKLSQDGERLNGKNMKLSESSELGEYIIGDS 274
           V+ +  F D+  G PGSL+D  +L+ S          R       L +S     +I+G+S
Sbjct: 378 VNADGIFTDVCIGNPGSLTDDQILEKSSL-------SRQRAARGMLRDS-----WIVGNS 437

Query: 275 GFPLLPWLLTPYQGKGLSDYQTEFNKRHYATRLVAQRALTRLKEMWKIIKGVMWKPDKHR 334
           GFPL  +LL PY  + L+  Q  FN+     + +A  A  RLK  W  ++    +     
Sbjct: 438 GFPLTDYLLVPYTRQNLTWTQHAFNESIGEIQGIATAAFERLKGRWACLQ-KRTEVKLQD 497

Query: 335 LPRIILVCCLLHNIVIDMEDEVQDEM 355
           LP ++  CC+LHNI    ++E+  E+
Sbjct: 498 LPYVLGACCVLHNICEMRKEEMLPEL 498

BLAST of MC09g1194 vs. TAIR 10
Match: AT1G72270.1 (CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G27010.1); Has 772 Blast hits to 657 proteins in 120 species: Archae - 0; Bacteria - 0; Metazoa - 344; Fungi - 94; Plants - 322; Viruses - 0; Other Eukaryotes - 12 (source: NCBI BLink). )

HSP 1 Score: 97.8 bits (242), Expect = 2.0e-20
Identity = 71/252 (28.17%), Postives = 116/252 (46.03%), Query Frame = 0

Query: 101 AVALRRLSSGESLSIIGDSFGMNQSS-VSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIK 160
           A  + RL+ G S   +   FG + +S  S+  +   + + EK           + +D  K
Sbjct: 124 AATIFRLAHGASYECLVHRFGFDSTSQASRSFFTVCKLINEK---------LSQQLDDPK 183

Query: 161 SKFKKIKGLPNCCGVIETTHIMMTLPTAESANGVWLDREKNCSMILQVIVDPEMRFCDIM 220
             F     LPNC GV+                G  L  +   S+++Q +VD   RF DI 
Sbjct: 184 PDFSP-NLLPNCYGVVGFGRF--------EVKGKLLGAKG--SILVQALVDSNGRFVDIS 243

Query: 221 AGWPGSLSDALVLQSSGFFKLSQDGERLNGKNMKLSESSELGEYIIGDSGFPLLPWLLTP 280
           AGWP ++    + + +  F +++  E L+G   KL     +  YI+GDS  PLLPWL+TP
Sbjct: 244 AGWPSTMKPEAIFRQTKLFSIAE--EVLSGAPTKLGNGVLVPRYILGDSCLPLLPWLVTP 303

Query: 281 YQ-GKGLSDYQTEFNKRHYATRLVAQRALTRLKEMWKIIKGVMWKPDK-HRLPRIILVCC 340
           Y        ++ EFN   +      + A  +++  W+I+    WKP+    +P +I   C
Sbjct: 304 YDLTSDEESFREEFNNVVHTGLHSVEIAFAKVRARWRIL-DKKWKPETIEFMPFVITTGC 352

Query: 341 LLHNIVIDMEDE 350
           LLHN +++  D+
Sbjct: 364 LLHNFLVNSGDD 352

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9M2U32.7e-14762.99Protein ALP1-like OS=Arabidopsis thaliana OX=3702 GN=At3g55350 PE=2 SV=1[more]
Q94K495.6e-10046.05Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 OS=Arabidopsis thaliana OX=... [more]
Q6AZB81.8e-2626.64Putative nuclease HARBI1 OS=Danio rerio OX=7955 GN=harbi1 PE=2 SV=1[more]
Q17QR86.6e-2426.01Putative nuclease HARBI1 OS=Bos taurus OX=9913 GN=HARBI1 PE=2 SV=1[more]
B0BN956.6e-2426.69Putative nuclease HARBI1 OS=Rattus norvegicus OX=10116 GN=Harbi1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
XP_022138922.11.36e-29199.75protein ALP1-like [Momordica charantia][more]
XP_038891834.11.55e-27294.15protein ALP1-like [Benincasa hispida][more]
XP_004147700.14.24e-27292.88protein ALP1-like [Cucumis sativus] >KGN50531.1 hypothetical protein Csa_000507 ... [more]
XP_008461643.11.21e-27192.62PREDICTED: putative nuclease HARBI1 [Cucumis melo][more]
XP_022941714.11.21e-26892.37protein ALP1-like [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A6J1CCK26.57e-29299.75protein ALP1-like OS=Momordica charantia OX=3673 GN=LOC111009982 PE=3 SV=1[more]
A0A0A0KS642.05e-27292.88DDE Tnp4 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G180900 PE... [more]
A0A1S3CEZ15.88e-27292.62putative nuclease HARBI1 OS=Cucumis melo OX=3656 GN=LOC103500196 PE=3 SV=1[more]
A0A6J1FP855.86e-26992.37protein ALP1-like OS=Cucurbita moschata OX=3662 GN=LOC111446995 PE=3 SV=1[more]
A0A6J1J6K04.81e-26892.11protein ALP1-like OS=Cucurbita maxima OX=3661 GN=LOC111483923 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT3G55350.12.0e-14862.99PIF / Ping-Pong family of plant transposases [more]
AT3G63270.14.0e-10146.05CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (Int... [more]
AT5G12010.13.2e-3427.69unknown protein; INVOLVED IN: response to salt stress; LOCATED IN: chloroplast, ... [more]
AT4G29780.12.9e-2727.61unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G72270.12.0e-2028.17CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR0217... [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (Dali-11) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR027806Harbinger transposase-derived nuclease domainPFAMPF13359DDE_Tnp_4coord: 176..341
e-value: 4.7E-29
score: 101.1
NoneNo IPR availablePANTHERPTHR22930UNCHARACTERIZEDcoord: 1..389
NoneNo IPR availablePANTHERPTHR22930:SF205PROTEIN ALP1-LIKEcoord: 1..389

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MC09g1194.1MC09g1194.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0046872 metal ion binding