Moc03g01650 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc03g01650
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionBED-type domain-containing protein
Locationchr3: 1245877 .. 1248155 (+)
RNA-Seq ExpressionMoc03g01650
SyntenyMoc03g01650
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCATCTTCATCAAATTCAAATCCATCCCAAGAAACAACCTCTTCTAGCATTGAAGATGATTCAAAACCTCTTTGGCAATATGTAACTAAAATTCAAAAATTGAGTGAGAGGGGTGGAAATTTTTCATGGCAATGCAACTTTTGTCAAGCCATCAAGAAGAGCTCTTACACAAGAGTTAGAGCTCACTTGTTAAAGATAAGTGGTCAAGGAATAGGGATATGTTTAAAAGTTACTCCTACAGATATTGCGGATATGGAGAAATTGGAAGAAGAAGCGAAAAATCGGAAGGAGAGAAAAGCCCCTAAAAATGTTCGTTTACCACCTTCATTCATATCAGTCGGTGGTGTTAATGTGAGTAATTCTCCTGGCACGAGTAATATTGAGCCAAAAAAGAGGAAAGGCACTCCAAGTGCAATTGAGAAGTCATTCAACAAGGCATCTCGAGATCAACTGAATGCACTCATTGCACGAATGTTTTATTCTGCTGGCTTGCCATTTCATTTAGCTAGAAACCCACACTTTAGAGGTGCCTTTAGTTATGCGGCGAACCATATGTTGACCGGATATGTACCTCCGGGATTTAATTCGTTGAGGACGAGTCTTTTACAACAAGAGAAGGCGAATATTGAGAGGTTATTAATACCAATTAAAGGTGAATGGCGTTTGAAAGGAGTGAGCATTGTGAGTGATGGATGGAGTGACTCACAAAGGAGGCCTTTAATTAACTTTATGGCCATCTCTGAAGGTAGACCGATGTTTTTGAAAGCAGTAGATTGTTCTGACGAGGTAAAAGACAAGTTTTTTATTGCTAATTTGATGAAAGAAGTGATCAATGAAGTTGGTCCTGACAATGTAGTGTAAGTGATAACCGATAATGCTCCTAATTGCAAAGGTGCAGGACAACTCATTGAAGGACAATTTCCAATGATTGTATGGACACCATGTGTAGTACATACATTGAATCTTGCCCTGAAGAACATTTGTGCTGCTAAGAATGTTGAAGACAATCAAATTGCATATGGAGAATGCAGCTGGATATCTAATGTTGATGGAGATGTCATGGTTGTGAAACATTTTATCATGAATCATTCTATGAGGTTGTCCATGTTTAATGAATTTGTACCTCTTAAATTGTTATATGTGGCTGAAACACGTTTTGCATCGATTATTGTTATGTTGAAGAGGTTTAAATTGATTAAAGGTGGCTTACAAGCCATGGTCATCAGTGACAAGTGGGCAAATTATAGAGAAGATGATGTGAGAAAGGCCCAACATGTAAAGGAGTTATTGCTTAATGATTTATGGTGGGACAAAATCGACTACATTCTTTCCTTCACTTCACCTATATACGACATGATTAGAGCTTGTGATACAGACAAGCCTTGTTTGCACTTGGTATATGATATGTGGGATACGATGATCGAAAAGGTGAAGAAAATCATATATAGACATGAAGGATTGCAGTCGAACGAAAATTCATCTTTTTATGATGTGGTGCACACCATTCTTGTTGATCGTTGGAACAAAAATAATACTCCACTACATTGTTTAGCACATTCTCTAAATCCAAGGTATTTACTGCTTAATTACTATCTATCATTTAATTCTTACTGAGATTAGCATGATTTATATTTAACATGAGAGTTTAACATTTTTGTAAAAACTTAAATTTAGGTATTATAGTGAAGAGTGGCTTGCAGAAGACTCTAATCGTGTGCCTCCGAGTCAAGATGTGGAATTAACTAGGGAGAGAATGAAGTTGTTGAAGAGGTATTTTATTAGCCCTCTGGATCGTACAAAGGTGATTACAGAATTTGCAAATTTCTCTACAAAATCCGGAGATTTTGCAGATTACGATTCTAATGGCGGATAGATATGTATTAGATCCTAGAAGTTGGTGGGCAACTTATGGTGTCTATGCACCCATGCTTCAAGCAATTGCCTTTAAATTACTTGGAACACCTTCTTCCCCCTCATGTTGCGAAAGAAATTGGAGCACATACTCTTTTGTTAACTCTGTCAAGAGGAATAAAATGACACATAAACGTACAGAAGATTTGGTATTCATCCATAGCAATCTTCGTCTCTTATCAAGAAGAACCCCAGAATATTCCCAAGGAGAGACTAAGTCATGGGATATTGCTGGAGATAGTTTCGATTCTCTTGAAGACGTTGGCATGCTTGAAGTAGCAAATTTGTCTTTAGATGAACCAGATTTAGAGGCTGAAATTCTTGACGAGGATGATCACGCTGACAAGGATGATATGGAGACTTGA

mRNA sequence

ATGGCATCTTCATCAAATTCAAATCCATCCCAAGAAACAACCTCTTCTAGCATTGAAGATGATTCAAAACCTCTTTGGCAATATGTAACTAAAATTCAAAAATTGAGTGAGAGGGGTGGAAATTTTTCATGGCAATGCAACTTTTGTCAAGCCATCAAGAAGAGCTCTTACACAAGAGTTAGAGCTCACTTGTTAAAGATAAGTGGTCAAGGAATAGGGATATGTTTAAAAGTTACTCCTACAGATATTGCGGATATGGAGAAATTGGAAGAAGAAGCGAAAAATCGGAAGGAGAGAAAAGCCCCTAAAAATGTTCGTTTACCACCTTCATTCATATCAGTCGGTGGTGTTAATGTGAGTAATTCTCCTGGCACGAGTAATATTGAGCCAAAAAAGAGGAAAGGCACTCCAAGTGCAATTGAGAAGTCATTCAACAAGGCATCTCGAGATCAACTGAATGCACTCATTGCACGAATGTTTTATTCTGCTGGCTTGCCATTTCATTTAGCTAGAAACCCACACTTTAGAGGTGCCTTTAGTTATGCGGCGAACCATATGTTGACCGGATATGTACCTCCGGGATTTAATTCGTTGAGGACGAGTCTTTTACAACAAGAGAAGGCGAATATTGAGAGGTTATTAATACCAATTAAAGGTGAATGGCGTTTGAAAGGAGTGAGCATTGTGAGTGATGGATGGAGTGACTCACAAAGGAGGCCTTTAATTAACTTTATGGCCATCTCTGAAGGACAACTCATTGAAGGACAATTTCCAATGATTGTATGGACACCATGTGTAGTACATACATTGAATCTTGCCCTGAAGAACATTTGTGCTGCTAAGAATGTTGAAGACAATCAAATTGCATATGGAGAATGCAGCTGGATATCTAATGTTGATGGAGATGTCATGGTTGTGAAACATTTTATCATGAATCATTCTATGAGGTTGTCCATGTTTAATGAATTTGTACCTCTTAAATTGTTATATGTGGCTGAAACACGTTTTGCATCGATTATTGTTATGTTGAAGAGGTTTAAATTGATTAAAGGTGGCTTACAAGCCATGGTCATCAGTGACAAGTGGGCAAATTATAGAGAAGATGATGTGAGAAAGGCCCAACATGTAAAGGAGTTATTGCTTAATGATTTATGGTGGGACAAAATCGACTACATTCTTTCCTTCACTTCACCTATATACGACATGATTAGAGCTTGTGATACAGACAAGCCTTGTTTGCACTTGGTATATGATATGTGGGATACGATGATCGAAAAGGTGAAGAAAATCATATATAGACATGAAGGATTGCAGTCGAACGAAAATTCATCTTTTTATGATGTGGTGCACACCATTCTTGTTGATCGTTGGAACAAAAATAATACTCCACTACATTGTTTAGCACATTCTCTAAATCCAAGGTATTATAGTGAAGAGTGGCTTGCAGAAGACTCTAATCGTGTGCCTCCGAGTCAAGATGTGGAATTAACTAGGGAGAGAATGAAGTTGTTGAAGAGATATGTATTAGATCCTAGAAGTTGGTGGGCAACTTATGGTGTCTATGCACCCATGCTTCAAGCAATTGCCTTTAAATTACTTGGAACACCTTCTTCCCCCTCATGTTGCGAAAGAAATTGGAGCACATACTCTTTTGTTAACTCTGTCAAGAGGAATAAAATGACACATAAACGTACAGAAGATTTGGTATTCATCCATAGCAATCTTCGTCTCTTATCAAGAAGAACCCCAGAATATTCCCAAGGAGAGACTAAGTCATGGGATATTGCTGGAGATAGTTTCGATTCTCTTGAAGACGTTGGCATGCTTGAAGTAGCAAATTTGTCTTTAGATGAACCAGATTTAGAGGCTGAAATTCTTGACGAGGATGATCACGCTGACAAGGATGATATGGAGACTTGA

Coding sequence (CDS)

ATGGCATCTTCATCAAATTCAAATCCATCCCAAGAAACAACCTCTTCTAGCATTGAAGATGATTCAAAACCTCTTTGGCAATATGTAACTAAAATTCAAAAATTGAGTGAGAGGGGTGGAAATTTTTCATGGCAATGCAACTTTTGTCAAGCCATCAAGAAGAGCTCTTACACAAGAGTTAGAGCTCACTTGTTAAAGATAAGTGGTCAAGGAATAGGGATATGTTTAAAAGTTACTCCTACAGATATTGCGGATATGGAGAAATTGGAAGAAGAAGCGAAAAATCGGAAGGAGAGAAAAGCCCCTAAAAATGTTCGTTTACCACCTTCATTCATATCAGTCGGTGGTGTTAATGTGAGTAATTCTCCTGGCACGAGTAATATTGAGCCAAAAAAGAGGAAAGGCACTCCAAGTGCAATTGAGAAGTCATTCAACAAGGCATCTCGAGATCAACTGAATGCACTCATTGCACGAATGTTTTATTCTGCTGGCTTGCCATTTCATTTAGCTAGAAACCCACACTTTAGAGGTGCCTTTAGTTATGCGGCGAACCATATGTTGACCGGATATGTACCTCCGGGATTTAATTCGTTGAGGACGAGTCTTTTACAACAAGAGAAGGCGAATATTGAGAGGTTATTAATACCAATTAAAGGTGAATGGCGTTTGAAAGGAGTGAGCATTGTGAGTGATGGATGGAGTGACTCACAAAGGAGGCCTTTAATTAACTTTATGGCCATCTCTGAAGGACAACTCATTGAAGGACAATTTCCAATGATTGTATGGACACCATGTGTAGTACATACATTGAATCTTGCCCTGAAGAACATTTGTGCTGCTAAGAATGTTGAAGACAATCAAATTGCATATGGAGAATGCAGCTGGATATCTAATGTTGATGGAGATGTCATGGTTGTGAAACATTTTATCATGAATCATTCTATGAGGTTGTCCATGTTTAATGAATTTGTACCTCTTAAATTGTTATATGTGGCTGAAACACGTTTTGCATCGATTATTGTTATGTTGAAGAGGTTTAAATTGATTAAAGGTGGCTTACAAGCCATGGTCATCAGTGACAAGTGGGCAAATTATAGAGAAGATGATGTGAGAAAGGCCCAACATGTAAAGGAGTTATTGCTTAATGATTTATGGTGGGACAAAATCGACTACATTCTTTCCTTCACTTCACCTATATACGACATGATTAGAGCTTGTGATACAGACAAGCCTTGTTTGCACTTGGTATATGATATGTGGGATACGATGATCGAAAAGGTGAAGAAAATCATATATAGACATGAAGGATTGCAGTCGAACGAAAATTCATCTTTTTATGATGTGGTGCACACCATTCTTGTTGATCGTTGGAACAAAAATAATACTCCACTACATTGTTTAGCACATTCTCTAAATCCAAGGTATTATAGTGAAGAGTGGCTTGCAGAAGACTCTAATCGTGTGCCTCCGAGTCAAGATGTGGAATTAACTAGGGAGAGAATGAAGTTGTTGAAGAGATATGTATTAGATCCTAGAAGTTGGTGGGCAACTTATGGTGTCTATGCACCCATGCTTCAAGCAATTGCCTTTAAATTACTTGGAACACCTTCTTCCCCCTCATGTTGCGAAAGAAATTGGAGCACATACTCTTTTGTTAACTCTGTCAAGAGGAATAAAATGACACATAAACGTACAGAAGATTTGGTATTCATCCATAGCAATCTTCGTCTCTTATCAAGAAGAACCCCAGAATATTCCCAAGGAGAGACTAAGTCATGGGATATTGCTGGAGATAGTTTCGATTCTCTTGAAGACGTTGGCATGCTTGAAGTAGCAAATTTGTCTTTAGATGAACCAGATTTAGAGGCTGAAATTCTTGACGAGGATGATCACGCTGACAAGGATGATATGGAGACTTGA

Protein sequence

MASSSNSNPSQETTSSSIEDDSKPLWQYVTKIQKLSERGGNFSWQCNFCQAIKKSSYTRVRAHLLKISGQGIGICLKVTPTDIADMEKLEEEAKNRKERKAPKNVRLPPSFISVGGVNVSNSPGTSNIEPKKRKGTPSAIEKSFNKASRDQLNALIARMFYSAGLPFHLARNPHFRGAFSYAANHMLTGYVPPGFNSLRTSLLQQEKANIERLLIPIKGEWRLKGVSIVSDGWSDSQRRPLINFMAISEGQLIEGQFPMIVWTPCVVHTLNLALKNICAAKNVEDNQIAYGECSWISNVDGDVMVVKHFIMNHSMRLSMFNEFVPLKLLYVAETRFASIIVMLKRFKLIKGGLQAMVISDKWANYREDDVRKAQHVKELLLNDLWWDKIDYILSFTSPIYDMIRACDTDKPCLHLVYDMWDTMIEKVKKIIYRHEGLQSNENSSFYDVVHTILVDRWNKNNTPLHCLAHSLNPRYYSEEWLAEDSNRVPPSQDVELTRERMKLLKRYVLDPRSWWATYGVYAPMLQAIAFKLLGTPSSPSCCERNWSTYSFVNSVKRNKMTHKRTEDLVFIHSNLRLLSRRTPEYSQGETKSWDIAGDSFDSLEDVGMLEVANLSLDEPDLEAEILDEDDHADKDDMET
Homology
BLAST of Moc03g01650 vs. NCBI nr
Match: XP_038721052.1 (uncharacterized protein LOC120013346 isoform X1 [Tripterygium wilfordii] >XP_038721053.1 uncharacterized protein LOC120013346 isoform X1 [Tripterygium wilfordii])

HSP 1 Score: 817.0 bits (2109), Expect = 1.1e-232
Identity = 420/718 (58.50%), Postives = 498/718 (69.36%), Query Frame = 0

Query: 2   ASSSNSNPSQETTSSSIEDDSKPLWQYVTKIQKLSERGGNFSWQCNFCQAIKKSSYTRVR 61
           A+ S S PS   +S  +ED  KPLW+YV + +K    GGN SW CNFC   K +SYTRVR
Sbjct: 13  ATGSASTPS--ASSGGVEDSGKPLWRYVKRQEKDGGGGGNVSWDCNFCHMQKTTSYTRVR 72

Query: 62  AHLLKISGQGIGICLKVTPTDIADMEKLEEEAKNRKERKAPKNVRLPPSFISVGGVNVSN 121
           AHLLK+   GI  C  VT  DIA+M++LEEEAKNR+   APK V LPPS +  GG+    
Sbjct: 73  AHLLKVPRLGISGCKNVTSKDIAEMQRLEEEAKNREMASAPKKVPLPPSSMFSGGMY--- 132

Query: 122 SPGTSNIEPKKRK-----GTPSAIEKSFNKASRDQLNALIARMFYSAGLPFHLARNPHFR 181
                  E KKRK       PS +EKSFN  +R+QL+ALIAR FY++GLPFHLAR+P++ 
Sbjct: 133 ---AIGFESKKRKEVGSSNPPSPLEKSFNLQARNQLHALIARFFYASGLPFHLARSPYYV 192

Query: 182 GAFSYAANHMLTGYVPPGFNSLRTSLLQQEKANIERLLIPIKGEWRLKGVSIVSDGWSDS 241
             F +A +H L GY+PPG+N LRT+LLQQEKAN+ERLL PIK  WR KGVSIVSDGWSDS
Sbjct: 193 SMFLFACSHNLAGYLPPGYNLLRTTLLQQEKANVERLLQPIKSTWREKGVSIVSDGWSDS 252

Query: 242 QRRPLINFMAISE----------------------------------------------- 301
           QRRPLINFMA+SE                                               
Sbjct: 253 QRRPLINFMAVSESGPMFLKAVDCSGETKDKFFIFNLMKEVIEEVGPQNVVQVITDNAKN 312

Query: 302 ----GQLIEGQFPMIVWTPCVVHTLNLALKNICAAKNVEDNQIAYGECSWISNVDGDVMV 361
               G L+EG +P IVWTPCVVHTLNLAL+NICAAKN+E+NQ+ Y ECSWI+ V GDV +
Sbjct: 313 CAGAGLLVEGLYPNIVWTPCVVHTLNLALQNICAAKNLENNQVVYDECSWITIVSGDVTI 372

Query: 362 VKHFIMNHSMRLSMFNEFVPLKLLYVAETRFASIIVMLKRFKLIKGGLQAMVISDKWANY 421
           +K+FIMNHSMRL++FNEFVPLKLL +A TRFAS++VMLKRF LIK  L +MVIS++W +Y
Sbjct: 373 IKNFIMNHSMRLAIFNEFVPLKLLSIATTRFASVLVMLKRFMLIKRSLMSMVISEQWNSY 432

Query: 422 REDDVRKAQHVKELLLNDLWWDKIDYILSFTSPIYDMIRACDTDKPCLHLVYDMWDTMIE 481
           REDD  KA+ VKE +L+D+WWD IDYIL FT+PIYDM+RACDTDKPCLHLVYDMWD+MIE
Sbjct: 433 REDDAGKAKFVKEKVLDDVWWDSIDYILKFTAPIYDMLRACDTDKPCLHLVYDMWDSMIE 492

Query: 482 KVKKIIYRHEGLQSNENSSFYDVVHTILVDRWNKNNTPLHCLAHSLNPRYYSEEWLAEDS 541
           KV+  IYR EG +  E+S FYDVVH ILV RWNKNNTPLHCLAHSLNPRYYSE+WL ED 
Sbjct: 493 KVRLAIYRKEGKRVEESSKFYDVVHGILVSRWNKNNTPLHCLAHSLNPRYYSEKWLNEDP 552

Query: 542 NRVPPSQDVELTRERMKLLKRYV----------------------------------LDP 601
            RVPP +DVE+ RERMK LK+Y                                   +DP
Sbjct: 553 TRVPPHKDVEVARERMKCLKKYFSNSEERMMVTMEFVNFSSMSGDFADSDSIHHRNDMDP 612

Query: 602 RSWWATYGVYAPMLQAIAFKLLGTPSSPSCCERNWSTYSFVNSVKRNKMTHKRTEDLVFI 630
           +SWW T+G  AP+LQ +A K+L  PSS SC ERNWSTYSFV+SV+RN+M  KR EDLVFI
Sbjct: 613 KSWWVTFGASAPLLQNLALKILVQPSSSSCAERNWSTYSFVHSVRRNQMAPKRAEDLVFI 672

BLAST of Moc03g01650 vs. NCBI nr
Match: KAG5532188.1 (hypothetical protein RHGRI_026721 [Rhododendron griersonianum])

HSP 1 Score: 815.1 bits (2104), Expect = 4.3e-232
Identity = 424/720 (58.89%), Postives = 507/720 (70.42%), Query Frame = 0

Query: 4   SSNSNPSQETTSSSIEDDSKPLWQYVTKIQKLSERGGNFSWQCNFCQAIKKSSYTRVRAH 63
           SS+S PSQ        D + PLW YVTK+QKLS  GGN  WQCNFC  IKKSSYTRV+ H
Sbjct: 14  SSSSQPSQPNEG----DANNPLWSYVTKLQKLSGAGGNTQWQCNFCGVIKKSSYTRVKGH 73

Query: 64  LLKISGQGIGICLKVTPTDIADMEKLEEEAKNRKERKAPKNVRLPPSFISVGGVNVSNSP 123
           LLK+S  GIG C KVT   +A M+KLE+EAK + +  APK V LPPS  S  G  +    
Sbjct: 74  LLKLS-NGIGPCTKVTNEALATMKKLEDEAKAKMKDNAPKRVPLPPSTRS--GFVLEGYD 133

Query: 124 GTSNIEPKKRKGTPSAIEKSFNKASRDQLNALIARMFYSAGLPFHLARNPHFRGAFSYAA 183
                    R  T S +EK+++K  RDQL+A IARMFYS G+PF+LARNP++  ++ +AA
Sbjct: 134 AKKQRTSGGRTETISPLEKAYDKNKRDQLHAEIARMFYSGGVPFNLARNPYYVTSYQFAA 193

Query: 184 NHMLTGYVPPGFNSLRTSLLQQEKANIERLLIPIKGEWRLKGVSIVSDGWSDSQRRPLIN 243
           N+ L+GY+PPG+N LRT+LLQQEK N+ERLL+PIKG WR KGVSIVSDGWSDSQRRPLIN
Sbjct: 194 NNPLSGYIPPGYNLLRTTLLQQEKTNVERLLLPIKGTWREKGVSIVSDGWSDSQRRPLIN 253

Query: 244 FMAISE---------------------------------------------------GQL 303
           FMA++E                                                   G L
Sbjct: 254 FMAVTEGGPMFLKAVDCSGETKDKYFICELMREVIEEVGPDNVVQVITDNAKNCAGAGLL 313

Query: 304 IEGQFPMIVWTPCVVHTLNLALKNICAAKNVEDNQIAYGECSWISNVDGDVMVVKHFIMN 363
           IEG +P I WTPCVVHTLNLAL+NICAAKNVE+NQ+ Y ECSWI+ +  DV  +K+FIMN
Sbjct: 314 IEGLYPNISWTPCVVHTLNLALQNICAAKNVENNQVTYDECSWITIIADDVSFIKNFIMN 373

Query: 364 HSMRLSMFNEFVPLKLLYVAETRFASIIVMLKRFKLIKGGLQAMVISDKWANYREDDVRK 423
           HSMRL++FN+FVPLKLL VA TRFAS++VMLKRFKL+K  LQ MVIS +W +YREDD  K
Sbjct: 374 HSMRLAIFNDFVPLKLLSVASTRFASVMVMLKRFKLLKASLQTMVISPRWNSYREDDTGK 433

Query: 424 AQHVKELLLNDLWWDKIDYILSFTSPIYDMIRACDTDKPCLHLVYDMWDTMIEKVKKIIY 483
           A+ VKE +L+D+WWD IDYILSFTSP+YDM+R CDTDKPCLHLVYDMWDTMIEKVK  IY
Sbjct: 434 AKFVKEKVLDDIWWDSIDYILSFTSPMYDMLRICDTDKPCLHLVYDMWDTMIEKVKVAIY 493

Query: 484 RHEGLQSNENSSFYDVVHTILVDRWNKNNTPLHCLAHSLNPRYYSEEWLAEDSNRVPPSQ 543
           RHEG +  ++S+FYDVVHTILVDRWNKN+TPLHCLAHSLNPRYYS+EWL E  +RVPP +
Sbjct: 494 RHEGKRHEDSSTFYDVVHTILVDRWNKNSTPLHCLAHSLNPRYYSDEWLNEGPSRVPPHK 553

Query: 544 DVELTRERMKLLK----------------------------------RYVLDPRSWWATY 603
           DVE+ RERMK +K                                  RY +DP+SWW TY
Sbjct: 554 DVEVARERMKCMKKYFPNSVDRSKVNLEFANFSSKAGEFADSDSIHDRYAMDPKSWWVTY 613

Query: 604 GVYAPMLQAIAFKLLGTPSSPSCCERNWSTYSFVNSVKRNKMTHKRTEDLVFIHSNLRLL 638
           G  AP+LQ+IA KLL  PSS S C+RNWSTYSFV+S KRNKMT KR EDLVFIHSNLRLL
Sbjct: 614 GASAPLLQSIALKLLVQPSS-SSCKRNWSTYSFVHSAKRNKMTPKRAEDLVFIHSNLRLL 673

BLAST of Moc03g01650 vs. NCBI nr
Match: KAG5522171.1 (hypothetical protein RHGRI_034377 [Rhododendron griersonianum])

HSP 1 Score: 811.2 bits (2094), Expect = 6.3e-231
Identity = 415/719 (57.72%), Postives = 501/719 (69.68%), Query Frame = 0

Query: 2   ASSSNSNPSQETTSSSIEDDSKPLWQYVTKIQKLSERGGNFSWQCNFCQAIKKSSYTRVR 61
           ++ S S     ++ ++  D + PLW YVTK++KL + GGN  WQCNFC  +KKSSYTRVR
Sbjct: 7   SAGSGSGSISSSSQTNAVDTNNPLWSYVTKLEKLGDHGGNTLWQCNFCSVVKKSSYTRVR 66

Query: 62  AHLLKISGQGIGICLKVTPTDIADMEKLEEEAKNRKERKAPKNVRLPPSFISVGGVNVSN 121
            HLLK+S  GIG C  VT   +A M+KL+EEAK R +   PK V LPPS  S G V    
Sbjct: 67  GHLLKLS-NGIGPCKNVTKEVLATMKKLDEEAKARMKENEPKKVPLPPSTRSSGFV---- 126

Query: 122 SPGTSNIEPKK------RKGTPSAIEKSFNKASRDQLNALIARMFYSAGLPFHLARNPHF 181
                  E KK      R  T S +EK+F++  RDQL+A IARMFYS G+PF+LARNP++
Sbjct: 127 ---LEGYETKKQRTSGSRSETISPVEKAFDRGKRDQLHAEIARMFYSGGVPFNLARNPYY 186

Query: 182 RGAFSYAANHMLTGYVPPGFNSLRTSLLQQEKANIERLLIPIKGEWRLKGVSIVSDGWSD 241
             ++ +AAN+ L+GY+PPG+N LRT+LLQQEK N+ERLL PIKG WR KGVSIVSDGWSD
Sbjct: 187 VSSYQFAANNPLSGYIPPGYNLLRTTLLQQEKTNVERLLQPIKGTWREKGVSIVSDGWSD 246

Query: 242 SQRRPLINFMAISE---------------------------------------------- 301
           SQRRPLINFMA++E                                              
Sbjct: 247 SQRRPLINFMAVTEGGPMFLKAVDCSGDTKDKYFICDLLKDVIEEVGPENVVQVITDNAK 306

Query: 302 -----GQLIEGQFPMIVWTPCVVHTLNLALKNICAAKNVEDNQIAYGECSWISNVDGDVM 361
                G LIEG +P   WTPCVVHTLNLAL+NICAAKNVE+NQ+ Y ECSWI+ +  DV 
Sbjct: 307 NCAGAGLLIEGLYPNKSWTPCVVHTLNLALQNICAAKNVENNQVTYDECSWITIIADDVS 366

Query: 362 VVKHFIMNHSMRLSMFNEFVPLKLLYVAETRFASIIVMLKRFKLIKGGLQAMVISDKWAN 421
            +K++IMNHSMRL++FN+FVPLKLL VA TRFAS++VMLKRFKL+K  LQ MVIS +W +
Sbjct: 367 FIKNYIMNHSMRLAIFNDFVPLKLLSVASTRFASVMVMLKRFKLLKTSLQTMVISPRWNS 426

Query: 422 YREDDVRKAQHVKELLLNDLWWDKIDYILSFTSPIYDMIRACDTDKPCLHLVYDMWDTMI 481
           YREDD  KA+ VKE +L+D+WWD+IDYILSFTSP+YDM+R CDTDKPCLHLVYDMWDTMI
Sbjct: 427 YREDDTGKAKFVKEKVLDDIWWDEIDYILSFTSPMYDMLRICDTDKPCLHLVYDMWDTMI 486

Query: 482 EKVKKIIYRHEGLQSNENSSFYDVVHTILVDRWNKNNTPLHCLAHSLNPRYYSEEWLAED 541
           EKVK  IYRHEG +  ++S+FY+VV+ ILVDRWNKNNTPLHCLAHSLNPRYYS+EWL E 
Sbjct: 487 EKVKVAIYRHEGKRHEDSSTFYEVVYAILVDRWNKNNTPLHCLAHSLNPRYYSDEWLHEG 546

Query: 542 SNRVPPSQDVELTRERMKLLK----------------------------------RYVLD 601
            +RVPP +DVE+ RERMK +K                                  RY +D
Sbjct: 547 PSRVPPHKDVEVARERMKCMKRYFPNSADRSKANMEFANFSSKAGEFGDSDSIHDRYAMD 606

Query: 602 PRSWWATYGVYAPMLQAIAFKLLGTPSSPSCCERNWSTYSFVNSVKRNKMTHKRTEDLVF 630
           P+SWW TYG  AP+LQ++A KLL  PSS SC ERNWSTYSFV+S KRNKMT KR EDLVF
Sbjct: 607 PKSWWVTYGASAPLLQSVALKLLVQPSSSSCSERNWSTYSFVHSAKRNKMTPKRAEDLVF 666

BLAST of Moc03g01650 vs. NCBI nr
Match: XP_030544727.1 (uncharacterized protein LOC115751129 [Rhodamnia argentea])

HSP 1 Score: 796.6 bits (2056), Expect = 1.6e-226
Identity = 404/706 (57.22%), Postives = 493/706 (69.83%), Query Frame = 0

Query: 15  SSSIEDDSKPLWQYVTKIQKLSERGGNFSWQCNFCQAIKKSSYTRVRAHLLKISGQGIGI 74
           SSS ED +KPLW+++TK+    + GGN SW+CNFC+   KSSYTR+RAHLL ISG+GI  
Sbjct: 3   SSSNEDLNKPLWRFITKLANHGDAGGNCSWKCNFCEKDFKSSYTRIRAHLLMISGRGIAK 62

Query: 75  CLKVTPTDIADMEKLEEEAKNRKERKAPKNVRLPPSFISVGGVNVSNSPGTSNIEPKKRK 134
           C KV   D+ +ME+L++E  +R +  AP+NV LPPS       + S+    S  + KKRK
Sbjct: 63  CGKVEKQDLLEMERLDKEVNDRLQSNAPRNVPLPPS-------SCSSVQMDSTFDSKKRK 122

Query: 135 ---------GTPSAIEKSFNKASRDQLNALIARMFYSAGLPFHLARNPHFRGAFSYAANH 194
                    G+  AIEK+FN + R+ L+ LIARMFYSAGLPF+LA NP+F  AF+YAANH
Sbjct: 123 MGSGSGSGSGSGGAIEKAFNISQREHLDHLIARMFYSAGLPFNLANNPYFHEAFTYAANH 182

Query: 195 MLTGYVPPGFNSLRTSLLQQEKANIERLLIPIKGEWRLKGVSIVSDGWSDSQRRPLINFM 254
            + GYVPP FN LRT+LLQ+EKANIE L  PIK  W+ KGVSIVSDGWSDSQRRPLINFM
Sbjct: 183 NIAGYVPPKFNLLRTTLLQKEKANIESLCTPIKKMWKDKGVSIVSDGWSDSQRRPLINFM 242

Query: 255 AISE---------------------------------------------------GQLIE 314
           A+S+                                                   GQLIE
Sbjct: 243 AVSDGNPMFIKSVDCSNERKDMHFIFNLLKEVIIEVGHENVVQVITDNASNCKGAGQLIE 302

Query: 315 GQFPMIVWTPCVVHTLNLALKNICAAKNVEDNQIAYGECSWISNVDGDVMVVKHFIMNHS 374
            +FP IVWTPCVVHTLNLALKNICAAKNVE N++AY EC+WI+ +  DVM +K+FIMNHS
Sbjct: 303 QEFPSIVWTPCVVHTLNLALKNICAAKNVEANEVAYDECNWITKIVDDVMQIKNFIMNHS 362

Query: 375 MRLSMFNEFVPLKLLYVAETRFASIIVMLKRFKLIKGGLQAMVISDKWANYREDDVRKAQ 434
           MRL+++NEF+ LKLL VA+TRFAS+IVMLKRFKLIK GLQ MVI +KW+ YR+D+  +A+
Sbjct: 363 MRLAIYNEFIHLKLLSVADTRFASMIVMLKRFKLIKNGLQNMVICEKWSTYRDDNQGRAR 422

Query: 435 HVKELLLNDLWWDKIDYILSFTSPIYDMIRACDTDKPCLHLVYDMWDTMIEKVKKIIYRH 494
            VKE +L+DLWWD IDYILSFT+PIYDM+R CDTDKPCLHLVYDMWD MIEKVKK+IY H
Sbjct: 423 FVKEKVLDDLWWDSIDYILSFTAPIYDMLRVCDTDKPCLHLVYDMWDLMIEKVKKVIYMH 482

Query: 495 EGLQSNENSSFYDVVHTILVDRWNKNNTPLHCLAHSLNPRYYSEEWLAEDSNRVPPSQDV 554
           E  + +E S+FY+VVH ILVDRW +NNTPLHCLAH+LNP+YYS++WL EDS RV P  D 
Sbjct: 483 EVKRLHEESTFYEVVHKILVDRWTRNNTPLHCLAHALNPKYYSDQWLDEDSTRVSPHMDY 542

Query: 555 ELTRERMKLLK----------------------------------RYVLDPRSWWATYGV 614
           E+  ER K L+                                  RY ++P+ WW  +G 
Sbjct: 543 EINEERKKCLRKYFPNEDELYKVSVEYADFACNSGRFQDPHSIRDRYFMEPKMWWVEHGA 602

Query: 615 YAPMLQAIAFKLLGTPSSPSCCERNWSTYSFVNSVKRNKMTHKRTEDLVFIHSNLRLLSR 627
            APMLQ IAFKLL  PSS SC ERNWSTYSFV S +R++MT +R EDLVFIHSNLRLLSR
Sbjct: 603 CAPMLQKIAFKLLAQPSSSSCAERNWSTYSFVQSARRHRMTPRRAEDLVFIHSNLRLLSR 662

BLAST of Moc03g01650 vs. NCBI nr
Match: XP_028124679.1 (uncharacterized protein LOC114321673 [Camellia sinensis])

HSP 1 Score: 787.7 bits (2033), Expect = 7.4e-224
Identity = 402/717 (56.07%), Postives = 490/717 (68.34%), Query Frame = 0

Query: 4   SSNSNPSQETTSSSIEDDSKPLWQYVTKIQKLSERGGNFSWQCNFCQAIKKSSYTRVRAH 63
           SSN  P  +       D++KPLW+Y+TK+ K  E G N   QC FC    K SYTR RAH
Sbjct: 6   SSNDIPDDDF------DENKPLWKYITKLAKPGEGGRNCQLQCKFCNNTFKGSYTRARAH 65

Query: 64  LLKISGQGIGICLKVTPTDIADMEKLEEEAKNRKERKAPKNVRLPPSFISVGGVNVSNSP 123
           LLK+ G+GIG C K+T   ++ ++   + A+ R +   PKNV LPPS +SVG      S 
Sbjct: 66  LLKLPGKGIGGCKKITSQKLSQLQNENDAAELRIKNALPKNVPLPPSTVSVGSSYAYESK 125

Query: 124 GTSNIEPKKRKGTP------SAIEKSFNKASRDQLNALIARMFYSAGLPFHLARNPHFRG 183
                E     G+       SAIEK+FN   R+QL+  IARMFYS G+PF+LARNP++  
Sbjct: 126 KRRTCESGSGSGSAPGSGSGSAIEKAFNLKDREQLHFEIARMFYSGGVPFNLARNPYYVS 185

Query: 184 AFSYAANHMLTGYVPPGFNSLRTSLLQQEKANIERLLIPIKGEWRLKGVSIVSDGWSDSQ 243
           ++++AANH + GY+PPG+N LRT+LLQ+E+ANI+RLL PI+G W+ KGVSIVSDGWSDSQ
Sbjct: 186 SYTFAANHNIPGYLPPGYNLLRTTLLQKERANIDRLLQPIRGTWKEKGVSIVSDGWSDSQ 245

Query: 244 RRPLINFMAISE------------------------------------------------ 303
           RRPLINFMA+SE                                                
Sbjct: 246 RRPLINFMAVSESGPMFIKAVDCSGETKDKYFIANLMKEVINEVGASNVVQVITDNAPNC 305

Query: 304 ---GQLIEGQFPMIVWTPCVVHTLNLALKNICAAKNVEDNQIAYGECSWISNVDGDVMVV 363
              GQLIE QFP I+WTPCVVHTLNLALKNICAAKN+E N++ Y ECSWI+++ GD +++
Sbjct: 306 KAAGQLIEAQFPHILWTPCVVHTLNLALKNICAAKNIERNEVTYEECSWITDIAGDGLMI 365

Query: 364 KHFIMNHSMRLSMFNEFVPLKLLYVAETRFASIIVMLKRFKLIKGGLQAMVISDKWANYR 423
           K+FI NHSMRL+M+NEFV LKLL VAETRFAS IVMLKR KLIK GLQA+VISDKW  YR
Sbjct: 366 KNFISNHSMRLAMYNEFVSLKLLSVAETRFASTIVMLKRLKLIKRGLQAIVISDKWNCYR 425

Query: 424 EDDVRKAQHVKELLLNDLWWDKIDYILSFTSPIYDMIRACDTDKPCLHLVYDMWDTMIEK 483
           EDDV  A+ VK+ +L+D+WWD IDYILSF  PIYDM+RACDTDKPCLHLVYDMWD+MIEK
Sbjct: 426 EDDVGNAKFVKKKILDDVWWDYIDYILSFMGPIYDMLRACDTDKPCLHLVYDMWDSMIEK 485

Query: 484 VKKIIYRHEGLQSNENSSFYDVVHTILVDRWNKNNTPLHCLAHSLNPRYYSEEWLAEDSN 543
           VK  IYRHEG + +E S+FY+VVH ILVDRWNKNNTPLHC+AHSLN +YYS+EWL ED N
Sbjct: 486 VKMSIYRHEGKRIDEESTFYNVVHQILVDRWNKNNTPLHCMAHSLNSKYYSDEWLHEDQN 545

Query: 544 RVPPSQDVELTRERMKLLKRYV----------------------------------LDPR 603
           RVPP +D E+TRER K  KRY                                   +DP 
Sbjct: 546 RVPPYRDEEVTRERSKCFKRYFEDITERTKVITEFGKFSGYVEAFSEYDSIHNRWHMDPY 605

Query: 604 SWWATYGVYAPMLQAIAFKLLGTPSSPSCCERNWSTYSFVNSVKRNKMTHKRTEDLVFIH 630
           +WW  Y  YAP LQ +  +LL  P+S SCCERNWSTYSF++SV+RNKM  +R EDLVFIH
Sbjct: 606 TWWCAYSAYAPSLQKLVLRLLVQPASSSCCERNWSTYSFIHSVRRNKMVPQRAEDLVFIH 665

BLAST of Moc03g01650 vs. ExPASy TrEMBL
Match: A0A5B7AFB0 (Uncharacterized protein OS=Davidia involucrata OX=16924 GN=Din_023134 PE=4 SV=1)

HSP 1 Score: 841.6 bits (2173), Expect = 2.1e-240
Identity = 434/727 (59.70%), Postives = 523/727 (71.94%), Query Frame = 0

Query: 1   MASSSNSNPSQETTSSSIEDDSKPLWQYVTKIQKLSERGGNFSWQCNFCQAIKKSSYTRV 60
           MAS+  S  +  ++SS  ED +KPLW+YV K  KLS+ GGN SWQCNFC  +KKSSYTRV
Sbjct: 1   MASTGAS--TAPSSSSQDEDINKPLWKYVAKFDKLSDGGGNISWQCNFCHQVKKSSYTRV 60

Query: 61  RAHLLKISGQGIGICLKVTPTDIADMEKLEEEAKNRKERKAPKNVRLPPSFISVGGVNVS 120
           RAHLL++ GQGI  C KVT  DI +M+KLE+E K R +  A K V LP S IS+ G   S
Sbjct: 61  RAHLLRLPGQGIVACSKVTTKDILEMQKLEDEVKLRLKSNALKKVPLPHSIISLCG---S 120

Query: 121 NSPGTSNIEPKKRKGTPSA----IEKSFNKASRDQLNALIARMFYSAGLPFHLARNPHFR 180
            S      + KKRK T S     +EK+FN  + +QL+A IARMFYS+GLPFHLARNP++ 
Sbjct: 121 TSFSQEGYDSKKRKTTSSGSGNPLEKTFNMVAHEQLHAEIARMFYSSGLPFHLARNPYYV 180

Query: 181 GAFSYAANHMLTGYVPPGFNSLRTSLLQQEKANIERLLIPIKGEWRLKGVSIVSDGWSDS 240
            +F++AAN+ + GY+PPG+N LRT+LLQ EK NIERLL PIKG W+ KGVSIVSDGWS+S
Sbjct: 181 SSFTFAANNPIMGYLPPGYNLLRTTLLQIEKENIERLLQPIKGTWKEKGVSIVSDGWSNS 240

Query: 241 QRRPLINFMAISE----------------------------------------------- 300
           QRRPLINFMA++E                                               
Sbjct: 241 QRRPLINFMAVTEDGPMFLKVVDCSGETKDKYFIANLMREVINEVGHENVIQIITDNAPN 300

Query: 301 ----GQLIEGQFPMIVWTPCVVHTLNLALKNICAAKNVEDNQIAYGECSWISNVDGDVMV 360
               GQ+IE QF  I WTPCVVHTLNLALKNICAAKNVE+NQ+ Y ECSWIS++ GDVM 
Sbjct: 301 CKGAGQMIESQFSNIFWTPCVVHTLNLALKNICAAKNVENNQLTYNECSWISDIAGDVMQ 360

Query: 361 VKHFIMNHSMRLSMFNEFVPLKLLYVAETRFASIIVMLKRFKLIKGGLQAMVISDKWANY 420
           +KHFIMNHS+RL MFNEFV LKLL VA+TRFAS+IVM +RFKLIK GLQAMVISDKW+ Y
Sbjct: 361 IKHFIMNHSLRLVMFNEFVTLKLLSVADTRFASVIVMFRRFKLIKHGLQAMVISDKWSYY 420

Query: 421 REDDVRKAQHVKELLLNDLWWDKIDYILSFTSPIYDMIRACDTDKPCLHLVYDMWDTMIE 480
           +EDDV + + VKE +LND+WWD IDYILSFT+PIY+M++ACDTDKPCLHLVYDMWD+M+E
Sbjct: 421 QEDDVGRGRFVKEKVLNDIWWDSIDYILSFTTPIYEMLKACDTDKPCLHLVYDMWDSMME 480

Query: 481 KVKKIIYRHEGLQSNENSSFYDVVHTILVDRWNKNNTPLHCLAHSLNPRYYSEEWLAEDS 540
           KVK  IYRHE  +  E+S+FYDVVH ILVDRWNKNNTPLHCLAHSLNP+YYS EWL E+ 
Sbjct: 481 KVKIAIYRHEEKRYEESSTFYDVVHNILVDRWNKNNTPLHCLAHSLNPKYYSNEWLHENP 540

Query: 541 NRVPPSQDVELTRERMKLLK----------------------------------RYVLDP 600
           NRVPP ++ E+++ER+K LK                                  RY++DP
Sbjct: 541 NRVPPYKNFEISQERLKCLKRYFSNSEDRTKVTVEYAKFSTKSGDFGKVDSIHDRYIMDP 600

Query: 601 RSWWATYGVYAPMLQAIAFKLLGTPSSPSCCERNWSTYSFVNSVKRNKMTHKRTEDLVFI 639
           +SWW  +G  APMLQ++A KLL  PSS SCCERNWSTYSFV+SV+RNKMT +  EDLVF+
Sbjct: 601 KSWWVIHGSSAPMLQSLALKLLVQPSSSSCCERNWSTYSFVHSVRRNKMTPQCAEDLVFV 660

BLAST of Moc03g01650 vs. ExPASy TrEMBL
Match: A0A443N8D6 (DUF659 domain-containing protein/Dimer_Tnp_hAT domain-containing protein OS=Cinnamomum micranthum f. kanehirae OX=337451 GN=CKAN_00314100 PE=4 SV=1)

HSP 1 Score: 786.9 bits (2031), Expect = 6.1e-224
Identity = 424/722 (58.73%), Postives = 486/722 (67.31%), Query Frame = 0

Query: 5   SNSNPSQETTSSSIEDDSKPLWQYVTKIQKLSERGGNFSWQCNFCQAIKKSSYTRVRAHL 64
           S+S+PS E      +D +KPLW+YVTK +KL+E GGN +WQ                   
Sbjct: 67  SSSHPSNE------KDINKPLWKYVTKFEKLNEGGGNITWQ------------------- 126

Query: 65  LKISGQGIGICLKVTPTDIADMEKLEEEAKNRKERKAPKNVRLPPSFISVGGVNVS-NSP 124
                                     EE K R +  APK V LP   ++V   ++S NS 
Sbjct: 127 -------------------------YEEVKLRMKANAPKKVPLPVPSVAVSSNSMSMNSI 186

Query: 125 GTSNIEPKKRK----GTPSAIEKSFNKASRDQLNALIARMFYSAGLPFHLARNPHFRGAF 184
                + KKRK    G  + IEK+FN  + DQL+A IARMFYSAGLPFHLARNPHF  AF
Sbjct: 187 MHGGFDSKKRKTSGSGNSNPIEKAFNIGAHDQLHAEIARMFYSAGLPFHLARNPHFVNAF 246

Query: 185 SYAANHMLTGYVPPGFNSLRTSLLQQEKANIERLLIPIKGEWRLKGVSIVSDGWSDSQRR 244
           ++AAN  LTGYVPPG+N LRTSLLQ+EKANIERLL PIKG WR KGVSIVSDGWSDSQRR
Sbjct: 247 TFAANSPLTGYVPPGYNMLRTSLLQREKANIERLLQPIKGTWREKGVSIVSDGWSDSQRR 306

Query: 245 PLINFMAISE-------------------------------------------------- 304
           PLI+FMA++E                                                  
Sbjct: 307 PLIHFMAVTEGGPMFLKAVDCSGETKDKYFIANLMKEVINDVGHENVVQVITDNAPNCKG 366

Query: 305 -GQLIEGQFPMIVWTPCVVHTLNLALKNICAAKNVEDNQIAYGECSWISNVDGDVMVVKH 364
            GQ+IE QFP I+WTPCVVHTLNLAL NICAAKNVE+NQ+ YGECSWI ++ GDVM +KH
Sbjct: 367 AGQIIESQFPNIIWTPCVVHTLNLALMNICAAKNVENNQLTYGECSWILDIVGDVMHIKH 426

Query: 365 FIMNHSMRLSMFNEFVPLKLLYVAETRFASIIVMLKRFKLIKGGLQAMVISDKWANYRED 424
           FIMNHSMRL+MFNEFV LKLL VA+TRFAS IVMLKRFKLIK GLQAMVISDKW+ YRE 
Sbjct: 427 FIMNHSMRLAMFNEFVTLKLLSVADTRFASSIVMLKRFKLIKRGLQAMVISDKWSCYREG 486

Query: 425 DVRKAQHVKELLLNDLWWDKIDYILSFTSPIYDMIRACDTDKPCLHLVYDMWDTMIEKVK 484
           DV  A+ VKE LL+D+WWD IDYILSFTSPIYDM+R CDTDKPCLHLVYDMWDTMIEKVK
Sbjct: 487 DVGTARFVKEKLLDDIWWDSIDYILSFTSPIYDMLRLCDTDKPCLHLVYDMWDTMIEKVK 546

Query: 485 KIIYRHEGLQSNENSSFYDVVHTILVDRWNKNNTPLHCLAHSLNPRYYSEEWLAEDSNRV 544
             I+RHEG + +E S FYDVVH ILVD WNKNNTPLHCLAHSLNPRYYS+EWL ED +RV
Sbjct: 547 TTIFRHEGKRHDELSGFYDVVHQILVDHWNKNNTPLHCLAHSLNPRYYSDEWLQEDPSRV 606

Query: 545 PPSQDVELTRERMKLL----------------------------------KRYVLDPRSW 604
           PP +DVE++RER K L                                   RY +DP SW
Sbjct: 607 PPYKDVEVSRERKKCLAKYFPTSEERTMVNMEFANFSLRIGEFGEYDSLHDRYHMDPTSW 666

Query: 605 WATYGVYAPMLQAIAFKLLGTPSSPSCCERNWSTYSFVNSVKRNKMTHKRTEDLVFIHSN 637
           WA +G  AP LQ++AFKLL  PSS SCCERNWSTYSFV+SV+RNKMT KR EDLVFIHSN
Sbjct: 667 WAIHGACAPKLQSLAFKLLMQPSSSSCCERNWSTYSFVHSVRRNKMTPKRAEDLVFIHSN 726

BLAST of Moc03g01650 vs. ExPASy TrEMBL
Match: A0A445LNL1 (Uncharacterized protein OS=Glycine soja OX=3848 GN=D0Y65_003769 PE=4 SV=1)

HSP 1 Score: 751.1 bits (1938), Expect = 3.7e-213
Identity = 384/703 (54.62%), Postives = 483/703 (68.71%), Query Frame = 0

Query: 5   SNSNPSQETTSSSIEDDSKPLWQYVTKIQKLSERGGNFSWQCNFCQAIKKSSYTRVRAHL 64
           S S+PSQ   +   +DD+KPLW YVTKI+ ++  GGN+  +CN C      SYTRVRAHL
Sbjct: 110 SASSPSQ---AKEQDDDTKPLWTYVTKIKSVA-GGGNYEIKCNICDFTFNGSYTRVRAHL 169

Query: 65  LKISGQGIGICLKVTPTDIADMEKLEEEAKNRKERKAPKNVRLPPSFISVGGVNVSNSPG 124
           LK++G+G+ +C KVT   + D++K++++A  R ER   K+V LPP  +S      +N+ G
Sbjct: 170 LKMTGKGVRVCQKVTVAKLIDLKKIDKKATLRVERSKTKSVSLPP--VSTQHQMDTNTLG 229

Query: 125 TSNIEPKKRKGTPSAIEKSFNKASRDQLNALIARMFYSAGLPFHLARNPHFRGAFSYAAN 184
              ++PKKRK   S++E +FN  +R+ L+  IARMFYS+GLPFHLARNPH+R AF+YAAN
Sbjct: 230 ---VDPKKRK--TSSVENAFNLQARETLDHEIARMFYSSGLPFHLARNPHYRKAFAYAAN 289

Query: 185 HMLTGYVPPGFNSLRTSLLQQEKANIERLLIPIKGEWRLKGVSIVSDGWSDSQRRPLINF 244
           + ++GY PPG+N LRT+LLQ E+ ++E  L PIK  W  KGVSIVSDGWSD QRR LINF
Sbjct: 290 NQISGYQPPGYNKLRTTLLQNERRHVENFLQPIKNAWSQKGVSIVSDGWSDPQRRSLINF 349

Query: 245 MAISE---------------------------------------------------GQLI 304
           M ++E                                                   G +I
Sbjct: 350 MVVTESGPMFLKAIDCSNEIKDKDFIAKHMREVIMEVGHSNVVQIVMDNAAVCKVAGLII 409

Query: 305 EGQFPMIVWTPCVVHTLNLALKNICAAKNVEDNQIAYGECSWISNVDGDVMVVKHFIMNH 364
           E +FP I WTPCVVHTLNLALKNICAAKN E N +AY ECSWI+ +  D M VK+F+M+H
Sbjct: 410 EAEFPSIYWTPCVVHTLNLALKNICAAKNTEKNNVAYEECSWITQIADDAMFVKNFVMSH 469

Query: 365 SMRLSMFNEFVPLKLLYVAETRFASIIVMLKRFKLIKGGLQAMVISDKWANYREDDVRKA 424
           SMRLS+FN    LKLL +A TRF S IVMLKRFK +K GLQ MVISD+W++Y+EDDV KA
Sbjct: 470 SMRLSIFNS---LKLLSIAPTRFVSTIVMLKRFKQLKKGLQEMVISDQWSSYKEDDVAKA 529

Query: 425 QHVKELLLNDLWWDKIDYILSFTSPIYDMIRACDTDKPCLHLVYDMWDTMIEKVKKIIYR 484
           + VK+ LL+D WWDK+DYILSFTSPIYD++R  DT    LHLVY+MWD+MIEKVK  IY+
Sbjct: 530 KFVKDTLLDDKWWDKVDYILSFTSPIYDVLRRTDTKASSLHLVYEMWDSMIEKVKNAIYQ 589

Query: 485 HEGLQSNENSSFYDVVHTILVDRWNKNNTPLHCLAHSLNPRYYSEEWLAEDSNRVPPSQD 544
           +E  + +E S+FY+VVH+IL+DRW K++TPLHCLAHSLNPRYYS EWL+EDSNRVPP QD
Sbjct: 590 YERKEESEGSTFYEVVHSILIDRWTKSSTPLHCLAHSLNPRYYSHEWLSEDSNRVPPHQD 649

Query: 545 VELTRERMKLLKRYVL----------------------------------DPRSWWATYG 604
           +ELTRER+K  KR+ L                                  DP++WW  +G
Sbjct: 650 MELTRERLKCFKRFFLDVDVRRKVNIEFANFSDGREGFDDLDSLNDRGQMDPKAWWLVHG 709

Query: 605 VYAPMLQAIAFKLLGTPSSPSCCERNWSTYSFVNSVKRNKMTHKRTEDLVFIHSNLRLLS 623
           + AP+LQ IA KLL  P S SCCERNWSTYSF++S+KRNKMT  R EDLVF+HSNLRLLS
Sbjct: 710 INAPILQKIALKLLAQPCSSSCCERNWSTYSFIHSLKRNKMTPHRAEDLVFVHSNLRLLS 769

BLAST of Moc03g01650 vs. ExPASy TrEMBL
Match: A0A2N9J950 (BED-type domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS61684 PE=4 SV=1)

HSP 1 Score: 728.0 bits (1878), Expect = 3.4e-206
Identity = 373/702 (53.13%), Postives = 468/702 (66.67%), Query Frame = 0

Query: 13  TTSSSIEDDSKPLWQYVTKIQKLSERGGNFSWQCNFCQAIKKSSYTRVRAHLLKISGQGI 72
           ++  S+ D + PLW YVT+I+K S  GGN S++CN+CQ I K SY+RV+AHLL+IS  GI
Sbjct: 6   SSGGSMPDGNAPLWTYVTRIEKSSGGGGNMSFKCNYCQEIYKGSYSRVKAHLLRISNVGI 65

Query: 73  GICLKVTPTDIADMEKLEEEAKNRKERKAPKNVRLPPSFISVGGVNVSNSPGTSNIEPKK 132
             C KVT     +M+KL++ A  +K  K    + LPP        + S S  +S    +K
Sbjct: 66  KGCPKVTAEHKLEMQKLQDAADQKKISK-ESTIPLPPG-------DGSESISSSMFGSRK 125

Query: 133 RKGT-PSAIEKSFNKASRDQLNALIARMFYSAGLPFHLARNPHFRGAFSYAANHMLTGYV 192
           RK T  S +E +FN   ++ L +LIARMFYS G+PFH ARNPH+  ++ YAAN+++TGYV
Sbjct: 126 RKLTGKSPLEMAFNNGCKEHLTSLIARMFYSGGIPFHFARNPHYVNSYKYAANNVITGYV 185

Query: 193 PPGFNSLRTSLLQQEKANIERLLIPIKGEWRLKGVSIVSDGWSDSQRRPLINFMAISE-- 252
           PPG+N+LRT+LLQ+E+AN+ER+L PIK  W+ KGVS+VSDGW+D QRRPLINFMA SE  
Sbjct: 186 PPGYNALRTTLLQKERANVERMLKPIKDGWKEKGVSVVSDGWTDPQRRPLINFMATSEGA 245

Query: 253 -------------------------------------------------GQLIEGQFPMI 312
                                                            G LIE ++P I
Sbjct: 246 PVFLKAIDGTKEYKDKYYISELLMNVIKEIGPEKVVQVITDNAYVMKAAGSLIEAEYPHI 305

Query: 313 VWTPCVVHTLNLALKNICAAKNVEDNQIAYGECSWISNVDGDVMVVKHFIMNHSMRLSMF 372
            WTPCVVHTLNLALKNICA KN E N +AY EC+WI+ +  D   ++ FI NHSMRL++F
Sbjct: 306 FWTPCVVHTLNLALKNICAPKNTERNAVAYAECNWIAQIADDASFIRVFITNHSMRLAIF 365

Query: 373 NEFVPLKLLYVAETRFASIIVMLKRFKLIKGGLQAMVISDKWANYREDDVRKAQHVKELL 432
           NE  PLKLL VA+TRFASI+V LKR KLIK  LQ+MVIS++W +Y+EDDV KA  V++++
Sbjct: 366 NEISPLKLLSVADTRFASILVTLKRMKLIKRSLQSMVISEEWTSYKEDDVGKATRVRDII 425

Query: 433 LNDLWWDKIDYILSFTSPIYDMIRACDTDKPCLHLVYDMWDTMIEKVKKIIYRHEGLQSN 492
           L+DLWWD++DYI+ FTSPIYDM+RA DTD+P LHLVYDMWDTMIEKVK II+RHEG Q  
Sbjct: 426 LDDLWWDRVDYIILFTSPIYDMLRAADTDRPTLHLVYDMWDTMIEKVKAIIFRHEGKQEG 485

Query: 493 ENSSFYDVVHTILVDRWNKNNTPLHCLAHSLNPRYYSEEWLAEDSNRVPPSQDVELTRER 552
           E S+FY+VV+ IL+DRW KN TPLHC+AHSLNP+YYS EWL    NRVPP QD+E++ ER
Sbjct: 486 EVSTFYNVVYDILIDRWTKNCTPLHCMAHSLNPKYYSTEWLELSPNRVPPHQDLEISEER 545

Query: 553 MKLLKRYVLDPR-----------------------------------SWWATYGVYAPML 612
            K L+RY LD                                      WW  YG Y P L
Sbjct: 546 NKCLERYFLDDHERTLAKTEFGKFSRGIINGKIHVAMVKDRSEIDAIDWWQCYGAYFPKL 605

Query: 613 QAIAFKLLGTPSSPSCCERNWSTYSFVNSVKRNKMTHKRTEDLVFIHSNLRLLSRRTPEY 627
           Q+IA KLL  P S SCCERNWSTYSF++S++RNKMT  R +DLVF+HSNLRLLSR   EY
Sbjct: 606 QSIASKLLVQPCSSSCCERNWSTYSFIHSLRRNKMTPARAQDLVFVHSNLRLLSRSNDEY 665

BLAST of Moc03g01650 vs. ExPASy TrEMBL
Match: A0A2U1N3G2 (BED-type domain-containing protein OS=Artemisia annua OX=35608 GN=CTI12_AA311960 PE=4 SV=1)

HSP 1 Score: 723.0 bits (1865), Expect = 1.1e-204
Identity = 366/713 (51.33%), Postives = 472/713 (66.20%), Query Frame = 0

Query: 2   ASSSNSNPSQETTSSSIEDDSKPLWQYVTKIQKLSERGGNFSWQCNFCQAIKKSSYTRVR 61
           +SSS  N +  +  ++   D  PLW+YVTKI K +E GG + ++CNFC+ IK  SY+RVR
Sbjct: 3   SSSSAGNNASGSVPATQATDKGPLWEYVTKISKTAETGGTWKFRCNFCEEIKTGSYSRVR 62

Query: 62  AHLLKISGQGIGICLKVTPTDIADMEKLEEEAKNRKERKAPKNVRLPPSFISVGGVNVSN 121
           AHLL+IS +GI  C +V P  + +M+  E+E ++ K   APK+V LP      GG +  N
Sbjct: 63  AHLLQISNKGISTCKRVKPESLIEMKNKEKECEDAKSNSAPKDVPLP-----CGGSDFEN 122

Query: 122 SPGTSNIEPKKRKGTPSAIEKSFNKASRDQLNALIARMFYSAGLPFHLARNPHFRGAFSY 181
           +        KKRK + S + ++F+  +R QL+  IARMF++ GLPF+LARNPH+  AF++
Sbjct: 123 T-------LKKRKSSSSPLVRAFDVDTRTQLDQEIARMFFTGGLPFNLARNPHYMRAFTF 182

Query: 182 AANHMLTGYVPPGFNSLRTSLLQQEKANIERLLIPIKGEWRLKGVSIVSDGWSDSQRRPL 241
           AANH L GYVPPG+N LRT+LLQQEK+N+ERLL PIK  WR KGV+IV+DGWSD QRRP+
Sbjct: 183 AANHNLGGYVPPGYNKLRTTLLQQEKSNVERLLKPIKETWREKGVTIVTDGWSDPQRRPI 242

Query: 242 INFMAI---------------------------------------------------SEG 301
           INFMA                                                      G
Sbjct: 243 INFMATCGNGPMFIKAVNCMGEVKKSEFIASLMKEVIDEIGHQNVVQIITDNAANCKGAG 302

Query: 302 QLIEGQFPMIVWTPCVVHTLNLALKNICAAKNVEDNQIAYGECSWISNVDGDVMVVKHFI 361
           ++IEG +P I WTPCVVHTLNLALKNICAAKN E+N   Y EC WI+ V  D M +K+FI
Sbjct: 303 EIIEGLYPQIYWTPCVVHTLNLALKNICAAKNTENNYEVYDECHWITEVHEDAMQIKNFI 362

Query: 362 MNHSMRLSMFNEFVPLKLLYVAETRFASIIVMLKRFKLIKGGLQAMVISDKWANYREDDV 421
           MNH+MRLSM+N F  LKLL VA+TRFAS IVMLKRFK IK  L+ MV+S++WA+YR+DD 
Sbjct: 363 MNHTMRLSMYNRFSSLKLLSVADTRFASTIVMLKRFKFIKRSLETMVMSEEWASYRDDDQ 422

Query: 422 RKAQHVKELLLNDLWWDKIDYILSFTSPIYDMIRACDTDKPCLHLVYDMWDTMIEKVKKI 481
            KA+ V++ +LN+ WWD++ YIL+FT+PIYDM+RACDTD+PCLHLVY+MWD+M+EKVK  
Sbjct: 423 EKARFVRDKVLNEYWWDQVTYILNFTAPIYDMVRACDTDRPCLHLVYEMWDSMVEKVKVE 482

Query: 482 IYRHEGLQSNENSSFYDVVHTILVDRWNKNNTPLHCLAHSLNPRYYSEEWLAEDSNRVPP 541
           IY+HE    +  SSFYDVVH IL+ RW K++TPLHCLAH LNPR+YSEEWL ED  R+PP
Sbjct: 483 IYKHEDKTLDMFSSFYDVVHDILIARWTKSSTPLHCLAHYLNPRFYSEEWLNEDRARLPP 542

Query: 542 SQDVELTRERM----------------------------------KLLKRYVLDPRSWWA 601
             D +++  R                                    L KR   D + WWA
Sbjct: 543 HTDGDVSYGRQLCFRRLYPNDEDYDKVLYDYADFSLKSGPFSDVTSLSKRGTTDAKRWWA 602

Query: 602 TYGVYAPMLQAIAFKLLGTPSSPSCCERNWSTYSFVNSVKRNKMTHKRTEDLVFIHSNLR 630
            +G   P+LQA+AF++LG P+S SCCERNWSTYSF++S++RNK+T KR EDLVFIH+NLR
Sbjct: 603 NFGAKTPLLQALAFRVLGQPTSSSCCERNWSTYSFIHSLRRNKLTPKRAEDLVFIHNNLR 662

BLAST of Moc03g01650 vs. TAIR 10
Match: AT1G79740.1 (hAT transposon superfamily )

HSP 1 Score: 123.2 bits (308), Expect = 7.3e-28
Identity = 133/623 (21.35%), Postives = 254/623 (40.77%), Query Frame = 0

Query: 26  WQYVTKIQKLSERGGNFSWQCNFCQAIKKSSYTRVRAHLLKISGQGIGICLKVTPTDIAD 85
           W+Y  K+       GN   +C FC  +     +R++ HL ++  +G+  C KV   D+ D
Sbjct: 9   WEYAEKLD------GN-KVKCKFCSRVLNGGISRLKHHLSRLPSKGVNPCAKVR-DDVTD 68

Query: 86  MEKLEEEAKNRKERKAPKNVRLPPSFISVGGVNVSNSPGTSNIEPKKRKGTPSAIEKSFN 145
             +      + K+     N   PP  +S       ++P +  + P          E+S  
Sbjct: 69  RVR---SILSAKDDPPITNKYKPPPPLS----PPFDAPASKLVFPSSPPNAQDIAERS-- 128

Query: 146 KASRDQLNALIARMFYSAGLPFHLARNPHFRGAFSYAANHMLTGYVP--PGF--NSLRTS 205
                     I+  F+   + F +AR+P +        +HML       PGF   S +T 
Sbjct: 129 ----------ISLFFFENKIDFAVARSPSY--------HHMLDAVAKCGPGFVAPSPKTE 188

Query: 206 LLQQEKANIERLLIPIKGEWRLKGVSIVSDGWSDSQRRPLINFMAISEGQL--------- 265
            L + K++I   L   + EW   G +I+++ W+D++ R LINF   S  ++         
Sbjct: 189 WLDRVKSDISLQLKDTEKEWVTTGCTIIAEAWTDNKSRALINFSVSSPSRIFFHKSVDAS 248

Query: 266 --------IEGQFPMIVWTPCVVHTLNLALKNI---------------------CAAKNV 325
                   +   F  ++      H + + + N                      CA++ +
Sbjct: 249 SYFKNSKCLADLFDSVIQDIGQEHIVQIIMDNSFCYTGISNHLLQNYATIFVSPCASQCL 308

Query: 326 EDNQIAYGECSWISNVDGDVMVVKHFIMNHSMRLSMFNEFV-PLKLLYVAETRFASIIVM 385
                 + +  W++       V+  F+ N+S  L +  +      ++    TR  S  + 
Sbjct: 309 NIILEEFSKVDWVNQCISQAQVISKFVYNNSPVLDLLRKLTGGQDIIRSGVTRSVSNFLS 368

Query: 386 LKRFKLIKGGLQAMVISDKWANYREDDVRKAQHVK--ELLLNDLWWDKIDYILSFTSPIY 445
           L+     K  L+ M    ++      +  K Q +    +L ++ +W  ++  ++ + PI 
Sbjct: 369 LQSMMKQKARLKHMFNCPEYTT----NTNKPQSISCVNILEDNDFWRAVEESVAISEPIL 428

Query: 446 DMIRACDTDKPCLHLVYDMWDTMIEKVKKIIYRHEGLQSNENSSFYDVVHTILVDRWNKN 505
            ++R   T KP +  +Y+    ++ K K+ I  +  +  N++  F D+V T     W ++
Sbjct: 429 KVLREVSTGKPAVGSIYE----LMSKAKESIRTYYIMDENKHKVFSDIVDT----NWCEH 488

Query: 506 -NTPLHCLAHSLNPR-YYSEE-----WLAED----SNRVPPSQDV--ELTRE-------- 565
            ++PLH  A  LNP   Y+ E      L ED      ++ P+ D+  ++T +        
Sbjct: 489 LHSPLHAAAAFLNPSIQYNPEIKFLTSLKEDFFKVLEKLLPTSDLRRDITNQIFTFTRAK 548

Query: 566 -----RMKLLKRYVLDPRSWWATYGVYAPMLQAIAFKLLGTPSSPSCCERNWSTYSFVNS 578
                 + +  R  + P  WW  +G  AP+LQ +A ++L    S    ER WST+  ++ 
Sbjct: 549 GMFGCNLAMEARDSVSPGLWWEQFGDSAPVLQRVAIRILSQVCSGYNLERQWSTFQQMHW 584

BLAST of Moc03g01650 vs. TAIR 10
Match: AT3G17450.1 (hAT dimerisation domain-containing protein )

HSP 1 Score: 121.3 bits (303), Expect = 2.8e-27
Identity = 158/737 (21.44%), Postives = 267/737 (36.23%), Query Frame = 0

Query: 45  QCNFCQAIKKSSYTRVRAHLLKISGQGIGICLKVTPTDIADMEKLEEEAKNRKERKAPKN 104
           +CN+C  I      R + HL +I G+       V P   A  E   +  +N K  +A K 
Sbjct: 150 KCNYCNKIVSGGINRFKQHLARIPGE-------VAPCKTAPEEVYVKIKENMKWHRAGKR 209

Query: 105 VRLPPSFISVGGV---NVSNSP---------------------GTSNIEPKKRKGTPSAI 164
              P     +G +    VS  P                     G       KRK   S  
Sbjct: 210 QNRPDD--EMGALTFRTVSQDPDQEEDREDHDFYPTSQDRLMLGNGRFSKDKRKSFDSTN 269

Query: 165 EKSFNKA------------------------------SRDQLNALIARMFYSAGLPFHLA 224
            +S ++A                              SR  + + I++  +  G+P   A
Sbjct: 270 MRSVSEAKTKRARMIPFQSPSSSKQRKLYSSCSNRVVSRKDVTSSISKFLHHVGVPTEAA 329

Query: 225 RNPHFRGAFSYAANHMLTGYVPPGFNSLRTSLLQQEKANIERLLIPIKGEWRLKGVSIVS 284
            + +F+        +   G+V P        LLQ+E + I+  L   +  W + G SI++
Sbjct: 330 NSLYFQKMIELIGMYG-EGFVVPSSQLFSGRLLQEEMSTIKSYLREYRSSWVVTGCSIMA 389

Query: 285 DGWSDSQRRPLINFMAI------------------------------------------- 344
           D W++++ + +I+F+                                             
Sbjct: 390 DTWTNTEGKKMISFLVSCPRGVYFHSSIDATDIVEDALSLFKCLDKLVDDIGEENVVQVI 449

Query: 345 --------SEGQLIEGQFPMIVWTPCVVHTLNLALKNICAAKNVEDNQIAYGECSWISNV 404
                   S G+L+E +   + WTPC +H   L L++             + +  ++S  
Sbjct: 450 TQNTAIFRSAGKLLEEKRKNLYWTPCAIHCTELVLED-------------FSKLEFVSEC 509

Query: 405 DGDVMVVKHFIMNHSMRLS-MFNEFVP-LKLLYVAETRFASIIVMLKRFKLIKGGLQAMV 464
                 +  FI N +  L+ M NEF   L LL  A  R AS    L+     K  L+ + 
Sbjct: 510 LEKAQRITRFIYNQTWLLNLMKNEFTQGLDLLRPAVMRHASGFTTLQSLMDHKASLRGLF 569

Query: 465 ISDKW-ANYREDDVRKAQHVKELLLNDLWWDKIDYILSFTSPIYDMIRACDT--DKPCLH 524
            SD W  +       + + V++++L+ ++W K+ Y+L    P+  +I   +   D+  + 
Sbjct: 570 QSDGWILSQTAAKSEEGREVEKMVLSAVFWKKVQYVLKSVDPVMQVIHMINDGGDRLSMP 629

Query: 525 LVYDMWDTMIEKVKKIIYRHEGLQSNENSSFYDVVHTILVDRWNK-NNTPLHCLAHSLNP 584
             Y         +K I         ++++  Y     ++  RWN   + PL+  A+  NP
Sbjct: 630 YAYGYMCCAKMAIKSI--------HSDDARKYGPFWRVIEYRWNPLFHHPLYVAAYFFNP 689

Query: 585 RY-YSEEWLAEDS-----NRVPPSQDVELTRERMKLLK-------------------RYV 639
            Y Y  +++A+       N      + + TR    L++                   R  
Sbjct: 690 AYKYRPDFMAQSEVVRGVNECIVRLEPDNTRRITALMQIPDYTCAKADFGTDIAIGTRTE 749

BLAST of Moc03g01650 vs. TAIR 10
Match: AT5G33406.1 (hAT dimerisation domain-containing protein / transposase-related )

HSP 1 Score: 100.9 bits (250), Expect = 3.9e-21
Identity = 75/288 (26.04%), Postives = 123/288 (42.71%), Query Frame = 0

Query: 332 AETRFASIIVMLKRFKLIKGGLQAMVISDKWANYREDDVRKAQHVKELLLNDLWWDKIDY 391
           A TR A+  + L +F  +K  L+ MV SD+W   +         +K     + +W  + +
Sbjct: 15  AITRIATSFITLAQFHRLKDNLRKMVHSDEWNASKWTKEAGGMKIKSFFFQESFWKNVLH 74

Query: 392 ILSFTSPIYDMIRACDTD-KPCLHLVYDMWDTMIEKV-KKIIYRHEGLQSNENSSFYDVV 451
            L    P+  ++R  D + KP +  +Y   D   E + K   Y+ E          Y + 
Sbjct: 75  ALKLGGPLIQVLRMVDGERKPPMGYIYGAMDQAKETIMKSFTYKEEN---------YKMA 134

Query: 452 HTILVDRWN-KNNTPLHCLAHSLNPRYY--------SEEWLA-------------EDSNR 511
             I+  RW+ + + PLH   + LNP ++         EE L              E  ++
Sbjct: 135 FEIIDRRWDIQLHRPLHAAGYYLNPEFHYGQPDDIGYEEVLGGFLGCLGRLVPKIETQDK 194

Query: 512 VPPSQD-----VELTRERMKLLKRYVLDPRSWWATYGVYAPMLQAIAFKLLGTPSSPSCC 571
           +    D       L    M +  R  + P  WW+ YG   P LQ  A K+L    S + C
Sbjct: 195 IITELDAFKKATGLFGIPMAIRLRTKMSPAEWWSAYGSSTPNLQNFAIKVLSLTCSATGC 254

Query: 572 ERNWSTYSFVNSVKRNKMTHKRTEDLVFIHSNLRLLSRRTPEYSQGET 591
           ERNW  +  +++ +RN++T  R  D++F+  N R L RR   Y + +T
Sbjct: 255 ERNWGVFQLLHTKRRNRLTQCRLNDMIFVKYN-RALQRR---YKRNDT 289

BLAST of Moc03g01650 vs. TAIR 10
Match: AT4G08267.1 (hAT transposon superfamily protein )

HSP 1 Score: 85.5 bits (210), Expect = 1.7e-16
Identity = 41/87 (47.13%), Postives = 56/87 (64.37%), Query Frame = 0

Query: 247 ISEGQLIEGQFPMIVWTPCVVHTLNLALKNICA-AKNVEDNQIAYGECSWISNVDGDVMV 306
           +  G LI  +F  I WTPCVVHTLNLALKN CA + +  +N++ Y  C WI  +  +V  
Sbjct: 24  VKSGALISAKFSTIFWTPCVVHTLNLALKNTCAPSLSTRNNEVVYKACYWIKFISENVTW 83

Query: 307 VKHFIMNHSMRLSMFNEFVPLKLLYVA 333
           +K+ IMN+ +RL MF E   LKLL ++
Sbjct: 84  IKNSIMNYGVRLVMFTEHCDLKLLTIS 110

BLAST of Moc03g01650 vs. TAIR 10
Match: AT1G43260.1 (hAT transposon superfamily protein )

HSP 1 Score: 62.4 bits (150), Expect = 1.5e-09
Identity = 47/172 (27.33%), Postives = 72/172 (41.86%), Query Frame = 0

Query: 156 IARMFYSAGLPFHLARNPHFRGAFSYAANHMLTGYVPPGFNSLRTSLLQQEKANIERLLI 215
           +AR  YS G+PF+   N   R      A     G  PP    LR  LL++E   ++ L+ 
Sbjct: 39  VARWVYSHGIPFNAIANDDLRRMLE-VAGQFGPGVTPPSQYQLREPLLKEEVVRMKGLME 98

Query: 216 PIKGEWRLKGVSIVSDGWSDSQRRPLINFMAISEGQLIEGQFPMIVWTPCVVHTLNLALK 275
             + EWR+ G S+ +D WSD +RR ++N                 +   C   T+ L+ K
Sbjct: 99  EQEDEWRVNGCSVTTDSWSDRKRRSIMN-----------------LCINCKEGTMFLSSK 158

Query: 276 NICAAKNVEDNQIAYGECSWISNVDGD--VMVVKHFIMNHSMRLSMFNEFVP 326
           +     +  +   AY     I N+ GD  V VV +   N+     +  E  P
Sbjct: 159 DCFDDSHTGEYIFAYVNEYCIKNLGGDHVVQVVTNNATNNITAAKLLKEVRP 192

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038721052.11.1e-23258.50uncharacterized protein LOC120013346 isoform X1 [Tripterygium wilfordii] >XP_038... [more]
KAG5532188.14.3e-23258.89hypothetical protein RHGRI_026721 [Rhododendron griersonianum][more]
KAG5522171.16.3e-23157.72hypothetical protein RHGRI_034377 [Rhododendron griersonianum][more]
XP_030544727.11.6e-22657.22uncharacterized protein LOC115751129 [Rhodamnia argentea][more]
XP_028124679.17.4e-22456.07uncharacterized protein LOC114321673 [Camellia sinensis][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5B7AFB02.1e-24059.70Uncharacterized protein OS=Davidia involucrata OX=16924 GN=Din_023134 PE=4 SV=1[more]
A0A443N8D66.1e-22458.73DUF659 domain-containing protein/Dimer_Tnp_hAT domain-containing protein OS=Cinn... [more]
A0A445LNL13.7e-21354.62Uncharacterized protein OS=Glycine soja OX=3848 GN=D0Y65_003769 PE=4 SV=1[more]
A0A2N9J9503.4e-20653.13BED-type domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS61684... [more]
A0A2U1N3G21.1e-20451.33BED-type domain-containing protein OS=Artemisia annua OX=35608 GN=CTI12_AA311960... [more]
Match NameE-valueIdentityDescription
AT1G79740.17.3e-2821.35hAT transposon superfamily [more]
AT3G17450.12.8e-2721.44hAT dimerisation domain-containing protein [more]
AT5G33406.13.9e-2126.04hAT dimerisation domain-containing protein / transposase-related [more]
AT4G08267.11.7e-1647.13hAT transposon superfamily protein [more]
AT1G43260.11.5e-0927.33hAT transposon superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR008906HAT, C-terminal dimerisation domainPFAMPF05699Dimer_Tnp_hATcoord: 503..575
e-value: 6.9E-8
score: 32.2
IPR007021Domain of unknown function DUF659PFAMPF04937DUF659coord: 193..249
e-value: 4.1E-12
score: 46.2
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..20
NoneNo IPR availablePANTHERPTHR32166OSJNBA0013A04.12 PROTEINcoord: 511..621
NoneNo IPR availablePANTHERPTHR32166:SF81HAT TRANSPOSON SUPERFAMILY PROTEINcoord: 511..621
coord: 11..250
coord: 250..511
NoneNo IPR availablePANTHERPTHR32166OSJNBA0013A04.12 PROTEINcoord: 11..250
coord: 250..511
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 224..578

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc03g01650.1Moc03g01650.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0046983 protein dimerization activity