MS000541 (gene) Bitter gourd (TR) v1

Overview
NameMS000541
Typegene
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionL10-interacting MYB domain-containing protein
Locationscaffold64: 232952 .. 237163 (-)
RNA-Seq ExpressionMS000541
SyntenyMS000541
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTTCAGCTTTAAGTGAGAAATATGGGCCCGACTTGACAAAAGAGTACATTAGAAATAGGCTTAGAACTTTGAAGAAACAATATCGTATTCTGAAGGAACTTCTTTCTCATGATGGGTTCAGTTGGGATGAGACAAGAAAGATTATCGTTGCAAATAACTCAGTATGGGATGATTATATCAAGGTAAGTGGAAAATTTCAGCATTTGCTTAATAGTGTGAGTGATATTGTTTTGTCATGCATTCGATCATGTTCTTTGAATTAATAGCGTCAAACTTCTATTCTTGTTTCTTTTTTTTTTTCCCTTTTGTTTTGAGGTTAGAAATATTTCATAACTAAAGAGATGTATAATAGGAGAGAAGAAAAGATATTCTACCTCTCACTCATAGCAAGGTGATCACAAAAGTAACTTTCTAATTGGCTTGAATGGGAAAAAATAATGGTTACAAAAATCTCTATGTAAAAGAGCACCACCTAGCTAGGAAAAAATTGTCTTGTCAAAATCCATTTCTCGTTCTTCTCAATGAAATATGTTGGGTTCCTTTCAATCTAAATCTCCTAAAGAGTAATATTCAACCAAGTAGCCTCGGTCTACATCCTAAGAACCAAAAGTAGTATGTTTTTCTTTACTCTTCTTTATTTTTCAGATGAGAAACCATACATTTTATAGATGATATGAAATTACAAAATGAGGGAAGGAAAGTTAAACCACAAGCCAAAGAATTTTCAAGAATTCTTCCAATTGGCACAAAGGTAACAAAGATTGTAGTTGTAAGAAGGTAAACTTTACACTATGAAAGAGCTACAAGAATTACATGACCCAAAAAGTTATCAAATGGCTTTCTATGCTATTGAAAAACCTATTGTTCTTTTCAATCCAAAATCACCTCAAGTATTTTTCTCTTAAAAGATTCTTTGATTTATTTTGTACCGAAGTCCCCAAATTGCTTTAATAGTATTGATCAACAATGTTTTAGCTTTACTTTTTTTTTTTTTTTTTTGATATCCATGAGTGTCTGGACCAGCTTGCGCGCACCTCGACTAATCTCAAGGGGCAATCGCCTGACCCTATAACATTTGGTGCCAGAAAACACATAGGAATTACTAATTCCTAAGGTAGGTGGCCATCATGGGGTTTGAACCCATTTTAGCTTTACTTTTGAATCTATTACCACAAAACAATTGGAAAATATTGTCTTTGGATTCTTTGCCAAAAACCCATGAAATATTTACTGTCATTGGTCCTATCCTCCTTCATTCAGGAGACACAATAACTAGAGTTTAGAGCAAATTCCTGAAGTTCTCTAAAATCATTATATATCATTATGGAATTTCTTCAAAACTGTCATTGAGCAGGTAACTGGAAATTGGAAAATGAGTTACATGATAAAATGGGGAAGAATTAATATATTAAATCCTTTTGCAGCAATGGAGGTTTAATCCGGTTTGGTGGTCTCCTTTAAGAAAGTTCTTGTTTTTAGTAGGTGGTCATCTTTTTTGACTCTTGGTCTTTTACAAAGATCTTTTCGGTTCGGTTGAAGCATGTTGCAGGTTTTGTAGCTTGTAGGATTTGATTTTGGTGGCTTTGGTTAGTTGGTTTTCAAGTGGTTCTTCTCAGCCTTGCTAGATCTTTCCTTGGAGAAATTCGATACTTTATTTATTTTTGGCTCTTGTAATTTTTAGAGATCTTTACTTTATCTGTTTGGTTGTTTGTTCTTTTCACCATCTTTTGGAGTTCTTGAATATTCTTCGATCTTTTCATCTATCAATGAAAAGTTTTTTTCTTGTTAAAAAAAAGAAAAAAAAAATTAACATCCTGTACGAGGGACAATCAATTTAATCGAGTTAAATGGTGAATTGTTCATTGATCACCCATCATGCTGCTTGTAAAGTTTCTGCTCTGATCTCATTTTTTTAATATGATCTAAACTACACTGGATAGGTACTATTCAATGTACAACTGATCATTTCAATGATACTTCCTTTTCTCTGTATCTGATGCATTCAATCTGAACTTTAGCTCCGGAATAACTATATGTTATTATTTTCTCTGTGCATAGGTCAATCTTGAGGCCAGAAGCTTTCGTGGTAGAATTTTTGAAAACTATGATCAACTTTGCATCTTCTTTGGATACTACAATATGGAGACTTTTGACTTCCACGTTGCTATGTATGATGGAAAGAATGGTTGTGAAGGGAATCCCTTGAGGTGGACAAGTGAAATGGACTGTTGTCTCAGTGGAGTCCTCGTGGAGCAAGTGATTCTTGGGAATAAAAATAGTAGATAATGAATTCAAGACTGTTGCATATAATGCAGCTATATTGGCTATAAGAGAGAGGTTTTCTCTTCAATTGACAAAAGATCAAGTTAAGGATCGTTTTAAATCATGGAAAAGAGAGTACTTTGTGCTGAGGAACCTCTTAGACCAAGGTGACTTTGAATGGGATGATCAGCGAAAGATGTTGGTTGCAAAGGACTCAGTATGGGATGTGTCTGTTGAGGTATATAACATTTCAGTTGGTTTTTCACTGGTCCATAGAATGATGTATATGATTTTTCTTTTCTTTTTCTTTTCTCCTCGGACGGTGTTCAAATGGTACTACGTACAATTGAAACAGAGAAACCCAGATGCTAGACTTCTTAGAGGGAGGGTCATTGAGAATTATGATGAATTGTGTATTATTATTGGGTATGACAATCCATCTGAAAGTTCTCTCAATCCTGCTAATGTTAATTTGGATTTAACTGCTAATAATGAAGCTATAAATGCTGGAGTTGTATGCTACAATCAAAGTAACAATGCAGCAGAAAAAGAAAATTTCATAACTTGGACTGAGGAGATGGATACCTGCTTATCGAAGCTGCTAGTTGAGCAAGTGGTTCTTGGAAACAGGATTGAGGAAACATTTAAGACTGCAGCTTACACGGCTGCTCTTACAGTTTTAAATGAGAGATTTGCGTTGGATTTGACTAAAGAAAACATTAGAAGCAGGTTAAACACATGGGAAAAGCAGTATGGAAGAGTGAAGTTACTCCTCTCCCATGATGGGTTTGAGTGGGATGAAAGACACAAGATGGTTGTTGCCAATGACTTTGATTGGACTGCATACATTAAGGTATGTATATATCGTTTTATCTTTCAATATGGTCTGATATTATTCTTTGGATGAAAAATCATCTTTGTTAAGATAAATGAAAGAACAAGATAATGGCAATACAAAAACATGAGTGAGTCCAAATGAGACGAACCCAATCACTGAGGCTTTGTGAAGAATCAAAACTTAAGAAAAATTACAAGATAATTCTGATGGGATGCCCAGAAAGGGCATTAAAGTAGGCCAAACTTTACAGCTCATCCCATTACATAATGTTATTTTGATAGCTGTAGAGCATGTGTCAGTGAGATACATTCTTCTCCATTGTGACTTTCTCACAAGTTAAGGGTCATTTTATGTTTGTTGGGTTACTAAAAAATATATGGTAAATTTAATATTTTATAGATTCCTAACACTCCCACACACCCGGAGGTTTAGAGAACCTTTTGCTCTGATATCATATTAAATTACTAATGAATCTAAAAGTTTAAATTGATGGGTTACCGTGAATATAATCCTTTATATGTGTATTCTCTAGTTAGTTGAGATATCTTTGAAATCAACTGGTGAAAAGAGAAAGCCTTTTGGATATGTCCAGTTTCAACTCTGTGTTTGACGTTTTGTTGGAAGGAAGTTAATTGCATTTTCTGAGTATGTAGACTGAGAAATTTGTTCTTCACATTTTGATTTAAGTACGTAAATCAAATGAGATTATGTCTGGTTCTTTGTACTCTAAATGGTCACATTTCTTGTACTATACCAAAATTAGATTGCTTAACCATGGAGTAGGGAGGGTTTTTGTTGGAGGATTCAGAGTTGATGAACAGGGTTCATCTTATCAATTTAATCATGCAATTTCCTTAATTGATGATGAACGCTGGATTTTATATTTCTTCTCATCTAAAACCTCTTATTAATGTAGATGTTAAATTATAACTTTTACTTTTACCATTCTATATTGTTTTTCTGTTTATTGATTTTGGCTAAGGACATGTGAAATATTGCACATTTGCTCATTGCTTTTCAACACACATTCCAGAAACACCCCGATGACCAGGACTTGCGAGCAAAATCGATCGACAATTACAATGAGCTGTGTATGATTTTTGGCAATGAA

mRNA sequence

GTTTCAGCTTTAAGTGAGAAATATGGGCCCGACTTGACAAAAGAGTACATTAGAAATAGGCTTAGAACTTTGAAGAAACAATATCGTATTCTGAAGGAACTTCTTTCTCATGATGGGTTCAGTTGGGATGAGACAAGAAAGATTATCGTTGCAAATAACTCAGTATGGGATGATTATATCAAGGTCAATCTTGAGGCCAGAAGCTTTCGTGGTAGAATTTTTGAAAACTATGATCAACTTTGCATCTTCTTTGGATACTACAATATGGAGACTTTTGACTTCCACGTTGCTATGTATGATGGAAAGAATGGTTGTGAAGGGAATCCCTTGAGGTGGACAAGTGAAATGGACTGTTGTCTCAGTGGAGTCCTCGTGGAGCAAGTGATTCTTGGGAATAAAAATAGTAATGAATTCAAGACTGTTGCATATAATGCAGCTATATTGGCTATAAGAGAGAGGTTTTCTCTTCAATTGACAAAAGATCAAGTTAAGGATCGTTTTAAATCATGGAAAAGAGAGTACTTTGTGCTGAGGAACCTCTTAGACCAAGGTGACTTTGAATGGGATGATCAGCGAAAGATGTTGGTTGCAAAGGACTCAGTATGGGATGTGTCTGTTGAGAGAAACCCAGATGCTAGACTTCTTAGAGGGAGGGTCATTGAGAATTATGATGAATTGTGTATTATTATTGGGTATGACAATCCATCTGAAAGTTCTCTCAATCCTGCTAATGTTAATTTGGATTTAACTGCTAATAATGAAGCTATAAATGCTGGAGTTGTATGCTACAATCAAAGTAACAATGCAGCAGAAAAAGAAAATTTCATAACTTGGACTGAGGAGATGGATACCTGCTTATCGAAGCTGCTAGTTGAGCAAGTGGTTCTTGGAAACAGGATTGAGGAAACATTTAAGACTGCAGCTTACACGGCTGCTCTTACAGTTTTAAATGAGAGATTTGCGTTGGATTTGACTAAAGAAAACATTAGAAGCAGGTTAAACACATGGGAAAAGCAGTATGGAAGAGTGAAGTTACTCCTCTCCCATGATGGGTTTGAGTGGGATGAAAGACACAAGATGGTTGTTGCCAATGACTTTGATTGGACTGCATACATTAAGAAACACCCCGATGACCAGGACTTGCGAGCAAAATCGATCGACAATTACAATGAGCTGTGTATGATTTTTGGCAATGAA

Coding sequence (CDS)

GTTTCAGCTTTAAGTGAGAAATATGGGCCCGACTTGACAAAAGAGTACATTAGAAATAGGCTTAGAACTTTGAAGAAACAATATCGTATTCTGAAGGAACTTCTTTCTCATGATGGGTTCAGTTGGGATGAGACAAGAAAGATTATCGTTGCAAATAACTCAGTATGGGATGATTATATCAAGGTCAATCTTGAGGCCAGAAGCTTTCGTGGTAGAATTTTTGAAAACTATGATCAACTTTGCATCTTCTTTGGATACTACAATATGGAGACTTTTGACTTCCACGTTGCTATGTATGATGGAAAGAATGGTTGTGAAGGGAATCCCTTGAGGTGGACAAGTGAAATGGACTGTTGTCTCAGTGGAGTCCTCGTGGAGCAAGTGATTCTTGGGAATAAAAATAGTAATGAATTCAAGACTGTTGCATATAATGCAGCTATATTGGCTATAAGAGAGAGGTTTTCTCTTCAATTGACAAAAGATCAAGTTAAGGATCGTTTTAAATCATGGAAAAGAGAGTACTTTGTGCTGAGGAACCTCTTAGACCAAGGTGACTTTGAATGGGATGATCAGCGAAAGATGTTGGTTGCAAAGGACTCAGTATGGGATGTGTCTGTTGAGAGAAACCCAGATGCTAGACTTCTTAGAGGGAGGGTCATTGAGAATTATGATGAATTGTGTATTATTATTGGGTATGACAATCCATCTGAAAGTTCTCTCAATCCTGCTAATGTTAATTTGGATTTAACTGCTAATAATGAAGCTATAAATGCTGGAGTTGTATGCTACAATCAAAGTAACAATGCAGCAGAAAAAGAAAATTTCATAACTTGGACTGAGGAGATGGATACCTGCTTATCGAAGCTGCTAGTTGAGCAAGTGGTTCTTGGAAACAGGATTGAGGAAACATTTAAGACTGCAGCTTACACGGCTGCTCTTACAGTTTTAAATGAGAGATTTGCGTTGGATTTGACTAAAGAAAACATTAGAAGCAGGTTAAACACATGGGAAAAGCAGTATGGAAGAGTGAAGTTACTCCTCTCCCATGATGGGTTTGAGTGGGATGAAAGACACAAGATGGTTGTTGCCAATGACTTTGATTGGACTGCATACATTAAGAAACACCCCGATGACCAGGACTTGCGAGCAAAATCGATCGACAATTACAATGAGCTGTGTATGATTTTTGGCAATGAA

Protein sequence

VSALSEKYGPDLTKEYIRNRLRTLKKQYRILKELLSHDGFSWDETRKIIVANNSVWDDYIKVNLEARSFRGRIFENYDQLCIFFGYYNMETFDFHVAMYDGKNGCEGNPLRWTSEMDCCLSGVLVEQVILGNKNSNEFKTVAYNAAILAIRERFSLQLTKDQVKDRFKSWKREYFVLRNLLDQGDFEWDDQRKMLVAKDSVWDVSVERNPDARLLRGRVIENYDELCIIIGYDNPSESSLNPANVNLDLTANNEAINAGVVCYNQSNNAAEKENFITWTEEMDTCLSKLLVEQVVLGNRIEETFKTAAYTAALTVLNERFALDLTKENIRSRLNTWEKQYGRVKLLLSHDGFEWDERHKMVVANDFDWTAYIKKHPDDQDLRAKSIDNYNELCMIFGNE
Homology
BLAST of MS000541 vs. NCBI nr
Match: XP_030959168.1 (uncharacterized protein LOC115981123 [Quercus lobata])

HSP 1 Score: 493.0 bits (1268), Expect = 2.4e-135
Identity = 241/406 (59.36%), Postives = 311/406 (76.60%), Query Frame = 0

Query: 1   VSALSEKYGPDLTKEYIRNRLRTLKKQYRILKELLSHDGFSWDETRKIIVANNSVWDDYI 60
           V AL+E++GPDLTKE+IRNRLRT +KQY ILKELLSH GF WD  +K+I+A++SVWDDY+
Sbjct: 222 VLALNERFGPDLTKEHIRNRLRTWRKQYLILKELLSHSGFKWDAMQKMIIASDSVWDDYV 281

Query: 61  KVNLEARSFRGRIFENYDQLCIFFGYYN-----METFDFHVAMYDGKNGCEGNPLRWTSE 120
           K + +AR FR R  +NYDQL I FG  +     ++  D       GK    G  +RWT E
Sbjct: 282 KTHPDARIFRNRFIQNYDQLFIIFGDSHEAAEPVDVIDVSPVRCGGKVKDLGKNVRWTFE 341

Query: 121 MDCCLSGVLVEQVILGNKN--SNEFKTVAYNAAILAIRERFSLQLTKDQVKDRFKSWKRE 180
           MD CL  VLVEQVILGNKN   N+FK  AY AA+LAI+ERF L LTKD V++R K+WK++
Sbjct: 342 MDRCLGKVLVEQVILGNKNRLDNKFKPAAYEAAVLAIKERFHLDLTKDHVRNRLKTWKKQ 401

Query: 181 YFVLRNLLDQGDFEWDDQRKMLVAKDSVWDVSVERNPDARLLRGRVIENYDELCIIIGYD 240
           Y +L+ LLDQ DFEWD++RKM++A DS W+  ++ NPDAR ++GRVI NY+ELC+IIG +
Sbjct: 402 YDILQELLDQRDFEWDERRKMVIANDSAWNEYIKINPDARTVQGRVINNYEELCVIIGCN 461

Query: 241 NPSESSLNPANVNLDLTANNEAINAGVVCYNQSNNAAEKENFITWTEEMDTCLSKLLVEQ 300
           +P ESS+N A  NLDL A NEA+ A    YN+ +NA +K  +I+WT+EMD CL++LLV+Q
Sbjct: 462 DPPESSVNIAENNLDLIAENEAVVAEEKYYNEVDNAKDKVKYISWTDEMDRCLTQLLVQQ 521

Query: 301 VVLGNRIEETFKTAAYTAALTVLNERFALDLTKENIRSRLNTWEKQYGRVKLLLSHDGFE 360
           V+LGN++++ FK  AY AALTVLNE+F LDLTKENIR+RL TW+KQYG VK LLSH GFE
Sbjct: 522 VMLGNKLDKNFKPVAYMAALTVLNEKFGLDLTKENIRNRLKTWKKQYGLVKELLSHGGFE 581

Query: 361 WDERHKMVVANDFDWTAYIKKHPDDQDLRAKSIDNYNELCMIFGNE 400
           WD+R+KMVVA D DW  YIK++PD + LRA+SI+NY++L +I GNE
Sbjct: 582 WDDRYKMVVATDSDWNEYIKRYPDARQLRARSIENYDDLRIIVGNE 627

BLAST of MS000541 vs. NCBI nr
Match: XP_023877154.1 (uncharacterized protein LOC111989590 [Quercus suber])

HSP 1 Score: 487.3 bits (1253), Expect = 1.3e-133
Identity = 238/406 (58.62%), Postives = 311/406 (76.60%), Query Frame = 0

Query: 1   VSALSEKYGPDLTKEYIRNRLRTLKKQYRILKELLSHDGFSWDETRKIIVANNSVWDDYI 60
           V AL+E++GPDLTKE+IRNRLRT +KQY ILKELLSH+GF WD  +K+I+A++SVWDDY+
Sbjct: 222 VLALNERFGPDLTKEHIRNRLRTWRKQYLILKELLSHNGFKWDAMQKMIIASDSVWDDYV 281

Query: 61  KVNLEARSFRGRIFENYDQLCIFFGYYN-----METFDFHVAMYDGKNGCEGNPLRWTSE 120
           K + +AR FR R  +NYDQL I FG  +     ++  D       GK    G  +RWT E
Sbjct: 282 KTHPDARIFRNRFIQNYDQLFIIFGDSHEAAEPVDVIDVSPVRCGGKAKDLGKNVRWTFE 341

Query: 121 MDCCLSGVLVEQVILGNKN--SNEFKTVAYNAAILAIRERFSLQLTKDQVKDRFKSWKRE 180
           MD CL  VLVEQVILGNKN   N+FK  AY AA+LAI+ERF L LTKD V++R K+WK++
Sbjct: 342 MDRCLGKVLVEQVILGNKNRLDNKFKPAAYEAAVLAIKERFHLDLTKDHVRNRLKTWKKQ 401

Query: 181 YFVLRNLLDQGDFEWDDQRKMLVAKDSVWDVSVERNPDARLLRGRVIENYDELCIIIGYD 240
           + +L+ LLDQ DFEWD++RKM++A DS W+  V+ NPDAR ++GRVI NY+ELC+IIG +
Sbjct: 402 FDILQELLDQRDFEWDERRKMVIANDSAWNEYVKINPDARTVQGRVINNYEELCVIIGCN 461

Query: 241 NPSESSLNPANVNLDLTANNEAINAGVVCYNQSNNAAEKENFITWTEEMDTCLSKLLVEQ 300
           +P ESS+N A  NLDL A NEA+ A    YN+ +NA +K  +I+WT+EMD CL++LLV+Q
Sbjct: 462 DPPESSVNIAENNLDLIAENEAVVAEETYYNEVDNAKDKGKYISWTDEMDRCLTQLLVQQ 521

Query: 301 VVLGNRIEETFKTAAYTAALTVLNERFALDLTKENIRSRLNTWEKQYGRVKLLLSHDGFE 360
           V+LGN++++ FK  AY AA+TVLNE+F LDLTKENIR+RL TW+KQYG VK LLS  GF+
Sbjct: 522 VMLGNKLDKNFKPVAYMAAVTVLNEKFGLDLTKENIRNRLKTWKKQYGLVKELLSQGGFK 581

Query: 361 WDERHKMVVANDFDWTAYIKKHPDDQDLRAKSIDNYNELCMIFGNE 400
           WDER+KMVVA D DW  YIK++PD + L+A+SI+NY++L +I GNE
Sbjct: 582 WDERYKMVVATDSDWNEYIKRYPDARQLQARSIENYDDLRIIVGNE 627

BLAST of MS000541 vs. NCBI nr
Match: KAF3973412.1 (hypothetical protein CMV_003146 [Castanea mollissima])

HSP 1 Score: 486.5 bits (1251), Expect = 2.2e-133
Identity = 240/406 (59.11%), Postives = 307/406 (75.62%), Query Frame = 0

Query: 1   VSALSEKYGPDLTKEYIRNRLRTLKKQYRILKELLSHDGFSWDETRKIIVANNSVWDDYI 60
           V AL+E++GPDLTKE+IRNRLRT +KQY ILKELLSH GF WD  +K+I+A++SVWDDY+
Sbjct: 236 VLALNERFGPDLTKEHIRNRLRTWRKQYLILKELLSHSGFKWDAMQKMIIASDSVWDDYV 295

Query: 61  KVNLEARSFRGRIFENYDQLCIFFGYYN-----METFDFHVAMYDGKNGCEGNPLRWTSE 120
           K + +AR FR R  +NYDQL I FG  +     ++  D       GK    G  +RWT E
Sbjct: 296 KTHPDARIFRNRFIQNYDQLFIIFGDSHEAAEPVDVIDVSPVRCGGKAKDLGKNVRWTFE 355

Query: 121 MDCCLSGVLVEQVILGNKN--SNEFKTVAYNAAILAIRERFSLQLTKDQVKDRFKSWKRE 180
           MD CL  VLVEQVILGNKN   N+FK  AY AA+  I+ERF L LTKD V++R K+WK++
Sbjct: 356 MDRCLGKVLVEQVILGNKNRLDNKFKPAAYEAAVFTIKERFHLDLTKDHVRNRLKTWKKQ 415

Query: 181 YFVLRNLLDQGDFEWDDQRKMLVAKDSVWDVSVERNPDARLLRGRVIENYDELCIIIGYD 240
           Y +L+ LLDQ DFEWD++RKM++A DS  +  V+ NPDAR ++GRVI NY+ELC+IIG +
Sbjct: 416 YDILQELLDQRDFEWDERRKMVIANDSACNEYVKINPDARTVQGRVINNYEELCVIIGCN 475

Query: 241 NPSESSLNPANVNLDLTANNEAINAGVVCYNQSNNAAEKENFITWTEEMDTCLSKLLVEQ 300
           +P ESS+N A  NLDL A NEA+ A    YN+ +NA +K  +I+WT+EMD CL++LLV+Q
Sbjct: 476 DPPESSVNIAENNLDLIAENEAVVAEETYYNEVDNAKDKGKYISWTDEMDRCLTQLLVQQ 535

Query: 301 VVLGNRIEETFKTAAYTAALTVLNERFALDLTKENIRSRLNTWEKQYGRVKLLLSHDGFE 360
           V+LGN++++ FK  AY AALTVLNE+F LDLTKENIR+RL TW+KQYG VK LLSH GFE
Sbjct: 536 VMLGNKLDKNFKPVAYMAALTVLNEKFGLDLTKENIRNRLKTWKKQYGLVKELLSHGGFE 595

Query: 361 WDERHKMVVANDFDWTAYIKKHPDDQDLRAKSIDNYNELCMIFGNE 400
           WDER+KMVVA D DW  YIK+ PD + LRA+SI+NY++L +I GNE
Sbjct: 596 WDERYKMVVATDSDWNEYIKRSPDARQLRARSIENYDDLRIIVGNE 641

BLAST of MS000541 vs. NCBI nr
Match: KAF7815604.1 (L10-interacting MYB domain-containing protein-like isoform X4 [Senna tora])

HSP 1 Score: 457.2 bits (1175), Expect = 1.4e-124
Identity = 219/410 (53.41%), Postives = 301/410 (73.41%), Query Frame = 0

Query: 1   VSALSEKYGPDLTKEYIRNRLRTLKKQYRILKELLSHDGFSWDETRKIIVANNSVWDDYI 60
           VSAL+ K+GP +TK++I+N L+T KKQY ++KELLSH  F WDETRK+IVAN+S W+ YI
Sbjct: 219 VSALNAKFGPCITKDHIKNHLKTWKKQYELIKELLSHAEFEWDETRKMIVANDSTWNHYI 278

Query: 61  KVNLEARSFRGRIFENYDQLCIFFGYYN--------METFDFHVAMYDGKNGCEGNPLRW 120
           K + +AR+FR R+FENYDQLC  +G  N        +E         D     +G  +RW
Sbjct: 279 KKHPDARTFRARVFENYDQLCTIYGSRNESVQCNEPLEALSNSPVQVDSSFKDQGKHMRW 338

Query: 121 TSEMDCCLSGVLVEQVILGNKN--SNEFKTVAYNAAILAIRERFSLQLTKDQVKDRFKSW 180
           T+EMD CLS VLVEQ+ LGNK+    + K+ AY AA+LAI ERF L L KDQVK+R K+ 
Sbjct: 339 TTEMDRCLSEVLVEQIKLGNKSKFDKKIKSAAYEAAVLAINERFQLDLIKDQVKNRLKTM 398

Query: 181 KREYFVLRNLLDQGDFEWDDQRKMLVAKDSVWDVSVERNPDARLLRGRVIENYDELCIII 240
           ++++ +L+ +LDQ  FEWD++RKM+ A DSVW+  ++ NPDARLL+GRVI NYDE+CII+
Sbjct: 399 RKQFEILKEILDQSGFEWDEKRKMVNATDSVWNEYIKINPDARLLKGRVIRNYDEMCIIV 458

Query: 241 GYDNPSESSLN-PANVNLDLTANNEAINAGVVCYNQSNNAAEKENFITWTEEMDTCLSKL 300
           G+ +P ++SLN     NL LT N+E ++     Y+ +NNA +K  ++ WT+ MD CL++L
Sbjct: 459 GHSDPPDNSLNGGGGANLGLTINDEVMDGQEAYYHGTNNAKDKSKYVAWTDAMDRCLTEL 518

Query: 301 LVEQVVLGNRIEETFKTAAYTAALTVLNERFALDLTKENIRSRLNTWEKQYGRVKLLLSH 360
           LV+QV++GN++E+ FK++AY +A++VLNE+F L LTKENI +R+ +W KQYG VK +LSH
Sbjct: 519 LVKQVMMGNKLEKNFKSSAYMSAVSVLNEKFGLHLTKENIMNRMKSWRKQYGLVKEMLSH 578

Query: 361 DGFEWDERHKMVVANDFDWTAYIKKHPDDQDLRAKSIDNYNELCMIFGNE 400
            GFEWD+R KMVVAND +W  YIKKHP  + LR++ I+NYNEL +I GNE
Sbjct: 579 GGFEWDDRRKMVVANDSEWNEYIKKHPKARHLRSRCIENYNELGVIVGNE 628

BLAST of MS000541 vs. NCBI nr
Match: KAA8550002.1 (hypothetical protein F0562_001686 [Nyssa sinensis])

HSP 1 Score: 454.1 bits (1167), Expect = 1.2e-123
Identity = 228/423 (53.90%), Postives = 293/423 (69.27%), Query Frame = 0

Query: 1   VSALSEKYGPDLTKEYIRNRLRTLKKQYRILKELLSHDGFSWDETRKIIVANNSVWDDYI 60
           V+AL+EK+GPD+TK++I+NRL+T KKQY ILKELLSH GF WDE RK+++ N+S W+DYI
Sbjct: 240 VTALNEKFGPDITKDHIKNRLKTWKKQYGILKELLSHIGFKWDEARKMVIGNDSAWNDYI 299

Query: 61  KVNLEARSFRGRIFENYDQLCIFF------GYYNMETFDFHVAMYDGKNGCE---GNPLR 120
           K + +A  FRGR+ ENYD LCI F      G Y+    D   ++     G E    +P+R
Sbjct: 300 KTHHDAHPFRGRVVENYDHLCIIFGNNHATGSYSRTVDDIVHSLAGDSEGVEAINASPIR 359

Query: 121 -------------WTSEMDCCLSGVLVEQVILGNKN--SNEFKTVAYNAAILAIRERFSL 180
                        WT+EMD CLS +LV+QV LGNK+   N+FK  AY AA+LA+ ERF L
Sbjct: 360 CYSGLRDEEKNMEWTNEMDRCLSTILVKQVKLGNKSKLDNKFKPAAYAAAVLALSERFQL 419

Query: 181 QLTKDQVKDRFKSWKREYFVLRNLLDQGDFEWDDQRKMLVAKDSVWDVSVERNPDARLLR 240
             T D V++R K+WK+ Y  ++ +LDQ +F+WD +RKM+   DSVW   ++ NPDARLL 
Sbjct: 420 DFTNDHVRNRIKTWKKLYGSVKEILDQSEFKWDKERKMITTNDSVWHDYIKINPDARLLH 479

Query: 241 GRVIENYDELCIIIGYDNPSESSLNPANVNLDLTANNEAINAGVVCYNQSNNAAEKENFI 300
           GRVIENYDELC IIG DNP+ESS N A  ++D  A+NE +   V   +Q +NA E+  +I
Sbjct: 480 GRVIENYDELCAIIGNDNPTESSKNDAEADMDWAADNEDVETEVAYQSQRDNAKERGKYI 539

Query: 301 TWTEEMDTCLSKLLVEQVVLGNRIEETFKTAAYTAALTVLNERFALDLTKENIRSRLNTW 360
            WT+EMD CL + LVEQV LGN++E+ FK  AYTA LT LNE F LDLTKENI+SRL TW
Sbjct: 540 IWTDEMDCCLMEKLVEQVKLGNKLEKNFKPVAYTAVLTALNENFVLDLTKENIKSRLKTW 599

Query: 361 EKQYGRVKLLLSHDGFEWDERHKMVVANDFDWTAYIKKHPDDQDLRAKSIDNYNELCMIF 400
           +K YG VK +LSH GF WDE+ KMVVA D  W  YIK HPD + LRA+SI+NY+EL +I 
Sbjct: 600 KKVYGLVKEVLSHRGFVWDEKRKMVVATDSVWNEYIKMHPDAKFLRARSIENYDELRIII 659

BLAST of MS000541 vs. ExPASy Swiss-Prot
Match: O82368 (Uncharacterized protein At2g29880 OS=Arabidopsis thaliana OX=3702 GN=At2g29880 PE=2 SV=1)

HSP 1 Score: 61.6 bits (148), Expect = 2.3e-08
Identity = 36/135 (26.67%), Postives = 68/135 (50.37%), Query Frame = 0

Query: 265 QSNNAAEKENFITWTEEMDTCLSKLLVEQVVLGNRIEE--TFKTAAYTAALTVLNERFAL 324
           +++   +K  +++W+++    L+ +LV+ +  G R +     KT      L +LN++F  
Sbjct: 9   ETSKKKKKGPYMSWSDQECYELTAILVDAIKRGWRDKNGTISKTTVERKILPLLNKKFKC 68

Query: 325 DLTKENIRSRLNTWEKQYG-RVKLLLSHDGFEWDERHKMVVANDFDWTAYIKKHPDDQDL 384
           + T  N  SR+ + +K+Y     L     GF WD   K   A D  W AY+  HP+   +
Sbjct: 69  NKTYTNYLSRMKSMKKEYSVYAALFWFSSGFGWDPITKQFTAPDDVWAAYLMGHPNHHHM 128

Query: 385 RAKSIDNYNELCMIF 397
           R  + +++ +L +IF
Sbjct: 129 RTSTFEDFEDLQLIF 143

BLAST of MS000541 vs. ExPASy TrEMBL
Match: A0A7N2KMQ1 (Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1)

HSP 1 Score: 493.0 bits (1268), Expect = 1.1e-135
Identity = 241/406 (59.36%), Postives = 311/406 (76.60%), Query Frame = 0

Query: 1   VSALSEKYGPDLTKEYIRNRLRTLKKQYRILKELLSHDGFSWDETRKIIVANNSVWDDYI 60
           V AL+E++GPDLTKE+IRNRLRT +KQY ILKELLSH GF WD  +K+I+A++SVWDDY+
Sbjct: 222 VLALNERFGPDLTKEHIRNRLRTWRKQYLILKELLSHSGFKWDAMQKMIIASDSVWDDYV 281

Query: 61  KVNLEARSFRGRIFENYDQLCIFFGYYN-----METFDFHVAMYDGKNGCEGNPLRWTSE 120
           K + +AR FR R  +NYDQL I FG  +     ++  D       GK    G  +RWT E
Sbjct: 282 KTHPDARIFRNRFIQNYDQLFIIFGDSHEAAEPVDVIDVSPVRCGGKVKDLGKNVRWTFE 341

Query: 121 MDCCLSGVLVEQVILGNKN--SNEFKTVAYNAAILAIRERFSLQLTKDQVKDRFKSWKRE 180
           MD CL  VLVEQVILGNKN   N+FK  AY AA+LAI+ERF L LTKD V++R K+WK++
Sbjct: 342 MDRCLGKVLVEQVILGNKNRLDNKFKPAAYEAAVLAIKERFHLDLTKDHVRNRLKTWKKQ 401

Query: 181 YFVLRNLLDQGDFEWDDQRKMLVAKDSVWDVSVERNPDARLLRGRVIENYDELCIIIGYD 240
           Y +L+ LLDQ DFEWD++RKM++A DS W+  ++ NPDAR ++GRVI NY+ELC+IIG +
Sbjct: 402 YDILQELLDQRDFEWDERRKMVIANDSAWNEYIKINPDARTVQGRVINNYEELCVIIGCN 461

Query: 241 NPSESSLNPANVNLDLTANNEAINAGVVCYNQSNNAAEKENFITWTEEMDTCLSKLLVEQ 300
           +P ESS+N A  NLDL A NEA+ A    YN+ +NA +K  +I+WT+EMD CL++LLV+Q
Sbjct: 462 DPPESSVNIAENNLDLIAENEAVVAEEKYYNEVDNAKDKVKYISWTDEMDRCLTQLLVQQ 521

Query: 301 VVLGNRIEETFKTAAYTAALTVLNERFALDLTKENIRSRLNTWEKQYGRVKLLLSHDGFE 360
           V+LGN++++ FK  AY AALTVLNE+F LDLTKENIR+RL TW+KQYG VK LLSH GFE
Sbjct: 522 VMLGNKLDKNFKPVAYMAALTVLNEKFGLDLTKENIRNRLKTWKKQYGLVKELLSHGGFE 581

Query: 361 WDERHKMVVANDFDWTAYIKKHPDDQDLRAKSIDNYNELCMIFGNE 400
           WD+R+KMVVA D DW  YIK++PD + LRA+SI+NY++L +I GNE
Sbjct: 582 WDDRYKMVVATDSDWNEYIKRYPDARQLRARSIENYDDLRIIVGNE 627

BLAST of MS000541 vs. ExPASy TrEMBL
Match: A0A2N9FX33 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS19734 PE=4 SV=1)

HSP 1 Score: 491.9 bits (1265), Expect = 2.5e-135
Identity = 239/403 (59.31%), Postives = 308/403 (76.43%), Query Frame = 0

Query: 4   LSEKYGPDLTKEYIRNRLRTLKKQYRILKELLSHDGFSWDETRKIIVANNSVWDDYIKVN 63
           L+E++GP+L+KE+IRNRLRT +KQY IL ELLSH+GF WDE +K+I+A++S+WDDY+K +
Sbjct: 225 LNERFGPELSKEHIRNRLRTWRKQYLILNELLSHNGFKWDEMQKMIIASDSIWDDYVKTH 284

Query: 64  LEARSFRGRIFENYDQLCIFFGYYNMET-----FDFHVAMYDGKNGCEGNPLRWTSEMDC 123
            +AR FR R  +NYDQL I FG YN ET      D       GK   +G  +RWT EMD 
Sbjct: 285 PDARIFRNRFIQNYDQLYIIFGNYN-ETREPIPIDASPVQCGGKARDQGKNMRWTYEMDR 344

Query: 124 CLSGVLVEQVILGNKN--SNEFKTVAYNAAILAIRERFSLQLTKDQVKDRFKSWKREYFV 183
           CL  VLVEQVILGNKN   N+FK  AY AA+LAI+++F + L KD V++R K+WK++Y +
Sbjct: 345 CLGKVLVEQVILGNKNKLDNKFKPAAYEAAVLAIKKQFHIDLMKDHVRNRLKTWKKQYDI 404

Query: 184 LRNLLDQGDFEWDDQRKMLVAKDSVWDVSVERNPDARLLRGRVIENYDELCIIIGYDNPS 243
           L+ LLDQ  FEWD +RKM++A DS W+  ++ NPDAR ++GRVI NY+ELC+IIGY++P 
Sbjct: 405 LQELLDQSGFEWDGRRKMVIANDSAWNEYLKINPDARTVQGRVINNYEELCVIIGYNDPP 464

Query: 244 ESSLNPANVNLDLTANNEAINAGVVCYNQSNNAAEKENFITWTEEMDTCLSKLLVEQVVL 303
           ESSLN A  NLDL   NEA+ A    YN+ +NA +K  +I+WT+EMD CL++LLVEQV+L
Sbjct: 465 ESSLNIAENNLDLIVENEAVVAEEAYYNEIDNAKDKGKYISWTDEMDRCLTQLLVEQVML 524

Query: 304 GNRIEETFKTAAYTAALTVLNERFALDLTKENIRSRLNTWEKQYGRVKLLLSHDGFEWDE 363
           GN++E+ FK  AY  ALTVLNE+F LDLT+ENIR+RL TW+KQYG VK LLSH GFEWDE
Sbjct: 525 GNKLEKNFKPVAYMTALTVLNEKFGLDLTRENIRNRLKTWKKQYGLVKELLSHSGFEWDE 584

Query: 364 RHKMVVANDFDWTAYIKKHPDDQDLRAKSIDNYNELCMIFGNE 400
           R+KMVVA D DW  YIK+HPD + LRA+SI+NY+EL +I GNE
Sbjct: 585 RYKMVVAPDSDWNEYIKRHPDARQLRARSIENYDELRIIVGNE 626

BLAST of MS000541 vs. ExPASy TrEMBL
Match: A0A5J5C7S2 (Uncharacterized protein OS=Nyssa sinensis OX=561372 GN=F0562_001686 PE=4 SV=1)

HSP 1 Score: 454.1 bits (1167), Expect = 5.9e-124
Identity = 228/423 (53.90%), Postives = 293/423 (69.27%), Query Frame = 0

Query: 1   VSALSEKYGPDLTKEYIRNRLRTLKKQYRILKELLSHDGFSWDETRKIIVANNSVWDDYI 60
           V+AL+EK+GPD+TK++I+NRL+T KKQY ILKELLSH GF WDE RK+++ N+S W+DYI
Sbjct: 240 VTALNEKFGPDITKDHIKNRLKTWKKQYGILKELLSHIGFKWDEARKMVIGNDSAWNDYI 299

Query: 61  KVNLEARSFRGRIFENYDQLCIFF------GYYNMETFDFHVAMYDGKNGCE---GNPLR 120
           K + +A  FRGR+ ENYD LCI F      G Y+    D   ++     G E    +P+R
Sbjct: 300 KTHHDAHPFRGRVVENYDHLCIIFGNNHATGSYSRTVDDIVHSLAGDSEGVEAINASPIR 359

Query: 121 -------------WTSEMDCCLSGVLVEQVILGNKN--SNEFKTVAYNAAILAIRERFSL 180
                        WT+EMD CLS +LV+QV LGNK+   N+FK  AY AA+LA+ ERF L
Sbjct: 360 CYSGLRDEEKNMEWTNEMDRCLSTILVKQVKLGNKSKLDNKFKPAAYAAAVLALSERFQL 419

Query: 181 QLTKDQVKDRFKSWKREYFVLRNLLDQGDFEWDDQRKMLVAKDSVWDVSVERNPDARLLR 240
             T D V++R K+WK+ Y  ++ +LDQ +F+WD +RKM+   DSVW   ++ NPDARLL 
Sbjct: 420 DFTNDHVRNRIKTWKKLYGSVKEILDQSEFKWDKERKMITTNDSVWHDYIKINPDARLLH 479

Query: 241 GRVIENYDELCIIIGYDNPSESSLNPANVNLDLTANNEAINAGVVCYNQSNNAAEKENFI 300
           GRVIENYDELC IIG DNP+ESS N A  ++D  A+NE +   V   +Q +NA E+  +I
Sbjct: 480 GRVIENYDELCAIIGNDNPTESSKNDAEADMDWAADNEDVETEVAYQSQRDNAKERGKYI 539

Query: 301 TWTEEMDTCLSKLLVEQVVLGNRIEETFKTAAYTAALTVLNERFALDLTKENIRSRLNTW 360
            WT+EMD CL + LVEQV LGN++E+ FK  AYTA LT LNE F LDLTKENI+SRL TW
Sbjct: 540 IWTDEMDCCLMEKLVEQVKLGNKLEKNFKPVAYTAVLTALNENFVLDLTKENIKSRLKTW 599

Query: 361 EKQYGRVKLLLSHDGFEWDERHKMVVANDFDWTAYIKKHPDDQDLRAKSIDNYNELCMIF 400
           +K YG VK +LSH GF WDE+ KMVVA D  W  YIK HPD + LRA+SI+NY+EL +I 
Sbjct: 600 KKVYGLVKEVLSHRGFVWDEKRKMVVATDSVWNEYIKMHPDAKFLRARSIENYDELRIII 659

BLAST of MS000541 vs. ExPASy TrEMBL
Match: A0A371EED3 (L10-interacting MYB domain-containing protein (Fragment) OS=Mucuna pruriens OX=157652 GN=LIMYB PE=4 SV=1)

HSP 1 Score: 449.1 bits (1154), Expect = 1.9e-122
Identity = 215/409 (52.57%), Postives = 295/409 (72.13%), Query Frame = 0

Query: 1   VSALSEKYGPDLTKEYIRNRLRTLKKQYRILKELLSHDGFSWDETRKIIVANNSVWDDYI 60
           VSA++ K+G  LTK  I+NRL+T K+QY +LKE+LSH GF WDET+K+I+AN+S W+DYI
Sbjct: 283 VSAINAKFGLHLTKFNIKNRLKTWKRQYELLKEILSHTGFKWDETKKMIIANDSTWNDYI 342

Query: 61  KVNLEARSFRGRIFENYDQLCIFFGYYNMETF--------DFHVAMYDGKNGCEGNPLRW 120
           + +L+ R+FRGR+FENYDQ CI FG++N   +        +     YD     +G  +RW
Sbjct: 343 RTHLDTRTFRGRVFENYDQFCIIFGHFNEPLYWDESEPCDEICPVNYDINVKDQGRQMRW 402

Query: 121 TSEMDCCLSGVLVEQVILGNKNSNEF--KTVAYNAAILAIRERFSLQLTKDQVKDRFKSW 180
           TS+MD CLS +LV+Q+  GN++  ++  K  A+ AA+LAI E+F L L K+ +K+R K+W
Sbjct: 403 TSDMDSCLSAILVQQIKQGNRSRYDYKLKPAAFEAAVLAINEKFQLYLAKEHIKNRLKTW 462

Query: 181 KREYFVLRNLLDQGDFEWDDQRKMLVAKDSVWDVSVERNPDARLLRGRVIENYDELCIII 240
           K++Y +L+ L+DQ  FEWD++RKM++A DSVW+  +++NPDARLL+GRVI NYDELCIII
Sbjct: 463 KKQYDILKELMDQSGFEWDERRKMIIANDSVWNEYIKKNPDARLLKGRVIRNYDELCIII 522

Query: 241 GYDNPSESSLNPANVNLDLTANNEAINAGVVCYNQSNNAAEKENFITWTEEMDTCLSKLL 300
           G+ +P +SS+N A  N+  T +N  +       ++   A EK   +TWT+EMD CL++LL
Sbjct: 523 GHCDPPDSSMNGACTNMGFTKDNGVMEVQETNCHRIIYAKEKGKNVTWTDEMDHCLTELL 582

Query: 301 VEQVVLGNRIEETFKTAAYTAALTVLNERFALDLTKENIRSRLNTWEKQYGRVKLLLSHD 360
             QV+LGN++E+ FKT+AY AALTVLNERF L+LTKENI SRL TW+KQY  +K +L   
Sbjct: 583 FNQVMLGNKLEKNFKTSAYIAALTVLNERFDLNLTKENIISRLKTWKKQYDLLKEMLLQR 642

Query: 361 GFEWDERHKMVVANDFDWTAYIKKHPDDQDLRAKSIDNYNELCMIFGNE 400
            FEWDE  KM VA D +W  YIKKHPD + LR + I+NY+EL MI GNE
Sbjct: 643 RFEWDEERKMAVATDLEWDEYIKKHPDAKHLRDRRIENYHELGMIVGNE 691

BLAST of MS000541 vs. ExPASy TrEMBL
Match: A0A5B7BRF2 (Uncharacterized protein OS=Davidia involucrata OX=16924 GN=Din_039932 PE=4 SV=1)

HSP 1 Score: 446.8 bits (1148), Expect = 9.4e-122
Identity = 224/422 (53.08%), Postives = 293/422 (69.43%), Query Frame = 0

Query: 1   VSALSEKYGPDLTKEYIRNRLRTLKKQYRILKELLSHDGFSWDETRKIIVANNSVWDDYI 60
           V+AL+EK+GPD+TK++I+NRL+T KKQY ILKELLSH GF WDE RK+++ ++S+W+DYI
Sbjct: 222 VTALNEKFGPDITKDHIKNRLKTWKKQYGILKELLSHTGFKWDEARKMVIGDDSIWNDYI 281

Query: 61  KVNLEARSFRGRIFENYDQLCIFF------GYYNMETFDFHVAMYDGKNGCE---GNPLR 120
           K + +A  FRGR+ ENYD LCI F      G Y+    D   ++     G E    +P+R
Sbjct: 282 KTHHDAHLFRGRVVENYDHLCIIFGNNHATGSYSRTADDIVHSLAGDSEGVEAINASPIR 341

Query: 121 -------------WTSEMDCCLSGVLVEQVILGNKN--SNEFKTVAYNAAILAIRERFSL 180
                        WT+EMD CLS +LVEQV LGNK+   N+FK  AY+AA+ A+ ERF L
Sbjct: 342 CYSGLRDQEKNMKWTNEMDYCLSTILVEQVKLGNKSKLDNKFKPAAYDAAVSALSERFQL 401

Query: 181 QLTKDQVKDRFKSWKREYFVLRNLLDQGDFEWDDQRKMLVAKDSVWDVSVERNPDARLLR 240
             TKD V++R K+WK+ Y  ++ LLD  +F+WD++ KM+ A DSVW   ++  PDARLL+
Sbjct: 402 DFTKDHVRNRIKTWKKLYGSMKELLDHSEFKWDEELKMVTANDSVWHDYIKIKPDARLLQ 461

Query: 241 GRVIENYDELCIIIGYDNPSESSLNPANVNLDLTANNEAINAGVVCYNQSNNAAEKENFI 300
           G VIENYDELC+IIG DNP+ESS N A  ++D  A+NE I   V   +Q +N  E+  +I
Sbjct: 462 GLVIENYDELCVIIGNDNPTESSKNDAEADMDWAADNEGIETEVAYQSQPDNGKERGKYI 521

Query: 301 TWTEEMDTCLSKLLVEQVVLGNRIEETFKTAAYTAALTVLNERFALDLTKENIRSRLNTW 360
            WT+EMD CL++ LVEQV LGN++E+ FK  AYTA +T LNE FALDLTKENI+SRL TW
Sbjct: 522 IWTDEMDRCLTEKLVEQVKLGNKLEKNFKPVAYTAVVTTLNENFALDLTKENIKSRLKTW 581

Query: 361 EKQYGRVKLLLSHDGFEWDERHKMVVANDFDWTAYIKKHPDDQDLRAKSIDNYNELCMIF 399
           +K YG VK +LSH GF WDE  KMVVA D  W  YIK HPD + LRA+SI+ ++EL +I 
Sbjct: 582 KKLYGLVKEVLSHRGFVWDEERKMVVATDSVWNEYIKMHPDAKFLRARSIEYFDELRIII 641

BLAST of MS000541 vs. TAIR 10
Match: AT2G24960.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 12 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G02210.2); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 229.6 bits (584), Expect = 4.5e-60
Identity = 130/410 (31.71%), Postives = 209/410 (50.98%), Query Frame = 0

Query: 1   VSALSEKYGPDLTKEYIRNRLRTLKKQYRILKELLSHDGFSWDETRKIIVANNSVWDDYI 60
           ++  + K+G    K+ +++R   L KQY  +K LL H GF WD+T + ++ ++S+W  Y+
Sbjct: 50  LTVFNSKFGSQYDKDVLKSRYTNLWKQYNDVKCLLDHGGFVWDQTHQTVIGDDSLWSLYL 109

Query: 61  KVNLEARSFRGRIFENYDQLCIFFGY------YNMETFDFHVA-MYDGK----NGCEGNP 120
           K + EAR ++ +   N+  LC+ +GY      Y+M + D  +    +G+    +G E + 
Sbjct: 110 KAHPEARVYKTKPVLNFSDLCLIYGYTVADGRYSMSSHDLEIEDEINGESVVLSGKESSK 169

Query: 121 LRWTSEMDCCLSGVLVEQVILGNKNSNEFKTVAYNAAILAIRERFSLQLTKDQVKDRFKS 180
             WT EMD     ++V+Q+  GNK  N F   A+   ++    RFS Q  K  ++ R+  
Sbjct: 170 TEWTLEMDQYFVEIMVDQIGRGNKTGNAFSKQAWIDMLVLFNARFSGQYGKRVLRHRYNK 229

Query: 181 WKREYFVLRNLLDQGDFEWDDQRKMLVAKDSVWDVSVERNPDARLLRGRVIENYDELCII 240
             + Y  +  +L +  F WD+ R M+ A D+VWD  ++ +P AR  R + + +Y++L  I
Sbjct: 230 LLKYYKDMEAILKEDGFSWDETRLMISADDAVWDSYIKDHPLARTYRMKSLPSYNDLDTI 289

Query: 241 IGYDNPSESSLNPANVNLDLTANNEAINAGVVCYNQSNNAAEKENFITWTEEMDTCLSKL 300
                        A    D   +  A        +Q  N+     F  WT  MD  L  L
Sbjct: 290 FACQ---------AEQGTDHRDDGSAAQTSETKASQEQNSDRTRIF--WTPPMDYHLIDL 349

Query: 301 LVEQVVLGNRIEETFKTAAYTAALTVLNERFALDLTKENIRSRLNTWEKQYGRVKLLLSH 360
           LVEQV  GNR+ +TF T+A+   +T  N +F     K+ +++R     + Y  +K LL  
Sbjct: 350 LVEQVNNGNRVGQTFITSAWNEMVTAFNAKFGSQHNKDVLKNRYKHLRRLYNDIKFLLEQ 409

Query: 361 DGFEWDERHKMVVANDFDWTAYIKKHPDDQDLRAKSIDNYNELCMIFGNE 400
           +GF WD R  MV+A+D  W  YI+ HP+ +  R K+I +Y  LC IFG E
Sbjct: 410 NGFSWDARRDMVIADDDIWNTYIQAHPEARSYRVKTIPSYPNLCFIFGKE 448

BLAST of MS000541 vs. TAIR 10
Match: AT2G24960.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G02210.2); Has 1453 Blast hits to 509 proteins in 26 species: Archae - 0; Bacteria - 0; Metazoa - 1; Fungi - 39; Plants - 1363; Viruses - 0; Other Eukaryotes - 50 (source: NCBI BLink). )

HSP 1 Score: 216.5 bits (550), Expect = 4.0e-56
Identity = 130/433 (30.02%), Postives = 209/433 (48.27%), Query Frame = 0

Query: 1   VSALSEKYGPDLTKEYIRNRLRTLKKQYRILKELLSHDGFSWDETRKIIVANNSVWDDYI 60
           ++  + K+G    K+ +++R   L KQY  +K LL H GF WD+T + ++ ++S+W  Y+
Sbjct: 50  LTVFNSKFGSQYDKDVLKSRYTNLWKQYNDVKCLLDHGGFVWDQTHQTVIGDDSLWSLYL 109

Query: 61  KVNLEARSFRGRIFENYDQLCIFFGY------YNMETFDFHVA-MYDGK----NGCEGNP 120
           K + EAR ++ +   N+  LC+ +GY      Y+M + D  +    +G+    +G E + 
Sbjct: 110 KAHPEARVYKTKPVLNFSDLCLIYGYTVADGRYSMSSHDLEIEDEINGESVVLSGKESSK 169

Query: 121 LRWTSEMDCCLSGVLVEQVILGNKNSNEFKTVAYNAAILAIRERFSLQLTKDQVKDRFKS 180
             WT EMD     ++V+Q+  GNK  N F   A+   ++    RFS Q  K  ++ R+  
Sbjct: 170 TEWTLEMDQYFVEIMVDQIGRGNKTGNAFSKQAWIDMLVLFNARFSGQYGKRVLRHRYNK 229

Query: 181 WKREYFVLRNLLDQGDFEWDDQRKMLVAKDSVWDVSVERNPDARLLRGRVIENYDELCII 240
             + Y  +  +L +  F WD+ R M+ A D+VWD  ++ +P AR  R + + +Y++L  I
Sbjct: 230 LLKYYKDMEAILKEDGFSWDETRLMISADDAVWDSYIKDHPLARTYRMKSLPSYNDLDTI 289

Query: 241 IGYDNPSESSLNPANVNLDLTANNEAINAGVVCYNQSNNAAEKENFITWTEEMDTCLSKL 300
                        A    D   +  A        +Q  N+     F  WT  MD  L  L
Sbjct: 290 FACQ---------AEQGTDHRDDGSAAQTSETKASQEQNSDRTRIF--WTPPMDYHLIDL 349

Query: 301 LVEQVVLGNRIEETFKTAAYTAALTVLNERFALDLTKENIRSRLNTWEKQYGRVKLLLSH 360
           LVEQV  GNR+ +TF T+A+   +T  N +F     K+ +++R     + Y  +K LL  
Sbjct: 350 LVEQVNNGNRVGQTFITSAWNEMVTAFNAKFGSQHNKDVLKNRYKHLRRLYNDIKFLLEQ 409

Query: 361 DGFEWDERHKMVVANDFDWTAYI-----------------------KKHPDDQDLRAKSI 400
           +GF WD R  MV+A+D  W  YI                       + HP+ +  R K+I
Sbjct: 410 NGFSWDARRDMVIADDDIWNTYIQACHILFLFKISVICLCLQMKHVQAHPEARSYRVKTI 469

BLAST of MS000541 vs. TAIR 10
Match: AT4G02210.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G24960.2); Has 791 Blast hits to 465 proteins in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 17; Plants - 748; Viruses - 0; Other Eukaryotes - 26 (source: NCBI BLink). )

HSP 1 Score: 147.5 bits (371), Expect = 2.3e-35
Identity = 88/306 (28.76%), Postives = 156/306 (50.98%), Query Frame = 0

Query: 102 KNGCEGNPLRWTSEMDCCLSGVLVEQVILGNK-NSNEFKTVAYNAAILAIRERFSLQLTK 161
           +NG E     WT EMD     ++VEQV  GN+   + F   A+     +   +F     K
Sbjct: 4   RNGNERLRTVWTPEMDQYFIELMVEQVRKGNRFEDHLFSKRAWKFMSCSFTAKFKFLYGK 63

Query: 162 DQVKDRFKSWKREYFVLRNLLDQGDFEWDDQRKMLVAKDSVWDVSVERNPDARLLRGRVI 221
           D +K+R K+ +  +  + NLL +  F WDD R+M+VA + VWD  ++ +PD+R  R + I
Sbjct: 64  DVLKNRHKTLRNLFKSVNNLLIEDGFSWDDTRQMVVADNCVWDEYLKIHPDSRSFRIKSI 123

Query: 222 ENYDELCIIIG---YDNPSESSLNPANVNLDLTANNEAINAGVVCYNQSNNAAEKENFI- 281
             Y +LC++      ++ +E S++    +  L   ++  N   +C + +  +  K + + 
Sbjct: 124 PCYKDLCLVYSDGMSEHKAEESISEGE-SKTLIQEDDGYNR--ICESSTVRSNSKGSSVT 183

Query: 282 ----TWTEEMDTCLSKLLVEQVVLGNRIEETFKTAAYTAALTVLNERFALDLTKENIRSR 341
               TW   MD     L+++Q   GN+IE  F+  A+T  + + N +F  +   + +++R
Sbjct: 184 RCRTTWHPPMDRYFIDLMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESNFDVDVLKNR 243

Query: 342 LNTWEKQYGRVKLLLSHDGFEWDERHKMVVANDFDWTAYIKKHPDDQDLRAKSIDNYNEL 399
             +  +Q+  +K +L  DGF WD   +MV A++  W  YIK H D +    + I  Y +L
Sbjct: 244 YKSLRRQFNAIKSILRSDGFAWDNERQMVTADNNVWQDYIKAHRDARQFMTRPIPYYKDL 303

BLAST of MS000541 vs. TAIR 10
Match: AT4G02210.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G24960.2). )

HSP 1 Score: 147.5 bits (371), Expect = 2.3e-35
Identity = 88/306 (28.76%), Postives = 156/306 (50.98%), Query Frame = 0

Query: 102 KNGCEGNPLRWTSEMDCCLSGVLVEQVILGNK-NSNEFKTVAYNAAILAIRERFSLQLTK 161
           +NG E     WT EMD     ++VEQV  GN+   + F   A+     +   +F     K
Sbjct: 4   RNGNERLRTVWTPEMDQYFIELMVEQVRKGNRFEDHLFSKRAWKFMSCSFTAKFKFLYGK 63

Query: 162 DQVKDRFKSWKREYFVLRNLLDQGDFEWDDQRKMLVAKDSVWDVSVERNPDARLLRGRVI 221
           D +K+R K+ +  +  + NLL +  F WDD R+M+VA + VWD  ++ +PD+R  R + I
Sbjct: 64  DVLKNRHKTLRNLFKSVNNLLIEDGFSWDDTRQMVVADNCVWDEYLKIHPDSRSFRIKSI 123

Query: 222 ENYDELCIIIG---YDNPSESSLNPANVNLDLTANNEAINAGVVCYNQSNNAAEKENFI- 281
             Y +LC++      ++ +E S++    +  L   ++  N   +C + +  +  K + + 
Sbjct: 124 PCYKDLCLVYSDGMSEHKAEESISEGE-SKTLIQEDDGYNR--ICESSTVRSNSKGSSVT 183

Query: 282 ----TWTEEMDTCLSKLLVEQVVLGNRIEETFKTAAYTAALTVLNERFALDLTKENIRSR 341
               TW   MD     L+++Q   GN+IE  F+  A+T  + + N +F  +   + +++R
Sbjct: 184 RCRTTWHPPMDRYFIDLMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESNFDVDVLKNR 243

Query: 342 LNTWEKQYGRVKLLLSHDGFEWDERHKMVVANDFDWTAYIKKHPDDQDLRAKSIDNYNEL 399
             +  +Q+  +K +L  DGF WD   +MV A++  W  YIK H D +    + I  Y +L
Sbjct: 244 YKSLRRQFNAIKSILRSDGFAWDNERQMVTADNNVWQDYIKAHRDARQFMTRPIPYYKDL 303

BLAST of MS000541 vs. TAIR 10
Match: AT4G02550.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G02210.2); Has 350 Blast hits to 284 proteins in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 13; Plants - 331; Viruses - 0; Other Eukaryotes - 6 (source: NCBI BLink). )

HSP 1 Score: 91.3 bits (225), Expect = 1.9e-18
Identity = 50/135 (37.04%), Postives = 72/135 (53.33%), Query Frame = 0

Query: 106 EGNPLRWTSEMDCCLSGVLVEQVILGNKNSNEFKTVAYNAAILAIRERFSLQLTKDQVKD 165
           +G  + W+  MD CL   L  Q   GNK    F   AY AA +A+  RF+L LT  +  +
Sbjct: 16  KGRNVIWSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAIN 75

Query: 166 RFKSWKREYFVLRNLLDQGDFEWDDQRKML-VAKDSVWDVSVERNPDARLLRGRVIENYD 225
           R K+ K+ Y V+R++L +  F W+   KM+    D +W   +  NPDA+  RG+ IE Y+
Sbjct: 76  RLKTIKKRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMYE 135

Query: 226 ELCIIIG-YDNPSES 239
           EL  + G Y  P  S
Sbjct: 136 ELRTVCGDYQTPGSS 150

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_030959168.12.4e-13559.36uncharacterized protein LOC115981123 [Quercus lobata][more]
XP_023877154.11.3e-13358.62uncharacterized protein LOC111989590 [Quercus suber][more]
KAF3973412.12.2e-13359.11hypothetical protein CMV_003146 [Castanea mollissima][more]
KAF7815604.11.4e-12453.41L10-interacting MYB domain-containing protein-like isoform X4 [Senna tora][more]
KAA8550002.11.2e-12353.90hypothetical protein F0562_001686 [Nyssa sinensis][more]
Match NameE-valueIdentityDescription
O823682.3e-0826.67Uncharacterized protein At2g29880 OS=Arabidopsis thaliana OX=3702 GN=At2g29880 P... [more]
Match NameE-valueIdentityDescription
A0A7N2KMQ11.1e-13559.36Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1[more]
A0A2N9FX332.5e-13559.31Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS19734 PE=4 SV=1[more]
A0A5J5C7S25.9e-12453.90Uncharacterized protein OS=Nyssa sinensis OX=561372 GN=F0562_001686 PE=4 SV=1[more]
A0A371EED31.9e-12252.57L10-interacting MYB domain-containing protein (Fragment) OS=Mucuna pruriens OX=1... [more]
A0A5B7BRF29.4e-12253.08Uncharacterized protein OS=Davidia involucrata OX=16924 GN=Din_039932 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G24960.24.5e-6031.71unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G24960.14.0e-5630.02unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G02210.12.3e-3528.76unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G02210.22.3e-3528.76unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT4G02550.21.9e-1837.04unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (TR) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR024752Myb/SANT-like domainPFAMPF12776Myb_DNA-bind_3coord: 277..371
e-value: 9.3E-23
score: 81.0
coord: 111..203
e-value: 1.3E-20
score: 74.2
coord: 3..59
e-value: 7.2E-13
score: 49.4
NoneNo IPR availablePANTHERPTHR46929EXPRESSED PROTEINcoord: 262..398
coord: 1..86
coord: 103..262
NoneNo IPR availablePANTHERPTHR46929:SF12MYB/SANT-LIKE DNA-BINDING DOMAIN PROTEINcoord: 262..398
coord: 1..86
coord: 103..262

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MS000541.1MS000541.1mRNA